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In many respects, we understand the structure of the universe better than the 
workings of living cells. Scientists can calculate the age of the Sun and predict 
when it will cease to shine, but we cannot explain how it is that a human being 
may live for eighty years but a mouse for only two. We know the complete 
genome sequences of these and many other species, but we still cannot predict 
how a cell will behave if we mutate a previously unstudied gene. Stars may be 
10 43 times bigger, but cells are more complex, more intricately structured, and 
more astonishing products of the laws of physics and chemistry. Through hered- 
ity and natural selection, operating from the beginnings of life on Earth to the 
present day— that is, for about 20% of the age of the universe — living cells have 
been progressively refining and extending their molecular machinery, and 
recording the results of their experiments in the genetic instructions they pass 
on to their progeny. 

With each edition of this book, we marvel at the new information that cell 
biologists have gathered in just a few years. But we are even more amazed and 
daunted at the sophistication of the mechanisms that we encounter. The deeper 
we probe into the cell, the more we realize how much remains to be understood. 
In the days of our innocence, working on the first edition, we hailed the identi- 
fication of a single protein — a signal receptor, say— as a great step forward. Now 
we appreciate that each protein is generally part of a complex with many others, 
working together as a system, regulating one another's activities in subtle ways, 
and held in specific positions by binding to scaffold proteins that give the chem- 
ical factory a definite spatial structure. Genome sequencing has given us virtu- 
ally complete molecular parts-lists for many different organisms; genetics and 
biochemistry have told us a great deal about what those parts are capable of 
individually and which ones interact with which others; but we have only the 
most primitive grasp of the dynamics of these biochemical systems, with all 
their interlocking control loops. Therefore, although there are great achieve- 
ments to report, cell biologists face even greater challenges for the future. 

In this edition, we have included new material on many topics, ranging from 
epigenetics, histone modifications, small RNAs, and comparative genomics, to 
genetic noise, cytoskeletal dynamics, cell-cycle control, apoptosis, stem cells, 
and novel cancer therapies. As in previous editions, we have tried above all to 
give readers a conceptual framework for the mass of information that we now 
have about cells. This means going beyond the recitation of facts. The goal is to 
learn how to put the facts to use — to reason, to predict, and to control the 
behavior of living systems. 

To help readers on the way to an active understanding, we have for the first 
time incorporated end- of- chapter problems, written by lohn Wilson and Tim 
Hunt. These emphasize a quantitative approach and the art of reasoning from 
experiments. A companion volume, Molecular Biology of the Cell Fifth Edition: 
The Problems Book (ISBN 978-0-8153-4110-9), by the same authors, gives com- 
plete answers to these problems and also contains more than 1700 additional 
problems and solutions. 

A further major adjunct to the main book is the attached Media DVD-ROM 
disc. This provides hundreds of movies and animations, including many that are 
new in this edition, showing cells and cellular processes in action and bringing 
the text to life; the disc also now includes all the figures and tables from the main 
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book, pre-loaded into PowerPoint® presentations. Other ancillaries available for 
the book include a bank of test questions and lecture outlines, available to qual- 
ified instructors, and a set of 200 full-color overhead transparencies. 

Perhaps the biggest change is in the physical structure of the book. In an 
effort to make the standard Student Edition somewhat more portable, we are 
providing Chapters 21-25, covering multicellular systems, in electronic (PDF) 
form on the accompanying disc, while retaining in the printed volume Chapters 
1-20, covering the core of the usual cell biology curriculum. But we should 
emphasize that the final chapters have been revised and updated as thoroughly 
as the rest of the book and we sincerely hope that they will be read! A Reference 
Edition (ISBN 978-0-8153-4111-6), containing the full set of chapters as printed 
pages, is also available for those who prefer it. 

Full details of the conventions adopted in the book are given in the Note to 
the Reader that follows this Preface. As explained there, we have taken a drastic 
approach in confronting the different rules for the writing of gene names in dif- 
ferent species: throughout this book, we use the same style, regardless of 
species, and often in defiance of the usual species-specific conventions. 

As always, we are indebted to many people. Full acknowledgments for sci- 
entific help are given separately, but we must here single out some exceptionally 
important contributions: Julie Theriot is almost entirely responsible for Chap- 
ters 16 (Cytoskeleton) and 24 (Pathogens, Infection, and Innate Immunity), and 
David Morgan likewise for Chapter 17 (Cell Cycle). Wallace Marshall and Laura 
Attardi provided substantial help with Chapters 8 and 20, respectively, as did 
Maynard Olson for the genomics section of Chapter 4, XiaodongWang for Chap- 
ter 18, and Nicholas Harberd for the plant section of Chapter 15. 

We also owe a huge debt to the staff of Garland Science and others who 
helped convert writers' efforts into a polished final product. Denise Schanck 
directed the whole enterprise and shepherded the wayward authors along the 
road with wisdom, skill, and kindness. Nigel Orme put the artwork into its final 
form and supervised the visual aspects of the book, including the back cover, with 
his usual flair. Matthew McClements designed the book and its front cover. 
Emma Jeffcock laid out its pages with extraordinary speed and unflappable effi- 
ciency, dealing impeccably with innumerable corrections. Michael Morales man- 
aged the transformation of a mass of animations, video clips, and other materi- 
als into a user-friendly DVD-ROM. Eleanor Lawrence and Sherry Granum 
updated and enlarged the glossary. Jackie Harbor and Sigrid Masson kept us orga- 
nized. Adam Sendroff kept us aware of our readers and their needs and reactions. 
Marjorie Anderson, Bruce Goady, and Sherry Granum combed the text for obscu- 
rities, infelicities, and errors. We thank them all, not only for their professional 
skill and dedication and for efficiency far surpassing our own, but also for their 
unfailing helpfulness and friendship: they have made it a pleasure to work on the 
book. 

Lastly, and with no less gratitude, we thank our spouses, families, friends 
and colleagues. Without their patient, enduring support, we could not have pro- 
duced any of the editions of this book. 
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Chapter 21 Sexual Reproduction: Meiosis, 
Germ Cells, and Fertilization 

OVERVIEW OF SEXUAL REPRODUCTION 

The Haploid Phase in Higher Eucaryotes Is Brief 

Meiosis Creates Genetic Diversity 

Sexual Reproduction Gives Organisms a Competitive Advantage 
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MEIOSIS 
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SPERM 
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Summary 
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A Note to the Reader 



Structure of the Book 

Although the chapters of this book can be read independently of one another, 
they are arranged in a logical sequence of five parts. The first three chapters of 
Part I cover elementary principles and basic biochemistry. They can serve either 
as an introduction for those who have not studied biochemistry or as a refresher 
course for those who have. 

Part II deals with the storage, expression and transmission of genetic infor- 
mation. 

Part III deals with the principles of the main experimental methods for 
investigating cells. It is not necessary to read these two chapters in order to 
understand the later chapters, but a reader will find it a useful reference. 

Part IV discusses the internal organization of the cell. 

Part V follows the behavior of cells in multicellular systems, starting with 
cell-cell junctions and extracellular matrix and concluding with two chapters on 
the immune system. Chapters 21-25 can be found on the Media DVD-ROM 
which is packaged with each book, providing increased portability for students. 

End-of-Chapter Problems 

A selection of problems, written by John Wilson and Tim Hunt, now appears in 
the text at the end of each chapter. The complete solutions to these problems 
can be found in Molecular Biology of the Cell, Fifth Edition: The Problems Book. 

References 

A concise list of selected references is included at the end of each chapter. These 
are arranged in alphabetical order under the main chapter section headings. 
These references frequently include the original papers in which important dis- 
coveries were first reported. Chapter 8 includes several tables giving the dates of 
crucial developments along with the names of the scientists involved. Elsewhere 
in the book the policy has been to avoid naming individual scientists. 

Media Codes 

Media codes are integrated throughout the text to indicate when relevant videos 
and animations are available on the DVD-ROM. The four-letter codes are 
enclosed in brackets and highlighted in color, like this <ATCG>. The interface for 
the Cell Biology Interactive media player on the DVD-ROM contains a window 
where you enter the 4-letter code. When the code is typed into the interface, the 
corresponding media item will load into the media player. 

Glossary Terms 

Throughout the book, boldface type has been used to highlight key terms at the 
point in a chapter where the main discussion of them occurs. Italic is used to set 
off important terms with a lesser degree of emphasis. At the end of the book is 
the expanded glossary, covering technical terms that are part of the common 
currency of cell biology; it is intended as a first resort for a reader who encoun- 
ters an unfamiliar term used without explanation. 

Nomenclature for Genes and Proteins 

Each species has its own conventions for naming genes; the only common fea- 
ture is that they are always set in italics. In some species (such as humans), gene 
names are spelled out all in capital letters; in other species (such as zebrafish), 



case and rest in lower case; or (as in Drosophila) with different combinations of 
upper and lower case, according to whether the first mutant allele to be discov- 
ered gave a dominant or recessive phenotype. Conventions for naming protein 
products are equally varied. 

This typographical chaos drives everyone crazy. It is not just tiresome and 
absurd; it is also unsustainable. We cannot independently define a fresh con- 
vention for each of the next few million species whose genes we may wish to 
study. Moreover, there are many occasions, especially in a book such as this, 
where we need to refer to a gene generically, without specifying the mouse ver- 
sion, the human version, the chick version, or the hippopotamus version, 
because they are all equivalent for the purposes of the discussion. What con- 
vention then should we use? 

We have decided in this book to cast aside the conventions for individual 
species and follow a uniform rule: we write all gene names, like the names of peo- 
ple and places, with the first letter in upper case and the rest in lower case, but all 
in italics, thus: Apc y Bazooka, Cdc2 t Dishevelled, EglL The corresponding protein, 
where it is named after the gene, will be written in the same way, but in roman 
rather than italic letters: Ape, Bazooka, Cdc2, Dishevelled, Egll. When it is neces- 
sary to specify the organism, this can be done with a prefix to the gene name. 

For completeness, we list a few further details of naming rules that we shall 
follow. In some instances an added letter in the gene name is traditionally used 
to distinguish between genes that are related by function or evolution; for those 
genes we put that letter in upper case if it is usual to do so [LacZ, RecA, HoxA4). 
We use no hyphen to separate added letters or numbers from the rest of the 
name. Proteins are more of a problem. Many of them have names in their own 
right, assigned to them before the gene was named. Such protein names take 
many forms, although most of them traditionally begin with a lower-case letter 
(actin, hemoglobin, catalase), like the names of ordinary substances (cheese, 
nylon), unless they are acronyms (such as GFP, for Green Fluorescent Protein, or 
BMP4, for Bone Morphogenetic Protein #4). To force all such protein names into 
a uniform style would do too much violence to established usages, and we shall 
simply write them in the traditional way (actin, GFP, etc.). For the corresponding 
gene names in all these cases, we shall nevertheless follow our standard rule: 
Actin, Hemoglobin, Catalase, Bmp4, Gfp. Occasionally in our book we need to 
highlight a protein name by setting it in italics for emphasis; the intention will 
generally be clear from the context. 

For those who wish to know them, the Table below shows some of the offi- 
cial conventions for individual species— conventions that we shall mostly vio- 
late in this book, in the manner shown. 
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Ancillaries 

Molecular Biology of the Cell, Fifth Edition: The Problems Book 
by John Wilson and Tim Hunt (ISBN: 978-0-8153-4110-9) 

The Problems Book is designed to help students appreciate the ways in which 
experiments and simple calculations can lead to an understanding of how cells 
work. It provides problems to accompany Chapters 1-20 of Molecular Biology of 
the Cell Each chapter of problems is divided into sections that correspond to 
those of the main textbook and review key terms, test for understanding basic 
concepts, and pose research-based problems. Molecular Biology of the Cell, Fifth 
Edition: The Problems Book should be useful for homework assignments and as 
a basis for class discussion. It could even provide ideas for exam questions. Solu- 
tions for all of the problems are provided on the CD-ROM which accompanies 
the book. Solutions for the end-of-chapter problems in the main textbook are 
also found in The Problems Book 

MBoC5 Media DVD-ROM 

The DVD included with every copy of the book contains the figures, tables, and 
micrographs from the book, pre-loaded into PowerPoint® presentations, one for 
each chapter. A separate folder contains individual versions of each figure, table, 
and micrograph in JPEG format. The panels are available in PDF format. There 
are also over 125 videos, animations, molecular structure tutorials, and high-res- 
olution micrographs on the DVD. The authors have chosen to include material 
that not only reinforces basic concepts but also expands the content and scope 
of the book. The multimedia can be accessed either as individual files or through 
the Cell Biology Interactive media player. As discussed above, the media player 
has been programmed to work with the Media Codes integrated throughout the 
book. A complete table of contents and overview of all electronic resources is 
contained in the MBoC5 Media Viewing Guide, a PDF file located on the root 
level of the DVD-ROM and in the Appendix of the media player. The DVD-ROM 
also contains Chapters 21-25 which cover multicellular systems. The chapters 
are in PDF format and can be easily printed or searched using Adobe® Acrobat® 
Reader or other PDF software. 

Teaching Supplements 

Upon request, teaching supplements for Molecular Biology of the Cell are avail- 
able to qualified instructors. 

MBoC5 Transparency Set 

Provides 200 full-color overhead acetate transparencies of the most important 
figures from the book. 

MBoC5 Test Questions 

A selection of test questions will be available. Written by Kirsten Benjamin 
(Amyris Biotechnologies, Emeryville, California) and Linda Huang (University of 
Massachusetts, Boston), these thought questions will test students' understand- 
ing of the chapter material. 

MBoC5 Lecture Outlines 

Lecture outlines created from the concept heads for the text are provided. 

Garland Science Classwire™ 

All of the teaching supplements on the DVD-ROM (these include figures in Pow- 
erPoint and JPEG format; Chapters 21-25 in PDF format; 125 videos, animations, 
and movies) and the test questions and lecture outlines are available to qualified 
instructors online at the Garland Science Classwire™ Web site. Garland Science 
Classwire™ offers access to other instructional resources from all of the Garland 
Science textbooks, and provides free online course management tools. For addi- 
tional information, please visit http://www.classwire.com/garlandscience or 
e-mail science@garland.com. {Classwire is a trademark of Chalkfree, Inc.) 

Adobe and Acrobat are either registered trademarks or trademarks of Adobe Systems Incorporated 
in the United States and/or other countries 

PowerPoint is either a registered trademark or trademark of Microsoft Corporation in the United 
States and/or other countries 



INTRODUCTION TO THE CELL 




Cells and Genomes 



The surface of our planet is populated by living things — curious, intricately 
organized chemical factories that take in matter from their surroundings and 
use these raw materials to generate copies of themselves. The living organisms 
appear extraordinarily diverse. What could be more different than a tiger and a 
piece of seaweed, or a bacterium and a tree? Yet our ancestors, knowing nothing 
of cells or DNA, saw that all these things had something in common. They called 
that something "life," marveled at it, struggled to define it, and despaired of 
explaining what it was or how it worked in terms that relate to nonliving matter. 
The discoveries of the past century have not diminished the marvel — quite 
the contrary. But they have lifted away the mystery as to the nature of life. We can 
now see that all living things are made of cells, and that these units of living mat- 
ter all share the same machinery for their most basic functions. Living things, 
though infinitely varied when viewed from the outside, are fundamentally simi- 
lar inside. The whole of biology is a counterpoint between the two themes: 
astonishing variety in individual particulars; astonishing constancy in funda- 
mental mechanisms. In this first chapter we begin by outlining the universal fea- 
tures common to all life on our planet. We then survey, briefly, the diversity of 
cells. And we see how, thanks to the common code in which the specifications 
for all living organisms are written, it is possible to read, measure, and decipher 
these specifications to achieve a coherent understanding of all the forms of life, 
from the smallest to the greatest. 
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THE UNIVERSAL FEATURES OF CELLS ON EARTH 

It is estimated that there are more than 10 million — perhaps 100 million — living 
species on Earth today. Each species is different, and each reproduces itself 
faithfully, yielding progeny that belong to the same species: the parent organism 
hands down information specifying, in extraordinary detail, the characteristics 
that the offspring shall have. This phenomenon of heredity is central to the def- 
inition of life: it distinguishes life from other processes, such as the growth of a 
crystal, or the burning of a candle, or the formation of waves on water, in which 
orderly structures are generated but without the same type of link between the 
peculiarities of parents and the peculiarities of offspring. Like the candle 
flame, the living organism consumes free energy to create and maintain its 
organization; but the free energy drives a hugely complex system of chemical 
processes that is specified by the hereditary information. 

Most living organisms are single cells; others, such as ourselves, are vast 
multicellular cities in which groups of cells perform specialized functions and 
are linked by intricate systems of communication. But in all cases, whether we 
discuss the solitary bacterium or the aggregate of more than 10 13 cells that form 
a human body, the whole organism has been generated by cell divisions from a 
single cell. The single cell, therefore, is the vehicle for the hereditary information 
that defines the species (Figure 1-1). And specified by this information, the cell 
includes the machinery to gather raw materials from the environment, and to 
construct out of them a new cell in its own image, complete with a new copy of 
the hereditary information. Nothing less than a cell has this capability. 
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Figure 1-1 The hereditary information in the fertilized egg cell determines the nature of the whole multicellular 
organism. (A and B) A sea urchin egg gives rise to a sea urchin. (C and D) A mouse egg gives rise to a mouse. (E and F) An 
egg of the seaweed Fucus gives rise to a Fucus seaweed. (A, courtesy of David McClay; B, courtesy of M. Gibbs, Oxford 
Scientific Films; C, courtesy of Patricia Calarco, from G. Martin, Science 209:768-776, 1980. With permission from AAAS; 
D, courtesy of O. Newman, Oxford Scientific Films; E and F, courtesy of Colin Brownlee.) 



All Cells Store Their Hereditary Information in the Same Linear 
Chemical Code (DNA) 

Computers have made us familiar with the concept of information as a measur- 
able quantity — a million bytes (to record a few hundred pages of text or an image 
from a digital camera), 600 million for the music on a CD, and so on. They have 
also made us well aware that the same information can be recorded in many dif- 
ferent physical forms. As the computer world has evolved, the discs and tapes 
that we used 10 years ago for our electronic archives have become unreadable on 
present-day machines. Living cells, like computers, deal in information, and it is 
estimated that they have been evolving and diversifying for over 3.5 billion years. 
It is scarcely to be expected that they should all store their information in the 
same form, or that the archives of one type of cell should be readable by the infor- 
mation-handling machinery of another. And yet it is so. All living cells on Earth, 
without any known exception, store their hereditary information in the form of 
double-stranded molecules of DNA — long unbranched paired polymer chains, 
formed always of the same four types of monomers. These monomers have nick- 
names drawn from a four-letter alphabet — A, T, C, G — and they are strung 
together in a long linear sequence that encodes the genetic information, just as 
the sequence of Is and 0s encodes the information in a computer file. We can take 
a piece of DNA from a human cell and insert it into a bacterium, or a piece of bac- 
terial DNA and insert it into a human cell, and the information will be success- 
fully read, interpreted, and copied. Using chemical methods, scientists can read 
out the complete sequence of monomers in any DNA molecule — extending for 
millions of nucleotides — and thereby decipher the hereditary information that 
each organism contains. 
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All Cells Replicate Their Hereditary Information byTemplated 
Polymerization 

The mechanisms that make life possible depend on the structure of the double- 
stranded DNA molecule. Each monomer in a single DNA strand — that is, each 
nucleotide — consists of two parts: a sugar (deoxyribose) with a phosphate 
group attached to it, and a base, which may be either adenine (A), guanine (G), 
cytosine (C) or thymine (T) (Figure 1-2). Each sugar is linked to the next via the 
phosphate group, creating a polymer chain composed of a repetitive sugar- 
phosphate backbone with a series of bases protruding from it. The DNA polymer 
is extended by adding monomers at one end. For a single isolated strand, these 
can, in principle, be added in any order, because each one links to the next in the 
same way, through the part of the molecule that is the same for all of them. In 
the living cell, however, DNA is not synthesized as a free strand in isolation, but 
on a template formed by a preexisting DNA strand. The bases protruding from 
the existing strand bind to bases of the strand being synthesized, according to a 
strict rule defined by the complementary structures of the bases: A binds to T, 
and C binds to G. This base-pairing holds fresh monomers in place and thereby 
controls the selection of which one of the four monomers shall be added to the 
growing strand next. In this way, a double-stranded structure is created, consist- 
ing of two exactly complementary sequences of As, Cs, Ts, and Gs. The two 
strands twist around each other, forming a double helix (Figure 1-2E). 
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Figure 1-2 DNA and its building blocks. (A) DNA is made from simple subunits, called nucleotides, each consisting of a sugar-phosphate 
molecule with a nitrogen-containing sidegroup, or base, attached to it. The bases are of four types (adenine, guanine, cytosine, and thymine), 
corresponding to four distinct nucleotides, labeled A, G, C, and T. (B) A single strand of DNA consists of nucleotides joined together by sugar- 
phosphate linkages. Note that the individual sugar-phosphate units are asymmetric, giving the backbone of the strand a definite directionality, 
or polarity. This directionality guides the molecular processes by which the information in DNA is interpreted and copied in cells: the 
information is always "read" in a consistent order, just as written English text is read from left to right. (C) Through templated polymerization, 
the sequence of nucleotides in an existing DNA strand controls the sequence in which nucleotides are joined together in a new DNA strand; 
T in one strand pairs with A in the other, and G in one strand with C in the other. The new strand has a nucleotide sequence complementary to 
that of the old strand, and a backbone with opposite directionality: corresponding to the GTAA... of the original strand, it has ...TTAC. 
(D) A normal DNA molecule consists of two such complementary strands. The nucleotides within each strand are linked by strong (covalent) 
chemical bonds; the complementary nucleotides on opposite strands are held together more weakly, by hydrogen bonds. (E) The two strands 
twist around each other to form a double helix — a robust structure that can accommodate any sequence of nucleotides without altering its 
basic structure. 
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The bonds between the base pairs are weak compared with the sugar-phos- 
phate links, and this allows the two DNA strands to be pulled apart without 
breakage of their backbones. Each strand then can serve as a template, in the 
way just described, for the synthesis of a fresh DNA strand complementary to 
itself — a fresh copy, that is, of the hereditary information (Figure 1-3). In differ- 
ent types of cells, this process of DNA replication occurs at different rates, with 
different controls to start it or stop it, and different auxiliary molecules to help it 
along. But the basics are universal: DNA is the information store, and templated 
polymerization is the way in which this information is copied throughout the 
living world. 



Figure 1 -3 The copying of genetic 
information by DNA replication. In this 
process, the two strands of a DNA double 
helix are pulled apart, and each serves as 
a template for synthesis of a new 
complementary strand. 



All Cells Transcribe Portions of Their Hereditary Information into 
the Same Intermediary Form (RNA) 

To carry out its information-bearing function, DNA must do more than copy 
itself. It must also express its information, by letting it guide the synthesis of 
other molecules in the cell. This also occurs by a mechanism that is the same 
in all living organisms, leading first and foremost to the production of two other 
key classes of polymers: RNAs and proteins. The process (discussed in detail in 
Chapters 6 and 7) begins with a templated polymerization called transcription, 
in which segments of the DNA sequence are used as templates for the synthesis 
of shorter molecules of the closely related polymer ribonucleic acid, or RNA. 
Later, in the more complex process of translation, many of these RNA molecules 
direct the synthesis of polymers of a radically different chemical class — the pro- 
teins (Figure 1-4). 

In RNA, the backbone is formed of a slightly different sugar from that of 
DNA — ribose instead of deoxyribose — and one of the four bases is slightly dif- 
ferent — uracil (U) in place of thymine (T); but the other three bases — A, C, and 
G — are the same, and all four bases pair with their complementary counterparts 
in DNA— the A, U, C, and G of RNA with the T, A, G, and C of DNA. During tran- 
scription, RNA monomers are lined up and selected for polymerization on a 
template strand of DNA, just as DNA monomers are selected during replication. 
The outcome is a polymer molecule whose sequence of nucleotides faithfully 
represents a part of the cell's genetic information, even though written in a 
slightly different alphabet, consisting of RNA monomers instead of DNA 
monomers. 

The same segment of DNA can be used repeatedly to guide the synthesis of 
many identical RNA transcripts. Thus, whereas the cell's archive of genetic infor- 
mation in the form of DNA is fixed and sacrosanct, the RNA transcripts are 
mass-produced and disposable (Figure 1-5). As we shall see, these transcripts 
function as intermediates in the transfer of genetic information: they mainly 
serve as messenger RNA (mRNA) to guide the synthesis of proteins according to 
the genetic instructions stored in the DNA. 

RNA molecules have distinctive structures that can also give them other spe- 
cialized chemical capabilities. Being single- stranded, their backbone is flexible, 
so that the polymer chain can bend back on itself to allow one part of the 
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Figure 1-4 From DNA to protein. 

Genetic information is read out and put 
to use through a two-step process. First, 
in transcription, segments of the DNA 
sequence are used to guide the synthesis 
of molecules of RNA. Then, in translation, 
the RNA molecules are used to guide the 
synthesis of molecules of protein. 
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molecule to form weak bonds with another part of the same molecule. This 
occurs when segments of the sequence are locally complementary: a ...GGGG... 
segment, for example, will tend to associate with a ...CCCC... segment. These 
types of internal associations can cause an RNA chain to fold up into a specific 
shape that is dictated by its sequence (Figure 1-6). The shape of the RNA 
molecule, in turn, may enable it to recognize other molecules by binding to them 
selectively — and even, in certain cases, to catalyze chemical changes in the 
molecules that are bound. As we see in Chapter 6, a few chemical reactions cat- 
alyzed by RNA molecules are crucial for several of the most ancient and funda- 
mental processes in living cells, and it has been suggested that more extensive 
catalysis by RNA played a central part in the early evolution of life. 



Figure 1-5 How genetic information is 
broadcast for use inside the cell. Each 
cell contains a fixed set of DNA 
molecules — its archive of genetic 
information. A given segment of this DNA 
guides the synthesis of many identical 
RNA transcripts, which serve as working 
copies of the information stored in the 
archive. Many different sets of RNA 
molecules can be made by transcribing 
selected parts of a long DNA sequence, 
allowing each cell to use its information 
store differently. 



All Cells Use Proteins as Catalysts 

Protein molecules, like DNA and RNA molecules, are long unbranched polymer 
chains, formed by stringing together monomeric building blocks drawn from a 
standard repertoire that is the same for all living cells. Like DNA and RNA, they 
carry information in the form of a linear sequence of symbols, in the same way 
as a human message written in an alphabetic script. There are many different 
protein molecules in each cell, and — leaving out the water — they form most of 
the cell's mass. 

The monomers of protein, the amino acids, are quite different from those of 
DNA and RNA, and there are 20 types, instead of 4. Each amino acid is built 
around the same core structure through which it can be linked in a standard way 
to any other amino acid in the set; attached to this core is a side group that gives 
each amino acid a distinctive chemical character. Each of the protein molecules, 
or polypeptides, created by joining amino acids in a particular sequence folds 
into a precise three-dimensional form with reactive sites on its surface (Figure 
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Figure 1 -6 The conformation of an RNA 
molecule. (A) Nucleotide pairing 
between different regions of the same 
RNA polymer chain causes the molecule 
to adopt a distinctive shape. (B) The 
three-dimensional structure of an actual 
RNA molecule, from hepatitis delta virus, 
that catalyzes RNA strand cleavage. The 
blue ribbon represents the sugar- 
phosphate backbone; the bars represent 
base pairs. (B, based on A.R. Ferre 
D'Amare, K. Zhou and J.A. Doudna, Nature 
395:567-574, 1998. With permission from 
Macmillan Publishers Ltd.) 
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Figure 1-7 How a protein molecule acts as catalyst for a chemical reaction. 

(A) In a protein molecule the polymer chain folds up to into a specific shape 
defined by its amino acid sequence. A groove in the surface of this particular 
folded molecule, the enzyme lysozyme, forms a catalytic site. (B) A polysaccharide 
molecule (red) — a polymer chain of sugar monomers — binds to the catalytic site 
of lysozyme and is broken apart, as a result of a covalent bond-breaking reaction 
catalyzed by the amino acids lining the groove. 



1-7 A). These amino acid polymers thereby bind with high specificity to other 
molecules and act as enzymes to catalyze reactions that make or break covalent 
bonds. In this way they direct the vast majority of chemical processes in the cell 
(Figure 1-7B). Proteins have many other functions as well — maintaining struc- 
tures, generating movements, sensing signals, and so on — each protein 
molecule performing a specific function according to its own genetically speci- 
fied sequence of amino acids. Proteins, above all, are the molecules that put the 
cell's genetic information into action. 

Thus, polynucleotides specify the amino acid sequences of proteins. Pro- 
teins, in turn, catalyze many chemical reactions, including those by which new 
DNA molecules are synthesized, and the genetic information in DNA is used to 
make both RNA and proteins. This feedback loop is the basis of the auto catalytic, 
self- reproducing behavior of living organisms (Figure 1-8). 



All Cells Translate RNA into Protein in the Same Way 

The translation of genetic information from the 4-letter alphabet of polynu- 
cleotides into the 20-letter alphabet of proteins is a complex process. The rules 
of this translation seem in some respects neat and rational, in other respects 
strangely arbitrary, given that they are (with minor exceptions) identical in all 
living things. These arbitrary features, it is thought, reflect frozen accidents in 
the early history of life — chance properties of the earliest organisms that were 
passed on by heredity and have become so deeply embedded in the constitution 
of all living cells that they cannot be changed without disastrous effects. 

The information in the sequence of a messenger RNA molecule is read out in 
groups of three nucleotides at a time: each triplet of nucleotides, or codon, speci- 
fies (codes for) a single amino acid in a corresponding protein. Since there are 64 
(=4x4x4) possible codons, all of which occur in nature, but only 20 amino acids, 
there are necessarily many cases in which several codons correspond to the same 
amino acid. The code is read out by a special class of small RNA molecules, the 
transfer RNAs (tRNAs). Each type of tRNA becomes attached at one end to a spe- 
cific amino acid, and displays at its other end a specific sequence of three 
nucleotides — an anticodon — that enables it to recognize, through base-pairing, a 
particular codon or subset of codons in mRNA (Figure 1-9). 

For synthesis of protein, a succession of tRNA molecules charged with their 
appropriate amino acids have to be brought together with an mRNA molecule and 
matched up by base-pairing through their anticodons with each of its successive 
codons. The amino acids then have to be linked together to extend the growing 
protein chain, and the tRNAs, relieved of their burdens, have to be released. This 
whole complex of processes is carried out by a giant multimolecular machine, 
the ribosome, formed of two main chains of RNA, called ribosomal RNAs 
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Figure 1-8 Life as an autocatalytic 
process. Polynucleotides (nucleotide 
polymers) and proteins (amino acid 
polymers) provide the sequence 
information and the catalytic functions 
that serve — through a complex set of 
chemical reactions — to bring about the 
synthesis of more polynucleotides and 
proteins of the same types. 
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(rRNAs), and more than 50 different proteins. This evolutionarily ancient molec- 
ular juggernaut latches onto the end of an mRNA molecule and then trundles 
along it, capturing loaded tRNA molecules and stitching together the amino 
acids they carry to form a new protein chain (Figure 1-10). 



The Fragment of Genetic Information Corresponding to One 
Protein Is One Gene 



DNA molecules as a rule are very large, containing the specifications for thou- 
sands of proteins. Individual segments of the entire DNA sequence are tran- 
scribed into separate mRNA molecules, with each segment coding for a different 
protein. Each such DNA segment represents one gene. A complication is that 
RNA molecules transcribed from the same DNA segment can often be processed 
in more than one way, so as to give rise to a set of alternative versions of a pro- 
tein, especially in more complex cells such as those of plants and animals. A 
gene therefore is defined, more generally, as the segment of DNA sequence cor- 
responding to a single protein or set of alternative protein variants (or to a sin- 
gle catalytic or structural RNA molecule for those genes that produce RNA but 
not protein). 

In all cells, the expression of individual genes is regulated: instead of manufac- 
turing its full repertoire of possible proteins at full tilt all the time, the cell adjusts 
the rate of transcription and translation of different genes independently, accord- 
ing to need. Stretches of regulatory DNA are interspersed among the segments 
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Figure 1 -9 Transfer RNA. (A) A tRNA 
molecule specific for the amino acid 
tryptophan. One end of the tRNA 
molecule has tryptophan attached to it, 
while the other end displays the triplet 
nucleotide sequence CCA (its anticodon), 
which recognizes the tryptophan codon 
in messenger RNA molecules. (B) The 
three-dimensional structure of the 
tryptophan tRNA molecule. Note that the 
codon and the anticodon in (A) are in 
antiparallel orientations, like the two 
strands in a DNA double helix (see Figure 
1 -2), so that the sequence of the 
anticodon in the tRNA is read from right 
to left, while that of the codon in the 
mRNA is read from left to right. 
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Figure 1 -1 A ribosome at work. (A) The diagram shows how a ribosome 
moves along an mRNA molecule, capturing tRNA molecules that match the 
codons in the mRNA and using them to join amino acids into a protein 
chain. The mRNA specifies the sequence of amino acids. (B) The three- 
dimensional structure of a bacterial ribosome (pale green and blue), moving 
along an mRNA molecule (orange beadsj, with three tRNA molecules 
(yellow, green, and pink) at different stages in their process of capture and 
release. The ribosome is a giant assembly of more than 50 individual 
protein and RNA molecules. (B, courtesy of Joachim Frank, Yanhong Li and 
Rajendra Agarwal.) 

that code for protein, and these noncoding regions bind to special protein 
molecules that control the local rate of transcription (Figure 1-11). Other non- 
coding DNA is also present, some of it serving, for example, as punctuation, 
defining where the information for an individual protein begins and ends. The 
quantity and organization of the regulatory and other noncoding DNA vary 
widely from one class of organisms to another, but the basic strategy is univer- 
sal. In this way, the genome of the cell — that is, the total of its genetic informa- 
tion as embodied in its complete DNA sequence — dictates not only the nature of 
the cell's proteins, but also when and where they are to be made. 



Life Requires Free Energy 

A living cell is a dynamic chemical system, operating far from chemical equilib- 
rium. For a cell to grow or to make a new cell in its own image, it must take in 
free energy from the environment, as well as raw materials, to drive the neces- 
sary synthetic reactions. This consumption of free energy is fundamental to life. 
When it stops, a cell decays towards chemical equilibrium and soon dies. 

Genetic information is also fundamental to life. Is there any connection? The 
answer is yes: free energy is required for the propagation of information. For 
example, to specify one bit of information — that is, one yes/no choice between 
two equally probable alternatives — costs a defined amount of free energy that 
can be calculated. The quantitative relationship involves some deep reasoning 
and depends on a precise definition of the term "free energy," discussed in 
Chapter 2. The basic idea, however, is not difficult to understand intuitively. 

Picture the molecules in a cell as a swarm of objects endowed with thermal 
energy, moving around violently at random, buffeted by collisions with one 
another. To specify genetic information — in the form of a DNA sequence, for 
example — molecules from this wild crowd must be captured, arranged in a spe- 
cific order defined by some preexisting template, and linked together in a fixed 
relationship. The bonds that hold the molecules in their proper places on the 
template and join them together must be strong enough to resist the disorder- 
ing effect of thermal motion. The process is driven forward by consumption of 
free energy, which is needed to ensure that the correct bonds are made, and 
made robustly. In the simplest case, the molecules can be compared with 
spring-loaded traps, ready to snap into a more stable, lower-energy attached 
state when they meet their proper partners; as they snap together into the 
bonded arrangement, their available stored energy — their free energy — like the 
energy of the spring in the trap, is released and dissipated as heat. In a cell, the 
chemical processes underlying information transfer are more complex, but the 
same basic principle applies: free energy has to be spent on the creation of order. 

To replicate its genetic information faithfully, and indeed to make all its 
complex molecules according to the correct specifications, the cell therefore 
requires free energy, which has to be imported somehow from the surroundings. 



All Cells Function as Biochemical Factories Dealing with the Same 
Basic Molecular Building Blocks 

Because all cells make DNA, RNA, and protein, and these macromolecules are 
composed of the same set of subunits in every case, all cells have to contain and 
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manipulate a similar collection of small molecules, including simple sugars, 
nucleotides, and amino acids, as well as other substances that are universally 
required for their synthesis. All cells, for example, require the phosphorylated 
nucleotide ATP (adenosine triphosphate) as a building block for the synthesis of 
DNA and RNA; and all cells also make and consume this molecule as a carrier of 
free energy and phosphate groups to drive many other chemical reactions. 

Although all cells function as biochemical factories of a broadly similar type, 
many of the details of their small-molecule transactions differ, and it is not as 
easy as it is for the informational macromolecules to point out the features that 
are strictly universal. Some organisms, such as plants, require only the simplest 
of nutrients and harness the energy of sunlight to make from these almost all 
their own small organic molecules; other organisms, such as animals, feed on 
living things and obtain many of their organic molecules ready-made. We return 
to this point below. 



All Cells Are Enclosed in a Plasma Membrane Across Which 
Nutrients and Waste Materials Must Pass 



There is, however, at least one other feature of cells that is universal: each one is 
enclosed by a membrane — the plasma membrane. This container acts as a 
selective barrier that enables the cell to concentrate nutrients gathered from its 
environment and retain the products it synthesizes for its own use, while excret- 
ing its waste products. Without a plasma membrane, the cell could not maintain 
its integrity as a coordinated chemical system. 

The molecules forming this membrane have the simple physico-chemical 
property of being amphiphilic — that is, consisting of one part that is hydropho- 
bic (water-insoluble) and another part that is hydrophilic (water-soluble). Such 
molecules placed in water aggregate spontaneously, arranging their hydropho- 
bic portions to be as much in contact with one another as possible to hide them 
from the water, while keeping their hydrophilic portions exposed. Amphiphilic 
molecules of appropriate shape, such as the phospholipid molecules that com- 
prise most of the plasma membrane, spontaneously aggregate in water to form a 
bilayer that creates small closed vesicles (Figure 1-12). The phenomenon can be 
demonstrated in a test tube by simply mixing phospholipids and water together; 
under appropriate conditions, small vesicles form whose aqueous contents are 
isolated from the external medium. 

Although the chemical details vary, the hydrophobic tails of the predomi- 
nant membrane molecules in all cells are hydrocarbon polymers 
(-CH2-CH2-CH2-), and their spontaneous assembly into a bilayered vesicle is 
but one of many examples of an important general principle: cells produce 
molecules whose chemical properties cause them to self-assemble into the 
structures that a cell needs. 

The cell boundary cannot be totally impermeable. If a cell is to grow and 
reproduce, it must be able to import raw materials and export waste across its 
plasma membrane. All cells therefore have specialized proteins embedded in 
their membrane that transport specific molecules from one side to the other 
(Figure 1-13). Some of these membrane transport proteins, like some of the pro- 
teins that catalyze the fundamental small-molecule reactions inside the cell, 



Figure 1-1 1 Gene regulation by protein binding to regulatory DNA. 

(A) A diagram of a small portion of the genome of the bacterium Escherichia 
coli, containing genes (called Lad, LacZ, LacY, and LacA) coding for four 
different proteins. The protein-coding DNA segments (red) have regulatory 
and other noncoding DNA segments (yellow) between them. (B) An electron 
micrograph of DNA from this region, with a protein molecule (encoded by 
the Lad gene) bound to the regulatory segment; this protein controls the 
rate of transcription of the LacZ, LacY, and LacA genes. (C) A drawing of the 
structures shown in (B). (B, courtesy of Jack Griffith.) 
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have been so well preserved over the course of evolution that we can recognize 
the family resemblances between them in comparisons of even the most dis- 
tantly related groups of living organisms. 

The transport proteins in the membrane largely determine which molecules 
enter the cell, and the catalytic proteins inside the cell determine the reactions 
that those molecules undergo. Thus, by specifying the proteins that the cell is to 
manufacture, the genetic information recorded in the DNA sequence dictates 
the entire chemistry of the cell; and not only its chemistry, but also its form and 
its behavior, for these too are chiefly constructed and controlled by the cell's pro- 
teins. 



A Living Cell Can Exist with Fewer Than 500 Genes 

The basic principles of biological information transfer are simple enough, but 
how complex are real living cells? In particular, what are the minimum require- 
ments? We can get a rough indication by considering a species that has one of 
the smallest known genomes — the bacterium Mycoplasma genitalium (Figure 
1-14). This organism lives as a parasite in mammals, and its environment pro- 
vides it with many of its small molecules ready-made. Nevertheless, it still has to 
make all the large molecules — DNA, RNAs, and proteins — required for the basic 
processes of heredity. It has only about 480 genes in its genome of 580,070 
nucleotide pairs, representing 145,018 bytes of information — about as much as 
it takes to record the text of one chapter of this book. Cell biology may be com- 
plicated, but it is not impossibly so. 

The minimum number of genes for a viable cell in today's environments is 
probably not less than 200-300, although there are only about 60 genes in the 
core set shared by all living species without any known exception. 
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Figure 1-12 Formation of a membrane 
by amphiphilic phospholipid molecules. 

These have a hydrophilic (water-loving, 
phosphate) head group and a 
hydrophobic (water-avoiding, 
hydrocarbon) tail. At an interface 
between oil and water, they arrange 
themselves as a single sheet with their 
head groups facing the water and their 
tail groups facing the oil. When immersed 
in water, they aggregate to form bilayers 
enclosing aqueous compartments. 
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Figure 1-13 Membrane transport proteins. (A) Structure of a molecule of 
bacteriorhodopsin, from the archaeon (archaebacterium) Halobacterium 
halobium. This transport protein uses the energy of absorbed light to 
pump protons (H + ions) out of the cell. The polypeptide chain threads to 
and fro across the membrane; in several regions it is twisted into a helical 
conformation, and the helical segments are arranged to form the walls of a 
channel through which ions are transported. (B) Diagram of the set of 
transport proteins found in the membrane of the bacterium Thermotoga 
maritima.Jhe numbers in parentheses refer to the number of different 
membrane transport proteins of each type. Most of the proteins within 
each class are evolutionary related to one another and to their 
counterparts in other species. 
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Figure 1-14 Mycoplasma genitalium. (A) Scanning electron micrograph 
showing the irregular shape of this small bacterium, reflecting the lack of 
any rigid wall. (B) Cross section (transmission electron micrograph) of a 
Mycoplasma cell. Of the 477 genes of Mycoplasma genitalium, 37 code for 
transfer, ribosomal, and other nonmessenger RNAs. Functions are known, 
or can be guessed, for 297 of the genes coding for protein: of these, 
1 53 are involved in replication, transcription, translation, and related 
processes involving DNA, RNA, and protein; 29 in the membrane and 
surface structures of the cell; 33 in the transport of nutrients and other 
molecules across the membrane; 71 in energy conversion and the 
synthesis and degradation of small molecules; and 1 1 in the regulation of 
cell division and other processes. (A, from S. Razin et al., Infect Immun. 
30:538-546, 1 980. With permission from the American Society for 
Microbiology; B, courtesy of Roger Cole, in Medical Microbiology, 4th ed. 
[S. Baron ed.]. Galveston: University of Texas Medical Branch, 1996.) 



Summary 

Living organisms reproduce themselves by transmitting genetic information to their 
progeny. The individual cell is the minimal self-reproducing unit, and is the vehicle for 
transmission of the genetic information in all living species. Every cell on our planet 
stores its genetic information in the same chemical form — as double-stranded DNA. The 
cell replicates its information by separating the paired DNA strands and using each as 
a template for polymerization to make a new DNA strand with a complementary 
sequence of nucleotides. The same strategy of templated polymerization is used to 
transcribe portions of the information from DNA into molecules of the closely related 
polymer, RNA. These in turn guide the synthesis of protein molecules by the more com- 
plex machinery of translation, involving a large multimolecular machine, the ribo- 
some, which is itself composed of RNA and protein. Proteins are the principal catalysts 
for almost all the chemical reactions in the cell; their other functions include the selec- 
tive import and export of small molecules across the plasma membrane that forms the 
cell's boundary. The specific function of each protein depends on its amino acid 
sequence, which is specified by the nucleotide sequence of a corresponding segment of 
the DNA — the gene that codes for that protein. In this way, the genome of the cell deter- 
mines its chemistry; and the chemistry of every living cell is fundamentally similar, 
because it must provide for the synthesis of DNA, RNA, and protein. The simplest 
known cells have just under 500 genes. 




THE DIVERSITY OF GENOMES AND THE TREE 
OF LIFE 

The success of living organisms based on DNA, RNA, and protein, out of the 
infinitude of other chemical forms that we might conceive of, has been spectac- 
ular. They have populated the oceans, covered the land, infiltrated the Earth's 
crust, and molded the surface of our planet. Our oxygen-rich atmosphere, the 
deposits of coal and oil, the layers of iron ores, the cliffs of chalk and limestone 
and marble — all these are products, directly or indirectly, of past biological 
activity on Earth. 

Living things are not confined to the familiar temperate realm of land, water, 
and sunlight inhabited by plants and plant-eating animals. They can be found in 
the darkest depths of the ocean, in hot volcanic mud, in pools beneath the 
frozen surface of the Antarctic, and buried kilometers deep in the Earth's crust. 
The creatures that live in these extreme environments are generally unfamiliar, 
not only because they are inaccessible, but also because they are mostly micro- 
scopic. In more homely habitats, too, most organisms are too small for us to see 
without special equipment: they tend to go unnoticed, unless they cause a dis- 
ease or rot the timbers of our houses. Yet microorganisms make up most of the 
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total mass of living matter on our planet. Only recently, through new methods of 
molecular analysis and specifically through the analysis of DNA sequences, have 
we begun to get a picture of life on Earth that is not grossly distorted by our 
biased perspective as large animals living on dry land. 

In this section we consider the diversity of organisms and the relationships 
among them. Because the genetic information for every organism is written in 
the universal language of DNA sequences, and the DNA sequence of any given 
organism can be obtained by standard biochemical techniques, it is now possi- 
ble to characterize, catalogue, and compare any set of living organisms with ref- 
erence to these sequences. From such comparisons we can estimate the place of 
each organism in the family tree of living species — the 'tree of life'. But before 
describing what this approach reveals, we need first to consider the routes by 
which cells in different environments obtain the matter and energy they require 
to survive and proliferate, and the ways in which some classes of organisms 
depend on others for their basic chemical needs. 



Cells Can Be Powered by a Variety of Free Energy Sources 

Living organisms obtain their free energy in different ways. Some, such as ani- 
mals, fungi, and the bacteria that live in the human gut, get it by feeding on 
other living things or the organic chemicals they produce; such organisms are 
called organotrophic (from the Greek word trophe, meaning "food"). Others 
derive their energy directly from the nonliving world. These fall into two 
classes: those that harvest the energy of sunlight, and those that capture their 
energy from energy- rich systems of inorganic chemicals in the environment 
(chemical systems that are far from chemical equilibrium). Organisms of the 
former class are called phototrophic (feeding on sunlight); those of the latter 
are called lithotrophic (feeding on rock). Organotrophic organisms could not 
exist without these primary energy converters, which are the most plentiful 
form of life. 

Phototrophic organisms include many types of bacteria, as well as algae and 
plants, on which we — and virtually all the living things that we ordinarily see 
around us — depend. Phototrophic organisms have changed the whole chem- 
istry of our environment: the oxygen in the Earth's atmosphere is a by-product 
of their biosynthetic activities. 

Lithotrophic organisms are not such an obvious feature of our world, 
because they are microscopic and mostly live in habitats that humans do not 
frequent — deep in the ocean, buried in the Earth's crust, or in various other 
inhospitable environments. But they are a major part of the living world, and are 
especially important in any consideration of the history of life on Earth. 

Some lithotrophs get energy from aerobic reactions, which use molecular 
oxygen from the environment; since atmospheric O2 is ultimately the product of 
living organisms, these aerobic lithotrophs are, in a sense, feeding on the prod- 
ucts of past life. There are, however, other lithotrophs that live anaerobically, in 
places where little or no molecular oxygen is present, in circumstances similar to 
those that must have existed in the early days of life on Earth, before oxygen had 
accumulated. 

The most dramatic of these sites are the hot hydrothermal vents found deep 
down on the floor of the Pacific and Atlantic Oceans, in regions where the ocean 
floor is spreading as new portions of the Earth's crust form by a gradual 
upwelling of material from the Earth's interior (Figure 1-15). Downward-perco- 
lating seawater is heated and driven back upward as a submarine geyser, carry- 
ing with it a current of chemicals from the hot rocks below. A typical cocktail 
might include H 2 S, H 2 , CO, Mn 2+ , Fe 2+ , Ni 2+ , CH 2 , NH 4 + , and phosphorus-con- 
taining compounds. A dense population of microbes lives in the neighborhood 
of the vent, thriving on this austere diet and harvesting free energy from reac- 
tions between the available chemicals. Other organisms — clams, mussels, and 
giant marine worms — in turn live off the microbes at the vent, forming an entire 
ecosystem analogous to the system of plants and animals that we belong to, but 
powered by geochemical energy instead of light (Figure 1-16). 
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Some Cells Fix Nitrogen and Carbon Dioxide for Others 

To make a living cell requires matter, as well as free energy. DNA, RNA, and protein 
are composed of just six elements: hydrogen, carbon, nitrogen, oxygen, sulfur, and 
phosphorus. These are all plentiful in the nonliving environment, in the Earth's 
rocks, water, and atmosphere, but not in chemical forms that allow easy incor- 
poration into biological molecules. Atmospheric N 2 and CO2, in particular, are 
extremely unreactive, and a large amount of free energy is required to drive the 
reactions that use these inorganic molecules to make the organic compounds 
needed for further biosynthesis — that is, to fix nitrogen and carbon dioxide, so 
as to make N and C available to living organisms. Many types of living cells lack 
the biochemical machinery to achieve this fixation, and rely on other classes of 
cells to do the job for them. We animals depend on plants for our supplies of 
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Figure 1 -1 5 The geology of a hot 
hydrothermal vent in the ocean floor. 

Water percolates down toward the hot 
molten rock upwelling from the Earth's 
interior and is heated and driven back 
upward, carrying minerals leached from 
the hot rock. A temperature gradient is 
set up, from more than 350°C near the 
core of the vent, down to 2-3°C in the 
surrounding ocean. Minerals precipitate 
from the water as it cools, forming a 
chimney. Different classes of organisms, 
thriving at different temperatures, live in 
different neighborhoods of the chimney. 
A typical chimney might be a few meters 
tall, with a flow rate of 1-2 m/sec. 
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Figure 1-16 Living organisms at a hot 
hydrothermal vent. Close to the vent, at 
temperatures up to about 120°C, various 
lithotrophic species of bacteria and 
archaea (archaebacteria) live, directly 
fuelled by geochemical energy. A little 
further away, where the temperature is 
lower, various invertebrate animals live 
by feeding on these microorganisms. 
Most remarkable are the giant (2-meter) 
tube worms, which, rather than feed on 
the lithotrophic cells, live in symbiosis 
with them: specialized organs in the 
worms harbor huge numbers of 
symbiotic sulfur-oxidizing bacteria. These 
bacteria harness geochemical energy and 
supply nourishment to their hosts, which 
have no mouth, gut, or anus. The 
dependence of the tube worms on the 
bacteria for the harnessing of geothermal 
energy is analogous to the dependence 
of plants on chloroplasts for the 
harnessing of solar energy, discussed 
later in this chapter. The tube worms, 
however, are thought to have evolved 
from more conventional animals, and to 
have become secondarily adapted to life 
at hydrothermal vents. (Courtesy of 
Dudley Foster, Woods Hole 
Oceanographic Institution.) 
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organic carbon and nitrogen compounds. Plants in turn, although they can fix 
carbon dioxide from the atmosphere, lack the ability to fix atmospheric nitro- 
gen, and they depend in part on nitrogen-fixing bacteria to supply their need for 
nitrogen compounds. Plants of the pea family, for example, harbor symbiotic 
nitrogen-fixing bacteria in nodules in their roots. 

Living cells therefore differ widely in some of the most basic aspects of their 
biochemistry. Not surprisingly, cells with complementary needs and capabilities 
have developed close associations. Some of these associations, as we see below, 
have evolved to the point where the partners have lost their separate identities 
altogether: they have joined forces to form a single composite cell. 



micrometers in linear dimension, 
there are also some giant species. 
An extreme example (not shown) 
is the cigar-shaped bacterium 
Epulopiscium fishelsoni, which lives 
in the gut of a surgeonfish and 
can be up to 600 Jim long. 



The Greatest Biochemical Diversity Exists Among Procaryotic 
Cells 

From simple microscopy, it has long been clear that living organisms can be 
classified on the basis of cell structure into two groups: the eucaryotes and the 
procaryotes. Eucaryotes keep their DNA in a distinct membrane-enclosed intra- 
cellular compartment called the nucleus. (The name is from the Greek, meaning 
"truly nucleated," from the words eu, "well" or "truly," and karyon, "kernel" or 
"nucleus".) Procaryotes have no distinct nuclear compartment to house their 
DNA. Plants, fungi, and animals are eucaryotes; bacteria are procaryotes, as are 
archaea — a separate class of procaryotic cells, discussed below. 

Most procaryotic cells are small and simple in outward appearance (Figure 
1-17), and they live mostly as independent individuals or in loosely organized 
communities, rather than as multicellular organisms. They are typically spherical 
or rod-shaped and measure a few micrometers in linear dimension. They often 
have a tough protective coat, called a cell wall, beneath which a plasma mem- 
brane encloses a single cytoplasmic compartment containing DNA, RNA, pro- 
teins, and the many small molecules needed for life. In the electron microscope, 
this cell interior appears as a matrix of varying texture without any discernible 
organized internal structure (Figure 1-18). 

Figure 1-1 8 The structure of a bacterium. (A) The bacterium Vibrio 
cholerae, showing its simple internal organization. Like many other species, 
Vibrio has a helical appendage at one end — a flagellum — that rotates as a 
propeller to drive the cell forward. (B) An electron micrograph of a 
longitudinal section through the widely studied bacterium Escherichia coli 
(E. coli). This is related to Vibrio but has many flagella (not visible in this 
section) distributed over its surface. The cell's DNA is concentrated in the 
lightly stained region. (B, courtesy of E. Kellenberger.) 
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Procaryotic cells live in an enormous variety of ecological niches, and they 
are astonishingly varied in their biochemical capabilities — far more so than 
eucaryotic cells. Organotrophic species can utilize virtually any type of organic 
molecule as food, from sugars and amino acids to hydrocarbons and methane 
gas. Phototrophic species (Figure 1-19) harvest light energy in a variety of ways, 
some of them generating oxygen as a byproduct, others not. Lithotrophic species 
can feed on a plain diet of inorganic nutrients, getting their carbon from CO2, and 
relying on H 2 S to fuel their energy needs (Figure 1-20) — or on H 2 , or Fe 2+ , or ele- 
mental sulfur, or any of a host of other chemicals that occur in the environment. 

Many parts of this world of microscopic organisms are virtually unexplored. 
Traditional methods of bacteriology have given us an acquaintance with those 
species that can be isolated and cultured in the laboratory. But DNA sequence 
analysis of the populations of bacteria in samples from natural habitats — such 
as soil or ocean water, or even the human mouth — has opened our eyes to the 
fact that most species cannot be cultured by standard laboratory techniques. 
According to one estimate, at least 99% of procaryotic species remain to be 
characterized. 



Figure 1 -1 9 The phototrophic 
bacterium Anabaena cylindrica viewed 
in the light microscope. The cells of this 
species form long, multicellular filaments. 
Most of the cells (labeled V) perform 
photosynthesis, while others become 
specialized for nitrogen fixation (labeled 
H), or develop into resistant spores 
(labeled S). (Courtesy of Dave G. Adams.) 



The Tree of Life Has Three Primary Branches: Bacteria, Archaea, 
and Eucaryotes 

The classification of living things has traditionally depended on comparisons of 
their outward appearances: we can see that a fish has eyes, jaws, backbone, 
brain, and so on, just as we do, and that a worm does not; that a rosebush is 
cousin to an apple tree, but less similar to a grass. As Darwin showed, we can 
readily interpret such close family resemblances in terms of evolution from 
common ancestors, and we can find the remains of many of these ancestors pre- 
served in the fossil record. In this way, it has been possible to begin to draw a 
family tree of living organisms, showing the various lines of descent, as well as 
branch points in the history, where the ancestors of one group of species 
became different from those of another. 

When the disparities between organisms become very great, however, these 
methods begin to fail. How do we decide whether a fungus is closer kin to a plant 
or to an animal? When it comes to procaryotes, the task becomes harder still: 
one microscopic rod or sphere looks much like another. Microbiologists have 
therefore sought to classify procaryotes in terms of their biochemistry and nutri- 
tional requirements. But this approach also has its pitfalls. Amid the bewildering 
variety of biochemical behaviors, it is difficult to know which differences truly 
reflect differences of evolutionary history. 

Genome analysis has given us a simpler, more direct, and more powerful 
way to determine evolutionary relationships. The complete DNA sequence of an 
organism defines its nature with almost perfect precision and in exhaustive 
detail. Moreover, this specification is in a digital form — a string of letters — that 
can be entered straightforwardly into a computer and compared with the corre- 
sponding information for any other living thing. Because DNA is subject to ran- 
dom changes that accumulate over long periods of time (as we shall see shortly), 
the number of differences between the DNA sequences of two organisms can 
provide a direct, objective, quantitative indication of the evolutionary distance 
between them. 

This approach has shown that the organisms that were traditionally classed 
together as "bacteria" can be as widely divergent in their evolutionary origins as 
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Figure 1-20 A lithotrophic bacterium. 

Beggiatoa, which lives in sulfurous 
environments, gets its energy by 
oxidizing H 2 S and can fix carbon even in 
the dark. Note the yellow deposits of 
sulfur inside the cells. (Courtesy of Ralph 
W. Wolfe.) 
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Figure 1-21 The three major divisions (domains) of the living world. Note that traditionally the word 
bacteria has been used to refer to procaryotes in general, but more recently has been redefined to refer to 
eubacteria specifically. The tree shown here is based on comparisons of the nucleotide sequence of a 
ribosomal RNA subunit in the different species, and the distances in the diagram represent estimates of the 
numbers of evolutionary changes that have occurred in this molecule in each lineage (see Figure 1-22). 
The parts of the tree shrouded in gray cloud represent uncertainties about details of the true pattern of 
species divergence in the course of evolution: comparisons of nucleotide or amino acid sequences of 
molecules other than rRNA, as well as other arguments, lead to somewhat different trees. There is general 
agreement, however, as to the early divergence of the three most basic domains — the bacteria, the 
archaea, and the eucaryotes. 



is any procaryote from any eucaryote. It now appears that the procaryotes com- 
prise two distinct groups that diverged early in the history of life on Earth, either 
before the ancestors of the eucaryotes diverged as a separate group or at about 
the same time. The two groups of procaryotes are called the bacteria (or eubac- 
teria) and the archaea (or archaebacteria). The living world therefore has three 
major divisions or domains: bacteria, archaea, and eucaryotes (Figure 1-21). 

Archaea are often found inhabiting environments that we humans avoid, 
such as bogs, sewage treatment plants, ocean depths, salt brines, and hot acid 
springs, although they are also widespread in less extreme and more homely 
environments, from soils and lakes to the stomachs of cattle. In outward appear- 
ance they are not easily distinguished from bacteria. At a molecular level, 
archaea seem to resemble eucaryotes more closely in their machinery for han- 
dling genetic information (replication, transcription, and translation), but bac- 
teria more closely in their apparatus for metabolism and energy conversion. We 
discuss below how this might be explained. 



Some Genes Evolve Rapidly; Others Are Highly Conserved 

Both in the storage and in the copying of genetic information, random accidents 
and errors occur, altering the nucleotide sequence — that is, creating mutations. 
Therefore, when a cell divides, its two daughters are often not quite identical to 
one another or to their parent. On rare occasions, the error may represent a 
change for the better; more probably, it will cause no significant difference in the 
cell's prospects; and in many cases, the error will cause serious damage — for 
example, by disrupting the coding sequence for a key protein. Changes due to 
mistakes of the first type will tend to be perpetuated, because the altered cell has 
an increased likelihood of reproducing itself. Changes due to mistakes of the 
second type — selectively neutral changes — may be perpetuated or not: in the 
competition for limited resources, it is a matter of chance whether the altered 
cell or its cousins will succeed. But changes that cause serious damage lead 
nowhere: the cell that suffers them dies, leaving no progeny. Through endless 
repetition of this cycle of error and trial — of mutation and natural selection — 
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organisms evolve: their genetic specifications change, giving them new ways to 
exploit the environment more effectively, to survive in competition with others, 
and to reproduce successfully 

Clearly, some parts of the genome change more easily than others in the 
course of evolution. A segment of DNA that does not code for protein and has no 
significant regulatory role is free to change at a rate limited only by the frequency 
of random errors. In contrast, a gene that codes for a highly optimized essential 
protein or RNA molecule cannot alter so easily: when mistakes occur, the faulty 
cells are almost always eliminated. Genes of this latter sort are therefore highly 
conserved. Through 3.5 billion years or more of evolutionary history, many fea- 
tures of the genome have changed beyond all recognition; but the most highly 
conserved genes remain perfectly recognizable in all living species. 

These latter genes are the ones we must examine if we wish to trace family 
relationships between the most distantly related organisms in the tree of life. 
The studies that led to the classification of the living world into the three 
domains of bacteria, archaea, and eucaryotes were based chiefly on analysis of 
one of the two main RNA components of the ribosome — the so-called small - 
subunit ribosomal RNA. Because translation is fundamental to all living cells, 
this component of the ribosome has been well conserved since early in the his- 
tory of life on Earth (Figure 1-22). 

Most Bacteria and Archaea Have 1000-6000 Genes 

Natural selection has generally favored those procaryotic cells that can reproduce 
the fastest by taking up raw materials from their environment and replicating 
themselves most efficiently, at the maximal rate permitted by the available food 
supplies. Small size implies a large ratio of surface area to volume, thereby help- 
ing to maximize the uptake of nutrients across the plasma membrane and 
boosting a cell's reproductive rate. 

Presumably for these reasons, most procaryotic cells carry very little super- 
fluous baggage; their genomes are small, with genes packed closely together and 
minimal quantities of regulatory DNA between them. The small genome size 
makes it relatively easy to determine the complete DNA sequence. We now have 
this information for many species of bacteria and archaea, and a few species of 
eucaryotes. As shown in Table 1-1, most bacterial and archaeal genomes con- 
tain between 10 6 and 10 7 nucleotide pairs, encoding 1000-6000 genes. 

A complete DNA sequence reveals both the genes an organism possesses 
and the genes it lacks. When we compare the three domains of the living world, 
we can begin to see which genes are common to all of them and must therefore 
have been present in the cell that was ancestral to all present-day living things, 
and which genes are peculiar to a single branch in the tree of life. To explain the 
findings, however, we need to consider a little more closely how new genes arise 
and genomes evolve. 
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Figure 1 -22 Genetic information conserved since the days of the last common ancestor of all living 
things. A part of the gene for the smaller of the two main RNA components of the ribosome is shown. (The 
complete molecule is about 1500-1900 nucleotides long, depending on species.) Corresponding segments 
of nucleotide sequence from an archaean (Methanococcusjannaschii), a bacterium (Escherichia coli) and a 
eucaryote (Homo sapiens) are aligned. Sites where the nucleotides are identical between species are 
indicated by a vertical line; the human sequence is repeated at the bottom of the alignment so that all 
three two-way comparisons can be seen. A dot halfway along the E. coli sequence denotes a site where a 
nucleotide has been either deleted from the bacterial lineage in the course of evolution, or inserted in the 
other two lineages. Note that the sequences from these three organisms, representative of the three 
domains of the living world, all differ from one another to a roughly similar degree, while still retaining 
unmistakable similarities. 
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Table 1-1 Some Genomes That Have Been Completely Sequenced 



SPECIES 


SPECIAL FEATURES 


HABITAT 


GENOME SIZE 
(1000s OF 
NUCLEOTIDE 


ESTIMATED 
NUMBER 
OF GENES 








PAIRS PER 


CODING FOR 








HAPLOID GENOME) 


PROTEINS 


BACTERIA 


Mycoplasma genitalium 


has one of the smallest of all 
known cell genomes 


human genital tract 


580 


468 


Synechocystis sp. 


photosynthetic, oxygen-generating 
(cyanobacterium) 


lakes and streams 


3573 


3168 


Escherichia coli 


laboratory favorite 


human gut 


4639 


4289 


Helicobacter pylori 


causes stomach ulcers and 
predisposes to stomach cancer 


human stomach 


1667 


1590 


Bacillus anthracis 


causes anthrax 


soil 


5227 


5634 


Aquifex aeolicus 


lithotrophic; lives at high 
temperatures 


hydrothermal vents 


1551 


1544 


Streptomyces coelicolor 


source of antibiotics; giant genome 


soil 


8667 


7825 


Treponema pallidum 


spirochete; causes syphilis 


human tissues 


1138 


1041 


Rickettsia prowazekii 


bacterium most closely related to 
mitochondria; causes typhus 


lice and humans 
(intracellular parasite) 


1111 


834 


Thermotoga maritima 


organotrophic; lives at very high 
temperatures 


hydrothermal vents 


1860 


1877 


ARCHAEA 


Methanococcus jannaschii 


lithotrophic, anaerobic, 
methane-producing 


hydrothermal vents 


1664 


1750 


Archaeoglobus fulgidus 


lithotrophic or organotrophic, 
anaerobic, sulfate-reducing 


hydrothermal vents 


2178 


2493 


Nanoarchaeum equitans 


smallest known archaean; anaerobic; 
parasitic on another, larger 
archaean 


hydrothermal and 
volcanic hot vents 


491 


552 


EUCARYOTES 










Saccharomyces cerevisiae 


minimal model eucaryote 


grape skins, beer 


12,069 


-6300 


(budding yeast) 










Arabidopsis thaliana 


model organism for flowering 


soil and air 


-142,000 


-26,000 


(Thale cress) 


plants 








Caenorhabditis elegans 


simple animal with perfectly 


soil 


-97,000 


-20,000 


(nematode worm) 


predictable development 








Drosophila melanogaster 


key to the genetics of animal 


rotting fruit 


-137,000 


-14,000 


(fruit fly) 


development 








Homo sapiens (human) 


most intensively studied mammal 


houses 


-3,200,000 


-24,000 



Genome size and gene number vary between strains of a single species, especially for bacteria and archaea. The table shows data for particular 
strains that have been sequenced. For eucaryotes, many genes can give rise to several alternative variant proteins, so that the total number of 
proteins specified by the genome is substantially greater than the number of genes. 



New Genes Are Generated from Preexisting Genes 

The raw material of evolution is the DNA sequence that already exists: there is 
no natural mechanism for making long stretches of new random sequence. In 
this sense, no gene is ever entirely new Innovation can, however, occur in sev- 
eral ways (Figure 1-23): 

1. Intragenic mutation: an existing gene can be modified by changes in its 
DNA sequence, through various types of error that occur mainly in the pro- 
cess of DNA replication. 

2. Gene duplication: an existing gene can be duplicated so as to create a pair 
of initially identical genes within a single cell; these two genes may then 
diverge in the course of evolution. 
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Figure 1 -23 Four modes of genetic 
innovation and their effects on the DNA 
sequence of an organism. A special form 
of horizontal transfer occurs when two 
different types of cells enter into a 
permanent symbiotic association. Genes 
from one of the cells then may be 
transferred to the genome of the other, 
as we shall see below when we discuss 
mitochondria and chloroplasts. 
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3. Segment shuffling: two or more existing genes can be broken and rejoined 
to make a hybrid gene consisting of DNA segments that originally 
belonged to separate genes. 

4. Horizontal (intercellular) transfer: a piece of DNA can be transferred from 
the genome of one cell to that of another — even to that of another species. 
This process is in contrast with the usual vertical transfer of genetic infor- 
mation from parent to progeny. 

Each of these types of change leaves a characteristic trace in the DNA 
sequence of the organism, providing clear evidence that all four processes have 
occurred. In later chapters we discuss the underlying mechanisms, but for the 
present we focus on the consequences. 



Gene Duplications Give Rise to Families of Related Genes Within a 
Single Cell 

A cell duplicates its entire genome each time it divides into two daughter cells. 
However, accidents occasionally result in the inappropriate duplication of just 
part of the genome, with retention of original and duplicate segments in a single 
cell. Once a gene has been duplicated in this way, one of the two gene copies is 
free to mutate and become specialized to perform a different function within the 
same cell. Repeated rounds of this process of duplication and divergence, over 
many millions of years, have enabled one gene to give rise to a family of genes 
that may all be found within a single genome. Analysis of the DNA sequence of 
procaryotic genomes reveals many examples of such gene families: in Bacillus 
subtilis, for example, 47% of the genes have one or more obvious relatives (Fig- 
ure 1-24). 

When genes duplicate and diverge in this way, the individuals of one species 
become endowed with multiple variants of a primordial gene. This evolutionary 
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Figure 1-24 Families of evolutionarily 
related genes in the genome of Bacillus 
subtilis. The biggest family consists of 
77 genes coding for varieties of ABC 
transporters — a class of membrane 
transport proteins found in all three 
domains of the living world. (Adapted 
from F. Kunst et al., Nature 390:249-256, 
1997. With permission from Macmillan 
Publishers Ltd.) 
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process has to be distinguished from the genetic divergence that occurs when 
one species of organism splits into two separate lines of descent at a branch 
point in the family tree — when the human line of descent became separate from 
that of chimpanzees, for example. There, the genes gradually become different 
in the course of evolution, but they are likely to continue to have corresponding 
functions in the two sister species. Genes that are related by descent in this 
way — that is, genes in two separate species that derive from the same ancestral 
gene in the last common ancestor of those two species — are called orthologs. 
Related genes that have resulted from a gene duplication event within a single 
genome — and are likely to have diverged in their function — are called paralogs. 
Genes that are related by descent in either way are called homologs, a general 
term used to cover both types of relationship (Figure 1-25). 

The family relationships between genes can become quite complex (Figure 
1-26). For example, an organism that possesses a family of paralogous genes (for 
example, the seven hemoglobin genes a, (3, y, 5, £, £, and 6) may evolve into two 
separate species (such as humans and chimpanzees) each possessing the entire 
set of paralogs. All 14 genes are homologs, with the human hemoglobin a orthol- 
ogous to the chimpanzee hemoglobin a, but paralogous to the human or chim- 
panzee hemoglobin (3, and so on. Moreover, the vertebrate hemoglobins (the 
oxygen-binding proteins of blood) are homologous to the vertebrate myo- 
globins (the oxygen-binding proteins of muscle), as well as to more distant 
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Figure 1 -25 Paralogous genes and orthologous genes: two types of gene 
homology based on different evolutionary pathways. (A) and (B) The most 
basic possibilities. (C) A more complex pattern of events that can occur. 
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Figure 1 -26 A complex family of 
homologous genes. This diagram shows 
the pedigree of the hemoglobin (Hb), 
myoglobin, and globin genes of human, 
chick, shark, and Drosophila. The lengths 
of the horizontal lines represent the 
amount of divergence in amino acid 
sequence. 
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genes that code for oxygen-binding proteins in invertebrates, plants, fungi, and 
bacteria. From the DNA sequences, it is usually easy to recognize that two genes 
in different species are homologous; it is much more difficult to decide, without 
other information, whether they stand in the precise evolutionary relationship 
of orthologs. 



Genes Can Be Transferred Between Organisms, Both in the 
Laboratory and in Nature 

Procaryotes also provide examples of the horizontal transfer of genes from one 
species of cell to another. The most obvious tell-tale signs are sequences recog- 
nizable as being derived from bacterial viruses, also called bacteriophages (Figure 
1-27). Viruses are not themselves living cells but can act as vectors for gene 
transfer: they are small packets of genetic material that have evolved as parasites 
on the reproductive and biosynthetic machinery of host cells. They replicate in 
one cell, emerge from it with a protective wrapping, and then enter and infect 
another cell, which may be of the same or a different species. Often, the infected 
cell will be killed by the massive proliferation of virus particles inside it; but 
sometimes, the viral DNA, instead of directly generating these particles, may per- 
sist in its host for many cell generations as a relatively innocuous passenger, 
either as a separate intracellular fragment of DNA, known as a plasmid, or as a 
sequence inserted into the cell's regular genome. In their travels, viruses can acci- 
dentally pick up fragments of DNA from the genome of one host cell and ferry 
them into another cell. Such transfers of genetic material frequently occur in pro- 
caryotes, and they can also occur between eucaryotic cells of the same species. 

Horizontal transfers of genes between eucaryotic cells of different species 
are very rare, and they do not seem to have played a significant part in eucary- 
ote evolution (although massive transfers from bacterial to eucaryotic genomes 
have occurred in the evolution of mitochondria and chloroplasts, as we discuss 
below). In contrast, horizontal gene transfers occur much more frequently 
between different species of procaryotes. Many procaryotes have a remarkable 
capacity to take up even nonviral DNA molecules from their surroundings and 
thereby capture the genetic information these molecules carry. By this route, or 
by virus-mediated transfer, bacteria and archaea in the wild can acquire genes 
from neighboring cells relatively easily. Genes that confer resistance to an 
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antibiotic or an ability to produce a toxin, for example, can be transferred from 
species to species and provide the recipient bacterium with a selective advan- 
tage. In this way, new and sometimes dangerous strains of bacteria have been 
observed to evolve in the bacterial ecosystems that inhabit hospitals or the var- 
ious niches in the human body For example, horizontal gene transfer is respon- 
sible for the spread, over the past 40 years, of penicillin-resistant strains of Neis- 
seria gonorrheae, the bacterium that causes gonorrhea. On a longer time scale, 
the results can be even more profound; it has been estimated that at least 18% 
of all of the genes in the present-day genome of E. coli have been acquired by 
horizontal transfer from another species within the past 100 million years. 



Sex Results in Horizontal Exchanges of Genetic Information 
Within a Species 

Horizontal exchanges of genetic information are important in bacterial and 
archaeal evolution in today's world, and they may have occurred even more fre- 
quently and promiscuously in the early days of life on Earth. Such early hori- 
zontal exchanges could explain the otherwise puzzling observation that the 
eucaryotes seem more similar to archaea in their genes for the basic informa- 
tion-handling processes of DNA replication, transcription, and translation, but 
more similar to bacteria in their genes for metabolic processes. In any case, 
whether horizontal gene transfer occurred most freely in the early days of life on 
Earth, or has continued at a steady low rate throughout evolutionary history, it 
has the effect of complicating the whole concept of cell ancestry, by making each 
cell's genome a composite of parts derived from separate sources. 

Horizontal gene transfer among procaryotes may seem a surprising process, 
but it has a parallel in a phenomenon familiar to us all: sex. In addition to the 
usual vertical transfer of genetic material from parent to offspring, sexual repro- 
duction causes a large-scale horizontal transfer of genetic information between 
two initially separate cell lineages — those of the father and the mother. A key 
feature of sex, of course, is that the genetic exchange normally occurs only 
between individuals of the same species. But no matter whether they occur 
within a species or between species, horizontal gene transfers leave a character- 
istic imprint: they result in individuals who are related more closely to one set of 
relatives with respect to some genes, and more closely to another set of relatives 
with respect to others. By comparing the DNA sequences of individual human 
genomes, an intelligent visitor from outer space could deduce that humans 
reproduce sexually, even if it knew nothing about human behavior. 

Sexual reproduction is widespread (although not universal), especially 
among eucaryotes. Even bacteria indulge from time to time in controlled sex- 
ual exchanges of DNA with other members of their own species. Natural selec- 
tion has clearly favored organisms that can reproduce sexually, although evo- 
lutionary theorists dispute precisely what the selective advantage of sex is. 



The Function of a Gene Can Often Be Deduced from Its Sequence 

Family relationships among genes are important not just for their historical 
interest, but because they simplify the task of deciphering gene functions. Once 
the sequence of a newly discovered gene has been determined, a scientist can 
tap a few keys on a computer to search the entire database of known gene 
sequences for genes related to it. In many cases, the function of one or more of 
these homologs will have been already determined experimentally, and thus, 
since gene sequence determines gene function, one can frequently make a good 
guess at the function of the new gene: it is likely to be similar to that of the 
already-known homologs. 

In this way, it is possible to decipher a great deal of the biology of an organ- 
ism simply by analyzing the DNA sequence of its genome and using the infor- 
mation we already have about the functions of genes in other organisms that 
have been more intensively studied. 
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Figure 1 -27 The viral transfer of DNA 
from one cell to another. (A) An electron 
micrograph of particles of a bacterial 
virus, the T4 bacteriophage. The head of 
this virus contains the viral DNA; the tail 
contains the apparatus for injecting the 
DNA into a host bacterium. (B) A cross 
section of a bacterium with aT4 
bacteriophage latched onto its surface. 
The large dark objects inside the 
bacterium are the heads of new T4 
particles in course of assembly. When 
they are mature, the bacterium will burst 
open to release them. (A, courtesy of 
James Paulson; B, courtesy of Jonathan 
King and Erika Hartwig from G. Karp, Cell 
and Molecular Biology, 2nd ed. New York: 
John Wiley & Sons, 1 999. With permission 
from John Wiley & Sons.) 
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More Than 200 Gene Families Are Common to All Three Primary 
Branches of the Tree of Life 

Given the complete genome sequences of representative organisms from all three 
domains — archaea, bacteria, and eucaryotes — we can search systematically for 
homologies that span this enormous evolutionary divide. In this way we can begin 
to take stock of the common inheritance of all living things. There are considerable 
difficulties in this enterprise. For example, individual species have often lost some 
of the ancestral genes; other genes have almost certainly been acquired by hori- 
zontal transfer from another species and therefore are not truly ancestral, even 
though shared. In fact, genome comparisons strongly suggest that both lineage- 
specific gene loss and horizontal gene transfer, in some cases between evolution- 
arily distant species, have been major factors of evolution, at least among pro- 
caryotes. Finally, in the course of 2 or 3 billion years, some genes that were initially 
shared will have changed beyond recognition by current methods. 

Because of all these vagaries of the evolutionary process, it seems that only 
a small proportion of ancestral gene families have been universally retained in a 
recognizable form. Thus, out of 4873 protein- coding gene families defined by 
comparing the genomes of 50 species of bacteria, 13 archaea, and 3 unicellular 
eucaryotes, only 63 are truly ubiquitous (that is, represented in all the genomes 
analyzed). The great majority of these universal families include components of 
the translation and transcription systems. This is not likely to be a realistic 
approximation of an ancestral gene set. A better — though still crude — idea of the 
latter can be obtained by tallying the gene families that have representatives in 
multiple, but not necessarily all, species from all three major domains. Such an 
analysis reveals 264 ancient conserved families. Each family can be assigned a 
function (at least in terms of general biochemical activity, but usually with more 
precision), with the largest number of shared gene families being involved in 
translation and in amino acid metabolism and transport (Table 1-2). This set of 
highly conserved gene families represents only a very rough sketch of the com- 
mon inheritance of all modern life; a more precise reconstruction of the gene 
complement of the last universal common ancestor might be feasible with fur- 
ther genome sequencing and more careful comparative analysis. 

Mutations Reveal the Functions of Genes 

Without additional information, no amount of gazing at genome sequences will 
reveal the functions of genes. We may recognize that gene B is like gene A, but 
how do we discover the function of gene A in the first place? And even if we know 
the function of gene A, how do we test whether the function of gene B is truly the 
same as the sequence similarity suggests? How do we connect the world of 
abstract genetic information with the world of real living organisms? 

The analysis of gene functions depends on two complementary approaches: 
genetics and biochemistry. Genetics starts with the study of mutants: we either 
find or make an organism in which a gene is altered, and examine the effects on 
the organism's structure and performance (Figure 1-28). Biochemistry exam- 
ines the functions of molecules: we extract molecules from an organism and 
then study their chemical activities. By combining genetics and biochemistry 
and examining the chemical abnormalities in a mutant organism, it is possible 
to find those molecules whose production depends on a given gene. At the same 
time, studies of the performance of the mutant organism show us what role 
those molecules have in the operation of the organism as a whole. Thus, genet- 
ics and biochemistry together provide a way to relate genes, molecules, and the 
structure and function of the organism. 

In recent years, DNA sequence information and the powerful tools of molec- 
ular biology have allowed rapid progress. From sequence comparisons, we can 
often identify particular subregions within a gene that have been preserved 
nearly unchanged over the course of evolution. These conserved subregions are 
likely to be the most important parts of the gene in terms of function. We can test 
their individual contributions to the activity of the gene product by creating in 
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Table 1 -2 The Numbers of Gene Families, Classified by Function, That Are Common 
to All Three Domains of the Living World 



GENE FAMILY FUNCTION 




NUMBER OF 




"UNIVERSAL" FAMILIES 


Information processing 






Translation 




63 


Transcription 




7 


Replication, recombination, and repair 




13 


Cellular processes and signaling 






Cell cycle control, mitosis, and meiosis 




2 


Defense mechanisms 




3 


Signal transduction mechanisms 




1 


Cell wall/membrane biogenesis 




2 


Intracellular trafficking and secretion 




4 


Post-translational modification, protein turnover 


, chaperones 


8 


Metabolism 






Energy production and conversion 




19 


Carbohydrate transport and metabolism 




16 


Amino acid transport and metabolism 




43 


Nucleotide transport and metabolism 




15 


Coenzyme transport and metabolism 




22 


Lipid transport and metabolism 




9 


Inorganic ion transport and metabolism 




8 


Secondary metabolite biosynthesis, transport, and catabolism 


5 


Poorly characterized 






General biochemical function predicted; specific 


biological role 


24 


unknown 







For the purpose of this analysis, gene families are defined as "universal" if they are represented in 
the genomes of at least two diverse archaea (Archaeoglobus fulgidus and Aeropyrum pernix), two 
evolutionary distant bacteria (Escherichia coli and Bacillus subtilis) and one eucaryote (yeast, 
Saccharomyces cerevisiae). (Data from R.L Tatusov, E.V. Koonin and DJ. Lipman, Science 278:63 1-637, 
1997, with permission from AAAS; R.L. Tatusov et al., BMC Bioinformatics 4:41 , 2003, with permission 
from BioMed Central; and the COGs database at the US National Library of Medicine.) 



the laboratory mutations of specific sites within the gene, or by constructing 
artificial hybrid genes that combine part of one gene with part of another. 
Organisms can be engineered to make either the RNA or the protein specified by 
the gene in large quantities to facilitate biochemical analysis. Specialists in 
molecular structure can determine the three-dimensional conformation of the 
gene product, revealing the exact position of every atom in it. Biochemists can 
determine how each of the parts of the genetically specified molecule con- 
tributes to its chemical behavior. Cell biologists can analyze the behavior of cells 
that are engineered to express a mutant version of the gene. 

There is, however, no one simple recipe for discovering a gene's function, 
and no simple standard universal format for describing it. We may discover, for 
example, that the product of a given gene catalyzes a certain chemical reaction, 
and yet have no idea how or why that reaction is important to the organism. The 
functional characterization of each new family of gene products, unlike the 
description of the gene sequences, presents a fresh challenge to the biologist's 
ingenuity. Moreover, we never fully understand the function of a gene until we 
learn its role in the life of the organism as a whole. To make ultimate sense of 
gene functions, therefore, we have to study whole organisms, not just molecules 
or cells. 



Molecular Biologists Have Focused a Spotlight on E. coli 

Because living organisms are so complex, the more we learn about any partic- 
ular species, the more attractive it becomes as an object for further study. Each 
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Figure 1-28 A mutant phenotype 
reflecting the function of a gene. 

A normal yeast (of the species 
Schizosaccharomyces pombe) is compared 
with a mutant in which a change in a 
single gene has converted the cell from a 
cigar shape (left) to a T shape (right). The 
mutant gene therefore has a function in 
the control of cell shape. But how, in 
molecular terms, does the gene product 
perform that function? That is a harder 
question, and needs biochemical analysis 
to answer it. (Courtesy of Kenneth Sawin 
and Paul Nurse.) 
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Figure 1 -29 The genome of E. coli. 

(A) A cluster of E. coli cells. (B) A diagram 
of the genome of E. coli strain K-1 2. The 
diagram is circular because the DNA of 
E. coli, like that of other procaryotes, 
forms a single, closed loop. Protein- 
coding genes are shown as yellow or 
orange bars, depending on the DNA 
strand from which they are transcribed; 
genes encoding only RNA molecules are 
indicated by green arrows. Some genes 
are transcribed from one strand of the 
DNA double helix (in a clockwise 
direction in this diagram), others from the 
other strand (counterclockwise). 
(A, courtesy of Dr. Tony Brain and David 
Parker/Photo Researchers; B, adapted 
from F.R. Blattner et al., Science 
277:1453-1462, 1997. With permission 
from A A AS.) 



discovery raises new questions and provides new tools with which to tackle 
general questions in the context of the chosen organism. For this reason, large 
communities of biologists have become dedicated to studying different aspects 
of the same model organism. 

In the enormously varied world of bacteria, the spotlight of molecular 
biology has for a long time focused intensely on just one species: Escherichia 
coli, or E. coli (see Figures 1-17 and 1-18). This small, rod-shaped bacterial cell 
normally lives in the gut of humans and other vertebrates, but it can be grown 
easily in a simple nutrient broth in a culture bottle. It adapts to variable chem- 
ical conditions and reproduces rapidly, and it can evolve by mutation and 
selection at a remarkable speed. As with other bacteria, different strains of E. 
coli, though classified as members of a single species, differ genetically to a 
much greater degree than do different varieties of a sexually reproducing 
organism such as a plant or animal. One E. coli strain may possess many hun- 
dreds of genes that are absent from another, and the two strains could have as 
little as 50% of their genes in common. The standard laboratory strain E. coli 
K-12 has a genome of approximately 4.6 million nucleotide pairs, contained in 
a single circular molecule of DNA, coding for about 4300 different kinds of pro- 
teins (Figure 1-29). 

In molecular terms, we know more about E. coli than about any other living 
organism. Most of our understanding of the fundamental mechanisms of life — 
for example, how cells replicate their DNA, or how they decode the instructions 
represented in the DNA to direct the synthesis of specific proteins — has come 
from studies of E. coli. The basic genetic mechanisms have turned out to be 
highly conserved throughout evolution: these mechanisms are therefore essen- 
tially the same in our own cells as in E. coli. 
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Summary 

Procaryotes (cells without a distinct nucleus) are biochemically the most diverse 
organisms and include species that can obtain all their energy and nutrients from 
inorganic chemical sources, such as the reactive mixtures of minerals released at 
hydrothermal vents on the ocean floor — the sort of diet that may have nourished the 
first living cells 3.5 billion years ago. DNA sequence comparisons reveal the family 
relationships of living organisms and show that the procaryotes fall into two groups 
that diverged early in the course of evolution: the bacteria (or eubacteria) and the 
archaea. Together with the eucaryotes (cells with a membrane-enclosed nucleus), these 
constitute the three primary branches of the tree of life. Most bacteria and archaea are 
small unicellular organisms with compact genomes comprising 1000-6000 genes. 
Many of the genes within a single organism show strong family resemblances in their 
DNA sequences, implying that they originated from the same ancestral gene through 
gene duplication and divergence. Family resemblances (homologies) are also clear 
when gene sequences are compared between different species, and more than 200 gene 
families have been so highly conserved that they can be recognized as common to most 
species from all three domains of the living world. Thus, given the DNA sequence of a 
newly discovered gene, it is often possible to deduce the gene's function from the known 
function of a homologous gene in an intensively studied model organism, such as the 
bacterium E. coli. 



GENETIC INFORMATION IN EUCARYOTES 

Eucaryotic cells, in general, are bigger and more elaborate than procaryotic cells, 
and their genomes are bigger and more elaborate, too. The greater size is accom- 
panied by radical differences in cell structure and function. Moreover, many 
classes of eucaryotic cells form multicellular organisms that attain levels of com- 
plexity unmatched by any procaryote. 

Because they are so complex, eucaryotes confront molecular biologists with 
a special set of challenges, which will concern us in the rest of this book. Increas- 
ingly, biologists meet these challenges through the analysis and manipulation of 
the genetic information within cells and organisms. It is therefore important at 
the outset to know something of the special features of the eucaryotic genome. 
We begin by briefly discussing how eucaryotic cells are organized, how this 
reflects their way of life, and how their genomes differ from those of procaryotes. 
This leads us to an outline of the strategy by which molecular biologists, by 
exploiting genetic information, are attempting to discover how eucaryotic 
organisms work. 

Eucaryotic Cells May Have Originated as Predators 

By definition, eucaryotic cells keep their DNA in an internal compartment 
called the nucleus. The nuclear envelope, a double layer of membrane, sur- 
rounds the nucleus and separates the DNA from the cytoplasm. Eucaryotes also 
have other features that set them apart from procaryotes (Figure 1-30). Their 
cells are, typically, 10 times bigger in linear dimension, and 1000 times larger in 
volume. They have a cytoskeleton — a system of protein filaments crisscrossing 
the cytoplasm and forming, together with the many proteins that attach to 
them, a system of girders, ropes, and motors that gives the cell mechanical 
strength, controls its shape, and drives and guides its movements. <GTTA> 
<ATGG> <TCGC> The nuclear envelope is only one part of a set of internal 
membranes, each structurally similar to the plasma membrane and enclosing 
different types of spaces inside the cell, many of them involved in digestion and 
secretion. Lacking the tough cell wall of most bacteria, animal cells and the 
free-living eucaryotic cells called protozoa can change their shape rapidly and 
engulf other cells and small objects by phagocytosis (Figure 1-31). 

It is still a mystery how all these properties evolved, and in what sequence. 
One plausible view, however, is that they are all reflections of the way of life of a 
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primordial eucaryotic cell that was a predator, living by capturing other cells and 
eating them (Figure 1-32). Such a way of life requires a large cell with a flexible 
plasma membrane, as well as an elaborate cytoskeleton to support and move 
this membrane. It may also require that the cell's long, fragile DNA molecules be 
sequestered in a separate nuclear compartment, to protect the genome from 
damage by the movements of the cytoskeleton. 



Modern Eucaryotic Cells Evolved from a Symbiosis 

A predatory way of life helps to explain another feature of eucaryotic cells. 
Almost all such cells contain mitochondria (Figure 1-33). These small bodies in 
the cytoplasm, enclosed by a double layer of membrane, take up oxygen and 
harness energy from the oxidation of food molecules — such as sugars — to pro- 
duce most of the ATP that powers the cell's activities. Mitochondria are similar 
in size to small bacteria, and, like bacteria, they have their own genome in the 
form of a circular DNA molecule, their own ribosomes that differ from those 
elsewhere in the eucaryotic cell, and their own transfer RNAs. It is now generally 
accepted that mitochondria originated from free-living oxygen-metabolizing 
(aerobic) bacteria that were engulfed by an ancestral eucaryotic cell that could 
otherwise make no such use of oxygen (that is, was anaerobic). Escaping diges- 
tion, these bacteria evolved in symbiosis with the engulfing cell and its progeny, 



Figure 1-30 The major features of 
eucaryotic cells. The drawing depicts a 
typical animal cell, but almost all the 
same components are found in plants 
and fungi and in single-celled eucaryotes 
such as yeasts and protozoa. Plant cells 
contain chloroplasts in addition to the 
components shown here, and their 
plasma membrane is surrounded by a 
tough external wall formed of cellulose. 
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Figure 1-31 Phagocytosis. This series of 
stills from a movie shows a human white 
blood cell (a neutrophil) engulfing a red 
blood cell (artificially colored red) that 
has been treated with antibody. 
(Courtesy of Stephen E. Malawista and 
Anne de Boisfleury Chevance.) 
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Figure 1-32 A single-celled eucaryote 
that eats other cells. (A) Didinium is a 
carnivorous protozoan, belonging to the 
group known as ciliates. It has a globular 
body, about 150 Jim in diameter, 
encircled by two fringes of cilia — sinuous, 
whiplike appendages that beat 
continually; its front end is flattened 
except for a single protrusion, rather like 
a snout. (B) Didinium normally swims 
around in the water at high speed by 
means of the synchronous beating of its 
cilia. When it encounters a suitable prey, 
usually another type of protozoan, it 
releases numerous small paralyzing darts 
from its snout region. Then, the Didinium 
attaches to and devours the other cell by 
phagocytosis, inverting like a hollow ball 
to engulf its victim, which is almost as 
large as itself. (Courtesy of D. Barlow.) 



receiving shelter and nourishment in return for the power generation they per- 
formed for their hosts (Figure 1-34). This partnership between a primitive 
anaerobic eucaryotic predator cell and an aerobic bacterial cell is thought to 
have been established about 1.5 billion years ago, when the Earth's atmosphere 
first became rich in oxygen. 




100 nm 

Figure 1-33 A mitochondrion. (A) A cross section, as seen in the electron 
microscope. (B) A drawing of a mitochondrion with part of it cut away to 
show the three-dimensional structure. (C) A schematic eucaryotic cell, with 
the interior space of a mitochondrion, containing the mitochondrial DNA 
and ribosomes, colored. Note the smooth outer membrane and the 
convoluted inner membrane, which houses the proteins that generate ATP 
from the oxidation of food molecules. (A, courtesy of Daniel S. Friend.) 
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Figure 1-34 The origin of mitochondria. 

An ancestral eucaryotic cell is thought to 
have engulfed the bacterial ancestor of 
mitochondria, initiating a symbiotic 
relationship. 



Many eucaryotic cells — specifically, those of plants and algae — also contain 
another class of small membrane-enclosed organelles somewhat similar to mito- 
chondria — the chloroplasts (Figure 1-35). Chloroplasts perform photosynthesis, 
using the energy of sunlight to synthesize carbohydrates from atmospheric 
carbon dioxide and water, and deliver the products to the host cell as food. Like 
mitochondria, chloroplasts have their own genome and almost certainly origi- 
nated as symbiotic photosynthetic bacteria, acquired by cells that already pos- 
sessed mitochondria (Figure 1-36). 

A eucaryotic cell equipped with chloroplasts has no need to chase after 
other cells as prey; it is nourished by the captive chloroplasts it has inherited 
from its ancestors. Correspondingly, plant cells, although they possess the 
cytoskeletal equipment for movement, have lost the ability to change shape 
rapidly and to engulf other cells by phagocytosis. Instead, they create around 
themselves a tough, protective cell wall. If the ancestral eucaryote was indeed a 
predator on other organisms, we can view plant cells as eucaryotes that have 
made the transition from hunting to farming. 

Fungi represent yet another eucaryotic way of life. Fungal cells, like animal 
cells, possess mitochondria but not chloroplasts; but in contrast with animal 
cells and protozoa, they have a tough outer wall that limits their ability to move 
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Figure 1-35 Chloroplasts. These organelles 
capture the energy of sunlight in plant cells 
and some single-celled eucaryotes. 
(A) A single cell isolated from a leaf of a 
flowering plant, seen in the light 
microscope, showing the green 
chloroplasts. (B) A drawing of one of the 
chloroplasts, showing the highly folded 
system of internal membranes containing 
the chlorophyll molecules by which light is 
absorbed. (A, courtesy of Preeti Dahiya.) 
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rapidly or to swallow up other cells. Fungi, it seems, have turned from hunters 
into scavengers: other cells secrete nutrient molecules or release them upon 
death, and fungi feed on these leavings — performing whatever digestion is nec- 
essary extracellularly, by secreting digestive enzymes to the exterior. 



Eucaryotes Have Hybrid Genomes 

The genetic information of eucaryotic cells has a hybrid origin — from the ances- 
tral anaerobic eucaryote, and from the bacteria that it adopted as symbionts. 
Most of this information is stored in the nucleus, but a small amount remains 
inside the mitochondria and, for plant and algal cells, in the chloroplasts. The 
mitochondrial DNA and the chloroplast DNA can be separated from the nuclear 
DNA and individually analyzed and sequenced. The mitochondrial and chloro- 
plast genomes are found to be degenerate, cut-down versions of the corre- 
sponding bacterial genomes, lacking genes for many essential functions. In a 
human cell, for example, the mitochondrial genome consists of only 16,569 
nucleotide pairs, and codes for only 13 proteins, two ribosomal RNA compo- 
nents, and 22 transfer RNAs. 

The genes that are missing from the mitochondria and chloroplasts have not 
all been lost; instead, many of them have been somehow moved from the sym- 
biont genome into the DNA of the host cell nucleus. The nuclear DNA of humans 
contains many genes coding for proteins that serve essential functions inside 
the mitochondria; in plants, the nuclear DNA also contains many genes specify- 
ing proteins required in chloroplasts. 



Figure 1-36 The origin of chloroplasts. 

An early eucaryotic cell, already 
possessing mitochondria, engulfed a 
photosynthetic bacterium 
(a cyanobacterium) and retained it in 
symbiosis. All present-day chloroplasts 
are thought to trace their ancestry back 
to a single species of cyanobacterium 
that was adopted as an internal 
symbiont (an endosymbiont) over a 
billion years ago. 



Eucaryotic Genomes Are Big 

Natural selection has evidently favored mitochondria with small genomes, just 
as it has favored bacteria with small genomes. By contrast, the nuclear genomes 
of most eucaryotes seem to have been free to enlarge. Perhaps the eucaryotic 
way of life has made large size an advantage: predators typically need to be big- 
ger than their prey, and cell size generally increases in proportion to genome 
size. Perhaps enlargement of the genome has been driven by the accumulation 
of parasitic transposable elements (discussed in Chapter 5) — "selfish" segments 
of DNA that can insert copies of themselves at multiple sites in the genome. 
Whatever the explanation, the genomes of most eucaryotes are orders of magni- 
tude larger than those of bacteria and archaea (Figure 1-37). And the freedom to 
be extravagant with DNA has had profound implications. 

Eucaryotes not only have more genes than procaryotes; they also have vastly 
more DNA that does not code for protein or for any other functional product 
molecule. The human genome contains 1000 times as many nucleotide pairs as 
the genome of a typical bacterium, 20 times as many genes, and about 10,000 
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Figure 1-37 Genome sizes compared. 

Genome size is measured in nucleotide 
pairs of DNA per haploid genome, that is, 
per single copy of the genome. (The cells 
of sexually reproducing organisms such as 
ourselves are generally diploid: they 
contain two copies of the genome, one 
inherited from the mother, the other from 
the father.) Closely related organisms can 
vary widely in the quantity of DNA in their 
genomes, even though they contain 
similar numbers of functionally distinct 
genes. (Data from W.H. Li, Molecular 
Evolution, pp. 380-383. Sunderland, 
MA:Sinauer, 1997.) 
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times as much noncoding DNA (-98.5% of the genome for a human is noncod- 
ing, as opposed to 11% of the genome for the bacterium E. coif). 



Eucaryotic Genomes Are Rich in Regulatory DNA 

Much of our noncoding DNA is almost certainly dispensable junk, retained like 
a mass of old papers because, when there is little pressure to keep an archive 
small, it is easier to retain everything than to sort out the valuable information 
and discard the rest. Certain exceptional eucaryotic species, such as the puffer 
fish (Figure 1-38), bear witness to the profligacy of their relatives; they have 
somehow managed to rid themselves of large quantities of noncoding DNA. Yet 
they appear similar in structure, behavior, and fitness to related species that 
have vastly more such DNA. 

Even in compact eucaryotic genomes such as that of puffer fish, there is 
more noncoding DNA than coding DNA, and at least some of the noncoding 
DNA certainly has important functions. In particular, it regulates the expression 
of adjacent genes. With this regulatory DNA, eucaryotes have evolved distinctive 
ways of controlling when and where a gene is brought into play. This sophisti- 
cated gene regulation is crucial for the formation of complex multicellular 
organisms. 



The Genome Defines the Program of Multicellular Development 

The cells in an individual animal or plant are extraordinarily varied. Fat cells, 
skin cells, bone cells, nerve cells — they seem as dissimilar as any cells could be. 
Yet all these cell types are the descendants of a single fertilized egg cell, and all 
(with minor exceptions) contain identical copies of the genome of the species. 

The differences result from the way in which the cells make selective use of 
their genetic instructions according to the cues they get from their surroundings 
in the developing embryo. The DNA is not just a shopping list specifying the 
molecules that every cell must have, and the cell is not an assembly of all the 
items on the list. Rather, the cell behaves as a multipurpose machine, with sen- 
sors to receive environmental signals and with highly developed abilities to call 
different sets of genes into action according to the sequences of signals to which 
the cell has been exposed. The genome in each cell is big enough to accommo- 
date the information that specifies an entire multicellular organism, but in any 
individual cell only part of that information is used. 

A large fraction of the genes in the eucaryotic genome code for proteins that 
regulate the activities of other genes. Most of these gene regulatory proteins act by 




Figure 1-38 The puffer fish (Fugu 
rubripes). This organism has a genome 
size of 400 million nucleotide pairs — 
about one-quarter as much as a 
zebrafish, for example, even though the 
two species offish have similar numbers 
of genes. (From a woodcut by Hiroshige, 
courtesy of Arts and Designs of Japan.) 
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Figure 1-39 Controlling gene readout 
by environmental signals. Regulatory 
DNA allows gene expression to be 
controlled by regulatory proteins, which 
are in turn the products of other genes. 
This diagram shows how a cell's gene 
expression is adjusted according to a 
signal from the cell's environment. The 
initial effect of the signal is to activate a 
regulatory protein already present in the 
cell; the signal may, for example, trigger 
the attachment of a phosphate group to 
the regulatory protein, altering its 
chemical properties. 
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...to produce yet more proteins, including 
some additional gene-regulatory proteins 

binding, directly or indirectly, to the regulatory DNA adjacent to the genes that 
are to be controlled (Figure 1-39), or by interfering with the abilities of other pro- 
teins to do so. The expanded genome of eucaryotes therefore not only specifies 
the hardware of the cell, but also stores the software that controls how that hard- 
ware is used (Figure 1-40). 

Cells do not just passively receive signals; rather, they actively exchange sig- 
nals with their neighbors. Thus, in a developing multicellular organism, the 
same control system governs each cell, but with different consequences 
depending on the messages exchanged. The outcome, astonishingly, is a pre- 
cisely patterned array of cells in different states, each displaying a character 
appropriate to its position in the multicellular structure. 



Many Eucaryotes Live as Solitary Cells: the Protists 

Many species of eucaryotic cells lead a solitary life — some as hunters (the proto- 
zoa), some as photosynthesizers (the unicellular algae), some as scavengers (the 
unicellular fungi, or yeasts). Figure 1-41 conveys something of the variety of 
forms of these single-celled eucaryotes, or protists. The anatomy of protozoa, 




Figure 1-40 Genetic control of the 
program of multicellular development. 

The role of a regulatory gene is 
demonstrated in the snapdragon 
Antirrhinum. In this example, a mutation 
in a single gene coding for a regulatory 
protein causes leafy shoots to develop in 
place of flowers: because a regulatory 
protein has been changed, the cells 
adopt characters that would be 
appropriate to a different location in the 
normal plant. The mutant is on the left, 
the normal plant on the right. (Courtesy 
of Enrico Coen and Rosemary Carpenter.) 
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especially, is often elaborate and includes such structures as sensory bristles, 
photoreceptors, sinuously beating cilia, leglike appendages, mouth parts, sting- 
ing darts, and musclelike contractile bundles. Although they are single cells, 
protozoa can be as intricate, as versatile, and as complex in their behavior as 
many multicellular organisms (see Figure 1-32). <ATGG> <TCGC> 

In terms of their ancestry and DNA sequences, protists are far more diverse 
than the multicellular animals, plants, and fungi, which arose as three compara- 
tively late branches of the eucaryotic pedigree (see Figure 1-21). As with procary- 
otes, humans have tended to neglect the protists because they are microscopic. 
Only now, with the help of genome analysis, are we beginning to understand 
their positions in the tree of life, and to put into context the glimpses these 
strange creatures offer us of our distant evolutionary past. 



Figure 1-41 An assortment of protists: a 
small sample of an extremely diverse 
class of organisms. The drawings are 
done to different scales, but in each case 
the scale bar represents 10 jam. The 
organisms in (A), (B), (E), (F), and (I) are 
ciliates; (C) is a euglenoid; (D) is an 
amoeba; (G) is a dinoflagellate; (H) is a 
heliozoan. (From M.A. Sleigh, Biology of 
Protozoa. Cambridge, UK: Cambridge 
University Press, 1973.) 



A Yeast Serves as a Minimal Model Eucaryote 

The molecular and genetic complexity of eucaryotes is daunting. Even more 
than for procaryotes, biologists need to concentrate their limited resources on a 
few selected model organisms to fathom this complexity. 

To analyze the internal workings of the eucaryotic cell, without the addi- 
tional problems of multicellular development, it makes sense to use a species 
that is unicellular and as simple as possible. The popular choice for this role of 
minimal model eucaryote has been the yeast Saccharomyces cerevisiae (Figure 
1-42) — the same species that is used by brewers of beer and bakers of bread. 

S. cerevisiae is a small, single-celled member of the kingdom of fungi and 
thus, according to modern views, at least as closely related to animals as it is to 
plants. It is robust and easy to grow in a simple nutrient medium. Like other 
fungi, it has a tough cell wall, is relatively immobile, and possesses mitochondria 
but not chloroplasts. When nutrients are plentiful, it grows and divides almost as 
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rapidly as a bacterium. It can reproduce either vegetatively (that is, by simple 
cell division), or sexually: two yeast cells that are haploid (possessing a single 
copy of the genome) can fuse to create a cell that is diploid (containing a double 
genome); and the diploid cell can undergo meiosis (a reduction division) to pro- 
duce cells that are once again haploid (Figure 1-43). In contrast with higher 
plants and animals, the yeast can divide indefinitely in either the haploid or the 
diploid state, and the process leading from the one state to the other can be 
induced at will by changing the growth conditions. 

In addition to these features, the yeast has a further property that makes it a 
convenient organism for genetic studies: its genome, by eucaryotic standards, is 
exceptionally small. Nevertheless, it suffices for all the basic tasks that every 
eucaryotic cell must perform. As we shall see later in this book, studies on yeasts 
(using both S. cerevisiae and other species) have provided a key to many crucial 
processes, including the eucaryotic cell-division cycle — the critical chain of 
events by which the nucleus and all the other components of a cell are dupli- 
cated and parceled out to create two daughter cells from one. The control sys- 
tem that governs this process has been so well conserved over the course of evo- 
lution that many of its components can function interchangeably in yeast and 
human cells: if a mutant yeast lacking an essential yeast cell- division- cycle gene 
is supplied with a copy of the homologous cell- division- cycle gene from a 
human, the yeast is cured of its defect and becomes able to divide normally. 



Figure 1 -42 The yeast Saccharomyces 
cerevisiae. (A) A scanning electron 
micrograph of a cluster of the cells. This 
species is also known as budding yeast; it 
proliferates by forming a protrusion or 
bud that enlarges and then separates 
from the rest of the original cell. Many 
cells with buds are visible in this 
micrograph. (B) A transmission electron 
micrograph of a cross section of a yeast 
cell, showing its nucleus, mitochondrion, 
and thick cell wall. (A, courtesy of Ira 
Herskowitz and Eric Schabatach.) 



The Expression Levels of All The Genes of An Organism Can Be 
Monitored Simultaneously 

The complete genome sequence of S. cerevisiae, determined in 1997, consists of 
approximately 13,117,000 nucleotide pairs, including the small contribution 
(78,520 nucleotide pairs) of the mitochondrial DNA. This total is only about 2.5 
times as much DNA as there is in E. coli, and it codes for only 1.5 times as many 
distinct proteins (about 6300 in all). The way of life of S. cerevisiae is similar in 
many ways to that of a bacterium, and it seems that this yeast has likewise been 
subject to selection pressures that have kept its genome compact. 

Knowledge of the complete genome sequence of any organism — be it a yeast 
or a human — opens up new perspectives on the workings of the cell: things that 
once seemed impossibly complex now seem within our grasp. Using techniques 

Figure 1-43 The reproductive cycles of the yeast S. cerevisiae. Depending 
on environmental conditions and on details of the genotype, cells of this 
species can exist in either a diploid (2/i) state, with a double chromosome 
set, or a haploid (n) state, with a single chromosome set. The diploid form 
can either proliferate by ordinary cell-division cycles or undergo meiosis to 
produce haploid cells. The haploid form can either proliferate by ordinary 
cell-division cycles or undergo sexual fusion with another haploid cell to 
become diploid. Meiosis is triggered by starvation and gives rise to 
spores — haploid cells in a dormant state, resistant to harsh environmental 
conditions. 



■ j; x 2 „ 



proliferation 
^of diploid cells/ 

2n 



meiosis and sporulation 
(triggered by starvation) 



mating (usually 

immediately after 

spores hatch) 



h 




spores hatch 



^^F /proliferationX 
I of haploid ] 

A cells J 

BUDDING YEAST LIFE CYCLE 



GENETIC INFORMATION IN EUCARYOTES 



35 




■ DNA/RNA/protein biosynthesis ■ environmental response 

■ cell cycle ■ developmental processes ■ metabolism 

to be described in Chapter 8, it is now possible, for example, to monitor, simul- 
taneously, the amount of mRNA transcript that is produced from every gene in 
the yeast genome under any chosen conditions, and to see how this whole pat- 
tern of gene activity changes when conditions change. The analysis can be 
repeated with mRNA prepared from mutants lacking a chosen gene — any gene 
that we care to test. In principle, this approach provides a way to reveal the 
entire system of control relationships that govern gene expression — not only in 
yeast cells, but in any organism whose genome sequence is known. 



Figure 1 -44 The network of interactions 
between gene regulatory proteins and 
the genes that code for them in a yeast 
cell. Results are shown for 1 06 out of the 
total of 141 gene regulatory proteins in 
Saccharomyces cerevisiae. Each protein in 
the set was tested for its ability to bind to 
the regulatory DNA of each of the genes 
coding for this set of proteins. In the 
diagram, the genes are arranged in a 
circle, and an arrow pointing from gene A 
to gene B means that the protein 
encoded by A binds to the regulatory 
DNA of B, and therefore presumably 
regulates the expression of B. Small 
circles with arrowheads indicate genes 
whose products directly regulate their 
own expression. Genes governing 
different aspects of cell behavior are 
shown in different colors. For a 
multicellular plant or animal, the number 
of gene regulatory proteins is about 
1 times greater, and the amount of 
regulatory DNA perhaps 100 times 
greater, so that the corresponding 
diagram would be vastly more complex. 
(From T.I. Lee et al., Science 298:799-804, 
2002. With permission from AAAS.) 



To Make Sense of Cells, We Need Mathematics, Computers, and 
Quantitative Information 

Through methods such as these, exploiting our knowledge of complete genome 
sequences, we can list the genes and proteins in a cell and begin to depict the 
web of interactions between them (Figure 1-44). But how are we to turn all this 
information into an understanding of how cells work? Even for a single cell type 
belonging to a single species of organism, the current deluge of data seems over- 
whelming. The sort of informal reasoning on which biologists usually rely seems 
totally inadequate in the face of such complexity. In fact, the difficulty is more 
than just a matter of information overload. Biological systems are, for example, 
full of feedback loops, and the behavior of even the simplest of systems with 
feedback is remarkably difficult to predict by intuition alone (Figure 1-45); small 



Figure 1-45 A very simple gene regulatory circuit — a single gene 
regulating its own expression by the binding of its protein product to 
its own regulatory DNA. Simple schematic diagrams such as this are often 
used to summarize what we know (as in Figure 1-44), but they leave many 
questions unanswered. When the protein binds, does it inhibit or 
stimulate transcription? How steeply does the transcription rate depend 
on the protein concentration? How long, on average, does a molecule of 
the protein remain bound to the DNA? How long does it take to make 
each molecule of mRNA or protein, and how quickly does each type of 
molecule get degraded? Mathematical modeling shows that we need 
quantitative answers to all these and other questions before we can 
predict the behavior of even this single-gene system. For different 
parameter values, the system may settle to a unique steady state; or it may 
behave as a switch, capable of existing in one or other of a set of 
alternative states; or it may oscillate; or it may show large random 
fluctuations. 
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changes in parameters can cause radical changes in outcome. To go from a cir- 
cuit diagram to a prediction of the behavior of the system, we need detailed 
quantitative information, and to draw deductions from that information we 
need mathematics and computers. 

These tools for quantitative reasoning are essential, but they are not all- 
powerful. You might think that, knowing how each protein influences each other 
protein, and how the expression of each gene is regulated by the products of oth- 
ers, we should soon be able to calculate how the cell as a whole will behave, just 
as an astronomer can calculate the orbits of the planets, or a chemical engineer 
can calculate the flows through a chemical plant. But any attempt to perform 
this feat for an entire living cell rapidly reveals the limits of our present state of 
knowledge. The information we have, plentiful as it is, is full of gaps and uncer- 
tainties. Moreover, it is largely qualitative rather than quantitative. Most often, 
cell biologists studying the cell's control systems sum up their knowledge in sim- 
ple schematic diagrams — this book is full of them — rather than in numbers, 
graphs, and differential equations. To progress from qualitative descriptions and 
intuitive reasoning to quantitative descriptions and mathematical deduction is 
one of the biggest challenges for contemporary cell biology. So far, the challenge 
has been met only for a few very simple fragments of the machinery of living 
cells — subsystems involving a handful of different proteins, or two or three 
cross-regulatory genes, where theory and experiment can go closely hand in 
hand. We shall discuss some of these examples later in the book. 



Arabidopsis Has Been Chosen Out of 300,000 Species As a Model 
Plant 

The large multicellular organisms that we see around us — the flowers and trees 
and animals — seem fantastically varied, but they are much closer to one another 
in their evolutionary origins, and more similar in their basic cell biology, than 
the great host of microscopic single-celled organisms. Thus, while bacteria and 
eucaryotes are separated by more than 3000 million years of divergent evolu- 
tion, vertebrates and insects are separated by about 700 million years, fish and 
mammals by about 450 million years, and the different species of flowering 
plants by only about 150 million years. 

Because of the close evolutionary relationship between all flowering plants, 
we can, once again, get insight into the cell and molecular biology of this whole 
class of organisms by focusing on just one or a few species for detailed analysis. 
Out of the several hundred thousand species of flowering plants on Earth today, 
molecular biologists have chosen to concentrate their efforts on a small weed, the 
common Thale cress Arabidopsis thaliana (Figure 1-46), which can be grown 
indoors in large numbers, and produces thousands of offspring per plant after 
8-10 weeks. Arabidopsis has a genome of approximately 140 million nucleotide 
pairs, about 11 times as much as yeast, and its complete sequence is known. 




Figure 1-46 Arabidopsis thaliana, the 
plant chosen as the primary model for 
studying plant molecular genetics. 

(Courtesy of Toni Hayden and the John 
Innes Foundation.) 



The World of Animal Cells Is Represented By a Worm, a Fly, a 
Mouse, and a Human 

Multicellular animals account for the majority of all named species of living 
organisms, and for the largest part of the biological research effort. Four species 
have emerged as the foremost model organisms for molecular genetic studies. In 
order of increasing size, they are the nematode worm Caenorhabditis elegans, 
the fly Drosophila melanogaster, the mouse Mus musculus, and the human, 
Homo sapiens. Each of these has had its genome sequenced. 

Caenorhabditis elegans (Figure 1-47) is a small, harmless relative of the eel- 
worm that attacks crops. With a life cycle of only a few days, an ability to survive 
in a freezer indefinitely in a state of suspended animation, a simple body plan, 
and an unusual life cycle that is well suited for genetic studies (described in 
Chapter 23), it is an ideal model organism. C. elegans develops with clockwork 
precision from a fertilized egg cell into an adult worm with exactly 959 body cells 
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Figure 1-47 Caenorhabditis elegans, the 
first multicellular organism to have its 
complete genome sequence 
determined. This small nematode, about 
1 mm long, lives in the soil. Most 
individuals are hermaphrodites, 
producing both eggs and sperm. The 
animal is viewed here using interference 
contrast optics, showing up the 
boundaries of the tissues in bright colors; 
the animal itself is not colored when 
viewed with ordinary lighting. (Courtesy 
of Ian Hope.) 
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(plus a variable number of egg and sperm cells) — an unusual degree of regular- 
ity for an animal. We now have a minutely detailed description of the sequence 
of events by which this occurs, as the cells divide, move, and change their char- 
acters according to strict and predictable rules. The genome of 97 million 
nucleotide pairs codes for about 19,000 proteins, and many mutants and other 
tools are available for the testing of gene functions. Although the worm has a 
body plan very different from our own, the conservation of biological mecha- 
nisms has been sufficient for the worm to be a model for many of the develop- 
mental and cell-biological processes that occur in the human body. Studies of 
the worm help us to understand, for example, the programs of cell division and 
cell death that determine the numbers of cells in the body — a topic of great 
importance in developmental biology and cancer research. 



Studies in Drosophila Provide a Key to Vertebrate Development 

The fruitfly Drosophila melanogaster (Figure 1-48) has been used as a model 
genetic organism for longer than any other; in fact, the foundations of classical 
genetics were built to a large extent on studies of this insect. Over 80 years ago, it 
provided, for example, definitive proof that genes — the abstract units of heredi- 
tary information — are carried on chromosomes, concrete physical objects whose 
behavior had been closely followed in the eucaryotic cell with the light micro- 
scope, but whose function was at first unknown. The proof depended on one of 
the many features that make Drosophila peculiarly convenient for genetics — the 




Figure 1-48 Drosophila melanogaster. 

Molecular genetic studies on this fly have 
provided the main key to understanding 
how all animals develop from a fertilized 
egg into an adult. (From E.B. Lewis, 
Science 221 :cover, 1983. With permission 
from AAAS.) 
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giant chromosomes, with characteristic banded appearance, that are visible in 
some of its cells (Figure 1-49). Specific changes in the hereditary information, 
manifest in families of mutant flies, were found to correlate exactly with the loss 
or alteration of specific giant-chromosome bands. 

In more recent times, Drosophila, more than any other organism, has shown 
us how to trace the chain of cause and effect from the genetic instructions 
encoded in the chromosomal DNA to the structure of the adult multicellular 
body. Drosophila mutants with body parts strangely misplaced or mispatterned 
provided the key to the identification and characterization of the genes required 
to make a properly structured body, with gut, limbs, eyes, and all the other parts 
in their correct places. Once these Drosophila genes were sequenced, the 
genomes of vertebrates could be scanned for homologs. These were found, and 
their functions in vertebrates were then tested by analyzing mice in which the 
genes had been mutated. The results, as we see later in the book, reveal an 
astonishing degree of similarity in the molecular mechanisms of insect and ver- 
tebrate development. 

The majority of all named species of living organisms are insects. Even if 
Drosophila had nothing in common with vertebrates, but only with insects, it 
would still be an important model organism. But if understanding the molecular 
genetics of vertebrates is the goal, why not simply tackle the problem head-on? 
Why sidle up to it obliquely, through studies in Drosophila 7 . 

Drosophila requires only 9 days to progress from a fertilized egg to an adult; 
it is vastly easier and cheaper to breed than any vertebrate, and its genome is 
much smaller — about 170 million nucleotide pairs, compared with 3200 million 
for a human. This genome codes for about 14,000 proteins, and mutants can 
now be obtained for essentially any gene. But there is also another, deeper rea- 
son why genetic mechanisms that are hard to discover in a vertebrate are often 
readily revealed in the fly. This relates, as we now explain, to the frequency of 
gene duplication, which is substantially greater in vertebrate genomes than in 
the fly genome and has probably been crucial in making vertebrates the com- 
plex and subtle creatures that they are. 



The Vertebrate Genome Is a Product of Repeated Duplication 

Almost every gene in the vertebrate genome has paralogs — other genes in the 
same genome that are unmistakably related and must have arisen by gene dupli- 
cation. In many cases, a whole cluster of genes is closely related to similar clus- 
ters present elsewhere in the genome, suggesting that genes have been dupli- 
cated in linked groups rather than as isolated individuals. According to one 
hypothesis, at an early stage in the evolution of the vertebrates, the entire 
genome underwent duplication twice in succession, giving rise to four copies of 
every gene. In some groups of vertebrates, such as fish of the salmon and carp 
families (including the zebrafish, a popular research animal), it has been sug- 
gested that there was yet another duplication, creating an eightfold multiplicity 
of genes. 

The precise course of vertebrate genome evolution remains uncertain, 
because many further evolutionary changes have occurred since these ancient 
events. Genes that were once identical have diverged; many of the gene copies 
have been lost through disruptive mutations; some have undergone further 
rounds of local duplication; and the genome, in each branch of the vertebrate 
family tree, has suffered repeated rearrangements, breaking up most of the orig- 
inal gene orderings. Comparison of the gene order in two related organisms, such 
as the human and the mouse, reveals that — on the time scale of vertebrate evo- 
lution — chromosomes frequently fuse and fragment to move large blocks of DNA 
sequence around. Indeed, it is possible, as we shall discuss in Chapter 7, that the 
present state of affairs is the result of many separate duplications of fragments of 
the genome, rather than duplications of the genome as a whole. 

There is, however, no doubt that such whole-genome duplications do 
occur from time to time in evolution, for we can see recent instances in which 
duplicated chromosome sets are still clearly identifiable as such. The frog 
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Figure 1-49 Giant chromosomes from 
salivary gland cells of Drosophila. 

Because many rounds of DNA replication 
have occurred without an intervening cell 
division, each of the chromosomes in 
these unusual cells contains over 1000 
identical DNA molecules, all aligned in 
register. This makes them easy to see in 
the light microscope, where they display 
a characteristic and reproducible banding 
pattern. Specific bands can be identified 
as the locations of specific genes: a 
mutant fly with a region of the banding 
pattern missing shows a phenotype 
reflecting loss of the genes in that region. 
Genes that are being transcribed at a 
high rate correspond to bands with a 
"puffed" appearance. The bands stained 
dark brown in the micrograph are sites 
where a particular regulatory protein is 
bound to the DNA. (Courtesy of B. Zink 
and R. Paro, from R. Paro, Trends Genet. 
6:416-421, 1990. With permission 
from Elsevier.) 
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Figure 1 -50 Two species of the frog genus Xenopus. X. tropicalis, above, 
has an ordinary diploid genome; X. laevis, below, has twice as much DNA 
per cell. From the banding patterns of their chromosomes and the 
arrangement of genes along them, as well as from comparisons of gene 
sequences, it is clear that the large-genome species have evolved through 
duplications of the whole genome. These duplications are thought to have 
occurred in the aftermath of matings between frogs of slightly divergent 
Xenopus species. (Courtesy of E. Amaya, M. Offield and R. Grainger, Trends 
Genet 14:253-255, 1998. With permission from Elsevier.) 



genus Xenopus, for example, comprises a set of closely similar species related 
to one another by repeated duplications or triplications of the whole genome. 
Among these frogs are X tropicalis, with an ordinary diploid genome; the com- 
mon laboratory species X. laevis, with a duplicated genome and twice as much 
DNA per cell; and X. ruwenzoriensis, with a sixfold reduplication of the original 
genome and six times as much DNA per cell (108 chromosomes, compared 
with 36 in X. laevis, for example). These species are estimated to have diverged 
from one another within the past 120 million years (Figure 1-50). 



Genetic Redundancy Is a Problem for Geneticists, But It Creates 
Opportunities for Evolving Organisms 

Whatever the details of the evolutionary history, it is clear that most genes in the 
vertebrate genome exist in several versions that were once identical. The related 
genes often remain functionally interchangeable for many purposes. This phe- 
nomenon is called genetic redundancy. For the scientist struggling to discover 
all the genes involved in some particular process, it complicates the task. If gene 
A is mutated and no effect is seen, it cannot be concluded that gene A is func- 
tionally irrelevant — it may simply be that this gene normally works in parallel 
with its relatives, and these suffice for near-normal function even when gene A 
is defective. In the less repetitive genome of Drosophila, where gene duplication 
is less common, the analysis is more straightforward: single gene functions are 
revealed directly by the consequences of single-gene mutations (the single- 
engined plane stops flying when the engine fails). 

Genome duplication has clearly allowed the development of more complex 
life forms; it provides an organism with a cornucopia of spare gene copies, 
which are free to mutate to serve divergent purposes. While one copy becomes 
optimized for use in the liver, say, another can become optimized for use in the 
brain or adapted for a novel purpose. In this way, the additional genes allow for 
increased complexity and sophistication. As the genes take on divergent func- 
tions, they cease to be redundant. Often, however, while the genes acquire indi- 
vidually specialized roles, they also continue to perform some aspects of their 
original core function in parallel, redundantly. Mutation of a single gene then 
causes a relatively minor abnormality that reveals only a part of the gene's 
function (Figure 1-51). Families of genes with divergent but partly overlap- 
ping functions are a pervasive feature of vertebrate molecular biology, and they 
are encountered repeatedly in this book. 




The Mouse Serves as a Model for Mammals 

Mammals have typically three or four times as many genes as Drosophila, a 
genome that is 20 times larger, and millions or billions of times as many cells in 
their adult bodies. In terms of genome size and function, cell biology, and 
molecular mechanisms, mammals are nevertheless a highly uniform group of 
organisms. Even anatomically, the differences among mammals are chiefly a 
matter of size and proportions; it is hard to think of a human body part that does 
not have a counterpart in elephants and mice, and vice versa. Evolution plays 
freely with quantitative features, but it does not readily change the logic of the 
structure. 
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For a more exact measure of how closely mammalian species resemble one 
another genetically, we can compare the nucleotide sequences of corresponding 
(orthologous) genes, or the amino acid sequences of the proteins that these 
genes encode. The results for individual genes and proteins vary widely But typ- 
ically, if we line up the amino acid sequence of a human protein with that of the 
orthologous protein from, say, an elephant, about 85% of the amino acids are 
identical. A similar comparison between human and bird shows an amino acid 
identity of about 70% — twice as many differences, because the bird and the 
mammalian lineages have had twice as long to diverge as those of the elephant 
and the human (Figure 1-52). 

The mouse, being small, hardy, and a rapid breeder, has become the fore- 
most model organism for experimental studies of vertebrate molecular genetics. 
Many naturally occurring mutations are known, often mimicking the effects of 
corresponding mutations in humans (Figure 1-53). Methods have been devel- 
oped, moreover, to test the function of any chosen mouse gene, or of any non- 
coding portion of the mouse genome, by artificially creating mutations in it, as 
we explain later in the book. 

One made-to-order mutant mouse can provide a wealth of information for 
the cell biologist. It reveals the effects of the chosen mutation in a host of differ- 
ent contexts, simultaneously testing the action of the gene in all the different 
kinds of cells in the body that could in principle be affected. 



Humans Report on Their Own Peculiarities 

As humans, we have a special interest in the human genome. We want to know 
the full set of parts from which we are made, and to discover how they work. But 
even if you were a mouse, preoccupied with the molecular biology of mice, 
humans would be attractive as model genetic organisms, because of one special 
property: through medical examinations and self-reporting, we catalog our own 
genetic (and other) disorders. The human population is enormous, consisting 
today of some 6 billion individuals, and this self- documenting property means 
that a huge database of information exists on human mutations. The complete 
human genome sequence of more than 3 billion nucleotide pairs has now been 
determined, making it easier than ever before to identify at a molecular level the 
precise gene responsible for each human mutant characteristic. 

By drawing together the insights from humans, mice, flies, worms, yeasts, 
plants, and bacteria — using gene sequence similarities to map out the corre- 
spondences between one model organism and another — we enrich our under- 
standing of them all. 



Figure 1-51 The consequences of gene 
duplication for mutational analyses of 
gene function. In this hypothetical 
example, an ancestral multicellular 
organism has a genome containing a 
single copy of gene G, which performs its 
function at several sites in the body, 
indicated in green. (A) Through gene 
duplication, a modern descendant of the 
ancestral organism has two copies of 
gene G, called Gi and G2. These have 
diverged somewhat in their patterns of 
expression and in their activities at the 
sites where they are expressed, but they 
still retain important similarities. At some 
sites, they are expressed together, and 
each independently performs the same 
old function as the ancestral gene G 
(alternating green and yellow stripes); at 
other sites, they are expressed alone and 
may serve new purposes. (B) Because of a 
functional overlap, the loss of one of the 
two genes by mutation (red cross) reveals 
only a part of its role; only the loss of 
both genes in the double mutant reveals 
the full range of processes for which 
these genes are responsible. Analogous 
principles apply to duplicated genes that 
operate in the same place (for example, 
in a single-celled organism) but are called 
into action together or individually in 
response to varying circumstances. Thus, 
gene duplications complicate genetic 
analyses in all organisms. 
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We Are All Different in Detail 

What precisely do we mean when we speak of the human genome? Whose 
genome? On average, any two people taken at random differ in about one or 
two in every 1000 nucleotide pairs in their DNA sequence. The Human Genome 
Project has arbitrarily selected DNA from a small number of anonymous indi- 
viduals for sequencing. The human genome — the genome of the human 
species — is, properly speaking, a more complex thing, embracing the entire pool 
of variant genes that are found in the human population and continually 
exchanged and reassorted in the course of sexual reproduction. Ultimately, we 
can hope to document this variation too. Knowledge of it will help us under- 
stand, for example, why some people are prone to one disease, others to another; 
why some respond well to a drug, others badly. It will also provide new clues to 
our history — the population movements and minglings of our ancestors, the 
infections they suffered, the diets they ate. All these things leave traces in the 
variant forms of genes that have survived in human communities. 



Figure 1 -52 Times of divergence of 
different vertebrates. The scale on the 
left shows the estimated date and 
geological era of the last common 
ancestor of each specified pair of animals. 
Each time estimate is based on 
comparisons of the amino acid 
sequences of orthologous proteins; the 
longer a pair of animals have had to 
evolve independently, the smaller the 
percentage of amino acids that remain 
identical. Data from many different 
classes of proteins have been averaged to 
arrive at the final estimates, and the time 
scale has been calibrated to match the 
fossil evidence that the last common 
ancestor of mammals and birds lived 310 
million years ago. The figures on the right 
give data on sequence divergence for 
one particular protein (chosen 
arbitrarily) — the a chain of hemoglobin. 
Note that although there is a clear 
general trend of increasing divergence 
with increasing time for this protein, 
there are also some irregularities. These 
reflect the randomness within the 
evolutionary process and, probably, the 
action of natural selection driving 
especially rapid changes of hemoglobin 
sequence in some organisms that 
experienced special physiological 
demands. On average, within any 
particular evolutionary lineage, 
hemoglobins accumulate changes at a 
rate of about 6 altered amino acids per 
1 00 amino acids every 1 00 million years. 
Some proteins, subject to stricter 
functional constraints, evolve much more 
slowly than this, others as much as 
5 times faster. All this gives rise to 
substantial uncertainties in estimates of 
divergence times, and some experts 
believe that the major groups of 
mammals diverged from one another as 
much as 60 million years more recently 
than shown here. (Adapted from 
S. Kumar and S.B. Hedges, Nature 
392:917-920, 1998. With permission from 
Macmillan Publishers Ltd.) 
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Figure 1-53 Human and mouse: similar 
genes and similar development. The 

human baby and the mouse shown here 
have similar white patches on their 
foreheads because both have mutations in 
the same gene (called Kit), required for the 
development and maintenance of pigment 
cells. (Courtesy of R.A. Fleischman.) 
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Knowledge and understanding bring the power to intervene — with humans, 
to avoid or prevent disease; with plants, to create better crops; with bacteria, to 
turn them to our own uses. All these biological enterprises are linked, because 
the genetic information of all living organisms is written in the same language. 
The new-found ability of molecular biologists to read and decipher this lan- 
guage has already begun to transform our relationship to the living world. The 
account of cell biology in the subsequent chapters will, we hope, prepare you to 
understand, and possibly to contribute to, the great scientific adventure of the 
twenty-first century. 



Summary 

Eucaryotic cells, by definition, keep their DN A in a separate membrane-enclosed com- 
partment, the nucleus. They have, in addition, a cytoskeleton for support and move- 
ment, elaborate intracellular compartments for digestion and secretion, the capacity 
(in many species) to engulf other cells, and a metabolism that depends on the oxida- 
tion of organic molecules by mitochondria. These properties suggest that eucaryotes 
may have originated as predators on other cells. Mitochondria — and, in plants, 
chloroplasts — contain their own genetic material, and evidently evolved from bacteria 
that were taken up into the cytoplasm of the eucaryotic cell and survived as symbionts. 
Eucaryotic cells have typically 3-30 times as many genes as procaryotes, and often 
thousands of times more noncoding DNA. The noncodingDNA allows for complex reg- 
ulation of gene expression, as required for the construction of complex multicellular 
organisms. Many eucaryotes are, however, unicellular — among them the yeast Sac- 
charomyces cerevisiae, which serves as a simple model organism for eucaryotic cell 
biology, revealing the molecular basis of conserved fundamental processes such as the 
eucaryotic cell division cycle. A small number of other organisms have been chosen as 
primary models for multicellular plants and animals, and the sequencing of their 
entire genomes has opened the way to systematic and comprehensive analysis of gene 
functions, gene regulation, and genetic diversity. As a result of gene duplications dur- 
ing vertebrate evolution, vertebrate genomes contain multiple closely related 
homologs of most genes. This genetic redundancy has allowed diversification and spe- 
cialization of genes for new purposes, but it also makes gene functions harder to deci- 
pher. There is less genetic redundancy in the nematode Caenorhabditis elegans and the 
fly Drosophila melanogaster, which have thus played a key part in revealing universal 
genetic mechanisms of animal development. 



PROBLEMS 

Which statements are true? Explain why or why not. 

1-1 The human hemoglobin genes, which are arranged 
in two clusters on two chromosomes, provide a good exam- 
ple of an orthologous set of genes. 

1 -2 Horizontal gene transfer is more prevalent in single - 
celled organisms than in multicellular organisms. 

1 -3 Most of the DNA sequences in a bacterial genome 
code for proteins, whereas most of the sequences in the 
human genome do not. 



Discuss the following problems. 

1 -4 Since it was deciphered four decades ago, some have 
claimed that the genetic code must be a frozen accident, 
while others have argued that it was shaped by natural selec- 
tion. A striking feature of the genetic code is its inherent resis- 
tance to the effects of mutation. For example, a change in the 
third position of a codon often specifies the same amino acid 
or one with similar chemical properties. The natural code 



resists mutation more effectively (is less susceptible to error) 
than most other possible versions, as illustrated in Figure 
Ql-1. Only one in a million computer- generated "random" 
codes is more error-resistant than the natural genetic code. 
Does the extraordinary mutation resistance of the genetic 
code argue in favor of its origin as a frozen accident or as a 
result of natural selection? Explain your reasoning. 



^ 25 



£ 20 




5 10 15 

susceptibility to mutation 



Figure Ql-1 Susceptibility of 
the natural code relative to 
millions of computer- 
generated codes (Problem 
1-4). Susceptibility measures 
the average change in amino 
acid properties caused by 
random mutations. A small 
value indicates that 
mutations tend to cause 
20 minor changes. (Data 

courtesy of Steve Freeland.) 



1-5 You have begun to characterize a sample obtained 
from the depths of the oceans on Europa, one of Jupiter's 
moons. Much to your surprise, the sample contains a life- 
form that grows well in a rich broth. Your preliminary analysis 



END-OF-CHAPTER PROBLEMS 
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shows that it is cellular and contains DNA, RNA, and pro- 
tein. When you show your results to a colleague, she sug- 
gests that your sample was contaminated with an organism 
from Earth. What approaches might you try to distinguish 
between contamination and a novel cellular life-form based 
on DNA, RNA, and protein? 

1 -6 It is not so difficult to imagine what it means to feed 
on the organic molecules that living things produce. That is, 
after all, what we do. But what does it mean to "feed" on sun- 
light, as phototrophs do? Or, even stranger, to "feed" on 
rocks, as lithotrophs do? Where is the "food," for example, in 
the mixture of chemicals (H 2 S, H 2 , CO, Mn + , Fe 2+ , Ni 2+ , CH 4 , 
and NH 4 + ) spewed forth from a hydrothermal vent? 

1-7 How many possible different trees (branching pat- 
terns) can be drawn for eubacteria, archaea, and eucaryotes, 
assuming that they all arose from a common ancestor? 

1-8 The genes for ribosomal RNA are highly conserved 
(relatively few sequence changes) in all organisms on Earth; 
thus, they have evolved very slowly over time. Were riboso- 
mal RNA genes "born" perfect? 

1-9 Genes participating in informational processes such 
as replication, transcription, and translation are transferred 
between species much less often than are genes involved in 
metabolism. The basis for this inequality is unclear at pre- 
sent, but one suggestion is that it relates to the underlying 
complexity. Informational processes tend to involve large 
aggregates of different gene products, whereas metabolic 
reactions are usually catalyzed by enzymes composed of a 
single protein. Why would the complexity of the underlying 
process — informational or metabolic — have any effect on 
the rate of horizontal gene transfer? 

1-10 The process of gene transfer from the mitochondrial 
to the nuclear genome can be analyzed in plants. The respi- 
ratory gene Cox2 f which encodes subunit 2 of cytochrome 
oxidase, was functionally transferred to the nucleus during 
flowering plant evolution. Extensive analyses of plant gen- 
era have pinpointed the time of appearance of the nuclear 
form of the gene and identified several likely intermedi- 
ates in the ultimate loss from the mitochondrial genome. 
A summary of Cox2 gene distributions between mitochon- 
dria and nuclei, along with data on their transcription, is 
shown in a phylogenetic context in Figure Ql-2. 

A. Assuming that transfer of the mitochondrial gene to the 
nucleus occurred only once (an assumption supported by 
the structures of the nuclear genes), indicate the point in the 
phylogenetic tree where the transfer occurred. 

B. Are there any examples of genera in which the trans- 
ferred gene and the mitochondrial gene both appear func- 
tional? Indicate them. 

C. What is the minimal number of times that the mito- 
chondrial gene has been inactivated or lost? Indicate those 
events on the phylogenetic tree. 

D. What is the minimal number of times that the nuclear 
gene has been inactivated or lost? Indicate those events on 
the phylogenetic tree. 

E. Based on this information, propose a general scheme 
for transfer of mitochondrial genes to the nuclear genome. 

1-11 When plant hemoglobin genes were first discovered 
in legumes, it was so surprising to find a gene typical of ani- 
mal blood that it was hypothesized that the plant gene arose 
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Figure Ql-2 Summary of Cox2 gene distribution and transcript data 
in a phylogenetic context (Problem 1-10). The presence of the 
intact gene or a functional transcript is indicated by (+); the absence 
of the intact gene or a functional transcript is indicated by (-). mt, 
mitochondria; nuc, nuclei. 

by horizontal transfer from an animal. Many more 
hemoglobin genes have now been sequenced, and a phylo- 
genetic tree based on some of these sequences is shown in 
Figure Q 1-3. 

A. Does this tree support or refute the hypothesis that the 
plant hemoglobins arose by horizontal gene transfer? 

B. Supposing that the plant hemoglobin genes were origi- 
nally derived from a parasitic nematode, for example, what 
would you expect the phylogenetic tree to look like? 
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Figure Q1-3 Phylogenetic tree for hemoglobin genes from a variety 
of species (Problem 1-1 1).The legumes are highlighted in red. 
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1-12 Rates of evolution appear to vary in different lin- 
eages. For example, the rate of evolution in the rat lineage is 
significantly higher than in the human lineage. These rate 
differences are apparent whether one looks at changes in 
protein sequences that are subject to selective pressure or at 



changes in noncoding nucleotide sequences, which are not 
under obvious selection pressure. Can you offer one or more 
possible explanations for the slower rate of evolutionary 
change in the human lineage versus the rat lineage? 
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Cell Chemistry and 
Biosynthesis 




It is at first sight difficult to accept the idea that each of the living creatures 
described in Chapter 1 is merely a chemical system. The incredible diversity of 
living forms, their seemingly purposeful behavior, and their ability to grow and 
reproduce appear to set them apart from the world of solids, liquids, and gases 
that chemistry normally describes. Indeed, until the nineteenth century animals 
were believed to contain a Vital Force — an "animus" — that was responsible for 
their distinctive properties. 

We now know there is nothing in living organisms that disobeys chemical 
and physical laws. However, the chemistry of life is special. First, it is based over- 
whelmingly on carbon compounds, whose study is therefore known as organic 
chemistry. Second, cells are 70 percent water, and life depends largely on chem- 
ical reactions that take place in aqueous solution. Third, and most important, 
cell chemistry is enormously complex: even the simplest cell is vastly more com- 
plicated in its chemistry than any other chemical system known. Although cells 
contain a variety of small carbon-containing molecules, most of the carbon 
atoms in cells are incorporated into enormous polymeric molecules — chains of 
chemical subunits linked end-to-end. It is the unique properties of these macro- 
molecules that enable cells and organisms to grow and reproduce — as well as to 
do all the other things that are characteristic of life. 
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THE CHEMICAL COMPONENTS OF A CELL 

Matter is made of combinations of elements — substances such as hydrogen or 
carbon that cannot be broken down or converted into other substances by 
chemical means. The smallest particle of an element that still retains its distinc- 
tive chemical properties is an atom (Figure 2-1). However, the characteristics of 
substances other than pure elements — including the materials from which liv- 
ing cells are made — depend on the way their atoms are linked together in groups 
to form molecules. In order to understand how living organisms are built from 
inanimate matter, therefore, it is crucial to know how all of the chemical bonds 
that hold atoms together in molecules are formed. 



Cells Are Made From a Few Types of Atoms 

The atomic weight of an atom, or the molecular weight of a molecule, is its 
mass relative to that of a hydrogen atom. This is essentially equal to the number 
of protons plus neutrons that the atom or molecule contains, since the electrons 
are much lighter and contribute almost nothing to the total. Thus the major iso- 
tope of carbon has an atomic weight of 12 and is symbolized as 12 C, whereas an 
unstable isotope of carbon has an atomic weight of 14 and is written as 14 C. The 
mass of an atom or a molecule is often specified in daltons, one dalton being an 
atomic mass unit approximately equal to the mass of a hydrogen atom. 

Atoms are so small that it is hard to imagine their size. An individual carbon 
atom is roughly 0.2 nm in diameter, so that it would take about 5 million of them, 
laid out in a straight line, to span a millimeter. One proton or neutron weighs 



45 



46 Chapter 2: Cell Chemistry and Biosynthesis 



neutron electron 



proton 





carbon atom 

atomic number = 6 
atomic weight = 12 



hydrogen atom 

atomic number = 1 
atomic weight = 1 



approximately 1/(6 x 10 23 ) gram, so one gram of hydrogen contains 6 x 10 23 
atoms. This huge number (6 x 10 23 , called Avogadro's number) is the key scale 
factor describing the relationship between everyday quantities and quantities 
measured in terms of individual atoms or molecules. If a substance has a molec- 
ular weight of X, 6 x 10 23 molecules of it will have a mass of X grams. This quan- 
tity is called one mole of the substance (Figure 2-2). 

There are 89 naturally occurring elements, each differing from the others in 
the number of protons and electrons in its atoms. Living organisms, however, 
are made of only a small selection of these elements, four of which — carbon (C), 
hydrogen (H), nitrogen (N), and oxygen (O) — make up 96.5% of an organism's 
weight. This composition differs markedly from that of the nonliving inorganic 
environment (Figure 2-3) and is evidence of a distinctive type of chemistry. 



The Outermost Electrons Determine How Atoms Interact 

To understand how atoms bond together to form the molecules that make up 
living organisms, we focus on their electrons. Protons and neutrons are welded 
tightly to one another in the nucleus and change partners only under extreme 
conditions— during radioactive decay for example, or in the interior of the sun 
or of a nuclear reactor. In living tissues, it is only the electrons of an atom that 
undergo rearrangements. They form the exterior of an atom and specify the 
rules of chemistry by which atoms combine to form molecules. 

Electrons are in continuous motion around the nucleus, but motions on this 
submicroscopic scale obey very different laws from those familiar in everyday 
life. These laws dictate that electrons in an atom can exist only in certain discrete 
states, called orbitals, and that there is a strict limit to the number of electrons 
that can be accommodated in an orbital of a given type — a so-called electron 
shell The electrons closest on average to the positive nucleus are attracted most 
strongly to it and occupy the innermost, most tightly bound shell. This shell 
holds a maximum of two electrons. The second shell is farther away from the 
nucleus, and its electrons are less tightly bound. This second shell holds up to 
eight electrons. The third shell contains electrons that are even less tightly 
bound; it also holds up to eight electrons. The fourth and fifth shells can hold 18 
electrons each. Atoms with more than four shells are very rare in biological 
molecules. 

The electron arrangement of an atom is most stable when all the electrons 
are in the most tightly bound states that are possible — that is, when they occupy 
the innermost shells. Therefore, with certain exceptions in the larger atoms, the 
electrons of an atom fill the orbitals in order — the first shell before the second, 
the second before the third, and so on. An atom whose outermost shell is 
entirely filled with electrons is especially stable and therefore chemically unre- 
active. Examples are helium with 2 electrons, neon with 2 + 8, and argon with 
2 + 8 + 8; these are all inert gases. Hydrogen, by contrast, with only one electron 



Figure 2-1 Highly schematic 
representations of an atom of carbon 
and an atom of hydrogen. The nucleus of 
every atom except hydrogen consists of 
both positively charged protons and 
electrically neutral neutrons. The number 
of electrons in an atom is equal to its 
number of protons (the atomic number), 
so that the atom has no net charge. 
Because it is the electrons that determine 
the chemical behavior of an atom, all of 
the atoms of a given element have the 
same atomic number. 

Neutrons are uncharged subatomic 
particles of essentially the same mass as 
protons. They contribute to the structural 
stability of the nucleus— if there are too 
many or too few, the nucleus may 
disintegrate by radioactive decay — but 
they do not alter the chemical properties 
of the atom. Because of neutrons, an 
element can exist in several physically 
distinguishable but chemically identical 
forms, called isotopes, each isotope 
having a different number of neutrons 
but the same number of protons. 
Multiple isotopes of almost all the 
elements occur naturally, including some 
that are unstable. For example, while 
most carbon on Earth exists as the stable 
isotope carbon 1 2, with six protons and 
six neutrons, there are also small 
amounts of an unstable isotope, the 
radioactive carbon 14, whose atoms have 
six protons and eight neutrons. Carbon 
14 undergoes radioactive decay at a slow 
but steady rate. This forms the basis for a 
technique known as carbon 14 dating, 
which is used in archaeology to 
determine the time of origin of organic 
materials. 

The neutrons, protons, and electrons 
are in reality minute in relation to the 
atom as a whole; their size is greatly 
exaggerated here. In addition, the 
diameter of the nucleus is only about 
1 0" 4 that of the electron cloud. Finally, 
although the electrons are shown here as 
individual particles, in reality their 
behavior is governed by the laws of 
quantum mechanics, and there is no way 
of predicting exactly where an electron is 
at any given instant of time. 
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and only a half- filled shell, is highly reactive. Likewise, the other atoms found in 
living tissues have incomplete outer electron shells and can donate, accept, or 
share electrons with each other to form both molecules and ions (Figure 2-4). 

Because an unfilled electron shell is less stable than a filled one, atoms with 
incomplete outer shells tend to interact with other atoms in a way that causes 
them to either gain or lose enough electrons to achieve a completed outermost 
shell. This electron exchange occurs either by transferring electrons from one 
atom to another or by sharing electrons between two atoms. These two strate- 
gies generate two types of chemical bonds between atoms: an ionic bond is 
formed when electrons are donated by one atom to another, whereas a covalent 
bond is formed when two atoms share a pair of electrons (Figure 2-5). Often, the 
pair of electrons is shared unequally, with a partial transfer between two atoms 
that attract electrons differently— one more electronegative than the other: this 
intermediate strategy results in a polar covalent bond, as we shall discuss later. 

An H atom, which needs only one electron to fill its shell, generally acquires 
it by electron sharing, forming one covalent bond with another atom; often this 
bond is polar — meaning that the electrons are shared unequally The other com- 
mon elements in living cells — C, N, and 0, with an incomplete second shell, and 
P and S, with an incomplete third shell (see Figure 2-4) — generally share elec- 
trons and achieve a filled outer shell of eight electrons by forming several cova- 
lent bonds. The number of electrons that an atom must acquire or lose (either 
by sharing or by transfer) to fill its outer shell is known as its valence. 

The crucial role of the outer electron shell in determining the chemical prop- 
erties of an element means that, when the elements are listed in order of their 
atomic number, there is a periodic recurrence of elements with similar proper- 
ties: an element with, say, an incomplete second shell containing one electron 
will behave in much the same way as an element that has filled its second shell 



A mole is X grams of a substance, 
where X is its relative molecular mass 
(molecular weight). A mole will contain 
6 x 10 23 molecules of the substance. 

1 mole of carbon weighs 12 g 

1 mole of glucose weighs 180 g 

1 mole of sodium chloride weighs 58 g 

Molar solutions have a concentration 
of 1 mole of the substance in 1 liter of 
solution. A molar solution (denoted as 
1 M) of glucose, for example, has 
1 80 g/l, while a millimolar solution 
(1 mM)has180mg/l. 

The standard abbreviation for gram is g; 
the abbreviation for liter is I. 



Figure 2-2 Moles and molar solutions. 
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Figure 2-3 The abundances of some 
chemical elements in the nonliving 
world (the Earth's crust) compared with 
their abundances in the tissues of an 
animal. The abundance of each element 
is expressed as a percentage of the total 
number of atoms present including 
water. Thus, because of the abundance of 
water, more than 60% of the atoms in a 
living organism are hydrogen atoms. The 
relative abundance of elements is similar 
in all living things. 
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Figure 2-4 Filled and unfilled electron 
shells in some common elements. All 

the elements commonly found in living 
organisms have unfilled outermost shells 
(red) and can thus participate in chemical 
reactions with other atoms. For 
comparison, some elements that have 
only filled shells (yellow) are shown; these 
are chemically unreactive. 



and has an incomplete third shell containing one electron. The metals, for 
example, have incomplete outer shells with just one or a few electrons, whereas, 
as we have just seen, the inert gases have full outer shells. This pattern gives rise 
to the famous periodic table of the elements, presented in Figure 2-6 with the 
elements found in living organisms highlighted. 



Covalent Bonds Form by the Sharing of Electrons 

All the characteristics of a cell depend on the molecules it contains. A molecule 
is defined as a cluster of atoms held together by covalent bonds; here electrons 
are shared between atoms to complete the outer shells, rather than being trans- 
ferred between them. In the simplest possible molecule— a molecule of hydro- 
gen (H 2 )— two H atoms, each with a single electron, share two electrons, which 
is the number required to fill the first shell. These shared electrons form a cloud 
of negative charge that is densest between the two positively charged nuclei and 
helps to hold them together, in opposition to the mutual repulsion between like 
charges that would otherwise force them apart. The attractive and repulsive 
forces are in balance when the nuclei are separated by a characteristic distance, 
called the bond length. 

Another property of any bond— covalent or noncovalent— is its bond strength, 
which is measured by the amount of energy that must be supplied to break that 
bond. This is often expressed in units of kilocalories per mole (kcal/mole), where 
a kilocalorie is the amount of energy needed to raise the temperature of one liter 
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Figure 2-5 Comparison of covalent and 
ionic bonds. Atoms can attain a more 
stable arrangement of electrons in their 
outermost shell by interacting with one 
another. An ionic bond is formed when 
electrons are transferred from one atom 
to the other. A covalent bond is formed 
when electrons are shared between 
atoms. The two cases shown represent 
extremes; often, covalent bonds form 
with a partial transfer (unequal sharing of 
electrons), resulting in a polar covalent 
bond (see Figure 2-43). 
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of water by one degree Celsius (centigrade). Thus if 1 kilocalorie must be supplied 
to break 6 x 10 23 bonds of a specific type (that is, 1 mole of these bonds), then the 
strength of that bond is 1 kcal/mole. An equivalent, widely used measure of 
energy is the kilojoule, which is equal to 0.239 kilocalories. 

To understand bond strengths, it is helpful to compare them with the aver- 
age energies of the impacts that molecules are constantly experiencing from col- 
lisions with other molecules in their environment (their thermal, or heat, 
energy), as well as with other sources of biological energy such as light and glu- 
cose oxidation (Figure 2-7). Typical covalent bonds are stronger than the ther- 
mal energies by a factor of 100, so they resist being pulled apart by thermal 
motions and are normally broken only during specific chemical reactions with 
other atoms and molecules. The making and breaking of covalent bonds are 
violent events, and in living cells they are carefully controlled by highly specific 
catalysts, called enzymes. Noncovalent bonds as a rule are much weaker; we 
shall see later that they are important in the cell in the many situations where 
molecules have to associate and dissociate readily to carry out their functions. 

Whereas an H atom can form only a single covalent bond, the other com- 
mon atoms that form covalent bonds in cells — O, N, S, and P, as well as the all- 
important C atom — can form more than one. The outermost shell of these 
atoms, as we have seen, can accommodate up to eight electrons, and they form 
covalent bonds with as many other atoms as necessary to reach this number. 
Oxygen, with six electrons in its outer shell, is most stable when it acquires an 
extra two electrons by sharing with other atoms and therefore forms up to two 
covalent bonds. Nitrogen, with five outer electrons, forms a maximum of three 
covalent bonds, while carbon, with four outer electrons, forms up to four cova- 
lent bonds — thus sharing four pairs of electrons (see Figure 2-4). 

When one atom forms covalent bonds with several others, these multiple 
bonds have definite arrangements in space relative to one another, reflecting the 
orientations of the orbits of the shared electrons. The covalent bonds of such an 
atom are therefore characterized by specific bond angles as well as by bond 
lengths and bond energies (Figure 2-8) . The four covalent bonds that can form 
around a carbon atom, for example, are arranged as if pointing to the four cor- 
ners of a regular tetrahedron. The precise orientation of covalent bonds forms 
the basis for the three-dimensional geometry of organic molecules. 



Figure 2-6 Elements ordered by their 
atomic number form the periodic table. 

Elements fall into groups that show 
similar properties based on the number 
of electrons each element possesses in its 
outer shell. For example, Mg and Ca tend 
to give away the two electrons in their 
outer shells; C, N, and O complete their 
second shells by sharing electrons. The 
four elements highlighted in red 
constitute 99% of the total number of 
atoms present in the human body. An 
additional seven elements, highlighted in 
blue, together represent about 0.9% of 
the total. Other elements, shown in green, 
are required in trace amounts by humans. 
It remains unclear whether those 
elements shown in yellow are essential in 
humans or not. The chemistry of life, it 
seems, is therefore predominantly the 
chemistry of lighter elements. 

Atomic weights, given by the sum of 
the protons and neutrons in the atomic 
nucleus, will vary with the particular 
isotope of the element. The atomic 
weights shown here are those of the 
most common isotope of each element. 
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Figure 2-7 Some energies important for 
cells. Note that these energies are 
compared on a logarithmic scale, 
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water (H 2 0) 



Figure 2-8 The geometry of covalent 
bonds. (A) The spatial arrangement of the 
covalent bonds that can be formed by 
oxygen, nitrogen, and carbon. 
(B) Molecules formed from these atoms 
have a precise three-dimensional 
structure, as shown here by ball-and-stick 
models for water and propane. 
A structure can be specified by the bond 
angles and bond lengths for each 
covalent linkage. The atoms are colored 
according to the following, generally 
used convention: H, white; C, black; 
0, red; N, blue. 



(B) 



propane (CH 3 -CH 2 -CH 3 ) 



There Are Different Types of Covalent Bonds 

Most covalent bonds involve the sharing of two electrons, one donated by each 
participating atom; these are called single bonds. Some covalent bonds, however, 
involve the sharing of more than one pair of electrons. Four electrons can be 
shared, for example, two coming from each participating atom; such a bond is 
called a double bond. Double bonds are shorter and stronger than single bonds 
and have a characteristic effect on the three-dimensional geometry of molecules 
containing them. A single covalent bond between two atoms generally allows 
the rotation of one part of a molecule relative to the other around the bond axis. 
A double bond prevents such rotation, producing a more rigid and less flexible 
arrangement of atoms (Figure 2-9 and Panel 2-1, pp. 106-107). 

In some molecules, electrons are shared among three or more atoms, pro- 
ducing bonds that have a hybrid character intermediate between single and 
double bonds. The highly stable benzene molecule, for example, consists of a 
ring of six carbon atoms in which the bonding electrons are evenly distributed 
(although usually depicted as an alternating sequence of single and double 
bonds, as shown in Panel 2-1). 

When the atoms joined by a single covalent bond belong to different ele- 
ments, the two atoms usually attract the shared electrons to different degrees. 
Compared with a C atom, for example, O and N atoms attract electrons rela- 
tively strongly, whereas an H atom attracts electrons more weakly. By definition, 
a polar structure (in the electrical sense) is one with positive charge concentrated 
toward one end (the positive pole) and negative charge concentrated toward the 
other (the negative pole). Covalent bonds in which the electrons are shared 
unequally in this way are therefore known as polar covalent bonds (Figure 2-10). 
For example, the covalent bond between oxygen and hydrogen, -O-H, or 
between nitrogen and hydrogen, -N-H, is polar, whereas that between carbon 
and hydrogen, -C-H, has the electrons attracted much more equally by both 
atoms and is relatively nonpolar. 

Polar covalent bonds are extremely important in biology because they cre- 
ate permanent dipoles that allow molecules to interact through electrical forces. 
Any large molecule with many polar groups will have a pattern of partial positive 
and negative charges on its surface. When such a molecule encounters a second 
molecule with a complementary set of charges, the two molecules will be 
attracted to each other by electrostatic interactions that resemble (but are 
weaker than) the ionic bonds discussed previously. 




(A) ethane 




(B) ethene 

Figure 2-9 Carbon-carbon double bonds 
and single bonds compared. (A) The 

ethane molecule, with a single covalent 
bond between the two carbon atoms, 
illustrates the tetrahedral arrangement of 
single covalent bonds formed by carbon. 
One of the CH 3 groups joined by the 
covalent bond can rotate relative to the 
other around the bond axis. (B) The 
double bond between the two carbon 
atoms in a molecule of ethene (ethylene) 
alters the bond geometry of the carbon 
atoms and brings all the atoms into the 
same plane (blue); the double bond 
prevents the rotation of one CH 2 group 
relative to the other. 
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An Atom Often Behaves as if It Has a Fixed Radius 

When a covalent bond forms between two atoms, the sharing of electrons brings 
the nuclei of these atoms unusually close together. But most of the atoms that 
are rapidly jostling each other in cells are located in separate molecules. What 
happens when two such atoms touch? RoshanKetab 021-66950639 

For simplicity and clarity, atoms and molecules are usually represented 
schematically— either as a line drawing of the structural formula or as a ball- 
and-stick model. Space-filling models, however, give us a more accurate repre- 
sentation of molecular structure. In these models, a solid envelope represents 
the radius of the electron cloud at which strong repulsive forces prevent a closer 
approach of any second, non-bonded atom — the so-called van der Waals radius 
for an atom. This is possible because the amount of repulsion increases very 
steeply as two such atoms approach each other closely. At slightly greater dis- 
tances, any two atoms will experience a weak attractive force, known as a van der 
Waals attraction. As a result, there is a distance at which repulsive and attractive 
forces precisely balance to produce an energy minimum in each atom's interac- 
tion with an atom of a second, non-bonded element (Figure 2-11). 

Depending on the intended purpose, we shall represent small molecules as 
line drawings, ball-and-stick models, or space-filling models. For comparison, 
the water molecule is represented in all three ways in Figure 2-12. When repre- 
senting very large molecules, such as proteins, we shall often need to further 
simplify the model used (see, for example, Panel 3-2, pp. 132-133). 





tm 

water 






oxygen 

Figure 2-1 Polar and nonpolar 
covalent bonds. The electron 
distributions in the polar water molecule 
(H2O) and the nonpolar oxygen molecule 
(O2) are compared (8 + , partial positive 
charge; 8~ partial negative charge). 



Water Is the Most Abundant Substance in Cells 

Water accounts for about 70% of a cell's weight, and most intracellular reactions 
occur in an aqueous environment. Life on Earth began in the ocean, and the 
conditions in that primeval environment put a permanent stamp on the chem- 
istry of living things. Life therefore hinges on the properties of water. 

In each water molecule (H 2 0) the two H atoms are linked to the O atom by 
covalent bonds (see Figure 2-12). The two bonds are highly polar because the O 
is strongly attractive for electrons, whereas the H is only weakly attractive. Con- 
sequently there is an unequal distribution of electrons in a water molecule, with 
a preponderance of positive charge on the two H atoms and of negative charge 
on the O (see Figure 2-10). When a positively charged region of one water 
molecule (that is, one of its H atoms) approaches a negatively charged region 
(that is, the O) of a second water molecule, the electrical attraction between them 
can result in a weak bond called a hydrogen bond (see Figure 2-15). These bonds 
are much weaker than covalent bonds and are easily broken by the random ther- 
mal motions due to the heat energy of the molecules, so each bond lasts only a 
short time. But the combined effect of many weak bonds can be profound. Each 
water molecule can form hydrogen bonds through its two H atoms to two other 
water molecules, producing a network in which hydrogen bonds are being con- 
tinually broken and formed (Panel 2-2, pp. 108-109). It is only because of the 




van der Waals force equilibrium at this point 



Figure 2-1 1 The balance of van der 
Waals forces between two atoms. 

As the nuclei of two atoms approach 
each other, they initially show a weak 
bonding interaction due to their 
fluctuating electric charges. However, the 
same atoms will strongly repel each other 
if they are brought too close together. 
The balance of these van der Waals 
attractive and repulsive forces occurs at 
the indicated energy minimum. This 
minimum determines the contact 
distance between any two noncovalently 
bonded atoms; this distance is the sum of 
their van der Waals radii. By definition, 
zero energy (indicated by the dotted red 
line) is the energy when the two nuclei 
are at infinite separation. 
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hydrogen bonds that link water molecules together that water is a liquid at 
room temperature, with a high boiling point and high surface tension — rather 
than a gas. 

Molecules, such as alcohols, that contain polar bonds and that can form 
hydrogen bonds with water dissolve readily in water. Molecules carrying plus or 
minus charges (ions) likewise interact favorably with water. Such molecules are 
termed hydrophilic, meaning that they are water-loving. A large proportion of 
the molecules in the aqueous environment of a cell necessarily fall into this cat- 
egory, including sugars, DNA, RNA, and most proteins. Hydrophobic (water- 
hating) molecules, by contrast, are uncharged and form few or no hydrogen 
bonds, and so do not dissolve in water. Hydrocarbons are an important example 
(see Panel 2-1, pp. 106-107). In these molecules the H atoms are covalently 
linked to C atoms by a largely nonpolar bond. Because the H atoms have almost 
no net positive charge, they cannot form effective hydrogen bonds to other 
molecules. This makes the hydrocarbon as a whole hydrophobic — a property 
that is exploited in cells, whose membranes are constructed from molecules that 
have long hydrocarbon tails, as we shall see in Chapter 10. 



Figure 2-1 2 Three representations of a 
water molecule. (A) The usual line 
drawing of the structural formula, in 
which each atom is indicated by its 
standard symbol, and each line 
represents a covalent bond joining two 
atoms. (B) A ball-and-stick model, in 
which atoms are represented by spheres 
of arbitrary diameter, connected by sticks 
representing covalent bonds. Unlike (A), 
bond angles are accurately represented 
in this type of model (see also Figure 
2-8). (C) A space-filling model, in which 
both bond geometry and van der Waals 
radii are accurately represented. 



Some Polar Molecules Are Acids and Bases 

One of the simplest kinds of chemical reaction, and one that has profound sig- 
nificance in cells, takes place when a molecule containing a highly polar covalent 
bond between a hydrogen and a second atom dissolves in water. The hydrogen 
atom in such a molecule has largely given up its electron to the companion atom 
and so resembles an almost naked positively charged hydrogen nucleus — in 
other words, a proton (H + ) . When water molecules surround the polar molecule, 
the proton is attracted to the partial negative charge on the O atom of an adja- 
cent water molecule and can dissociate from its original partner to associate 
instead with the oxygen atoms of the water molecule to generate a hydronium 
ion (H30 + ) (Figure 2-1 3A). The reverse reaction also takes place very readily, so 
one has to imagine an equilibrium state in which billions of protons are con- 
stantly flitting to and fro from one molecule in the solution to another. 

The same type of reaction takes place in a solution of pure water itself. As 
illustrated in Figure 2-13B, water molecules are constantly exchanging protons 
with each other. As a result, pure water contains an equal, very low concentra- 
tion of H30 + and OH" ions, both being present at 10~ 7 M. (The concentration of 
H 2 in pure water is 55.5 M.) 

Substances that release protons to form H 3 + when they dissolve in water 
are termed acids. The higher the concentration of H 3 + , the more acidic the 
solution. As H 3 + rises, the concentration of OH" falls, according to the equilib- 
rium equation for water: [H 3 + ][OH _ ] = 1.0 x 10" 14 , where square brackets 
denote molar concentrations to be multiplied. By tradition, the H 3 + concen- 
tration is usually referred to as the H + concentration, even though nearly all H + 
in an aqueous solution is present as H 3 + . To avoid the use of unwieldy num- 
bers, the concentration of H + is expressed using a logarithmic scale called the 
pH scale, as illustrated in Panel 2-2 (pp. 108-109). Pure water has a pH of 7.0, 
and is neutral — that is, neither acidic (pH < 7.0) nor basic (pH > 7.0). 
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Because the proton of a hydronium ion can be passed readily to many types 
of molecules in cells, altering their character, the concentration of HsO + inside a 
cell (the acidity) must be closely regulated. The interior of a cell is kept close to 
neutrality, and it is buffered by the presence of many chemical groups that can 
take up and release protons near pH 7. 

The opposite of an acid is a base. Just as the defining property of an acid is 
that it donates protons to a water molecule so as to raise the concentration of 
H30 + ions, the defining property of a base is that it accepts protons so as to lower 
the concentration of H 3 + ions, and thereby raise the concentration of hydroxyl 
ions (OH - ). A base can either combine with protons directly or form hydroxyl 
ions that immediately combine with protons to produce H2O. Thus sodium 
hydroxide (NaOH) is basic (or alkaline) because it dissociates in aqueous solu- 
tion to form Na + ions and 0H~ ions. Other bases, especially important in living 
cells, contain NH2 groups. These groups directly take up a proton from water: 
-NH 2 + H 2 -» -NH 3 + + OH". 

All molecules that accept protons from water will do so most readily when 
the concentration of H30 + is high (acidic solutions). Likewise, molecules that 
can give up protons do so more readily if the concentration of Hs0 + in solution 
is low (basic solutions), and they will tend to receive them back if this concen- 
tration is high. 



Four Types of Noncovalent Attractions Help Bring Molecules 
Together in Cells 

In aqueous solutions, covalent bonds are 10-100 times stronger than the other 
attractive forces between atoms, allowing their connections to define the 
boundaries of one molecule from another. But much of biology depends on the 
specific binding of different molecules to each other. This binding is mediated 
by a group of noncovalent attractions that are individually quite weak, but 
whose energies can sum to create an effective force between two separate 
molecules. We have previously introduced three of these attractive forces: elec- 
trostatic attractions (ionic bonds), hydrogen bonds, and van der Waals attrac- 
tions. Table 2-1 compares the strengths of these three types of noncovalent 
bonds with that of a typical covalent bond, both in the presence and in the 
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Figure 2-13 Acids in water. (A) The 
reaction that takes place when a 
molecule of acetic acid dissolves in water. 
(B) Water molecules are continuously 
exchanging protons with each other to 
form hydronium and hydroxyl ions. These 
ions in turn rapidly recombine to form 
water molecules. 



Table 2-1 Covalent and Noncovalent Chemical Bonds 









STRENGTH 


(kcal/mole) 


BOND TYPE 




LENGTH <nm) 


IN VACUUM 


IN WATER 


Covalent 




0.15 


90 


90 


Noncovalent: 


ionic* 


0.25 


80 


3 




hydrogen 


0.30 


4 


1 




van der Waals attraction 


0.35 


0.1 


0.1 




(per atom) 









*An ionic bond is an electrostatic attraction between two fully charged atoms. 
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absence of water. Because of their fundamental importance in all biological sys- 
tems, we summarize their properties here: 

• Electrostatic attractions. These result from the attractive forces between 
oppositely charged atoms. Electrostatic attractions are quite strong in the 
absence of water. They readily form between permanent dipoles, but are 
greatest when the two atoms involved are fully charged (ionic bonds). How- 
ever, the polar water molecules cluster around both fully charged ions and 
polar molecules that contain permanent dipoles (Figure 2-14). This greatly 
reduces the attractiveness of these charged species for each other in most 
biological settings. 

• Hydrogen bonds. The structure of a typical hydrogen bond is illustrated in 
Figure 2-15. This bond represents a special form of polar interaction in 
which an electropositive hydrogen atom is partially shared by two elec- 
tronegative atoms. Its hydrogen can be viewed as a proton that has par- 
tially dissociated from a donor atom, allowing it to be shared by a second 
acceptor atom. Unlike a typical electrostatic interaction, this bond is 
highly directional — being strongest when a straight line can be drawn 
between all three of the involved atoms. As already discussed, water weak- 
ens these bonds by forming competing hydrogen-bond interactions with 
the involved molecules. 

• van der Waals attractions. The electron cloud around any nonpolar atom 
will fluctuate, producing a flickering dipole. Such dipoles will transiently 
induce an oppositely polarized flickering dipole in a nearby atom. This 
interaction generates a very weak attraction between atoms. But since 
many atoms can be simultaneously in contact when two surfaces fit 
closely the net result is often significant. Water does not weaken these so- 
called van der Waals attractions. 

The fourth effect that often brings molecules together in water is not, strictly 
speaking, a bond at all. However, a very important hydrophobic force is caused 
by a pushing of nonpolar surfaces out of the hydrogen-bonded water network, 
where they would otherwise physically interfere with the highly favorable inter- 
actions between water molecules. Bringing any two nonpolar surfaces together 
reduces their contact with water; in this sense, the force is nonspecific. Never- 
theless, we shall see in Chapter 3 that hydrophobic forces are central to the 
proper folding of protein molecules. 

Panel 2-3 provides an overview of the four types of attractions just 
described. And Figure 2-16 illustrates schematically how many such interac- 
tions can sum to hold together the matching surfaces of two macromolecules, 
even though each interaction by itself would be much too weak to be effective in 
the face of thermal motions. 



A Cell Is Formed from Carbon Compounds 

Having looked at the ways atoms combine into small molecules and how these 
molecules behave in an aqueous environment, we now examine the main 
classes of small molecules found in cells and their biological roles. We shall see 
that a few basic categories of molecules, formed from a handful of different ele- 
ments, give rise to all the extraordinary richness of form and behavior shown by 
living things. 

If we disregard water and inorganic ions such as potassium, nearly all the 
molecules in a cell are based on carbon. Carbon is outstanding among all the ele- 
ments in its ability to form large molecules; silicon is a poor second. Because it is 
small and has four electrons and four vacancies in its outermost shell, a carbon 
atom can form four covalent bonds with other atoms. Most important, one car- 
bon atom can join to other carbon atoms through highly stable covalent C-C 

Figure 2-1 5 Hydrogen bonds. (A) Ball-and-stick model of a typical 
hydrogen bond. The distance between the hydrogen and the oxygen atom 
here is less than the sum of their van der Waals radii, indicating a partial 
sharing of electrons. (B) The most common hydrogen bonds in cells. 





Figure 2-1 4 How the dipoles on water 
molecules orient to reduce the affinity 
of oppositely charged ions or polar 
groups for each other. 
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bonds to form chains and rings and hence generate large and complex molecules 
with no obvious upper limit to their size (see Panel 2-1, pp. 106-107). The small 
and large carbon compounds made by cells are called organic molecules. 

Certain combinations of atoms, such as the methyl (-CH 3 ), hydroxyl (-OH), 
carboxyl (-COOH), carbonyl (-00), phosphate (-PO3 2 -), sulfhydryl (-SH), and 
amino (-NH 2 ) groups, occur repeatedly in organic molecules. Each such chemi- 
cal group has distinct chemical and physical properties that influence the behav- 
ior of the molecule in which the group occurs. The most common chemical 
groups and some of their properties are summarized in Panel 2-1, pp. 106-107. 

Cells Contain Four Major Families of Small Organic Molecules 

The small organic molecules of the cell are carbon-based compounds that have 
molecular weights in the range 100-1000 and contain up to 30 or so carbon 
atoms. They are usually found free in solution and have many different fates. 
Some are used as monomer subunits to construct the giant polymeric macro- 
molecules — the proteins, nucleic acids, and large polysaccharides — of the cell. 
Others act as energy sources and are broken down and transformed into other 
small molecules in a maze of intracellular metabolic pathways. Many small 
molecules have more than one role in the cell — for example, acting both as a 
potential subunit for a macromolecule and as an energy source. Small organic 
molecules are much less abundant than the organic macromolecules, account- 
ing for only about one-tenth of the total mass of organic matter in a cell (Table 
2-2). As a rough guess, there may be a thousand different kinds of these small 
molecules in a typical cell. 

All organic molecules are synthesized from and are broken down into the 
same set of simple compounds. Both their synthesis and their breakdown occur 
through sequences of limited chemical changes that follow definite rules. As a 
consequence, the compounds in a cell are chemically related and most can be 
classified into a few distinct families. Broadly speaking, cells contain four major 
families of small organic molecules: the sugars, the fatty acids, the amino acids, 
and the nucleotides (Figure 2-17). Although many compounds present in cells 
do not fit into these categories, these four families of small organic molecules, 
together with the macromolecules made by linking them into long chains, 
account for a large fraction of cell mass (see Table 2-2). 





Figure 2-16 Schematic indicating how 
two macromolecules with 
complementary surfaces can bind 
tightly to one another through 
noncovalent interactions. 



Sugars Provide an Energy Source for Cells and Are the Subunits of 
Polysaccharides 

The simplest sugars — the monosaccharides — are compounds with the general 
formula (CH 2 0) n , where n is usually 3, 4, 5, 6, 7, or 8. Sugars, and the molecules 
made from them, are also called carbohydrates because of this simple formula. 
Glucose, for example, has the formula C 6 Hi 2 06 (Figure 2-18). The formula, how- 
ever, does not fully define the molecule: the same set of carbons, hydrogens, and 



Table 2-2 The Types of Molecules That Form a Bacterial Cell 
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Figure 2-1 7 The four main families of 
small organic molecules in cells. These 
small molecules form the monomeric 
building blocks, or subunits, for most of 
the macromolecules and other 
assemblies of the cell. Some, such as the 
sugars and the fatty acids, are also energy 
sources. 



oxygens can be joined together by covalent bonds in a variety of ways, creating 
structures with different shapes. As shown in Panel 2-4 (pp. 112-113), for exam- 
ple, glucose can be converted into a different sugar — mannose or galactose — 
simply by switching the orientations of specific OH groups relative to the rest of 
the molecule. Each of these sugars, moreover, can exist in either of two forms, 
called the D-form and the L-form, which are mirror images of each other. Sets of 
molecules with the same chemical formula but different structures are called 
isomers, and the subset of such molecules that are mirror-image pairs are called 
optical isomers. Isomers are widespread among organic molecules in general, 
and they play a major part in generating the enormous variety of sugars. 

Panel 2-4 presents an outline of sugar structure and chemistry. Sugars can 
exist as rings or as open chains. In their open-chain form, sugars contain a 
number of hydroxyl groups and either one aldehyde ( tt^ :::: C = 0) or one ketone 
(^C = 0) group. The aldehyde or ketone group plays a special role. First, it can 
react with a hydroxyl group in the same molecule to convert the molecule into 
a ring; in the ring form the carbon of the original aldehyde or ketone group can 
be recognized as the only one that is bonded to two oxygens. Second, once the 
ring is formed, this same carbon can become further linked, via oxygen, to one 
of the carbons bearing a hydroxyl group on another sugar molecule. This cre- 
ates a disaccharide such as sucrose, which is composed of a glucose and a fruc- 
tose unit. Larger sugar polymers range from the oligosaccharides (trisaccha- 
rides, tetrasaccharides, and so on) up to giant polysaccharides, which can con- 
tain thousands of monosaccharide units. 

The way that sugars are linked together to form polymers illustrates some 
common features of biochemical bond formation. A bond is formed between an 
-OH group on one sugar and an -OH group on another by a condensation reac- 
tion, in which a molecule of water is expelled as the bond is formed (Figure 
2-19). Subunits in other biological polymers, such as nucleic acids and proteins, 
are also linked by condensation reactions in which water is expelled. The bonds 
created by all of these condensation reactions can be broken by the reverse pro- 
cess of hydrolysis, in which a molecule of water is consumed (see Figure 2-19). 
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Figure 2-1 8 The structure of glucose, a 
simple sugar. As illustrated previously for 
water (see Figure 2-1 2), any molecule can 
be represented in several ways. In the 
structural formulas shown in (A), (B) and 
(C), the atoms are shown as chemical 
symbols linked together by lines 
representing the covalent bonds. The 
thickened lines here are used to indicate 
the plane of the sugar ring, in an attempt 
to emphasize that the -H and -OH 
groups are not in the same plane as the 
ring. (A) The open-chain form of this 
sugar, which is in equilibrium with the 
more stable cyclic or ring form in (B). 
(C) The chair form is an alternative way to 
draw the cyclic molecule that reflects the 
geometry more accurately than the 
structural formula in (B). (D) A space- 
filling model, which, as well as depicting 
the three-dimensional arrangement of 
the atoms, also uses the van der Waals 
radii to represent the surface contours of 
the molecule. (E) A ball-and-stick model 
in which the three-dimensional 
arrangement of the atoms in space is 
shown. (H, white; C, black; O, red; N, blue.) 
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Because each monosaccharide has several free hydroxyl groups that can 
form a link to another monosaccharide (or to some other compound), sugar 
polymers can be branched, and the number of possible polysaccharide struc- 
tures is extremely large. Even a simple disaccharide consisting of two glucose 
units can exist in eleven different varieties (Figure 2-20), while three different 
hexoses (C6H12O6) can join together to make several thousand trisaccharides. 
For this reason it is a much more complex task to determine the arrangement of 
sugars in a polysaccharide than to determine the nucleotide sequence of a DNA 
molecule, where each unit is joined to the next in exactly the same way 

The monosaccharide glucose is a key energy source for cells. In a series of 
reactions, it is broken down to smaller molecules, releasing energy that the cell 
can harness to do useful work, as we shall explain later. Cells use simple polysac- 
charides composed only of glucose units — principally glycogen in animals and 
starch in plants — as energy stores. 



Figure 2-1 9 The reaction of two 
monosaccharides to form a 
disaccharide. This reaction belongs to a 
general category of reactions termed 
condensation reactions, in which two 
molecules join together as a result of the 
loss of a water molecule. The reverse 
reaction (in which water is added) is 
termed hydroiysis. Note that the reactive 
carbon at which the new bond is formed 
(on the monosaccharide on the left here) 
is the carbon joined to two oxygens as a 
result of sugar ring formation (see Figure 
2-1 8). As indicated, this common type of 
covalent bond between two sugar 
molecules is known as a glycosidic bond 
(see also Figure 2-20). 
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Figure 2-20 Eleven disaccharides 
consisting of two D-glucose units. 

Although these differ only in the type of 
linkage between the two glucose units, 
they are chemically distinct. Since the 
oligosaccharides associated with proteins 
and lipids may have six or more different 
kinds of sugar joined in both linear and 
branched arrangements through 
glycosidic bonds such as those illustrated 
here, the number of distinct types of 
oligosaccharides that can be used in cells 
is extremely large. For an explanation 
of a and fi linkages, see Panel 2-4 
(pp. 112-11 3). Short black lines ending 
"blind" indicate OH positions. {Red lines 
merely indicate disaccharide bond 
orientations and "corners" do not imply 
extra atoms.) 
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Sugars do not function only in the production and storage of energy. They 
can also be used, for example, to make mechanical supports. Thus, the most 
abundant organic chemical on Earth — the cellulose of plant cell walls — is a 
polysaccharide of glucose. Because the glucose-glucose linkages in cellulose dif- 
fer from those in starch and glycogen, however, humans cannot digest cellulose 
and use its glucose. Another extraordinarily abundant organic substance, the 
chitin of insect exoskeletons and fungal cell walls, is also an indigestible polysac- 
charide — in this case a linear polymer of a sugar derivative called AT-acetylglu- 
cosamine (see Panel 2-4). Other polysaccharides are the main components of 
slime, mucus, and gristle. 

Smaller oligosaccharides can be covalently linked to proteins to form glyco- 
proteins and to lipids to form glycolipids y both of which are found in cell mem- 
branes. As described in Chapter 10, most cell surfaces are clothed and decorated 
with glycoproteins and glycolipids in the cell membrane. The sugar side chains 
on these molecules are often recognized selectively by other cells. And differ- 
ences between people in the details of their cell-surface sugars are the molecu- 
lar basis for the different major human blood groups, termed A, B, AB, and O. 



Fatty Acids Are Components of Cell Membranes, as Well as a 
Source of Energy 

A fatty acid molecule, such as palmitic acid, has two chemically distinct regions 
(Figure 2-21). One is a long hydrocarbon chain, which is hydrophobic and not 
very reactive chemically. The other is a carboxyl (-COOH) group, which behaves 
as an acid (carboxylic acid): it is ionized in solution (-COO"), extremely 
hydrophilic, and chemically reactive. Almost all the fatty acid molecules in a cell 
are covalently linked to other molecules by their carboxylic acid group. 

The hydrocarbon tail of palmitic acid is saturated: it has no double bonds 
between carbon atoms and contains the maximum possible number of hydro- 
gens. Stearic acid, another one of the common fatty acids in animal fat, is also 
saturated. Some other fatty acids, such as oleic acid, have unsaturated tails, with 
one or more double bonds along their length. The double bonds create kinks in 
the molecules, interfering with their ability to pack together in a solid mass. It is 
this that accounts for the difference between hard margarine (saturated) and 
liquid vegetable oils (polyunsaturated). The many different fatty acids found in 
cells differ only in the length of their hydrocarbon chains and the number and 
position of the carbon-carbon double bonds (see Panel 2-5, pp. 114-115). 

Fatty acids are stored in the cytoplasm of many cells in the form of droplets 
of triacylglycerol molecules, which consist of three fatty acid chains joined to a 
glycerol molecule (see Panel 2-5); these molecules are the animal fats found in 
meat, butter, and cream, and the plant oils such as corn oil and olive oil. When 
required to provide energy, the fatty acid chains are released from triacylglycerols 
and broken down into two-carbon units. These two-carbon units are identical to 
those derived from the breakdown of glucose and they enter the same energy- 
yielding reaction pathways, as will be described later in this chapter. Triglycerides 
serve as a concentrated food reserve in cells, because they can be broken down 
to produce about six times as much usable energy, weight for weight, as glucose. 

Fatty acids and their derivatives such as triacylglycerols are examples of 
lipids. Lipids comprise a loosely defined collection of biological molecules that 
are insoluble in water, while being soluble in fat and organic solvents such as 
benzene. They typically contain either long hydrocarbon chains, as in the fatty 
acids and isoprenes, or multiple linked rings, as in the steroids. 

The most important function of fatty acids in cells is in the construction of 
cell membranes. These thin sheets enclose all cells and surround their internal 
organelles. They are composed largely of phospholipids, which are small 
molecules that, like triacylglycerols, are constructed mainly from fatty acids and 
glycerol. In phospholipids the glycerol is joined to two fatty acid chains, however, 
rather than to three as in triacylglycerols. The "third" site on the glycerol is linked 
to a hydrophilic phosphate group, which is in turn attached to a small 
hydrophilic compound such as choline (see Panel 2-5). Each phospholipid 



hydrophilic carboxylic acid head — i 




(A) 



hydrophobic hydrocarbon tail ■ 
(B) (C) 



Figure 2-21 A fatty acid. A fatty acid is 
composed of a hydrophobic hydrocarbon 
chain to which is attached a hydrophilic 
carboxylic acid group. Palmitic acid is 
shown here. Different fatty acids have 
different hydrocarbon tails. (A) Structural 
formula. The carboxylic acid group is 
shown in its ionized form. (B) Ball-and- 
stick model. (C) Space-filling model. 
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molecule, therefore, has a hydrophobic tail composed of the two fatty acid 
chains and a hydrophilic head, where the phosphate is located. This gives them 
different physical and chemical properties from triacylglycerols, which are pre- 
dominantly hydrophobic. Molecules such as phospholipids, with both hydro- 
phobic and hydrophilic regions, are termed amphiphilic. 

The membrane-forming property of phospholipids results from their 
amphophilic nature. Phospholipids will spread over the surface of water to 
form a monolayer of phospholipid molecules, with the hydrophobic tails fac- 
ing the air and the hydrophilic heads in contact with the water. Two such 
molecular layers can readily combine tail-to-tail in water to make a phospho- 
lipid sandwich, or lipid bilayer. This bilayer is the structural basis of all cell 
membranes (Figure 2-22). 



Amino Acids Are the Subunits of Proteins 

Amino acids are a varied class of molecules with one defining property: they all 
possess a carboxylic acid group and an amino group, both linked to a single car- 
bon atom called the a-carbon (Figure 2-23). Their chemical variety comes from 
the side chain that is also attached to the a-carbon. The importance of amino 
acids to the cell comes from their role in making proteins, which are polymers 
of amino acids joined head-to-tail in a long chain that is then folded into a three- 
dimensional structure unique to each type of protein. The covalent linkage 
between two adjacent amino acids in a protein chain forms an amide (see Panel 
2-1), and it is called a peptide bond; the chain of amino acids is also known as a 
polypeptide (Figure 2-24). Regardless of the specific amino acids from which it 
is made, the polypeptide has an amino (NH2) group at one end (its N-terminus) 
and a carboxyl (COOH) group at its other end (its C- terminus). This gives it a def- 
inite directionality — a structural (as opposed to an electrical) polarity. 

Each of the 20 amino acids found commonly in proteins has a different side 
chain attached to the a-carbon atom (see Panel 3-1, pp. 128-129). All organisms, 



Figure 2-22 Phospholipid structure and 
the orientation of phospholipids in 
membranes. In an aqueous environment, 
the hydrophobic tails of phospholipids 
pack together to exclude water. Here 
they have formed a bilayer with the 
hydrophilic head of each phospholipid 
facing the water. Lipid bilayers are the 
basis for cell membranes, as discussed in 
detail in Chapter 10. 
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Figure 2-23 The amino acid alanine. 

(A) In the cell, where the pH is close to 7, 
the free amino acid exists in its ionized 
form; but when it is incorporated into a 
polypeptide chain, the charges on the 
amino and carboxyl groups disappear. 

(B) A ball-and-stick model and (C) a 
space-filling model of alanine (H, white; 
C, black; O, red; N, blue). 
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Figure 2-24 A small part of a protein molecule. The four amino acids 
shown are linked together by three peptide bonds, one of which is 
highlighted in yellow. One of the amino acids is shaded in gray. The amino 
acid side chains are shown in red. The two ends of a polypeptide chain are 
chemically distinct. One end, the N-terminus, terminates in an amino 
group, and the other, the C-terminus, in a carboxyl group. The sequence is 
always read from the N-terminal end; hence this sequence is 
Phe-Ser-Glu-Lys. 

whether bacteria, archea, plants, or animals, have proteins made of the same 20 
amino acids. How this precise set of 20 came to be chosen is one of the myster- 
ies of the evolution of life; there is no obvious chemical reason why other amino 
acids could not have served just as well. But once the choice was established, it 
could not be changed; too much depended on it. 

Like sugars, all amino acids, except glycine, exist as optical isomers in D- and 
L-forms (see Panel 3-1) . But only L-forms are ever found in proteins (although D- 
amino acids occur as part of bacterial cell walls and in some antibiotics). The 
origin of this exclusive use of L-amino acids to make proteins is another evolu- 
tionary mystery. 

The chemical versatility of the 20 amino acids is essential to the function of 
proteins. Five of the 20 amino acids have side chains that can form ions in neu- 
tral aqueous solution and thereby can carry a charge (Figure 2-25). The others 
are uncharged; some are polar and hydrophilic, and some are nonpolar and 
hydrophobic. As we discuss in Chapter 3, the properties of the amino acid side 
chains underlie the diverse and sophisticated functions of proteins. 
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Figure 2-25 The charge on amino acid 
side chains depends on the pH. The five 
different side chains that can carry a 
charge are shown. Carboxylic acids can 
readily lose H + in aqueous solution to 
form a negatively charged ion, which is 
denoted by the suffix "-ate," as in 
aspartate or glutamare. A comparable 
situation exists for amines, which in 
aqueous solution can take up H + to form 
a positively charged ion (which does not 
have a special name). These reactions are 
rapidly reversible, and the amounts of the 
two forms, charged and uncharged, 
depend on the pH of the solution. At a 
high pH, carboxylic acids tend to be 
charged and amines uncharged. At a low 
pH, the opposite is true— the carboxylic 
acids are uncharged and amines are 
charged. The pH at which exactly half of 
the carboxylic acid or amine residues are 
charged is known as the p/( of that amino 
acid side chain (indicated by ye//ow 
stripe). 

In the cell the pH is close to 7, and 
almost all carboxylic acids and amines are 
in their fully charged form. 
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Figure 2-26 Chemical structure of 
adenosine triphosphate (ATP). 

(A) Structural formula. (B) Space-filling 
model. In (B) the colors of the atoms are 
C, black; N, blue; H, white; O, red; and 
P, yellow. 



A nucleotide is a molecule made up of a nitrogen-containing ring compound 
linked to a five-carbon sugar, which in turn carries one or more phosphate 
groups (Panel 2-6, pp. 116-117). The five-carbon sugar can be either ribose or 
deoxyribose. Nucleotides containing ribose are known as ribonucleotides, and 
those containing deoxyribose as deoxyribonucleotides. The nitrogen-containing 
rings are generally referred to as bases for historical reasons: under acidic condi- 
tions they can each bind an H + (proton) and thereby increase the concentration 
of OH~ ions in aqueous solution. There is a strong family resemblance between 
the different bases. Cytosine (Q, thymine (T), and uracil (U) are called pyrim- 
idines because they all derive from a six-membered pyrimidine ring; guanine (G) 
and adenine (A) are purine compounds, and they have a second, five-membered 
ring fused to the six-membered ring. Each nucleotide is named for the base it 
contains (see Panel 2-6). 

Nucleotides can act as short-term carriers of chemical energy. Above all oth- 
ers, the ribonucleotide adenosine triphosphate, or ATP (Figure 2-26), transfers 
energy in hundreds of different cell reactions. ATP is formed through reactions 
that are driven by the energy released by the oxidative breakdown of foodstuffs. 
Its three phosphates are linked in series by two phosphoanhydride bonds, whose 
rupture releases large amounts of useful energy. The terminal phosphate group in 
particular is frequently split off by hydrolysis, often transferring a phosphate to 
other molecules and releasing energy that drives energy- requiring biosynthetic 
reactions (Figure 2-27). Other nucleotide derivatives are carriers for the transfer 
of other chemical groups, as will be described later. 

The most fundamental role of nucleotides in the cell, however, is in the stor- 
age and retrieval of biological information. Nucleotides serve as building blocks 
for the construction of nucleic acids— long polymers in which nucleotide subunits 
are covalently linked by the formation of a phosphodiester bond between the 
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Figure 2-27 The ATP molecule serves as 
an energy carrier in cells. The energy- 
requiring formation of ATP from ADP and 
inorganic phosphate is coupled to the 
energy-yielding oxidation of foodstuffs 
(in animal cells, fungi, and some bacteria) 
or to the capture of light energy (in plant 
cells and some bacteria). The hydrolysis 
of this ATP back to ADP and inorganic 
phosphate in turn provides the energy to 
drive many cell reactions. 
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Figure 2-28 A small part of one chain of a deoxyribonucleic acid (DNA) 
molecule. Four nucleotides are shown. One of the phosphodiester bonds 
that links adjacent nucleotide residues is highlighted in yellow, and one of 
the nucleotides is shaded in gray. Nucleotides are linked together by a 
phosphodiester linkage between specific carbon atoms of the ribose, 
known as the 5' and 3' atoms. For this reason, one end of a polynucleotide 
chain, the 5' end, will have a free phosphate group and the other, the 
3' end, a free hydroxyl group. The linear sequence of nucleotides in a 
polynucleotide chain is commonly abbreviated by a one-letter code, and 
the sequence is always read from the 5' end. In the example illustrated the 
sequence is G-A-T-C. 

phosphate group attached to the sugar of one nucleotide and a hydroxyl group on 
the sugar of the next nucleotide (Figure 2-28). Nucleic acid chains are synthesized 
from energy-rich nucleoside triphosphates by a condensation reaction that 
releases inorganic pyrophosphate during phosphodiester bond formation. 

There are two main types of nucleic acids, differing in the type of sugar in 
their sugar-phosphate backbone. Those based on the sugar ribose are known as 
ribonucleic acids, or RNA, and normally contain the bases A, G, C, and U. Those 
based on deoxyribose (in which the hydroxyl at the 2' position of the ribose car- 
bon ring is replaced by a hydrogen are known as deoxyribonucleic acids, or 
DNA, and contain the bases A, G, C, and T (T is chemically similar to the U in 
RNA, merely adding the methyl group on the pyrimidine ring; see Panel 2-6). 
RNA usually occurs in cells as a single polynucleotide chain, but DNA is virtually 
always a double-stranded molecule — a DNA double helix composed of two 
polynucleotide chains running antiparallel to each other and held together by 
hydrogen-bonding between the bases of the two chains. 

The linear sequence of nucleotides in a DNA or an RNA encodes the genetic 
information of the cell. The ability of the bases in different nucleic acid 
molecules to recognize and pair with each other by hydrogen-bonding (called 
base-pairing)— -G with C, and A with either T or U— underlies all of heredity and 
evolution, as explained in Chapter 4. 



The Chemistry of Cells Is Dominated by Macromolecules 
with Remarkable Properties 

By weight, macromolecules are the most abundant carbon-containing 
molecules in a living cell (Figure 2-29 and Table 2-3). They are the principal 
building blocks from which a cell is constructed and also the components that 
confer the most distinctive properties of living things. The macromolecules in 
cells are polymers that are constructed by covalently linking small organic 
molecules (called monomers) into long chains (Figure 2-30). Yet they have 
remarkable properties that could not have been predicted from their simple 
constituents. 

Proteins are especially abundant and versatile. They perform thousands of 
distinct functions in cells. Many proteins serve as enzymes, the catalysts that 
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Figure 2-29 Macromolecules are 
abundant in cells. The approximate 
composition of a bacterial cell is shown 
by weight. The composition of an animal 
cell is similar (see Table 2-3). 
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Table 2-3 Approximate Chemical Compositions of a Typical Bacterium and a 
Typical Mammalian Cell 





PERCENT OF TOTAL CELL WEIGHT 
E COU BACTERIUM MAMMALIAN CELL 


COMPONENT 


H 2 


70 70 


Inorganic ions (Na + , K + , Mg 2+ , 


1 1 


Ca 2+ , CI" etc.) 




Miscellaneous small metabolites 


3 3 


Proteins 


15 18 


RNA 


6 1.1 


DNA 


1 0.25 


Phospholipids 


2 3 


Other lipids 


2 


Polysaccharides 


2 2 


Total cell volume 


2x10- 12 cm 3 4x10~ 9 cm 3 


Relative cell volume 


1 2000 



Proteins, polysaccharides, DNA, and RNA are macromolecules Lipids are not generally classed as 
macromolecules even though they share some of their features; for example, most are synthesized 
as linear polymers of a smaller molecule (the acetyl group on acetyl CoA), and they self-assemble 
into I,"! 'oer structures (membranes), Note that water and protein comprise most -of the mass of 

both mammalian and bacterial cells 



direct the many covalent bond-making and bond-breaking reactions that the 
ceil needs. Enzymes catalyze all of the reactions whereby cells extract energy 
from food molecules, for example, and an enzyme called ribulose bisphosphate 
carboxylase helps to convert C0 2 to sugars in photosynthetic organisms, pro- 
ducing most of the organic matter needed for life on Earth. Other proteins are 
used to build structural components, such as tubulin, a protein that self-assem- 
bles to make the cell's long microtubules, or histones, proteins that compact the 
DNA in chromosomes. Yet other proteins act as molecular motors to produce 
force and movement, as in the case of myosin in muscle. Proteins perform many 
other functions, and we shall examine the molecular basis for many of them 
later in this book. Here we identify some general principles of macromolecular 
chemistry that make such functions possible. 

Although the chemical reactions for adding subunits to each polymer are 
different in detail for proteins, nucleic acids, and polysaccharides, they share 
important features. Each polymer grows by the addition of a monomer onto 
the end of a growing polymer chain in a condensation reaction, in which a 
molecule of water is lost with each sub unit added (see Figure 2-19). The stepwise 
polymerization of monomers into a long chain is a simple way to manufacture a 
large, complex molecule, since die subunits are added by the same reaction per- 
formed over and over again by die same set of enzymes. In a sense, the process 
resembles the repetitive operation of a machine in a factory— except in one cru- 
cial respect. Apart from some of the polysaccharides, most macromolecules are 
made from a set of monomers that are slightly different from one another— for 
example, the 20 different amino acids from which proteins are made. It is criti- 
cal to life that the polymer chain is not assembled at random from these sub- 
units; instead the subunits are added in a particular order, or sequence. The 
elaborate mechanisms that allow this to be accomplished by enzymes are 
described in detail in Chapters 5 and 6. 



Noncovalent Bonds Specify Both the Precise Shape of a 
Macromolecule and its Binding to Other Molecules 

Most of the covalent bonds in a macromolecule allow rotation of the atoms they 
join, giving the polymer chain great flexibility. In principle, this allows a macro- 
molecule to adopt an almost unlimited number of shapes, or conformations, as 
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Figure 2-30 Three families of 
macromolecules. Each is a polymer 
formed from small molecules (called 
monomers) linked together by 
covalent bonds. 



64 Chapter 2: Cell Chemistry and Biosynthesis 




/ 






$ 



* 



Figure 2-31 Most proteins and many 
RNA molecules fold into only one stable 
conformation. If the noncovalent bonds 
maintaining this stable conformation are 
disrupted, the molecule becomes a 
flexible chain that usually has no 
biological value. 



many unstable 
conformations 



one stable folded 
conformation 



random thermal energy causes the polymer chain to writhe and rotate. However, 
the shapes of most biological macromolecules are highly constrained because of 
the many weak noncovalent bonds that form between different parts of the same 
molecule. If these noncovalent bonds are formed in sufficient numbers, the 
polymer chain can strongly prefer one particular conformation, determined by 
the linear sequence of monomers in its chain. Most protein molecules and many 
of the small RNA molecules found in cells fold tightly into one highly preferred 
conformation in this way (Figure 2-31). 

The four types of noncovalent interactions important in biological 
molecules were described earlier, and they are reviewed in Panel 2-3 (pp. 
110-111). Although individually very weak, these interactions cooperate to fold 
biological macromolecules into unique shapes. In addition, they can also add up 
to create a strong attraction between two different molecules when these 
molecules fit together very closely, like a hand in a glove. This form of molecular 
interaction provides for great specificity, inasmuch as the multipoint contacts 
required for strong binding make it possible for a macromolecule to select out — 
through binding— just one of the many thousands of other types of molecules 
present inside a cell. Moreover, because the strength of the binding depends on 
the number of noncovalent bonds that are formed, interactions of almost any 
affinity are possible — allowing rapid dissociation when necessary. 

Binding of this type underlies all biological catalysis, making it possible for 
proteins to function as enzymes. Noncovalent interactions also allow macro- 
molecules to be used as building blocks for the formation of larger structures. In 
cells, macromolecules often bind together into large complexes, thereby form- 
ing intricate machines with multiple moving parts that perform such complex 
tasks as DNA replication and protein synthesis (Figure 2-32). 



SUBUNITS 



covalent bonds 



MACROMOLECULES 



e g , sugars, amino acids, 
and nucleotides 



'^1 



e.g., globular proteins 
and RNA 



noncovalent bonds 



MACROMOLECULAR 
ASSEMBLIES 







30 nm 
e.g., ribosome 



Figure 2-32 Small molecules, proteins, and a ribosome drawn approximately to scale. Ribosomes are a central part of the machinery that the 
cell uses to make proteins: each ribosome is formed as a complex of about 90 macromolecules (protein and RNA molecules). 
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Summary 

Living organisms are autonomous, self- propagating chemical systems. They are made 
from a distinctive and restricted set of small carbon-based molecules that are essen- 
tially the same for every living species. Each of these molecules is composed of a small 
set of atoms linked to each other in a precise configuration through covalent bonds. 
The main categories are sugars, fatty acids, amino acids, and nucleotides. Sugars are a 
primary source of chemical energy for cells and can be incorporated into polysaccha- 
rides for energy storage. Fatty acids are also important for energy storage, but their 
most critical function is in the formation of cell membranes. Polymers consisting of 
amino acids constitute the remarkably diverse and versatile macromolecules known 
as proteins. Nucleotides play a central part in energy transfer. They are also the sub- 
units for the informational macromolecules, RNA and DNA. 

Most of the dry mass of a cell consists of macromolecules that have been produced 
as linear polymers of amino acids (proteins) or nucleotides (DNA and RNA), covalently 
linked to each other in an exact order. Most of the protein molecules and many of the 
RNAs fold into a unique conformation that depends on their sequence of subunits. 
This folding process creates unique surfaces, and it depends on a large set of weak 
attractions produced by noncovalent forces between atoms. These forces are of four 
types: electrostatic attractions, hydrogen bonds, van der Waals attractions, and an 
interaction between nonpolar groups caused by their hydrophobic expulsion from 
water. The same set of weak forces governs the specific binding of other molecules to 
macromolecules, making possible the myriad associations between biological 
molecules that produce the structure and the chemistry of a cell 
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One property of living things above all makes them seem almost miraculously 
different from nonliving matter: they create and maintain order, in a universe 
that is tending always to greater disorder (Figure 2-33). To create this order, the 
cells in a living organism must perform a never-ending stream of chemical reac- 
tions. In some of these reactions, small organic molecules — amino acids, sugars, 
nucleotides, and lipids — are being taken apart or modified to supply the many 
other small molecules that the cell requires. In other reactions, these small 
molecules are being used to construct an enormously diverse range of proteins, 
nucleic acids, and other macromolecules that endow living systems with all of 
their most distinctive properties. Each cell can be viewed as a tiny chemical fac- 
tory, performing many millions of reactions every second. 




Figure 2-33 Order in biological structures. Well-defined, ornate, and beautiful spatial patterns can be 
found at every level of organization in living organisms. In order of increasing size: (A) protein molecules in 
the coat of a virus; (B) the regular array of microtubules seen in a cross section of a sperm tail; (C) surface 
contours of a pollen grain (a single cell); (D) close-up of the wing of a butterfly showing the pattern created 
by scales, each scale being the product of a single cell; (E) spiral array of seeds, made of millions of cells, in 
the head of a sunflower. (A, courtesy of R.A. Grant and J.M. Hogle; B, courtesy of Lewis Tilney; C, courtesy of 
Colin MacFarlane and Chris Jeffree; D and E, courtesy of Kjell B. Sandved.) 



66 Chapter 2: Cell Chemistry and Biosynthesis 



molecule 



molecule 



molecule 



molecule 



molecule 



molecule 



catalysis by 
enzyme 1 



catalysis by 
enzyme 2 



catalysis by 
enzyme 3 



catalysis by 
enzyme 4 



catalysis by 
enzyme 5 




Figure 2-34 How a set of enzyme-catalyzed reactions generates a metabolic pathway. Each enzyme 
catalyzes a particular chemical reaction, leaving the enzyme unchanged. In this example, a set of enzymes 
acting in series converts molecule A to molecule F, forming a metabolic pathway. 



Cell Metabolism Is Organized by Enzymes 

The chemical reactions that a cell carries out would normally occur only at 
much higher temperatures than those existing inside cells. For this reason, each 
reaction requires a specific boost in chemical reactivity. This requirement is cru- 
cial, because it allows the cell to control each reaction. The control is exerted 
through the specialized proteins called enzymes, each of which accelerates, or 
catalyzes, just one of the many possible kinds of reactions that a particular 
molecule might undergo. Enzyme -catalyzed reactions are usually connected in 
series, so that the product of one reaction becomes the starting material, or sub- 
strate, for the next (Figure 2-34). These long linear reaction pathways are in turn 
linked to one another, forming a maze of interconnected reactions that enable 
the cell to survive, grow, and reproduce (Figure 2-35). 

Two opposing streams of chemical reactions occur in cells: (1) the catabolic 
pathways break down foodstuffs into smaller molecules, thereby generating 
both a useful form of energy for the cell and some of the small molecules that the 
cell needs as building blocks, and (2) the anabolic, or biosynthetic, pathways use 
the energy harnessed by catabolism to drive the synthesis of the many other 
molecules that form the cell. Together these two sets of reactions constitute the 
metabolism of the cell (Figure 2-36). 

Many of the details of cell metabolism form the traditional subject of bio- 
chemistry and need not concern us here. But the general principles by which 
cells obtain energy from their environment and use it to create order are central 
to cell biology. We begin with a discussion of why a constant input of energy is 
needed to sustain living organisms. 



Biological Order Is Made Possible by the Release of Heat Energy 
from Cells 

The universal tendency of things to become disordered is a fundamental law of 
physics — the second law of thermodynamics — which states that in the universe, 
or in any isolated system (a collection of matter that is completely isolated from 
the rest of the universe), the degree of disorder only increases. This law has such 
profound implications for all living things that we restate it in several ways. 

For example, we can present the second law in terms of probability and state 
that systems will change spontaneously toward those arrangements that have 
the greatest probability. If we consider, for example, a box of 100 coins all lying 
heads up, a series of accidents that disturbs the box will tend to move the 
arrangement toward a mixture of 50 heads and 50 tails. The reason is simple: 
there is a huge number of possible arrangements of the individual coins in the 
mixture that can achieve the 50-50 result, but only one possible arrangement 
that keeps all of the coins oriented heads up. Because the 50-50 mixture is there- 
fore the most probable, we say that it is more "disordered." For the same reason, 



Figure 2-35 Some of the metabolic pathways and their interconnections 
in a typical cell. About 500 common metabolic reactions are shown 
diagrammatically, with each molecule in a metabolic pathway represented 
by a filled circle, as in the yellow box in Figure 2-34. The pathway that is 
highlighted in this diagram with larger circles and connecting lines is the 
central pathway of sugar metabolism, which will be discussed shortly. 
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Figure 2-36 Schematic representation of the relationship between 
catabolic and anabolic pathways in metabolism. As suggested here, since 
a major portion of the energy stored in the chemical bonds of food 
molecules is dissipated as heat, the mass of food required by any organism 
that derives all of its energy from catabolism is much greater than the mass 
of the molecules that can be produced by anabolism. 



it is a common experience that one's living space will become increasingly dis- 
ordered without intentional effort: the movement toward disorder is a sponta- 
neous process, requiring a periodic effort to reverse it (Figure 2-37). 

The amount of disorder in a system can be quantified and expressed as the 
entropy of the system: the greater the disorder, the greater the entropy. Thus, 
another way to express the second law of thermodynamics is to say that systems 
will change spontaneously toward arrangements with greater entropy. 

Living cells — by surviving, growing, and forming complex organisms — are 
generating order and thus might appear to defy the second law of thermody- 
namics. How is this possible? The answer is that a cell is not an isolated system: 
it takes in energy from its environment in the form of food, or as photons from 
the sun (or even, as in some chemosynthetic bacteria, from inorganic molecules 
alone), and it then uses this energy to generate order within itself. In the course 
of the chemical reactions that generate order, the cell converts part of the energy 
it uses into heat. The heat is discharged into the cell's environment and disorders 
it, so that the total entropy — that of the cell plus its surroundings — increases, as 
demanded by the laws of thermodynamics. 

To understand the principles governing these energy conversions, think of 
a cell surrounded by a sea of matter representing the rest of the universe. As the 
cell lives and grows, it creates internal order. But it constantly releases heat 
energy as it synthesizes molecules and assembles them into cell structures. 
Heat is energy in its most disordered form — the random jostling of molecules. 
When the cell releases heat to the sea, it increases the intensity of molecular 
motions there (thermal motion) — thereby increasing the randomness, or disor- 
der, of the sea. The second law of thermodynamics is satisfied because the 
increase in the amount of order inside the cell is more than compensated for by 
an even greater decrease in order (increase in entropy) in the surrounding sea 
of matter (Figure 2-38). 

Where does the heat that the cell releases come from? Here we encounter 
another important law of thermodynamics. The first law of thermodynamics 
states that energy can be converted from one form to another, but that it cannot 
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Figure 2-37 An everyday illustration of 
the spontaneous drive toward disorder. 

Reversing this tendency toward disorder 
requires an intentional effort and an 
input of energy: it is not spontaneous. In 
fact, from the second law of 
thermodynamics, we can be certain that 
the human intervention required will 
release enough heat to the environment 
to more than compensate for the 
reordering of the items in this room. 



68 Chapter 2: Cell Chemistry and Biosynthesis 



sea of matter 


cell 










/ 


i ? 


<{• 


1 -• 


• \ 


** / 




s 


\ 






• 


• 

t 






• 


s 


V»-«/ 




N 


m 






v » 


• 


• • 


^ 




\ 


\ / 


4 


fe 






•- 






increased disorder increased order 



Figure 2-38 A simple thermodynamic analysis of a living cell. In the diagram on the left the 
molecules of both the cell and the rest of the universe (the sea of matter) are depicted in a 
relatively disordered state. In the diagram on the right the cell has taken in energy from food 
molecules and released heat by a reaction that orders the molecules the cell contains. Because 
the heat increases the disorder in the environment around the cell (depicted by the jagged 
arrows and distorted molecules, indicating the increased molecular motions caused by heat), 
the second law of thermodynamics — which states that the amount of disorder in the universe 
must always increase — is satisfied as the cell grows and divides. For a detailed discussion, see 
Panel 2-7 (pp. 118-119). 



be created or destroyed. Figure 2-39 illustrates some interconversions between 
different forms of energy. The amount of energy in different forms will change as 
a result of the chemical reactions inside the cell, but the first law tells us that the 
total amount of energy must always be the same. For example, an animal cell 
takes in foodstuffs and converts some of the energy present in the chemical 
bonds between the atoms of these food molecules (chemical bond energy) into 
the random thermal motion of molecules (heat energy). As described above, this 
conversion of chemical energy into heat energy is essential if the reactions that 
create order inside the cell are to cause the universe as a whole to become more 
disordered. 

The cell cannot derive any benefit from the heat energy it releases unless the 
heat- generating reactions inside the cell are directly linked to the processes that 
generate molecular order. It is the tight coupling of heat production to an 
increase in order that distinguishes the metabolism of a cell from the wasteful 
burning of fuel in a fire. Later, we shall illustrate how this coupling occurs. For 
now, it is sufficient to recognize that a direct linkage of the "burning" of food 
molecules to the generation of biological order is required for cells to create and 
maintain an island of order in a universe tending toward chaos. 



Photosynthetic Organisms Use Sunlight to Synthesize 
Organic Molecules 

All animals live on energy stored in the chemical bonds of organic molecules 
made by other organisms, which they take in as food. The molecules in food also 
provide the atoms that animals need to construct new living matter. Some ani- 
mals obtain their food by eating other animals. But at the bottom of the animal 
food chain are animals that eat plants. The plants, in turn, trap energy directly 
from sunlight. As a result, the sun is the ultimate source of the energy used by 
animal cells. 

Solar energy enters the living world through photosynthesis in plants and 
photosynthetic bacteria. Photosynthesis converts the electromagnetic energy 
in sunlight into chemical bond energy in the cell. Plants obtain all the atoms 
they need from inorganic sources: carbon from atmospheric carbon dioxide, 
hydrogen and oxygen from water, nitrogen from ammonia and nitrates in the 
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Figure 2-39 Some interconversions 
between different forms of energy. 

All energy forms are, in principle, 
interconvertible. In all these processes 
the total amount of energy is conserved. 
Thus, for example, from the height and 
weight of the brick in (1), we can predict 
exactly how much heat will be released 
when it hits the floor. In (2), note that the 
large amount of chemical bond energy 
released when water is formed is initially 
converted to very rapid thermal motions 
in the two new water molecules; but 
collisions with other molecules almost 
instantaneously spread this kinetic 
energy evenly throughout the 
surroundings (heat transfer), making the 
new molecules indistinguishable from all 
the rest. 
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soil, and other elements needed in smaller amounts from inorganic salts in the 
soil. They use the energy they derive from sunlight to build these atoms into 
sugars, amino acids, nucleotides, and fatty acids. These small molecules in 
turn are converted into the proteins, nucleic acids, polysaccharides, and lipids 
that form the plant. All of these substances serve as food molecules for ani- 
mals, if the plants are later eaten. 

The reactions of photosynthesis take place in two stages (Figure 2-40). In 
the first stage, energy from sunlight is captured and transiently stored as chem- 
ical bond energy in specialized small molecules that act as carriers of energy and 
reactive chemical groups. (We discuss these "activated carrier" molecules later.) 
Molecular oxygen (O2 gas) derived from the splitting of water by light is released 
as a waste product of this first stage. 

In the second stage, the molecules that serve as energy carriers are used to 
help drive a carbon fixation process in which sugars are manufactured from car- 
bon dioxide gas (CO2) and water (H 2 0), thereby providing a useful source of 
stored chemical bond energy and materials— both for the plant itself and for any 
animals that eat it. We describe the elegant mechanisms that underlie these two 
stages of photosynthesis in Chapter 14. 
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Figure 2-40 Photosynthesis. The two stages of photosynthesis. The energy carriers created in the first 
stage are two molecules that we discuss shortly — ATP and NADPH. 



The net result of the entire process of photosynthesis, so far as the green 
plant is concerned, can be summarized simply in the equation 

light energy + CO2 + H2O — > sugars + O2 + heat energy 

The sugars produced are then used both as a source of chemical bond energy 
and as a source of materials to make the many other small and large organic 
molecules that are essential to the plant cell. 



Cells Obtain Energy by the Oxidation of Organic Molecules 

All animal and plant cells are powered by energy stored in the chemical bonds of 
organic molecules, whether they are sugars that a plant has photosynthesized as 
food for itself or the mixture of large and small molecules that an animal has 
eaten. Organisms must extract this energy in usable form to live, grow, and 
reproduce. In both plants and animals, energy is extracted from food molecules 
by a process of gradual oxidation, or controlled burning. 

The Earth's atmosphere contains a great deal of oxygen, and in the presence 
of oxygen the most energetically stable form of carbon is CO2 and that of hydro- 
gen is H 2 0. A cell is therefore able to obtain energy from sugars or other organic 
molecules by allowing their carbon and hydrogen atoms to combine with oxygen 
to produce C0 2 and H 2 0, respectively — a process called respiration. 

Photosynthesis and respiration are complementary processes (Figure 2-41). 
This means that the transactions between plants and animals are not all one 
way. Plants, animals, and microorganisms have existed together on this planet 
for so long that many of them have become an essential part of the others' envi- 
ronments. The oxygen released by photosynthesis is consumed in the combus- 
tion of organic molecules by nearly all organisms. And some of the C0 2 
molecules that are fixed today into organic molecules by photosynthesis in a 
green leaf were yesterday released into the atmosphere by the respiration of an 
animal — or by that of a fungus or bacterium decomposing dead organic matter. 
We therefore see that carbon utilization forms a huge cycle that involves the bio- 
sphere (all of the living organisms on Earth) as a whole, crossing boundaries 
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Figure 2-41 Photosynthesis and 
respiration as complementary processes 
in the living world. Photosynthesis uses 
the energy of sunlight to produce sugars 
and other organic molecules. These 
molecules in turn serve as food for other 
organisms. Many of these organisms 
carry out respiration, a process that uses 
2 to form C0 2 from the same carbon 
atoms that had been taken up as CO2 and 
converted into sugars by photosynthesis. 
In the process, the organisms that respire 
obtain the chemical bond energy that 
they need to survive. The first cells on the 
Earth are thought to have been capable 
of neither photosynthesis nor respiration 
(discussed in Chapter 14). However, 
photosynthesis must have preceded 
respiration on the Earth, since there is 
strong evidence that billions of years of 
photosynthesis were required before O2 
had been released in sufficient quantity 
to create an atmosphere rich in this gas. 
(The Earth's atmosphere currently 
contains 20% 2 .) 
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Figure 2-42 The carbon cycle. Individual 
carbon atoms are incorporated into 
organic molecules of the living world by 
the photosynthetic activity of bacteria 
and plants (including algae). They pass to 
animals, microorganisms, and organic 
material in soil and oceans in cyclic paths. 
C0 2 is restored to the atmosphere when 
organic molecules are oxidized by cells or 
burned by humans as fuels. 



between individual organisms (Figure 2-42). Similarly, atoms of nitrogen, phos- 
phorus, and sulfur move between the living and nonliving worlds in cycles that 
involve plants, animals, fungi, and bacteria. 



Oxidation and Reduction Involve Electron Transfers 

The cell does not oxidize organic molecules in one step, as occurs when organic 
material is burned in a fire. Through the use of enzyme catalysts, metabolism 
takes the molecules through a large number of reactions that only rarely involve 
the direct addition of oxygen. Before we consider some of these reactions and 
their purpose, we discuss what is meant by the process of oxidation. 

Oxidation does not mean only the addition of oxygen atoms; rather, it 
applies more generally to any reaction in which electrons are transferred from 
one atom to another. Oxidation in this sense refers to the removal of electrons, 
and reduction — the converse of oxidation — means the addition of electrons. 
Thus, Fe 2+ is oxidized if it loses an electron to become Fe 3+ , and a chlorine atom 
is reduced if it gains an electron to become CI". Since the number of electrons is 
conserved (no loss or gain) in a chemical reaction, oxidation and reduction 
always occur simultaneously: that is, if one molecule gains an electron in a reac- 
tion (reduction) , a second molecule loses the electron (oxidation) . When a sugar 
molecule is oxidized to CO2 and H2O, for example, the O2 molecules involved in 
forming H2O gain electrons and thus are said to have been reduced. 

The terms "oxidation" and "reduction" apply even when there is only a par- 
tial shift of electrons between atoms linked by a covalent bond (Figure 2-43). 
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Figure 2-43 Oxidation and reduction. (A) When two atoms form a polar covalent bond 
(see p. 50), the atom ending up with a greater share of electrons is said to be reduced, 
while the other atom acquires a lesser share of electrons and is said to be oxidized. The 
reduced atom has acquired a partial negative charge (8") as the positive charge on the 
atomic nucleus is now more than equaled by the total charge of the electrons 
surrounding it, and conversely, the oxidized atom has acquired a partial positive charge 
(5 + ). (B) The single carbon atom of methane can be converted to that of carbon dioxide 
by the successive replacement of its covalently bonded hydrogen atoms with oxygen 
atoms. With each step, electrons are shifted away from the carbon (as indicated by the 
blue shading), and the carbon atom becomes progressively more oxidized. Each of 
these steps is energetically favorable under the conditions present inside a cell. 
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When a carbon atom becomes covalently bonded to an atom with a strong affin- 
ity for electrons, such as oxygen, chlorine, or sulfur, for example, it gives up more 
than its equal share of electrons and forms a polar covalent bond: the positive 
charge of the carbon nucleus is now somewhat greater than the negative charge 
of its electrons, and the atom therefore acquires a partial positive charge and is 
said to be oxidized. Conversely, a carbon atom in a C-H linkage has slightly more 
than its share of electrons, and so it is said to be reduced (see Figure 2-43). 

When a molecule in a cell picks up an electron {e~) t it often picks up a pro- 
ton (H + ) at the same time (protons being freely available in water). The net effect 
in this case is to add a hydrogen atom to the molecule 

Even though a proton plus an electron is involved (instead of just an electron), 
such hydrogenation reactions are reductions, and the reverse, dehydrogenation 
reactions, are oxidations. It is especially easy to tell whether an organic molecule 
is being oxidized or reduced: reduction is occurring if its number of C-H bonds 
increases, whereas oxidation is occurring if its number of C-H bonds decreases 
(see Figure 2-43B). 

Cells use enzymes to catalyze the oxidation of organic molecules in small 
steps, through a sequence of reactions that allows useful energy to be harvested. 
We now need to explain how enzymes work and some of the constraints under 
which they operate. 



Enzymes Lower the Barriers That Block Chemical Reactions 

Consider the reaction 

paper + O2 -^ smoke + ashes + heat + CO2 + H2O 

The paper burns readily, releasing to the atmosphere both energy as heat and 
water and carbon dioxide as gases, but the smoke and ashes never spontaneously 
retrieve these entities from the heated atmosphere and reconstitute themselves 
into paper. When the paper burns, its chemical energy is dissipated as heat— not 
lost from the universe, since energy can never be created or destroyed, but irre- 
trievably dispersed in the chaotic random thermal motions of molecules. At the 
same time, the atoms and molecules of the paper become dispersed and disor- 
dered. In the language of thermodynamics, there has been a loss of free energy, 
that is, of energy that can be harnessed to do work or drive chemical reactions. 
This loss reflects a loss of orderliness in the way the energy and molecules were 
stored in the paper. We shall discuss free energy in more detail shortly, but the 
general principle is clear enough intuitively: chemical reactions proceed sponta- 
neously only in the direction that leads to a loss of free energy; in other words, the 
spontaneous direction for any reaction is the direction that goes "downhill." A 
"downhill" reaction in this sense is often said to be energetically favorable. 

Although the most energetically favorable form of carbon under ordinary 
conditions is CO2, and that of hydrogen is H2O, a living organism does not dis- 
appear in a puff of smoke, and the book in your hands does not burst into 
flames. This is because the molecules both in the living organism and in the 
book are in a relatively stable state, and they cannot be changed to a state of 
lower energy without an input of energy: in other words, a molecule requires 
activation energy— a kick over an energy barrier — before it can undergo a 
chemical reaction that leaves it in a more stable state (Figure 2-44). In the case 
of a burning book, the activation energy is provided by the heat of a lighted 
match. For the molecules in the watery solution inside a cell, the kick is deliv- 
ered by an unusually energetic random collision with surrounding molecules — 
collisions that become more violent as the temperature is raised. 

In a living cell, the kick over the energy barrier is greatly aided by a special- 
ized class of proteins — the enzymes. Each enzyme binds tightly to one or more 
molecules, called substrates, and holds them in a way that greatly reduces the 
activation energy of a particular chemical reaction that the bound substrates 
can undergo. A substance that can lower the activation energy of a reaction is 
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Figure 2-44 The important principle of activation energy. (A) Compound Y (a reactant) is in a 
relatively stable state, and energy is required to convert it to compound X (a product), even 
though X is at a lower overall energy level than Y. This conversion will not take place, therefore, 
unless compound Y can acquire enough activation energy (energy a minus energy b) from its 
surroundings to undergo the reaction that converts it into compound X.This energy may be 
provided by means of an unusually energetic collision with other molecules. For the reverse 
reaction, X — » Y, the activation energy will be much larger (energy a minus energy c); this reaction 
will therefore occur much more rarely. Activation energies are always positive; note, however, 
that the total energy change for the energetically favorable reaction Y -^ X is energy c minus 
energy b, a negative number. (B) Energy barriers for specific reactions can be lowered by 
catalysts, as indicated by the line marked d. Enzymes are particularly effective catalysts because 
they greatly reduce the activation energy for the reactions they perform. 



termed a catalyst; catalysts increase the rate of chemical reactions because they 
allow a much larger proportion of the random collisions with surrounding 
molecules to kick the substrates over the energy barrier, as illustrated in Figure 
2-45. Enzymes are among the most effective catalysts known, capable of speed- 
ing up reactions by factors of 10 14 or more. They thereby allow reactions that 
would not otherwise occur to proceed rapidly at normal temperatures. 

Enzymes are also highly selective. Each enzyme usually catalyzes only one 
particular reaction: in other words, it selectively lowers the activation energy of 
only one of the several possible chemical reactions that its bound substrate 
molecules could undergo. In this way, enzymes direct each of the many different 
molecules in a cell along specific reaction pathways (Figure 2-46). 

The success of living organisms is attributable to a cell's ability to make 
enzymes of many types, each with precisely specified properties. Each enzyme 
has a unique shape containing an active site, a pocket or groove in the enzyme 
into which only particular substrates will fit (Figure 2-47). Like all other cata- 
lysts, enzyme molecules themselves remain unchanged after participating in a 
reaction and therefore can function over and over again. In Chapter 3, we dis- 
cuss further how enzymes work. 
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Figure 2-45 Lowering the activation 
energy greatly increases the probability 
of reaction. At any given instant, a 
population of identical substrate 
molecules will have a range of energies, 
distributed as shown on the graph. The 
varying energies come from collisions 
with surrounding molecules, which make 
the substrate molecules jiggle, vibrate, 
and spin. For a molecule to undergo a 
chemical reaction, the energy of the 
molecule must exceed the activation 
energy barrier for that reaction; for most 
biological reactions, this almost never 
happens without enzyme catalysis. Even 
with enzyme catalysis, the substrate 
molecules must experience a particularly 
energetic collision to react (red shaded 
area). Raising the temperature can also 
increase the number of molecules with 
sufficient energy to overcome the 
activation energy needed for a reaction; 
but in contrast to enzyme catalysis, this 
effect is nonselective, speeding up all 
reactions. 
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How Enzymes Find Their Substrates: The Enormous Rapidity 
of Molecular Motions 

An enzyme will often catalyze the reaction of thousands of substrate molecules 
every second. This means that it must be able to bind a new substrate molecule 
in a fraction of a millisecond. But both enzymes and their substrates are present 
in relatively small numbers in a cell. How do they find each other so fast? Rapid 
binding is possible because the motions caused by heat energy are enormously 
fast at the molecular level. These molecular motions can be classified broadly 
into three kinds: (1) the movement of a molecule from one place to another 
(translational motion), (2) the rapid back-and-forth movement of covalently 
linked atoms with respect to one another (vibrations), and (3) rotations. All of 
these motions help to bring the surfaces of interacting molecules together. 

The rates of molecular motions can be measured by a variety of spectro- 
scopic techniques. A large globular protein is constantly tumbling, rotating about 
its axis about a million times per second. Molecules are also in constant transla- 
tional motion, which causes them to explore the space inside the cell very effi- 
ciently by wandering through it— a process called diffusion. In this way, every 
molecule in a cell collides with a huge number of other molecules each second. 
As the molecules in a liquid collide and bounce off one another, an individual 
molecule moves first one way and then another, its path constituting a random 
walk (Figure 2-48). In such a walk, the average net distance that each molecule 
travels (as the crow flies) from its starting point is proportional to the square root 
of the time involved: that is, if it takes a molecule 1 second on average to travel 
1 jim, it takes 4 seconds to travel 2 \im, 100 seconds to travel 10 jam, and so on. 

The inside of a cell is very crowded (Figure 2-49) . Nevertheless, experiments 
in which fluorescent dyes and other labeled molecules are injected into cells 



Figure 2-46 Floating ball analogies for 
enzyme catalysis. <TAAA> (A) A barrier 
dam is lowered to represent enzyme 
catalysis. The green ball represents a 
potential reactant (compound Y) that is 
bouncing up and down in energy level 
due to constant encounters with waves 
(an analogy for the thermal 
bombardment of the reactant molecule 
with the surrounding water molecules). 
When the barrier (activation energy) is 
lowered significantly, it allows the 
energetically favorable movement of the 
ball (the reactant) downhill. (B) The four 
walls of the box represent the activation 
energy barriers for four different chemical 
reactions that are all energetically 
favorable, in the sense that the products 
are at lower energy levels than the 
reactants. In the left-hand box, none of 
these reactions occurs because even the 
largest waves are not large enough to 
surmount any of the energy barriers. In 
the right-hand box, enzyme catalysis 
lowers the activation energy for reaction 
number 1 only; now the jostling of the 
waves allows passage of the reactant 
molecule over this energy barrier, 
inducing reaction 1. (C) A branching river 
with a set of barrier dams (yellow boxes) 
serves to illustrate how a series of 
enzyme-catalyzed reactions determines 
the exact reaction pathway followed by 
each molecule inside the cell. 
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Figure 2-47 How enzymes work. Each 
enzyme has an active site to which one 
or more substrate molecules bind, 
forming an enzyme-substrate complex. 
A reaction occurs at the active site, 
producing an enzyme-product complex. 
The product is then released, allowing the 
enzyme to bind further substrate 
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show that small organic molecules diffuse through the watery gel of the cytosol 
nearly as rapidly as they do through water. A small organic molecule, for exam- 
ple, takes only about one-fifth of a second on average to diffuse a distance of 
10 jum. Diffusion is therefore an efficient way for small molecules to move the 
limited distances in the cell (a typical animal cell is 15 jam in diameter). 

Since enzymes move more slowly than substrates in cells, we can think of 
them as sitting still. The rate of encounter of each enzyme molecule with its sub- 
strate will depend on the concentration of the substrate molecule. For example, 
some abundant substrates are present at a concentration of 0.5 mM. Since pure 
water is 55.5 M, there is only about one such substrate molecule in the cell for 
every 10 5 water molecules. Nevertheless, the active site on an enzyme molecule 
that binds this substrate will be bombarded by about 500,000 random collisions 
with the substrate molecule per second. (For a substrate concentration tenfold 
lower, the number of collisions drops to 50,000 per second, and so on.) A ran- 
dom encounter between the surface of an enzyme and the matching surface of 
its substrate molecule often leads immediately to the formation of an 
enzyme-substrate complex that is ready to react. A reaction in which a covalent 
bond is broken or formed can now occur extremely rapidly When one appreci- 
ates how quickly molecules move and react, the observed rates of enzymatic 
catalysis do not seem so amazing. 

Once an enzyme and substrate have collided and snuggled together properly 
at the active site, they form multiple weak bonds with each other that persist until 
random thermal motion causes the molecules to dissociate again. In general, the 
stronger the binding of the enzyme and substrate, the slower their rate of disso- 
ciation. However, when two colliding molecules have poorly matching surfaces, 
they form few noncovalent bonds and their total energy is negligible compared 
with that of thermal motion. In this case the two molecules dissociate as rapidly 
as they come together, preventing incorrect and unwanted associations between 
mismatched molecules, such as between an enzyme and the wrong substrate. 
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Figure 2-48 A random walk. <GGTA> 
Molecules in solution move in a random 
fashion as a result of the continual 
buffeting they receive in collisions with 
other molecules. This movement allows 
small molecules to diffuse rapidly from 
one part of the cell to another, as 
described in the text. 



The Free-Energy Change for a Reaction Determines Whether It 
Can Occur 

We must now digress briefly to introduce some fundamental chemistry Cells are 
chemical systems that must obey all chemical and physical laws. Although 
enzymes speed up reactions, they cannot by themselves force energetically unfa- 
vorable reactions to occur. In terms of a water analogy, enzymes by themselves 
cannot make water run uphill. Cells, however, must do just that in order to grow 
and divide: they must build highly ordered and energy- rich molecules from small 
and simple ones. We shall see that this is done through enzymes that directly cou- 
ple energetically favorable reactions, which release energy and produce heat, to 
energetically unfavorable reactions, which produce biological order. 

Before examining how such coupling is achieved, we must consider more 
carefully the term "energetically favorable." According to the second law of ther- 
modynamics, a chemical reaction can proceed spontaneously only if it results in 
a net increase in the disorder of the universe (see Figure 2-38). The criterion for 
an increase in disorder of the universe can be expressed most conveniently in 
terms of a quantity called the free energy, G, of a system. The value of G is of 
interest only when a system undergoes a change, and the change in G, denoted 
AG (delta G), is critical. Suppose that the system being considered is a collection 
of molecules. As explained in Panel 2-7 (pp. 118-119), free energy has been 
defined such that AG directly measures the amount of disorder created in the 
universe when a reaction takes place that involves these molecules. Energetically 
favorable reactions, by definition, are those that decrease free energy; in other 
words, they have a negative AG and disorder the universe (Figure 2-50). 

An example of an energetically favorable reaction on a macroscopic scale is 
the "reaction" by which a compressed spring relaxes to an expanded state, 
releasing its stored elastic energy as heat to its surroundings; an example on a 
microscopic scale is salt dissolving in water. Conversely, energetically unfavor- 
able reactions, with a positive AG^such as the joining of two amino acids to 
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Figure 2-49 The structure of the 

cytoplasm. The drawing is approximately 
to scale and emphasizes the crowding in 
the cytoplasm. Only the macromolecules 
are shown: RNAs are shown in blue, 
ribosomes in green, and proteins in red. 
Enzymes and other macromolecules diffuse 
relatively slowly in the cytoplasm, in part 
because they interact with many other 
macromolecules; small molecules, by 
contrast, diffuse nearly as rapidly as they do 
in water. (Adapted from D.S. Goodsell, 
Trends Biochem. Sci. 1 6:203-206, 1 991 . With 
permission from Elsevier.) 
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form a peptide bond — by themselves create order in the universe. Therefore, 
these reactions can take place only if they are coupled to a second reaction with 
a negative AGso large that the AGof the entire process is negative (Figure 2-51). 



The Concentration of Reactants Influences the Free-Energy 
Change and a Reaction's Direction 

As we have just described, a reaction Y ^ X will go in the direction Y -> X when 
the associated free-energy change, AG, is negative, just as a tensed spring left to 
itself will relax and lose its stored energy to its surroundings as heat. For a chem- 
ical reaction, however, AG depends not only on the energy stored in each indi- 
vidual molecule, but also on the concentrations of the molecules in the reaction 
mixture. Remember that AG reflects the degree to which a reaction creates a 
more disordered — in other words, a more probable — state of the universe. 
Recalling our coin analogy, it is very likely that a coin will flip from a head to a 
tail orientation if a jiggling box contains 90 heads and 10 tails, but this is a less 
probable event if the box has 10 heads and 90 tails. 

The same is true for a chemical reaction. For a reversible reaction Y ^ X, a 
large excess of Y over X will tend to drive the reaction in the direction Y -> X; that 
is, there will be a tendency for there to be more molecules making the transition 
Y -> X than there are molecules making the transition X -> Y. If the ratio of Y to 
X increases, the AG becomes more negative for the transition Y -> X (and more 
positive for the transition X -> Y). 

How much of a concentration difference is needed to compensate for a 
given decrease in chemical bond energy (and accompanying heat release)? The 
answer is not intuitively obvious, but it can be determined from a thermody- 
namic analysis that makes it possible to separate the concentration-dependent 
and the concentration-independent parts of the free-energy change. The AG for 
a given reaction can thereby be written as the sum of two parts: the first, called 
the standard free-energy change, AG , depends on the intrinsic characters of the 
reacting molecules; the second depends on their concentrations. For the simple 
reactionY^Xat37°C, 



AG: 



= AG° + 0.616 In M = AG° + 1.42 log -j^- 



where AG is in kilocalories per mole, [Y] and [X] denote the concentrations ofY 
and X, In is the natural logarithm, and the constant 0.616 is equal to RT: the 
product of the gas constant, R, and the absolute temperature, T. 

Note that AG equals the value of AG when the molar concentrations of Y 
and X are equal (log 1 = 0). As expected, AG becomes more negative as the ratio 
of X to Y decreases (the log of a number < 1 is negative) . 

Inspection of the above equation reveals that the AG equals the value of AG 9 
when the concentrations of Y and X are equal. But as the favorable reaction Y -» X 
proceeds, the concentration of the product X increases and the concentration of 
the substrate Y decreases. This change in relative concentrations will cause [X] / [ Y] 
to become increasingly large, making the initially favorable AG less and less nega- 
tive. Eventually, when AG = 0, a chemical equilibrium will be attained; here the 
concentration effect just balances the push given to the reaction by AG , and the 
ratio of substrate to product reaches a constant value (Figure 2-52). 

How far will a reaction proceed before it stops at equilibrium? To address 
this question, we need to introduce the equilibrium constant, K The value of K 
is different for different reactions, and it reflects the ratio of product to substrate 
at equilibrium. For the reaction Y -> X : 

K- [X ] 

The equation that connects AG and the ratio [X] / [ Y] allows us to connect AG° 
directly to K. Since AG = at equilibrium, the concentrations of Y and X at this 
point are such that: 



AG° = -1.42 log 
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universe would 
become more 
ordered. 



ENERGETICALLY 
UNFAVORABLE 
REACTION 



this reaction can occur only if 

it is coupled to a second, 

energetically favorable reaction 



Figure 2-50 The distinction between 
energetically favorable and 
energetically unfavorable reactions. 
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the energetically unfavorable 
reaction X-**Y is driven by the 
energetically favorable 
reaction C-HD, because the net 
free-energy change for the 
pair of coupled reactions is less 
than zero 



or, AG 9 = -1.42 logtf 



Figure 2-51 How reaction coupling is 
used to drive energetically unfavorable 
reactions. 
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Figure 2-52 Chemical equilibrium. 

When a reaction reaches equilibrium, the 
forward and backward fluxes of reacting 
molecules are equal and opposite. 



The formation of X is energetically favored in this example. In other words, the 
AG of Y -» X is negative and the AG of X -> Y is positive But because of thermal 
bombardments, there will always be some X converting to Y and vice versa. 

SUPPOSE WE START WITH AN EQUAL NUMBER OF Y AND X MOLECULES 



conversion of 
Y to X will 
occur often. 



Conversion of X to Y 

will occur less often, 

because it requires a 

more energetic collision 

than the transition 

Y^X. 




therefore the ratio of X to Y 
molecules will increase 



EVENTUALLY there will be a large enough excess of X over Y to just 
compensate for the slow rate of X -» Y. Equilibrium will then be attained 



v 
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AT EQUILIBRIUM the number of Y molecules being converted to X molecules 
each second is exactly equal to the number of X molecules being converted to 
Y molecules each second, so that there is no net change in the ratio of Y to X. 



Using the last equation, we can see how the equilibrium ratio of X to Y 
(expressed as an equilibrium constant, K) depends on the intrinsic character of 
the molecules, as expressed in the value of AG 3 (Table 2-4). Note that for every 
1.4 kcal/mole (5.9 kj/mole) difference in free energy at 37°C, the equilibrium 
constant changes by a factor of 10. 

When an enzyme (or any catalyst) lowers the activation energy for the reac- 
tion Y -» X, it also lowers the activation energy for the reaction X -> Y by exactly 
the same amount (see Figure 2-44). The forward and backward reactions will 
therefore be accelerated by the same factor by an enzyme, and the equilibrium 
point for the reaction (and AG ) is unchanged (Figure 2-53). 



For Sequential Reactions, AG Values Are Additive 

We can predict quantitatively the course of most reactions. A large body of ther- 
modynamic data has been collected that makes it possible to calculate the stan- 
dard change in free energy, AG , for most of the important metabolic reactions 
of the cell. The overall free-energy change for a metabolic pathway is then sim- 
ply the sum of the free-energy changes in each of its component steps. Consider, 
for example, two sequential reactions 

X->Y and Y^Z 
whose AG values are +5 and -13 kcal/mole, respectively (Recall that a mole is 6 
x 10 23 molecules of a substance.) If these two reactions occur sequentially the 
AG for the coupled reaction will be -8 kcal/mole. Thus, the unfavorable reaction 
X -> Y, which will not occur spontaneously, can be driven by the favorable reac- 
tion Y -> Z, provided that this second reaction follows the first. 

Cells can therefore cause the energetically unfavorable transition, X -> Y, to 
occur if an enzyme catalyzing the X -> Y reaction is supplemented by a second 
enzyme that catalyzes the energetically favorable reaction, Y -» Z. In effect, the 
reaction Y -» Z will then act as a "siphon" to drive the conversion of all of 
molecule X to molecule Y, and thence to molecule Z (Figure 2-54) . For example, 



Table 2-4 Relationship 
Between the Standard Free- 
Energy Change, AG°, and the 
Equilibrium Constant 



EQUILIBRIUM 


FREE ENERGY 


CONSTANT 


OF X MINUS 


[X] 


FREE ENERGY 


7yi~ — 


OFY 




kcal/mole (kJ/mole) 


10 5 


-7.1 (-29.7) 


10 4 


-5.7 (-23.8) 


10 3 


-4.3 (-18.0) 


10 2 


-2.8 (-11.7) 


10 1 


-1.4 (-5.9) 


1 


0(0) 


10 


1.4(5.9) 


10" 2 


2.8(11.7) 


10" 3 


4.3(18.0) 


10" 4 


5.7 (23.8) 


io- 5 


7.1 (297) 



Values of the equilibrium constant 
were calculated for the simple 
chemical reaction Y ^ X using the 
equation given in the text. 

The AG given here is in kilocalories 
per mole at 37°C, with kilojoules 
per mole in parentheses (1 kilocalorie 
is equal to 4,184 kilojoules) As 
explained in the text, AG° represents 
the free-energy difference under 
standard conditions (where all 
components are present at a 
concentration of 1 ,0 mole/liter). 

From this table, we see that if there is 
a favorable standard free-energy change 
(AG ) of -4,3 kcal/mole (-18.0 kJ/mole) for 
the transition Y -» X, there will be 1 000 
times more molecules in state X than 
in state Y at equilibrium (K= 1000). 
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several of the reactions in the long pathway that converts sugars into CO2 and 
H 2 would be energetically unfavorable if considered on their own. But the 
pathway nevertheless proceeds because the total AG for the series of sequential 
reactions has a large negative value. 

But forming a sequential pathway is not adequate for many purposes. Often 
the desired pathway is simply X -> Y, without further conversion of Y to some 
other product. Fortunately, there are other more general ways of using enzymes 
to couple reactions together. How these work is the topic we discuss next. 



Figure 2-53 Enzymes cannot change 
the equilibrium point for reactions. 

Enzymes, like all catalysts, speed up the 
forward and backward rates of a reaction 
by the same factor. Therefore, for both 
the catalyzed and the uncatalyzed 
reactions shown here, the number of 
molecules undergoing the transition 
X -> Y is equal to the number of 
molecules undergoing the transition 
Y -> X when the ratio of Y molecules to X 
molecules is 3.5 to 1 . In other words, the 
two reactions reach equilibrium at 
exactly the same point 



Activated Carrier Molecules Are Essential for Biosynthesis 

The energy released by the oxidation of food molecules must be stored tem- 
porarily before it can be channeled into the construction of the many other 
molecules needed by the cell. In most cases, the energy is stored as chemical 
bond energy in a small set of activated "carrier molecules," which contain one or 
more energy-rich covalent bonds. These molecules diffuse rapidly throughout 
the cell and thereby carry their bond energy from sites of energy generation to the 
sites where energy is used for biosynthesis and other cell activities (Figure 2-55). 
The activated carriers store energy in an easily exchangeable form, either as 
a readily transferable chemical group or as high-energy electrons, and they can 
serve a dual role as a source of both energy and chemical groups in biosynthetic 
reactions. For historical reasons, these molecules are also sometimes referred to 
as coenzymes. The most important of the activated carrier molecules are ATP 
and two molecules that are closely related to each other, NADH and NADPH— 
as we discuss in detail shortly. We shall see that cells use activated carrier 
molecules like money to pay for reactions that otherwise could not take place. 



m 



or. 



51 




(C) 



equilibrium point for 
X^Y reaction alone 



s , 



equilibrium point for 
Y— ^2 reaction alone 



♦♦ 



•1*:: 



equilibrium point for sequential reactions X-*-Y-^Z 



Figure 2-54 How an energetically 
unfavorable reaction can be driven by a 
second, following reaction. (A) At 

equilibrium, there are twice as many 
X molecules as Y molecules, because X is 
of lower energy than Y. (B) At equilibrium, 
there are 25 times more Z molecules than 
Y molecules, because Z is of much lower 
energy than Y. (C) If the reactions in (A) 
and (B) are coupled, nearly all of the X 
molecules will be converted to Z 
molecules, as shown. 
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Figure 2-55 Energy transfer and the role 
of activated carriers in metabolism. By 

serving as energy shuttles, activated 
carrier molecules perform their function 
as go-betweens that link the breakdown 
of food molecules and the release of 
energy (catabolism) to the energy- 
requiring biosynthesis of small and large 
organic molecules (anabofism). 
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The Formation of an Activated Carrier Is Coupled to an 
Energetically Favorable Reaction 

When a fuel molecule such as glucose is oxidized in a cell, enzyme-catalyzed 
reactions ensure that a large part of the free energy that is released by oxidation 
is captured in a chemically useful form, rather than being released as heat. This 
is achieved by means of a coupled reaction, in which an energetically favorable 
reaction drives an energetically unfavorable one that produces an activated car- 
rier molecule or some other useful energy store. Coupling mechanisms require 
enzymes and are fundamental to all the energy transactions of the cell. 

The nature of a coupled reaction is illustrated by a mechanical analogy in 
Figure 2-56, in which an energetically favorable chemical reaction is represented 
by rocks falling from a cliff. The energy of falling rocks would normally be entirely 
wasted in the form of heat generated by friction when the rocks hit the ground 
(see the falling brick diagram in Figure 2-39). By careful design, however, part of 
this energy could be used instead to drive a paddle wheel that lifts a bucket of 
water (Figure 2-56B). Because the rocks can now reach the ground only after 
moving the paddle wheel, we say that the energetically favorable reaction of rock 
falling has been directly coupled to the energetically unfavorable reaction of lift- 
ing the bucket of water. Note that because part of the energy is used to do work 
in (B), the rocks hit the ground with less velocity than in (A), and correspond- 
ingly less energy is dissipated as heat. 

Similar processes occur in cells, where enzymes play the role of the paddle 
wheel in our analogy. By mechanisms that will be discussed later in this chap- 
ter, they couple an energetically favorable reaction, such as the oxidation of 
foodstuffs, to an energetically unfavorable reaction, such as the generation of 



Figure 2-56 A mechanical model 
illustrating the principle of coupled 
chemical reactions. The spontaneous 
reaction shown in (A) could serve as an 
analogy for the direct oxidation of 
glucose to CO2 and H2O, which produces 
heat only. In (B) the same reaction is 
coupled to a second reaction; this second 
reaction is analogous to the synthesis of 
activated carrier molecules. The energy 
produced in (B) is in a more useful form 
than in (A) and can be used to drive a 
variety of otherwise energetically 
unfavorable reactions (C). 




kinetic energy of falling rocks is 
transformed into heat energy only 



part of the kinetic energy is used to lift 
a bucket of water, and a correspondingly 
smaller amount is transformed into heat 



the potential kinetic energy stored in 
the raised bucket of water can be 
used to drive hydraulic machines that 
carry out a variety of useful tasks 
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an activated carrier molecule. As a result, the amount of heat released by the oxi- 
dation reaction is reduced by exactly the amount of energy that is stored in the 
energy- rich covalent bonds of the activated carrier molecule. The activated car- 
rier molecule in turn picks up a packet of energy of a size sufficient to power a 
chemical reaction elsewhere in the cell. 



Figure 2-57 The hydrolysis of ATP to 
ADP and inorganic phosphate. The two 

outermost phosphates in ATP are held to 
the rest of the molecule by high-energy 
phosphoanhydride bonds and are readily 
transferred. As indicated, water can be 
added to ATP to form ADP and inorganic 
phosphate (Pj). This hydrolysis of the 
terminal phosphate of ATP yields 
between 1 1 and 1 3 kcal/mole of usable 
energy, depending on the intracellular 
conditions. The large negative AG of this 
reaction arises from several factors. 
Release of the terminal phosphate group 
removes an unfavorable repulsion 
between adjacent negative charges; in 
addition, the inorganic phosphate ion (Pj) 
released is stabilized by resonance and 
by favorable hydrogen-bond formation 
with water. 



ATP Is the Most Widely Used Activated Carrier Molecule 

The most important and versatile of the activated carriers in cells is ATP (adeno- 
sine triphosphate). Just as the energy stored in the raised bucket of water in Fig- 
ure 2-56B can drive a wide variety of hydraulic machines, ATP is a convenient 
and versatile store, or currency, of energy used to drive a variety of chemical 
reactions in cells. ATP is synthesized in an energetically unfavorable phosphory- 
lation reaction in which a phosphate group is added to ADP (adenosine diphos- 
phate). When required, ATP gives up its energy packet through its energetically 
favorable hydrolysis to ADP and inorganic phosphate (Figure 2-57). The regen- 
erated ADP is then available to be used for another round of the phosphoryla- 
tion reaction that forms ATP 

The energetically favorable reaction of ATP hydrolysis is coupled to many 
otherwise unfavorable reactions through which other molecules are synthe- 
sized. We shall encounter several of these reactions later in this chapter. Many of 
them involve the transfer of the terminal phosphate in ATP to another molecule, 
as illustrated by the phosphorylation reaction in Figure 2-58. 
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Figure 2-58 An example of a phosphate 
transfer reaction. Because an energy-rich 
phosphoanhydride bond in ATP is 
converted to a phosphoester bond, this 
reaction is energetically favorable, having 
a large negative AG. Reactions of this 
type are involved in the synthesis of 
phospholipids and in the initial steps of 
reactions that catabolize sugars. 
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Figure 2-59 An example of an energetically unfavorable 
biosynthetic reaction driven by ATP hydrolysis. (A) Schematic 
illustration of the formation of A-B in the condensation reaction 
described in the text. (B) The biosynthesis of the common amino 
acid glutamine. Glutamic acid is first converted to a high-energy 
phosphorylated intermediate (corresponding to the compound 
B-O-PO3 described in the text), which then reacts with ammonia 
(corresponding to A-H) to form glutamine. In this example both 
steps occur on the surface of the same enzyme, glutamine 
synthetase. The high energy bonds are shaded red. 




CONDENSATION 
STEP 



H 3 N + — CH — COO" 
glutamic acid 



ATP is the most abundant activated carrier in cells. As one example, it sup- 
plies energy for many of the pumps that transport substances into and out of 
the cell (discussed in Chapter 11). It also powers the molecular motors that 
enable muscle cells to contract and nerve cells to transport materials from one 
end of their long axons to another (discussed in Chapter 16). 



H 3 N + — CH — COO 
glutamine 



Energy Stored in ATP Is Often Harnessed to Join Two Molecules 
Together 

We have previously discussed one way in which an energetically favorable reac- 
tion can be coupled to an energetically unfavorable reaction, X -> Y, so as to 
enable it to occur. In that scheme a second enzyme catalyzes the energetically 
favorable reaction Y -> Z, pulling all of the X to Y in the process (see Figure 2-54). 
But when the required product is Y and not Z, this mechanism is not useful. 

A typical biosynthetic reaction is one in which two molecules, A and B, are 
joined together to produce A-B in the energetically unfavorable condensation 
reaction 

A-H + B-OH -> A-B + H 2 

There is an indirect pathway that allows A-H and B-OH to form A-B, in which a 
coupling to ATP hydrolysis makes the reaction go. Here energy from ATP hydrol- 
ysis is first used to convert B-OH to a higher-energy intermediate compound, 
which then reacts directly with A-H to give A-B. The simplest possible mecha- 
nism involves the transfer of a phosphate from ATP to B-OH to make B-OPO3, 
in which case the reaction pathway contains only two steps: 



1. 
2. 



B-OH + ATP -> B-O-PO3 + ADP 
A-H + B-O-PO3 -> A-B + Pi 



Net result: B-OH + ATP + A-H -> A-B + ADP + Pi 

The condensation reaction, which by itself is energetically unfavorable, is forced 
to occur by being directly coupled to ATP hydrolysis in an enzyme-catalyzed 
reaction pathway (Figure 2-59A). 

A biosynthetic reaction of exactly this type synthesizes the amino acid glu- 
tamine (Figure 2-59B). We will see shortly that similar (but more complex) 
mechanisms are also used to produce nearly all of the large molecules of the cell. 
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NADH and NADPH Are Important Electron Carriers 

Other important activated carrier molecules participate in oxidation-reduction 
reactions and are commonly part of coupled reactions in cells. These activated 
carriers are specialized to carry high-energy electrons and hydrogen atoms. 
The most important of these electron carriers are NAD + (nicotinamide ade- 
nine dinucleotide) and the closely related molecule NADP + (nicotinamide 
adenine dinucleotide phosphate). Later, we examine some of the reactions in 
which they participate. NAD + and NADP + each pick up a "packet of energy" cor- 
responding to two high-energy electrons plus a proton (H + ) — being converted to 
NADH {reduced nicotinamide adenine dinucleotide) and NADPH {reduced 
nicotinamide adenine dinucleotide phosphate), respectively. These molecules 
can therefore also be regarded as carriers of hydride ions (the H + plus two elec- 
trons, or H"). 

Like ATP, NADPH is an activated carrier that participates in many important 
biosynthetic reactions that would otherwise be energetically unfavorable. The 
NADPH is produced according to the general scheme shown in Figure 2-60A. Dur- 
ing a special set of energy-yielding catabolic reactions, a hydrogen atom plus two 
electrons are removed from the substrate molecule and added to the nicotinamide 
ring of NADP + to form NADPH, with a proton (H + ) being released into solution. 
This is a typical oxidation-reduction reaction; the substrate is oxidized and NADP + 
is reduced. The structures of NADP + and NADPH are shown in Figure 2-60B. 

NADPH readily gives up the hydride ion it carries in a subsequent oxida- 
tion-reduction reaction, because the nicotinamide ring can achieve a more sta- 
ble arrangement of electrons without it. In this subsequent reaction, which 
regenerates NADP\ it is the NADPH that is oxidized and the substrate that is 
reduced. The NADPH is an effective donor of its hydride ion to other molecules 
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Figure 2-60 NADPH, an important 
carrier of electrons. (A) NADPH is 
produced in reactions of the general type 
shown on the left, in which two 
hydrogen atoms are removed from a 
substrate. The oxidized form of the carrier 
molecule, NADP+, receives one hydrogen 
atom plus an electron (a hydride ion); the 
proton (H + ) from the other H atom is 
released into solution. Because NADPH 
holds its hydride ion in a high-energy 
linkage, the added hydride ion can easily 
be transferred to other molecules, as 
shown on the right. (B) The structures of 
NADP + and NADPH. The part of the 
NADP + molecule known as the 
nicotinamide ring accepts two electrons 
together with a proton (the equivalent of 
a hydride ion, H - ), forming NADPH. The 
molecules NAD + and NADH are identical 
in structure to NADP + and NADPH, 
respectively, except that the indicated 
phosphate group is absent from both. 
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for the same reason that ATP readily transfers a phosphate: in both cases the 
transfer is accompanied by a large negative free-energy change. One example of 
the use of NADPH in biosynthesis is shown in Figure 2-61 . 

The extra phosphate group on NADPH has no effect on the electron-trans- 
fer properties of NADPH compared with NADH, being far away from the region 
involved in electron transfer (see Figure 2-60B). It does, however, give a 
molecule of NADPH a slightly different shape from that of NADH, making it pos- 
sible for NADPH and NADH to bind as substrates to completely different sets of 
enzymes. Thus the two types of carriers are used to transfer electrons (or hydride 
ions) between two different sets of molecules. 

Why should there be this division of labor? The answer lies in the need to 
regulate two sets of electron-transfer reactions independently NADPH operates 
chiefly with enzymes that catalyze anabolic reactions, supplying the high-energy 
electrons needed to synthesize energy- rich biological molecules. NADH, by con- 
trast, has a special role as an intermediate in the catabolic system of reactions 
that generate ATP through the oxidation of food molecules, as we will discuss 
shortly. The genesis of NADH from NAD + and that of NADPH from NADP + occur 
by different pathways and are independently regulated, so that the cell can 
adjust the supply of electrons for these two contrasting purposes. Inside the cell 
the ratio of NAD + to NADH is kept high, whereas the ratio of NADP + to NADPH 
is kept low. This provides plenty of NAD + to act as an oxidizing agent and plenty 
of NADPH to act as a reducing agent — as required for their special roles in 
catabolism and anabolism, respectively. 



There Are Many Other Activated Carrier Molecules in Cells 

Other activated carriers also pick up and carry a chemical group in an easily 
transferred, high-energy linkage. For example, coenzyme A carries an acetyl 
group in a readily transferable linkage, and in this activated form is known as 
acetyl CoA (acetyl coenzyme A). Acetyl CoA (Figure 2-62) is used to add two car- 
bon units in the biosynthesis of larger molecules. 

In acetyl CoA as in other carrier molecules, the transferable group makes up 
only a small part of the molecule. The rest consists of a large organic portion that 
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Figure 2-61 The final stage in one of the 
biosynthetic routes leading to 
cholesterol. As in many other 
biosynthetic reactions, the reduction of 
the C=C bond is achieved by the transfer 
of a hydride ion from the carrier molecule 
NADPH, plus a proton (H + ) from the 
solution. 
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Figure 2-62 The structure of the 
important activated carrier molecule 
acetyl CoA. A space-filling model is 
shown above the structure. The sulfur 
atom (yellow) forms a thioester bond to 
acetate. Because this is a high-energy 
linkage, releasing a large amount of free 
energy when it is hydrolyzed, the acetate 
molecule can be readily transferred to 
other molecules. 
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Table 2-5 Some Activated Carrier Molecules Widely Used in Metabolism 


ACTIVATED CARRIER 


GROUP CARRIED IN HIGH-ENERGY LINKAGE 


ATP 


phosphate 


NADH, NADPH, FADH 2 


electrons and hydrogens 


Acetyl CoA 


acetyl group 


Carboxylated biotin 


carboxyl group 


S-Adenosylmethionine 


methyl group 


Uridine diphosphate glucose 


glucose 



serves as a convenient "handle," facilitating the recognition of the carrier 
molecule by specific enzymes. As with acetyl CoA, this handle portion very often 
contains a nucleotide (usually adenosine), a curious fact that may be a relic from 
an early stage of evolution. It is currently thought that the main catalysts for 
early life-forms — before DNA or proteins — were RNA molecules (or their close 
relatives), as described in Chapter 6. It is tempting to speculate that many of the 
carrier molecules that we find today originated in this earlier RNA world, where 
their nucleotide portions could have been useful for binding them to RNA 
enzymes. 

Figures 2-58 and 2-61 have presented examples of the type of transfer reac- 
tions powered by the activated carrier molecules ATP (transfer of phosphate) 
and NADPH (transfer of electrons and hydrogen). The reactions of other acti- 
vated carrier molecules involve the transfer of a methyl, carboxyl, or glucose 
group for the purpose of biosynthesis (Table 2-5). These activated carriers are 
generated in reactions that are coupled to ATP hydrolysis, as in the example in 
Figure 2-63. Therefore, the energy that enables their groups to be used for 
biosynthesis ultimately comes from the catabolic reactions that generate ATP 
Similar processes occur in the synthesis of the very large molecules of the cell — 
the nucleic acids, proteins, and polysaccharides — that we discuss next. 



The Synthesis of Biological Polymers Is Driven by ATP Hydrolysis 

As discussed previously, the macromolecules of the cell constitute most of its 
dry mass— that is, of the mass not due to water (see Figure 2-29). These 



Figure 2-63 A carboxyl group transfer 
reaction using an activated carrier 
molecule. Carboxylated biotin is used by 
the enzyme pyruvate carboxylase to 
transfer a carboxyl group in the 
production of oxaloacetate, a molecule 
needed for the citric acid cycle. The 
acceptor molecule for this group transfer 
reaction is pyruvate. Other enzymes use 
biotin to transfer carboxyl groups to 
other acceptor molecules. Note that 
synthesis of carboxylated biotin 
requires energy that is derived from 
ATP— a general feature of many 
activated carriers. 
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molecules are made from subunits (or monomers) that are linked together in a 
condensation reaction, in which the constituents of a water molecule (OH plus 
H) are removed from the two reactants. Consequently, the reverse reaction — 
the breakdown of all three types of polymers — occurs by the enzyme-catalyzed 
addition of water {hydrolysis). This hydrolysis reaction is energetically favor- 
able, whereas the biosynthetic reactions require an energy input (Figure 2-64). 
The nucleic acids (DNA and RNA), proteins, and polysaccharides are all 
polymers that are produced by the repeated addition of a monomer onto one 
end of a growing chain. The synthesis reactions for these three types of macro- 
molecules are outlined in Figure 2-65. As indicated, the condensation step in 
each case depends on energy from nucleoside triphosphate hydrolysis. And yet, 
except for the nucleic acids, there are no phosphate groups left in the final 
product molecules. How are the reactions that release the energy of ATP hydrol- 
ysis coupled to polymer synthesis? 



Figure 2-64 Condensation and 
hydrolysis as opposite reactions. The 

macromolecules of the cell are polymers 
that are formed from subunits (or 
monomers) by a condensation reaction 
and are broken down by hydrolysis. The 
condensation reactions are all 
energetically unfavorable. 
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Figure 2-65 The synthesis of polysaccharides, proteins, and nucleic 
acids. Synthesis of each kind of biological polymer involves the loss of 
water in a condensation reaction. Not shown is the consumption of 
high-energy nucleoside triphosphates that is required to activate each 
monomer before its addition. In contrast, the reverse reaction— the 
breakdown of all three types of polymers— occurs by the simple addition 
of water (hydrolysis). 
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For each type of macromolecule, an enzyme -catalyzed pathway exists which 
resembles that discussed previously for the synthesis of the amino acid glu- 
tamine (see Figure 2-59). The principle is exactly the same, in that the OH group 
that will be removed in the condensation reaction is first activated by becoming 
involved in a high-energy linkage to a second molecule. However, the actual 
mechanisms used to link ATP hydrolysis to the synthesis of proteins and 
polysaccharides are more complex than that used for glutamine synthesis, since 
a series of high-energy intermediates is required to generate the final high- 
energy bond that is broken during the condensation step (discussed in Chapter 
6 for protein synthesis). 

Each activated carrier has limits in its ability to drive a biosynthetic reaction. 
The AG for the hydrolysis of ATP to ADP and inorganic phosphate (Pi) depends 
on the concentrations of all of the reactants, but under the usual conditions in a 
cell it is between -1 1 and -13 kcal/mole (between -46 and -54 kj/mole). In prin- 
ciple, this hydrolysis reaction could drive an unfavorable reaction with a AG of, 
perhaps, +10 kcal/mole, provided that a suitable reaction path is available. For 
some biosynthetic reactions, however, even -13 kcal/mole may not be enough. 
In these cases the path of ATP hydrolysis can be altered so that it initially pro- 
duces AMP and pyrophosphate (PPJ, which is itself then hydrolyzed in a subse- 
quent step (Figure 2-66). The whole process makes available a total free-energy 
change of about -26 kcal/mole. An important type of biosynthetic reaction that 
is driven in this way is the synthesis of nucleic acids (polynucleotides) from 
nucleoside triphosphates, as illustrated on the right side of Figure 2-67. 

Note that the repetitive condensation reactions that produce macro- 
molecules can be oriented in one of two ways, giving rise to either the head poly- 
merization or the tail polymerization of monomers. In so-called head polymer- 
ization the reactive bond required for the condensation reaction is carried on 
the end of the growing polymer, and it must therefore be regenerated each time 
that a monomer is added. In this case, each monomer brings with it the reactive 
bond that will be used in adding the next monomer in the series. In tail poly- 
merization the reactive bond carried by each monomer is instead used immedi- 
ately for its own addition (Figure 2-68). 

We shall see in later chapters that both these types of polymerization are 
used. The synthesis of polynucleotides and some simple polysaccharides occurs 
by tail polymerization, for example, whereas the synthesis of proteins occurs by 
a head polymerization process. 
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Figure 2-66 An alternative pathway of 
ATP hydrolysis, in which pyrophosphate 
is first formed and then hydrolyzed. This 
route releases about twice as much free 
energy as the reaction shown earlier in 
Figure 2-57, and it forms AMP instead of 
ADP. (A) In the two successive hydrolysis 
reactions, oxygen atoms from the 
participating water molecules are 
retained in the products, as indicated, 
whereas the hydrogen atoms dissociate 
to form free hydrogen ions (H + , not 
shown). (B) Diagram of overall reaction in 
summary form. 
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Figure 2-67 Synthesis of a 
polynucleotide, RNA or DNA, is a 
multistep process driven by ATP 
hydrolysis. In the first step, a nucleoside 
monophosphate is activated by the 
sequential transfer of the terminal 
phosphate groups from two ATP 
molecules. The high-energy intermediate 
formed— a nucleoside triphosphate— 
exists free in solution until it reacts with 
the growing end of an RNA or a DNA 
chain with release of pyrophosphate. 
Hydrolysis of the latter to inorganic 
phosphate is highly favorable and helps 
to drive the overall reaction in the 
direction of polynucleotide synthesis. For 
details, see Chapter 5. 
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Summary 

Living cells are highly ordered and need to create order within themselves to survive 
and grow. This is thermodynamically possible only because of a continual input of 
energy, part of which must be released from the cells to their environment as heat. The 
energy comes ultimately from the electromagnetic radiation of the sun, which drives 
the formation of organic molecules in photosynthetic organisms such as green plants. 
Animals obtain their energy by eating these organic molecules and oxidizing them in 
a series of enzyme- catalyzed reactions that are coupled to the formation of ATP — a 
common currency of energy in all cells. 

To make possible the continual generation of order in cells, the energetically favor- 
able hydrolysis of ATP is coupled to energetically unfavorable reactions. In the biosyn- 
thesis of macromolecules, this is accomplished by the transfer of phosphate groups to 
form reactive phosphorylated intermediates. Because the energetically unfavorable 
reaction now becomes energetically favorable, ATP hydrolysis is said to drive the reac- 
tion. Polymeric molecules such as proteins, nucleic acids, and polysaccharides are 
assembled from small activated precursor molecules by repetitive condensation reac- 
tions that are driven in this way Other reactive molecules, called either active carriers 
or coenzymes, transfer other chemical groups in the course of biosynthesis: NADPH 
transfers hydrogen as a proton plus two electrons (a hydride ion), for example, whereas 
acetyl CoA transfers an acetyl group. 
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Figure 2-68 The orientation of the active intermediates in the repetitive condensation reactions that form 
biological polymers. The head growth of polymers is compared with its alternative, tail growth. As indicated, these 
two mechanisms are used to produce different types of biological macromolecules. 
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HOW CELLS OBTAIN ENERGY FROM FOOD 

The constant supply of energy that cells need to generate and maintain the bio- 
logical order that keeps them alive comes from the chemical bond energy in 
food molecules, which thereby serve as fuel for cells. 

The proteins, lipids, and polysaccharides that make up most of the food we 
eat must be broken down into smaller molecules before our cells can use them— 
either as a source of energy or as building blocks for other molecules. Enzymatic 
digestion breaks down the large polymeric molecules in food into their 
monomer subunits— proteins into amino acids, polysaccharides into sugars, 
and fats into fatty acids and glycerol. After digestion, the small organic 
molecules derived from food enter the cytosol of cells, where their gradual oxi- 
dation begins. 

Sugars are particularly important fuel molecules, and they are oxidized in 
small controlled steps to carbon dioxide (C0 2 ) and water (Figure 2-69). In this 
section we trace the major steps in the breakdown, or catabolism, of sugars and 
show how they produce ATF> NADH, and other activated carrier molecules in 
animal cells. A very similar pathway also operates in plants, fungi, and many 
bacteria. As we shall see, the oxidation of fatty acids is equally important for 
cells. Other molecules, such as proteins, can also serve as energy sources when 
they are funneled through appropriate enzymatic pathways. 



Glycolysis Is a Central ATP-Producing Pathway 

The major process for oxidizing sugars is the sequence of reactions known as 
glycolysis— from the Greek ghtkus, "sweet," and lusis, "rupture." Glycolysis pro- 
duces ATP without the involvement of molecular oxygen (G? gas). It occurs in 
the cytosol of most cells, including many anaerobic microorganisms (those that 
can live without using molecular oxygen). Glycolysis probably evolved early in 
the history of life, before photosynthetic organisms introduced oxygen into die 
atmosphere. During glycolysis, a glucose molecule with six carbon atoms is con- 
verted into two molecules of pyruvate, each of which contains three carbon 
atoms. For each glucose molecule, two molecules of ATP are hydrolyzed to pro- 
vide energy to drive the early steps, but four molecules of ATP are produced in 
the later steps. At the end of glycolysis, there is consequently a net gain of two 
molecules of ATP for each glucose molecule broken down. 

The glycolytic pathway is outlined in Figure 2-70 and shown in more detail 
in Panel 2-8 (pp. 120-121). Glycolysis involves a sequence of 10 separate reac- 
tions, each producing a different sugar intermediate and each catalyzed by a 



(A) stepwise oxidation of sugar in cells 
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Figure 2-69 Schematic representation 
of the controlled stepwise oxidation of 
sugar in a cell, compared with ordinary 
burning. (A) In the cell, enzymes catalyze 
oxidation via a series of small steps in 
which free energy is transferred in 
conveniently sized packets to carrier 
molecules— most often ATP and NADH. 
At each step, an enzyme controls the 
reaction by reducing the activation 
energy barrier that has to be surmounted 
before the specific reaction can occur. 
The total free energy released is exactly 
the same in (A) and (B). But if the sugar 
were instead oxidized to C0 2 and H 2 in 
a single step, as in (B), it would release an 
amount of energy much larger than 
could be captured for useful purposes. 
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Figure 2-70 An outline of glycolysis. 

<GGGC> Each of the 10 steps shown is 
catalyzed by a different enzyme. Note 
that step 4 cleaves a six-carbon sugar 
into two three-carbon sugars, so that the 
number of molecules at every stage after 
this doubles. As indicated, step 6 begins 
the energy generation phase of 
glycolysis. Because two molecules of ATP 
are hydrolyzed in the early energy 
investment phase, glycolysis results in the 
net synthesis of 2 ATP and 2 NADH 
molecules per molecule of glucose (see 
also Panel 2-8). 



different enzyme. Like most enzymes, these have names ending in ase— such as 
isomevase and dehydrogenase— to indicate the type of reaction they catalyze. 

Although no molecular oxygen is used in glycolysis, oxidation occurs, in that 
electrons are removed by NAD + (producing NADH) from some of the carbons 
derived from the glucose molecule. The stepwise nature of the process releases 
the energy of oxidation in small packets, so that much of it can be stored in acti- 
vated carrier molecules rather than all of it being released as heat (see Figure 
2-69). Thus, some of the energy released by oxidation drives the direct synthesis 
of ATP molecules from ADP and Pi, and some remains with the electrons in the 
high-energy electron carrier NADH. 

Two molecules of NADH are formed per molecule of glucose in the course of 
glycolysis. In aerobic organisms (those that require molecular oxygen to live), 
these NADH molecules donate their electrons to the electron-transport chain 
described in Chapter 14, and the NAD + formed from the NADH is used again for 
glycolysis (see step 6 in Panel 2-8, pp. 120-121). 



Fermentations Produce ATP in the Absence of Oxygen 

For most animal and plant cells, glycolysis is only a prelude to the final stage of the 
breakdown of food molecules. In these cells, the pyruvate formed by glycolysis is 
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rapidly transported into the mitochondria, where it is converted into C0 2 plus 
acetyl CoA, which is then completely oxidized to C0 2 and H 2 0. 

In contrast, for many anaerobic organisms— which do not utilize molecular 
oxygen and can grow and divide without it— glycolysis is the principal source of 
the cell's ATP. This is also true for certain animal tissues, such as skeletal muscle, 
that can continue to function when molecular oxygen is limiting. In these anaer- 
obic conditions, the pyruvate and the NADH electrons stay in the cytosol. The 
pyruvate is converted into products excreted from the cell— for example, into 
ethanol and C0 2 in the yeasts used in brewing and breadmaking, or into lactate 
in muscle. In this process, the NADH gives up its electrons and is converted back 
into NAD + . This regeneration of NAD + is required to maintain the reactions of 
glycolysis (Figure 2-71). 

Anaerobic energy-yielding pathways like these are called fermentations. 
Studies of the commercially important fermentations carried out by yeasts 
inspired much of early biochemistry Work in the nineteenth century led in 1896 to 
the then startling recognition that these processes could be studied outside living 
organisms, in cell extracts. This revolutionary discovery eventually made it possi- 
ble to dissect out and study each of the individual reactions in the fermentation 
process. The piecing together of the complete glycolytic pathway in the 1930s was 
a major triumph of biochemistry, and it was quickly followed by the recognition of 
the central role of ATP in cell processes. Thus, most of the fundamental concepts 
discussed in this chapter have been understood for many years. 
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Figure 2-71 Two pathways for the 
anaerobic breakdown of pyruvate. 
(A) When there is inadequate oxygen, for 
example, in a muscle cell undergoing 
vigorous contraction, the pyruvate 
produced by glycolysis is converted to 
lactate as shown. This reaction 
regenerates the NAD + consumed in step 
6 of glycolysis, but the whole pathway 
yields much less energy overall than 
complete oxidation. (B) In some 
organisms that can grow anaerobically, 
such as yeasts, pyruvate is converted via 
acetaldehyde into carbon dioxide and 
ethanol. Again, this pathway regenerates 
NAD + from NADH, as required to enable 
glycolysis to continue. Both (A) and (B) 
are examples of fermentations. 
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Glycolysis Illustrates How Enzymes Couple Oxidation to Energy 
Storage 

Returning to the paddle-wheel analogy that we used to introduce coupled reac- 
tions (see Figure 2-56), we can now equate enzymes with the paddle wheel. 
Enzymes act to harvest useful energy from the oxidation of organic molecules 
by coupling an energetically unfavorable reaction with a favorable one. To 
demonstrate this coupling, we examine a step in glycolysis to see exactly how 
such coupled reactions occur. 

Two central reactions in glycolysis (steps 6 and 7) convert the three-carbon 
sugar intermediate glyceraldehyde 3-phosphate (an aldehyde) into 3-phospho- 
glycerate (a carboxylic acid; see Panel 2-8, pp. 120-121). This entails the oxidation 
of an aldehyde group to a carboxylic acid group in a reaction that occurs in two 
steps. The overall reaction releases enough free energy to convert a molecule of 
ADP to ATP and to transfer two electrons from the aldehyde to NAD + to form 
NADH, while still releasing enough heat to the environment to make the overall 
reaction energetically favorable (AG for the overall reaction is -3.0 kcal/mole). 

Figure 2-72 outlines the means by which this remarkable feat of energy har- 
vesting is accomplished. The indicated chemical reactions are precisely guided 
by two enzymes to which the sugar intermediates are tightly bound. In fact, as 
detailed in Figure 2-72, the first enzyme (glyceraldehyde 3-phosphate dehy- 
drogenase) forms a short-lived covalent bond to the aldehyde through a reac- 
tive -SH group on the enzyme, and catalyzes its oxidation by NAD + in this 
attached state. The reactive enzyme-substrate bond is then displaced by an 
inorganic phosphate ion to produce a high-energy phosphate intermediate, 
which is released from the enzyme. This intermediate binds to the second 
enzyme (phosphoglycerate kinase), which catalyzes the energetically favorable 
transfer of the high-energy phosphate just created to ADP, forming ATP and 
completing the process of oxidizing an aldehyde to a carboxylic acid. 

We have shown this particular oxidation process in some detail because it 
provides a clear example of enzyme -mediated energy storage through coupled 
reactions (Figure 2-73). Steps 6 and 7 are the only reactions in glycolysis that 
create a high-energy phosphate linkage directly from inorganic phosphate. As 
such, they account for the net yield of two ATP molecules and two NADH 
molecules per molecule of glucose (see Panel 2-8, pp. 120-121). 

As we have just seen, ATP can be formed readily from ADP when a reaction 
intermediate is formed with a phosphate bond of higher- energy than the phos- 
phate bond in ATP Phosphate bonds can be ordered in energy by comparing the 
standard free-energy change (AG°) for the breakage of each bond by hydrolysis. 
Figure 2-74 compares the high-energy phosphoanhydride bonds in ATP with 
the energy of some other phosphate bonds, several of which are generated dur- 
ing glycolysis. 

Organisms Store Food Molecules in Special Reservoirs 

All organisms need to maintain a high ATP/ADP ratio to maintain biological 
order in their cells. Yet animals have only periodic access to food, and plants 
need to survive overnight without sunlight, when they are unable to produce 
sugar from photosynthesis. For this reason, both plants and animals convert 
sugars and fats to special forms for storage (Figure 2-75). 

To compensate for long periods of fasting, animals store fatty acids as fat 
droplets composed of water- insoluble triacylglycerols, largely in the cytoplasm 
of specialized fat cells, called adipocytes. For shorter-term storage, sugar is 
stored as glucose subunits in the large branched polysaccharide glycogen, 
which is present as small granules in the cytoplasm of many cells, including liver 
and muscle. The synthesis and degradation of glycogen are rapidly regulated 
according to need. When cells need more ATP than they can generate from the 
food molecules taken in from the bloodstream, they break down glycogen in a 
reaction that produces glucose 1 -phosphate, which is rapidly converted to glu- 
cose 6-phosphate for glycolysis. 



92 Chapter 2: Cell Chemistry and Biosynthesis 



(A) 



O 



^ 



H — C— OH 

CH.O® 3-phosphate 



glyceraldehyde 




-SH 



^■■^lE 3+n^ 



high-energy bond 

(thioester) 



HO— 





r+ HO- 


o phc 

II 

-P — OH 


o 

II i 


i 


1 
OH 


PMC 


) 





HO C = 

I 
H — C— OH 

CH 2 0(V) 



1,3-bisphosphoglycerate 



a. 



A covalent bond is formed between 
glyceraldehyde 3-phosphate (the 
substrate) and the -SH group of a 
cysteine side chain of the enzyme 
glyceraldehyde 3-phosphate 
dehydrogenase, which also binds 
noncovalently to NAD + . 



Oxidation of glyceraldehyde 
3-phosphate occurs, as two 
electrons plus a proton (a hydride 
ion, see Figure 2-60) are 
transferred from glyceraldehyde 
3-phosphate to the bound NAD\ 
forming NADH. Part of the energy 
released by the oxidation of the 
aldehyde is thus stored in NADH, 
and part goes into converting the 
bond between the enzyme and its 
substrate glyceraldehyde 
3-phosphate into a high-energy 
thioester bond 



A molecule of inorganic phosphate 
displaces the high-energy bond to 
the enzyme to create 1,3-bis- 
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a high-energy acyl-anhydride 
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Figure 2-72 Energy storage in steps 6 
and 7 of glycolysis. In these steps the 
oxidation of an aldehyde to a carboxylic 
acid is coupled to the formation of ATP 
and NADH. (A) Step 6 begins with the 
formation of a covalent bond between 
the substrate (glyceraldehyde 
3-phosphate) and an -SH group exposed 
on the surface of the enzyme 
(glyceraldehyde 3-phosphate 
dehydrogenase). The enzyme then 
catalyzes transfer of hydrogen (as a 
hydride ion — a proton plus two 
electrons) from the bound 
glyceraldehyde 3-phosphate to a 
molecule of NAD + . Part of the energy 
released in this oxidation is used to form 
a molecule of NADH and part is used to 
convert the original linkage between the 
enzyme and its substrate to a high- 
energy thioester bond (shown in red). 
A molecule of inorganic phosphate then 
displaces this high-energy bond on the 
enzyme, creating a high-energy sugar- 
phosphate bond instead (red). At this 
point the enzyme has not only stored 
energy in NADH, but also coupled the 
energetically favorable oxidation of an 
aldehyde to the energetically unfavorable 
formation of a high-energy phosphate 
bond. The second reaction has been 
driven by the first thereby acting like the 
"paddle-wheel" coupler in Figure 2-56. 

In reaction step 7, the high-energy 
sugar-phosphate intermediate just made, 
1,3-bisphosphoglycerate, binds to a 
second enzyme, phosphoglycerate 
kinase. The reactive phosphate is 
transferred to ADP, forming a molecule of 
ATP and leaving a free carboxylic acid 
group on the oxidized sugar. 

(B) Summary of the overall chemical 
change produced by reactions 6 and 7. 




Much of the energy of oxidation 
has been stored in the activated 
carriers ATP and NADH. 
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Figure 2-73 Schematic view of the 
coupled reactions that form NADH and 
ATP in steps 6 and 7 of glycolysis. The 

C-H bond oxidation energy drives the 
formation of both NADH and a high- 
energy phosphate bond. The breakage of 
the high-energy bond then drives ATP 
formation. 
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Figure 2-74 Phosphate bonds have different energies. Examples of different types of phosphate bonds with their sites of hydrolysis are 
shown in the molecules depicted on the left. Those starting with a gray carbon atom show only part of a molecule. Examples of molecules 
containing such bonds are given on the right, with the free-energy change for hydrolysis in kilocalories (kilojoules in parentheses). The transfer 
of a phosphate group from one molecule to another is energetically favorable if the standard free-energy change (AG°) for hydrolysis of the 
phosphate bond of the first molecule is more negative than that for hydrolysis of the phosphate bond in the second. Thus, a phosphate group 
is readily transferred from 1,3-bisphosphoglycerate to ADP to form ATP. The hydrolysis reaction can be viewed as the transfer of the phosphate 
group to water. 
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Figure 2-75 The storage of sugars and 
fats in animal and plant cells. (A) The 

structures of starch and glycogen, the 
storage form of sugars in plants and 
animals, respectively. Both are storage 
polymers of the sugar glucose and differ 
only in the frequency of branch points 
(the region in yellow is shown enlarged 
below). There are many more branches in 
glycogen than in starch. (B) An electron 
micrograph shows glycogen granules in 
the cytoplasm of a liver cell. (C) A thin 
section of a single chloroplast from a 
plant cell, showing the starch granules 
and lipid (fat droplets) that have 
accumulated as a result of the 
biosyntheses occurring there. (D) Fat 
droplets (stained red) beginning to 
accumulate in developing fat cells of an 
animal. (B, courtesy of Robert Fletterick 
and Daniel S. Friend; C, courtesy of 
K. Plaskitt; D, courtesy of Ronald M. Evans 
and Peter Totonoz.) 
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Quantitatively, fat is far more important than glycogen as an energy store for 
animals, presumably because it provides for more efficient storage. The oxidation 
of a gram of fat releases about twice as much energy as the oxidation of a gram 
of glycogen. Moreover, glycogen differs from fat in binding a great deal of water, 
producing a sixfold difference in the actual mass of glycogen required to store 
the same amount of energy as fat. An average adult human stores enough glyco- 
gen for only about a day of normal activities but enough fat to last for nearly a 
month. If our main fuel reservoir had to be carried as glycogen instead of fat, 
body weight would increase by an average of about 60 pounds. 

Although plants produce NADPH and ATP by photosynthesis, this important 
process occurs in a specialized organelle, called a chloroplast, which is isolated 
from the rest of the plant cell by a membrane that is impermeable to both types 
of activated carrier molecules. Moreover, the plant contains many other cells- 
such as those in the roots— that lack chloroplasts and therefore cannot produce 
their own sugars. Therefore, for most of its ATP production, the plant relies on an 
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export of sugars from its chloroplasts to the mitochondria that are located in all 
cells of the plant. Most of the ATP needed by the plant is synthesized in these 
mitochondria and exported from them to the rest of the plant cell, using exactly 
the same pathways for the oxidative breakdown of sugars as in nonphotosyn- 
thetic organisms (Figure 2-76). 

During periods of excess photosynthetic capacity during the day, chloro- 
plasts convert some of the sugars that they make into fats and into starch, a 
polymer of glucose analogous to the glycogen of animals. The fats in plants are 
triacylglycerols, just like the fats in animals, and differ only in the types of fatty 
acids that predominate. Fat and starch are both stored in the chloroplast as 
reservoirs to be mobilized as an energy source during periods of darkness (see 
Figure 2-75C). 

The embryos inside plant seeds must live on stored sources of energy for a 
prolonged period, until they germinate to produce leaves that can harvest the 
energy in sunlight. For this reason plant seeds often contain especially large 
amounts of fats and starch — which makes them a major food source for animals, 
including ourselves (Figure 2-77). 



Figure 2-76 How the ATP needed for 
most plant cell metabolism is made. In 

plants, the chloroplasts and mitochondria 
collaborate to supply cells with 
metabolites and ATP. (For details, see 
Chapter 14.) 



Most Animal Cells Derive Their Energy from Fatty Acids Between 
Meals 

After a meal, most of the energy that an animal needs is derived from sugars 
derived from food. Excess sugars, if any, are used to replenish depleted glycogen 
stores, or to synthesize fats as a food store. But soon the fat stored in adipose tis- 
sue is called into play, and by the morning after an overnight fast, fatty acid oxi- 
dation generates most of the ATP we need. 

Low glucose levels in the blood trigger the breakdown of fats for energy pro- 
duction. As illustrated in Figure 2-78, the triacylglycerols stored in fat droplets 
in adipocytes are hydrolyzed to produce fatty acids and glycerol, and the fatty 
acids released are transferred to cells in the body through the bloodstream. 
While animals readily convert sugars to fats, they cannot convert fatty acids to 
sugars. Instead, the fatty acids are oxidized directly. 




I 





Figure 2-77 Some plant seeds that 
serve as important foods for humans. 

Corn, nuts, and peas all contain rich 
stores of starch and fat that provide the 
young plant embryo in the seed with 
energy and building blocks for 
biosynthesis. (Courtesy of the John Innes 
Foundation.) 
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Figure 2-78 How stored fats are 
mobilized for energy production in 
animals. Low glucose levels in the blood 
trigger the hydrolysis of the 
triacylglycerol molecules in fat droplets 
to free fatty acids and glycerol, as 
illustrated. These fatty acids enter the 
bloodstream, where they bind to the 
abundant blood protein, serum albumin. 
Special fatty acid transporters in the 
plasma membrane of cells that oxidize 
fatty acids, such as muscle cells, then pass 
these fatty acids into the cytosol, from 
which they are moved into mitochondria 
for energy production {see Figure 2-80). 



Sugars and Fats Are Both Degraded to Acetyl CoA in 
Mitochondria 

In aerobic metabolism, the pyruvate that was produced by glycolysis from sug- 
ars in the cytosol is transported into the mitochondria of eucaryotic cells. 
There, it is rapidly decarboxylated by a giant complex of three enzymes* called 
the pyruvate dehydrogenase complex. The products of pyruvate decarboxylation 
are a molecule of C0 2 (a waste product], a molecule of NADH, and acetyl CoA 
(Figure 2-79). 

The fatty acids imported from the bloodstream are moved into mitochon- 
dria, where all of their oxidation takes place (Figure 2-80). Each molecule of 
fatty acid (as the activated molecule fatty acyl CoA) is broken down completely 
by a cycle of reactions that trims two carbons at a time from its carboxyl end, 
generating one molecule of acetyl CoA for each turn of the cycle. A molecule of 
NADH and a molecule of FADH 2 are also produced in this process (Figure 2-81 ) . 

Sugars and fats are the major energy sources for most non-photosynthetic 
organisms, including humans. However, most of the useful energy that can be 
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Figure 2-79 The oxidation of pyruvate 
to acetyl CoA and C0 2 . (A) The structure 
of the pyruvate dehydrogenase complex, 
which contains 60 polypeptide chains. 
This is an example of a large 
multienzyme complex in which reaction 
intermediates are passed directly from 
one enzyme to another. In eucaryotic 
cells it is located in the mitochondrion. 
(B) The reactions carried out by the 
pyruvate dehydrogenase complex. The 
complex converts pyruvate to acetyl CoA 
in the mitochondrial matrix; NADH is also 
produced in this reaction. A, B, and C are 
the three enzymes pyruvate 
decarboxylase, lipoamide reductase- 
transacetylase, and dihydrolipoyl 
dehydrogenase, respectively. These 
enzymes are illustrated in (A); their 
activities are linked as shown. 
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Figure 2-80 Pathways for the production of acetyl CoA from sugars and fats. The mitochondrion in 
eucaryotic cells is the place where acetyl CoA is produced from both types of major food molecules. It is 
therefore the place where most of the cell's oxidation reactions occur and where most of its ATP is made. 
The structure and function of mitochondria are discussed in detail in Chapter 14. 



extracted from the oxidation of both types of foodstuffs remains stored in the 
acetyl CoA molecules that are produced by the two types of reactions just 
described. The citric acid cycle of reactions, in which the acetyl group in acetyl 
CoA is oxidized to CO2 and H2O, is therefore central to the energy metabolism of 
aerobic organisms. In eucaryotes these reactions all take place in mitochondria. 
We should therefore not be surprised to discover that the mitochondrion is the 
place where most of the ATP is produced in animal cells. In contrast, aerobic 
bacteria carry out all of their reactions in a single compartment, the cytosol, and 
it is here that the citric acid cycle takes place in these cells. 



The Citric Acid Cycle Generates NADH by Oxidizing Acetyl Groups 
toC0 2 

In the nineteenth century, biologists noticed that in the absence of air (anaero- 
bic conditions) cells produce lactic acid (for example, in muscle) or ethanol (for 
example, in yeast), while in its presence (aerobic conditions) they consume O2 
and produce C0 2 and H 2 0. Efforts to define the pathways of aerobic metabolism 



Figure 2-81 The oxidation of fatty acids 
to acetyl CoA. (A) Electron micrograph of 
a lipid droplet in the cytoplasm (top), and 
the structure of fats (bottom). Fats are 
triacylglycerols. The glycerol portion, to 
which three fatty acids are linked 
through ester bonds, is shown here in 
blue. Fats are insoluble in water and form 
large lipid droplets in the specialized fat 
cells (called adipocytes) in which they are 
stored. (B) The fatty acid oxidation cycle. 
The cycle is catalyzed by a series of four 
enzymes in the mitochondrion. Each 
turn of the cycle shortens the fatty acid 
chain by two carbons (shown in red) and 
generates one molecule of acetyl CoA 
and one molecule each of NADH and 
FADH 2 . The structure of FADH 2 is 
presented in Figure 2-83B. (A, courtesy 
of DanielS. Friend.) 
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eventually focused on the oxidation of pyruvate and led in 1937 to the discovery 
of the citric acid cycle, also known as the tricarboxylic acid cycle or the Krebs 
cycle. The citric acid cycle accounts for about two-thirds of the total oxidation of 
carbon compounds in most cells, and its major end products are CO2 and high- 
energy electrons in the form of NADH. The C0 2 is released as a waste product, 
while the high-energy electrons from NADH are passed to a membrane-bound 
electron-transport chain (discussed in Chapter 14), eventually combining with 
2 to produce H 2 0. Although the citric acid cycle itself does not use 2 , it 
requires 2 in order to proceed because there is no other efficient way for the 
NADH to get rid of its electrons and thus regenerate the NAD + that is needed to 
keep the cycle going. 

The citric acid cycle takes place inside mitochondria in eucaryotic cells. It 
results in the complete oxidation of the carbon atoms of the acetyl groups in 
acetyl CoA, converting them into C0 2 . But the acetyl group is not oxidized 
directly Instead, this group is transferred from acetyl CoA to a larger, four-car- 
bon molecule, oxaloacetate, to form the six-carbon tricarboxylic acid, citric acid, 
for which the subsequent cycle of reactions is named. The citric acid molecule is 
then gradually oxidized, allowing the energy of this oxidation to be harnessed to 
produce energy-rich activated carrier molecules. The chain of eight reactions 
forms a cycle because at the end the oxaloacetate is regenerated and enters a 
new turn of the cycle, as shown in outline in Figure 2-82. 

We have thus far discussed only one of the three types of activated carrier 
molecules that are produced by the citric acid cycle, the NAD+-NADH pair (see 
Figure 2-60). In addition to three molecules of NADH, each turn of the cycle also 
produces one molecule of FADH 2 (reduced flavin adenine dinucleotide) from 
FAD and one molecule of the ribonucleotide GTP (guanosine triphosphate) 
from GDP. The structures of these two activated carrier molecules are illustrated 
in Figure 2-83. GTP is a close relative of ATR and the transfer of its terminal 
phosphate group to ADP produces one ATP molecule in each cycle. Like NADH, 
FADH 2 is a carrier of high-energy electrons and hydrogen. As we discuss shortly, 
the energy that is stored in the readily transferred high-energy electrons of 
NADH and FADH 2 will be utilized subsequently for ATP production through the 
process of oxidative phosphorylation, the only step in the oxidative catabolism 
of foodstuffs that directly requires gaseous oxygen (0 2 ) from the atmosphere. 

Panel 2-9 (pp. 122-123) presents the complete citric acid cycle. Water, rather 
than molecular oxygen, supplies the extra oxygen atoms required to make C0 2 
from the acetyl groups entering the citric acid cycle. As illustrated in the panel, 

O 



H 3 C— C — S-CoA 
acetyl CoA 
2C 



citrate 




CEJUJ + h' 



NET RESULT: ONE TURN OF THE CYCLE PRODUCES THREE NADH, ONE GTP, AND 
ONE FADH 2 , AND RELEASES TWO MOLECULES OF C0 2 



Figure 2-82 Simple overview of the 
citric acid cycle. <TAGT> The reaction of 
acetyl CoA with oxaloacetate starts the 
cycle by producing citrate (citric acid). In 
each turn of the cycle, two molecules of 
CO2 are produced as waste products, plus 
three molecules of NADH, one molecule 
of GTP, and one molecule of FADH 2 . The 
number of carbon atoms in each 
intermediate is shown in a yellow box. 
For details, see Panel 2-9 (pp. 1 22-1 23). 
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three molecules of water are split in each cycle, and the oxygen atoms of some 
of them are ultimately used to make CO2. 

In addition to pyruvate and fatty acids, some amino acids pass from the 
cytosol into mitochondria, where they are also converted into acetyl CoA or one 
of the other intermediates of the citric acid cycle. Thus, in the eucaryotic cell, the 
mitochondrion is the center toward which all energy-yielding processes lead, 
whether they begin with sugars, fats, or proteins. 

Both the citric acid cycle and glycolysis also function as starting points for 
important biosynthetic reactions by producing vital carbon-containing inter- 
mediates, such as oxaloacetate and cc-ketoglutarate. Some of these substances 
produced by catabolism are transferred back from the mitochondrion to the 
cytosol, where they serve in anabolic reactions as precursors for the synthesis of 
many essential molecules, such as amino acids (Figure 2-84). 
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Figure 2-83 The structures of GTP and 
FADH 2 . (A) GTP and GDP are close 
relatives of ATP and ADP, respectively. 
(B) FADH 2 is a carrier of hydrogens and 
high-energy electrons, like NADH and 
NADPH. It is shown here in its oxidized 
form (FAD) with the hydrogen-carrying 
atoms highlighted in ye//oiv. 
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Figure 2-84 Glycolysis and the citric 
acid cycle provide the precursors 
needed to synthesize many important 
biological molecules. The amino acids, 
nucleotides, lipids, sugars, and other 
molecules — shown here as products — in 
turn serve as the precursors for the many 
macromoleculesof the cell. Each black 
arrow in this diagram denotes a single 
enzyme-catalyzed reaction; the red 
arrows generally represent pathways with 
many steps that are required to produce 
the indicated products. 
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Electron Transport Drives the Synthesis of the Majority of the ATP 
in Most Cells 

Most chemical energy is released in the last step in the degradation of a food 
molecule. In this final process the electron carriers NADH and FADH2 transfer 
the electrons that they have gained when oxidizing other molecules to the elec- 
tron-transport chain, which is embedded in the inner membrane of the mito- 
chondrion (see Figure 14-10). As the electrons pass along this long chain of spe- 
cialized electron acceptor and donor molecules, they fall to successively lower 
energy states. The energy that the electrons release in this process pumps H + 
ions (protons) across the membrane — from the inner mitochondrial compart- 
ment to the outside — generating a gradient of H + ions (Figure 2-85). This gradi- 
ent serves as a source of energy, being tapped like a battery to drive a variety of 
energy- requiring reactions. The most prominent of these reactions is the gener- 
ation of ATP by the phosphorylation of ADR 

At the end of this series of electron transfers, the electrons are passed to 
molecules of oxygen gas (O2) that have diffused into the mitochondrion, which 
simultaneously combine with protons (H + ) from the surrounding solution to 
produce water molecules. The electrons have now reached their lowest energy 
level, and therefore all the available energy has been extracted from the oxidized 
food molecule. This process, termed oxidative phosphorylation (Figure 2-86), 
also occurs in the plasma membrane of bacteria. As one of the most remarkable 
achievements of cell evolution, it is a central topic of Chapter 14. 

In total, the complete oxidation of a molecule of glucose to H2O and CO2 is 
used by the cell to produce about 30 molecules of ATP. In contrast, only 2 
molecules of ATP are produced per molecule of glucose by glycolysis alone. 



Amino Acids and Nucleotides Are Part of the Nitrogen Cycle 

So far we have concentrated mainly on carbohydrate metabolism and have not 
yet considered the metabolism of nitrogen or sulfur. These two elements are 
important constituents of biological macromolecules. Nitrogen and sulfur 
atoms pass from compound to compound and between organisms and their 
environment in a series of reversible cycles. 

Although molecular nitrogen is abundant in the Earth's atmosphere, nitro- 
gen is chemically unreactive as a gas. Only a few living species are able to incor- 
porate it into organic molecules, a process called nitrogen fixation. Nitrogen 
fixation occurs in certain microorganisms and by some geophysical processes, 
such as lightning discharge. It is essential to the biosphere as a whole, for with- 
out it life could not exist on this planet. Only a small fraction of the nitrogenous 
compounds in today's organisms, however, is due to fresh products of nitrogen 
fixation from the atmosphere. Most organic nitrogen has been in circulation for 
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Figure 2-85 The generation of an 
H + gradient across a membrane by 
electron-transport reactions. 

A high-energy electron (derived, for 
example, from the oxidation of a 
metabolite) is passed sequentially by 
carriers A, B, and C to a lower energy 
state. In this diagram carrier B is arranged 
in the membrane in such a way that it 
takes up H + from one side and releases it 
to the other as the electron passes. The 
result is an H + gradient. As discussed in 
Chapter 14, this gradient is an important 
form of energy that is harnessed by other 
membrane proteins to drive the 
formation of ATP. 
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Figure 2-86 The final stages of oxidation 
of food molecules. Molecules of NADH 
and FADH 2 (FADH 2 is not shown) are 
produced by the citric acid cycle. These 
activated carriers donate high-energy 
electrons that are eventually used to 
reduce oxygen gas to water. 
A major portion of the energy released 
during the transfer of these electrons 
along an electron-transfer chain in the 
mitochondrial inner membrane (or in the 
plasma membrane of bacteria) is 
harnessed to drive the synthesis of ATP — 
hence the name oxidative 
phosphorylation (discussed in Chapter 14). 
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some time, passing from one living organism to another. Thus present-day 
nitrogen-fixing reactions can be said to perform a "topping-up" function for the 
total nitrogen supply. 

Vertebrates receive virtually all of their nitrogen from their dietary intake of 
proteins and nucleic acids. In the body these macromolecules are broken down 
to amino acids and the components of nucleotides, and the nitrogen they con- 
tain is used to produce new proteins and nucleic acids — or utilized to make 
other molecules. About half of the 20 amino acids found in proteins are essen- 
tial amino acids for vertebrates (Figure 2-87), which means that they cannot be 
synthesized from other ingredients of the diet. The others can be so synthesized, 
using a variety of raw materials, including intermediates of the citric acid cycle 
as described previously. The essential amino acids are made by plants and other 
organisms, usually by long and energetically expensive pathways that have been 
lost in the course of vertebrate evolution. RoshanKetab 021-66950639 

The nucleotides needed to make RNA and DNA can be synthesized using 
specialized biosynthetic pathways. All of the nitrogens in the purine and pyrim- 
idine bases (as well as some of the carbons) are derived from the plentiful 
amino acids glutamine, aspartic acid, and glycine, whereas the ribose and 
deoxyribose sugars are derived from glucose. There are no "essential 
nucleotides" that must be provided in the diet. 

Amino acids not used in biosynthesis can be oxidized to generate metabolic 
energy. Most of their carbon and hydrogen atoms eventually form CO2 or H2O, 
whereas their nitrogen atoms are shuttled through various forms and eventually 
appear as urea, which is excreted. Each amino acid is processed differently, and 
a whole constellation of enzymatic reactions exists for their catabolism. 

Sulfur is abundant on Earth in its most oxidized form, sulfate (SO4 2 "). To con- 
vert it to forms useful for life, sulfate must be reduced to sulfide (S 2 ~), the oxida- 
tion state of sulfur required for the synthesis of essential biological molecules. 
These molecules include the amino acids methionine and cysteine, coenzyme A 
(see Figure 2-62), and the iron-sulfur centers essential for electron transport (see 
Figure 14-23). The process begins in bacteria, fungi, and plants, where a special 
group of enzymes use ATP and reducing power to create a sulfate assimilation 
pathway. Humans and other animals cannot reduce sulfate and must therefore 
acquire the sulfur they need for their metabolism in the food that they eat. 
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Figure 2-87 The nine essential amino 

acids. These cannot be synthesized by 
human cells and so must be supplied in 
the diet. 



Metabolism Is Organized and Regulated 

One gets a sense of the intricacy of a cell as a chemical machine from the rela- 
tion of glycolysis and the citric acid cycle to the other metabolic pathways 
sketched out in Figure 2-88. This type of chart, which was used earlier in this 
chapter to introduce metabolism, represents only some of the enzymatic path- 
ways in a cell. It is obvious that our discussion of cell metabolism has dealt with 
only a tiny fraction of cellular chemistry. 

All these reactions occur in a cell that is less than 0.1 mm in diameter, and 
each requires a different enzyme. As is clear from Figure 2-88, the same 
molecule can often be part of many different pathways. Pyruvate, for example, is 
a substrate for half a dozen or more different enzymes, each of which modifies it 
chemically in a different way. One enzyme converts pyruvate to acetyl CoA, 
another to oxaloacetate; a third enzyme changes pyruvate to the amino acid ala- 
nine, a fourth to lactate, and so on. All of these different pathways compete for 
the same pyruvate molecule, and similar competitions for thousands of other 
small molecules go on at the same time. 

The situation is further complicated in a multicellular organism. Different 
cell types will in general require somewhat different sets of enzymes. And differ- 
ent tissues make distinct contributions to the chemistry of the organism as a 
whole. In addition to differences in specialized products such as hormones or 
antibodies, there are significant differences in the "common" metabolic path- 
ways among various types of cells in the same organism. 

Although virtually all cells contain the enzymes of glycolysis, the citric acid 
cycle, lipid synthesis and breakdown, and amino acid metabolism, the levels of 
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Figure 2-88 Glycolysis and the citric 
acid cycle are at the center of 
metabolism. Some 500 metabolic 
reactions of a typical cell are shown 
schematically with the reactions of 
glycolysis and the citric acid cycle in red. 
Other reactions either lead into these 
two central pathways — delivering small 
molecules to be catabolized with 
production of energy— or they lead 
outward and thereby supply carbon 
compounds for the purpose of 
biosynthesis. 



these processes required in different tissues are not the same. For example, 
nerve cells, which are probably the most fastidious cells in the body, maintain 
almost no reserves of glycogen or fatty acids and rely almost entirely on a con- 
stant supply of glucose from the bloodstream. In contrast, liver cells supply glu- 
cose to actively contracting muscle cells and recycle the lactic acid produced by 
muscle cells back into glucose. All types of cells have their distinctive metabolic 
traits, and they cooperate extensively in the normal state, as well as in response 
to stress and starvation. One might think that the whole system would need to 
be so finely balanced that any minor upset, such as a temporary change in 
dietary intake, would be disastrous. 

In fact, the metabolic balance of a cell is amazingly stable. Whenever the bal- 
ance is perturbed, the cell reacts so as to restore the initial state. The cell can 
adapt and continue to function during starvation or disease. Mutations of many 
kinds can damage or even eliminate particular reaction pathways, and yet — pro- 
vided that certain minimum requirements are met — the cell survives. It does so 
because an elaborate network of control mechanisms regulates and coordinates 
the rates of all of its reactions. These controls rest, ultimately on the remarkable 
abilities of proteins to change their shape and their chemistry in response to 
changes in their immediate environment. The principles that underlie how large 
molecules such as proteins are built and the chemistry behind their regulation 
will be our next concern. 



END-OF-CHAPTER PROBLEMS 



103 



Summary 

Glucose and other food molecules are broken down by controlled stepwise oxidation to 
provide chemical energy in the form of ATP and NADH. There are three main sets of 
reactions that act in series — the products of each being the starting material for the 
next: glycolysis (which occurs in the cytosol), the citric acid cycle (in the mitochondrial 
matrix), and oxidative phosphorylation (on the inner mitochondrial membrane). The 
intermediate products of glycolysis and the citric acid cycle are used both as sources of 
metabolic energy and to produce many of the small molecules used as the raw materi- 
als for biosynthesis. Cells store sugar molecules as glycogen in animals and starch in 
plants; both plants and animals also use fats extensively as a food store. These storage 
materials in turn serve as a major source of food for humans, along with the proteins 
that comprise the majority of the dry mass of most of the cells in the foods we eat. 



PROBLEMS 



Which statements are true? Explain why or why not. 

2-1 Of the original radioactivity in a sample, only about 
1/1000 will remain after 10 half-lives. 

2-2 A 10" 8 M solution of HC1 has a pH of 8. 

2-3 Most of the interactions between macromolecules 
could be mediated just as well by covalent bonds as by non- 
covalent bonds. 

2-4 Animals and plants use oxidation to extract energy 
from food molecules. 

2-5 If an oxidation occurs in a reaction, it must be 
accompanied by a reduction. 

2-6 Linking the energetically unfavorable reaction A -» B 
to a second, favorable reaction B -> C will shift the equilib- 
rium constant for the first reaction. 

2-7 The criterion for whether a reaction proceeds spon- 
taneously is AG not AG , because AG takes into account the 
concentrations of the substrates and products. 

2-8 Because glycolysis is only a prelude to the oxidation 
of glucose in mitochondria, which yields 15-fold more ATP, 
glycolysis is not really important for human cells. 

2-9 The oxygen consumed during the oxidation of glu- 
cose in animal cells is returned as C0 2 to the atmosphere. 

Discuss the following problems. 

2-10 The organic chemistry of living cells is said to be spe- 
cial for two reasons: it occurs in an aqueous environment 
and it accomplishes some very complex reactions. But do 
you suppose it is really all that much different from the 
organic chemistry carried out in the top laboratories in the 
world? Why or why not? 

2-1 1 The molecular weight of ethanol (CH 3 CH 2 OH) is 46 
and its density is 0.789 g/cm 3 . 

A. What is the molarity of ethanol in beer that is 5% 
ethanol by volume? [Alcohol content of beer varies from 
about 4% (lite beer) to 8% (stout beer).] 

B. The legal limit for a driver's blood alcohol content 
varies, but 80 mg of ethanol per 100 mL of blood (usually 
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referred to as a blood alcohol level of 0.08) is typical. What is 
the molarity of ethanol in a person at this legal limit? 

C. How many 12-oz (355-mL) bottles of 5% beer could a 
70-kg person drink and remain under the legal limit? A 70-kg 
person contains about 40 liters of water. Ignore the 
metabolism of ethanol, and assume that the water content 
of the person remains constant. 

D. Ethanol is metabolized at a constant rate of about 120 
mg per hour per kg body weight, regardless of its concentra- 
tion. If a 70-kg person were at twice the legal limit (160 
mg/100 mL), how long would it take for their blood alcohol 
level to fall below the legal limit? 

2-12 Specific activity refers to the amount of radioactiv- 
ity per unit amount of substance, usually in biology 
expressed on a molar basis, for example, as Ci/mmol. [One 
curie (Ci) corresponds to 2.22 x 10 12 disintegrations per 
minute (dpm).] As apparent in Table Q2-1, which lists 
properties of four isotopes commonly used in biology, 
there is an inverse relationship between maximum specific 
activity and half-life. Do you suppose this is just a coinci- 
dence or is there an underlying reason? Explain your 
answer. 

2-1 3 By a convenient coincidence the ion product of water, 
K w = [H + ] [OH"], is a nice round number: 1.0 x 10" 14 M 2 . 

A. Why is a solution at pH 7.0 said to be neutral? 

B. What is the H + concentration and pH of a 1 mM solution 
ofNaOH? 

C If the pH of a solution is 5.0, what is the concentration 
of OH" ions? 

2- 1 4 Suggest a rank order for the pK values (from lowest to 
highest) for the carboxyl group on the aspartate side chain 
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in the following environments in a protein. Explain your 
ranking. 

1 . An aspartate side chain on the surface of a protein with 
no other ionizable groups nearby. 

2. An aspartate side chain buried in a hydrophobic pocket 
on the surface of a protein. 

3. An aspartate side chain in a hydrophobic pocket adja- 
cent to a glutamate side chain. 

4. An aspartate side chain in a hydrophobic pocket adja- 
cent to a lysine side chain. 

2-1 5 A histidine side chain is known to play an important 
role in the catalytic mechanism of an enzyme; however, it is 
not clear whether histidine is required in its protonated 
(charged) or unprotonated (uncharged) state. To answer this 
question you measure enzyme activity over a range of pH, 
with the results shown in Figure Q2-1. Which form of histi- 
dine is required for enzyme activity? 



120 




Figure Q2-1 Enzyme 
activity as a function of 
pH (Problem 2-15). 



2-1 6 During an all-out sprint, muscles metabolize glucose 
anaerobically, producing a high concentration of lactic acid, 
which lowers the pH of the blood and of the cytosol and 
contributes to the fatigue sprinters experience well before 
their fuel reserves are exhausted. The main blood buffer 
against pH changes is the bicarbonate/C02 system. 
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To improve their performance, would you advise sprinters to 
hold their breath or to breathe rapidly for a minute immedi- 
ately before the race? Explain your answer. 

2-17 The three molecules in Figure Q2-2 contain the 
seven most common reactive groups in biology. Most 
molecules in the cell are built from these functional groups. 
Indicate and name the functional groups in these 
molecules. 

2-18 "Diffusion" sounds slow— and over everyday dis- 
tances it is— but on the scale of a cell it is very fast. The aver- 
age instantaneous velocity of a particle in solution, that is, 
the velocity between collisions, is 

v={kTlm) 1/2 

where k = 1.38 x 10" 16 g cm 2 /K sec 2 , T = temperature in K 
(37°C is 310 K), m = mass in g/molecule. 
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Figure Q2-2 Three molecules that illustrate the 
seven most common functional groups in 
biology (Problem 2-17). 1,3-Bisphosphoglycerate 
and pyruvate are intermediates in glycolysis and 
cysteine is an amino acid. 
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1,3-bisphosphoglycerate pyruvate cysteine 



Calculate the instantaneous velocity of a water molecule 
(molecular mass =18 daltons), a glucose molecule (molecu- 
lar mass = 180 daltons), and a myoglobin molecule (molec- 
ular mass = 15,000 daltons) at 37°C. Just for fun, convert 
these numbers into kilometers/hour. Before you do any cal- 
culations, try to guess whether the molecules are moving at 
a slow crawl (<1 km/hr), an easy walk (5 km/hr), or a record- 
setting sprint (40 km/hr). 

2-19 Polymerization of tubulin subunits into micro- 
tubules occurs with an increase in the orderliness of the 
subunits (Figure Q2-3). Yet tubulin polymerization occurs 
with an increase in entropy (decrease in order). How can 
that be? 



POLYMERIZATION 




Figure Q2-3 Polymerization of tubulin subunits into a microtubule 
(Problem 2-1 9). The fates of one subunit [shaded) and its associated 
water molecules {small spheres) are shown. 



2-20 A 70-kg adult human (154 lb) could meet his or her 
entire energy needs for one day by eating 3 moles of glucose 
(540 g). (We don't recommend this.) Each molecule of glu- 
cose generates 30 ATP when it is oxidized to CO2. The con- 
centration of ATP is maintained in cells at about 2 mM, and 
a 70-kg adult has about 25 liters of intracellular fluid. Given 
that the ATP concentration remains constant in cells, calcu- 
late how many times per day, on average, each ATP molecule 
in the body is hydrolyzed and resynthesized. 

2-21 Assuming that there are 5 x 10 13 cells in the human 
body and that ATP is turning over at a rate of 10 9 ATP per 
minute in each cell, how many watts is the human body 
consuming? (A watt is a joule per second, and there are 4.18 
joules/calorie.) Assume that hydrolysis of ATP yields 12 
kcal/mole. 

2-22 Does a Snickers™ candy bar (65 g, 325 kcal) provide 
enough energy to climb from Zermatt (elevation 1660 m) to 
the top of the Matterhorn (4478 m, Figure Q2-4), or might 
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Figure Q2-4The 
Matterhorn (Problem 
2-22). (Courtesy of 
Zermatt Tourism.) 



you need to stop at Hornli Hut (3260 m) to eat another one? 
Imagine that you and your gear have a mass of 75 kg, and 
that all of your work is done against gravity (that is, you are 
just climbing straight up). Remember from your introduc- 
tory physics course that 

work (J) = mass (kg) x g (m/sec 2 ) x height gained (m) 

where gis acceleration due to gravity (9.8 m/sec 2 ). One joule 
is 1 kg m 2 /sec 2 and there are 4.18 kj per kcal. 

What assumptions made here will greatly underestimate 
how much candy you need? 

2-23 At first glance, fermentation of pyruvate to lactate 
appears to be an optional add-on reaction to glycolysis. After 
all, could cells growing in the absence of oxygen not simply 
discard pyruvate as a waste product? In the absence of fer- 
mentation, which products derived from glycolysis would 
accumulate in cells under anaerobic conditions? Could the 
metabolism of glucose via the glycolytic pathway continue in 
the absence of oxygen in cells that cannot carry out fermen- 
tation? Why or why not? 

2-24 In the absence of oxygen, cells consume glucose at a 
high, steady rate. When oxygen is added, glucose consump- 
tion drops precipitously and is then maintained at the lower 
rate. Why is glucose consumed at a high rate in the absence 
of oxygen and at a low rate in its presence? 

2-25 The liver provides glucose to the rest of the body 
between meals. It does so by breaking down glycogen, form- 
ing glucose 6-phosphate in the penultimate step. Glucose 6- 
phosphate is converted to glucose by splitting off the phos- 
phate (AG° = -3.3 kcal/mole). Why do you suppose the liver 
removes the phosphate by hydrolysis, rather than reversing 
the reaction by which glucose 6-phosphate (G6P) is formed 
from glucose (glucose + ATP -> G6P + ADP, AG° = -4.0 
kcal/mole)? By reversing this reaction the liver could gener- 
ate both glucose and ATP. 

2-26 In 1904 Franz Knoop performed what was probably 
the first successful labeling experiment to study metabolic 
pathways. He fed many different fatty acids labeled with a 
terminal benzene ring to dogs and analyzed their urine for 
excreted benzene derivatives. Whenever the fatty acid had 
an even number of carbon atoms, phenylacetate was 
excreted (Figure Q2-5A). Whenever the fatty acid had an 
odd number of carbon atoms, benzoate was excreted (Fig- 
ure Q2-5B). 

From these experiments Knoop deduced that oxidation of 
fatty acids to CO2 and H2O involved the removal of two -car- 
bon fragments from the carboxylic acid end of the chain. 
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Figure Q2-5 The original labeling experiment to analyze fatty acid 
oxidation (Problem 2-26). (A) Fed and excreted derivatives of an 
even-number fatty acid chain. (B) Fed and excreted derivatives of an 
odd-number fatty acid chain. 



Can you explain the reasoning that led him to conclude that 
two -carbon fragments, as opposed to any other number, 
were removed, and that degradation was from the carboxylic 
acid end, as opposed to the other end? 

2-27 Pathways for synthesis of amino acids in microor- 
ganisms were worked out in part by cross-feeding experi- 
ments among mutant organisms that were defective for 
individual steps in the pathway. Results of cross-feeding 
experiments for three mutants defective in the tryptophan 
pathway— TrpB~, TrpLr, and TrpEr—diXe shown in Figure 
Q2-6. The mutants were streaked on a Petri dish and 
allowed to grow briefly in the presence of a very small 
amount of tryptophan, producing three pale streaks. As 
shown, heavier growth was observed at points where some 
streaks were close to other streaks. These spots of heavier 
growth indicate that one mutant can cross-feed (supply an 
intermediate) to the other one. 

From the pattern of cross-feeding shown in Figure Q2-6, 
deduce the order of the steps controlled by the products of 
the TrpB, TrpD, and TrpE genes. Explain your reasoning. 



CROSS-FEEDING RESULT 



TrpE~ 



TrpD~ 



TrpB~ 



Figure Q2-6 Defining the pathway for tryptophan synthesis using 
cross-feeding experiments (Problem 2-27). Results of a cross- 
feeding experiment among mutants defective for steps in the 
tryptophan biosynthetic pathway. Dark areas on the Petri dish show 
regions of cell growth. 
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CARBON SKELETONS 

Carbon has a unique role in the cell because of its 
ability to form strong covalent bonds with other 
carbon atoms. Thus carbon atoms can join to form 

chains. 
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COVALENT BONDS 

A covalent bond forms when two atoms come very close 
together and share one or more of their electrons. In a single 
bond one electron from each of the two atoms is shared; in 
a double bond a total of four electrons are shared. 

Each atom forms a fixed number of covalent bonds in a 
defined spatial arrangement. For example, carbon forms four 
single bonds arranged tetrahedrally, whereas nitrogen forms 
three single bonds and oxygen forms two single bonds arranged 
as shown below. 
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Double bonds exist and have a different spatial arrangement. 

fife 




Atoms joined by two 
or more covalent bonds 
cannot rotate freely 
around the bond axis. 
This restriction is a 
major influence on the 
three-dimensional shape 
of many macromolecules. 



HYDROCARBONS 



Carbon and hydrogen combine 
together to make stable 
compounds (or chemical groups) 
called hydrocarbons. These are 
nonpolar, do not form 
hydrogen bonds, and are 
generally insoluble in water. 
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ALTERNATING DOUBLE BONDS 

The carbon chain can include double 
bonds. If these are on alternate carbon 
atoms, the bonding electrons move 
within the molecule, stabilizing the 
structure by a phenomenon called 
resonance. 
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these two structures 
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Alternating double bonds in a ring 
can generate a very stable structure. 




often written as 
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part of the hydrocarbon "tail" 
of a fatty acid molecule 
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C-0 CHEMICAL GROUPS 


Many biological compounds contain a carbon 


bonded to an oxygen. For example, 


alcohol H 


The -OH is called a 
hydroxyl group. 
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carboxylk acid The _ C00H is ca |, ed a 

— q carboxyl group. In water 


\ this loses an H + ion to 
become -COO". 


esters Esters are formed by a condensation reaction 


between acid and an alcohol. 
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C-N CHEMICAL GROUPS 

Amines and amides are two important examples of 
compounds containing a carbon linked to a nitrogen. 

Amines in water combine with an H + ion to become 
positively charged. 



H 



H 



1 /x ^ 
C — N + H + ^= 


1 /" 

— C — N— H 


| \ 


1 V 


1 H 


1 H 



Amides are formed by combining an acid and an 
amine. Unlike amines, amides are uncharged in water. 
An example is the peptide bond that joins amino acids 
in a protein. 
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Nitrogen also occurs in several ring compounds, including 
important constituents of nucleic acids: purines and pyrimidines. 
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cytosine (a pyrimidine) 



SULFHYDRYL GROUP The — C-SH is called a sulfhydryl group. In the amino acid cysteine the sulfhydryl group 

i i 



may exist in the reduced form,— C—SH 

i 
or more rarely in an oxidized, cross-bridging form, -C-S-S-C- 



PHOSPHATES 

Inorganic phosphate is a stable ion formed from Phosphate esters can form between a phosphate and a free hydroxyl group, 
phosphoric acid, H 3 P0 4 . It is often written as Pj. Phosphate groups are often attached to proteins in this way. 
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The combination of a phosphate and a carboxyl group, or two or more phosphate groups, gives an acid anhydride. 
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high-energy acyl phosphate 
bond (carboxylic-phosphoric 
acid anhydride) found in 
some metabolites 



phosphoanhydride— a high- 
energy bond found in 
molecules such as ATP 
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WATER 

Two atoms, connected by a covalent bond, may exert different attractions for 
the electrons of the bond. In such cases the bond is polar, with one end 
slightly negatively charged (6~) and the other slightly positively charged (8 + ). 



electropositive 
region 




electronegative 
region 



Although a water molecule has an overall neutral charge (having the same 
number of electrons and protons), the electrons are asymmetrically distributed, 
which makes the molecule polar. The oxygen nucleus draws electrons away 
from the hydrogen nuclei, leaving these nuclei with a small net positive charge. 
The excess of electron density on the oxygen atom creates weakly negative 
regions at the other two corners of an imaginary tetrahedron. 



WATER STRUCTURE 

Molecules of water join together transiently 
in a hydrogen-bonded lattice. Even at 37°C, 
15% of the water molecules are joined to 
four others in a short-lived assembly known 
as a "flickering cluster." 
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The cohesive nature of water is 
responsible for many of its unusual 
properties, such as high surface tension, 
specific heat, and heat of vaporization. 



HYDROGEN BONDS 

Because they are polarized, two 

adjacent H 2 molecules can form H 

a linkage known as a hydrogen 
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hydrogen bond 
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bond. Hydrogen bonds have 

only about 1/20 the strength f 

of a covalent bond. 
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Hydrogen bonds are strongest when H 
the three atoms lie in a straight line. 


hydrogen bond 



HYDROPHILIC MOLECULES 

Substances that dissolve readily in water are termed hydrophilic. They are 
composed of ions or polar molecules that attract water molecules through 
electrical charge effects. Water molecules surround each ion or polar molecule 
on the surface of a solid substance and carry it into solution. 
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Ionic substances such as sodium chloride 
dissolve because water molecules are 
attracted to the positive (Na + ) or negative 
(CI) charge of each ion. 
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Polar substances such as urea 
dissolve because their molecules 
form hydrogen bonds with the 
surrounding water molecules. 



HYDROPHOBIC MOLECULES 

Molecules that contain a preponderance 
of nonpolar bonds are usually insoluble in 
water and are termed hydrophobic. This is 
true, especially, of hydrocarbons, which 
contain many C-H bonds. Water molecules 
are not attracted to such molecules and so 
have little tendency to surround them and 
carry them into solution. 



/ 



H H 

\/ 
C 

A 



H-m 



H 



H 



O 



\ 



-c 

\/ H 

c 

/\ 

H H 

H 



II 
I 
O— Hi O v 



\ b 



, 



If 






CHAPTER 2 PANELS 



WATER AS A SOLVENT 

Many substances, such as household sugar, dissolve in water. That is, their 
molecules separate from each other, each becoming surrounded by water molecules. 
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When a substance dissolves in a 
liquid, the mixture is termed a solution. 
The dissolved substance (in this case 
sugar) is the solute, and the liquid that 
does the dissolving (in this case water) 
is the solvent. Water is an excellent 
solvent for many substances because 
of its polar bonds. 



ACIDS 

Substances that release hydrogen ions into solution 
are called acids. 



HCI 



hydrochloric acid 
(strong acid) 



H + + cr 

hydrogen ion chloride ion 



Many of the acids important in the cell are only partially 
dissociated, and they are therefore weak acids— for example, 
the carboxyl group (-COOH), which dissociates to give a 
hydrogen ion in solution 
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(weak acid) 
Note that this is a reversible reaction. 
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HYDROGEN ION EXCHANGE 

Positively charged hydrogen ions (H + ) can spontaneously 
move from one water molecule to another, thereby creating 
two ionic species. 
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hydronium ion 

(water acting as 

a weak base) 



hydroxyl ion 

(water acting as 

a weak acid) 



often written as: hUO 



i H + + OH" 
hydrogen hydroxyl 



Since the process is rapidly reversible, hydrogen ions are 
continually shuttling between water molecules. Pure water 
contains a steady-state concentration of hydrogen ions and 
hydroxyl ions (both 10" 7 M). 
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For convenience we 
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use the pH scale, where 
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BASES 

Substances that reduce the number of hydrogen ions in 
solution are called bases. Some bases, such as ammonia, 
combine directly with hydrogen ions. 



NH 3 + H + - 
ammonia hydrogen ion 



NH 4 + 
ammonium ion 



Other bases, such as sodium hydroxide, reduce the number of 
H + ions indirectly, by making OH" ions that then combine 
directly with H + ions to make H 2 0. 



NaOH 



sodium hydroxide 
(strong base) 



Na + + OH" 

sodium hydroxyl 



ion 



ion 



Many bases found in cells are partially dissociated and are termed 
weak bases. This is true of compounds that contain an amino 
group (-NH 2 ), which has a weak tendency to reversibly accept 
an H + ion from water, increasing the quantity of free OH" ions. 
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WEAK CHEMICAL BONDS 

Organic molecules can interact with other molecules through three 
types of short-range attractive forces known as noncovalent bonds: 
van der Waals attractions, electrostatic attractions, and hydrogen 
bonds. The repulsion of hydrophobic groups from water is also 
important for ordering biological macromolecules. 




Weak chemical bonds have less than 1/20 the strength of a strong 
covalent bond. They are strong enough to provide tight binding 
only when many of them are formed simultaneously. 



HYDROGEN BONDS 

As already described for water (see Panel 2-2), 
hydrogen bonds form when a hydrogen atom is 
"sandwiched" between two electron-attracting atoms 
(usually oxygen or nitrogen). 



Hydrogen bonds are strongest when the three atoms are 
in a straight line: 
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VAN DER WAALS ATTRACTIONS 

If two atoms are too close together they repel each other 
very strongly. For this reason, an atom can often be 
treated as a sphere with a fixed radius. The characteristic 
"size" for each atom is specified by a unique van der 
Waals radius. The contact distance between any two 
noncovalently bonded atoms is the sum of their van der 
Waals radii. 
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At very short distances any two atoms show a weak 
bonding interaction due to their fluctuating electrical 
charges. The two atoms will be attracted to each other 
in this way until the distance between their nuclei is 
approximately equal to the sum of their van der Waals 
radii. Although they are individually very weak, van der 
Waals attractions can become important when two 
macromolecular surfaces fit very close together, 
because many atoms are involved. 
Note that when two atoms form a covalent bond, the 
centers of the two atoms (the two atomic nuclei) are 
much closer together than the sum of the two van der 
Waals radii. Thus, 






0,4 nm 


0.15 nm 


0.13 nm 


no n on -bonded 
carbon atoms 


single-bonded 
carbons 


double-bonded 
carbons 



Examples in macromolecules: 

Amino acids in polypeptide chains hydrogen-bonded 
together. 
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Two bases, G and C, hydrogen-bonded in DNA or RNA. 
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HYDROGEN BONDS IN WATER 

Any molecules that can form hydrogen bonds to each other 
can alternatively form hydrogen bonds to water molecules. 
Because of this competition with water molecules, the 
hydrogen bonds formed between two molecules dissolved 
in water are relatively weak. 
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HYDROPHOBIC FORCES 
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ELECTROSTATIC ATTRACTIONS 

Attractive forces occur both between fully charged 
groups (ionic bond) and between the partially charged 
groups on polar molecules. 




The force of attraction between the two charges, 5 + 
and 5", falls off rapidly as the distance between the 
charges increases. 

In the absence of water, electrostatic forces are very strong. 
They are responsible for the strength of such minerals as 
marble and agate, and for crystal formation in common 
table salt, NaCI. 
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a crystal of 
salt, NaCI 
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Water forces hydrophobic groups together, 
because doing so minimizes their disruptive 
effects on the hydrogen-bonded water 
network. Hydrophobic groups held 
together in this way are sometimes said 
to be held together by "hydrophobic 
bonds," even though the apparent attraction 
is actually caused by a repulsion from the 
water. 



ELECTROSTATIC ATTRACTIONS IN 
AQUEOUS SOLUTIONS 

Charged groups are shielded by their 
interactions with water molecules. 
Electrostatic attractions are therefore 
quite weak in water. 
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Similarly, ions in solution can cluster around 
charged groups and further weaken 
these attractions. 
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Despite being weakened by water and salt, 
electrostatic attractions are very important in 
biological systems. For example, an enzyme that 
binds a positively charged substrate will often 
have a negatively charged amino acid side chain 
at the appropriate place. 
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MONOSACCHARIDES 

Monosaccharides usually have the general formula (CH 2 0) n , where n can be 3, 4, 5, 6, 7, or 8, and have two or more hydroxyl groups. 
They either contain an aldehyde group (- C \ H ) and are called aldoses or a ketone group ( ^c=o ) and are called ketoses. 
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RING FORMATION 

In aqueous solution, the aldehyde or ketone group of a sugar 
molecule tends to react with a hydroxyl group of the same 
molecule, thereby closing the molecule into a ring. 
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Note that each carbon atom 
has a number. 



ISOMERS 

Many monosaccharides differ only in the spatial arrangement 
of atoms — that is, they are isomers. For example, glucose, 
galactose, and mannose have the same formula (C 6 H 12 6 ) but 
differ in the arrangement of groups around one or two carbon 
atoms. 
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galactose 
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mannose 
These small differences make only minor changes in the 
chemical properties of the sugars. But they are recognized by 
enzymes and other proteins and therefore can have important 
biological effects. 
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a AND (3 LINKS 

The hydroxyl group on the carbon that carries the 
aldehyde or ketone can rapidly change from one 
position to the other. These two positions are 
called a and p. 





p hydroxyl a hydroxyl 

As soon as one sugar is linked to another, the a or 
p form is frozen. 



SUGAR DERIVATIVES 

The hydroxyl groups of a simple CH 2 QH 

monosaccharide can be replaced 
by other groups. For example, 



CH.OH 




OH 



HO 



/V-dcety [glucosamine C = 
CH 3 



HO 




NH, 
glucosamine 



glucuronic acid u ' 
HO 




OH 



OH 



DISACCHARIDES 

The carbon that carries the aldehyde 
or the ketone can react with any 
hydroxyl group on a second sugar 
molecule to form a disaccharide. 
The linkage is called a glycosidic 
bond. 

Three common disaccharides are 

maltose (glucose + glucose) 
lactose (galactose + glucose) 
sucrose (glucose + fructose) 

The reaction forming sucrose is 
shown here. 



a glucose CH 2 OH 




|-1 fructose 



CH-.OH 



CH,OH 



OLIGOSACCHARIDES AND POLYSACCHARIDES 

Large linear and branched molecules can be made from simple repeating sugar 
subunits. Short chains are called oligosaccharides, while long chains are called 
polysaccharides. Glycogen, for example, is a polysaccharide made entirely of 
glucose units joined together. 



branch points 




glycogen 




COMPLEX OLIGOSACCHARIDES 

In many cases a sugar sequence 
is nonrepetitive. Many different 
molecules are possible. Such 
complex oligosaccharides are 
usually linked to proteins or to lipids, 
as is this oligosaccharide, which is 
part of a cell-surface molecule 
that defines a particular blood group. 



CH-.OH 



CH,OH 



CH>OH 




OH 
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COMMON FATTY 
ACIDS 


These are carboxylic acids 
with long hydrocarbon tails. 
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CH 2 

1 
CH 3 


palmitic 

acid 

(C 16 ) 


CH 2 

1 

CH 3 


stearic 
acid (C 18 ) 




oleic 
acid (Cis) 



TRIACYLGLYCEROLS 

i i 



Fatty acids are stored as an energy reserve (fats and 
oils) through an ester linkage to glycerol to form 
triacylglycerols, also known as triglycerides. 



H,C — O 



ML — O 



o 

II 

o 

II 



-OH 
OH 

ii c: 

glycerol 



Hundreds of different kinds of fatty acids exist. Some have one or more double bonds in their 
hydrocarbon tail and are said to be unsaturated. Fatty acids with no double bonds are saturated. 

"O O 

\ • 

C 



This double bond 
is rigid and creates 
a kink in the chain. 
The rest of the chain 
is free to rotate 
about the other C-C 
bonds. 




stearic 
acid 



space-filling model carbon skeleton 

UNSATURATED 



J 
SATURATED 



CARBOXYL GROUP 



PHOSPHOLIPIDS 



Phospholipids are the major constituents 
of cell membranes. 



If free, the carboxyl group of a 
fatty acid will be ionized. 



But more usually it is linked to 
other groups to form either esters 



or amides. 



O 



// 



: ) 




O 



// 



H 



CH, — GH — CH 2 




hydrophilic 
group 



choline 



\ hydrophobic 

"-- fatty acid tails 

space-filling model of 
the phospholipid 
phosphatidylcholine 

In phospholipids two of the -OH groups in glycerol are 
linked to fatty acids, while the third -OH group is linked 
general structure to phosphoric acid. The phosphate is further linked to 
of a phospholipid one of a variety of small polar groups (alcohols). 
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LIPID AGGREGATES 

Fatty acids have a hydrophflic head 
and a hydrophobic taik- 



In water they can form a surface film 
or form small micelles. 




micelle 



POLYISOPRENOIDS 

long-chain polymers of isoprene 

Cr 

I 

o=p— o- 

I 

o 



Their derivatives can form larger aggregates held together by hydrophobic forces: 



Triglycerides can form large spherical 
fat droplets in the cell cytoplasm. 



Phospholipids and glycolipids form self-sealing lipid | 
bilayers that are the basis for all cell membranes. 



C 



200 nm 




OTHER LIPIDS 



Lipids are defined as the water-insoluble 
molecules in cells that are soluble in organic 
solvents. Two other common types of lipids 
are steroids and polyisoprenoids. Both are 
made from isoprene units. 



CH, 



CH 2 



C— CH = CH 2 



isoprene 



STEROIDS 



Steroids have a common multiple-ring structure. 




OH 




cholesterol — found in many membranes 



(> 



testosterone— male steroid hormone 



GLYCOLIPIDS 

Like phospholipids, these compounds are composed of a hydrophobic 
region, containing two long hydrocarbon tails, and a polar region, 
which, however, contains one or more sugars and no phosphate. 

I; 



< H-l 
I r 

H 



galactose 



O 

I 



hydrocarbon tails 



C-NH 

II 
O 



\ 
sugar 



a simple 
glycolipid 



dolichol phosphate — used 
to carry activated sugars 
in the membrane-associated 
synthesis of glycoproteins 
and some polysaccharides 
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NUCLEOTIDES 

A nucleotide consists of a nitrogen-containing 
base, a five-carbon sugar, and one or more 
phosphate groups. 



V/S 







Nucleotides 
are the 

subunits of 
the nucleic arids 



PHOSPHATES 

The phosphates are normally joined to 
the C5 hydroxyl of the ribose or 
deoxyribose sugar (designated 5'). Mono-, 
di-, and triphosphates are common. 
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as in 
AMP 



HEHjj 

I 



as in 
ADP 



-P — O— CH 2 



O" 



I 



ATP 



The phosphate makes a nucleotide 
negatively charged. 



BASIC SUGAR 
LINKAGE 



W-glycosidic 
bond 




The base is linked to 
the same carbon (C1) 
used in sugar-sugar 
bonds. 



BASES 



NH 2 



HC' 



U 



HC] ^N 

(2 cytosine 

HC X. 



N 
H 



'NH 



uracil 



%, 



O 



II T | 

thymine N ^O 

H 



The bases are nitrogen-containing ring 
compounds, either pyrimidines or purines. 




PYRIMIDINE 




PURINE 



ST 

h 



NH, 



adenine N^ ^ '^^ 

Hcf f| A 



', 1 1 



o 



X . 



A? C NH 

\ I G ' 

guanine 'm^ X. f^ " 

H N ^NH 2 



SUGARS 



PENTOSE 

a five-carbon sugar 




HOCH, 



two kinds are used 




HOCH 



Each numbered carbon on the sugar of a nucleotide is 
followed by a prime mark; therefore, one speaks of the 
"5-prime carbon," etc. 




[3-D- ribose 

used in ribonucleic acid 



p-o-2-deoxy ribose 

used in deoxyribonucleic acid 
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NOMENCLATURE 



BASE 


NUCLEOSIDE 


AB8R. 


adenine 


adenosine 


A 


guanine 


guanoslne 


G 


cytosine 


cytidine 


C 


uracil 


uridine 


u 


thymine 


thymidine 


T 



A nucleoside or nucleotide is named according to its nitrogenous base 
Single letter abbreviations are used variously as 
shorthand for (1) the base alone, (2) the 
nucleoside, or (3) the whole nucleotide — 
the context will usually make clear which of 
the three entities is meant. When the context 
is not sufficient, we will add the terms "base", 
"nucleoside", "nucleotide", or — as in the 
examples below — use the full 3-letter nucleotide 
code. 

AMP = adenosine monophosphate 
dAMP = deoxyadenosine monophosphate 
UDP = uridine diphosphate 
ATP = adenosine triphosphate 




BASE + SUGAR = NUCLEOSIDE 




BASE + SUGAR + PHOSPHATE = NUCLEOTIDE 



NUCLEIC ACIDS 
Nucleotides are joined together by a 
phosphodiester linkage between 5' and 
3' carbon atoms to form nucleic acids. 
The linear sequence of nucleotides in a 
nucleic acid chain is commonly 
abbreviated by a one-letter code, 
A— G— C— T— T— A— C— A, with the 5' 
end of the chain at the left. 




O 5' end of chain base 

II 5' 

"O— P— O — Chn 

o- 




phosphodtester 

linkage - Q _ P =o 



example: DNA 




r OH 

y end of chsm 



NUCLEOTIDES HAVE MANY OTHER FUNCTIONS 
) They carry chemical energy in their easily hydrolyzed phosphoanhydride bonds. 

phosphoanhydride bonds 



O 



O 

II 



o 



"O— P — O— P — O— P — O — CH 



O" 



O" 
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example: ATP (or E 1) 



OH OH 



2 They combine with other groups to form coenzymes. 



HH OHH OHCH 3 H O O 

II II I I II I II II II 

HS-C-C-N-C-C-C-N-C-C-C— C-O— P— O— P— O— CH 2 

I I I I I I I I I I I 

H H H H H H HO CH 3 H O" O" 



example: coenzyme A (CoA) 



3 They are used as specific signaling molecules in the cell. 



example: cyclic AMP (cAMP) 
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THE IMPORTANCE OF FREE ENERGY FOR CELLS 
Life is possible because of the complex network of interacting 
chemical reactions occurring in every cell. In viewing the 
metabolic pathways that comprise this network, one might 
suspect that the cell has had the ability to evolve an enzyme to 
carry out any reaction that it needs. But this is not so. Although 
enzymes are powerful catalysts, they can speed up only those 
reactions that are thermodynamically possible; other reactions 
proceed in cells only because they are coupled to very favorable 
reactions that drive them. The question of whether a reaction 



can occur spontaneously, or instead needs to be coupled to 
another reaction, is central to cell biology. The answer is 
obtained by reference to a quantity called the free energy: the 
total change in free energy during a set of reactions determines 
whether or not the entire reaction sequence can occur. In this 
panel we shall explain some of the fundamental ideas — derived 
from a special branch of chemistry and physics called thermo- 
dynamics — that are required for understanding what free 
energy is and why it is so important to cells. 



ENERGY RELEASED BY CHANGES IN CHEMICAL BONDING IS CONVERTED INTO HEAT 




SEA 



UNIVERSE 



An enclosed system is defined as a collection of molecules that 
does not exchange matter with the rest of the universe (for 
example, the "cell in a box" shown above). Any such system will 
contain molecules with a total energy E . This energy will be 
distributed in a variety of ways: some as the translational energy 
of the molecules, some as their vibrational and rotational energies, 
but most as the bonding energies between the individual atoms 
that make up the molecules. Suppose that a reaction occurs in 
the system. The first law of thermodynamics places a constraint 
on what types of reactions are possible: it states that "in any 
process, the total energy of the universe remains constant." 
For example, suppose that reaction A -> B occurs somewhere in 
the box and releases a great deal of chemical bond energy. This 
energy will initially increase the intensity of molecular motions 
(translational, vibrational, and rotational) in the system, which 
is equivalent to raising its temperature. However, these increased 
motions will soon be transferred out of the system by a series 



of molecular collisions that heat up first the walls of the box 
and then the outside world (represented by the sea in 
our example). In the end, the system returns to its initial 
temperature, by which time all the chemical bond energy 
released in the box has been converted into heat energy and 
transferred out of the box to the surroundings. According to 
the first law, the change in the energy in the box (A£ box , which 
we shall denote as AE) must be equal and opposite to the 
amount of heat energy transferred, which we shall designate 
as h: that is, AE = ~h. Thus, the energy in the box (£) decreases 
when heat leaves the system. 

E also can change during a reaction as a result of work being 
done on the outside world. For example, suppose that there is 
a small increase in the volume (AV) of the box during a reaction.) 
Since the walls of the box must push against the constant 
pressure (P) in the surroundings in order to expand, this does 
work on the outside world and requires energy. The energy 
used is P(AV), which according to the first law must decrease 
the energy in the box (£) by the same amount. In most reactions | 
chemical bond energy is converted into both work and heat. 
Enthalpy (H) is a composite function that includes both of these | 
(H = E + PV). To be rigorous, it is the change in enthalpy 
(AH) in an enclosed system, and not the change in energy, that 
is equal to the heat transferred to the outside world during a 
reaction. Reactions in which H decreases release heat to the 
surroundings and are said to be "exothermic," while reactions 
in which H increases absorb heat from the surroundings and 
are said to be "endothermic." Thus, -h = AH. However, the 
volume change is negligible in most biological reactions, so to 
a good approximation 

-h = AH*AE 



THE SECOND LAW OF THERMODYNAMICS 

Consider a container in which 1000 coins are all lying heads up. 
If the container is shaken vigorously, subjecting the coins to 
the types of random motions that all molecules experience due 
to their frequent collisions with other molecules, one will end 
up with about half the coins oriented heads down. The 
reason for this reorientation is that there is only a single way in 
which the original orderly state of the coins can be reinstated 
(every coin must lie heads up), whereas there are many different 
ways (about 10 298 ) to achieve a disorderly state in which there is 
an equal mixture of heads and tails; in fact, there are more ways 



to achieve a 50-50 state than to achieve any other state. Each 
state has a probability of occurrence that is proportional to the | 
number of ways it can be realized. The second law of thermo- 
dynamics states that "systems will change spontaneously from 
states of lower probability to states of higher probability." 
Since states of lower probability are more "ordered" than 
states of high probability, the second law can be restated: 
"the universe constantly changes so as to become more 
disordered." 
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THE ENTROPY, S 

The second law (but not the first law) allows one to predict the 
direction of a particular reaction. But to make it useful for this 
purpose, one needs a convenient measure of the probability or, 
equivalently, the degree of disorder of a state. The entropy (S) 
is such a measure. It is a logarithmic function of the probability 
such that the change in entropy (AS) that occurs when the 
reaction A — > B converts one mole of A into one mole of B is 

AS= R In p B /p A 

where p A and p B are the probabilities of the two states A and B, 
R is the gas constant (2 cal deg" 1 mole" 1 ), and AS is measured 
in entropy units (eu). In our initial example of 1000 coins, the 
relative probability of all heads (state A) versus half heads and 
half tails (state B) is equal to the ratio of the number of different 
ways that the two results can be obtained. One can calculate 
that p A = 1 and p B = 10001(500! x 500!) = 10 299 . Therefore, 
the entropy change for the reorientation of the coins when their 



container is vigorously shaken and an equal mixture of heads 
and tails is obtained is/? In (10 298 ), or about 1370 eu per mole of 
such containers (6 x 10 23 containers). We see that because AS 
defined above is positive for the transition from state A to 
state B (Pb/Pa> 1)# reactions with a large increase in S (that is, 
for which AS > 0) are favored and will occur spontaneously. 
As discussed in Chapter 2, heat energy causes the random 
commotion of molecules. Because the transfer of heat from an 
enclosed system to its surroundings increases the number of 
different arrangements that the molecules in the outside world 
can have, it increases their entropy. It can be shown that the 
release of a fixed quantity of heat energy has a greater disor- 
dering effect at low temperature than at high temperature, and 
that the value of AS for the surroundings, as defined above 
(AS sea ), is precisely equal to n, the amount of heat transferred to 
the surroundings from the system, divided by the absolute 
temperature (7): 

AS^ a -h/T 



THE GIBBS FREE ENERGY, G 

When dealing with an enclosed biological system, one would 
like to have a simple way of predicting whether a given reaction 
will or will not occur spontaneously in the system. We have 
seen that the crucial question is whether the entropy change for 
the universe is positive or negative when that reaction occurs. 
In our idealized system, the cell in a box, there are two separate 
components to the entropy change of the universe — the entropy 
change for the system enclosed in the box and the entropy 
change for the surrounding "sea" — and both must be added 
together before any prediction can be made. For example, it is 
possible for a reaction to absorb heat and thereby decrease the 
entropy of the sea (AS sea < 0) and at the same time to cause 
such a large degree of disordering inside the box (AS box > 0) 
that the total A5 universe = A5 sea + AS box is greater than 0. In this 
case the reaction will occur spontaneously, even though the 
sea gives up heat to the box during the reaction. An example of 
such a reaction is the dissolving of sodium chloride in a beaker 
containing water (the "box"), which is a spontaneous process 
even though the temperature of the water drops as the salt 
goes into solution. 

Chemists have found it useful to define a number of new 
"composite functions" that describe combinations of physical 
properties of a system. The properties that can be combined 
include the temperature (7"), pressure (P), volume (V), energy 
(£), and entropy (S). The enthalpy (H) is one such composite 
function. But by far the most useful composite function for 
biologists is the Gibbs free energy, G. It serves as an accounting 
device that allows one to deduce the entropy change of the 
universe resulting from a chemical reaction in the box, while 
avoiding any separate consideration of the entropy change in 
the sea. The definition of G is 

where, for a box of volume V, H is the enthalpy described above 
(E + PV), T is the absolute temperature, and S is the entropy. 
Each of these quantities applies to the inside of the box only. The 
change in free energy during a reaction in the box (the G of the 
products minus the G of the starting materials) is denoted as AG 
and, as we shall now demonstrate, it is a direct measure of the 
amount of disorder that is created in the universe when the 
reaction occurs. 



At constant temperature the change in free energy (AG) 
during a reaction equals AH- TAS. Remembering that AH = 
-h, the heat absorbed from the sea, we have 

-AG - -AH * TAS 
-AG = h + TAS, SO -&GtT= h/T-h AS 

But NT is equal to the entropy change of the sea (AS sea ), and 
the AS in the above equation is A5 box . Therefore 

-AG/T= AS^a t A5 boK = A5 uriverse 

We conclude that the free-energy change is a direct measure 
of the entropy change of the universe. A reaction will proceed 
in the direction that causes the change in the free energy (AG) 
to be less than zero, because in this case there will be a positive 
entropy change in the universe when the reaction occurs. 

For a complex set of coupled reactions involving many 
different molecules, the total free-energy change can be com- 
puted simply by adding up the free energies of all the different 
molecular species after the reaction and comparing this value 
with the sum of free energies before the reaction; for common 
substances the required free-energy values can be found from 
published tables. In this way one can predict the direction of 
a reaction and thereby readily check the feasibility of any 
proposed mechanism. Thus, for example, from the observed 
values for the magnitude of the electrochemical proton gradient | 
across the inner mitochondrial membrane and the AG for ATP 
hydrolysis inside the mitochondrion, one can be certain that ATP I 
synthase requires the passage of more than one proton for each 
molecule of ATP that it synthesizes. 

The value of AG for a reaction is a direct measure of how far 
the reaction is from equilibrium. The large negative value for 
ATP hydrolysis in a cell merely reflects the fact that cells keep 
the ATP hydrolysis reaction as much as 10 orders of magnitude 
away from equilibrium. If a reaction reaches equilibrium, 
AG = 0, the reaction then proceeds at precisely equal rates 
in the forward and backward direction. For ATP hydrolysis, 
equilibrium is reached when the vast majority of the ATP 
has been hydrolyzed, as occurs in a dead cell. 



1 20 PANEL 2-8: Details of the 10 Steps of Glycolysis 



For each step, the part of the molecule that undergoes a change is shadowed in blue, 
and the name of the enzyme that catalyzes the reaction is in a yellow box. 



STEP 1 Glucose is 
phosphorylated by ATP to 
form a sugar phosphate. 
The negative charge of the 
phosphate prevents passage 
of the sugar 
phosphate through the 
plasma membrane, 
trapping glucose inside 
the cell. 



CH ,OH 




CH-,0 P 



hexo kinase 



OH 



OH 



glucose 



HO 




-1- ADP + H* 



uii 



OH 

glucose 6-phosphate 



STEP 2 A readily 
reversible rearrange- 
ment of the chemical 
structure 
{ isomer Izatf on) 
moves the 
carbonyl oxygen 
from carbon 1 to 
carbon 2, forming 
a ketose from an 
aldose sugar (See 
Panel 2-4.) 



O. 
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H — C — OH 
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HO— C — H 



H — C — OH 
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H — C — OH 
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6 CH 2°'£ 
(open chain form) 
glucose 6 -phosphate 



phosphoglucose 
iso me rase 



,CH 2 OH 
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x=o 



HO— C — H 



H — C- 

H — C- 



P DM (. 



-Ol f 
-OH 




CH 2 OH 



6 CH 2 0^ 
(open chain form) 

fructose 6- phosphate 



STEP 3 The new hydroxyl 
group on carbon 1 is 
phosphorylated by ATP, in 
preparation for the formation 
of two three-carbon sugar 
phosphates. The entry of sugars 
into glycolysis is controlled at this 
step, through regulation of the 
enzyme phosphofructokinase. 



!>OH 2 C 




phosphofructokinase 

+ 



P OH 2 C o CH 2 P 




HO 

OH 
OH 
fructose 1,6-bisphosphate 



+ ADP + H + 



STEP 4 The 
six-carbon sugar is 
cleaved to produce 
two three-carbon 
molecules. Only the 
glyceraldehyde 
3-phosphate can 
proceed immediately 
through glycolysis 



P mH,<. 
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CH 2 P 

(open chain form) 
fructose 1,6-bisphosphate 



aldolase 
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dihydroxyacetone 
phosphate 



H O 
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I 
H — C — OH 

I 
CH 2 P 

glyceraldehyde 
3-phosphate 



STEP 5 The other 
product of step 4, 
dihydroxyacetone 
phosphate, is 
isomerized to form 
glyceraldehyde 
3-phosphate. 



triose phosphate isomerase 



dihydroxyacetone phosphate 



CH 2 P 
glyceraldehyde 3-phosphate 
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STEP 6 The two molecules 

of glyceraldehyde 3-phosphate 
are oxidized. The energy 
generation phase of glycolysis 
begins, as NADH and a new 
high-energy anhydride linkage 
to phosphate are formed {see 
Figure 2-73) 
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I- 



-OH 



CH-rO-P- 



glyceraldehyde 3-phosphate 
dehydrogenase 



+ tWMTj + P 



glyceraldehyde 3-phosphate 



C 

I 
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CH 2 P 
1,3-bisphosphoglycerate 



STEP 7 The transfer 
to ADPof the 
high-energy phosphate 


C 

H — C — OH + 

I 

CH 2 P 

1,3-bisphosphoglycerate 


phosphogly cerate kinase 


y . 

H — C^OH 
S-phosphoglycerate 




group that was 
generated in step 6 
forms ATP. 







STEP 8 The remaining 
phosphate ester linkage in 
3-phosphoglycerate, which has 
a relatively low free energy of 
hydrolysis, is moved from 
carbon 3 to carbon 2 to form 
2-phosphoglycerate. 
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H — C— OH 

2 | 
3 CH 2 P> 

3-phosphoglycerate 
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I 

h— c— oy> 

CH 2 OH 
2-phosphoglycerate 


phosphoglycerate 


mutase 















STEP 9 The removal of 
water from 2-phosphoglycerate 
creates a high-energy enol 
phosphate linkage. 
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I 
H™C — O P 

I 
CH 2 OH 

2-phosphoglycerate 



enolase 



C 

C — O P + H 2 

II 
CH 2 

phosphoenol pyruvate 



STEP 10 The transfer to 
ADP of the high-energy 
phosphate group that was 
generated in step 9 forms 
ATP, completing glycolysis. 
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pyruvate kinase 
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phosphoenolpyruvate 
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CH 3 
pyruvate 



NET RESULT OF GLYCOLYSIS 
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glucose 
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In addition to the pyruvate, the net products are 
two molecules of ATP and two molecules of NADH 
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two molecules 
of pyruvate 
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pyruvate 



HS-CoA 

acetyl CoA (2C) 
S-CoA 

HS-CoA 



The complete citric acid cycle. The two 
carbons from acetyl CoA that enter this 
turn of the cycle (shadowed in tQd ) will 
be converted to C0 2 in subsequent turns 
of the cycle: it is the two carbons 
shadowed in blue that are converted to 
C0 2 in this cycle. 



next cycle 
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oxaloacetate (4C) 



Step 1 
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Details of the eight steps are shown below. For each step, the part of the molecule that undergoes a change is shadowed in blue; 
and the name of the enzyme that catalyzes the reaction is in a yellow box. 



STEP 1 After the enzyme 
removes a proton from the 
CH 3 group on acetyl CoA, 
the negatively charged 
CH 2 ~ forms a bond to a 
carbonyl carbon of 
oxaloacetate. The 
subsequent loss by 
hydrolysis of the coenzyme 
A (CoA) drives the reaction 
strongly forward. 
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STEP 2 An isomerization 
reaction, in which water is 
first removed and then 
added back, moves the 
hydroxyl group from one 
carbon atom to its neighbor. 
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STEP 3 In the first of 
four oxidation steps in the 
cycle, the carbon carrying 
the hydroxyl group is 
converted to a carbonyl 
group. The immediate 
product is unstable, losing 
C0 2 while still bound to 
the enzyme. 
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STEP 4 The a-ketoglutarate 
dehydrogenase complex closely 
resembles the large enzyme 
complex that converts pyruvate 
to acetyl CoA {pyruvate 
dehydrogenase). It likewise 
catalyzes an oxidation that 
produces NADH, C0 2 , and a 
high-energy thioester bond to 
coenzyme A (CoA). 
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STEP 5 A phosphate 
molecule from solution 
displaces the CoA, forming a 
high-energy phosphate 
linkage to succinate. This 
phosphate is then passed to 
GDP to form GTP. (In bacteria 
and plants, ATP is formed 
instead.) 
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STEP 6 In the third 
oxidation step in the cycle, FAD 
removes two hydrogen atoms 
from succinate. 
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STEP 7 The addition of 
water to fumarate places a 
hydroxyl group next to a 
carbonyl carbon. 
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STEP 8 In the last of four 
oxidation steps in the cycle, the 
carbon carrying the hydroxyl 
group is converted 
to a carbonyl group, 
regenerating the oxaloacetate 
needed for step 1. 
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Chapter 3 



Proteins 




When we look at a cell through a microscope or analyze its electrical or bio- 
chemical activity, we are, in essence, observing proteins. Proteins constitute 
most of a cell's dry mass. They are not only the cell's building blocks; they also 
execute nearly all the cell's functions. Thus, enzymes provide the intricate 
molecular surfaces in a cell that promote its many chemical reactions. Proteins 
embedded in the plasma membrane form channels and pumps that control the 
passage of small molecules into and out of the cell. Other proteins carry mes- 
sages from one cell to another, or act as signal integrators that relay sets of sig- 
nals inward from the plasma membrane to the cell nucleus. Yet others serve as 
tiny molecular machines with moving parts: kinesin, for example, propels 
organelles through the cytoplasm; topoisomerase can untangle knotted DNA 
molecules. Other specialized proteins act as antibodies, toxins, hormones, 
antifreeze molecules, elastic fibers, ropes, or sources of luminescence. Before we 
can hope to understand how genes work, how muscles contract, how nerves 
conduct electricity, how embryos develop, or how our bodies function, we must 
attain a deep understanding of proteins. 



In This Chapter 

THE SHAPE AND 125 

STRUCTURE OF PROTEINS 



PROTEIN FUNCTION 
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THE SHAPE AND STRUCTURE OF PROTEINS 

From a chemical point of view, proteins are by far the most structurally complex 
and functionally sophisticated molecules known. This is perhaps not surprising, 
once we realize that the structure and chemistry of each protein has been devel- 
oped and fine-tuned over billions of years of evolutionary history. Yet, even to 
experts, the remarkable versatility of proteins can seem truly amazing. 

In this section, we consider how the location of each amino acid in the long 
string of amino acids that forms a protein determines its three-dimensional 
shape. Later in the chapter, we use this understanding of protein structure at the 
atomic level to describe how the precise shape of each protein molecule deter- 
mines its function in a cell. 



The Shape of a Protein Is Specified by Its Amino Acid Sequence 

There are 20 types of amino acids in proteins, each with different chemical prop- 
erties. A protein molecule is made from a long chain of these amino acids, each 
linked to its neighbor through a covalent peptide bond. Proteins are therefore 
also known as polypeptides. Each type of protein has a unique sequence of 
amino acids, and there are many thousands of different proteins, each with its 
own particular amino acid sequence. 

The repeating sequence of atoms along the core of the polypeptide chain is 
referred to as the polypeptide backbone. Attached to this repetitive chain are 
those portions of the amino acids that are not involved in making a peptide 
bond and that give each amino acid its unique properties: the 20 different amino 
acid side chains (Figure 3-1). Some of these side chains are nonpolar and 
hydrophobic ("water- fearing"), others are negatively or positively charged, some 
readily form covalent bonds, and so on. Panel 3-1 (pp. 128-129) shows their 
atomic structures and Figure 3-2 lists their abbreviations. 
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Figure 3-1 The components 
of a protein. A protein 
consists of a polypeptide 
backbone with attached side 
chains. Each type of protein 
differs in its sequence and 
number of amino acids; 
therefore, it is the sequence 
of the chemically different 
side chains that makes each 
carboxyl terminus protein distinct. The two ends 
or C-terminus f a polypeptide chain are 

chemically different: the end 
carrying the free amino group 
(NH 3 + , also written NH 2 ) is the 
amino terminus, or 
N-terminus, and that carrying 
the free carboxyl group 
(COO - , also written COOH) is 
the carboxyl terminus or 
C-terminus. The amino acid 
sequence of a protein is 
always presented in the 
N-to-C direction, reading 
from left to right. 
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As discussed in Chapter 2, atoms behave almost as if they were hard spheres 
with a definite radius (their van derWaals radius). The requirement that no two 
atoms overlap limits greatly the possible bond angles in a polypeptide chain 
(Figure 3-3). This constraint and other steric interactions severely restrict the 
possible three-dimensional arrangements of atoms (or conformations). Never- 
theless, a long flexible chain, such as a protein, can still fold in an enormous 
number of ways. 

The folding of a protein chain is, however, further constrained by many dif- 
ferent sets of weak noncovalent bonds that form between one part of the chain 
and another. These involve atoms in the polypeptide backbone, as well as atoms 
in the amino acid side chains. There are three types of weak bonds: hydrogen 
bonds, electrostatic attractions, and van der Waals attractions, as explained in 
Chapter 2 (see p. 54). Individual noncovalent bonds are 30-300 times weaker 
than the typical covalent bonds that create biological molecules. But many weak 
bonds acting in parallel can hold two regions of a polypeptide chain tightly 
together. In this way, the combined strength of large numbers of such noncova- 
lent bonds determines the stability of each folded shape (Figure 3-4). 
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Figure 3-2 The 20 amino 
acids found in proteins. Each 
amino acid has a three-letter 
and a one-letter abbreviation. 
There are equal numbers of 
polar and nonpolar side 
chains; however, some side 
chains listed here as polar are 
large enough to have some 
non-polar properties (for 
example, Tyr, Thr, Arg, Lys). For 
atomic structures, see Panel 
3-1 (pp. 128-129). 



POLAR AMINO ACIDS 



NONPOLAR AMINO ACIDS 



A fourth weak force also has a central role in determining the shape of a pro- 
tein. As described in Chapter 2, hydrophobic molecules, including the nonpolar 
side chains of particular amino acids, tend to be forced together in an aqueous 
environment in order to minimize their disruptive effect on the hydrogen- 
bonded network of water molecules (see p. 54 and Panel 2-2, pp. 108-109). 
Therefore, an important factor governing the folding of any protein is the distri- 
bution of its polar and nonpolar amino acids. The nonpolar (hydrophobic) side 
chains in a protein — belonging to such amino acids as phenylalanine, leucine, 
valine, and tryptophan — tend to cluster in the interior of the molecule (just as 
hydrophobic oil droplets coalesce in water to form one large droplet). This 
enables them to avoid contact with the water that surrounds them inside a cell. 
In contrast, polar groups — such as those belonging to arginine, glutamine, and 
histidine — tend to arrange themselves near the outside of the molecule, where 
they can form hydrogen bonds with water and with other polar molecules (Fig- 
ure 3-5). Polar amino acids buried within the protein are usually hydrogen- 
bonded to other polar amino acids or to the polypeptide backbone. 



(A) 



amino acid 



(B) +180 




N" , H VX' 



H 

N : 



O 



peptide bonds 



il 
I 



pSl 




+ 180 



Figure 3-3 Steric limitations on the bond angles in a polypeptide chain. (A) Each amino acid contributes three bonds 
(red) to the backbone of the chain. The peptide bond is planar (gray shading) and does not permit rotation. By contrast, 
rotation can occur about the C a -C bond, whose angle of rotation is called psi (\|/), and about the N-C a bond, whose angle 
of rotation is called phi ((J)). By convention, an R group is often used to denote an amino acid side chain (green circles). 
(B) The conformation of the main-chain atoms in a protein is determined by one pair of § and \j/ angles for each amino acid; 
because of steric collisions between atoms within each amino acid, most pairs of and \|/ angles do not occur. In this so- 
called Ramachandran plot, each dot represents an observed pair of angles in a protein. The cluster of dots in the bottom 
left quadrant represents all of the amino acids that are located in cc-helix structures (see Figure 3-7 A). (B, from 
J. Richardson, Adv. Prot Chem. 34:1 74-1 75, 1 981 . With permission from Academic Press.) 
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The general formula of an amino acid is 
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Proteins consist exclusively of L-amino acids. 



FAMILIES OF 
AMINO ACIDS 
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These nitrogens have a 
relatively weak affinity for an 
H + and are only partly positive 
at neutral pH. 



PEPTIDE BONDS 

Amino acids are commonly joined together by an amide linkage, 
called a peptide bond. 
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Peptide bond: The four atoms in each gray box form a rigid 
planar unit. There is no rotation around the C-N bond. 
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Proteins are long polymers 
of amino acids linked by 
peptide bonds, and they 
are always written with the 
N-terminus toward the left. 
The sequence of this tripeptide 
is histidine-cysteine-valine. 
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These two single bonds allow rotation, so that long chains of 
amino acids are very flexible. 
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Although the amide N is not charged at 
neutral pH, it is polar. 
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The -OH group is polar. 



Disulfide bonds can form between two cysteine side chains 
in proteins. 
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Proteins Fold into a Conformation of Lowest Energy 

As a result of all of these interactions, most proteins have a particular three- 
dimensional structure, which is determined by the order of the amino acids in its 
chain. The final folded structure, or conformation, of any polypeptide chain is 
generally the one that minimizes its free energy. Biologists have studied protein 
folding in a test tube by using highly purified proteins. Treatment with certain 
solvents, which disrupt the noncovalem interactions holding the folded chain 
together, unfolds, or denatures, a protein. This treatment converts the protein 
into a flexible polypeptide chain that has lost its natural shape. When the dena- 
turing solvent is removed, the protein often refolds spontaneously or renatures t 
into its original conformation (Figure 3^6) > This indicates thai the amino acid 
sequence contains all the information needed for specifying the three-dimen- 
sional shape of a protein, which is a critical point for understanding cell function. 

Each protein normally folds up into a single stable conformation. However, 
the conformation changes slightly when the protein interacts with other 
molecules in the cell. This change in shape is often crucial to the function of the 
protein, as we see later. 

Although a protein chain can fold into its correct conformation without out- 
side help, in a living cell special proteins called molecular chaperones often assist 
in protein folding. Molecular chaperones bind to partly folded polypeptide 
chains and help them progress along the most energetically favorable folding 
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Figure 3-4 Three types of noncovalent 
bonds help proteins fold. Although a 
single one of these bonds is quite weak, 
many of them often form together to 
create a strong bonding arrangement, as 
in the example shown. As in the previous 
figure, R is used as a general designation 
for an amino acid side chain. 



unfolded polypeptide 



folded conformation in aqueous environment 



Figure 3-5 How a protein folds into a 
compact conformation. The polar amino 
acid side chains tend to gather on the 
outside of the protein, where they can 
interact with water; the nonpolar amino 
acid side chains are buried on the inside 
to form a tightly packed hydrophobic 
core of atoms that are hidden from water. 
In this schematic drawing, the protein 
contains only about 30 amino acids. 
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Figure 3-6 The refolding of a 
denatured protein. (A) This type of 
experiment, first performed more 
than 40 years ago, demonstrates 
that a protein's conformation is 
determined solely by its amino acid 
sequence. (B) The structure of urea. 
Urea is very soluble in water and 
unfolds proteins at high 
concentrations, where there is 
about one urea molecule for every 
six water molecules. 



pathway. In the crowded conditions of the cytoplasm, chaperones prevent the 
temporarily exposed hydrophobic regions in newly synthesized protein chains 
from associating with each other to form protein aggregates (see p. 388). How- 
ever, the final three-dimensional shape of the protein is still specified by its 
amino acid sequence: chaperones simply make the folding process more reliable. 

Proteins come in a wide variety of shapes, and they are generally between 50 
and 2000 amino acids long. Large proteins usually consist of several distinct pro- 
tein domains — structural units that fold more or less independently of each 
other, as we discuss below. Since the detailed structure of any protein is compli- 
cated, several different representations are used to depict the protein's structure, 
each emphasizing different features. 

Panel 3-2 (pp. 132-133) presents four different representations of a protein 
domain called SH2, which has important functions in eucaryotic cells. Con- 
structed from a string of 100 amino acids, the structure is displayed as (A) a 
polypeptide backbone model, (B) a ribbon model, (C) a wire model that 
includes the amino acid side chains, and (D) a space-filling model. Each of the 
three horizontal rows shows the protein in a different orientation, and the 
image is colored in a way that allows the polypeptide chain to be followed from 
its N-terminus (purple) to its C-terminus (red). <GTGA> 

Panel 3-2 shows that a protein's conformation is amazingly complex, even 
for a structure as small as the SH2 domain. But the description of protein struc- 
tures can be simplified because they are built up from combinations of several 
common structural motifs, as we discuss next. 



The a Helix and the (3 Sheet Are Common Folding Patterns 

When we compare the three-dimensional structures of many different protein 
molecules, it becomes clear that, although the overall conformation of each pro- 
tein is unique, two regular folding patterns are often found in parts of them. 
Both patterns were discovered more than 50 years ago from studies of hair and 
silk. The first folding pattern to be discovered, called the a helix, was found in 
the protein a-keratin, which is abundant in skin and its derivatives— such as 
hair, nails, and horns. Within a year of the discovery of the a helix, a second 
folded structure, called a P sheet, was found in the protein fibroin, the major 
constituent of silk. These two patterns are particularly common because they 
result from hydrogen-bonding between the N-H and C=0 groups in the 
polypeptide backbone, without involving the side chains of the amino acids. 
Thus, many different amino acid sequences can form them. In each case, the 
protein chain adopts a regular, repeating conformation. Figure 3-7 shows these 
two conformations, as well as the abbreviations that are used to denote them in 
ribbon models of proteins. 

The core of many proteins contains extensive regions of (3 sheet. As shown in 
Figure 3-8, these (3 sheets can form either from neighboring polypeptide chains 
that run in the same orientation (parallel chains) or from a polypeptide chain 
that folds back and forth upon itself, with each section of the chain running in 
the direction opposite to that of its immediate neighbors (antiparallel chains). 
Both types of p sheet produce a very rigid structure, held together by hydrogen 
bonds that connect the peptide bonds in neighboring chains (see Figure 3-7D). 



1 32 PANEL 3-2: Four Different Ways of Depicting a Small Protein, the SH2 Domain 








(A) Backbone: Shows the overall organization of the polypeptide 
chain; a clean way to compare structures of related proteins. 




(B) Ribbon: Easy way to visualize secondary structures, such as 
a helices and p sheets. 
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(C) Wire: Highlights side chains and their relative proximities; useful for 
predicting which amino acids might be involved in a protein's activity, 
particularly if the protein is an enzyme. 







(D) Space-filling: Provides contour map of the protein; gives a feel for the 
shape of the protein and shows which amino acid side chains are exposed 
on its surface. Shows how the protein might look to a small molecule, 
such as water, or to another protein. 
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Figure 3-7 The regular conformation of the polypeptide backbone in the a helix and the P sheet. <GTAG> <TGCT> 
(A, B, and C) The a helix. The N-H of every peptide bond is hydrogen-bonded to the C=0 of a neighboring peptide bond 
located four peptide bonds away in the same chain. Note that all of the N-H groups point up in this diagram and that all of 
the C=0 groups point down (toward the C-terminus); this gives a polarity to the helix, with the C-terminus having a partial 
negative and the N-terminus a partial positive charge. <D, E, and F) The p sheet. In this example, adjacent peptide chains 
run in opposite (antiparallel) directions. Hydrogen-bonding between peptide bonds in different strands holds the 
individual polypeptide chains (strands) together in a p sheet, and the amino acid side chains in each strand alternately 
project above and below the plane of the sheet. (A) and (D) show all the atoms in the polypeptide backbone, but the 
amino acid side chains are truncated and denoted by R. In contrast, (B) and (E) show the backbone atoms only, while (C) 
and (F) display the shorthand symbols that are used to represent the a helix and the p sheet in ribbon drawings of proteins 
(see Panel 3-2B). 
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An a helix is generated when a single polypeptide chain twists around on 
itself to form a rigid cylinder. A hydrogen bond forms between every fourth pep- 
tide bond, linking the C=0 of one peptide bond to the N-H of another (see Fig- 
ure 3-7A). This gives rise to a regular helix with a complete turn every 3.6 amino 
acids. Note that the protein domain illustrated in Panel 3-2 contains two a 
helices, as well as a three-stranded antiparallel p sheet. 

Regions of a helix are especially abundant in proteins located in cell mem- 
branes, such as transport proteins and receptors. As we discuss in Chapter 10, 
those portions of a transmembrane protein that cross the lipid bilayer usually 
cross as an a helix composed largely of amino acids with nonpolar side chains. 
The polypeptide backbone, which is hydrophilic, is hydrogen-bonded to itself in 
the a helix and shielded from the hydrophobic lipid environment of the mem- 
brane by its protruding nonpolar side chains (see also Figure 3-78). 

In other proteins, a helices wrap around each other to form a particularly 
stable structure, known as a coiled-coil. This structure can form when the two 
(or in some cases three) a helices have most of their nonpolar (hydrophobic) 
side chains on one side, so that they can twist around each other with these side 
chains facing inward (Figure 3-9). Long rodlike coiled-coils provide the struc- 
tural framework for many elongated proteins. Examples are cc-keratin, which 
forms the intracellular fibers that reinforce the outer layer of the skin and its 
appendages, and the myosin molecules responsible for muscle contraction. 



Protein Domains Are Modular Units from which Larger Proteins 
Are Built 

Even a small protein molecule is built from thousands of atoms linked together by 
precisely oriented covalent and noncovalent bonds, and it is extremely difficult to 
visualize such a complicated structure without a three-dimensional display. For 





Figure 3-8 Two types of P sheet 
structures. (A) An antiparallel (3 sheet 
(see Figure 3-7D). (B) A parallel p sheet. 
Both of these structures are common 
in proteins. 
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Figure 3-9 A coiled-coil. <CGGA> 
(A) A single a helix, with successive amino 
acid side chains labeled in a sevenfold 
sequence, "abcdefg" (from bottom to top). 
Amino acids "a" and "d" in such a sequence 
lie close together on the cylinder surface, 
forming a "stripe" {red) that winds slowly 
around the a helix. Proteins that form 
coiled-coils typically have nonpolar amino 
acids at positions "a" and "d." Consequently, 
as shown in (B), the two a helices can wrap 
around each other with the nonpolar side 
chains of one a helix interacting with the 
nonpolar side chains of the other, while the 
more hydrophilic amino acid side chains are 
left exposed to the aqueous environment. 
(C) The atomic structure of a coiled-coil 
determined by x-ray crystallography. The 
red side chains are nonpolar. 
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this reason, biologists use various graphic and computer-based aids. A DVD that 
accompanies this book contains computer-generated images of selected pro- 
teins, displayed and rotated on the screen in a variety of formats. 

Biologists distinguish four levels of organization in the structure of a protein. 
The amino acid sequence is known as the primary structure. Stretches of 
polypeptide chain that form a helices and p sheets constitute the protein's sec- 
ondary structure. The full three-dimensional organization of a polypeptide 
chain is sometimes referred to as the tertiary structure, and if a particular pro- 
tein molecule is formed as a complex of more than one polypeptide chain, the 
complete structure is designated as the quaternary structure. 

Studies of the conformation, function, and evolution of proteins have also 
revealed the central importance of a unit of organization distinct from these 
four. This is the protein domain, a substructure produced by any part of a 
polypeptide chain that can fold independently into a compact, stable structure. 
A domain usually contains between 40 and 350 amino acids, and it is the mod- 
ular unit from which many larger proteins are constructed. 

The different domains of a protein are often associated with different func- 
tions. Figure 3-10 shows an example— the Src protein kinase, which functions in 
signaling pathways inside vertebrate cells (Src is pronounced "sarc"). This pro- 
tein is considered to have three domains: the SH2 and SH3 domains have regu- 
latory roles, while the C-terminal domain is responsible for the kinase catalytic 
activity. Later in the chapter, we shall return to this protein, in order to explain 
how proteins can form molecular switches that transmit information through- 
out cells. 

Figure 3-11 presents ribbon models of three differently organized protein 
domains. As these examples illustrate, the polypeptide chain tends to cross the 
entire domain before making a sharp turn at the surface. The central core of a 
domain can be constructed from a helices, from (3 sheets, or from various com- 
binations of these two fundamental folding elements. <CAGT> 

The smallest protein molecules contain only a single domain, whereas 
larger proteins can contain as many as several dozen domains, often connected 
to each other by short, relatively unstructured lengths of polypeptide chain. 

Few of the Many Possible Polypeptide Chains Will Be Useful to 
Cells 

Since each of the 20 amino acids is chemically distinct and each can, in princi- 
ple, occur at any position in a protein chain, there are 20 x 20 x 20 x 20 = 160,000 
different possible polypeptide chains four amino acids long, or 20" different 
possible polypeptide chains n amino acids long. For a typical protein length of 



Figure 3-10 A protein formed from 
multiple domains. In the Src protein 
shown, a C-terminal domain with two 
lobes {yellow and orange) forms a protein 
kinase enzyme, while the SH2 and SH3 
domains perform regulatory functions. 
(A) A ribbon model, with ATP substrate in 
red. (B) A spacing-filling model, with ATP 
substrate in red. Note that the site that 
binds ATP is positioned at the interface of 
the two lobes that form the kinase. The 
detailed structure of the SH2 domain is 
illustrated in Panel 3-2 (pp. 132-133). 
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about 300 amino acids, a cell could theoretically make more than 10 390 (20 300 ) 
different polypeptide chains. This is such an enormous number that to produce 
just one molecule of each kind would require many more atoms than exist in the 
universe. 

Only a very small fraction of this vast set of conceivable polypeptide chains 
would adopt a single, stable three-dimensional conformation— by some esti- 
mates, less than one in a billion. And yet the vast majority of proteins present in 
cells adopt unique and stable conformations. How is this possible? The answer 
lies in natural selection. A protein with an unpredictably variable structure and 
biochemical activity is unlikely to help the survival of a cell that contains it. Such 
proteins would therefore have been eliminated by natural selection through the 
enormously long trial-and-error process that underlies biological evolution. 

Because evolution has selected for protein function in living organisms, the 
amino acid sequence of most present-day proteins is such that a single confor- 
mation is extremely stable. In addition, this conformation has its chemical prop- 
erties finely tuned to enable the protein to perform a particular catalytic or 
structural function in the cell. Proteins are so precisely built that the change of 
even a few atoms in one amino acid can sometimes disrupt the structure of the 
whole molecule so severely that all function is lost. 



Figure 3-1 1 Ribbon models of three 
different protein domains. 

(A) Cytochrome 6 56 2, a single-domain 
protein involved in electron transport in 
mitochondria. This protein is composed 
almost entirely of a helices. (B) The 
NAD-binding domain of the enzyme 
lactic dehydrogenase, which is composed 
of a mixture of a helices and parallel p 
sheets. (C) The variable domain of an 
immunoglobulin (antibody) light chain, 
composed of a sandwich of two 
antiparallel (3 sheets. In these examples, 
the a helices are shown in green, while 
strands organized as (3 sheets are 
denoted by red arrows. 
Note how the polypeptide chain 
generally traverses back and forth across 
the entire domain, making sharp turns 
only at the protein surface. It is the 
protruding loop regions (yellow) that 
often form the binding sites for other 
molecules. (Adapted from drawings 
courtesy of Jane Richardson.) 



Proteins Can Be Classified into Many Families 

Once a protein had evolved that folded up into a stable conformation with use- 
ful properties, its structure could be modified during evolution to enable it to 
perform new functions. This process has been greatly accelerated by genetic 
mechanisms that occasionally duplicate genes, allowing one gene copy to evolve 
independently to perform a new function (discussed in Chapter 4), This type of 
event has occurred quite often in the past; asS a result, many presenr-day pro- 
teins can be grouped into protein families, each family member having an 
amino acid sequence and a three-dimensional conformation that resemble 
those of the other family members. 

Consider, for example, the serine proteases, a large family of protein-cleaving 
(proteolytic) enzymes that includes the digestive enzymes chymotrypsin, 
trypsin, and elastase, and several proteases involved in blood clotting. When the 
protease portions of any two of these enzymes are compared, parts of their 
amino acid sequences are found to match, the similarity of their three-dimen- 
sional conformations is even more striking: most of the detailed twists and turns 
in their polypeptide chains, which are several hundred amino acids long, are vir- 
tually identical (Figure 3-12). The many different serine proteases nevertheless 
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Figure 3-1 2 A comparison of the 
conformations of two serine proteases. 

The backbone conformations of elastase 
and chymotrypsin. Although only those 
amino acids in the polypeptide chain 
shaded in green are the same in the two 
proteins, the two conformations are very 
similar nearly everywhere. The active site 
of each enzyme is circled in red; this is 
where the peptide bonds of the proteins 
that serve as substrates are bound and 
cleaved by hydrolysis. The serine 
proteases derive their name from the 
amino acid serine, whose side chain is 
part of the active site of each enzyme 
and directly participates in the 
cleavage reaction. 



have distinct enzymatic activities, each cleaving different proteins or the pep- 
tide bonds between different types of amino acids. Each therefore performs a 
distinct function in an organism. 

The story we have told for the serine proteases could be repeated for hun- 
dreds of other protein families. In general, the structure of the different mem- 
bers of a protein family has been more highly conserved than has the amino acid 
sequence. In many cases, the amino acid sequences have diverged so far that we 
cannot be certain of a family relationship between two proteins without deter- 
mining their three-dimensional structures, The yeast a2 protein and the 
Drosophila engrailed protein, for example, are both gene regulatory proteins in 
the homeodomain family (discussed in Chapter 7). Because they are identical in 
only 17 of lueir 60 amino acid residues, their relationship became certain only 
by comparing their three-dimensional structures (Figure 3-13), Many similar 
examples show that two proteins with more than 25% identity in their amino 
acid sequences usually share the same overall structure. 

The various members of a large protein family often have distinct functions. 
Some of ihc amino acid changes that maJce family members different were no 
doubt selected in the course of evolution because they resulted in useful 
changes in biological activity, giving the individual family members the different 
functional properties they have today But many other amino acid changes are 
effectively "neutral: - having neither a beneficial nor a damaging effect on the 
basic structure and function of the protein, tn addition, since mutation is a ran- 
dom process, there must also have been many deleterious changes that altered 
the three-dimensional structure of these proteins sufficiently to harm them. 



Figure 3-13 A comparison of a class of 
DNA-binding domains, called 
homeodomains, in a pair of proteins 
from two organisms separated by more 
than a billion years of evolution. 
(A) A ribbon model of the structure 
common to both proteins. (B) A trace of 
the a-carbon positions. The three- 
dimensional structures shown were 
determined by x-ray crystallography for 
the yeast oe2 protein (green) and the 
Drosophila engrailed protein (red). 
(C) A comparison of amino acid 
sequences for the region of the proteins 
shown in (A) and (B). Block dots mark sites 
with identical amino acids. Orange 
dots indicate the position of a three 
amino acid insert in the cc2 protein. 
(Adapted from C. Wolberger et al„ 
Cell 67:51 7-528, 1991. With permission 
from Elsevier.) 
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Such faulty proteins would have been lost whenever the individual organisms 
making them were at enough of a disadvantage to be eliminated by natural 
selection. 

Protein families are readily recognized when the genome of any organism is 
sequenced; for example, the determination of the DNA sequence for the entire 
human genome has revealed that we contain about 24,000 protein-coding genes. 
Through sequence comparisons, we can assign the products of about 40 percent 
of these genes to known protein structures, belonging to more than 500 different 
protein families. Most of the proteins in each family have evolved to perform 
somewhat different functions, as for the enzymes elastase and chymotrypsin 
illustrated previously in Figure 3-12. These are sometimes called paralogs to dis- 
tinguish them from the corresponding proteins in different organisms 
{orthologs, such as mouse and human elastase). 

As described in Chapter 8, because of the powerful techniques of x-ray crys- 
tallography and nuclear magnetic resonance (NMR), we now know the three- 
dimensional shapes, or conformations, of more than 20,000 proteins. <GGCC> 
By carefully comparing the conformations of these proteins, structural biolo- 
gists (that is, experts on the structure of biological molecules) have concluded 
that there are a limited number of ways in which protein domains fold up in 
nature— maybe as few as 2000. The structures for about 800 of these protein 
folds have thus far been determined. These known folds tend to be those most 
represented in the universe of protein structures: for example, 50 folds account 
for nearly three-fourths of the domain families with predicted structures. A com- 
plete catalog of the most significant protein folds that exist in living organisms 
would therefore seem to be within our reach. 
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Sequence Searches Can Identify Close Relatives 

The present database of known protein sequences contains more than ten mil- 
lion entries, and it is growing very rapidly as more and more genomes are 
sequenced — revealing huge numbers of new genes that encode proteins. Pow- 
erful computer search programs are available that allow us to compare each 
newly discovered protein with this entire database, looking for possible relatives. 
Many proteins whose genes have evolved from a common ancestral gene can be 
identified by the discovery of statistically significant similarities in amino acid 
sequences. 

With such a large number of proteins in the database, the search programs 
find many nonsignificant matches, resulting in a background noise level that 
makes it very difficult to pick out all but the closest relatives. Generally speak- 
ing, one requires a 30% identity in sequence to consider that two proteins 
match. However, we know the function of many short signature sequences 
("fingerprints"), and these are widely used to find more distant relationships 
(Figure 3-14). 

Protein comparisons are important because related structures often imply 
related functions. Many years of experimentation can be saved by discovering 
that a new protein has an amino acid sequence similarity with a protein of 
known function. Such sequence relationships, for example, first indicated that 
certain genes that cause mammalian cells to become cancerous are protein 
kinases. In the same way, many of the proteins that control pattern formation 
during the embryonic development of the fruit fly Drosophila were quickly rec- 
ognized to be gene regulatory proteins. 
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Figure 3-14The use of short signature 
sequences to find related protein 
domains. The two short sequences of 
15 and 9 amino acids shown (green) can 
be used to search large databases for a 
protein domain that is found in many 
proteins, the SH2 domain. Here, the first 
50 amino acids of the SH2 domain of 
1 00 amino acids is compared for the 
human and Drosophila Src protein (see 
Figure 3-10). In the computer-generated 
sequence comparison (yellow row), exact 
matches between the human and 
Drosophila proteins are noted by the one- 
letter abbreviation for the amino acid; the 
positions with a similar but nonidentical 
amino acid are denoted by +, and 
nonmatches are blank. In this diagram, 
wherever one or both proteins contain an 
exact match to a position in the green 
sequences, both aligned sequences are 
colored red. 
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Some Protein Domains Form Parts of Many Different Proteins 

As previously stated, most proteins are composed of a series of protein domains, 
in which different regions of the polypeptide chain have folded independently 
to form compact structures. Such multidomain proteins are believed to have 
originated from the accidental joining of the DNA sequences that encode each 
domain, creating a new gene. Novel binding surfaces have often been created at 
the juxtaposition of domains, and many of the functional sites where proteins 
bind to small molecules are found to be located there. In an evolutionary pro- 
cess called domain shuffling, many large proteins have evolved through the 
joining of preexisting domains in new combinations (Figure 3-15). 

A subset of protein domains have been especially mobile during evolution; 
these seem to have particularly versatile structures and are sometimes referred 
to as protein modules. The structure of one such module, the SH2 domain, was 
illustrated in Panel 3-2 (pp. 132-133). Some other abundant protein domains are 
illustrated in Figure 3-16. 

Each of the domains shown has a stable core structure formed from strands 
of p sheet, from which less-ordered loops of polypeptide chain protrude (green). 
The loops are ideally situated to form binding sites for other molecules, as most 
clearly demonstrated for the immunoglobulin fold, which forms the basis for 
antibody molecules (see Figure 3-41). Most likely, such p-sheet-based domains 
have achieved their evolutionary success because they provide a convenient 
framework for the generation of new binding sites for ligands through small 
changes to their protruding loops. 

A second feature of these protein domains that explains their utility is the 
ease with which they can be integrated into other proteins. Five of the six 
domains illustrated in Figure 3-16 have their N- and C-terminal ends at oppo- 
site poles of the domain. When the DNA encoding such a domain undergoes 
tandem duplication, which is not unusual in the evolution of genomes (dis- 
cussed in Chapter 4), the duplicated modules with this "in-line" arrangement 
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Figure 3-15 Domain shuffling. An 

extensive shuffling of blocks of protein 
sequence (protein domains) has occurred 
during protein evolution. Those portions 
of a protein denoted by the same shape 
and color in this diagram are 
evolutionary related. Serine proteases 
like chymotrypsin are formed from two 
domains (brown). In the three other 
proteases shown, which are highly 
regulated and more specialized, these 
two protease domains are connected to 
one or more domains that are similar to 
domains found in epidermal growth 
factor (EGF; green), to a calcium-binding 
protein (yellow), or to a"kringle"domain 
(blue) that contains three internal 
disulfide bridges. Chymotrypsin is 
illustrated in Figure 3-12. 
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Figure 3-16 The three-dimensional 
structures of some protein modules. In 

these ribbon diagrams, [3-sheet strands are 
shown as arrows, and the N- and 
C-termini are indicated by red spheres. 
(Adapted from M. Baron, D.G. Norman and 
I.D. Campbell, Trends Biochem. Sci. 1 6:1 3-1 7, 
1991, with permission from Elsevier, and 
D.J. Leahy et al., Sc/ence 258:987-991, 1992, 
with permission from AAAS.) 
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can be readily linked in series to form extended structures — either with them- 
selves or with other in-line domains (Figure 3-17). Stiff extended structures 
composed of a series of domains are especially common in extracellular matrix 
molecules and in the extracellular portions of cell-surface receptor proteins. 
Other modules, including the SH2 domain and the kringle domain illustrated in 
Figure 3-16, are of a "plug-in" type, with their N- and C-termini close together. 
After genomic rearrangements, such modules are usually accommodated as an 
insertion into a loop region of a second protein. 

A comparison of the relative frequency of domain utilization in different 
eucaryotes reveals that, for many common domains, such as protein kinases, 
this frequency is similar in organisms as diverse as yeast, plants, worms, flies, 
and humans (Figure 3-18). But there are some notable exceptions, such as the 
Major Histocompatibility Complex (MHC) antigen-recognition domain (see 
Figure 25-52) that is present in 57 copies in humans, but absent in the other 
four organisms just mentioned. Such domains presumably have specialized 
functions that are not shared with the other eucaryotes, being strongly selected 
for during evolution so as to give rise to the multiple copies observed. Similarly, 
a domain like SH2 that shows an unusual increase in its numbers in higher 
eucaryotes might be assumed to be especially useful for multicellularity (com- 
pare the multicellular organisms with yeast in Figure 3-18). 



Certain Pairs of Domains Are Found Together in Many Proteins 

We can construct a large table displaying domain usage for each organism whose 
genome sequence is known. For example, the human genome is estimated to 
contain about 1000 immunoglobulin domains, 500 protein kinase domains, 250 
DNA-binding homeodomains, 300 SH3 domains, and 120 SH2 domains. Impor- 
tant additional information can be derived by comparing the frequencies and 
arrangements of domains in the more than 100 eucaryotic, bacterial, and 
archaeal genomes that have been completely sequenced. For example, we find 
that more than two-thirds of proteins consist of two or more domains, and that 
the same pairs of domains occur repeatedly in the same relative arrangement in 
a protein. Although half of all domain families are common to archaea, bacteria, 
and eucaryotes, only about 5 percent of the two-domain combinations are simi- 
larly shared. This pattern suggests that most proteins containing especially use- 
ful two-domain combinations arose relatively late in evolution. 

The 200 most abundant two-domain combinations occur in about one- 
fourth of all of the proteins with recognizable domains in the complete data set. 
It would therefore be very useful to determine the precise three-dimensional 
structure for at least one protein from each common two-domain combination, 
so as to reveal how the domains interact in that type of protein structure. 




Figure 3-17 An extended structure 
formed from a series of in-line protein 
modules. Fourfibronectin type 3 modules 
(see Figure 3-1 6) from the extracellular 
matrix molecule fibronectin are illustrated 
in (A) ribbon and <B) space-filling models. 
(Adapted from DJ. Leahy, I. Aukhil and 
H.P. Erickson, Ce// 84:1 55-1 64, 1996. With 
permission from Elsevier.) 
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Figure 3-1 8 The relative frequencies of 
three protein domains in five eucaryotic 
organisms. The approximate percentages 
given have been determined by dividing 
the number of copies of each domain by 
the total number of distinct proteins 
thought to be encoded by each 
organism, as determined from the 
sequence of its genome. Thus, for SH2 
domains in humans, 1 20/24,000 = 0.005. 
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The Human Genome Encodes a Complex Set of Proteins, 
Revealing Much That Remains Unknown 

The result of sequencing the human genome has been surprising, because it 
reveals that our chromosomes contain only about 25,000 genes. Based on gene 
number alone, we would appear to be no more complex than the tiny mustard 
weed, Arabidopsis, and only about 1.3-fold more complex than a nematode 
worm. The genome sequences also reveal that vertebrates have inherited nearly 
all of their protein domains from invertebrates— with only 7 percent of identi- 
fied human domains being vertebrate-specific. 

Each of our proteins is on average more complicated, however (Figure 
3-19). Domain shuffling during vertebrate evolution has given rise to many 
novel combinations of protein domains, with the result that there are nearly 
twice as many combinations of domains found in human proteins as in a worm 
or a fly. Thus, for example, the trypsinlike serine protease domain is linked to at 
least 18 other types of protein domains in human proteins, whereas it is found 
covalently joined to only 5 different domains in the worm. This extra variety in 
our proteins greatly increases the range of protein-protein interactions possible 
(see Figure 3-82), but how it contributes to making us human is not known. 

The complexity of living organisms is staggering, and it is quite sobering to 
note that we currently lack even the tiniest hint of what the function might be for 
more than 10,000 of the proteins that have thus far been identified in the human 
genome. There are certainly enormous challenges ahead for the next generation 
of cell biologists, with no shortage of fascinating mysteries to solve. 
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HD Ep2 Br BMB 

Figure 3-1 9 Domain structure of a 
group of evolutionary related proteins 
that are thought to have a similar 
function. In general, there is a tendency 
for the proteins in more complex 
organisms, such as humans, to contain 
additional domains — as is the 
case for the DNA-binding protein 
compared here. 



Larger Protein Molecules Often Contain More Than One 
Polypeptide Chain <gcct> 

The same weak noncovalent bonds that enable a protein chain to fold into a spe- 
cific conformation also allow proteins to bind to each other to produce larger 
structures in the cell. Any region of a protein's surface that can interact with 
another molecule through sets of noncovalent bonds is called a binding site. A 
protein can contain binding sites for various large and small molecules. If a 
binding site recognizes the surface of a second protein, the tight binding of two 
folded polypeptide chains at this site creates a larger protein molecule with a 
precisely defined geometry. Each polypeptide chain in such a protein is called a 
protein subunit. 

In the simplest case, two identical folded polypeptide chains bind to each 
other in a "head-to-head" arrangement, forming a symmetric complex of two 
protein subunits (a dimer) held together by interactions between two identical 
binding sites. The Cro repressor protein— a viral gene regulatory protein that 
binds to DNA to turn viral genes off in an infected bacterial cell— provides an 
example (Figure 3-20). Cells contain many other types of symmetric protein 
complexes, formed from multiple copies of a single polypeptide chain. The 
enzyme neuraminidase, for example, consists of four identical protein subunits, 
each bound to the next in a "head-to-tail" arrangement that forms a closed ring 
(Figure 3-21). 

Many of the proteins in cells contain two or more types of polypeptide 
chains. Hemoglobin, the protein that carries oxygen in red blood cells, contains 




Figure 3-20 Two identical protein 
subunits binding together to form a 
symmetric protein dimer. The Cro 

repressor protein from bacteriophage 
lambda binds to DNA to turn off viral 
genes. Its two identical subunits bind 
head-to-head, held together by a 
combination of hydrophobic forces (blue) 
and a set of hydrogen bonds (yellow 
region). (Adapted from D.H. Ohlendorf, 
D.E. Tronrud and B.W. Matthews, J. Mol. 
Biol. 280:129-136, 1998. With permission 
from Academic Press.) 



THE SHAPE AND STRUCTURE OF PROTEINS 



143 




Figure 3-21 A protein molecule 
containing multiple copies of a single 
protein subunit. The enzyme 
neuraminidase exists as a ring of four 
identical polypeptide chains. Each of 
these chains is formed from six repeats of 
a four-stranded p sheet, as indicated by 
the colored arrows. The small diagram 
shows how the repeated use of the same 
binding interaction forms the structure. 




tetramer of neuraminidase protein 



two identical a-globin subunits and two identical (3-globin subunits, symmetri- 
cally arranged (Figure 3-22). Such multisubunit proteins are very common in 
cells, and they can be very large. Figure 3-23 shows a sample of proteins whose 
exact structures are known, and it compares the sizes and shapes of a few larger 
proteins with some of the relatively small proteins that we have thus far pre- 
sented as models. 



Some Proteins Form Long Helical Filaments 

Some protein molecules can assemble to form filaments that may span the 
entire length of a cell Most simply, a long chain of identical protein molecules 
can be constructed if each molecule has a binding site complementary to 
another region of the surface of the same molecule (Figure 3-24). An actin fila- 
ment, for example, is a long helical structure produced from many molecules of 
the protein actin (Figure 3-25). Actin is very abundant in eucaryotic cells, where 
it constitutes one of the major filament systems of the cytoskeleton (discussed 
in Chapter 16). 

Why is a helix such a common structure in biology? As we have seen, bio- 
logical structures are often formed by linking similar subunits— such as amino 
acids or protein molecules — into long, repetitive chains. If all the subunits are 
identical, the neighboring subunits in the chain can often fit together in only 
one way, adjusting their relative positions to minimize the free energy of the 
contact between them. As a result, each subunit is positioned in exactly the 
same way in relation to the next, so that subunit 3 fits onto subunit 2 in the same 
way that subunit 2 fits onto subunit 1, and so on. Because it is very rare for sub- 
units to join up in a straight line, this arrangement generally results in a helix— 
a regular structure that resembles a spiral staircase, as illustrated in Figure 3-26. 
Depending on the twist of the staircase, a helix is said to be either right-handed 
or left-handed (see Figure 3-26E). Handedness is not affected by turning the 
helix upside down, but it is reversed if the helix is reflected in the mirror. 




Figure 3-22 A protein formed as a 
symmetric assembly of two different 
subunits. Hemoglobin is an abundant 
protein in red blood cells that contains two 
copies of a-globin and two copies of 
P-globin. Each of these four polypeptide 
chains contains a heme molecule (red), 
which is the site that binds oxygen (0 2 ). 
Thus, each molecule of hemoglobin in the 
blood carries four molecules of oxygen. 
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Figure 3-23 A collection of protein molecules, shown at the same scale. For comparison, a DNA molecule bound to a protein is also 
illustrated. These space-filling models represent a range of sizes and shapes. Hemoglobin, catalase, porin, alcohol dehydrogenase, and 
aspartate transcarbamoylase are formed from multiple copies of subunits. The SH2 domain (top left) is presented in detail in Panel 3-2 
(pp. 1 32-1 33). (Adapted from David S. Goodsell, Our Molecular Nature. New York: Springer-Verlag, 1 996. With permission from Springer Science 
and Business Media.) 
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Figure 3-24 Protein assemblies. 

(A) A protein with just one binding site 
can form a dimer with another identical 
protein. (B) Identical proteins with two 
different binding sites often form a long 
helical filament. (C) If the two binding 
sites are disposed appropriately in 
relation to each other, the protein 
subunits may form a closed ring instead 
of a helix. (For an example of A, see 
Figure 3-20; for an example of C, see 
Figure 3 21.) 



Helices occur commonly in biological structures, whether the subunits are 
small molecules linked together by covalent bonds (for example, the amino 
acids in an a helix) or large protein molecules that are linked by noncovalent 
forces (for example, the actin molecules in actin filaments). This is not surpris- 
ing. A helix is an unexceptional structure, and it is generated simply by placing 
many similar subunits next to each other, each in the same strictly repeated rela- 
tionship to the one before — that is, with a fixed rotation followed by a fixed 
translation along the helix axis, as in a spiral staircase. 
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Many Protein Molecules Have Elongated, Fibrous Shapes 

Most of the proteins that we have discussed so far are globular proteins, in which 
the polypeptide chain folds up into a compact shape like a ball with an irregular 
surface. Enzymes tend to be globular proteins: even though many are large and 
complicated, with multiple subunits, most have an overall rounded shape (see 
Figure 3-23). In contrast, other proteins have roles in the cell that require each 
individual protein molecule to span a large distance. These proteins generally 
have a relatively simple, elongated three-dimensional structure and are com- 
monly referred to as fibrous proteins. 

One large family of intracellular fibrous proteins consists of a-keratin, intro- 
duced when we presented the a helix, and its relatives. Keratin filaments are 
extremely stable and are the main component in long-lived structures such as 
hair, horn, and nails. An a-keratin molecule is a dimer of two identical subunits, 
with the long a helices of each subunit forming a coiled-coil (see Figure 3-9). 
The coiled-coil regions are capped at each end by globular domains containing 
binding sites. This enables this class of protein to assemble into ropelike inter- 
mediate filaments— an important component of the cytoskeleton that creates 
the cell's internal structural framework (see Figure 16-19). 

Fibrous proteins are especially abundant outside the cell, where they are a 
main component of the gel-like extracellular matrix that helps to bind collec- 
tions of cells together to form tissues. Cells secrete extracellular matrix proteins 
into their surroundings, where they often assemble into sheets or long fibrils. 
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Figure 3-25 Actin filaments. 

(A) Transmission electron micrographs 
of negatively stained actin filaments. 

(B) The helical arrangement of actin 
molecules in an actin filament. 

(A, courtesy of Roger Craig.) 
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Collagen is the most abundant of these proteins in animal tissues. A collagen 
molecule consists of three long polypeptide chains, each containing the non- 
polar amino acid glycine at every third position. This regular structure allows the 
chains to wind around one another to generate a long regular triple helix (Figure 
3-27 A). Many collagen molecules then bind to one another side-by-side and 
end-to-end to create long overlapping arrays— thereby generating the extremely 
tough collagen fibrils that give connective tissues their tensile strength, as 
described in Chapter 19. 



Many Proteins Contain a Surprisingly Large Amount of 
Unstructured Polypeptide Chain 

It has been well known for a long time that, in complete contrast to collagen, 
another abundant protein in the extracellular matrix, elastin, is formed as a 
highly disordered polypeptide. This disorder is essential for elastin's function. Its 



Figure 3-26 Some properties of a helix. 

(A-D) A helix forms when a series of 
subunits bind to each other in a regular 
way. At the bottom, the interaction 
between two subunits is shown; behind 
them are the helices that result. These 
helices have two (A), three (B), and six 
(C and D) subunits per helical turn. The 
photographs at the top show the 
arrangement of subunits viewed from 
directly above the helix. Note that the 
helix in (D) has a wider path than that in 
(C), but the same number of subunits per 
turn. (E) A helix can be either right- 
handed or left-handed. As a reference, it 
is useful to remember that standard 
metal screws, which insert when turned 
clockwise, are right-handed. Note that a 
helix retains the same handedness when 
it is turned upside down. 



Figure 3-27 Collagen and elastin. 

(A) Collagen is a triple helix formed by 
three extended protein chains that wrap 
around one another (bottom). Many 
rodlike collagen molecules are cross- 
linked together in the extracellular space 
to form unextendable collagen fibrils 
(top) that have the tensile strength of 
steel. The striping on the collagen fibril is 
caused by the regular repeating 
arrangement of the collagen molecules 
within the fibril. (B) Elastin polypeptide 
chains are cross-linked together to form 
rubberlike, elastic fibers. Each elastin 
molecule uncoils into a more extended 
conformation when the fiber is stretched 
and recoils spontaneously as soon as the 
stretching force is relaxed. 
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relatively loose and unstructured polypeptide chains are covalently cross-linked 
to produce a rubberlike elastic meshwork that can be reversibly pulled from one 
conformation to another, as illustrated in Figure 3-27B. The elastic fibers that 
result enable skin and other tissues, such as arteries and lungs, to stretch and 
recoil without tearing. 

Intrinsically unstructured regions of proteins are quite frequent in nature, 
having important functions in the interior of cells. As we have already seen, pro- 
teins use the short loops of polypeptide chain that generally protrude from the 
core region of protein domains to bind other molecules. Similarly, many pro- 
teins have much longer regions of unstructured amino acid sequences that 
interact with another molecule (often DNA or a protein), undergoing a struc- 
tural transition to a specific folded conformation when the other molecule is 
bound. Other proteins appear to resemble elastin, in so far as their function 
requires that they remain largely unstructured. For example, the abundant 
nucleoporins that coat the inner surface of the nuclear pore complex form a ran- 
dom coil meshwork that is intimately involved in nuclear transport (see Figure 
12-10). Finally, as will be discussed later in this chapter (see Figure 3-80C), 
unstructured regions of polypeptide chain are often used to connect the binding 
sites for proteins that function together to catalyze a biological reaction. Thus, 
for example, in facilitating cell signaling, large scaffold proteins use such flexible 
regions as "tethers" that concentrate sets of interacting proteins, often confining 
them to particular sites in the cell (discussed in Chapter 15). 

We can recognize the unstructured regions in many proteins by their biased 
amino acid composition: they contain very few of the bulky hydrophobic amino 
acids that normally form the core of a folded protein, being composed instead 
of a high proportion of the amino acids Gin, Ser, Pro, Glu, and Lys. Such "natively 
unfolded" regions also frequently contain repeated sequences of amino acids. 



Covalent Cross-Linkages Often Stabilize Extracellular Proteins 

Many protein molecules are either attached to the outside of a cell's plasma 
membrane or secreted as part of the extracellular matrix. All such proteins are 
directly exposed to extracellular conditions. To help maintain their structures, 
the polypeptide chains in such proteins are often stabilized by covalent cross- 
linkages. These linkages can either tie two amino acids in the same protein 
together, or connect different polypeptide chains in a multisubunit protein. The 
most common cross-linkages in proteins are covalent sulfur-sulfur bonds. 
These disulfide bonds (also called S-S bonds) form as cells prepare newly syn- 
thesized proteins for export. As described in Chapter 12, their formation is cat- 
alyzed in the endoplasmic reticulum by an enzyme that links together two pairs 
of-SH groups of cysteine side chains that are adjacent in the folded protein (Fig- 
ure 3-28). Disulfide bonds do not change the conformation of a protein but 
instead act as atomic staples to reinforce its most favored conformation. For 
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Figure 3-28 Disulfide bonds. <ATAC> 
This diagram illustrates how covalent 
disulfide bonds form between adjacent 
cysteine side chains. As indicated, these 
cross-linkages can join either two parts of 
the same polypeptide chain or two 
different polypeptide chains. Since the 
energy required to break one covalent 
bond is much larger than the energy 
required to break even a whole set of 
noncovalent bonds (see Table 2-1, p. 53), 
a disulfide bond can have a major 
stabilizing effect on a protein. 
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example, lysozyme — an enzyme in tears that dissolves bacterial cell walls- 
retains its antibacterial activity for a long time because it is stabilized by such 
cross-linkages. 

Disulfide bonds generally fail to form in the cell cytosol, where a high con- 
centration of reducing agents converts S-S bonds back to cysteine -SH groups. 
Apparently, proteins do not require this type of reinforcement in the relatively 
mild environment inside the cell. 



Figure 3-29 An example of the 
assembly of a single protein subunit 
requiring multiple protein-protein 
contacts. Hexagonally packed globular 
protein subunits can form either a flat 
sheet or a tube. 



Protein Molecules Often Serve as Subunits for the Assembly of 
Large Structures 

The same principles that enable a protein molecule to associate with itself to 
form rings or filaments also operate to generate much larger structures in the 
cell — supramolecular structures such as enzyme complexes, ribosomes, protein 
filaments, viruses, and membranes. These large objects are not made as single, 
giant, covalently linked molecules. Instead they are formed by the noncovalent 
assembly of many separately manufactured molecules, which serve as the sub- 
units of the final structure. 

The use of smaller subunits to build larger structures has several advantages: 

1. A large structure built from one or a few repeating smaller subunits 
requires only a small amount of genetic information. 

2. Both assembly and disassembly can be readily controlled, reversible pro- 
cesses, because the subunits associate through multiple bonds of relatively 
low energy. 

3. Errors in the synthesis of the structure can be more easily avoided, since 
correction mechanisms can operate during the course of assembly to 
exclude malformed subunits. 

Some protein subunits assemble into flat sheets in which the subunits are 
arranged in hexagonal patterns. Specialized membrane proteins are sometimes 
arranged this way in lipid bilayers. With a slight change in the geometry of the 
individual subunits, a hexagonal sheet can be converted into a tube (Figure 
3-29) or, with more changes, into a hollow sphere. Protein tubes and spheres 
that bind specific RNA and DNA molecules in their interior form the coats of 
viruses. 

The formation of closed structures, such as rings, tubes, or spheres, provides 
additional stability because it increases the number of bonds between the pro- 
tein subunits. Moreover, because such a structure is created by mutually depen- 
dent, cooperative interactions between subunits, a relatively small change that 
affects each subunit individually can cause the structure to assemble or disas- 
semble. These principles are dramatically illustrated in the protein coat or cap- 
sid of many simple viruses, which takes the form of a hollow sphere based on an 
icosahedron (Figure 3-30). Capsids are often made of hundreds of identical pro- 
tein subunits that enclose and protect the viral nucleic acid (Figure 3-31). The 
protein in such a capsid must have a particularly adaptable structure: not only 
must it make several different kinds of contacts to create the sphere, it must also 
change this arrangement to let the nucleic acid out to initiate viral replication 
once the virus has entered a cell. 
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Figure 3-30 The capsids of some viruses, all shown at the same scale. (A) Tomato bushy stunt virus; (B) poliovirus; 

(C) simian virus 40 (SV40); (D) satellite tobacco necrosis virus. The structures of all of these capsids have been determined 

by x-ray crystallography and are known in atomic detail. {Courtesy of Robert Grant, Stephan Crainic, and James M. Hogle.) 



Many Structures in Cells Are Capable of Self-Assembly 

The information for forming many of the complex assemblies of macro- 
molecules in cells must be contained in the subunits themselves, because puri- 
fied subunits can spontaneously assemble into the final structure under the 
appropriate conditions. The first large macromolecular aggregate shown to be 
capable of self-assembly from its component parts was tobacco mosaic virus 
(TMV). This virus is a long rod in which a cylinder of protein is arranged around 
a helical RNA core (Figure 3-32). If the dissociated RNA and protein subunits are 
mixed together in solution, they recombine to form fully active viral particles. 
The assembly process is unexpectedly complex and includes the formation of 
double rings of protein, which serve as intermediates that add to the growing 
viral coat. 

Another complex macromolecular aggregate that can reassemble from its 
component parts is the bacterial ribosome. This structure is composed of about 
55 different protein molecules and 3 different rRNA molecules. Incubating the 
individual components under appropriate conditions in a test tube causes them 
to spontaneously re-form the original structure. Most importantly, such recon- 
stituted ribosomes are able to catalyze protein synthesis. As might be expected, 
the reassembly of ribosomes follows a specific pathway: after certain proteins 
have bound to the RNA, this complex is then recognized by other proteins, and 
so on, until the structure is complete. 

It is still not clear how some of the more elaborate self-assembly processes 
are regulated. Many structures in the cell, for example, seem to have a precisely 
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defined length that is many times greater than that of their component macro- 
molecules. How such length determination is achieved is in many cases a mys- 
tery Three possible mechanisms are illustrated in Figure 3-33. In the simplest 



Figure 3-31 The structure of a spherical 
virus. In many viruses, identical protein 
subunits pack together to create a 
spherical shell (a capsid) that encloses 
the viral genome, composed of either 
RNA or DNA (see also Figure 3-30). For 
geometric reasons, no more than 60 
identical subunits can pack together in a 
precisely symmetric way. If slight 
irregularities are allowed, however, more 
subunits can be used to produce a larger 
capsid that retains icosahedral symmetry. 
The tomato bushy stunt virus (TBSV) 
shown here, for example, is a spherical 
virus about 33 nm in diameter formed 
from 180 identical copies of a 386 amino 
acid capsid protein plus an RNA genome 
of 4500 nucleotides. To construct such a 
large capsid, the protein must be able to 
fit into three somewhat different 
environments, each of which is differently 
colored in the virus particle shown here. 
The postulated pathway of assembly is 
shown; the precise three-dimensional 
structure has been determined by x-ray 
diffraction. (Courtesy of Steve Harrison.) 



Figure 3-32 The structure of tobacco mosaic virus (TMV). (A) An electron 
micrograph of the viral particle, which consists of a single long RNA 
molecule enclosed in a cylindrical protein coat composed of identical 
protein subunits. (B) A model showing part of the structure of TMV. A 
single-stranded RNA molecule of 6395 nucleotides is packaged in a helical 
coat constructed from 21 30 copies of a coat protein 1 58 amino acids long. 
Fully infective viral particles can self-assemble in a test tube from purified 
RNA and protein molecules. (A, courtesy of Robley Williams; B, courtesy of 
Richard J. Feldmann.) 
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Figure 3-33 Three mechanisms of length determination for large protein 
assemblies. (A) Coassembly along an elongated core protein or other 
macromolecule that acts as a measuring device. (B) Termination of 
assembly because of strain that accumulates in the polymeric structure as 
additional subunits are added, so that beyond a certain length the energy 
required to fit another subunit onto the chain becomes excessively large. 
(C) A vernier type of assembly, in which two sets of rodlike molecules 
differing in length form a staggered complex that grows until their ends 
exactly match. The name derives from a measuring device based on the 
same principle, used in mechanical instruments. 



case, a long core protein or other macromolecule provides a scaffold that deter- 
mines the extent of the final assembly. This is the mechanism that determines 
the length of the TMV particle, where the RNA chain provides the core. Similarly 
a core protein is thought to determine the length of the thin filaments in mus- 
cle, as well as the length of the long tails of some bacterial viruses (Figure 3-34). 



Assembly Factors Often Aid the Formation of Complex Biological 
Structures 

Not all cellular structures held together by noncovalent bonds self- assemble. A 
mitochondrion, a cilium, or a myofibril of a muscle cell, for example, cannot 
form spontaneously from a solution of its component macromolecules. In these 
cases, part of the assembly information is provided by special enzymes and 
other proteins that perform the function of templates, guiding construction but 
taking no part in the final assembled structure. 

Even relatively simple structures may lack some of the ingredients necessary 
for their own assembly. In the formation of certain bacterial viruses, for exam- 
ple, the head, which is composed of many copies of a single protein subunit, is 
assembled on a temporary scaffold composed of a second protein. Because the 
second protein is absent from the final viral particle, the head structure cannot 
spontaneously reassemble once it has been taken apart. Other examples are 
known in which proteolytic cleavage is an essential and irreversible step in the 
normal assembly process. This is even the case for some small protein assem- 
blies, including the structural protein collagen and the hormone insulin (Figure 
3-35). From these relatively simple examples, it seems certain that the assembly 
of a structure as complex as a mitochondrion or a cilium will involve temporal 
and spatial ordering imparted by numerous other cell components. 



Figure 3-34 An electron micrograph of bacteriophage lambda. The tip of 

the virus tail attaches to a specific protein on the surface of a bacterial cell, 
after which the tightly packaged DNA in the head is injected through the 
tail into the cell. The tail has a precise length, determined by the 
mechanism shown in Figure 3-33A. 
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Figure 3-35 Proteolytic cleavage in 
insulin assembly. The polypeptide 
hormone insulin cannot spontaneously 
re-form efficiently if its disulfide bonds 
are disrupted. It is synthesized as a larger 
protein (proinsulin) that is cleaved by a 
proteolytic enzyme after the protein 
chain has folded into a specific shape. 
Excision of part of the proinsulin 
polypeptide chain removes some of the 
information needed for the protein to 
fold spontaneously into its normal 
conformation. Once insulin has been 
denatured and its two polypeptide 
chains have separated, its ability to 
reassemble is lost. 
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Summary 

A protein molecule's amino acid sequence determines its three-dimensional confor- 
mation. Noncovalent interactions between different parts of the polypeptide chain sta- 
bilize its folded structure. The amino acids with hydrophobic side chains tend to clus- 
ter in the interior of the molecule, and local hydrogen-bond interactions between 
neighboring peptide bonds give rise to a helices and ft sheets. 

Globular regions, known as domains, are the modular units from which many 
proteins are constructed; such domains generally contain 40-350 amino acids. Small 
proteins typically consist of only a single domain, while large proteins are formed from 
several domains linked together by various lengths of polypeptide chain, some of 
which can be relatively disordered. As proteins have evolved, domains have been mod- 
ified and combined with other domains to construct new proteins. Thus far, about 800 
different ways of folding up a domain have been observed, among more than 20,000 
known protein structures. 

Proteins are brought together into larger structures by the same noncovalent forces 
that determine protein folding. Proteins with binding sites for their own surface can 
assemble into dimers, closed rings, spherical shells, or helical polymers. Although mix- 
tures of proteins and nucleic acids can assemble spontaneously into complex struc- 
tures in a test tube, many biological assembly processes involve irreversible steps. Con- 
sequently, not all structures in the cell are capable of spontaneous reassembly after 
they have been dissociated into their component parts. 



PROTEIN FUNCTION 

We have seen that each type of protein consists of a precise sequence of amino 
acids that allows it to fold up into a particular three-dimensional shape, or con- 
formation. But proteins are not rigid lumps of material. They often have precisely 
engineered moving parts whose mechanical actions are coupled to chemical 
events. It is this coupling of chemistry and movement that gives proteins the 
extraordinary capabilities that underlie the dynamic processes in living cells. 
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In this section, we explain how proteins bind to other selected molecules 
and how their activity depends on such binding. We show that the ability to bind 
to other molecules enables proteins to act as catalysts, signal receptors, 
switches, motors, or tiny pumps. The examples we discuss in this chapter by no 
means exhaust the vast functional repertoire of proteins. You will encounter the 
specialized functions of many other proteins elsewhere in this book, based on 
similar principles. 



All Proteins Bind to Other Molecules 

A protein molecule's physical interaction with other molecules determines its 
biological properties. Thus, antibodies attach to viruses or bacteria to mark 
them for destruction, the enzyme hexokinase binds glucose and ATP so as to cat- 
alyze a reaction between them, actin molecules bind to each other to assemble 
into actin filaments, and so on. Indeed, all proteins stick, or bind, to other 
molecules. In some cases, this binding is very tight; in others it is weak and 
short-lived. But the binding always shows great specificity, in the sense that each 
protein molecule can usually bind just one or a few molecules out of the many 
thousands of different types it encounters. The substance that is bound by the 
protein — whether it is an ion, a small molecule, or a macromolecule such as 
another protein — is referred to as a ligand for that protein (from the Latin word 
ligare, meaning "to bind"). 

The ability of a protein to bind selectively and with high affinity to a ligand 
depends on the formation of a set of weak, noncovalent bonds — hydrogen 
bonds, electrostatic attractions, and van der Waals attractions — plus favorable 
hydrophobic interactions (see Panel 2 3, pp. 110-111). Because each individual 
bond is weak, effective binding occurs only when many of these bonds form 
simultaneously. Such binding is possible only if the surface contours of the lig- 
and molecule fit very closely to the protein, matching it like a hand in a glove 
(Figure 3-36). 

The region of a protein that associates with a ligand, known as the ligand's 
binding site, usually consists of a cavity in the protein surface formed by a par- 
ticular arrangement of amino acids. These amino acids can belong to different 
portions of the polypeptide chain that are brought together when the protein 
folds (Figure 3-37). Separate regions of the protein surface generally provide 
binding sites for different ligands, allowing the protein's activity to be regulated, 
as we shall see later. And other parts of the protein act as a handle to position the 
protein in the cell — an example is the SH2 domain discussed previously, which 
often moves a protein containing it to particular intracellular sites in response to 
particular signals. 

Although the atoms buried in the interior of the protein have no direct con- 
tact with the ligand, they form the framework that gives the surface its contours 
and its chemical and mechanical properties. Even small changes to the amino 
acids in the interior of a protein molecule can change its three-dimensional 
shape enough to destroy a binding site on the surface. 



noncovalent bonds 

/ 




protein 



Figure 3-36 The selective binding of a 
protein to another molecule. Many weak 
bonds are needed to enable a protein to 
bind tightly to a second molecule, which 
is called a ligand for the protein. A ligand 
must therefore fit precisely into a 
protein's binding site, like a hand into a 
glove, so that a large number of 
noncovalent bonds form between the 
protein and the ligand. 
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Figure 3-37 The binding site of a protein. (A) The folding of the 
polypeptide chain typically creates a crevice or cavity on the protein 
surface. This crevice contains a set of amino acid side chains disposed in 
such a way that they can form noncovalent bonds only with certain 
ligands. (B) A close-up of an actual binding site showing the hydrogen 
bonds and electrostatic interactions formed between a protein and its 
ligand. In this example, cyclic AMP is the bound ligand. 
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The Surface Conformation of a Protein Determines Its Chemistry 

Proteins have impressive chemical capabilities because the neighboring chemi- 
cal groups on their surface often interact in ways that enhance the chemical 
reactivity of amino acid side chains. These interactions fall into two main cate- 
gories. 

First, the interaction of neighboring parts of the polypeptide chain may 
restrict the access of water molecules to that protein's ligand-binding sites. This 
is important because water molecules readily form hydrogen bonds that can 
compete with ligands for sites on the protein surface. Proteins and their ligands 
form tighter hydrogen bonds (and electrostatic interactions) if the protein can 
exclude water molecules from its binding sites. It might be hard to imagine a 
mechanism that would exclude a molecule as small as water from a protein sur- 
face without affecting the access of the ligand itself. However, because of the 
strong tendency of water molecules to form water-water hydrogen bonds, water 
molecules exist in a large hydrogen-bonded network (see Panel 2-2, pp. 
108-109). In effect, a protein can keep a ligand-binding site dry because it is 
energetically unfavorable for individual water molecules to break away from this 
network, as they must do to reach into a crevice on a protein's surface. 

Second, the clustering of neighboring polar amino acid side chains can alter 
their reactivity. If protein folding forces together a number of negatively charged 
side chains against their mutual repulsion, for example, the affinity of the site for 
a positively charged ion is greatly increased. In addition, when amino acid side 
chains interact with one another through hydrogen bonds, normally unreactive 
side groups (such as the -CH 2 OH on the serine shown in Figure 3-38) can 
become reactive, enabling them to be used to make or break selected covalent 
bonds. 

The surface of each protein molecule therefore has a unique chemical reac- 
tivity that depends not only on which amino acid side chains are exposed, but 
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also on their exact orientation relative to one another. For this reason, two even 
slightly different conformations of the same protein molecule may differ greatly 
in their chemistry. 

Sequence Comparisons Between Protein Family Members 
Highlight Crucial Ligand-Binding Sites 

As we have described previously, genome sequences allow us to group many of 
the domains in proteins into families that show clear evidence of their evolution 
from a common ancestor. The three-dimensional structures of the members of 
the same domain family are remarkably similar. For example, even when the 
amino acid sequence identity falls to 25%, the backbone atoms in a domain fol- 
low a common protein fold within 0.2 nanometers (2 A). 

We can therefore use a method called evolutionary tracing to identify those 
sites in a protein domain that are the most crucial to the domain's function. For 
this purpose, those amino acids that are unchanged, or nearly unchanged, in all 
of the known protein family members are mapped onto a model of the three- 
dimensional structure of one family member. When this is done, the most 
invariant positions often form one or more clusters on the protein surface, as 
illustrated in Figure 3-39A for the SH2 domain described previously (see Panel 
3-2, pp. 132-133). These clusters generally correspond to ligand binding sites. 

The SH2 domain is a module that functions in protein-protein interactions. 
It binds the protein containing it to a second protein that contains a phospho- 
rylated tyrosine side chain in a specific amino acid sequence context, as shown 
in Figure 3-39B. The amino acids located at the binding site for the phosphory- 
lated polypeptide have been the slowest to change during the long evolutionary 
process that produced the large SH2 family of peptide recognition domains. 
Because mutation is a random process, this result is attributed to the preferen- 
tial elimination during evolution of all organisms whose SH2 domains became 
altered in a way that inactivated the SH2-binding site, thereby destroying the 
function of the SH2 domain. 



Figure 3-38 An unusually reactive 
amino acid at the active site of an 
enzyme. This example is the "catalytic 
triad" found in chymotrypsin, elastase, 
and other serine proteases (see Figure 
3-12).Theasparticacid side chain (Asp 
1 02) induces the histidine (His 57) to 
remove the proton from serine 1 95. This 
activates the serine to form a covalent 
bond with the enzyme substrate, 
hydrolyzing a peptide bond. The many 
convolutions of the polypeptide chain 
are omitted here. 



Figure 3-39 The evolutionary trace 
method applied to the SH2 domain. 

(A) Front and back views of a space-filling 
model of the SH2 domain, with 
evolutionary conserved amino acids on 
the protein surface colored yellow, and 
those more toward the protein interior 
colored red. (B) The structure of the SH2 
domain with its bound polypeptide. Here, 
those amino acids located within 0.4 nm 
of the bound ligand are colored blue. 
The two key amino acids of the ligand 
are yellow, and the others are purple. Note 
the high degree of correspondance 
between (A) and (B). (Adapted from 
0. Lichtarge, H.R. Bourne and F.E. Cohen, 
J. Mol. Biol. 257:342-358, 1 996. With 
permission from Elsevier.) 
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In this era of extensive genome sequencing, many new protein families have 
been discovered whose functions are unknown. Once a three-dimensional 
structure has been determined for one family member, evolutionary tracing 
allows biologists to determine binding sites for the members of that family, 
thereby helping to decipher protein function. 



Proteins Bind to Other Proteins Through Several Types of 
Interfaces 

Proteins can bind to other proteins in at least three ways. In many cases, a por- 
tion of the surface of one protein contacts an extended loop of polypeptide 
chain (a "string") on a second protein (Figure 3-40A). Such a surface-string 
interaction, for example, allows the SH2 domain to recognize a phosphorylated 
polypeptide loop on a second protein, as just described, and it also enables a 
protein kinase to recognize the proteins that it will phosphorylate (see below). 

A second type of protein-protein interface forms when two a helices, one 
from each protein, pair together to form a coiled-coil (Figure 3-40B). This type 
of protein interface is found in several families of gene regulatory proteins, as 
discussed in Chapter 7. 

The most common way for proteins to interact, however, is by the precise 
matching of one rigid surface with that of another (Figure 3-40C). Such interac- 
tions can be very tight, since a large number of weak bonds can form between 
two surfaces that match well. For the same reason, such surface-surface inter- 
actions can be extremely specific, enabling a protein to select just one partner 
from the many thousands of different proteins found in a cell. 



Figure 3-40 Three ways in which two 
proteins can bind to each other. Only 
the interacting parts of the two proteins 
are shown. (A) A rigid surface on one 
protein can bind to an extended loop of 
polypeptide chain (a "string") on a second 
protein. (B) Two a helices can bind 
together to form a coiled-coil. 
(C) Two complementary rigid surfaces 
often link two proteins together. 



Antibody Binding Sites Are Especially Versatile <gccg> 

All proteins must bind to particular ligands to carry out their various functions. 
The antibody family is notable for its capacity for tight selective binding (dis- 
cussed in detail in Chapter 25). 

Antibodies, or immunoglobulins, are proteins produced by the immune 
system in response to foreign molecules, such as those on the surface of an 
invading microorganism. Each antibody binds tightly to a particular target 
molecule, thereby either inactivating the target molecule directly or marking it 
for destruction. An antibody recognizes its target (called an antigen) with 
remarkable specificity. Because there are potentially billions of different anti- 
gens that humans might encounter, we have to be able to produce billions of dif- 
ferent antibodies. 

Antibodies are Y-shaped molecules with two identical binding sites that are 
complementary to a small portion of the surface of the antigen molecule. A 
detailed examination of the antigen-binding sites of antibodies reveals that they 
are formed from several loops of polypeptide chain that protrude from the ends 
of a pair of closely juxtaposed protein domains (Figure 3-41). Different anti- 
bodies generate an enormous diversity of antigen-binding sites by changing 
only the length and amino acid sequence of these loops, without altering the 
basic protein structure. 

Loops of this kind are ideal for grasping other molecules. They allow a large 
number of chemical groups to surround a ligand so that the protein can link to 
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it with many weak bonds. For this reason, loops often form the ligand-binding 
sites in proteins. 



The Equilibrium Constant Measures Binding Strength 

Molecules in the cell encounter each other very frequently because of their con- 
tinual random thermal movements. Colliding molecules with poorly matching 
surfaces form few noncovalent bonds with one another, and the two molecules 
dissociate as rapidly as they come together. At the other extreme, when many 
noncovalent bonds form between two colliding molecules, the association can 
persist for a very long time (Figure 3-42). Strong interactions occur in cells 
whenever a biological function requires that molecules remain associated for a 
long time — for example, when a group of RNA and protein molecules come 
together to make a subcellular structure such as a ribosome. 

We can measure the strength with which any two molecules bind to each 
other. As an example, consider a population of identical antibody molecules that 
suddenly encounters a population of ligands diffusing in the fluid surrounding 
them. At frequent intervals, one of the ligand molecules will bump into the bind- 
ing site of an antibody and form an antibody-ligand complex. The population of 
antibody-ligand complexes will therefore increase, but not without limit: over 
time, a second process, in which individual complexes break apart because of 
thermally induced motion, will become increasingly important. Eventually, any 
population of antibody molecules and ligands will reach a steady state, or equi- 
librium, in which the number of binding (association) events per second is pre- 
cisely equal to the number of "unbinding" (dissociation) events (see Figure 2-52). 

From the concentrations of the ligand, antibody, and antibody-ligand com- 
plex at equilibrium, we can calculate a convenient measure — the equilibrium 
constant (K) — of the strength of binding (Figure 3-43A). The equilibrium con- 
stant for a reaction in which two molecules (A and B) bind to each other to form 
a complex (AB) has units of liters/mole, and half of the binding sites will be 
occupied by ligand when that ligand's concentration (in moles/liter) reaches a 
value that is equal to UK, This equilibrium constant is larger the greater the 
binding strength, and it is a direct measure of the free-energy difference 



Figure 3-41 An antibody molecule. 

(A) A typical antibody molecule is 
Y-shaped and has two identical binding 
sites for its antigen, one on each arm of 
the Y. The protein is composed of four 
polypeptide chains (two identical heavy 
chains and two identical and smaller light 
chains) held together by disulfide bonds. 
Each chain is made up of several different 
immunoglobulin domains, here shaded 
either blue or gray. The antigen-binding 
site is formed where a heavy-chain 
variable domain (V H ) and a light-chain 
variable domain (V L ) come close 
together. These are the domains that 
differ most in their sequence and 
structure in different antibodies. Each 
domain at the end of the two arms of the 
antibody molecule forms loops that bind 
to the antigen. In (B) we can see these 
fingerlike loops (red) contributed by the 
Vl domain. 
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molecule A randomly encounters 
other molecules (B, C, and D) 




the surfaces of molecules A and B, 
and A and C, are a poor match and 
are capable of forming only a few 
weak bonds; thermal motion rapidly 
breaks them apart 



between the bound and free states (Figure 3-43B and C). Even a change of a few 
noncovalent bonds can have a striking effect on a binding interaction, as shown 
by the example in Figure 3-44. (Note that the equilibrium constant, as defined 
here is also known as the association or affinity constant, K&.) 

We have used the case of an antibody binding to its ligand to illustrate the 
effect of binding strength on the equilibrium state, but the same principles 
apply to any molecule and its ligand. Many proteins are enzymes, which, as we 
now discuss, first bind to their ligands and then catalyze the breakage or forma- 
tion of covalent bonds in these molecules. 



the surfaces of molecules A and D 
match well and therefore can form 
enough weak bonds to withstand 
thermal jolting; they therefore 
stay bound to each other 



Figure 3-42 How noncovalent bonds 
mediate interactions between 
macromolecules. 



Enzymes Are Powerful and Highly Specific Catalysts 

Many proteins can perform their function simply by binding to another 
molecule. An actin molecule, for example, need only associate with other actin 
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Although joules and 
kilojoules (1000 joules) are 
standard units of energy, 
cell biologists usually refer 
to free energy values in 
terms of calories and 
kilocalories. 

One kilocalorie (kcal) is 
equal to 4.184 kilojoules 
(kJ). 

The relationship between 
the free-energy change, 
AG, and the equilibrium 
constant is 

AG = -0.00458 7 log K 

where AG is in kilocalories 
and T is the absolute 
temperature in Kelvins 
(310K = 37°C) 
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Figure 3-43 Relating binding energies to the equilibrium constant for an association reaction. (A) The equilibrium 
between molecules A and B and the complex AB is maintained by a balance between the two opposing reactions shown in 
panels 1 and 2. Molecules A and B must collide if they are to react, and the association rate is therefore proportional to the 
product of their individual concentrations [A] x [B]. (Square brackets indicate concentration.) As shown in panel 3, the ratio 
of the rate constants for the association and the dissociation reactions is equal to the equilibrium constant (K) for the 
reaction. (B) The equilibrium constant in panel 3 is that for the reaction A + B ^ AB, and the larger its value, the stronger 
the binding between A and B. Note that for every 1 .41 kcal/mole (5.91 kJ/mole) decrease in free energy the equilibrium 
constant increases by a factor of 1 at 37°C. 

The equilibrium constant here has units of liters/mole: for simple binding interactions it is also called the affinity constant 
or association constant, denoted K a , The reciprocal of K a is called the dissociation constant, K d (in units of moles/liter). 
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molecules to form a filament. There are other proteins, however, for which lig- 
and binding is only a necessary first step in their function. This is the case for the 
large and very important class of proteins called enzymes. As described in 
Chapter 2, enzymes are remarkable molecules that determine all the chemical 
transformations that make and break covalent bonds in cells. They bind to one 
or more ligands, called substrates, and convert them into one or more chemi- 
cally modified products, doing this over and over again with amazing rapidity. 
Enzymes speed up reactions, often by a factor of a million or more, without 
themselves being changed — that is, they act as catalysts that permit cells to 
make or break covalent bonds in a controlled way. It is the catalysis of organized 
sets of chemical reactions by enzymes that creates and maintains the cell, mak- 
ing life possible. 

We can group enzymes into functional classes that perform similar chemical 
reactions (Table 3-1). Each type of enzyme within such a class is highly specific, 
catalyzing only a single type of reaction. Thus, hexokinase adds a phosphate 
group to D-glucose but ignores its optical isomer L-glucose; the blood-clotting 
enzyme thrombin cuts one type of blood protein between a particular arginine 
and its adjacent glycine and nowhere else, and so on. As discussed in detail in 
Chapter 2, enzymes work in teams, with the product of one enzyme becoming 
the substrate for the next. The result is an elaborate network of metabolic path- 
ways that provides the cell with energy and generates the many large and small 
molecules that the cell needs (see Figure 2-35). 



Substrate Binding Is the First Step in Enzyme Catalysis 

For a protein that catalyzes a chemical reaction (an enzyme), the binding of each 
substrate molecule to the protein is an essential prelude. In the simplest case, if 
we denote the enzyme by E, the substrate by S, and the product by P, the basic 
reaction path isE + S^ES^EP^E + E From this reaction path, we see that 
there is a limit to the amount of substrate that a single enzyme molecule can 
process in a given time. An increase in the concentration of substrate also 
increases the rate at which product is formed, up to a maximum value (Figure 
3-45). At that point the enzyme molecule is saturated with substrate, and the 
rate of reaction {V max ) depends only on how rapidly the enzyme can process the 
substrate molecule. This maximum rate divided by the enzyme concentration is 



Consider 1000 molecules of A and 
1000 molecules of B in a eucaryotic 
cell. The concentration of both will 
be about 10" 9 M. 

If the equilibrium constant (K) 
for A + B *=* AB is 10 10 , then one can 
calculate that at equilibrium there 
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a loss of 2.8 kcal/mole of binding 
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Figure 3-44 Small changes in the 
number of weak bonds can have drastic 
effects on a binding interaction. This 
example illustrates the dramatic effect of 
the presence or absence of a few weak 
noncovalent bonds in a biological 
context. 



Table 3-1 Some Common Types of Enzymes 
ENZYME 

Hydrolases 



Nucleases 

Proteases 

Synthases 

Isomerases 

Polymerases 

Kinases 

Phosphatases 
Oxido-Reductases 



ATPases 



REACTION CATALYZE 

general term for enzymes that catalyze a hydrolytic cleavage reaction; nucleases and proteoses 

are more specific names for subclasses of these enzymes, 
break down nucleic acids by hydrolyzing bonds between nucleotides, 
break down proteins by hydrolyzing bonds between amino acids. 

synthesize molecules in anabolic reactions by condensing two smaller molecules together, 
catalyze the rearrangement of bonds within a single molecule, 
catalyze polymerization reactions such as the synthesis of DNA and RNA. 
catalyze the addition of phosphate groups to molecules. Protein kinases are an important group 

of kinases that attach phosphate groups to proteins, 
catalyze the hydrolytic removal of a phosphate group from a molecule, 
general name for enzymes that catalyze reactions in which one molecule is oxidized while the 

other is reduced. Enzymes of this type are often more specifically named either oxidases, 

reductases, or dehydrogenases. 
hydrolyze ATP. Many proteins with a wide range of roles have an energy-harnessing ATPase 

activity as part of their function, for example, motor proteins such as myosin and membrane 

transport proteins such as the sodium-potassium pump. 



Enzyme names typically end in "-ase," with the exception of some enzymes, such as pepsin, trypsin, thrombin and lysozyme that were discovered 
and named before the convention became generally accepted at the end of the nineteenth century The common name of an enzyme usually 
indicates the substrate and the nature of the reaction catalyzed. For example, citrate synthase catalyzes the synthesis of citrate by a reaction 

between acetyl CoA and oxaloacetate 
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called the turnover number. The turnover number is often about 1000 substrate 
molecules processed per second per enzyme molecule, although turnover num- 
bers between 1 and 10,000 are known. 

The other kinetic parameter frequently used to characterize an enzyme is its 
K m> the concentration of substrate that allows the reaction to proceed at one- 
half its maximum rate (0.5 V max ) (see Figure 3-45). A lowK m value means that the 
enzyme reaches its maximum catalytic rate at a low concentration of substrate 
and generally indicates that the enzyme binds to its substrate very tightly, 
whereas a high K m value corresponds to weak binding. The methods used to 
characterize enzymes in this way are explained in Panel 3-3 (pp. 162-163). 



Figure 3-45 Enzyme kinetics. The rate of 
an enzyme reaction (V) increases as the 
substrate concentration increases until a 
maximum value (V max ) is reached. At this 
point all substrate-binding sites on the 
enzyme molecules are fully occupied, 
and the rate of reaction is limited by the 
rate of the catalytic process on the 
enzyme surface. For most enzymes, the 
concentration of substrate (K m ) at which 
the reaction rate is half-maximal {black 
dot) is a measure of how tightly the 
substrate is bound, with a large value of 
/C m corresponding to weak binding. 



Enzymes Speed Reactions by Selectively Stabilizing Transition 
States 

Enzymes achieve extremely high rates of chemical reaction — rates that are far 
higher than for any synthetic catalysts. There are several reasons for this effi- 
ciency First, the enzyme increases the local concentration of substrate 
molecules at the catalytic site and holds all the appropriate atoms in the correct 
orientation for the reaction that is to follow. More importantly however, some of 
the binding energy contributes directly to the catalysis. Substrate molecules 
must pass through a series of intermediate states of altered geometry and elec- 
tron distribution before they form the ultimate products of the reaction. The free 
energy required to attain the most unstable transition state is called the activa- 
tion energy fox the reaction, and it is the major determinant of the reaction rate. 
Enzymes have a much higher affinity for the transition state of the substrate 
than they have for the stable form. Because this tight binding greatly lowers the 
energies of the transition state, the enzyme greatly accelerates a particular reac- 
tion by lowering the activation energy that is required (Figure 3-46). 

By intentionally producing antibodies that act like enzymes, we can 
demonstrate that stabilizing a transition state can greatly increase a reaction 
rate. Consider, for example, the hydrolysis of an amide bond, which is similar to 
the peptide bond that joins two adjacent amino acids in a protein. In an aque- 
ous solution, an amide bond hydrolyzes very slowly by the mechanism shown 
in Figure 3-47A. In the central intermediate, or transition state, the carbonyl 
carbon is bonded to four atoms arranged at the corners of a tetrahedron. By 
generating monoclonal antibodies that bind tightly to a stable analog of this 
very unstable tetrahedral intermediate, one can obtain an antibody that func- 
tions like an enzyme (Figure 3-47B). Because this catalytic antibody binds to and 
stabilizes the tetrahedral intermediate, it increases the spontaneous rate of 
amide-bond hydrolysis more than 10,000-fold. 



Enzymes Can Use Simultaneous Acid and Base Catalysis 

Figure 3-48 compares the spontaneous reaction rates and the corresponding 
enzyme-catalyzed rates for five enzymes. Rate accelerations range from 10 9 to 
10 23 . Clearly, enzymes are much better catalysts than catalytic antibodies. 



activation energy 

for uncatalyzed reaction 




progress 
of reaction 



activation energy 
for catalyzed reaction 

Figure 3-46 Enzymatic acceleration of 
chemical reactions by decreasing the 
activation energy. Often both the 
uncatalyzed reaction (A) and the enzyme- 
catalyzed reaction (B) can go through 
several transition states. It is the 
transition state with the highest energy 
(S T and ES T ) that determines the 
activation energy and limits the rate of 
the reaction. (S = substrate; P = product 
of the reaction; ES = enzyme-substrate 
complex; EP = enzyme-product 
complex.) 
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(B) TRANSITION-STATE ANALOG FOR AMIDE HYDROLYSIS 
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Figure 3-47 Catalytic antibodies. The 

stabilization of a transition state by an 
antibody creates an enzyme. (A) The 
reaction path for the hydrolysis of an 
amide bond goes through a tetrahedral 
intermediate, the high-energy transition 
state for the reaction. (B) The molecule on 
the left was covalently linked to a protein 
and used as an antigen to generate an 
antibody that binds tightly to the region 
of the molecule shown in yellow. Because 
this antibody also bound tightly to the 
transition state in (A), it was found to 
function as an enzyme that efficiently 
catalyzed the hydrolysis of the amide 
bond in the molecule on the right. 



amide 



Enzymes not only bind tightly to a transition state, they also contain precisely 
positioned atoms that alter the electron distributions in those atoms that par- 
ticipate directly in the making and breaking of covalent bonds. Peptide bonds, 
for example, can be hydrolyzed in the absence of an enzyme by exposing a 
polypeptide to either a strong acid or a strong base, as illustrated in Figure 3-49. 
Enzymes are unique, however, in being able to use acid and base catalysis simul- 
taneously, since the rigid framework of the protein binds the acidic and basic 
residues and prevents them from combining with each other (as they would do 
in solution) (Figure 3-49D). 

The fit between an enzyme and its substrate needs to be precise. A small 
change introduced by genetic engineering in the active site of an enzyme can 
have a profound effect. Replacing a glutamic acid with an aspartic acid in one 
enzyme, for example, shifts the position of the catalytic carboxylate ion by only 
1 A (about the radius of a hydrogen atom); yet this is enough to decrease the 
activity of the enzyme a thousandfold. 



Lysozyme Illustrates How an Enzyme Works <agca> 

To demonstrate how enzymes catalyze chemical reactions, we examine an 
enzyme that acts as a natural antibiotic in egg white, saliva, tears, and other 
secretions. Lysozyme catalyzes the cutting of polysaccharide chains in the cell 
walls of bacteria. Because the bacterial cell is under pressure from osmotic 
forces, cutting even a small number of polysaccharide chains causes the cell wall 
to rupture and the cell to burst. Lysozyme is a relatively small and stable protein 



10 b years 



half-time for reaction 
1 year 1 sec 



1 1 

msec psec 



r 



i 



OMP decarboxylase 
staphylococcal nuclease 
ade nosi ne dea m i n ase 



uiosephosphaTe 
isomer ase 



carbonic 
anhydrase 



L 



J 



UNCATALYZED 



CATALYZED 



Figure 3-48 The rate accelerations 
caused by five different enzymes. 

(Adapted from A. Radzicka and 

R. Wolfenden, Science 267:90-93, 1 995. 

With permission from AAAS.) 
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WHY ANALYZE THE KINETICS OF ENZYMES? 



Enzymes are the most selective and powerful catalysts known. 
An understanding of their detailed mechanisms provides a 
critical tool for the discovery of new drugs, for the large-scale 
industrial synthesis of useful chemicals, and for appreciating 
the chemistry of cells and organisms. A detailed study of the 
rates of the chemical reactions that are catalyzed by a purified 
enzyme — more specifically how these rates change with 
changes in conditions such as the concentrations of substrates, 
products, inhibitors, and regulatory ligands — allows 



biochemists to figure out exactly how each enzyme works. 
For example, this is the way that the ATP-producing reactions 
of glycolysis, shown previously in Figure 2-72, were 
deciphered — allowing us to appreciate the rationale for this 
critical enzymatic pathway. 

In this Panel, we introduce the important field of enzyme 
kinetics, which has been indispensable for deriving much of 
the detailed knowledge that we now have about cell 
chemistry. 



STEADY-STATE ENZYME KINETICS 



Many enzymes have only one substrate, which they bind and 
then process to produce products according to the scheme 
outlined in Figure 3-50A. In this case, the reaction is written as 



E + S 



*1. *cat 

^~" ES * E + P 



At this steady state, [ES] is nearly constant, so that 



rate of ES breakdown 
*-i [ES] + k cat [ES] 



rate of ES formation 

*i [E][S] 



Here we have assumed that the reverse reaction, in which E + P 
recombine to form EP and then ES, occurs so rarely that we can 
ignore it. In this case, EP need not be represented, and we can 
express the rate of the reaction — known as its velocity, V, as 

V=k cat [ES] 

where [ES] is the concentration of the enzyme-substrate complex, 
and k cat is the turnover number, a rate constant that has a value 
equal to the number of substrate molecules processed per 
enzyme molecule each second. 

But how does the value of [ES] relate to the concentrations that 
we know directly, which are the total concentration of the 
enzyme, [EJ, and the concentration of the substrate, [S]? When 
enzyme and substrate are first mixed, the concentration [ES] will 
rise rapidly from zero to a so-called steady-state level, as 
illustrated below. 



or, since the concentration of the free enzyme, [E], is equal 
to [E ] - [ES], 



[ES] = 



*1 



*-1 + K; 



[E][S] 



fc-1 + K: 



[E ]-[ES] [S] 



Rearranging, and defining the constant K m as 

^-1 + ^cat 



we get 



[ES] 



[E P ][S] 
Km + [S] 




pre-steady 

state: 
ES forming 



time 



steady state: 
ES almost constant 



or, remembering that V= /c cat [ES], we obtain the famous 
Michaelis-Menten equation 



V = 



k cat [E ][S] 
Km + H 



As [S] is increased to higher and higher levels, essentially all of 
the enzyme will be bound to substrate at steady state; at this 
point, a maximum rate of reaction, V max , will be reached where 
v= ^max = ^cat [E ]. Thus, it is convenient to rewrite the 
Michaelis-Menten equation as 



V = 



;[S] 



Km + [51 
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THE DOUBLE-RECIPROCAL PLOT 

Atypical plot of ^versus [S] for an enzyme that follows 
rvlkhaelis-Menten kinetics is shown below. From this plot, 
neither the value of V mM nor of K m is immediately clear, 




2 4 6 

[5] mmole/liter 



THE SIGNIFICANCE OF K mt k CBV and /c C3t /K m 

As described in the text r K m is an approximate measure of 
substrate affinity for the enzyme: it is numerically equal to 
the concentration of [S] at V= 0.5 V mdJ( . In general, a lower 
value of K m means tighter substrate binding. In fact, for 
those cases where ic cat is much smaller than k_ v the K m will 
be equal to K^, the dissociation constant for substrate 
binding to the enzyme (K d = 1//C a ). 

We have seen that k^ t is the turnover number for the 
enzyme. At very low substrate concentrations, where 
[S] « K m , most of the enzyme is free. Thus we can think of 
[EI == t%i $o that the Mkhaelis-Menten equation becomes 
l/= Jt cat //C m [E][S]. Thus, the ratio k„^K m is equivalent to the 
rate constant for the reaction between free enzyme and 
free substrate, 

A comparison of k r ^ r /K m for the same enzyme with 
different substrates, or for two enzymes with their 
different substrates, is widely used as a measure of enzyme 

effectiveness, 



To obtain V majt and K m from such data, a double-reciprocal 
plot is often used, in which the Michael is-Menten equation 
has merely been rearranged, so that W can be plotted versus 
1/[S]. 



VV 
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For simplicity, in this Panel we have discussed enzymes 
that have only one substrate, such as the lysozyme enzyme 
described in the text (see p. 164). Most enzymes have two 
substrates, one of which is often an active carrier 
molecule— such as NADH or ATR 

A similar, but more complex, analysis is used to determine 
the kinetics of such enzymes— allowing the order of substrate 
binding and the presence of covalent intermediates along 
the pathway to be revealed. 




SOME ENZYMES ARE DIFFUSION LIMITED 
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The values of fc cat , K, 
enzymes are given below; 



and k €ai /K m for some selected 



enzyme 


substrate ^ 1v 
(sec 


(M) 


k at/K m 
(sec-iM - 1 ) 


acetylcholinesterase 


acetylcholine 1,4x10 4 


9x10" 5 


1,6xtO* 


catalase 


H,Oj 4x10 7 


1 


4x1 7 


fumarase 


fumarate flxlG z 


5x10^ 


1.6x10 s 



Because an enzyme and its substrate must collide before 
they can react, k c&r fK m has a maximum possible value that is 
limited by collision rates. If every collision forms an 
enzyme-substrate complex, one can calculate from diffusion 
theory that k C3t /K w will be between 10 B and 10 9 sec _1 M" T r in 
the case where all subsequent steps proceed immediately. 
Thus, it is claimed that enzymes like acetylcholinesterase and 
fumarase are "perfect enzymes," each enzyme having 
evolved to the point where nearly every collision with its 
substrate converts the substrate to a product 
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that can be easily isolated in large quantities. For these reasons, it has been 
intensively studied, and it was the first enzyme to have its structure worked out 
in atomic detail by x-ray crystallography. 

The reaction that lysozyme catalyzes is a hydrolysis: it adds a molecule of 
water to a single bond between two adjacent sugar groups in the polysaccharide 
chain, thereby causing the bond to break (see Figure 2-19). The reaction is ener- 
getically favorable because the free energy of the severed polysaccharide chain 
is lower than the free energy of the intact chain. However, the pure polysaccha- 
ride can remain for years in water without being hydrolyzed to any detectable 
degree. This is because there is an energy barrier to the reaction, as discussed in 
Chapter 2 (see Figure 2-46) . A colliding water molecule can break a bond linking 
two sugars only if the polysaccharide molecule is distorted into a particular 
shape— the transition state— in which the atoms around the bond have an 
altered geometry and electron distribution. Because of this distortion, random 
collisions must supply a very large activation energy for the reaction to take 
place. In an aqueous solution at room temperature, the energy of collisions 
almost never exceeds the activation energy. Consequently, hydrolysis occurs 
extremely slowly, if at all. 

This situation changes drastically when the polysaccharide binds to 
lysozyme. The active site of lysozyme, because its substrate is a polymer, is a long 
groove that holds six linked sugars at the same time. As soon as the polysaccha- 
ride binds to form an enzyme-substrate complex, the enzyme cuts the polysac- 
charide by adding a water molecule across one of its sugar-sugar bonds. The 
product chains are then quickly released, freeing the enzyme for further cycles 
of reaction (Figure 3-50) . 

The chemistry of the binding of lysozyme to its substrate is the same as that 
for antibody binding to its antigen— the formation of multiple noncovalent 



Figure 3-49 Acid catalysis and base 
catalysis. (A) The start of the uncatalyzed 
reaction shown in Figure 3-47A, with 
blue indicating electron distribution in 
the water and carbonyl bonds. (B) An acid 
likes to donate a proton (H + ) to other 
atoms. By pairing with the carbonyl 
oxygen, an acid causes electrons to move 
away from the carbonyl carbon, making 
this atom much more attractive to the 
electronegative oxygen of an attacking 
water molecule. (C) A base likes to take 
up H + . By pairing with a hydrogen of the 
attacking water molecule, a base causes 
electrons to move toward the water 
oxygen, making it a better attacking 
group for the carbonyl carbon. (D) By 
having appropriately positioned atoms 
on its surface, an enzyme can perform 
both acid catalysis and base catalysis at 
the same time. 



Figure 3-50 The reaction catalyzed by lysozyme. (A) The enzyme 
lysozyme (E) catalyzes the cutting of a polysaccharide chain, which is its 
substrate (S). The enzyme first binds to the chain to form an 
enzyme-substrate complex (ES) and then catalyzes the cleavage of a 
specific covalent bond in the backbone of the polysaccharide, forming an 
enzyme-product complex (EP) that rapidly dissociates. Release of the 
severed chain (the products P) leaves the enzyme free to act on another 
substrate molecule. (B) A space-filling model of the lysozyme molecule 
bound to a short length of polysaccharide chain before cleavage. 
(B, courtesy of Richard J. Feidmann.) 
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bonds. However, lysozyme holds its polysaccharide substrate in a particular way, 
so that it distorts one of the two sugars in the bond to be broken from its normal, 
most stable conformation. The bond to be broken is also held close to two 
amino acids with acidic side chains (a glutamic acid and an aspartic acid) within 
the active site. 

Conditions are thereby created in the microenvironment of the lysozyme 
active site that greatly reduce the activation energy necessary for the hydrolysis 
to take place. Figure 3-51 shows three central steps in this enzymatically cat- 
alyzed reaction. 

1 . The enzyme stresses its bound substrate, so that the shape of one sugar 
more closely resembles the shape of high-energy transition states formed 
during the reaction. 

2. The negatively charged aspartic acid reacts with the CI carbon atom on the 
distorted sugar, and the glutamic acid donates its proton to the oxygen that 
links this sugar to its neighbor. This breaks the sugar-sugar bond and 
leaves the aspartic acid side chain covalently linked to the site of bond 
cleavage. 

3. Aided by the negatively charged glutamic acid, a water molecule reacts 
with the CI carbon atom, displacing the aspartic acid side chain and com- 
pleting the process of hydrolysis. 

The overall chemical reaction, from the initial binding of the polysaccharide 
on the surface of the enzyme through the final release of the severed chains, 
occurs many millions of times faster than it would in the absence of enzyme. 

Other enzymes use similar mechanisms to lower activation energies and 
speed up the reactions they catalyze. In reactions involving two or more reao 
tants, the active site also acts like a template, or mold, that brings the substrates 
together in the proper orientation for a reaction to occur between them (Figure 



Figure 3-51 Events at the active site of 
lysozyme. <TGGT> The top left and top 
right drawings show the free substrate 
and the free products, respectively, 
whereas the other three drawings show 
the sequential events at the enzyme 
active site. Note the change in the 
conformation of sugar D in the 
enzyme-substrate complex; this shape 
change stabilizes the oxocarbenium 
ion-like transition states required for 
formation and hydrolysis of the covalent 
intermediate shown in the middle panel. 
It is also possible that a carbonium ion 
intermediate forms in step 2, as the 
covalent intermediate shown in the 
middle panel has been detected only with 
a synthetic substrate. (See DJ. Vocadlo et 
al., Nature 41 2:835-838, 2001 .) 



SUBSTRATE 

This substrate is an oligosaccharide of six sugars, 
labeled A-F. Only sugars D and E are shown in detail. 

# CH 2 OH 

CH ? OH 



PRODUCTS 

The final products are an oligosaccharide of four sugars 
(left) and a disaccharide {right), produced by hydrolysis. 



CH 2 OH 




In the enzyme-substrate complex (ES), the 
enzyme forces sugar D into a strained 
conformation, with Glu 35 positioned to serve as 
an acid that attacks the adjacent sugar-sugar 
bond by donating a proton (H + ) to sugar E, and 
Asp 52 poised to attack the C1 carbon atom. 



The Asp 52 has formed a covalent bond between 
the enzyme and the C1 carbon atom of sugar D. 
The Glu 35 then polarizes a water molecule (red), 
so that its oxygen can readily attack the C1 
carbon atom and displace Asp 52 



The reaction of the water molecule (red) 
completes the hydrolysis and returns the enzyme 
to its initial state, forming the final enzyme- 
product complex (EP). 
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(A) enzyme binds to two 
substrate molecules and 
orients them precisely to 
encourage a reaction to 
occur between them 



(B) binding of substrate 
to enzyme rearranges 
electrons in the substrate, 
creating partial negative 
and positive charges 
that favor a reaction 



(C) enzyme strains the 
bound substrate 
molecule, forcing it 
toward a transition 
state to favor a reaction 



3-52A). As we saw for lysozyme, the active site of an enzyme contains precisely 
positioned atoms that speed up a reaction by using charged groups to alter the 
distribution of electrons in the substrates (Figure 3-52B). In addition, when a 
substrate binds to an enzyme, bonds in the substrate often bend, changing the 
substrate shape. These changes, along with mechanical forces, drive a substrate 
toward a particular transition state (Figure 3-52C). Finally, like lysozyme, many 
enzymes participate intimately in the reaction by briefly forming a covalent 
bond between the substrate and a side chain of the enzyme. Subsequent steps 
in the reaction restore the side chain to its original state, so that the enzyme 
remains unchanged after the reaction (see also Figure 2-72). 



Tightly Bound Small Molecules Add Extra Functions to Proteins 

Although we have emphasized the versatility of proteins as chains of amino 
acids that perform different functions, there are many instances in which the 
amino acids by themselves are not enough. Just as humans employ tools to 
enhance and extend the capabilities of their hands, proteins often use small 
nonprotein molecules to perform functions that would be difficult or impossi- 
ble to do with amino acids alone. Thus, the signal receptor protein rhodopsin, 
which is made by the photoreceptor cells in the retina, detects light by means of 
a small molecule, retinal, embedded in the protein (Figure 3-53A). Retinal 
changes its shape when it absorbs a photon of light, and this change causes the 
protein to trigger a cascade of enzymatic reactions that eventually lead to an 
electrical signal being carried to the brain. 

Another example of a protein that contains a nonprotein portion is 
hemoglobin (see Figure 3-22). A molecule of hemoglobin carries four heme 
groups, ring-shaped molecules each with a single central iron atom (Figure 
3-53B). Heme gives hemoglobin (and blood) its red color. By binding reversibly 
to oxygen gas through its iron atom, heme enables hemoglobin to pick up oxy- 
gen in the lungs and release it in the tissues. 

Sometimes these small molecules are attached covalently and permanently 
to their protein, thereby becoming an integral part of the protein molecule itself. 
We shall see in Chapter 10 that proteins are often anchored to cell membranes 
through covalently attached lipid molecules. And membrane proteins exposed 



Figure 3-52 Some general strategies of 
enzyme catalysis. (A) Holding substrates 
together in a precise alignment. 
(B) Charge stabilization of reaction 
intermediates. (C) Applying forces that 
distort bonds in the substrate to increase 
the rate of a particular reaction. 
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Figure 3-53 Retinal and heme. (A) The 

structure of retinal, the light-sensitive 
molecule attached to rhodopsin in the 
eye. (B) The structure of a heme group. 
The carbon-containing heme ring is red 
and the iron atom at its center is orange. 
A heme group is tightly bound to each of 
the four polypeptide chains in 
hemoglobin, the oxygen-carrying protein 
whose structure is shown in Figure 3-22. 
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Table 3-2 Many Vitamins Provide Critical Coenzymes for Human Cells 


VITAMIN 


COENZYME 


ENZYME-CATALYZED REACTIONS REQUIRING 






THESE COENZYMES 


Thiamine (vitamin Bi) 


thiamine pyrophosphate 


activation and transfer of aldehydes 


Riboflavin (vitamin B2) 


FADH 


oxidation-reduction 


Niacin 


NADH, NADPH 


oxidation-reduction 


Pantothenic acid 


coenzyme A 


acyi group activation and transfer 


Pyridoxine 


pyridoxal phosphate 


amino acid activation; also glycogen phosphorylase 


Biotin 


biotin 


C0 2 activation and transfer 


Lipoic acid 


lipoamide 


acyl group activation; oxidation-reduction 


Folic acid 


tetrahydrofolate 


activation and transfer of single carbon groups 


Vitamin B12 


cobalamin coenzymes 


isomerization and methyl group transfers 



on the surface of the cell, as well as proteins secreted outside the cell, are often 
modified by the covalent addition of sugars and oligosaccharides. 

Enzymes frequently have a small molecule or metal atom tightly associated 
with their active site that assists with their catalytic function. Carboxypeptidase, 
for example, an enzyme that cuts polypeptide chains, carries a tightly bound 
zinc ion in its active site. During the cleavage of a peptide bond by carboxypep- 
tidase, the zinc ion forms a transient bond with one of the substrate atoms, 
thereby assisting the hydrolysis reaction. In other enzymes, a small organic 
molecule serves a similar purpose. Such organic molecules are often referred to 
as coenzymes. An example is biotin, which is found in enzymes that transfer a 
carboxylate group (-COO") from one molecule to another (see Figure 2-63). 
Biotin participates in these reactions by forming a transient covalent bond to the 
-COO" group to be transferred, being better suited to this function than any of 
the amino acids used to make proteins. Because it cannot be synthesized by 
humans, and must therefore be supplied in small quantities in our diet, biotin is 
a vitamin. Many other coenzymes are produced from vitamins (Table 3-2). Vita- 
mins are also needed to make other types of small molecules that are essential 
components of our proteins; vitamin A, for example, is needed in the diet to 
make retinal, the light-sensitive part of rhodopsin. 



Molecular Tunnels Channel Substrates in Enzymes with Multiple 
Catalytic Sites 

Some of the chemical reactions catalyzed by enzymes in cells produce interme- 
diates that are either very unstable or that could readily diffuse out of the cell 
through the plasma membrane if released into the cytosol. To preserve these 
intermediates, enzymes have evolved molecular tunnels that connect two or 
more active sites, allowing the intermediate to be rapidly processed to a final 
product — without ever leaving the enzyme. 

Consider, for example, the enzyme carbamoyl phosphate synthetase, which 
uses ammonia derived from glutamine plus two molecules of ATP to convert 
bicarbonate (HCO3") to carbamoyl phosphate — an important intermediate in 
several metabolic pathways (Figure 3-54). This enzyme contains three widely 
separated active sites that are connected to each other by a tunnel. The reaction 
starts at active site 2, located in the middle of the tunnel, where ATP is used to 
phosphorylate (add a phosphate group to) bicarbonate, forming carboxy phos- 
phate. This event triggers the hydrolysis of glutamine to glutamic acid at active 
site 1, releasing ammonia into the tunnel. The ammonia immediately diffuses 
through the first half of the tunnel to active site 2, where it reacts with the car- 
boxyphosphate to form carbamate. This unstable intermediate then diffuses 
through the second half of the tunnel to active site 3, where it is phosphorylated 
by ATP to the final product, carbamoyl phosphate. 
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Figure 3-54 The tunneling of reaction intermediates in the 
enzyme carbamoyl phosphate synthetase. (A) Diagram of the 
structure of the enzyme, in which a red ribbon has been used to 
outline the tunnel on the inside of the protein connecting its three 
active sites. The small and large subunits of this dimeric enzyme 
are color coded yellow and blue, respectively. (B) The path of the 
reaction. As indicated, active site 1 produces ammonia, which 
diffuses through the tunnel to active site 2, where it combines 
with carboxy phosphate to form carbamate. This highly unstable 
intermediate then diffuses through the tunnel to active site 3, 
where it is phosphorylated by ATP to produce the final product 
carbamoyl phosphate. (A, modified from F.M. Raushel, J.B. Thoden, 
and H.M. Holden, Ace. Chem. Res. 36:539-548, 2003. With 
permission from American Chemical Society.) 
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Several other well characterized enzymes contain similar molecular tunnels. 
Ammonia, a readily diffusable intermediate that might otherwise be lost from 
the cell, is the substrate most frequently channeled in the examples thus far 
known. 



Multienzyme Complexes Help to Increase the Rate of Cell 
Metabolism 

The efficiency of enzymes in accelerating chemical reactions is crucial to the 
maintenance of life. Cells, in effect, must race against the unavoidable pro- 
cesses of decay which— if left unattended— cause macromolecules to run 
downhill toward greater and greater disorder. If the rates of desirable reactions 
were not greater than the rates of competing side reactions, a cell would soon 
die. We can get some idea of the rate at which cell metabolism proceeds by 
measuring the rate of ATP utilization. A typical mammalian cell "turns over" 
(i.e., hydrolyzes and restores by phosphorylation) its entire ATP pool once every 
1 or 2 minutes. For each cell, this turnover represents the utilization of roughly 
10 7 molecules of ATP per second (or, for the human body, about 1 gram of ATP 
every minute). 
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The rates of reactions in cells are rapid because enzyme catalysis is so effec- 
tive. Many important enzymes have become so efficient that there is no possi- 
bility of further useful improvement. The factor that limits the reaction rate is no 
longer the enzyme's intrinsic speed of action; rather, it is the frequency with 
which the enzyme collides with its substrate. Such a reaction is said to be diffu- 
sion-limited (see Panel 3-3, p. 162-163). 

If an enzyme -catalyzed reaction is diffusion-limited, its rate depends on 
the concentration of both the enzyme and its substrate. If a sequence of reac- 
tions is to occur extremely rapidly, each metabolic intermediate and enzyme 
involved must be present in high concentration. However, given the enormous 
number of different reactions performed by a cell, there are limits to the con- 
centrations that can be achieved. In fact, most metabolites are present in 
micromolar (10~ 6 M) concentrations, and most enzyme concentrations are 
much lower. How is it possible, therefore, to maintain very fast metabolic rates? 

The answer lies in the spatial organization of cell components. The cell can 
increase reaction rates without raising substrate concentrations by bringing the 
various enzymes involved in a reaction sequence together to form a large pro- 
tein assembly known as a multienzyme complex (Figure 3-55). Because this 
allows the product of enzyme A to be passed directly to enzyme B, and so on, dif- 
fusion rates need not be limiting, even when the concentrations of the sub- 
strates in the cell as a whole are very low. It is perhaps not surprising, therefore, 
that such enzyme complexes are very common, and they are involved in nearly 
all aspects of metabolism — including the central genetic processes of DNA, 
RNA, and protein synthesis. In fact, few enzymes in eucaryotic cells diffuse freely 
in solution; instead, most seem to have evolved binding sites that concentrate 
them with other proteins of related function in particular regions of the cell, 
thereby increasing the rate and efficiency of the reactions that they catalyze. 

Eucaryotic cells have yet another way of increasing the rate of metabolic 
reactions: using their intracellular membrane systems. These membranes can 
segregate particular substrates and the enzymes that act on them into the same 
membrane-enclosed compartment, such as the endoplasmic reticulum or the 
cell nucleus. If, for example, a compartment occupies a total of 10% of the vol- 
ume of the cell, the concentration of reactants in that compartment may be 
increased by 10 times compared with a cell with the same number of enzymes 
and substrate molecules, but no compartmentalization. Reactions limited by 
the speed of diffusion can thereby be speeded up by a factor of 10. 



The Cell Regulates the Catalytic Activities of its Enzymes 

A living cell contains thousands of enzymes, many of which operate at the same 
time and in the same small volume of the cytosol. By their catalytic action, these 
enzymes generate a complex web of metabolic pathways, each composed of 
chains of chemical reactions in which the product of one enzyme becomes the 
substrate of the next. In this maze of pathways, there are many branch points 
(nodes) where different enzymes compete for the same substrate. The system is 
so complex (see Figure 2-88) that elaborate controls are required to regulate 
when and how rapidly each reaction occurs. 
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Figure 3-55 The structure of pyruvate 
dehydrogenase. This enzyme complex 
catalyzes the conversion of pyruvate to 
acetyl CoA, as part of the pathway that 
oxidizes sugars to C0 2 and H 2 (see Figure 
2-79). It is an example of a large 
multienzyme complex in which reaction 
intermediates are passed directly from one 
enzyme to another. 
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Regulation occurs at many levels. At one level, the cell controls how many 
molecules of each enzyme it makes by regulating the expression of the gene that 
encodes that enzyme (discussed in Chapter 7). The cell also controls enzymatic 
activities by confining sets of enzymes to particular subcellular compartments, 
enclosed by distinct membranes (discussed in Chapters 12 and 14). As will be 
discussed later in this chapter, enzymes are frequently covalently modified to 
control their activity. The rate of protein destruction by targeted proteolysis rep- 
resents yet another important regulatory mechanism (see p. 395). But the most 
general process that adjusts reaction rates operates through a direct, reversible 
change in the activity of an enzyme in response to the specific small molecules 
that it encounters. 

The most common type of control occurs when a molecule other than one 
of the substrates binds to an enzyme at a special regulatory site outside the 
active site, thereby altering the rate at which the enzyme converts its substrates 
to products. For example, in feedback inhibition a product produced late in a 
reaction pathway inhibits an enzyme that acts earlier in the pathway. Thus, 
whenever large quantities of the final product begin to accumulate, this product 
binds to the enzyme and slows down its catalytic action, thereby limiting the fur- 
ther entry of substrates into that reaction pathway (Figure 3-56). Where path- 
ways branch or intersect, there are usually multiple points of control by differ- 
ent final products, each of which works to regulate its own synthesis (Figure 
3-57). Feedback inhibition can work almost instantaneously, and it is rapidly 
reversed when the level of the product falls. 




negative 

regulation 



Figure 3-56 Feedback inhibition of a 
single biosynthetic pathway. The end- 
product Z inhibits the first enzyme that is 
unique to its synthesis and thereby 
controls its own level in the cell. This is an 
example of negative regulation. 
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Figure 3-57 Multiple feedback 
inhibition. In this example, which shows 
the biosynthetic pathways for four 
different amino acids in bacteria, the red 
arrows indicate positions at which 
products feed back to inhibit enzymes. 
Each amino acid controls the first enzyme 
specific to its own synthesis, thereby 
controlling its own levels and avoiding a 
wasteful, or even dangerous, buildup of 
intermediates. The products can also 
separately inhibit the initial set of 
reactions common to all the syntheses; in 
this case, three different enzymes 
catalyze the initial reaction, each 
inhibited bv a different oroduct. 
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Feedback inhibition is negative regulation: it prevents an enzyme from act- 
ing. Enzymes can also be subject to positive regulation, in which a regulatory 
molecule stimulates the enzyme's activity rather than shutting the enzyme 
down. Positive regulation occurs when a product in one branch of the metabolic 
network stimulates the activity of an enzyme in another pathway. As one exam- 
ple, the accumulation of ADP activates several enzymes involved in the oxida- 
tion of sugar molecules, thereby stimulating the cell to convert more ADP to ATP. 

Allosteric Enzymes Have Two or More Binding Sites That Interact 

A striking feature of both positive and negative feedback regulation is that the reg- 
ulatory molecule often has a shape totally different from the shape of the substrate 
of the enzyme. This is why the effect on a protein is termed allostery (from the 
Greek words alios, meaning "other," and stereos, meaning "solid" or "three-dimen- 
sional"). As biologists learned more about feedback regulation, they recognized 
that the enzymes involved must have at least two different binding sites on their 
surface — an active site that recognizes the substrates, and a regulatory site that 
recognizes a regulatory molecule. These two sites must somehow communicate so 
that the catalytic events at the active site can be influenced by the binding of the 
regulatory molecule at its separate site on the protein's surface. 

The interaction between separated sites on a protein molecule is now 
known to depend on a conformational change in the protein: binding at one of 
the sites causes a shift from one folded shape to a slightly different folded shape. 
During feedback inhibition, for example, the binding of an inhibitor at one site 
on the protein causes the protein to shift to a conformation that incapacitates its 
active site, located elsewhere in the protein. 

It is thought that most protein molecules are allosteric. They can adopt two 
or more slightly different conformations, and a shift from one to another caused 
by the binding of a ligand can alter their activity. This is true not only for 
enzymes but also for many other proteins, including receptors, structural pro- 
teins, and motor proteins. In all instances of allosteric regulation, each confor- 
mation of the protein has somewhat different surface contours, and the protein's 
binding sites for ligands are altered when the protein changes shape. Moreover 
as we discuss next, each ligand will stabilize the conformation that it binds to 
most strongly, and thus — at high enough concentrations— will tend to "switch" 
the protein toward the conformation that the ligand prefers. 

Two Ligands Whose Binding Sites Are Coupled Must Reciprocally 
Affect Each Other's Binding 

The effects of ligand binding on a protein follow from a fundamental chemical 
principle known as linkage. Suppose, for example, that a protein that binds glu- 
cose also binds another molecule, X, at a distant site on the protein's surface. If 
the binding site for X changes shape as part of the conformational change 
induced by glucose binding, the binding sites for X and for glucose are said to be 
coupled. Whenever two ligands prefer to bind to the same conformation of an 
allosteric protein, it follows from basic thermodynamic principles that each lig- 
and must increase the affinity of the protein for the other. Thus, if the shift of the 
protein in Figure 3-58 to the closed conformation that binds glucose best also 
causes the binding site for X to fit X better, then the protein will bind glucose 
more tightly when X is present than when X is absent. 

Conversely, linkage operates in a negative way if two ligands prefer to bind 
to different conformations of the same protein. In this case, the binding of the 
first ligand discourages the binding of the second ligand. Thus, if a shape change 
caused by glucose binding decreases the affinity of a protein for molecule X, the 
binding of X must also decrease the protein's affinity for glucose (Figure 3-59). 
The linkage relationship is quantitatively reciprocal, so that, for example, if glu- 
cose has a very large effect on the binding of X, X has a very large effect on the 
binding of glucose. 
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The relationships shown in Figures 3-58 and 3-59 apply to all proteins, and 
they underlie all of cell biology. They seem so obvious in retrospect that we now 
take it for granted. But the discovery of linkage in studies of a few enzymes in the 
1950s, followed by an extensive analysis of allosteric mechanisms in proteins in 
the early 1960s, had a revolutionary effect on our understanding of biology. 
Since molecule X in these examples binds at a site on the enzyme that is distinct 
from the site where catalysis occurs, it need have no chemical relationship to 
glucose or to any other ligand that binds at the active site. Moreover, as we have 
just seen, for enzymes that are regulated in this way, molecule X can either turn 
the enzyme on (positive regulation) or turn it off (negative regulation). By such 
a mechanism, allosteric proteins serve as general switches that, in principle, 
allow one molecule in a cell to affect the fate of any other. 



Figure 3-58 Positive regulation caused 
by conformational coupling between 
two distant binding sites. In this 
example, both glucose and molecule X 
bind best to the closed conformation of a 
protein with two domains. Because both 
glucose and molecule X drive the protein 
toward its closed conformation, each 
ligand helps the other to bind. Glucose 
and molecule X are therefore said to bind 
cooperatively to the protein. 



Symmetric Protein Assemblies Produce Cooperative Allosteric 
Transitions 

A single-subunit enzyme that is regulated by negative feedback can at most 
decrease from 90% to about 10% activity in response to a 100-fold increase in the 
concentration of an inhibitory ligand that it binds (Figure 3-60, red line). 
Responses of this type are apparently not sharp enough for optimal cell regula- 
tion, and most enzymes that are turned on or off by ligand binding consist of 
symmetric assemblies of identical subunits. With this arrangement, the binding 
of a molecule of ligand to a single site on one subunit can promote an allosteric 
change in the entire assembly that helps the neighboring subunits bind the 
same ligand. As a result, a cooperative allosteric transition occurs (Figure 3-60, 
blue line), allowing a relatively small change in ligand concentration in the cell 
to switch the whole assembly from an almost fully active to an almost fully inac- 
tive conformation (or vice versa). 
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Figure 3-59 Negative regulation caused 
by conformational coupling between 
two distant binding sites. The scheme 
here resembles that in the previous 
figure, but here molecule X prefers the 
open conformation, while glucose prefers 
the closed conformation. Because glucose 
and molecule X drive the protein toward 
opposite conformations (closed and 
open, respectively), the presence of 
either ligand interferes with the binding 
of the other. 
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The principles involved in a cooperative "all-or-none" transition are the 
same for all proteins, whether or not they are enzymes. But they are perhaps eas- 
iest to visualize for an enzyme that forms a symmetric dimer. In the example 
shown in Figure 3-61, the first molecule of an inhibitory ligand binds with great 
difficulty since its binding disrupts an energetically favorable interaction 
between the two identical monomers in the dimer. A second molecule of 
inhibitory ligand now binds more easily, however, because its binding restores 
the energetically favorable monomer-monomer contacts of a symmetric dimer 
(this also completely inactivates the enzyme). 

As an alternative to this induced fit model for a cooperative allosteric transi- 
tion, we can view such a symmetrical enzyme as having only two possible con- 
formations, corresponding to the "enzyme on" and "enzyme off" structures in 
Figure 3-61. In this view, ligand binding perturbs an all-or-none equilibrium 
between these two states, thereby changing the proportion of active molecules. 
Both models represent true and useful concepts; it is the second model that we 
shall describe next. 



Figure 3-60 Enzyme activity versus the 
concentration of inhibitory ligand for 
single-subunit and multisubunit 
allosteric enzymes. For an enzyme with a 
single subunit (red line), a drop from 90% 
enzyme activity to 10% activity (indicated 
by the two dots on the curve) requires a 
100-fold increase in the concentration of 
inhibitor. The enzyme activity is 
calculated from the simple equilibrium 
relationship K = [IP]/[I][P], where P is 
active protein, I is inhibitor, and IP is the 
inactive protein bound to inhibitor. An 
identical curve applies to any simple 
binding interaction between two 
molecules, A and B. In contrast, a 
multisubunit allosteric enzyme can 
respond in a switchlike manner to a 
change in ligand concentration: the steep 
response is caused by a cooperative 
binding of the ligand molecules, as 
explained in Figure 3-61 . Here, the green 
line represents the idealized result 
expected for the cooperative binding of 
two inhibitory ligand molecules to an 
allosteric enzyme with two subunits, and 
the blue line shows the idealized 
response of an enzyme with four 
subunits. As indicated by the two dots on 
each of these curves, the more complex 
enzymes drop from 90% to 1 0% activity 
over a much narrower range of inhibitor 
concentration than does the enzyme 
composed of a single subunit. 



The Allosteric Transition in Aspartate Transcarbamoylase Is 
Understood in Atomic Detail 

One enzyme used in the early studies of allosteric regulation was aspartate 
transcarbamoylase from E. coll It catalyzes the important reaction that begins 
the synthesis of the pyrimidine ring of C, U, and T nucleotides: carbamoyl phos- 
phate + aspartate -> Af-carbamoylaspartate. One of the final products of this 
pathway cytosine triphosphate (CTP), binds to the enzyme to turn it off when- 
ever CTP is plentiful. 

Aspartate transcarbamoylase is a large complex of six regulatory and six cat- 
alytic subunits. The catalytic subunits form two trimers, each arranged in the 
shape of an equilateral triangle; the two trimers face each other and are held 



Figure 3-61 A cooperative allosteric transition in an enzyme composed 
of two identical subunits. This diagram illustrates how the conformation of 
one subunit can influence that of its neighbor. The binding of a single 
molecule of an inhibitory ligand (yellow) to one subunit of the enzyme 
occurs with difficulty because it changes the conformation of this subunit 
and thereby disrupts the symmetry of the enzyme. Once this 
conformational change has occurred, however, the energy gained by 
restoring the symmetric pairing interaction between the two subunits 
makes it especially easy for the second subunit to bind the inhibitory 
ligand and undergo the same conformational change. Because the binding 
of the first molecule of ligand increases the affinity with which the other 
subunit binds the same ligand, the response of the enzyme to changes in 
the concentration of the ligand is much steeper than the response of an 
enzyme with only one subunit (see Figure 3-60). 
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Figure 3-62 The transition between 
R and T states in the enzyme aspartate 
transcarbamoylase. <CTAA> The enzyme 
consists of a complex of six catalytic 
subunits and six regulatory subunits, and 
the structures of its inactive (T state) and 
active (R state) forms have been 
determined by x-ray crystallography. The 
enzyme is turned off by feedback 
inhibition when CTP concentrations rise. 
Each regulatory subunit can bind one 
molecule of CTP, which is one of the final 
products in the pathway. By means of this 
negative feedback regulation, the pathway 
is prevented from producing more CTP 
than the cell needs. (Based on K.L. Krause, 
K.W. Volz and W.N. Lipscomb, Proc. Natl 
Acad. Sci. U.S.A. 82:1 643-1 647, 1 985. 
With permission from National Academy 
of Sciences.) 



together by three regulatory dimers that form a bridge between them. The entire 
molecule is poised to undergo a concerted, all-or-none, allosteric transition 
between two conformations, designated as T (tense) and R (relaxed) states (Fig- 
ure 3-62). 

The binding of substrates (carbamoyl phosphate and aspartate) to the cat- 
alytic trimers drives aspartate transcarbamoylase into its catalytically active R 
state, from which the regulatory CTP molecules dissociate. By contrast, the 
binding of CTP to the regulatory dimers converts the enzyme to the inactive T 
state, from which the substrates dissociate. This tug-of-war between CTP and 
substrates is identical in principle to that described previously in Figure 3-59 for 
a simpler allosteric protein. But because the tug-of-war occurs in a symmetric 
molecule with multiple binding sites, the enzyme undergoes a cooperative 
allosteric transition that will turn it on suddenly as substrates accumulate (form- 
ing the R state) or shut it off rapidly when CTP accumulates (forming the T state) . 

A combination of biochemistry and x-ray crystallography has revealed many 
fascinating details of this allosteric transition. Each regulatory subunit has two 
domains, and the binding of CTP causes the two domains to move relative to 
each other, so that they function like a lever that rotates the two catalytic trimers 
and pulls them closer together into the T state (see Figure 3-62). When this 
occurs, hydrogen bonds form between opposing catalytic subunits. This helps 
widen the cleft that forms the active site within each catalytic subunit, thereby 
disrupting the binding sites for the substrates (Figure 3-63). Adding large 
amounts of substrate has the opposite effect, favoring the R state by binding in 
the cleft of each catalytic subunit and opposing the above conformational 
change. Conformations that are intermediate between R and T are unstable, so 
that the enzyme mostly clicks back and forth between its R and T forms, pro- 
ducing a mixture of these two species in proportions that depend on the relative 
concentrations of CTP and substrates. 
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Figure 3-63 Part of the on-off switch in 
the catalytic subunits of aspartate 
transcarbamoylase. Changes in the 
indicated hydrogen-bonding interactions 
are partly responsible for switching this 
enzyme's active site between active 
(yellow) and inactive conformations. 
Hydrogen bonds are indicated by thin red 
lines. The amino acids involved in the 
subunit-subunit interaction in the T state 
are shown in red, while those that form 
the active site of the enzyme in the R state 
are shown in blue. The large drawings 
show the catalytic site in the interior of 
the enzyme; the boxed sketches show the 
same subunits viewed from the enzyme's 
external surface. (Adapted from 
E.R. Kantrowitz and W.N. Lipscomb, Trends 
Biochem.Sci. 15:53-59, 1990. With 
permission from Elsevier.) 
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Many Changes in Proteins Are Driven by Protein Phosphorylation 

Proteins are regulated by more than the reversible binding of other molecules. A 
second method that eucaryotic cells use to regulate a protein's function is the 
covalent addition of a smaller molecule to one or more of its amino acid side 
chains. The most common such regulatory modification in higher eucaryotes is 
the addition of a phosphate group. We shall therefore use protein phosphoryla- 
tion to illustrate some of the general principles involved in the control of protein 
function through the modification of amino acid side chains. 

A phosphorylation event can affect the protein that is modified in two 
important ways. First, because each phosphate group carries two negative 
charges, the enzyme-catalyzed addition of a phosphate group to a protein can 
cause a major conformational change in the protein by, for example, attracting 
a cluster of positively charged amino acid side chains. This can, in turn, affect 
the binding of ligands elsewhere on the protein surface, dramatically changing 
the protein's activity. When a second enzyme removes the phosphate group, the 
protein returns to its original conformation and restores its initial activity. 

Second, an attached phosphate group can form part of a structure that the 
binding sites of other proteins recognize. As previously discussed, certain pro- 
tein domains, sometimes referred to as modules, appear very frequently as parts 
of larger proteins. One such module is the SH2 domain, described earlier, which 
binds to a short peptide sequence containing a phosphorylated tyrosine side 
chain (see Figure 3-39B). More than ten other common domains provide bind- 
ing sites for attaching their protein to phosphorylated peptides in other protein 
molecules, each recognizing a phosphorylated amino acid side chain in a differ- 
ent protein context. As a result, protein phosphorylation and dephosphoryla- 
tion very often drive the regulated assembly and disassembly of protein com- 
plexes (see Figure 15-22). 

Reversible protein phosphorylation controls the activity, structure, and 
cellular localization of both enzymes and many other types of proteins in 
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eucaryotic cells. In fact, this regulation is so extensive that more than one-third 
of the 10,000 or so proteins in a typical mammalian cell are thought to be phos- 
phorylated at any given time — many with more than one phosphate. As might 
be expected, the addition and removal of phosphate groups from specific pro- 
teins often occur in response to signals that specify some change in a cell's state. 
For example, the complicated series of events that takes place as a eucaryotic 
cell divides is largely timed in this way (discussed in Chapter 17), and many of 
the signals mediating cell-cell interactions are relayed from the plasma mem- 
brane to the nucleus by a cascade of protein phosphorylation events (discussed 
in Chapter 15). 



A Eucaryotic Cell Contains a Large Collection of Protein Kinases 
and Protein Phosphatases 

Protein phosphorylation involves the enzyme-catalyzed transfer of the terminal 
phosphate group of an ATP molecule to the hydroxyl group on a serine, threo- 
nine, or tyrosine side chain of the protein (Figure 3-64). A protein kinase cat- 
alyzes this reaction, and the reaction is essentially unidirectional because of the 
large amount of free energy released when the phosphate-phosphate bond in 
ATP is broken to produce ADP (discussed in Chapter 2). A protein phosphatase 
catalyzes the reverse reaction of phosphate removal, or dephosphorylation. Cells 
contain hundreds of different protein kinases, each responsible for phosphory- 
lating a different protein or set of proteins. There are also many different protein 
phosphatases; some are highly specific and remove phosphate groups from only 
one or a few proteins, whereas others act on a broad range of proteins and are 
targeted to specific substrates by regulatory subunits. The state of phosphoryla- 
tion of a protein at any moment, and thus its activity, depends on the relative 
activities of the protein kinases and phosphatases that modify it. 

The protein kinases that phosphorylate proteins in eucaryotic cells belong 
to a very large family of enzymes, which share a catalytic (kinase) sequence of 
about 290 amino acids. The various family members contain different amino 
acid sequences on either end of the kinase sequence (for example, see Figure 
3-10), and often have short amino acid sequences inserted into loops within it 
(red arrowheads in Figure 3-65) . Some of these additional amino acid sequences 
enable each kinase to recognize the specific set of proteins it phosphorylates, or 
to bind to structures that localize it in specific regions of the cell. Other parts of 
the protein regulate the activity of each kinase, so it can be turned on and off in 
response to different specific signals, as described below. 

By comparing the number of amino acid sequence differences between the 
various members of a protein family, we can construct an "evolutionary tree" 
that is thought to reflect the pattern of gene duplication and divergence that 
gave rise to the family. Figure 3-66 shows an evolutionary tree of protein 
kinases. Kinases with related functions are often located on nearby branches of 
the tree: the protein kinases involved in cell signaling that phosphorylate tyro- 
sine side chains, for example, are all clustered in the top left corner of the tree. 
The other kinases shown phosphorylate either a serine or a threonine side 
chain, and many are organized into clusters that seem to reflect their function- 
in transmembrane signal transduction, intracellular signal amplification, cell- 
cycle control, and so on. 

Figure 3-65 The three-dimensional structure of a protein kinase. 

Superimposed on this structure are red arrowheads to indicate sites where 
insertions of 5-1 00 amino acids are found in some members of the protein 
kinase family. These insertions are located in loops on the surface of the 
enzyme where other ligands interact with the protein. Thus, they 
distinguish different kinases and confer on them distinctive interactions 
with other proteins. The ATP (which donates a phosphate group) and the 
peptide to be phosphorylated are held in the active site, which extends 
between the phosphate-binding loop (yellow) and the catalytic loop 
(orange). See also Figure 3-10. (Adapted from D.R. Knighton et al., Science 
253:407-414, 1991. With permission from AAAS.) 
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Figure 3-64 Protein phosphorylation. 

Many thousands of proteins in a typical 
eucaryotic cell are modified by the covalent 
addition of a phosphate group. 

(A) The general reaction, shown here, 
transfers a phosphate group from ATP to an 
amino acid side chain of the target protein 
by a protein kinase. Removal of the 
phosphate group is catalyzed by a second 
enzyme, a protein phosphatase. In this 
example, the phosphate is added to a serine 
side chain; in other cases, the phosphate is 
instead linked to the -OH group of a 
threonine or a tyrosine in the protein. 

(B) The phosphorylation of a protein by a 
protein kinase can either increase or 
decrease the protein's activity, depending 
on the site of phosphorylation and the 
structure of the protein. 
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As a result of the combined activities of protein kinases and protein phos- 
phatases, the phosphate groups on proteins are continually turning over — being 
added and then rapidly removed. Such phosphorylation cycles may seem waste- 
ful, but they are important in allowing the phosphorylated proteins to switch 
rapidly from one state to another: the more rapid the cycle, the faster a popula- 
tion of protein molecules can change its state of phosphorylation in response to 
a sudden change in the phosphorylation rate (see Figure 15-11). The energy 
required to drive this phosphorylation cycle is derived from the free energy of 
ATP hydrolysis, one molecule of which is consumed for each phosphorylation 
event. 



Figure 3-66 An evolutionary tree of 
selected protein kinases. Although a 
higher eucaryotic cell contains hundreds 
of such enzymes, and the human 
genome codes for more than 500, only 
some of those discussed in this book are 
shown. 



The Regulation of Cdk and Src Protein Kinases Shows How a 
Protein Can Function as a Microchip 

The hundreds of different protein kinases in a eucaryotic cell are organized into 
complex networks of signaling pathways that help to coordinate the cell's activ- 
ities, drive the cell cycle, and relay signals into the cell from the cell's environ- 
ment. Many of the extracellular signals involved need to be both integrated and 
amplified by the cell. Individual protein kinases (and other signaling proteins) 
serve as input-output devices, or "microchips," in the integration process. An 
important part of the input to these signal processing proteins comes from the 
control that is exerted by phosphates added and removed from them by protein 
kinases and protein phosphatases, respectively. 

In general, specific sets of phosphate groups serve to activate the protein, 
while other sets can inactivate it. A cyclin-dependent protein kinase (Cdk) pro- 
vides a good example. Kinases in this class phosphorylate serines and threonines, 
and they are central components of the cell-cycle control system in eucaryotic 
cells, as discussed in detail in Chapter 17. In a vertebrate cell, individual Cdk pro- 
teins turn on and off in succession, as a cell proceeds through the different 
phases of its division cycle. When a particular kinase is on, it influences various 
aspects of cell behavior through effects on the proteins it phosphorylates. 

A Cdk protein becomes active as a serine/threonine protein kinase only 
when it is bound to a second protein called a cyclin. But, as Figure 3-67 shows, 
the binding of cyclin is only one of three distinct "inputs" required to activate the 
Cdk. In addition to cyclin binding, a phosphate must be added to a specific thre- 
onine side chain, and a phosphate elsewhere in the protein (covalently bound to 
a specific tyrosine side chain) must be removed. Cdk thus monitors a specific set 
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Figure 3-67 How a Cdk protein acts as 
an integrating device. <TAGA>The 
function of these central regulators of the 
cell cycle is discussed in Chapter 17. 
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of cell components— a cyclin, a protein kinase, and a protein phosphatase — and 
it acts as an input-output device that turns on if, and only if, each of these com- 
ponents has attained its appropriate activity state. Some cyclins rise and fall in 
concentration in step with the cell cycle, increasing gradually in amount until 
they are suddenly destroyed at a particular point in the cycle. The sudden 
destruction of a cyclin (by targeted proteolysis) immediately shuts off its partner 
Cdk enzyme, and this triggers a specific step in the cell cycle. 

"Hie Src family of protein kinases (see Figure 3-lOJ exhibits a similar type of 
microchip behavior. The Src protein {pronounced "sarc" and named for the type 
of tumor, a sarcoma, that its deregulation can cause) was the first tyrosine 
kinase to be discovered, ft is now known to be part of a subfamily of nine very 
similar protein kinases, which are round only in multicellular animals, As indi- 
cated by the evolutionary tree in Figure 3-66, sequence comparisons suggest 
that tyrosine kinases as a group were a relatively late innovation that branched 
off from the serine/threonine kinases, with the Src subfamily being only one 
subgroup of the tyrosine kinases created in this way 

The Src protein and its relatives contain a short N-terminal region that 
becomes covalently linked to a strongly hydrophobic fatty acid, which holds the 
kinase at the cytoplasmic face of the plasma membrane. Next come two pep- 
tide-binding modules, a Src homology 3 (SH3) domain and a SH2 domain, fol- 
lowed by the kinase catalytic domain (Figure 3-68). These kinases normally exist 
in an inactive conformation, in which a phosphorylated tyrosine near the C-ter- 
minus is bound to the SH2 domain, and the SH3 domain is bound to an internal 
peptide in a way that distorts the active site of the enzyme and helps to render it 
inactive. 

Turning the kinase on involves at least two specific inputs: removal of the C- 
terminal phosphate and the binding of the SH3 domain by a specific activating 
protein (Figure 3-69). like the activation of the Cdk protein, liie activation of 
the Src kinase signals the completion of a particular set of separate upstream 
events [Figure 3-70). Thus, both the Cdk and Src families of proteins serve as 
specific signal integrators, helping to generate the complex web of information- 
processing events that enable the cell to compute logical responses to a complex 
set of conditions. 



Figure 3-68 The domain structure of the 
Src family of protein kinases, mapped 
along the amino acid sequence. For the 

three-dimensional structure of Src, see 
Figure 3-10. 



Proteins That Bind and Hydrolyze GTP Are Ubiquitous Cellular 
Regulators 

We have described how the addition or removal of phosphate groups on a protein 
can be used by a cell to control the protein's activity. In the examples discussed so 



activating 



Figure 3-69 The activation of a Src-type 
protein kinase by two sequential events. 

(Adapted from S.C. Harrison et al., Cell 
1 1 2:737-740, 2003. With permission from 
Elsevier.) 
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far, the phosphate is transferred from an ATP molecule to an amino acid side 
chain of the protein in a reaction catalyzed by a specific protein kinase. Eucary- 
otic cells also have another way to control protein activity by phosphate addi- 
tion and removal. In this case, the phosphate is not attached directly to the pro- 
tein; instead, it is a part of the guanine nucleotide GTP, which binds very tightly 
to the protein. In general, proteins regulated in this way are in their active con- 
formations with GTP bound. The loss of a phosphate group occurs when the 
bound GTP is hydrolyzed to GDP in a reaction catalyzed by the protein itself, 
and in its GDP-bound state the protein is inactive. In this way, GTP-binding pro- 
teins act as on-off switches whose activity is determined by the presence or 
absence of an additional phosphate on a bound GDP molecule (Figure 3-71). 

GTP-binding proteins (also called GTPases because of the GTP hydrolysis 
they catalyze) comprise a large family of proteins that all contain variations on 
the same GTP-binding globular domain. When the tightly bound GTP is 
hydrolyzed to GDP, this domain undergoes a conformational change that inac- 
tivates it. The three-dimensional structure of a prototypical member of this fam- 
ily, the monomeric GTPase called Ras, is shown in Figure 3-72. 

The Ras protein has an important role in cell signaling (discussed in Chap- 
ter 15). In its GTP-bound form, it is active and stimulates a cascade of protein 
phosphorylations in the cell. Most of the time, however, the protein is in its inac- 
tive, GDP-bound form. It becomes active when it exchanges its GDP for a GTP 
molecule in response to extracellular signals, such as growth factors, that bind to 
receptors in the plasma membrane (see Figure 15-58). 



Regulatory Proteins Control the Activity of GTP-Binding Proteins 
by Determining Whether GTP or GDP Is Bound 

GTP-binding proteins are controlled by regulatory proteins that determine 
whether GTP or GDP is bound, just as phosphorylated proteins are turned on 
and off by protein kinases and protein phosphatases. Thus, Ras is inactivated by 
a GTPase-activating protein (GAP), which binds to the Ras protein and induces 
it to hydro lyze its bound GTP molecule to GDP— which remains tightly bound— 
and inorganic phosphate (Pj), which is rapidly released. The Ras protein stays in 
its inactive, GDP-bound conformation until it encounters a guanine nucleotide 
exchange factor (GEF), which binds to GDP-Ras and causes it to release its GDP 
Because the empty nucleotide -binding site is immediately filled by a GTP 
molecule (GTP is present in large excess over GDP in cells), the GEF activates Ras 
by indirectly adding back the phosphate removed by GTP hydrolysis. Thus, in a 
sense, the roles of GAP and GEF are analogous to those of a protein phosphatase 
and a protein kinase, respectively (Figure 3-73). 
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Figure 3-70 How a Src-type protein 
kinase acts as an integrating device. The 

disruption of the SH3 domain interaction 
(green) involves replacing its binding to 
the indicated red linker region with a 
tighter interaction with an activating 
ligand, as illustrated in Figure 3-69. 



Large Protein Movements Can Be Generated From Small Ones 

The Ras protein belongs to a large superfamily of monomeric GTPases, each of 
which consists of a single GTP-binding domain of about 200 amino acids. Over 
the course of evolution, this domain has also become joined to larger proteins 
with additional domains, creating a large family of GTP-binding proteins. Fam- 
ily members include the receptor-associated trimeric G proteins involved in 
cell signaling (discussed in Chapter 15), proteins regulating the traffic of vesi- 
cles between intracellular compartments (discussed in Chapter 13), and pro- 
teins that bind to transfer RNA and are required as assembly factors for protein 
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Figure 3-71 GTP-binding proteins as 
molecular switches. The activity of a 
GTP-binding protein (also called a 
GTPase) generally requires the presence 
of a tightly bound GTP molecule (switch 
"on"). Hydrolysis of this GTP molecule 
produces GDP and inorganic phosphate 
(Pi), and it causes the protein to convert 
to a different, usually inactive, 
conformation (switch "off"). As shown 
here, resetting the switch requires the 
tightly bound GDP to dissociate, a slow 
step that is greatly accelerated by specific 
signals; once the GDP has dissociated, a 
molecule of GTP is quickly rebound. 



180 Chapter 3: Proteins 



COOH 




switch 
helix 



Figure 3-72 The structure of the Ras 
protein in its GTP-bound form. <GAAC> 
This monomeric GTPase illustrates the 
structure of a GTP-binding domain, which 
is present in a large family of GTP-binding 
proteins. The red regions change their 
conformation when the GTP molecule is 
hydrolyzed to GDP and inorganic 
phosphate by the protein; the GDP 
remains bound to the protein, while the 
inorganic phosphate is released. The 
special role of the "switch helix" in 
proteins related to Ras is explained next 
(see Figure 3-75). 
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synthesis on the ribosome (discussed in Chapter 6). In each case, an important 
biological activity is controlled by a change in the protein's conformation that is 
caused by GTP hydrolysis in a Ras-like domain. 

The EF-Tu protein provides a good example of how this family of proteins 
works. EF-Tu is an abundant molecule that serves as an elongation factor (hence 
the EF) in protein synthesis, loading each aminoacyl tRNA molecule onto the 
ribosome. The tRNA molecule forms a tight complex with the GTP-bound form 
of EF-Tu (Figure 3-74). In this complex, the amino acid attached to the tRNA is 
improperly positioned for protein synthesis. The tRNA can transfer its amino 
acid only after the GTP bound to EF-Tu is hydrolyzed on the ribosome, allowing 
the EF-Tu to dissociate. Since the GTP hydrolysis is triggered by a proper fit of 
the tRNA to the mRNA molecule on the ribosome, the EF-Tu serves as a factor 
that discriminates between correct and incorrect mRNA-tRNA pairings (see Fig- 
ure 6-67 for a further discussion of this function of EF-Tu). 

By comparing the three-dimensional structure of EF-Tu in its GTP-bound 
and GDP-bound forms, we can see how the repositioning of the tRNA occurs. 
The dissociation of the inorganic phosphate group (PJ, which follows the reac- 
tion GTP -> GDP + Pi, causes a shift of a few tenths of a nanometer at the GTP- 
binding site, just as it does in the Ras protein. This tiny movement, equivalent to 
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Figure 3-73 A comparison of the two 
major intracellular signaling 
mechanisms in eucaryotic cells. In both 
cases, a signaling protein is activated by 
the addition of a phosphate group and 
inactivated by the removal of this 
phosphate. To emphasize the similarities 
in the two pathways, ATP and GTP are 
drawn as APPP and GPPP, and ADP and 
GDP as APP and GPP, respectively. As 
shown in Figure 3-64, the addition of a 
phosphate to a protein can also be 
inhibitory. 
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Figure 3-74 An aminoacyl tRNA molecule bound to EF-Tu. The three 
domains of the EF-Tu protein are colored differently, to match Figure 3-75. 
This is a bacterial protein; however, a very similar protein exists in 
eucaryotes, where it is called EF-1 . (Coordinates determined by P. Nissen et 
al., Science 270:1464-1472, 1995. With permission from AAAS.) 



a few times the diameter of a hydrogen atom, causes a conformational change 
to propagate along a crucial piece of a helix, called the switch helix, in the Ras- 
like domain of the protein. The switch helix seems to serve as a latch that 
adheres to a specific site in another domain of the molecule, holding the protein 
in a "shut" conformation. The conformational change triggered by GTP hydrol- 
ysis causes the switch helix to detach, allowing separate domains of the protein 
to swing apart, through a distance of about 4 nm. This releases the bound tRNA 
molecule, allowing its attached amino acid to be used (Figure 3-75). 

Notice in this example how cells have exploited a simple chemical change 
that occurs on the surface of a small protein domain to create a movement 50 
times larger. Dramatic shape changes of this type also cause the very large move- 
ments that occur in motor proteins, as we discuss next. 



Motor Proteins Produce Large Movements in Cells 

We have seen that conformational changes in proteins have a central role in 
enzyme regulation and cell signaling. We now discuss proteins whose major 
function is to move other molecules. These motor proteins generate the forces 
responsible for muscle contraction and the crawling and swimming of cells. 
Motor proteins also power smaller-scale intracellular movements: they help to 
move chromosomes to opposite ends of the cell during mitosis (discussed in 
Chapter 17), to move organelles along molecular tracks within the cell (discussed 
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Figure 3-75 The large conformational change in EF-Tu caused by GTP hydrolysis. <GTAA> (A) The three-dimensional structure of EF-Tu with 
GTP bound. The domain at the top has a structure similar to the Ras protein, and its red a helix is the switch helix, which moves after GTP 
hydrolysis. (B) The change in the conformation of the switch helix in domain 1 causes domains 2 and 3 to rotate as a single unit by about 90° 
toward the viewer, which releases the tRNA that was shown bound to this structure in Figure 3-74. (A, adapted from H. Berchtold et al., Nature 
365:1 26-1 32, 1 993. With permission from Macmillan Publishers Ltd. B, courtesy of Mathias Sprinzl and Rolf Hilgenfeld.) 
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in Chapter 16), and to move enzymes along a DNA strand during the synthesis 
of a new DNA molecule (discussed in Chapter 5). All these fundamental pro- 
cesses depend on proteins with moving parts that operate as force-generating 
machines. 

How do these machines work? In other words, how do cells use shape 
changes in proteins to generate directed movements? If, for example, a protein 
is required to walk along a narrow thread such as a DNA molecule, it can do this 
by undergoing a series of conformational changes, such as those shown in Fig- 
ure 3-76. But with nothing to drive these changes in an orderly sequence, they 
are perfectly reversible, and the protein can only wander randomly back and 
forth along the thread. We can look at this situation in another way. Since the 
directional movement of a protein does work, the laws of thermodynamics (dis- 
cussed in Chapter 2) demand that such movement use free energy from some 
other source (otherwise the protein could be used to make a perpetual motion 
machine). Therefore, without an input of energy, the protein molecule can only 
wander aimlessly. 

How can the cell make such a series of conformational changes unidirec- 
tional? To force the entire cycle to proceed in one direction, it is enough to make 
any one of the changes in shape irreversible. Most proteins that are able to walk 
in one direction for long distances achieve this motion by coupling one of the 
conformational changes to the hydrolysis of an ATP molecule bound to the pro- 
tein. The mechanism is similar to the one just discussed that drives allosteric 
protein shape changes by (TIP hydrolysis. Because ATP for GTP) hydrolysis 
releases a great deal of free energy it is very unlikely that the nucleotide-binding 
protein will undergo the reverse shape change needed for moving backward — 
since this would require that it also reverse the ATP hydrolysis by adding a phos- 
phate molecule to ADP to form ATP 

In the model shown in Figure 3-77, ATP binding shifts a motor protein from 
conformation 1 to conformation 2. The bound ATP is then hydrolyzed to pro- 
duce ADP and inorganic phosphate (Pi), causing a change from conformation 2 
to conformation 3 + Finally, the release of the bound ADP and P s drives the pro- 
tein back to conformation 1„ Because the energy provided by ATP hydrolysis 
drives the transition 2 -*- 3 t this series of conformational changes is effectively 
irreversible. Thus, the entire cycle goes in only one direction, causing the proieiu 
molecule to walk continuously to the right in this example. 

Many motor proteins generate directional movement in this general way, 
including the muscle motor protein myosin, which walks along actin filaments 
to generate muscle contraction, and the kinesin proteins that walk along micro- 
tubules (both discussed in Chapter 16). These movements can be rapid: some of 
the motor proteins involved in DNA replication (the DNA helicases) propel 
themselves along a DNA strand at rates as high as 1000 nucleotides per second. 

Membrane-Bound Transporters Harness Energy to Pump 
Molecules Through Membranes 

We have thus far seen how allosteric proteins can act as microchips (Cdk and Src 
kinases), as assembly factors (EF-Tu), and as generators of mechanical force and 
motion (motor proteins). Allosteric proteins can also harness energy derived 
from ATP hydrolysis, ion gradients, or electron transport processes to pump spe- 
cific ions or small molecules across a membrane. We consider one example here; 
others will be discussed in Chapter 11. 

The ABC transporters constitute an important class of membrane-bound 
pump proteins. In humans at least 48 different genes encode them. These trans- 
porters mostly function to export hydrophobic molecules from the cytoplasm, 

Figure 3-77 An allosteric motor protein. The transition between three 
different conformations includes a step driven by the hydrolysis of a bound 
ATP molecule, and this makes the entire cycle essentially irreversible. By 
repeated cycles, the protein therefore moves continuously to the right 
along the thread. 




Figure 3-76 An allosteric "walking" 
protein. Although its three different 
conformations allow it to wander 
randomly back and forth while bound to 
a thread or a filament, the protein cannot 
move uniformly in a single direction. 
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Figure 3-78 The ABC (ATP-binding 
cassette) transporter, a protein machine 
that pumps large hydrophobic molecules 
through a membrane. (A) The bacterial 
BtuCD protein, which imports vitamin B12 
into £ coli using the energy of ATP 
hydrolysis. The binding of two molecules of 
lipid ATP clamps together the two ATP-binding 

bilayer subunits. The structure is shown in its ADP- 

bound state, where the channel to the 
extracellular space can be seen to be open 
but the gate to the cytosol remains closed. 
(B) Schematic illustration of substrate 
pumping by ABC transporters. In bacteria, 
the binding of a substrate molecule to the 
extracellular face of the protein complex 
triggers ATP hydrolysis followed by ADP 
release, which opens the cytoplasmic gate; 
the pump is then reset for another cycle. In 
eucaryotes, an opposite process occurs, 
causing substrate molecules to be pumped 
out of the cell. (A, adapted from K.P. Locher, 
Curr. Opin. Struct Biol. 14:426-441, 2004. 
With permission from Elsevier.) 
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serving to remove toxic molecules at the mucosal surface of the intestinal tract, 
for example, or at the blood-brain barrier. The study of ABC transporters is of 
intense interest in clinical medicine, because the overproduction of proteins in 
this class contributes to the resistance of tumor cells to chemotherapeutic 
drugs. And in bacteria, the same type of proteins primarily function to import 
essential nutrients into the cell. 

The ABC transporter is a tetramer, with a pair of membrane-spanning sub- 
units linked to a pair of ATP binding subunits located just below the plasma 
membrane (Figure 3-78A). As in other examples we have discussed, the hydrol- 
ysis of the bound ATP molecules drives conformational changes in the protein, 
transmitting forces that cause the membrane-spanning subunits to move their 
bound molecules across the lipid bilayer (Figure 3-78B). 

Humans have invented many different types of mechanical pumps, and it 
should not be surprising that cells also contain membrane-bound pumps that 
function in other ways. Among the most notable are the rotary pumps that 
couple the hydrolysis of ATP to the transport of H + ions (protons). These pumps 
resemble miniature turbines, and they are used to acidify the interior of lyso- 
somes and other eucaryotic organelles. Like other ion pumps that create ion gra- 
dients, they can function in reverse to catalyze the reaction ADP + Pi -> ATR if the 
gradient across their membrane of the ion that they transport is steep enough. 

One such pump, the ATP synthase, harnesses a gradient of proton concen- 
tration produced by electron transport processes to produce most of the ATP 
used in the living world. This ubiquitous pump has a central role in energy con- 
version, and we shall discuss its three-dimensional structure and mechanism in 
Chapter 14. 
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Proteins Often Form Large Complexes That Function as Protein 
Machines <actt> <atcg> 

Large proteins formed from many domains are able to perform more elaborate 
functions than small, single-domain proteins. But large protein assemblies 
formed from many protein molecules perform the most impressive tasks. Now 
that it is possible to reconstruct most biological processes in cell-free systems in 
the laboratory it is clear that each of the central processes in a cell— such as 
DNA replication, protein synthesis, vesicle budding, or transmembrane signal- 
ing—is catalyzed by a highly coordinated, linked set of 10 or more proteins. In 
most such protein machines, an energetically favorable reaction such as the 
hydrolysis of bound nucleoside triphosphates (ATP or GTP) drives an ordered 
series of conformational changes in one or more of the individual protein sub- 
units, enabling the ensemble of proteins to move coordinately. In this way each 
enzyme can be moved directly into position, as the machine catalyzes succes- 
sive reactions in a series. This is what occurs, for example, in protein synthesis 
on a ribosome (discussed in Chapter 6)— or in DNA replication, where a large 
multiprotein complex moves rapidly along the DNA (discussed in Chapter 5). 

Cells have evolved protein machines for the same reason that humans have 
invented mechanical and electronic machines. For accomplishing almost any 
task, manipulations that are spatially and temporally coordinated through 
linked processes are much more efficient than the use of individual tools. 



Protein Machines with Interchangeable Parts Make Efficient Use 
of Genetic Information 

To probe more deeply into the nature of protein machines, we shall consider a 
relatively simple one: the SCF ubiquitin ligase. This protein complex binds dif- 
ferent "target proteins" at different times in the cell cycle, and it covalently adds 
multiubiquitin polypeptide chains to these proteins. Its C-shaped structure is 
formed from five protein subunits, the largest of which is a molecule that serves 
as a scaffold protein on which the rest of the structure is built. The structure 
underlies a remarkable mechanism (Figure 3-79). At one end of the C is an E2 
ubiquitin-conjugating enzyme. At the other end is a substrate-binding arm, a 
subunit known as an F-box protein. These two subunits are separated by a gap of 
about 5 nm. When this protein complex is activated, the F-box protein binds to 
a specific site on a target protein, positioning the protein in the gap so that some 
of its lysine side chains contact the ubiquitin-conjugating enzyme. This enzyme 
can then catalyze the repeated addition of a ubiquitin polypeptide to these 
lysines (see Figure 3-79C), producing a polyubiquitin chain that marks the tar- 
get protein for rapid destruction in a proteasome (see p. 393). 

In this manner, specific proteins are targeted for rapid destruction in 
response to specific signals, thereby helping to drive die cell cycle (discussed in 
Chapter 1 7). The timing of the destruction often involves creating a specific pat- 
tern of phosphorylation on the target protein that is required for its recognition 
by the F-box subunit. It also requires the activation of an SCF ubiquitin ligase 
that carries the appropriate substrate-binding arm. Many of these arms (the F- 
box subunits) are interchangeable in the protein complex (see Figure 3-79B), 
and there are more than 70 human genes that encode them. 

As emphasized previously, once a successful protein has evolved, its genetic 
information tends to be duplicated to produce a family of related proteins. Thus, 
for example, not only are there many F-box proteins— milking possible the 
recognition of different sets of target proteins— but there is also a family of scaf- 
folds (known as cullins) that give rise to a family of SCF-like ubiquitin ligases. 

The pressure on organisms to minimize the number of genes (see p. 265) pre- 
sumably helps to explain why RNA splicing is so prevalent in higher eucaryotes* 
allowing multiple related proteins to be synthesized from a single gene (dis- 
cussed in Chapter 6). A protein machine like the SCF ubiquitin ligase, with its 
interchangeable parts, likewise makes economical use of the genetic information 
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in cells, inasmuch as new functions can evolve for the entire complex simply by 
producing an alternative version of one of its subunits. 



Figure 3-79 The structure and mode of 
action of a SCF ubiquitin ligase. (A) The 

structure of the five-protein complex that 
includes an E2 ubiquitin ligase. The 
protein denoted here as adapter protein 
1 istheRbx1/Hrt1 protein, adaptor 
protein 2 is the Skp1 protein, and the 
cullin is the Cull protein. (B) Comparison 
of the same complex with two different 
substrate-binding arms, the F-box 
proteins Skp2 (top) and (3-trCP1 (bottom), 
respectively. (C) The binding and 
ubiquitylation of a target protein by the 
SCF ubiquitin ligase. If, as indicated, a 
chain of ubiquitin molecules is added to 
the same lysine of the target protein, that 
protein is marked for rapid destruction by 
the proteasome. (A and B, adapted from 
G. Wu et al., Mol. Ceil 1 1:1445-1456, 2003. 
With permission from Elsevier.) 



The Activation of Protein Machines Often Involves Positioning 
Them at Specific Sites 

As scientists have learned more of the details of cell biology, they have recog- 
nized increasing degrees of sophistication in cell chemistry. Thus, not only do we 
now know that protein machines play a predominant role, but it has recently 
become clear that most of these machines form at specific sites in the cell, being 
activated only where and when they are needed. Using fluorescent, GFP- tagged 
fusion proteins in living cells (see p. 593), cell biologists are able to follow the 
repositioning of individual proteins that occurs in response to specific signals. 
Thus, when certain extracellular signaling molecules bind to receptor proteins 
in the plasma membrane, they often recruit a set of other proteins to the inside 
surface of the plasma membrane to form protein machines that pass the signal 
on. As an example, Figure 3-80A illustrates the rapid movement of a protein 
kinase C (PKC) enzyme to a complex in the plasma membrane, where it associ- 
ates with specific substrate proteins that it phosphorylates. 

There are more than 10 distinct PKC enzymes in human cells, which differ 
both in their mode of regulation and in their functions. When activated, these 
enzymes move from the cytoplasm to different intracellular locations, forming 
specific complexes with other proteins that allow them to phosphorylate differ- 
ent protein substrates (Figure 3-80B). The SCF ubiquitin ligases can also move 
to specific sites of function at appropriate times. As will be explained when we 
discuss cell signaling in Chapter 15, the mechanisms frequently involve protein 
phosphorylation, as well as scaffold proteins that link together a set of activat- 
ing, inhibiting, adaptor, and substrate proteins at a specific location in a cell. 

This general phenomenon is known as induced proximity, and it explains 
the otherwise puzzling observation that slightly different forms of enzymes 
with the same catalytic site will often have very different biological functions. 
Cells change the locations of their proteins by covalently modifying them in a 
variety of different ways, as part of a "regulatory code" to be described next. 
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These modifications create sites on proteins that bind them to particular scaf- 
fold proteins, thereby clustering the proteins required for particular reactions in 
specific regions of the cell. Most biological reactions are catalyzed by sets of 5 or 
more proteins, and such a clustering of proteins is often required for the reac- 
tion to occur. Scaffolds thereby allow cells to compartmentalize reactions even 
in the absence of membranes. Although only recently recognized as a 
widespread phenomenon, this type of clustering is particularly obvious in the 
cell nucleus (see Figure 4-69). 

Many scaffolds appear to be quite different from the cullin illustrated previ- 
ously in Figure 3-79: rather than holding their bound proteins in precise posi- 
tions relative to each other, the interacting proteins are linked by unstructured 
regions of polypeptide chain. This tethers the proteins together, causing them to 
collide frequently with each other in random orientations— some of which will 
lead to a productive reaction (Figure 3-80C). In essence, this mechanism greatly 
speeds reactions by creating a very high local concentration of the reacting 
species. For this reason, the use of scaffold proteins represents an especially ver- 
satile way of controlling cell chemistry (see also Figure 15-61). 



Figure 3-80 The assembly of protein 
machines at specific sites in a cell. 

(A) In response to a signal (here a phorbol 
ester), the gamma subspecies of protein 
kinase C moves rapidly from the cytosol 
to the plasma membrane. The protein 
kinase is fluorescent in these living cells 
because an engineered gene inside the 
cell encodes a fusion protein that links 
the kinase to green fluorescent protein 
(GFP). (B) The specific association of a 
different subspecies of protein kinase C 
(aPKC) with the apical tip of a 
differentiating neuroblast in an early 
Drosophilo embryo. The kinase is stained 
red, and the cell nucleus green. 
(C) Diagram illustrating how a simple 
proximity created by scaffold proteins 
can greatly speed reactions in a cell. In 
this example, long unstructured regions 
of polypeptide chain in a large scaffold 
protein connect a series of structured 
domains that bind a set of reacting 
proteins. The unstructured regions serve 
as flexible "tethers" that greatly speed 
reaction rates by causing a rapid, random 
collision of all of the proteins that are 
bound to the scaffold. (For a simple 
example of tethering, see Figure 1 6-38.) 
(A, from N. Sakai et al, J. Cell Biol 
139:1465-1476, 1997. With permission 
from The Rockefeller University Press. 
B, courtesy of Andreas Wodarz, Institute 
of Genetics, University of Dusseldorf, 
Germany.) 



Many Proteins Are Controlled by Multisite Covalent Modification 

We have thus far described only one type of posttranslational modification of 
proteins— that in which a phosphate is covalently attached to an amino acid 
side chain (see Figure 3-64). But a large number of other such modifications also 
occur, more than 200 distinct types being known. To give a sense of the variety, 
Table 3-3 presents a subset of modifying groups with known regulatory roles. As 

Table 3-3 Some Molecules Covalently Attached to Proteins Regulate Protein Function 



MODIFYING GROUP 

Phosphate on Ser,Thr, orTyr 
Methyl on Lys 



Acetyl on Lys 
Palmityl group on Cys 



SOME PROMINENT FUNCTIONS 

Drives the assembly of a protein into larger complexes (see Figure 15-19). 
Helps to creates histone code in chromatin through forming either mono-, 
di-, or tri-methyl lysine (see Figure 4-38). 



Helps to creates histone code in chromatin (see Figure 4-38). 
This fatty acid addition drives protein association with membranes (see 
Figure 10-20). 

Controls enzyme activity and gene expression in glucose homeostasis. 

Monoubiquitin addition regulates the transport of membrane proteins in vesicles 
(see Figure 13-58). 

A polyubiquitin chain targets a protein for degradation (see Figure 3-79). 

Ubiquitin is a 76 amino acid polypeptide; there are at least 10 other ubiquitin-related proteins, such as SUMO, that modify proteins in similar ways 



/V-acetylglucosamine on Ser orThr 
Ubiquitin on Lys 
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in phosphate addition, these groups are added and then removed from proteins 
according to the needs of the cell. 

A large number of proteins are now known to be modified on more than one 
amino acid side chain, with different regulatory events producing a different pat- 
tern of such modifications. A striking example is the protein p53, which plays a 
central part in controlling a cell's response to adverse circumstances (seep. 1105). 
Through one of four different types of molecular additions, this protein can be 
modified at 20 different sites (Figure 3-81A). Because an enormous number of 
different combinations of these 20 modifications are possible, the protein's 
behavior can in principle be altered in a huge number of ways. Moreover, the 
pattern of modifications on a protein can determine its susceptibility to further 
modification, as illustrated by histone H3 in Figure 3-8 IB. 

Cell biologists have only recently come to recognize that each protein's set of 
covalent modifications constitutes an important combinatorial regulatory code. 
As specific modifying groups are added to or removed from a protein, this code 
causes a different set of protein behaviors — changing the activity or stability of 
the protein, its binding partners, and its specific location within the cell (Figure 
3-8 1C). This helps the cell respond rapidly and with great versatility to changes 
in its condition or environment. 



Figure 3-81 Multisite protein 
modification and its effects. A protein 
that carries a post-translational addition 
to more than one of its amino acid side 
chains can be considered to carry a 
combinatorial regulatory code. (A) The 
pattern of known covalent modifications 
to the protein p53; ubiquitin and SUMO 
are related polypeptides (see Table 3-3). 
(B) The possible modifications on the first 
20 amino acids at the N-terminus of 
histone H3, showing not only their 
locations but also their activating (blue) 
and inhibiting (red) effects on the 
addition of neighboring covalent 
modifications. In addition to the effects 
shown, the acetylation and methylation 
of a lysine are mutually exclusive 
reactions (see Figure 4-38). (C) Diagram 
showing the general manner in which 
multisite modifications are added to (and 
removed from) a protein through 
signaling networks, and how the 
resulting combinatorial regulatory code 
on the protein is read to alter its behavior 
in the cell. 



A Complex Network of Protein Interactions Underlies Cell 
Function 

There are many challenges facing cell biologists in this "post-genome" era when 
complete genome sequences are known. One is the need to dissect and recon- 
struct each one of the thousands of protein machines that exist in an organism 
such as ourselves. To understand these remarkable protein complexes, each 
must be reconstituted from its purified protein parts, so that we can study its 
detailed mode of operation under controlled conditions in a test tube, free from 
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all other cell components. This alone is a massive task. But we now know that 
each of these subcomponents of a cell also interacts with other sets of macro- 
molecules, creating a large network of protein-protein and protein-nucleic acid 
interactions throughout the cell. To understand the cell, therefore, we need to 
analyze most of these other interactions as well. 

We can gain some idea of the complexity of intracellular protein networks 
from a particularly well-studied example described in Chapter 16: the many 
dozens of proteins that interact with the actin cytoskeleton in the yeast Saccha- 
romyces cerevisiae (see Figure 16-18). The extent of such protein-protein inter- 
actions can also be estimated more generally. An enormous amount of valuable 
information is now freely available in protein databases on the Internet: tens of 
thousands of three-dimensional protein structures plus tens of millions of pro- 
tein sequences derived from the nucleotide sequences of genes. Scientists have 
been developing new methods for mining this great resource to increase our 
understanding of cells. In particular, computer-based bioinformatics tools are 
being combined with robotics and microarray technologies (see p. 574) to allow 
thousands of proteins to be investigated in a single set of experiments. Pro- 
teomics is a term that is often used to describe such research focused on the 
large-scale analysis of proteins, analogous to the term genomics describing the 
large-scale analysis of DNA sequences and genes. 

Biologists use two different large-scale methods to map the direct binding 
interactions between the many different proteins in a cell. The initial method of 
choice was based on genetics: through an ingenious technique known as the 
yeast two-hybrid screen (see Figure 8-24), tens of thousands of interactions 
between thousands of proteins have been mapped in yeast, a nematode, and the 
fruit fly Drosophila. More recently, a biochemical method based on affinity tag- 
ging and mass spectroscopy has gained favor (discussed in Chapter 8), because 
it appears to produce fewer spurious results. The results of these and other anal- 
yses that predict protein binding interactions have been tabulated and orga- 
nized in Internet databases. This allows a cell biologist studying a small set of 
proteins to readily discover which other proteins in the same cell are thought to 
bind to, and thus interact with, that set of proteins. When displayed graphically 
as a protein interaction map, each protein is represented by a box or dot in a two- 
dimensional network, with a straight line connecting those proteins that have 
been found to bind to each other. 

When hundreds or thousands of proteins are displayed on the same map, 
the network diagram becomes bewilderingly complicated, serving to illustrate 
how much more we have to learn before we can claim to really understand the 
cell. Much more useful are small subsections of these maps, centered on a few 
proteins of interest. Thus, Figure 3-82 shows a network of protein-protein inter- 
actions for the five proteins that form the SCF ubiquitin ligase in a yeast cell (see 
Figure 3-79). Four of the subunits of this ligase are located at the bottom right of 
Figure 3-82. The remaining subunit, the F-box protein that serves as its sub- 
strate-binding arm, appears as a set of 15 different gene products that bind to 
adaptor protein 2 (the Skpl protein). Along the top and left of the figure are sets 
of additional protein interactions marked with yellow and green shading: as indi- 
cated, these protein sets function at the origin of DNA replication, in cell cycle 
regulation, in methionine synthesis, in the kinetochore, and in vacuolar H + - 
ATPase assembly We shall use this figure to explain how such protein interaction 
maps are used, and what they do and do not mean. 

1. Protein interaction maps are useful for identifying the likely function of 
previously uncharacterized proteins. Examples are the products of the 
genes that have thus far only been inferred to exist from the yeast genome 
sequence, which are the six proteins in the figure that lack a simple three- 
letter abbreviation (white letters beginning with Y). One, the product of so- 
called open reading frame YDR196C, is located in the origin of replication 
group, and it is therefore likely to have a role in starting new replication 
forks. The remaining five in this diagram are F-box proteins that bind to 
Skpl; these are therefore likely to function as part of the ubiquitin ligase, 
serving as substrate-binding arms that recognize different target proteins. 
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3. 



However, as we discuss next, neither assignment can be considered certain 
without additional data. 

Protein interaction networks need to be interpreted with caution because, 
as a result of evolution making efficient use of each organism's genetic 
information, the same protein can be used as part of two different protein 
complexes that have different types of functions. Thus, although protein A 
binds to protein B and protein B binds to protein C, proteins A and C need 
not function in the same process. For example, we know from detailed bio- 
chemical studies that the functions of Skpl in the kinetochore and in vac- 
uolar H + -ATPase assembly (yellow shading) are separate from its function 
in the SCF ubiquitin ligase. In fact, only the remaining three functions of 
Skpl illustrated in the diagram — methionine synthesis, cell cycle regula- 
tion, and origin of replication (green shading) — involve ubiquitylation. 
In cross-species comparisons, those proteins displaying similar patterns 
of interactions in the two protein interaction maps are likely to have the 
same function in the cell. Thus, as scientists generate more and more 
highly detailed maps for multiple organisms, the results will become 
increasingly useful for inferring protein function. These map comparisons 
are a particularly powerful tool for deciphering the functions of human 
proteins. There is a vast amount of direct information about protein func- 
tion that can be obtained from genetic engineering, mutational, and 
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Figure 3-82 A map of some protein- protein interactions of the SCF ubiquitin ligase and other proteins in the yeast 
S. cerevisiae. The symbols and/or colors used for the 5 proteins of the ligase are those in Figure 3-79. Note that 1 5 different 
F-box proteins are shown {purple); those with white lettering (beginning with Y) are only known from the genome 
sequence as open reading frames. For additional details, see text. (Courtesy of Peter Bowers and David Eisenberg, 
UCLA-DOE Institute for Genomics and Proteomics, UCLA.) 
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Figure 3-83 A network of protein-binding interactions in a yeast cell. 

Each line connecting a pair of dots (proteins) indicates a protein-protein 
interaction. (From A. Guimera and M. Sales-Pardo, Mol. Syst. Biol. 2:42, 2006. 
With permission from Macmillan Publishers Ltd.) 



genetic analyses in model organisms — such as yeast, worms, and flies — 
that is not available in humans. 

The available data suggest that a typical protein in a human cell may inter- 
act with between 5 and 15 different partners. Often, each of the different 
domains in a multidomain protein binds to a different set of partners; in fact, we 
can speculate that the unusually extensive multidomain structures observed for 
human proteins may have evolved to facilitate these interactions. Given the 
enormous complexity of the interacting networks of macromolecules in cells 
(Figure 3-83), deciphering their full functional meaning may well keep scien- 
tists busy for centuries. 
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Summary 

Proteins can form enormously sophisticated chemical devices, whose functions largely 
depend on the detailed chemical properties of their surfaces. Binding sites for ligands 
are formed as surface cavities in which precisely positioned amino acid side chains are 
brought together by protein folding. In this way normally unreactive amino acid side 
chains can be activated to make and break covalent bonds. Enzymes are catalytic pro- 
teins that greatly speed up reaction rates by binding the high-energy transition states 
for a specific reaction path; they also perform acid catalysis and base catalysis simul- 
taneously. The rates of enzyme reactions are often so fast that they are limited only by 
diffusion; rates can be further increased if enzymes that act sequentially on a substrate 
are joined into a single multienzyme complex, or if the enzymes and their substrates 
are confined to the same compartment of the cell 

Proteins reversibly change their shape when ligands bind to their surface. The 
allosteric changes in protein conformation produced by one ligand affect the binding 
of a second ligand, and this linkage between two ligand-binding sites provides a cru- 
cial mechanism for regulating cell processes. Metabolic pathways, for example, are 
controlled by feedback regulation: some small molecules inhibit and other small 
molecules activate enzymes early in a pathway. Enzymes controlled in this way gener- 
ally form symmetric assemblies, allowing cooperative conformational changes to cre- 
ate a steep response to changes in the concentrations of the ligands that regulate them. 

The expenditure of chemical energy can drive unidirectional changes in protein 
shape. By coupling allosteric shape changes to ATP hydrolysis, for example, proteins 
can do useful work, such as generating a mechanical force or moving for long dis- 
tances in a single direction. The three-dimensional structures of proteins, determined 
by x-ray crystallography, have revealed how a small local change caused by nucleo- 
side triphosphate hydrolysis is amplified to create major changes elsewhere in the 
protein. By such means, these proteins can serve as input-output devices that trans- 
mit information, as assembly factors, as motors, or as membrane-bound pumps. 
Highly efficient protein machines are formed by incorporating many different protein 
molecules into larger assemblies that coordinate the allosteric movements of the indi- 
vidual components. Such machines are now known to perform many of the most 
important reactions in cells. 

Proteins are subjected to many reversible post-translational modifications, such 
as the covalent addition of a phosphate or an acetyl group to a specific amino acid side 
chain. The addition of these modifying groups is used to regulate the activity of a pro- 
tein, changing its conformation, its binding to other proteins and its location inside 
the cell A typical protein in a cell will interact with more than five different partners. 
Using the new technologies ofproteomics, biologists can analyze thousands of proteins 
in one set of experiments. One important result is the production of detailed protein 
interaction maps, which aim at describing all of the binding interactions between the 
thousands of distinct proteins in a cell. 
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PROBLEMS 

Which statements are true? Explain why or why not. 

3-1 Each strand in a P sheet is a helix with two amino 
acids per turn. 

3-2 Loops of polypeptide that protrude from the surface 
of a protein often form the binding sites for other molecules. 

3-3 An enzyme reaches a maximum rate at high substrate 
concentration because it has a fixed number of active sites 
where substrate binds. 

3-4 Higher concentrations of enzyme give rise to a higher 
turnover number. 

3-5 Enzymes such as aspartate transcarbamoylase that 
undergo cooperative allosteric transitions invariably con- 
tain multiple identical subunits. 

3-6 Continual addition and removal of phosphates by 
protein kinases and protein phosphatases is wasteful of 
energy — since their combined action consumes ATP — but it 
is a necessary consequence of effective regulation by phos- 
phorylation. 

Discuss the following problems. 

3-7 Consider the following statement. "To produce one 
molecule of each possible kind of polypeptide chain, 300 
amino acids in length, would require more atoms than exist in 
the universe." Given the size of the universe, do you suppose 
this statement could possibly be correct? Since counting 
atoms is a tricky business, consider the problem from the 
standpoint of mass. The mass of the observable universe is 
estimated to be about 10 80 grams, give or take an order of 
magnitude or so. Assuming that the average mass of an amino 
acid is 1 10 daltons, what would be the mass of one molecule 
of each possible kind of polypeptide chain 300 amino acids in 
length? Is this greater than the mass of the universe? 

3-8 A common strategy for identifying distantly related 
proteins is to search the database using a short signature 
sequence indicative of the particular protein function. Why 
is it better to search with a short sequence than with a long 
sequence? Do you not have more chances for a 'hit' in the 
database with a long sequence? 

3-9 The so-called kelch motif consists of a four-stranded 
P sheet, which forms what is known as a P propeller. It is 
usually found to be repeated four to seven times, forming a 
kelch repeat domain in a multidomain protein. One such 
kelch repeat domain is shown in Figure Q3-1. Would you 
classify this domain as an 'in-line' or 'plug-in' type domain? 

3-10 Titin, which has a molecular weight of 3 x 10 6 dal- 
tons, is the largest polypeptide yet described. Titin 
molecules extend from muscle thick filaments to the Z disc; 
they are thought to act as springs to keep the thick filaments 
centered in the sarcomere. Titin is composed of a large num- 
ber of repeated immunoglobulin (Ig) sequences of 89 amino 
acids, each of which is folded into a domain about 4 nm in 
length (Figure Q3-2A). 

You suspect that the springlike behavior of titin is caused 
by the sequential unfolding (and refolding) of individual Ig 




Figure Q3-1 The kelch 

repeat domain of 
galactose oxidase from 
D. dendroides (Problem 
3-9). The seven 
individual P propellers 
are indicated. The N- and 
C-termini are indicated 
by N and C. 



domains. You test this hypothesis using the atomic force 
microscope, which allows you to pick up one end of a pro- 
tein molecule and pull with an accurately measured force. 
For a fragment of titin containing seven repeats of the Ig 
domain, this experiment gives the sawtooth force-versus- 
extension curve shown in Figure Q3-2B. When the experi- 
ment is repeated in a solution of 8 M urea (a protein denat- 
urant), the peaks disappear and the measured extension 
becomes much longer for a given force. If the experiment is 
repeated after the protein has been cross-linked by treat- 
ment with glutaraldehyde, once again the peaks disappear 
but the extension becomes much smaller for a given force. 

A. Are the data consistent with your hypothesis that titin's 
springlike behavior is due to the sequential unfolding of 
individual Ig domains? Explain your reasoning. 

B. Is the extension for each putative domain-unfolding 
event the magnitude you would expect? (In an extended 
polypeptide chain, amino acids are spaced at intervals of 
0.34 nm.) 

C. Why is each successive peak in Figure Q3-2B a little 
higher than the one before? 

D. Why does the force collapse so abruptly after each peak? 

3-1 1 It is often said that protein complexes are made from 
subunits (that is, individually synthesized proteins) rather 
than as one long protein because the former is more likely to 
give a correct final structure. 

A. Assuming that the protein synthesis machinery incor- 
porates one incorrect amino acid for each 10,000 it inserts, 
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Figure Q3-2 Springlike behavior of titin (Problem 3-10). (A) The 
structure of an individual Ig domain. (B) Force in piconewtons versus 
extension in nanometers obtained by atomic force microscopy. 
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calculate the fraction of bacterial ribosomes that would be 
assembled correctly if the proteins were synthesized as one 
large protein versus built from individual proteins? For the 
sake of calculation assume that the ribosome is composed 
of 50 proteins, each 200 amino acids in length, and that the 
subunits — correct and incorrect — are assembled with equal 
likelihood into the complete ribosome. [The probability that 
a polypeptide will be made correctly, Pq, equals the fraction 
correct for each operation, f C) raised to a power equal to the 
number of operations, n: Pq = ifc) n - For an error rate of 
1/10,000, /c = 0.9999.] 

B. Is the assumption that correct and incorrect subunits 
assemble equally well likely to be true? Why or why not? 
How would a change in that assumption affect the calcula- 
tion in part A? 

3- 1 2 Rous sarcoma virus (RSV) carries an oncogene called 
Src, which encodes a continuously active protein tyrosine 
kinase that leads to unchecked cell proliferation. Normally, 
Src carries an attached fatty acid (myristoylate) group that 
allows it to bind to the cytoplasmic side of the plasma mem- 
brane. A mutant version of Src that does not allow attach- 
ment of myristoylate does not bind to the membrane. Infec- 
tion of cells with RSV encoding either the normal or the 
mutant form of Src leads to the same high level of protein 
tyrosine kinase activity, but the mutant Src does not cause 
cell proliferation. 

A. Assuming that the normal Src is all bound to the plasma 
membrane and that the mutant Src is distributed through- 
out the cytoplasm, calculate their relative concentrations in 
the neighborhood of the plasma membrane. For the pur- 
poses of this calculation, assume that the cell is a sphere 
with a radius of 10 urn and that the mutant Src is distributed 
throughout, whereas the normal Src is confined to a 4-nm- 
thick layer immediately beneath the membrane. [For this 
problem, assume that the membrane has no thickness. The 
volume of a sphere is (4/3)7ir 3 .] 

B, The target (X) for phosphorylation by Src resides in the 
membrane. Explain why the mutant Src does not cause cell 
proliferation. 

3-13 An antibody binds to another protein with an equilib- 
rium constant, K, of 5 x 10 9 M _1 . When it binds to a second, 
related protein, it forms three fewer hydrogen bonds, reduc- 
ing its binding affinity by 2.8 kcal/mole. What is the Kfor its 
binding to the second protein? (Free-energy change is related 
to the equilibrium constant by the equation AG° = -2.3 RTlog 
K, where R is 1.98 x 1(T 3 kcal/(mole K) and Tis 310 K.) 

3-14 The protein SmpB binds to a special species of tRNA, 
tmRNA, to eliminate the incomplete proteins made from 
truncated mRNAs in bacteria. If the binding of SmpB to 
tmRNA is plotted as fraction tmRNA bound versus SmpB 
concentration, one obtains a symmetrical S-shaped curve as 
shown in Figure Q3-3. This curve is a visual display of a very 
useful relationship between K^ and concentration, which 
has broad applicability. The general expression for fraction 
of ligand bound is derived from the equation for K& {Ka = 
[Pr][L]/[Pr-L]) by substituting ([L] T ot- [L]) for [Pr-L] and re- 
arranging. Because the total concentration of ligand ([L] T ot) 
is equal to the free ligand ([L]) plus bound ligand ([Pr-L]), 

fraction bound = [L]/[L] TOT = [Pr]/([Pr] + Kd) 

For SmpB and tmRNA, the fraction bound = [tmRNA]/ 



[tmRNAlxoT = [SmpB] /([SmpB] + K$. Using this relation- 
ship, calculate the fraction of tmRNA bound for SmpB con- 
centrations equal to lO 4 ^, 10 3 Xd, 10 2 Kd, lO 1 ^, K& lO" 1 ^, 
10- 2 Ai, 10- 3 Kd, and lCT 4 ^. 
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Figure Q3-3 Fraction of tmRNA bound versus SmpB concentration 

(Problem 3-14). 



3-15 Many enzymes obey simple Michaelis-Menten 
kinetics, which are summarized by the equation 

rate=y max [S]/([S]+X m ) 

where V max = maximum velocity, [S] = concentration of sub- 
strate, and K m = the Michaelis constant. 

It is instructive to plug a few values of [S] into the equation 
to see how rate is affected. What are the rates for [S] equal to 
zero, equal to K m , and equal to infinite concentration? 

3-16 The enzyme hexokinase adds a phosphate to D-glu- 
cose but ignores its mirror image, L-glucose. Suppose that 
you were able to synthesize hexokinase entirely from D- 
amino acids, which are the mirror image of the normal L- 
amino acids. 

A. Assuming that the 'D' enzyme would fold to a stable 
conformation, what relationship would you expect it to bear 
to the normal T enzyme? 

B. Do you suppose the 'D' enzyme would add a phosphate 
to L-glucose, and ignore D-glucose? 

3-17 How do you suppose that a molecule of hemoglobin 
is able to bind oxygen efficiently in the lungs, and yet release 
it efficiently in the tissues? 

3-18 Synthesis of the purine nucleotides AMP and GMP 
proceeds by a branched pathway starting with ribose 5- 
phosphate (R5P), as shown schematically in Figure Q3-4. 
Using the principles of feedback inhibition, propose a regu- 
latory strategy for this pathway that ensures an adequate 
supply of both AMP and GMP and minimizes the buildup of 
the intermediates {A-I) when supplies of AMP and GMP are 
adequate. 
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Figure Q3-4 Schematic diagram of the metabolic pathway for 
synthesis of AMP and GMP from R5P (Problem 3-18). 
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Life depends on the ability of cells to store, retrieve, and translate the genetic 
instructions required to make and maintain a living organism. This hereditary 
information is passed on from a cell to its daughter cells at cell division, and 
from one generation of an organism to the next through the organism's repro- 
ductive cells. These instructions are stored within every living cell as its genes, 
the information- containing elements that determine the characteristics of a 
species as a whole and of the individuals within it. 

As soon as genetics emerged as a science at the beginning of the twentieth 
century, scientists became intrigued by the chemical structure of genes. The 
information in genes is copied and transmitted from cell to daughter cell mil- 
lions of times during the life of a multicellular organism, and it survives the pro- 
cess essentially unchanged. What form of molecule could be capable of such 
accurate and almost unlimited replication and also be able to direct the devel- 
opment of an organism and the daily life of a cell? What kind of instructions does 
the genetic information contain? How can the enormous amount of information 
required for the development and maintenance of an organism fit within the 
tiny space of a cell? 

The answers to several of these questions began to emerge in the 1940s. At 
this time, researchers discovered, from studies in simple fungi, that genetic 
information consists primarily of instructions for making proteins. Proteins are 
the macromolecules that perform most cell functions: they serve as building 
blocks for cell structures and form the enzymes that catalyze the cell's chemical 
reactions (Chapter 3), they regulate gene expression (Chapter 7), and they 
enable cells to communicate with each other (Chapter 15) and to move (Chap- 
ter 16). The properties and functions of a cell are determined largely by the pro- 
teins that it is able to make. With hindsight, it is hard to imagine what other type 
of instructions the genetic information could have contained. 

Painstaking observations of cells and embryos in the late 19th century had 
led to the recognition that the hereditary information is carried on chromo- 
somes, threadlike structures in the nucleus of a eucaryotic cell that become vis- 
ible by light microscopy as the cell begins to divide (Figure 4-1). Later, as bio- 
chemical analysis became possible, chromosomes were found to consist of both 
deoxyribonucleic acid (DNA) and protein. For many decades, the DNA was 
thought to be merely a structural element. However, the other crucial advance 
made in the 1940s was the identification of DNA as the likely carrier of genetic 
information. This breakthrough in our understanding of cells came from studies 



Figure 4-1 Chromosomes in cells. (A) Two adjacent plant cells 
photographed through a light microscope. The DNA has been stained with 
a fluorescent dye (DAPI) that binds to it. The DNA is present in 
chromosomes, which become visible as distinct structures in the light 
microscope only when they become compact, sausage-shaped structures 
in preparation for cell division, as shown on the left. The cell on the right, 
which is not dividing, contains identical chromosomes, but they cannot be 
clearly distinguished in the light microscope at this phase in the cell's life 
cycle, because they are in a more extended conformation. (B) Schematic 
diagram of the outlines of the two cells along with their chromosomes. 
(A, courtesy of Peter Shaw.) 
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of inheritance in bacteria (Figure 4-2). But as the 1950s began, both how pro- 
teins could be specified by instructions in the DNA and how this information 
might be copied for transmission from cell to cell seemed completely mysteri- 
ous. The mystery was suddenly solved in 1953, when the structure of DNA was 
correctly predicted by James Watson and Francis Crick. As outlined in Chapter 1 , 
the double-helical structure of DNA immediately solved the problem of how the 
information in this molecule might be copied, or replicated. It also provided the 
first clues as to how a molecule of DNA might use the sequence of its subunits to 
encode the instructions for making proteins. Today the fact that DNA is the 
genetic material is so fundamental to biological thought that it is difficult to 
appreciate the enormous intellectual gap that was filled. 

In this chapter we begin by describing the structure of DNA. We see how, 
despite its chemical simplicity, the structure and chemical properties of DNA 
make it ideally suited as the raw material of genes. We then consider how the 
many proteins in chromosomes arrange and package this DNA. The packing has 
to be done in an orderly fashion so that the chromosomes can be replicated and 
apportioned correctly between the two daughter cells at each cell division. It 
must also allow access to chromosomal DNA for the enzymes that repair it when 
it is damaged and for the specialized proteins that direct the expression of its 
many genes. We shall also see how the packaging of DNA differs along the length 
of each chromosome in eucaryotes, and how it can store a valuable record of the 
cell's developmental history 

In the past two decades, there has been a revolution in our ability to deter- 
mine the exact sequence of subunits in DNA molecules. As a result, we now 
know the order of the 3 billion DNA subunits that provide the information for 
producing a human adult from a fertilized egg, as well as the DNA sequences of 
thousands of other organisms. Detailed analyses of these sequences have pro- 
vided exciting insights into the process of evolution, and it is with this subject 
that the chapter ends. 

This is the first of four chapters that deal with basic genetic mechanisms — 
the ways in which the cell maintains, replicates, expresses, and occasionally 
improves the genetic information carried in its DNA. This chapter presents a 
broad overview of DNA and how it is packaged into chromosomes. In the fol- 
lowing chapter (Chapter 5) we discuss the mechanisms by which the cell accu- 
rately replicates and repairs DNA; we also describe how DNA sequences can be 
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Figure 4-2 The first experimental 
demonstration that DNA is the genetic 
material. These experiments, carried out 
in the 1940s, showed that adding purified 
DNA to a bacterium changed its 
properties and that this change was 
faithfully passed on to subsequent 
generations. Two closely related strains of 
the bacterium Streptococcus pneumoniae 
differ from each other in both their 
appearance under the microscope and 
their pathogenicity. One strain appears 
smooth (S) and causes death when 
injected into mice, and the other appears 
rough (R) and is nonlethal. (A) An initial 
experiment shows that a substance 
present in the S strain can change (or 
transform) the R strain into the S strain 
and that this change is inherited by 
subsequent generations of bacteria. 
(B)This experiment, in which the R strain 
has been incubated with various classes 
of biological molecules purified from the 
S strain, identifies the substance as DNA. 
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rearranged through the process of genetic recombination. Gene expression — 
the process through which the information encoded in DNA is interpreted by 
the cell to guide the synthesis of proteins — is the main topic of Chapter 6. In 
Chapter 7, we describe how this gene expression is controlled by the cell to 
ensure that each of the many thousands of proteins and RNA molecules 
encrypted in its DNA are manufactured only at the proper time and place in the 
life of the cell. 



THE STRUCTURE AND FUNCTION OF DNA 

Biologists in the 1940s had difficulty in conceiving how DNA could be the 
genetic material because of the apparent simplicity of its chemistry. DNA was 
known to be a long polymer composed of only four types of subunits, which 
resemble one another chemically. Early in the 1950s, DNA was examined by x- 
ray diffraction analysis, a technique for determining the three-dimensional 
atomic structure of a molecule (discussed in Chapter 8). The early x-ray diffrac- 
tion results indicated that DNA was composed of two strands of the polymer 
wound into a helix. The observation that DNA was double-stranded was of cru- 
cial significance and provided one of the major clues that led to the 
Watson-Crick model for DNA structure. But only when this model was proposed 
in 1953 did DNAs potential for replication and information encoding become 
apparent. In this section we examine the structure of the DNA molecule and 
explain in general terms how it is able to store hereditary information. 



A DNA Molecule Consists of Two Complementary Chains of 
Nucleotides 

A deoxyribonucleic acid (DNA) molecule consists of two long polynucleotide 
chains composed of four types of nucleotide subunits. Each of these chains is 
known as a DNA chain, or a DNA strand. Hydrogen bonds between the base por- 
tions of the nucleotides hold the two chains together (Figure 4-3). As we saw in 
Chapter 2 (Panel 2-6, pp. 116-117), nucleotides are composed of a five-carbon 
sugar to which are attached one or more phosphate groups and a nitrogen-con- 
taining base. In the case of the nucleotides in DNA, the sugar is deoxyribose 
attached to a single phosphate group (hence the name deoxyribonucleic acid), 
and the base maybe either adenine (A), cytosine (C), guanine (G), or thymine (T). 
The nucleotides are covalently linked together in a chain through the sugars and 
phosphates, which thus form a "backbone" of alternating sugar-phosphate- 
sugar-phosphate. Because only the base differs in each of the four types of sub- 
units, each polynucleotide chain in DNA is analogous to a necklace (the back- 
bone) strung with four types of beads (the four bases A, C, G, and T). These same 
symbols (A, C, G, and T) are also commonly used to denote the four different 
nucleotides — that is, the bases with their attached sugar and phosphate groups. 

The way in which the nucleotide subunits are linked together gives a DNA 
strand a chemical polarity. If we think of each sugar as a block with a protruding 
knob (the 5' phosphate) on one side and a hole (the 3' hydroxyl) on the other (see 
Figure 4-3) , each completed chain, formed by interlocking knobs with holes, will 
have all of its subunits lined up in the same orientation. Moreover, the two ends 
of the chain will be easily distinguishable, as one has a hole (the 3' hydroxyl) and 
the other a knob (the 5' phosphate) at its terminus. This polarity in a DNA chain 
is indicated by referring to one end as the 3 end and the other as the 5' end. 

The three-dimensional structure of DNA — the double helix — arises from 
the chemical and structural features of its two polynucleotide chains. Because 
these two chains are held together by hydrogen bonding between the bases on 
the different strands, all the bases are on the inside of the double helix, and the 
sugar-phosphate backbones are on the outside (see Figure 4-3). In each case, 
a bulkier two-ring base (a purine; see Panel 2-6, pp. 116-117) is paired with a 
single-ring base (a pyrimidine); A always pairs with T, and G with C (Figure 
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4-4). This complementary base-pairing enables the base pairs to be packed in 
the energetically most favorable arrangement in the interior of the double helix. 
In this arrangement, each base pair is of similar width, thus holding the sugar- 
phosphate backbones an equal distance apart along the DNA molecule. To max- 
imize the efficiency of base-pair packing, the two sugar-phosphate backbones 



Figure 4-3 DNA and its building blocks. 

<CAGA> DNA is made of four types of 
nucleotides, which are linked covalently 
into a polynucleotide chain (a DNA 
strand) with a sugar-phosphate 
backbone from which the bases (A, C, G, 
and T) extend. A DNA molecule is 
composed of two DNA strands held 
together by hydrogen bonds between 
the paired bases. The arrowheads at the 
ends of the DNA strands indicate the 
polarities of the two strands, which run 
antiparallel to each other in the DNA 
molecule. In the diagram at the bottom 
left of the figure, the DNA molecule is 
shown straightened out; in reality, it is 
twisted into a double helix, as shown on 
the right. For details, see Figure 4-5. 



sugar-phosphate 
backbone 




Figure 4-4 Complementary base pairs in 
the DNA double helix. The shapes and 
chemical structure of the bases allow 
hydrogen bonds to form efficiently only 
between A and T and between G and C, 
where atoms that are able to form hydrogen 
bonds (see Panel 2-3, pp. 110-111) can be 
brought close together without distorting 
the double helix. As indicated, two 
hydrogen bonds form between A and T, 
while three form between G and C.The 
bases can pair in this way only if the two 
polynucleotide chains that contain them 
are antiparallel to each other. 
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wind around each other to form a double helix, with one complete turn every 
ten base pairs (Figure 4-5). 

The members of each base pair can fit together within the double helix only 
if the two strands of the helix are antiparallel — that is, only if the polarity of one 
strand is oriented opposite to that of the other strand (see Figures 4-3 and 4-4). 
A consequence of these base-pairing requirements is that each strand of a DNA 
molecule contains a sequence of nucleotides that is exactly complementary to 
the nucleotide sequence of its partner strand. 



Figure 4-5 The DNA double helix. 

(A) A space-filling model of 1 .5 turns of the 
DNA double helix. Each turn of DNA is 
made up of 1 0.4 nucleotide pairs, and the 
center-to-center distance between 
adjacent nucleotide pairs is 3.4 nm.The 
coiling of the two strands around each 
other creates two grooves in the double 
helix: the wider groove is called the major 
groove, and the smaller the minor groove. 

(B) A short section of the double helix 
viewed from its side, showing four base 
pairs. The nucleotides are linked together 
covalently by phosphodiester bonds that 
join the 3'-hydroxyl (-OH) group of one 
sugar to the S'-hydroxyl group of the next 
sugar. Thus, each polynucleotide strand 
has a chemical polarity; that is, its two 
ends are chemically different. The 5' end of 
the DNA polymer is by convention often 
illustrated carrying a phosphate group, 
while the 3'-end is shown with a hydroxyl. 



The Structure of DNA Provides a Mechanism for Heredity 

Genes carry biological information that must be copied accurately for transmis- 
sion to the next generation each time a cell divides to form two daughter cells. 
Two central biological questions arise from these requirements: how can the 
information for specifying an organism be carried in chemical form, and how is 
it accurately copied? The discovery of the structure of the DNA double helix was 
a landmark in twentieth- century biology because it immediately suggested 
answers to both questions, thereby providing a molecular explanation for the 
problem of heredity We discuss these answers briefly in this section, and we 
shall examine them in much more detail in subsequent chapters. 

DNA encodes information through the order, or sequence, of the 
nucleotides along each strand. Each base — A, C, T, or G — can be considered as a 
letter in a four-letter alphabet that spells out biological messages in the chemi- 
cal structure of the DNA. As we saw in Chapter 1, organisms differ from one 
another because their respective DNA molecules have different nucleotide 
sequences and, consequently, carry different biological messages. But how is the 
nucleotide alphabet used to make messages, and what do they spell out? 

As discussed above, it was known well before the structure of DNA was 
determined that genes contain the instructions for producing proteins. The 
DNA messages must therefore somehow encode proteins (Figure 4-6). This rela- 
tionship immediately makes the problem easier to understand. As discussed in 
Chapter 3, the properties of a protein, which are responsible for its biological 
function, are determined by its three-dimensional structure. This structure is 
determined in turn by the linear sequence of the amino acids of which it is com- 
posed. The linear sequence of nucleotides in a gene must therefore somehow 
spell out the linear sequence of amino acids in a protein. The exact correspon- 
dence between the four-letter nucleotide alphabet of DNA and the twenty-letter 
amino acid alphabet of proteins — the genetic code — is not obvious from the 
DNA structure, and it took over a decade after the discovery of the double helix 
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Figure 4-6 The relationship between 
genetic information carried in DNA and 
proteins (discussed in Chapter 1). 
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before it was worked out. In Chapter 6 we will describe this code in detail in the 
course of elaborating the process, known as gene expression, through which a cell 
converts the nucleotide sequence of a gene first into the nucleotide sequence of 
an RNA molecule, and then into the amino acid sequence of a protein. 

The complete set of information in an organism's DNA is called its genome, 
and it carries the information for all the proteins and RNA molecules that the 
organism will ever synthesize. (The term genome is also used to describe the 
DNA that carries this information.) The amount of information contained in 
genomes is staggering: for example, a typical human diploid cell contains 2 
meters of DNA double helix. Written out in the four-letter nucleotide alphabet, 
the nucleotide sequence of a very small human gene occupies a quarter of a 
page of text (Figure 4-7), while the complete sequence of nucleotides in the 
human genome would fill more than a thousand books the size of this one. In 
addition to other critical information, it carries the instructions for roughly 
24,000 distinct proteins. 

At each cell division, the cell must copy its genome to pass it to both daugh- 
ter cells. The discovery of the structure of DNA also revealed the principle that 
makes this copying possible: because each strand of DNA contains a sequence 
of nucleotides that is exactly complementary to the nucleotide sequence of its 
partner strand, each strand can act as a template, or mold, for the synthesis of a 
new complementary strand. In other words, if we designate the two DNA 
strands as S and S', strand S can serve as a template for making a new strand S', 
while strand S' can serve as a template for making a new strand S (Figure 4-8). 
Thus, the genetic information in DNA can be accurately copied by the beauti- 
fully simple process in which strand S separates from strand S', and each sepa- 
rated strand then serves as a template for the production of a new complemen- 
tary partner strand that is identical to its former partner. 

The ability of each strand of a DNA molecule to act as a template for pro- 
ducing a complementary strand enables a cell to copy, or replicate, its genome 
before passing it on to its descendants. In the next chapter we shall describe the 
elegant machinery the cell uses to perform this enormous task. 



In Eucaryotes, DNA Is Enclosed in a Cell Nucleus 

As described in Chapter 1, nearly all the DNA in a eucaryotic cell is sequestered 
in a nucleus, which in many cells occupies about 10% of the total cell volume. 
This compartment is delimited by a nuclear envelope formed by two concentric 
lipid bilayer membranes (Figure 4-9). These membranes are punctured at inter- 
vals by large nuclear pores, which transport molecules between the nucleus and 
the cytosol. The nuclear envelope is directly connected to the extensive mem- 
branes of the endoplasmic reticulum, which extend out from it into the cyto- 
plasm. And it is mechanically supported by a network of intermediate filaments 
called the nuclear lamina, which forms a thin sheetlike meshwork just beneath 
the inner nuclear membrane (see Figure 4-9B). 

The nuclear envelope allows the many proteins that act on DNA to be con- 
centrated where they are needed in the cell, and, as we see in subsequent 

Figure 4-7 The nucleotide sequence of the human p-globin gene. By 

convention, a nucleotide sequence is written from its 5' end to its 
3' end, and it should be read from left to right in successive lines down the 
page as though it were normal English text. This gene carries the 
information for the amino acid sequence of one of the two types of 
subunits of the hemoglobin molecule, the protein that carries oxygen in 
the blood. A different gene, the cc-globin gene, carries the information for 
the other type of hemoglobin subunit (a hemoglobin molecule has four 
subunits, two of each type). Only one of the two strands of the DNA double 
helix containing the p-globin gene is shown; the other strand has the exact 
complementary sequence. The DNA sequences highlighted in yellow show 
the three regions of the gene that specify the amino acid sequence for the 
(3-gJobin protein. We shall see in Chapter 6 how the cell splices these three 
sequences together at the level of messenger RNA in order to synthesize a 
full-length 3-qlobin protein. 



CCCTGTGGAGCCACACCCTAGGGTTGGCCA 

ATCTACTCCCAGGAGCAGGGAGGGCAGGAG 

CCAGGGCTGGGCATAAAAGTCAGGGCAGAG 

C CATC TAT TGCTTACATTTGCTTCTGACAC 

AACTGTGTTCACTAGCAACTCAAACAGACA 

CCATGGTGCACCTGACTCCTGAGGAGAAGT 

CTGCCGTTACTGCCCTGTGGGGCAAGGTGA 

ACGTGGATGAAGTTGGTGGTGAGGCCCTGG 

GCAGGTTGGTATCAAGGTTACAAGACAGGT 

TTAAGGAGACCAATAGAAACTGGGCATGTG 

GAGACAGAGAAGACTCTTGGGTTTCTGATA 

GGCACTGACTCTCTCTGCCTATTGGTCTAT 

TTTCCCACCCTTAGGCTGCTGGTGGTCTAC 

CCTTGGACCCAGAGGTTCTTTGAGTCCTTT 

GGGGATCTGTCCACTCCTGATGCTGTTATG 

GGCAACCCTAAGGTGAAGGCTCATGGCAAG 

AAAGTGCTCGGTGCCTTTAGTGATGGCCTG 

GCTCACCTGGACAACCTCAAGGGCACCTTT 

GCCACACTGAGTGAGCTGCACTGTGACAAG 

CTGCACGTGGATCCTGAGAACTTCAGGGTG 

AGTCTATGGGACCCTTGATGTTTTCTTTCC 

CCTTCTTTTCTATGGTTAAGTTCATGTCAT 

AGGAAGGGGAGAAGTAACAGGGTACAGTTT 

AGAATGGGAAACAGACGAATGATTGCATCA 

GTGTGGAAGTCTCAGGATCGTTTTAGTTTC 

TTTTATTTGCTGTTCATAACAATTGTTTTC 

TTTTGTTTAATTCTTGCTTTCTTTTTTTTT 

CTTCTCCGCAATTTTTACTATTATACTTAA 

TGCCTTAACATTGTGTATAACAAAAGGAAA 

TATCTCTGAGATACATTAAGTAACTTAAAA 

AAAAACTTTACACAGTCTGCCTAGTACATT 

ACTATTTGGAATATATGTGTGCTTATTTGC 

ATATTCATAATCTCCCTACTTTATTTTCTT 

TTATTTTTAATTGATACATAATCATTATAC 

ATATTTATGGGTTAAAGTGTAATGTTTTAA 

TATGTGTACACATATTGACCAAATCAGGGT 

AATTTTGCATTTGTAATTTTAAAAAATGCT 

TTCTTCTTTTAATATACTTTTTTGTTTATC 

TTATTTCTAATACTTTCCCTAATCTCTTTC 

TTTCAGGGCAATAATGATACAATGTATCAT 

GCCTCTTTGCACCATTCTAAAGAATAACAG 

T GATAAT T TC T GGGTT AAGGC AAT AGC AAT 

ATTTCTGCATATAAATATTTCTGCATATAA 

ATTGTAACTGATGTAAGAGGTTTCATATTG 

CTAATAGCAGCTACAATCCAGCTACCATTC 

TGCTTTTATTTTATGGTTGGGATAAGGCTG 

GATTATTCTGAGTCCAAGCTAGGCCCTTTT 

GCTAATCATGTTCATACCTCTTATCTTCCT 

CCCACAGCTCCTGGGCAACGTGCTGGTCTG 

TGTGCTGGCCCATCACTTTGGCAAAGAATT 

CACCCCACCAGTGCAGGCTGCCTATCAGAA 

AGTGGTGGCTGGTGTGGCTAATGCCCTGGC 

CCACAAGTATCACTAAGCTCGCTTTCTTGC 

TGTCCAATTTCTATTAAAGGTTCCTTTGTT 

CCCTAAGTCCAACTACTAAACTGGGGGATA 

TTATGAAGGGCCTTGAGCATCTGGATTCTG 

CCTAATAAAAAACATTTATTTTCATTGCAA 

TGATGTATTTAAATTATTTCTGAATATTTT 

ACTAAAAAGGGAATGTGGGAGGTCAGTGCA 

TTTAAAACATAAAGAAATGATGAGCTGTTC 

AAACCTTGGGAAAATACACTATATCTTAAA 

CTCCATGAAAGAAGGTGAGGCTGCAACCAG 

CTAATGCACATTGGCAACAGCCCCTGATGC 

CTATGCCTTATTCATCCCTCAGAAAAGGAT 

TCTTGTAGAGGCTTGATTTGCAGGTTAAAG 

TTTTGCTATGCTGTATTTTACATTACTTAT 
TGTTTTAGCTGTCCTCATGAATGTGTTTTC 
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chapters, it also keeps nuclear and cytosolic enzymes separate, a feature that is 
crucial for the proper functioning of eucaryotic cells. Compartmentalization, of 
which the nucleus is an example, is an important principle of biology; it serves 
to establish an environment in which biochemical reactions are facilitated by 
the high concentration of both substrates and the enzymes that act on them. 
Compartmentalization also prevents enzymes needed in one part of the cell 
from interfering with the orderly biochemical pathways in another. 



Summary 

Genetic information is carried in the linear sequence of nucleotides in DNA. Each 
molecule of DNA is a double helix formed from two complementary strands of 
nucleotides held together by hydrogen bonds between G-C and A-T base pairs. Dupli- 
cation of the genetic information occurs by the use of one DNA strand as a template for 
the formation of a complementary strand. The genetic information stored in an organ- 
ism's DNA contains the instructions for all the proteins the organism will ever synthe- 
size and is said to comprise its genome. In eucaryotes, DNA is contained in the cell 
nucleus, a large membrane-bound compartment. 



Figure 4-8 DNA as a template for its 
own duplication. As the nucleotide A 
successfully pairs only with T, and G with 
C, each strand of DNA can act as a 
template to specify the sequence of 
nucleotides in its complementary strand. 
In this way, double-helical DNA can be 
copied precisely, with each parental DNA 
helix producing two identical daughter 
DNA helices. 
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Figure 4-9 A cross-sectional view of a typical cell nucleus. (A) Electron micrograph of a thin section through the nucleus 
of a human fibroblast. (B) Schematic drawing, showing that the nuclear envelope consists of two membranes, the outer 
one being continuous with the endoplasmic reticulum membrane (see also Figure 1 2-8). The space inside the endoplasmic 
reticulum (the ER lumen) is colored yellow; it is continuous with the space between the two nuclear membranes. The lipid 
bilayers of the inner and outer nuclear membranes are connected at each nuclear pore. A sheet-like network of 
intermediate filaments (brown) inside the nucleus provides mechanical support for the nuclear envelope, forming a special 
supporting structure called the nuclear lamina (for details, see Chapter 1 2). The heterochromatin near the lamina contains 
specially condensed regions of DNA that will be discussed later. 
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CHROMOSOMAL DNA AND ITS PACKAGING IN THE 
CHROMATIN FIBER 

The most important function of DNA is to carry genes, the information that 
specifies all the proteins and RNA molecules that make up an organism — 
including information about when, in what types of cells, and in what quantity 
each protein is to be made. The genomes of eucaryotes are divided up into chro- 
mosomes, and in this section we see how genes are typically arranged on each 
chromosome. In addition, we describe the specialized DNA sequences that are 
required for a chromosome to be accurately duplicated and passed on from one 
generation to the next. 

We also confront the serious challenge of DNA packaging. If the double 
helices comprising all 46 chromosomes in a human cell could be laid end-to- 
end, they would reach approximately 2 meters; yet the nucleus, which contains 
the DNA, is only about 6 jim in diameter. This is geometrically equivalent to 
packing 40 km (24 miles) of extremely fine thread into a tennis ball! The complex 
task of packaging DNA is accomplished by specialized proteins that bind to and 
fold the DNA, generating a series of coils and loops that provide increasingly 
higher levels of organization, preventing the DNA from becoming an unmanage- 
able tangle. Amazingly, although the DNA is very tightly folded, it is compacted in 
a way that keeps it available to the many enzymes in the cell that replicate it, 
repair it, and use its genes to produce RNA molecules and proteins. 

Eucaryotic DNA Is Packaged into a Set of Chromosomes 

In eucaryotes, the DNA in the nucleus is divided between a set of different chro- 
mosomes. For example, the human genome — approximately 3.2 x 10 9 
nucleotides — is distributed over 24 different chromosomes. Each chromosome 
consists of a single, enormously long linear DNA molecule associated with pro- 
teins that fold and pack the fine DNA thread into a more compact structure. The 
complex of DNA and protein is called chromatin (from the Greek chroma, 
"color," because of its staining properties). In addition to the proteins involved 
in packaging the DNA, chromosomes are also associated with many proteins 
and RNA molecules required for the processes of gene expression, DNA replica- 
tion, and DNA repair. 

Bacteria carry their genes on a single DNA molecule, which is often circular 
(see Figure 1-29). This DNA is associated with proteins that package and con- 
dense the DNA, but they are different from the proteins that perform these func- 
tions in eucaryotes. Although often called the bacterial "chromosome," it does 
not have the same structure as eucaryotic chromosomes, and less is known 
about how the bacterial DNA is packaged. Therefore, our discussion of chromo- 
some structure will focus almost entirely on eucaryotic chromosomes. 

With the exception of the germ cells (discussed in Chapter 21) and a few 
highly specialized cell types that cannot multiply and lack DNA altogether (for 
example, red blood cells), each human cell contains two copies of each chromo- 
some, one inherited from the mother and one from the father. The maternal and 
paternal chromosomes of a pair are called homologous chromosomes 
(homologs). The only nonhomologous chromosome pairs are the sex chromo- 
somes in males, where a Y chromosome is inherited from the father and an X 
chromosome from the mother. Thus, each human cell contains a total of 46 chro- 
mosomes— 22 pairs common to both males and females, plus two so-called sex 
chromosomes (X and Y in males, two Xs in females) . DNA hybridization is a tech- 
nique in which a labeled nucleic acid strand serves as a "probe" that localizes a 
complementary strand, as will be described in detail in Chapter 8. This tech- 
nique can be used to distinguish these human chromosomes by "painting" each 
one a different color (Figure 4-10). Chromosome painting is typically done at 
the stage in the cell cycle called mitosis, when chromosomes are especially com- 
pacted and easy to visualize (see below). 

Another more traditional way to distinguish one chromosome from another 
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along each mitotic chromosome (Figure 4-11). The structural bases for these 
banding patterns are not well understood. Nevertheless, the pattern of bands on 
each type of chromosome is unique, and it is these patterns that initially allowed 
each human chromosome to be identified and numbered. 

The display of the 46 human chromosomes at mitosis is called the human 
karyotype. If parts of chromosomes are lost or are switched between chromo- 
somes, these changes can be detected by changes in the banding patterns or by 
changes in the pattern of chromosome painting (Figure 4-12). Cyto geneticists 
use these alterations to detect chromosome abnormalities that are associated 
with inherited defects, as well as to characterize cancers that are associated with 
specific chromosome rearrangements in somatic cells (discussed in Chapter 20). 
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Figure 4-10 The complete set of human 
chromosomes. These chromosomes, from 
a male, were isolated from a cell 
undergoing nuclear division (mitosis) and 
are therefore highly compacted. Each 
chromosome has been "painted" a 
different color to permit its unambiguous 
identification under the light microscope. 
Chromosome painting is performed by 
exposing the chromosomes to a collection 
of human DNA molecules that have been 
coupled to a combination of fluorescent 
dyes. For example, DNA molecules derived 
from chromosome 1 are labeled with one 
specific dye combination, those from 
chromosome 2 with another, and so on. 
Because the labeled DNA can form base 
pairs, or hybridize, only to the 
chromosome from which it was derived 
(discussed in Chapter 8), each 
chromosome is differently labeled. For 
such experiments, the chromosomes are 
subjected to treatments that separate the 
double-helical DNA into individual strands, 
designed to permit base-pairing with the 
single-stranded labeled DNA while 
keeping the chromosome structure 
relatively intact. (A) The chromosomes 
visualized as they originally spilled from 
the lysed cell. (B) The same chromosomes 
artificially lined up in their numerical order. 
This arrangement of the full chromosome 
set is called a karyotype. (From E. Schrock 
et al., Science 273:494-497, 1 996. With 
permission from AAAS.) 



Figure 4-1 1 The banding patterns of 
human chromosomes. Chromosomes 
1-22 are numbered in approximate order 
of size. A typical human somatic (non- 
germ-line) cell contains two of each of 
these chromosomes, plus two sex 
chromosomes — two X chromosomes in a 
female, one X and one Y chromosome in a 
male. The chromosomes used to make 
these maps were stained at an early stage 
in mitosis, when the chromosomes are 
incompletely compacted. The horizontal 
red line represents the position of the 
centromere (see Figure 4-21), which 
appears as a constriction on mitotic 
chromosomes. The red knobs on 
chromosomes 13, 14, 15, 21, and 22 
indicate the positions of genes that code 
for the large ribosomal RNAs (discussed in 
Chapter 6). These patterns are obtained by 
staining chromosomes with Giemsa stain, 
and they can be observed under the light 
microscope. (For micrographs, see Figure 
21-18; adapted from U. Franke, Cytogenet 
Cell Genet 31:24-32, 1981. With 
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Figure 4-12 An aberrant human chromosome. (A) Two pairs of 
chromosomes, stained with Giemsa (see Figure 4-1 1 ), from a patient with 
ataxia, a disease characterized by progressive deterioration of motor skills. 
The patient has a normal pair of chromosome 4s (left-hand pair), but one 
normal chromosome 1 2 and one aberrant chromosome 1 2, as seen by its 
greater length (right-hand pair). The additional material contained on the 
aberrant chromosome 12 {red bracket) was deduced, from its pattern of 
bands, as a copy of part of chromosome 4 that had become attached to 
chromosome 1 2 through an abnormal recombination event, called a 
chromosomal translocation. (B) Drawing of the same two chromosome 
pairs, "painted" red for chromosome 4 DNA and blue for chromosome 1 2 
DNA. The two techniques give rise to the same conclusion regarding the 
nature of the aberrant chromosome 1 2, but chromosome painting 
provides better resolution, allowing the clear identification of even short 
pieces of chromosomes that have become translocated. However, Giemsa 
staining is easier to perform. (Adapted from E. Schrock et al., Science 
273:494-497, 1996. With permission from AAAS.) 
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Chromosomes Contain Long Strings of Genes 

Chromosomes carry genes — the functional units of heredity. A gene is usually 
defined as a segment of DNA that contains the instructions for making a partic- 
ular protein (or a set of closely related proteins). Although this definition holds 
for the majority of genes, several percent of genes produce an RNA molecule, 
instead of a protein, as their final product. Like proteins, these RNA molecules 
perform a diverse set of structural and catalytic functions in the cell, and we dis- 
cuss them in detail in subsequent chapters. 

As might be expected, some correlation exists between the complexity of an 
organism and the number of genes in its genome (see Table 1-1, p. 18). For 
example, some simple bacteria have only 500 genes, compared to about 25,000 
for humans. Bacteria and some single-celled eucaryotes, such as yeast, have 
especially concise genomes; the complete nucleotide sequence of their 
genomes reveals that the DNA molecules that make up their chromosomes are 
little more than strings of closely packed genes (Figure 4-13). However, chro- 
mosomes from many eucaryotes (including humans) contain, in addition to 
genes, a large excess of interspersed DNA that does not seem to carry critical 
information. Sometimes called "junk DNA" to signify that its usefulness to the 
cell has not been demonstrated, the particular nucleotide sequence of most of 
this DNA may not be important. However, some of this DNA is crucial for the 
proper expression of certain genes, as we discuss elsewhere. 

Because of differences in the amount of DNA interspersed between genes, 
genome sizes can vary widely (see Figure 1-37). For example, the human 
genome is 200 times larger than that of the yeast S. cerevisiae, but 30 times 
smaller than that of some plants and amphibians and 200 times smaller than 
that of a species of amoeba. Moreover, because of differences in the amount of 
excess DNA, the genomes of similar organisms (bony fish, for example) can vary 
several hundredfold in their DNA content, even though they contain roughly the 
same number of genes. Whatever the excess DNA may do, it seems clear that it 
is not a great handicap for a eucaryotic cell to carry a large amount of it. 

How the genome is divided into chromosomes also differs from one eucary- 
otic species to the next. For example, compared with 46 for humans, somatic 
cells from a species of small deer contain only 6 chromosomes, while those from 
a species of carp contain over 100. Even closely related species with similar 
genome sizes can have very different numbers and sizes of chromosomes (Fig- 
ure 4-14). Thus, there is no simple relationship between chromosome number, 



Figure 4-1 3 The arrangement of genes 
in the genome of S. cerevisiae. 

S. cerevisiae is a budding yeast widely 
used for brewing and baking. The 
genome of this yeast cell is distributed 
over 16 chromosomes. A small region of 
one chromosome has been arbitrarily 
selected to show the high density of 
genes characteristic of this species. As 
indicated by the light red shading, some 
genes are transcribed from the lower 
strand, while others are transcribed from 
the upper strand. There are about 
6300 genes in the complete genome, 
which contains somewhat more than 
1 2 million nucleotide pairs. (For the 
closely packed genes of a bacterium 
whose genome is 4.6 million nucleotides 
long, see Figure 1-29). 
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species complexity, and total genome size. Rather, the genomes and chromo- 
somes of modern-day species have each been shaped by a unique history of 
seemingly random genetic events, acted on by selection pressures over long evo- 
lutionary times. 



The Nucleotide Sequence of the Human Genome Shows How Our 
Genes Are Arranged 

In Chapter 1 we discussed, in general terms, how the information in DNA is read 
out and used, through RNA intermediates, to make proteins (see Figure 1-4). In 
1999, it became possible for the first time to see exactly how genes are arranged 
along an entire vertebrate chromosome (Figure 4-15). Today, with the publica- 
tion of the "first draft" of the entire human genome in 2001 and the "finished 
DNA sequence" in 2004, the genetic information in all human chromosomes is 
available. The sheer quantity of information provided by the Human Genome 
Project is staggering (Figure 4-16 and Table 4-1). At its peak, the Project gener- 
ated raw nucleotide sequences at a rate of 1000 nucleotides per second around 
the clock. It will be many decades before this information is fully analyzed, but 
it has already stimulated new experiments that have had major effects on the 
content of every chapter in this book. 

The first striking feature of the human genome is how little of it (only a few 
percent) codes for proteins (Figure 4-17). Much of the remaining chromosomal 

(A) human chromosome 22 in its mitotic conformation, 

composed of two DNA molecules, each 48 x 10 6 nucleotide pairs long 
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Figure 4-1 4 Two closely related species of 
deer with very different chromosome 
numbers. In the evolution of the Indian 
muntjac, initially separate chromosomes 
fused, without having a major effect on the 
animal. These two species contain a similar 
number of genes. (Adapted from 
M.W. Strickberger, Evolution, 3rd ed. Sudbury, 
MA: Jones & Bartlett Publishers, 2000.) 



Figure 4-1 5 The organization of genes on 
a human chromosome. (A) Chromosome 
22, one of the smallest human 
chromosomes, contains 48 x 10 6 nucleotide 
pairs and makes up approximately 1 .5% of 
the entire human genome. Most of the left 
arm of chromosome 22 consists of short 
repeated sequences of DNA that are 
packaged in a particularly compact form of 
chromatin (heterochromatin), which is 
discussed later in this chapter. (B) A tenfold 
expansion of a portion of chromosome 22, 
with about 40 genes indicated. Those in 
dark brown are known genes and those in 
red are predicted genes. (C) An expanded 
portion of (B) shows the entire length of 
several genes. (D)The intron-exon 
arrangement of a typical gene is shown 
after a further tenfold expansion. Each exon 
(red) codes for a portion of the protein, 
while the DNA sequence of the introns 
(gray) is relatively unimportant, as discussed 
in detail in Chapter 6. 

The human genome (3.2 x 1 9 nucleotide 
pairs) is the totality of genetic information 
belonging to our species. Almost all of this 
genome is distributed over the 22 
autosomes and 2 sex chromosomes (see 
Figures 4-1 and 4-1 1 ) found within the 
nucleus. A minute fraction of the human 
genome (16,569 nucleotide pairs— in 
multiple copies per cell) is found in the 
mitochondria (introduced in Chapter 1, and 
discussed in detail in Chapter 14). The term 
human genome sequence refers to the 
complete nucleotide sequence of DNA in 
the 24 nuclear chromosomes and the 
mitochondria. Being diploid, a human 
somatic cell nucleus contains roughly twice 
the haploid amount of DNA, or 6.4 x 1 9 
nucleotide pairs when not duplicating its 
chromosomes in preparation for division. 
(Adapted from International Human 
Genome Sequencing Consortium, Nature 
409:860-921, 2001. With permission from 
Macmillan Publishers Ltd.) 
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DNA is made up of short, mobile pieces of DNA that have gradually inserted 
themselves in the chromosome over evolutionary time. We discuss these trans- 
posable elements in detail in later chapters. 

A second notable feature of the human genome is the large average gene size 
of 27,000 nucleotide pairs. As discussed above, a typical gene carries in its linear 
sequence of nucleotides the information for the linear sequence of the amino 
acids of a protein. Only about 1300 nucleotide pairs are required to encode a pro- 
tein of average size (about 430 amino acids in humans). Most of the remaining 
DNA in a gene consists of long stretches of noncoding DNA that interrupt the rel- 
atively short segments of DNA that code for protein. As will be discussed in detail 
in Chapter 6, the coding sequences are called exons; the intervening (noncoding) 
sequences in genes are called introns (see Figure 4-15 and Table 4-1). The major- 
ity of human genes thus consist of a long string of alternating exons and introns, 
with most of the gene consisting of introns. In contrast, the majority of genes from 
organisms with concise genomes lack introns. This accounts for the much smaller 
size of their genes (about one-twentieth that of human genes), as well as for the 
much higher fraction of coding DNA in their chromosomes. 

In addition to introns and exons, each gene is associated with regulatory 
DNA sequences, which are responsible for ensuring that the gene is turned on or 
off at the proper time, expressed at the appropriate level, and only in the proper 
type of cell. In humans, the regulatory sequences for a typical gene are spread 
out over tens of thousands of nucleotide pairs. As would be expected, these reg- 
ulatory sequences are more compressed in organisms with concise genomes. We 
discuss in Chapter 7 how regulatory DNA sequences work. 

Finally, the nucleotide sequence of the human genome has revealed that the 
critical information needed to produce a human seems to be in an alarming state 
of disarray. As one commentator described our genome, "In some ways it may 
resemble your garage/bedroom/refrigerator/life: highly individualistic, but 
unkempt; little evidence of organization; much accumulated clutter (referred to 
by the uninitiated as 'junk'); virtually nothing ever discarded; and the few patently 
valuable items indiscriminately, apparently carelessly, scattered throughout." 



(A) 




Figure 4-16 Scale of the human 
genome. If drawn with a 1 mm space 
between each nucleotide, as in (A), the 
human genome would extend 3200 km 
(approximately 2000 miles), far enough to 
stretch across the center of Africa, the site 
of our human origins (red line in B). At this 
scale, there would be, on average, a 
protein-coding gene every 130 m. An 
average gene would extend for 30 m, but 
the coding sequences in this gene would 
add up to only just over a meter. 



Table 4-1 Some Vital Statistics for the Human Genome 



DNA length 

Number of genes 

Largest gene 

Mean gene size 

Smallest number of exons per gene 

Largest number of exons per gene 

Mean number of exons per gene 

Largest exon size 

Mean exon size 

Number of pseudogenes** 

Percentage of DNA sequence in exons (protein 

coding sequences) 
Percentage of DNA in other highly conserved 

sequences*** 
Percentage of DNA in high-copy repetitive elements 



HUMAN GENOME 

3.2 x 10 9 nucleotide pairs* 

approximately 25,000 

2.4 x 10 6 nucleotide pairs 

27,000 nucleotide pairs 

1 

178 

10.4 

17,106 nucleotide pairs 

145 nucleotide pairs 

more than 20,000 

1.5% 

3.5% 
approximately 50% 



* The sequence of 2.85 billion nucleotides is known precisely (error rate of only about one in 
100,000 nucleotides). The remaining DNA primarily consists of short highly repeated sequences 
that are tandemly repeated, with repeat numbers differing from one individual to the next 

** A pseudogene is a nucleotide sequence of DNA closely resembling that of a functional gene, 
but containing numerous mutations that prevent its proper expression. Most pseudogenes arise 
from the duplication of a functional gene followed by the accumulation of damaging mutations 
in one copy. 

*** Preserved functional regions; these include DNA encoding 5' and 3' UTRs (untranslated 
regions), structural and functional RNAs, and conserved protein-binding sites on the DNA. 
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Genome Comparisons Reveal Evolutionary Conserved DNA 
Sequences 

A major obstacle in interpreting the nucleotide sequences of human chromo- 
somes is the fact that much of the sequence is probably unimportant. More- 
over, the coding regions of the genome (the exons) are typically found in short 
segments (average size about 145 nucleotide pairs) floating in a sea of DNA 
whose exact nucleotide sequence is of little consequence. This arrangement 
makes it very difficult to identify all the exons in a stretch of DNA sequence. 
Even harder is the determination of where a gene begins and ends and exactly 
how many exons it spans. 

Accurate gene identification requires approaches that extract information 
from the inherently low signal-to-noise ratio of the human genome. We shall 
describe some of them in Chapter 8. Here we discuss only one general approach, 
which is based on the observation that sequences that have a function are rela- 
tively conserved during evolution, whereas those without a function are free to 
mutate randomly. The strategy is therefore to compare the human sequence 
with that of the corresponding regions of a related genome, such as that of the 
mouse. Humans and mice are thought to have diverged from a common mam- 
malian ancestor about 80 x 10 6 years ago, which is long enough for the majority 
of nucleotides in their genomes to have been changed by random mutational 
events. Consequently, the only regions that will have remained closely similar in 
the two genomes are those in which mutations would have impaired function 
and put the animals carrying them at a disadvantage, resulting in their elimina- 
tion from the population by natural selection. Such closely similar regions are 
known as conserved regions. The conserved regions include both functionally 
important exons and regulatory DNA sequences. In contrast, nonconserved 
regions represent DNA whose sequence is unlikely to be critical for function. 

The power of this method can be increased by comparing our genome with 
the genomes of additional animals whose genomes have been completely 
sequenced, including the rat, chicken, chimpanzee, and dog. By revealing in this 
way the results of a very long natural "experiment," lasting for hundreds of mil- 
lions of years, such comparative DNA sequencing studies have highlighted the 
most interesting regions in these genomes. The comparisons reveal that roughly 
5% of the human genome consists of "multi-species conserved sequences," as 
discussed in detail near the end of this chapter. Unexpectedly, only about one- 
third of these sequences code for proteins. Some of the conserved noncoding 
sequences correspond to clusters of protein-binding sites that are involved in 
gene regulation, while others produce RNA molecules that are not translated 
into protein. But the function of the majority of these sequences remains 
unknown. This unexpected discovery has led scientists to conclude that we 
understand much less about the cell biology of vertebrates than we had previ- 
ously imagined. Certainly, there are enormous opportunities for new discover- 
ies, and we should expect many surprises ahead. 

Comparative studies have revealed not only that humans and other mammals 
share most of the same genes, but also that large blocks of our genomes contain 
these genes in the same order, a feature called conserved synteny. As a result, large 
blocks of our chromosomes can be recognized in other species. This allows the 
chromosome painting technique to be used to reconstruct the recent evolution- 
arv historv of human chromosomes (Fieure 4-18). 



Figure 4-1 7 Representation of the 
nucleotide sequence content of the 
completely sequenced human genome. 

The LINEs, SINEs, retroviral-like elements, 
and DNA-only transposons are mobile 
genetic elements that have multiplied in 
our genome by replicating themselves 
and inserting the new copies in different 
positions. These mobile genetic elements 
are discussed in Chapter 5 (see Table 5-3, 
p. 318). Simple sequence repeats are 
short nucleotide sequences (less than 14 
nucleotide pairs) that are repeated again 
and again for long stretches. Segmental 
duplications are large blocks of the 
genome (1000-200,000 nucleotide pairs) 
that are present at two or more locations 
in the genome. The most highly repeated 
blocks of DNA in heterochromatin have 
not yet been completely sequenced; 
therefore about 10% of human DNA 
sequences are not represented in this 
diagram. (Data courtesy of 
E. Margulies.) 
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Chromosomes Exist in Different States Throughout the Life of a Cell 

We have seen how genes are arranged in chromosomes, but to form a functional 
chromosome, a DNA molecule must be able to do more than simply carry genes: 
it must be able to replicate, and the replicated copies must be separated and reli- 
ably partitioned into daughter cells at each cell division. This process occurs 
through an ordered series of stages, collectively known as the cell cycle, which 
provides for a temporal separation between the duplication of chromosomes 
and their segregation into two daughter cells. The cell cycle is briefly summa- 
rized in Figure 4-19, and it is discussed in detail in Chapter 17. Only certain 



Figure 4-18 A proposed evolutionary 
history of human chromosome 3 and its 
relatives in other mammals. (A) The 

order of chromosome 3 segments 
hypothesized to be present on a 
chromosome of a mammalian ancestor is 
shown {yellow box). The minimum 
changes in this ancestral chromosome 
necessary to account for the appearance 
of each of the three modem 
chromosomes are indicated. (The 
present-day chromosomes of humans 
and African apes are identical at this 
resolution.) The small circles depicted in 
the modern chromosomes represent the 
positions of centromeres. A fission and 
inversion that leads to a change in 
chromosome organization is thought to 
occur once every 5-10x1 6 years in 
mammals. (B) Some of the chromosome 
painting experiments that led to the 
diagram in (A). Each image shows the 
chromosome most closely related to 
human chromosome 3, painted green by 
hybridization with different segments of 
DNA, lettered a, b, c, and d along the 
bottom of the figure. These letters 
correspond to the colored segments of 
the diagrams in (A), as indicated on the 
ancestral chromosome. (From S. Muller et 
al., Proc Natl Acad. Sci. U.S.A. 97:206-21 1, 
2000. With permission from National 
Academy of Sciences.) 
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Figure 4-19 A simplified view of the eucaryotic cell cycle. During interphase, the cell is actively expressing its genes and 
is therefore synthesizing proteins. Also, during interphase and before cell division, the DNA is replicated and each 
chromosome is duplicated to produce two closely paired daughter chromosomes (a cell with only two chromosomes is 
illustrated here). Once DNA replication is complete, the cell can enter M phase, when mitosis occurs and the nucleus is 
divided into two daughter nuclei. During this stage, the chromosomes condense, the nuclear envelope breaks down, and 
the mitotic spindle forms from microtubules and other proteins. The condensed mitotic chromosomes are captured by the 
mitotic spindle, and one complete set of chromosomes is then pulled to each end of the eel! by separating each daughter 
chromosome pair. A nuclear envelope re-forms around each chromosome set, and in the final step of M phase, the cell 
divides to produce two daughter cells. Most of the time in the cell cycle is spent in interphase; M phase is brief in 
comparison, occupying only about an hour in many mammalian cells. 
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Figure 4-20 A comparison of extended 
interphase chromatin with the 
chromatin in a mitotic chromosome. 

(A) A scanning electron micrograph of a 
mitotic chromosome: a condensed 
duplicated chromosome in which the 
two new chromosomes are still linked 
together (see Figure 4-21 ). The 
constricted region indicates the position 
of the centromere, described in Figure 
4-21. (B) An electron micrograph 
showing an enormous tangle of 
chromatin spilling out of a lysed 
interphase nucleus. Note the difference in 
scales. (A, courtesy of Terry D. Allen; 
B, courtesy of Victoria Foe.) 
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parts of the cycle concern us in this chapter. During interphase chromosomes 
are replicated, and during mitosis they become highly condensed and then are 
separated and distributed to the two daughter nuclei. The highly condensed 
chromosomes in a dividing cell are known as mitotic chromosomes (Figure 
4-20A). This is the form in which chromosomes are most easily visualized; in 
fact, the images of chromosomes shown so far in the chapter are of chromo- 
somes in mitosis. During cell division, this condensed state is important for the 
accurate separation of the duplicated chromosomes by the mitotic spindle, as 
discussed in Chapter 17. 

During the portions of the cell cycle when the cell is not dividing, the chro- 
mosomes are extended and much of their chromatin exists as long, thin tangled 
threads in the nucleus so that individual chromosomes cannot be easily distin- 
guished (Figure 4-20B). We shall refer to chromosomes in this extended state as 
interphase chromosomes. Since cells spend most of their time in interphase, and 
this is where their genetic information is being read out, chromosomes are of 
greatest interest to cell biologists when they are least visible. 



Each DNA Molecule That Forms a Linear Chromosome Must 
Contain a Centromere, Two Telomeres, and Replication Origins 

A chromosome operates as a distinct structural unit: for a copy to be passed on 
to each daughter cell at division, each chromosome must be able to replicate, 
and the newly replicated copies must subsequently be separated and parti- 
tioned correctly into the two daughter cells. These basic functions are controlled 
by three types of specialized nucleotide sequences in the DNA, each of which 
binds specific proteins that guide the machinery that replicates and segregates 
chromosomes (Figure 4-21). 

Experiments in yeasts, whose chromosomes are relatively small and easy to 
manipulate, have identified the minimal DNA sequence elements responsible 
for each of these functions. One type of nucleotide sequence acts as a DNA repli- 
cation origin, the location at which duplication of the DNA begins. Eucaryotic 
chromosomes contain many origins of replication to ensure that the entire chro- 
mosome can be replicated rapidly, as discussed in detail in Chapter 5. 

After replication, the two daughter chromosomes remain attached to one 
another and, as the cell cycle proceeds, are condensed further to produce 
mitotic chromosomes. The presence of a second specialized DNA sequence, 
called a centromere, allows one copy of each duplicated and condensed chro- 
mosome to be pulled into each daughter cell when a cell divides. A protein 
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Figure 4-21 The three DNA sequences 
required to produce a eucaryotic 
chromosome that can be replicated and 
then segregated at mitosis. Each 
chromosome has multiple origins of 
replication, one centromere, and two 
telomeres. Shown here is the sequence of 
events that a typical chromosome follows 
during the cell cycle. The DNA replicates 
in interphase, beginning at the origins of 
replication and proceeding 
bidirectionally from the origins across the 
chromosome. In M phase, the centromere 
attaches the duplicated chromosomes to 
the mitotic spindle so that one copy is 
distributed to each daughter cell during 
mitosis. The centromere also helps to 
hold the duplicated chromosomes 
together until they are ready to be 
moved apart. The telomeres form special 
caps at each chromosome end. 



complex called a kinetochore forms at the centromere and attaches the dupli- 
cated chromosomes to the mitotic spindle, allowing them to be pulled apart 
(discussed in Chapter 17). 

The third specialized DNA sequence forms telomeres, the ends of a chromo- 
some. Telomeres contain repeated nucleotide sequences that enable the ends of 
chromosomes to be efficiently replicated. Telomeres also perform another func- 
tion: the repeated telomere DNA sequences, together with the regions adjoining 
them, form structures that protect the end of the chromosome from being mis- 
taken by the cell for a broken DNA molecule in need of repair. We discuss both 
this type of repair and the structure and function of telomeres in Chapter 5. 

In yeast cells, the three types of sequences required to propagate a chromo- 
some are relatively short (typically less than 1000 base pairs each) and therefore 
use only a tiny fraction of the information-carrying capacity of a chromosome. 
Although telomere sequences are fairly simple and short in all eucaryotes, the 
DNA sequences that form centromeres and replication origins in more complex 
organisms are much longer than their yeast counterparts. For example, experi- 
ments suggest that human centromeres contain up to 100,000 nucleotide pairs 
and may not require a stretch of DNA with a defined nucleotide sequence. 
Instead, as we shall discuss later in this chapter, they seem to consist of a large, 
regularly repeating protein-nucleic acid structure that can be inherited when a 
chromosome replicates. 



DNA Molecules Are Highly Condensed in Chromosomes 

All eucaryotic organisms have special ways of packaging DNA into chromo- 
somes. For example, if the 48 million nucleotide pairs of DNA in human chro- 
mosome 22 could be laid out as one long perfect double helix, the molecule 
would extend for about 1.5 cm if stretched out end to end. But chromosome 22 
measures only about 2 jam in length in mitosis (see Figures 4-10 and 4-11), rep- 
resenting an end-to-end compaction ratio of nearly 10,000-fold. This remark- 
able feat of compression is performed by proteins that successively coil and fold 
the DNA into higher and higher levels of organization. Although much less con- 
densed than mitotic chromosomes, the DNA of human interphase chromo- 
somes is still tightly packed, with an overall compaction ratio of approximately 
500-fold (the length of a chromosome's DNA helix divided by the end-to-end 
length of that chromosome). 

In reading these sections it is important to keep in mind that chromosome 
structure is dynamic. We have seen that each chromosome condenses to an 
unusual degree in the M phase of the cell cycle. Much less visible, but of enor- 
mous interest and importance, specific regions of interphase chromosomes 
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decondense as the cells gain access to specific DNA sequences for gene 
expression, DNA repair, and replication — and then recondense when these 
processes are completed. The packaging of chromosomes is therefore accom- 
plished in a way that allows rapid localized, on-demand access to the DNA. In 
the next sections we discuss the specialized proteins that make this type of 
packaging possible. 



Nucleosomes Are a Basic Unit of Eucaryotic Chromosome 
Structure 

The proteins that bind to the DNA to form eucaryotic chromosomes are tradi- 
tionally divided into two general classes: the histones and the nonhistone chro- 
mosomal proteins. The complex of both classes of protein with the nuclear DNA 
of eucaryotic cells is known as chromatin. Histones are present in such enor- 
mous quantities in the cell (about 60 million molecules of each type per human 
cell) that their total mass in chromatin is about equal to that of the DNA. 

Histones are responsible for the first and most basic level of chromosome 
packing, the nucleosome, a protein-DNA complex discovered in 1974. When 
interphase nuclei are broken open very gently and their contents examined 
under the electron microscope, most of the chromatin is in the form of a fiber 
with a diameter of about 30 nm (Figure 4-22A). If this chromatin is subjected to 
treatments that cause it to unfold partially, it can be seen under the electron 
microscope as a series of "beads on a string" (Figure 4-22B). The string is DNA, 
and each bead is a "nucleosome core particle" that consists of DNA wound 
around a protein core formed from histones. <ACTC> 

The structural organization of nucleosomes was determined after first iso- 
lating them from unfolded chromatin by digestion with particular enzymes 
(called nucleases) that break down DNA by cutting between the nucleosomes. 
After digestion for a short period, the exposed DNA between the nucleosome 
core particles, the linker DNA, is degraded. Each individual nucleosome core 
particle consists of a complex of eight histone proteins — two molecules each of 
histones H2A, H2B, H3, and H4— and double-stranded DNA that is 147 
nucleotide pairs long. The histone octamer forms a protein core around which 
the double-stranded DNA is wound (Figure 4-23). 

Each nucleosome core particle is separated from the next by a region of 
linker DNA, which can vary in length from a few nucleotide pairs up to about 80. 
(The term nucleosome technically refers to a nucleosome core particle plus one 
of its adjacent DNA linkers, but it is often used synonymously with nucleosome 
core particle.) On average, therefore, nucleosomes repeat at intervals of about 
200 nucleotide pairs. For example, a diploid human cell with 6.4 x 10 9 nucleotide 
pairs contains approximately 30 million nucleosomes. The formation of nucleo- 
somes converts a DNA molecule into a chromatin thread about one-third of its 
initial length. 
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Figure 4-22 Nucleosomes as seen in the 
electron microscope. (A) Chromatin 
isolated directly from an interphase 
nucleus appears in the electron 
microscope as a thread 30 nm thick. 
(B) This electron micrograph shows a 
length of chromatin that has been 
experimentally unpacked, or 
decondensed, after isolation to show the 
nucleosomes. (A, courtesy of Barbara 
Hamkalo; B, courtesy of Victoria Foe.) 
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Figure 4-23 Structural organization of the nucleosome. A nucleosome 
contains a protein core made of eight histone molecules. In biochemical 
experiments, the nucleosome core particle can be released from isolated 
chromatin by digestion of the linker DNA with a nuclease, an enzyme that 
breaks down DNA. (The nuclease can degrade the exposed linker DNA but 
cannot attack the DNA wound tightly around the nucleosome core.) After 
dissociation of the isolated nucleosome into its protein core and DNA, the 
length of the DNA that was wound around the core can be determined. 
This length of 147 nucleotide pairs is sufficient to wrap 1 .7 times around 
the histone core. 



The Structure of the Nucleosome Core Particle Reveals How DNA 
Is Packaged 

The high-resolution structure of a nucleosome core particle, solved in 1997, 
revealed a disc-shaped histone core around which the DNA was tightly wrapped 
1.7 turns in a left-handed coil (Figure 4-24). All four of the histones that make 
up the core of the nucleosome are relatively small proteins (102-135 amino 
acids), and they share a structural motif, known as the histone fold, formed from 
three a helices connected by two loops (Figure 4-25). In assembling a nucleo- 
some, the histone folds first bind to each other to form H3-H4 and H2A-H2B 
dimers, and the H3-H4 dimers combine to form tetramers. An H3-H4 tetramer 
then further combines with two H2A-H2B dimers to form the compact octamer 
core, around which the DNA is wound (Figure 4-26). 

The interface between DNA and histone is extensive: 142 hydrogen bonds 
are formed between DNA and the histone core in each nucleosome. Nearly half 
of these bonds form between the amino acid backbone of the histones and the 
phosphodiester backbone of the DNA. Numerous hydrophobic interactions and 
salt linkages also hold DNA and protein together in the nucleosome. For exam- 
ple, more than one-fifth of the amino acids in each of the core histones are either 
lysine or arginine (two amino acids with basic side chains), and their positive 
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Figure 4-24 The structure of a nucleosome 
core particle, as determined by x-ray 
diffraction analyses of crystals. Each 
histone is colored according to the scheme 
in Figure 4-23, with the DNA double helix in 
light gray. (From K. Luger et al., Nature 
389:251-260, 1997. With permission from 
Macmillan Publishers Ltd.) 
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Figure 4-25 The overall structural organization of the core histones. 

(A) Each of the core histones contains an N-terminal tail, which is subject to 
several forms of covalent modification, and a histone fold region, as 
indicated. (B) The structure of the histone fold, which is formed by all four 
of the core histones. (C) Histones 2A and 2B form a dimer through an 
interaction known as the "handshake." Histones H3 and H4 form a dimer 
through the same type of interaction. 
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charges can effectively neutralize the negatively charged DNA backbone. These 
numerous interactions explain in part why DNA of virtually any sequence can be 
bound on a histone octamer core. The path of the DNA around the histone core 
is not smooth; rather, several kinks are seen in the DNA, as expected from the 
nonuniform surface of the core. The bending requires a substantial compression 
of the minor groove of the DNA helix. Certain dinucleotides in the minor groove 
are especially easy to compress, and some nucleotide sequences bind the nucle- 
osome more tightly than others (Figure 4-27). This probably explains some 
striking, but unusual, cases of very precise positioning of nucleosomes along a 
stretch of DNA. For most of the DNA sequences found in chromosomes, how- 
ever, the sequence preference of nucleosomes must be small enough to allow 
other factors to dominate, inasmuch as nucleosomes can occupy any one of a 
number of positions relative to the DNA sequence in most chromosomal 
regions. 

In addition to its histone fold, each of the core histones has an N-terminal 
amino acid "tail", which extends out from the DNA-histone core (see Figure 
4-26). These histone tails are subject to several different types of covalent mod- 
ifications that in turn control critical aspects of chromatin structure and func- 
tion, as we shall discuss shortly. 

As a reflection of their fundamental role in DNA function through control- 
ling chromatin structure, the histones are among the most highly conserved 
eucaryotic proteins. For example, the amino acid sequence of histone H4 from a 
pea and from a cow differ at only 2 of the 102 positions. This strong evolutionary 
conservation suggests that the functions of histones involve nearly all of their 
amino acids, so that a change in any position is deleterious to the cell. This sug- 
gestion has been tested directly in yeast cells, in which it is possible to mutate a 
given histone gene in vitro and introduce it into the yeast genome in place of the 
normal gene. As might be expected, most changes in histone sequences are 
lethal; the few that are not lethal cause changes in the normal pattern of gene 
expression, as well as other abnormalities. 

Despite the high conservation of the core histones, eucaryotic organisms 
also produce smaller amounts of specialized variant core histones that differ in 
amino acid sequence from the main ones. As we shall see, these variants, com- 
bined with a surprisingly large variety of covalent modifications that can be 
added to the histones in nucleosomes, make possible the many different chro- 
matin structures that are required for DNA function in higher eucaryotes. 
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Figure 4-26 The assembly of a histone 
octamer on DNA. The histone H3-H4 
dimer and the H2A-H2B dinner are 
formed from the handshake interaction. 
An H3-H4 tetramer forms and binds to 
the DNA. Two H2A-H2B dimers are then 
added, to complete the nucleosome. The 
histones are colored as in Figures 4-24 
and 4-25. Note that all eight N-terminal 
tails of the histones protrude from the 
disc-shaped core structure. Their 
conformations are highly flexible. 
Inside the cell, the nucleosome 
assembly reactions shown here are 
mediated by histone chaperone proteins, 
some specific for H3-H4 and others 
specific for H2A-H2B. (Adapted from 
figures by J. Waterborg.) 
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Figure 4-27 The bending of DNA in a 
nucleosome. The DNA helix makes 
1 .7 tight turns around the histone 
octamer. This diagram illustrates how the 
minor groove is compressed on the 
inside of the turn. Owing to certain 
structural features of the DNA molecule, 
the indicated dinucleotides are 
preferentially accommodated in such a 
narrow minor groove, which helps to 
explain why certain DNA sequences will 
bind more tightly than others to the 
nucleosome core. 
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Figure 4-28 Dynamic nucleosomes. 

Kinetic measurements show that the DNA 
in an isolated nucleosome is surprisingly 
dynamic, rapidly uncoiling and then 
rewrapping around its nucleosome core. 
As indicated, this makes most of its bound 
DNA sequence accessible to other DNA- 
binding proteins. (Data from G. Li and 
J. Widom, Nat. Struct Mol. Biol. 1 1 :763-769, 
2004. With permission from Macmillan 
Publishers Ltd.) 



Nucleosomes Have a Dynamic Structure, and Are Frequently 
Subjected to Changes Catalyzed by ATP-Dependent Chromatin- 
Remodeling Complexes 

For many years biologists thought that, once formed in a particular position on 
DNA, a nucleosome remains fixed in place because of the very tight association 
between its core histones and DNA. If true, this would pose problems for genetic 
readout mechanisms, which in principle require rapid access to many specific 
DNA sequences, as well as for the rapid passage of the DNA transcription and 
replication machinery through chromatin. But kinetic experiments show that the 
DNA in an isolated nucleosome unwraps from each end at rate of about 4 times 
per second, remaining exposed for 10 to 50 milliseconds before the partially 
unwrapped structure recloses. Thus, most of the DNA in an isolated nucleosome 
is in principle available for binding other proteins (Figure 4-28). 

For the chromatin in a cell, a further loosening of DNA-histone contacts is 
clearly required, because eucaryotic cells contain a large variety of ATP-depen- 
dent chromatin remodeling complexes. The subunit in these complexes that 
hydrolyzes ATP is evolutionary related to the DNA helicases (discussed in 
Chapter 5), and it binds both to the protein core of the nucleosome and to the 
double-stranded DNA that winds around it. By using the energy of ATP hydroly- 
sis to move this DNA relative to the core, this subunit changes the structure of a 
nucleosome temporarily, making the DNA less tightly bound to the histone core. 
Through repeated cycles of ATP hydrolysis, the remodeling complexes can cat- 
alyze nucleosome sliding, and by pulling the nucleosome core along the DNA 
double helix in this way, they make the nucleosomal DNA available to other pro- 
teins in the cell (Figure 4-29). In addition, by cooperating with negatively 
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Figure 4-29 The nucleosome sliding 
catalyzed by ATP-dependent chromatin 
remodeling complexes. Using the 
energy of ATP hydrolysis, the remodeling 
complex is thought to push on the DNA 
of its bound nucleosome and loosen its 
attachment to the nucleosome core. Each 
cycle of ATP binding, ATP hydrolysis, and 
release of the ADP and Pj products 
thereby moves the DNA with respect to 
the histone octamer in the direction of 
the arrow in this diagram. It requires 
many such cycles to produce the 
nucleosome sliding shown. (See also 
Figure 4-46B.) 
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Figure 4-30 Nucleosome removal and histone 
exchange catalyzed by ATP-dependent 
chromatin remodeling complexes. By 

cooperating with specific histone chaperones, 
some chromatin remodeling complexes can 
remove the H2A-H2B dimers from a 
nucleosome (top series of reactions) and 
replace them with dimers that contain a variant 
histone, such as the H2AZ-H2B dimer (see 
Figure 4-41). Other remodeling complexes are 
attracted to specific sites on chromatin to 
remove the histone octamer completely and/or 
to replace it with a different nucleosome core 
(bottom series of reactions). 
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charged proteins that serve as histone chaperones, some remodeling complexes 
are able to remove either all or part of the nucleosome core from a nucleo- 
some— catalyzing either an exchange of its H2A-H2B histones, or the complete 
removal of the octameric core from the DNA (Figure 4-30). 

Cells contain dozens of different ATP-dependent chromatin remodeling com- 
plexes that are specialized for different roles. Most are large protein complexes 
that can contain 10 or more subunits. The activity of these complexes is carefully 
controlled by the cell. As genes are turned on and off, chromatin remodeling com- 
plexes are brought to specific regions of DNA where they act locally to influence 
chromatin structure (discussed in Chapter 7; see also Figure 4-46, below). 

As pointed out previously, for most of the DNA sequences found in chro- 
mosomes, experiments show that a nucleosome can occupy any one of a num- 
ber of positions relative to the DNA sequence. The most important influence on 
nucleosome positioning appears to be the presence of other tightly bound pro- 
teins on the DNA. Some bound proteins favor the formation of a nucleosome 
adjacent to them. Others create obstacles that force the nucleosomes to move 
to positions between them. The exact positions of nucleosomes along a stretch 
of DNA therefore depends mainly on the presence and nature of other proteins 
bound to the DNA. Due to the presence of ATP-dependent remodeling com- 
plexes, the arrangement of nucleosomes on DNA can be highly dynamic, 
changing rapidly according to the needs of the cell. 
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Nucleosomes Are Usually Packed Together into a Compact 
Chromatin Fiber 

Although enormously long strings of nucleosomes form on the chromosomal 
DNA, chromatin in a living cell probably rarely adopts the extended "beads on a 
string" form. Instead, the nucleosomes are packed on top of one another, gen- 
erating regular arrays in which the DNA is even more highly condensed. Thus, 
when nuclei are very gently lysed onto an electron microscope grid, most of the 
chromatin is seen to be in the form of a fiber with a diameter of about 30 nm, 
which is considerably wider than chromatin in the "beads on a string" form (see 
Figure 4-22). 
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Figure 4-31 A zigzag model for the 30-nm chromatin fiber. (A) The conformation of two of the four 
nucleosomes in a tetranucleosome, from a structure determined by x-ray crystallography. (B) Schematic of 
the entire tetranucleosome; the fourth nucleosome is not visible, being stacked on the bottom nucleosome 
and behind it in this diagram. (C) Diagrammatic illustration of a possible zigzag structure that could account 
for the 30-nm chromatin fiber. (Adapted from C.L Woodcock, Nat Struct Mol. Biol. 12:639-640, 2005. With 
permission from Macmillan Publishers Ltd.) 



How are nucleosomes packed in the 30-nm chromatin fiber? This question 
has not yet been answered definitively, but important information concerning 
the structure has been obtained. In particular, high-resolution structural analy- 
ses have been performed on homogeneous short strings of nucleosomes, pre- 
pared from purified histones and purified DNA molecules. The structure of a 
tetranucleosome, obtained by X-ray crystallography has been used to support a 
zigzag model for the stacking of nucleosomes in the 30-nm fiber (Figure 4-31). 
But cryoelectron microscopy of longer strings of nucleosomes supports a very 
different solenoidal structure with intercalated nucleosomes (Figure 4-32). 

What causes the nucleosomes to stack so tightly on each other in a 30-nm 
fiber? The nucleosome to nucleosome linkages formed by histone tails, most 
notably the H4 tail (Figure 4-33) constitute one important factor. Another 
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Figure 4-32 An interdigitated solenoid model for the 30-nm chromatin fiber. (A) Drawings in which strings of color- 
coded nucleosomes are used to illustrate how the solenoid is generated. (B) Schematic diagram of final structure in (A). 
(C) Structural model. The model is derived from high-resolution cryoelectron microscopy images of nucleosome arrays 
reconstituted from purified histones and DNA molecules of specific length and sequence. Both nucleosome octamers and 
a linker histone (discussed below) were used to produce regularly repeating arrays containing up to 72 nucleosomes. 
(Adapted from P. Robinson, L. Fairall, V. Huynh and D. Rhodes, Proc. Natl Acad Sci. U.S.A. 103:6506-651 1, 2006. With 
permission from National Academy of Sciences.) 
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important factor is an additional histone that is often present in a 1-to-l ratio 
with nucleosome cores, known as histone HI. This so-called linker histone is 
larger than the individual core histones and it has been considerably less well 
conserved during evolution. A single histone HI molecule binds to each nucle- 
osome, contacting both DNA and protein, and changing the path of the DNA as 
it exits from the nucleosome. Although it is not understood in detail how HI 
pulls nucleosomes together into the 30-nm fiber, a change in the exit path in 
DNA seems crucial for compacting nucleosomal DNA so that it interlocks to 
form the 30-nm fiber (Figure 4-34). Most eucaryotic organisms make several 
histone HI proteins of related but quite distinct amino acid sequences. 

It is possible that the 30-nm structure found in chromosomes is a fluid 
mosaic of several different variations. For example, a linker histone in the HI 
family was present in the nucleosomal arrays studied in Figure 4-32 but was 
missing from the tetranucleosome in Figure 4-31. Moreover, we saw earlier that 
the linker DNA that connects adjacent nucleosomes can vary in length; these 
differences in linker length probably introduce local perturbations into the 
structure. And the presence of many other DNA-binding proteins, as well as pro- 
teins that bind directly to histones, will certainly add important additional fea- 
tures to any array of nucleosomes. 



Figure 4-33 A speculative model for the 
role played by histone tails in the 
formation of the 30-nm fiber. (A) This 
schematic diagram shows the 
approximate exit points of the eight 
histone tails, one from each histone 
protein, that extend from each 
nucleosome. The actual structure is shown 
to its right. In the high-resolution structure 
of the nucleosome, the tails are largely 
unstructured, suggesting that they are 
highly flexible. (B) A speculative model 
showing how the histone tails may help to 
pack nucleosomes together into the 
30-nm fiber. This model is based on (1) 
experimental evidence that histone tails 
aid in the formation of the 30-nm fiber, 
and (2) the x-ray crystal structure of the 
nucleosome, in which the tails of one 
nucleosome contact the histone core of an 
adjacent nucleosome in the crystal lattice. 



Summary 

A gene is a nucleotide sequence in a DNA molecule that acts as afunctional unit for the 
production of a protein, a structural RNA, or a catalytic or regulatory RNA molecule. In 
eucaryotes, protein- coding genes are usually composed of a string of alternating introns 
and exons associated with regulatory regions of DNA. A chromosome is formed from a 
single, enormously long DNA molecule that contains a linear array of many genes. The 
human genome contains 3.2 x 10 9 DNA nucleotide pairs, divided between 22 different 
autosomes and 2 sex chromosomes. Only a small percentage of this DNA codes for pro- 
teins or functional RNA molecules. A chromosomal DNA molecule also contains three 
other types of functionally important nucleotide sequences: replication origins and 
telomeres allow the DNA molecule to be efficiently replicated, while a centromere 
attaches the daughter DNA molecules to the mitotic spindle, ensuring their accurate 
segregation to daughter cells during the M phase of the cell cycle. 
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Figure 4-34 How the linker histone 
binds to the nucleosome. The position 
and structure of the globular region of 
histone HI are shown. As indicated, this 
region constrains an additional 
20 nucleotide pairs of DNA where it exits 
from the nucleosome core. This type of 
binding by HI is thought to be important 
for forming the 30-nm chromatin fiber. 
The long C-terminal tail of histone H1 is 
also required for the high-affinity binding 
of H1 to chromatin, but neither its 
position or that of the N-terminal tail is 
known. (A) Schematic, (B) structure. 
(B, from D. Brown, T Izard and T. Misteli, 
Nat. Struct Moi. Biol. 1 3:250-255, 2006. 
With permission from Macmillan 
Publishers Ltd.) 
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The DNA in eucaryotes is tightly bound to an equal mass of histones, which form 
repeated arrays of DNA-protein particles called nucleosomes. The nucleosome is com- 
posed of an octameric core ofhistone proteins around which the DNA double helix is 
wrapped. Nucleosomes are spaced at intervals of about 200 nucleotide pairs, and they 
are usually packed together (with the aid ofhistone HI molecules) into quasi-regular 
arrays to form a 30-nm chromatin fiber Despite the high degree of compaction in 
chromatin, its structure must be highly dynamic to allow access to the DNA. There is 
some spontaneous DNA unwrapping and rewrapping in the nucleosome itself; how- 
ever, the general strategy for reversibly changing local chromatin structure features 
ATP-driven chromatin remodeling complexes. Cells contain a large set of such com- 
plexes, which are targeted to specific regions of chromatin at appropriate times. The 
remodeling complexes collaborate with histone chaperones to allow nucleosome cores 
to be repositioned, reconstituted with different histones, or completely removed to 
expose the underlying DNA. 



THE REGULATION OF CHROMATIN STRUCTURE 

Having described how DNA is packaged into nucleosomes to create a chromatin 
fiber, we now turn to the mechanisms that create different chromatin structures 
in different regions of a cell's genome. We now know that mechanisms of this type 
are used to control many genes in eucaryotes. Most importantly, certain types of 
chromatin structure can be inherited; that is, the structure can be directly passed 
down from a cell to its decendents. Because the cell memory that results is based 
on an inherited protein structure rather than on a change in DNA sequence, this 
is a form of epigenetic inheritance. The prefix epi is Greek for "on"; this is appro- 
priate, because epigenetics represents a form of inheritance that is superimposed 
on the genetic inheritance based on DNA (Figure 4-35). 

In Chapter 7, we shall introduce the many different ways in which the 
expression of genes is regulated. There we discuss epigenetic inheritance in 
detail and present several distinct mechanisms that can produce it. Here, we are 
concerned with only one, that based on chromatin structure. We begin this sec- 
tion with an introduction to inherited chromatin structures and then describe 
the basis for them — the covalent modification of histones in nucleosomes. We 
shall see that these modifications serve as recognition sites for protein modules 
that bring specific protein complexes to the appropriate regions of chromatin, 
thereby producing specific effects on gene expression or inducing other biolog- 
ical functions. Through such mechanisms, chromatin structure plays a central 
role in the development, growth, and maintenance of eucaryotic organisms, 
including ourselves. 
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Figure 4-35 A comparison of genetic 
inheritance with an epigenetic 
inheritance based on chromatin 
structures. Genetic inheritance is based 
on the direct inheritance of DNA 
nucleotide sequences during DNA 
replication. DNA sequence changes are 
not only transmitted faithfully from a 
somatic cell to all of its descendents, but 
also through germ cells from one 
generation to the next. The field of 
genetics, reviewed in Chapter 8, is based 
on the inheritance of these changes 
between generations. The type of 
epigenetic inheritance shown here is 
based on other molecules bound to the 
DNA, and it is therefore less permanent 
than a change in DNA sequence; in 
particular, epigenetic information is 
usually (but not always) erased during 
the formation of eggs and sperm. 

Only one epigenetic mechanism, that 
based on an inheritance of chromatin 
structures, is discussed in this chapter. 
Other epigenetic mechanisms are 
presented in Chapter 7, which focuses on 
the control of gene expression (see 
Figure 7-86). 
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Some Early Mysteries Concerning Chromatin Structure 

Thirty years ago, histories were viewed as relatively uninteresting proteins. 
Nucleosomes were known to cover all of the DNA in chromosomes, and they 
were thought to exist to allow the enormous amounts of DNA in many eucary- 
otic cells to be packaged into compact chromosomes. Extrapolating from what 
was known in bacteria, many scientists believed that gene regulation in eucary- 
otes would simply bypass nucleosomes, treating them as uninvolved 
bystanders. 

But there were reasons to challenge this view Thus, for example, bio- 
chemists had determined that mammalian chromatin consists of an approxi- 
mately equal mass of histone and non-histone proteins. This would mean that, 
on average, every 200 nucleotide pairs of DNA in our cells is associated with 
more than 1000 amino acids of non-histone proteins (that is, a mass of protein 
equivalent to the total mass of the histone octamer plus histone HI). We now 
know that many of these proteins bind to nucleosomes, and their abundance 
might suggest that histones are more than just packaging proteins. 

A second reason to challenge the view that histones were inconsequential to 
gene regulation was based on the amazingly slow rate of evolutionary change in 
the sequences of the four core histones. The previously mentioned fact that 
there are only two amino acid differences in the sequence of mammalian and 
pea histone H4 implies that a change in almost any one of the 102 amino acids 
in H4 must be deleterious to these organisms. What type of process could make 
the life of an organism so sensitive to the exact structure of the nucleosome core 
that only two amino acids had changed in more than 500 million years of ran- 
dom variation followed by natural selection? 

Last but not least, a combination of genetics and cytology had revealed that 
a particular form of chromatin silences the genes that it packages without regard 
to nucleotide sequence — and does so in a manner that is directly inherited by 
both daughter cells when a cell divides. It is to this subject that we turn next. 



Heterochromatin Is Highly Organized and Unusually Resistant to 
Gene Expression 

Light-microscope studies in the 1930s distinguished two types of chromatin in 
the interphase nuclei of many higher eucaryotic cells: a highly condensed form, 
called heterochromatin, and all the rest, which is less condensed, called 
euchromatin. Heterochromatin represents an especially compact form of chro- 
matin (see Figure 4-9), and we are finally beginning to understand important 
aspects of its molecular properties. Although present in many locations along 
chromosomes, it is also highly concentrated in specific regions, most notably at 
the centromeres and telomeres introduced previously (see Figure 4-21). In a typ- 
ical mammalian cell, more than ten percent of the genome is packaged in this 
way 

The DNA in heterochromatin contains very few genes, and those euchro- 
matic genes that become packaged into heterochromatin are turned off by this 
type of packaging. However, we know now that the term heterochromatin 
encompasses several distinct types of chromatin structures whose common fea- 
ture is an especially high degree of compaction. Thus, heterochromatin should 
not be thought of as encapsulating "dead" DNA, but rather as creating different 
types of compact chromatin with distinct features that make it highly resistant 
to gene expression for the vast majority of genes. 

When a gene that is normally expressed in euchromatin is experimentally 
relocated into a region of heterochromatin, it ceases to be expressed, and the 
gene is said to be silenced. These differences in gene expression are examples of 
position effects, in which the activity of a gene depends on its position relative 
to a nearby region of heterochromatin on a chromosome. First recognized in 
Drosophila, position effects have now been observed in many eucaryotes, 
including yeasts, plants, and humans. 
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Figure 4-36 The cause of position effect variegation in Drosophilo. (A) Heterochromatin (green) is normally prevented 
from spreading into adjacent regions of euchromatin (red) by special barrier DNA sequences, which we shall discuss shortly. 
In flies that inherit certain chromosomal rearrangements, however, this barrier is no longer present. (B) During the early 
development of such flies, heterochromatin can spread into neighboring chromosomal DNA, proceeding for different 
distances in different cells. This spreading soon stops, but the established pattern of heterochromatin is inherited, so that 
large clones of progeny cells are produced that have the same neighboring genes condensed into heterochromatin and 
thereby inactivated (hence the "variegated" appearance of some of these flies; see Figure 4-37). Although "spreading" is 
used to describe the formation of new heterochromatin close to previously existing heterochromatin, the term may not be 
wholly accurate. There is evidence that during expansion, heterochromatin can "skip over" some regions of chromatin, 
sparing the genes that lie within them from repressive effects. 



The position effects associated with heterochromatin exhibit a feature 
called position effect variegation, which in retrospect provided critical clues con- 
cerning chromatin function. In Drosophila, chromosome breakage events that 
directly connect a region of heterochromatin to a region of euchromatin tend to 
inactivate the nearby euchromatic genes. The zone of inactivation spreads a dif- 
ferent distance in different early cells in the fly embryo, but once the heterchro- 
matic condition is established on a gene, it tends to be stably inherited by all of 
the cell's progeny (Figure 4-36). This remarkable phenomenon was first recog- 
nized through a detailed genetic analysis of the mottled loss of red pigment in 
the fly eye (Figure 4-37), but it shares many features with the extensive spread 
of heterochromatin that inactivates of one of the two X chromosomes in female 
mammals (see p. 473). 

Extensive genetic screens have been carried out in Drosophila, as well as in 
fungi, in a search for gene products that either enhance or suppress the spread 
of heterochromatin and its stable inheritance — that is, for genes that when 
mutated serve as either enhancers or suppressors of position effect variegation. 
In this way, more than 50 genes have been identified that play a critical role in 
these processes. In recent years, the detailed characterization of the proteins 
produced by these genes has revealed that many are nonhistone chromosomal 
proteins that underlie a remarkable mechanism for eucaryotic gene control, one 
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Figure 4-37 The discovery of position 
effects on gene expression. The White 
gene in the fruit fly Drosophila controls 
eye pigment production and is named 
after the mutation that first identified it. 
Wild-type flies with a normal White gene 
(White + ) have normal pigment production, 
which gives them red eyes, but if the White 
gene is mutated and inactivated, the 
mutant flies (White') make no pigment 
and have white eyes. In flies in which a 
normal White + gene has been moved near 
a region of heterochromatin, the eyes are 
mottled, with both red and white patches. 
The white patches represent cell lineages 
in which the White + gene has been 
silenced by the effects of the 
heterochromatin. In contrast, the red 
patches represent cell lineages in which 
the White + gene is expressed. Early in 
development, when the heterochromatin 
is first formed, it spreads into neighboring 
euchromatin to different extents in 
different embryonic cells (see Figure 
4-36). The presence of large patches of red 
and white cells reveals that the state of 
transcriptional activity, as determined by 
the packaging of this gene into chromatin 
in those ancestor cells, is inherited by all 
daughter cells. 
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Figure 4-38 Some prominent types of covalent amino acid side-chain 
modifications found on nucleosomal histones. (A) Three different levels of 
lysine methylation are shown; each can be recognized by a different 
binding protein and thus each can have a different significance for the cell. 
Note that acetylation removes the plus charge on lysine, and that, most 
importantly, an acetylated lysine cannot be methylated, and vice versa. 
(B) Serine phosphorylation adds a negative charge to a histone. 
Modifications not shown here are the mono- or di-methylation of an 
arginine, the phosphorylation of a threonine, the addition of ADP-ribose to 
a glutamic acid, and the addition of a ubiquityl, sumoyl, or biotin group to 
a lysine. 

that requires the precise amino acid sequences of the core histones. This mech- 
anism of gene control therefore helps to explain the remarkably slow change in 
the histones over time. 



(B) SERINE PHOSPHORYLATION 
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The Core Histones Are Covalently Modified at Many Different 
Sites 

The amino acid side chains of the four histones in the nucleosome core are sub- 
jected to a remarkable variety of covalent modifications, including the acetyla- 
tion of lysines, the mono-, di-, and tri-methylation of lysines, and the phospho- 
rylation of serines (Figure 4-38). A large number of these side-chain modifica- 
tions occur on the eight relatively unstructured N-terminal "histone tails" that 
protrude from the nucleosome (Figure 4-39). However, there are also specific 
side-chain modifications on the nucleosome's globular core (Figure 4-40). 

All of the above types of modifications are reversible. The modification of a 
particular amino acid side chain in a nucleosome is created by a specific 
enzyme, with most of these enzymes acting only on one or a few sites. A differ- 
ent enzyme is responsible for removing each side chain modification. Thus, for 
example, acetyl groups are added to specific lysines by a set of different histone 
acetyl transferases (HATs) and removed by a set of histone deacetylase com- 
plexes (HDACs). Likewise, methyl groups are added to lysine side chains by a set 
of different histone methyl transferases and removed by a set of histone 
demethylases. Each enzyme is recruited to specific sites on the chromatin at 
defined times in each cell's life history. For the most part, the initial recruitment 
of these enzymes depends on gene regulatory proteins that bind to specific DNA 
sequences along chromosomes, and these are produced at different times in the 
life of an organism, as described in Chapter 7. But in at least some cases, the 
covalent modifications on nucleosomes can persist long after the gene regula- 
tory proteins that first induced them have disappeared, thereby carrying a mem- 
ory in the cell of its developmental history. Very different patterns of covalent 
modifications are therefore found on different groups of nucleosomes, accord- 
ing to their exact position on a chromosome and the status of the cell. 
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Figure 4-39 The covalent modification of core histone tails. (A) The structure of the nucleosome highlighting the location 
of the first 30 amino acids in each of its eight N-terminal histone tails (green). (B) Well-documented modifications of the 
four histone core proteins are indicated. Although only a single symbol is used for methylation here (M), each lysine (K) or 
arginine (ff) can be methylated in several different ways. Note also that some positions (e.g., lysine 9 of H3) can be modified 
either by methylation or by acetylation, but not both. Most of the modifications shown add a relatively small molecule 
onto the histone tails; the exception is ubiquitin, a 76 amino acid protein also used for other cell processes (see Figure 
6-92). (Adapted from H. Santos-Rosa and C. Caldas, Eur. J. Cancer 41:2381 -2402, 2005. With permission from Elsevier.) 

The modifications of the histones are carefully controlled, and they have 
important consequences. The acetylation of lysines on the N-terminal tails 
tends to loosen chromatin structure, in part because adding an acetyl group to 
lysine removes its positive charge, thereby reducing the affinity of the tails for 
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Figure 4-40 A map of histone modifications 
on the surface of the nucleosome core 

particle. As noted, the histone tails have 
been omitted here (compare with Figure 
4-39). The functions of most of these core 
modifications are not yet known. (Adapted 
from M.S. Cosgrove, J.D. Boeke and 
C. Wolberger, Nat. Struct. Mot. Biol. 
1 1:1037-1043, 2004. With permission from 
Macmillan Publishers Ltd.) 
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adjacent nucleosomes (see Figure 4-33). However, the most profound effect of 
the histone modifications is their ability to attract specific proteins to a stretch 
of chromatin that has been appropriately modified. These new proteins deter- 
mine how and when genes will be expressed, as well as other biological func- 
tions. In this way, the precise structure of a domain of chromatin determines the 
expression of the genes packaged in it, and thereby the structure and function of 
the eucaryotic cell. 



Chromatin Acquires Additional Variety Through the Site-Specific 
Insertion of a Small Set of Histone Variants 

Despite the tight conservation of the amino acid sequences of the four core his- 
tones over hundreds of millions of years, eucaryotes also contain a few variant 
histones that assemble into nucleosomes. These histones are present in much 
smaller amounts than the major histones, and they have been less well con- 
served over long evolutionary times. Except for histone H4, variants exist for 
each of the core histones; some examples are shown in Figure 4-41. 

The major histones are synthesized primarily during the S phase of the cell 
cycle (see Figure 17-4) and assembled into nucleosomes on the daughter DNA 
helices just behind the replication fork (see Figure 5-38). In contrast, most his- 
tone variants are synthesized throughout interphase. They are often inserted 
into already-formed chromatin, which requires a histone-exchange process cat- 
alyzed by the ATP-dependent chromatin remodeling complexes discussed pre- 
viously. These remodeling complexes contain subunits that cause them to bind 
both to specific sites on chromatin and to histone chaperones that carry a par- 
ticular variant. As a result, each histone variant is inserted into chromatin in a 
highly selective manner (see Figure 4-30). 



The Covalent Modifications and the Histone Variants Act in 
Concert to Produce a "Histone Code'That Helps to Determine 
Biological Function 

The number of possible distinct markings on an individual nucleosome is enor- 
mous. Even with the recognition that some of the covalent modifications are 
mutually exclusive (for example, it is not possible for a lysine to be both acetylated 
and methylated at the same time), and that other modifications are created 
together as a set, it is clear that thousands of combinations can exist. In addition, 
there is the further diversity created by nucleosomes that contain histone variants. 
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Figure 4-41 The structure of some 
histone variants compared with the 
major histone that they replace. These 
histones are inserted into nucleosomes at 
specific sites on chromosomes by 
ATP-dependent chromatin remodeling 
enzymes that act in concert with histone 
chaperones (see Figure 4-30). The 
CENP-A variant of histone H3 is discussed 
later in this chapter (see Figures 4-48 to 
4-51 ); other variants are discussed in 
Chapter 7. The sequences that are colored 
differently in each variant are different 
from the corresponding sequence of the 
major histone. (Adapted from K. Sarma 
and D. Rein berg, Nat. Rev. Mol. Cell. Biol. 
6:139-149, 2005. With permission from 
Macmillan Publishers Ltd.) 
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Many of the combinations appear to have a specific meaning for the cell 
because they determine how and when the DNA packaged in the nucleosomes 
is accessed, leading to the histone code hypothesis. For example, one type of 
marking signals that a stretch of chromatin has been newly replicated, another 
signals that the DNA in that chromatin has been damaged and needs repair, 
while many others signal when and how gene expression should take place. 
Small protein modules bind to specific marks, recognizing for example a tri- 
methylated lysine 4 on histone H3 (Figure 4-42). These modules are thought to 
act in concert with other modules as part of a code-reader complex, so as to 
allow particular combinations of markings on chromatin to attract additional 
protein complexes that execute an appropriate biological function at the right 
time (Figure 4-43). 
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Figure 4-42 How each mark on a 
nucleosome is read. The structure of a 
protein module that specifically 
recognizes histone H3 trimethylated on 
lysine 4 is shown. (A) Space-filling model 
of an ING PHD domain bound to a 
histone tail {green, with the trimethyl 
group highlighted in yellow). (B) A ribbon 
model showing how the N-terminal six 
amino acids in the H3 tail are recognized. 
The dashed lines represent hydrogen 
bonds. This is one of many PHD domains 
that recognize methylated lysines on 
histones; different domains bind tightly 
to lysines located at different positions, 
and they can discriminate between a 
mono-, di-, and tri-methylated lysine. In a 
similar way, other small protein modules 
recognize specific histone side chains 
that have been marked with acetyl 
groups, phosphate groups, and so on. 
(Adapted from P.V. Pena et al., Nature 
442:1 00-1 03, 2006. With permission from 
Macmillan Publishers Ltd.) 



Figure 4-43 Schematic diagram showing 
how the histone code could be read by a 
code-reader complex. A large protein 
complex that contains a series of protein 
modules, each of which recognizes a 
specific histone mark, is schematically 
illustrated (green). This "code-reader 
complex" will bind tightly only to a region 
of chromatin that contains several of the 
different histone marks that it recognizes. 
Therefore, only a specific combination of 
marks will cause the complex to bind to 
chromatin and attract additional protein 
complexes (purple) that catalyze a 
biological function. 
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Figure 4-44 Some specific meanings of 
the histone code. (A) The modifications 
on the histone H3 N-terminal tail are 
shown, repeated from Figure 4-39. 
(B) The H3 tail can be marked by different 
combinations of modifications that 
convey a specific meaning to the stretch 
of chromatin where this combination 
occurs. Only a few of the meanings are 
known, including the four examples 
shown. To focus on just one example, the 
trimethylation of lysine 9 attracts the 
heterochromatin-specific protein HP1, 
which induces a spreading wave of 
further lysine 9 trimethylation followed 
by further HP1 binding, according to the 
general scheme that will be illustrated 
shortly (see Figure 4-46), Not shown is 
the fact that, as just implied (see Figure 
4-43), reading the histone code generally 
involves the joint recognition of marks at 
other sites on the nucleosome along with 
the indicated H3 tail recognition. In 
addition, specific levels of methylation 
(mono-, di-, or tri-methyl groups) are 
required, as in Figure 4-42. 



The marks on nucleosomes due to covalent additions to histones are 
dynamic, being constantly removed and added at rates that depend on their 
chromosomal locations. Because the histone tails extend outward from the 
nucleosome core and are likely to be accessible even when chromatin is con- 
densed, they would seem to provide an especially suitable format for creating 
marks in a form that can be readily altered as a cell's needs change. Although 
much remains to be learned about the meaning of the many different histone 
code combinations, a few well-studied examples of the information that can be 
encoded in the histone H3 tail are listed in Figure 4-44. 



A Complex of Code-reader and Code-writer Proteins Can Spread 
Specific Chromatin Modifications for Long Distances Along a 
Chromosome 

The phenomenon of position effect variegation described previously requires 
that at least some modified forms of chromatin have the ability to spread for 
substantial distances along a chromosomal DNA molecule (see Figure 4-36). 
How is this possible? 

The enzymes that modify (or remove modifications from) the histones in 
nucleosomes are part of multisubunit complexes. They can initially be brought 
to a particular region of chromatin by one of the sequence-specific DNA-bind- 
ing proteins (gene regulatory proteins) discussed in Chapters 6 and 7 (for a spe- 
cific example, see Figure 7-87). But after a modifying enzyme "writes" its mark 
on one or a few neighboring nucleosomes, events that resemble a chain reaction 
can ensue. In this case, the "code-writer" enzyme works in concert with a code- 
reader protein located in the same protein complex. This second protein con- 
tains a code-reader module that recognizes the mark and binds tightly to the 
newly modified nucleosome (see Figure 4-42), positioning its attached writer 
enzyme near an adjacent nucleosome. Through many such read-write cycles, 
the reader protein can carry the writer enzyme along the DNA— spreading the 
mark in a hand-over-hand manner along the chromosome (Figure 4-45). 

In reality, the process is more complicated than the scheme just described. 
Both readers and writers are part of a protein complex that is likely to contain 



THE REGULATION OF CHROMATIN STRUCTURE 



gene regulatory protein 



227 




hi stone modifying * 
enzyme ("writer") 




code-reader protein 




histone modification (mark) 

multiple readers and writers, and to require multiple marks on the nucleosome 
to spread. Moreover, many of these reader-writer complexes also contain an 
ATP-dependent chromatin remodeling protein, and the reader, writer, and 
remodeling proteins work in concert to either decondense or condense long 
stretches of chromatin as the reader moves progressively along the nucleosome- 
packaged DNA (Figure 4-46). 

Some idea of the complexity of the processes just described can be derived 
from the results of genetic screens for mutant genes that either enhance or sup- 
press the spreading and stability of heterochromatin in tests for position effect 
variegation in Drosophila (see Figure 4-37). As pointed out previously, more 
than 50 such genes are known, and most of them are likely to function as sub- 
units in one or more reader-writer-remodeling protein complexes. 



Figure 4-45 How the recruitment of a 
code-reader-writer complex can spread 
chromatin changes along a 
chromosome. The code-writer is an 
enzyme that creates a specific 
modification on one or more of the four 
nucleosomal histones. After its 
recruitment to a specific site on a 
chromosome by a gene regulatory 
protein, the writer collaborates with a 
code-reader protein to spread its mark 
from nucleosome to nucleosome by 
means of the indicated reader-writer 
complex. For this mechanism to work, 
the reader must recognize the same 
histone modification mark that the writer 
produces (see also Figure 4-43). 



Barrier DNA Sequences Block the Spread of Reader-Writer 
Complexes and thereby Separate Neighboring Chromatin Domains 

The above mechanism for spreading chromatin structures raises a potential 
problem. Inasmuch as each chromosome consists of one continuous, very long 
DNA molecule, what prevents a cacophony of confusing cross-talk between 
adjacent chromatin domains of different structure and function? Early studies of 
position effect variegation had suggested an answer: the existence of specific 
DNA sequences that separate one chromatin domain from another (see Figure 
4-37). Several such barrier sequences have now been identified and character- 
ized through the use of genetic engineering techniques that allow specific 
regions of DNA sequence to be deleted or added to chromosomes. 

For example, a sequence called HS4 normally separates the active chro- 
matin domain that contains the p-globin locus from an adjacent region of 
silenced, condensed chromatin in erythrocytes (see Figure 7-61). If this 
sequence is deleted, the P-globin locus is invaded by condensed chromatin. This 
chromatin silences the genes it covers, and it spreads to a different extent in dif- 
ferent cells, causing a pattern of position effect variegation similar to that 
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observed in Drosophila. As described in Chapter 7, this invasion has dire conse- 
quences: the globin genes are poorly expressed, and individuals who carry such 
a deletion have a severe form of anemia. 

The HS4 sequence is often added to both ends of a gene that is experiment 
tally inserted into a mammalian genome, in order to protect that gene from the 
silencing caused by spreading heterochrony tin, Analysis of this barrier sequence 
reveals that it contains a cluster of binding sites for histone acetylase enzymes. 
Since the acetylation of a lysine side chain is incompatible with the metiiylation 
of the same side chain, histone aeetylases and histone deacetylases are logical 
candidates for die formation of barriers on the DNA that block the spread of dif- 
ferent forms of chromatin (Figure 4-47). However, several other types of chro- 
matin modifications are known that can also protect genes from silencing. 




10 nm 



Figure 4-46 How a complex containing 
reader-writer and ATP-dependent 
chromatin remodeling proteins can 
spread chromatin changes along a 
chromosome. (A) A spreading wave of 
chromatin condensation. This mechanism 
is identical to that in Figure 4-45, except 
that the reader-writer complex 
collaborates with an ATP-dependent 
chromatin remodeling protein (see Figure 
4-29) to reposition nucleosomes and 
pack them into highly condensed arrays. 
This is a highly simplified view of the 
mechanism known to be able to spread a 
major form of heterochromatin for long 
distances along chromosomes (see 
Figure 4-36). The heterochromatin- 
specific protein HP! plays a major role in 
that process. HP1 binds to trimethyl 
lysine 9 on histone H3, and it remains 
associated with the condensed 
chromatin as one of the readers in a 
reader-writer-remodeling complex that, 
while incompletely understood, is 
considerably more intricate than that 
shown here. (B) The actual structure of a 
chromatin reader-remodeling complex, 
showing how it is thought to interact 
with a nucleosome. Modeled in gray is 
the yeast RSC complex, which contains 
15 subunits — including an ATP- 
dependent chromatin remodeling 
protein and at least 4 subunits with code- 
reader domains. (B, from A.E. Leschziner 
et al., Proc Natl Acad. 5c/. USA. 
1 04:491 3-491 8, 2007. With permission 
from National Academy of Sciences.) 



The Chromatin in Centromeres Reveals How Histone Variants Can 
Create Special Structures 

The presence of nucleosomes carrying histone variants is thought to produce 
marks in chromatin that are unusually long lasting. Consider, for example, the 
formation and inheritance of the chromatin that forms on centromeres, the 
DNA region of each chromosome required for the orderly segregation of the 
chromosomes into daughter cells each time a cell divides (see Figure 4-21). In 
many complex organisms, including humans, each centromere is embedded in 
a stretch of special centric heterochromatin that persists throughout interphase, 
even though the centromere-mediated movement of DNA occurs only during 
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mitosis. This chromatin contains a centromere-specific variant H3 histone, 
known as CENP-A (see Figure 4-41), plus additional proteins that pack the 
nucleosomes into particularly dense arrangements and form the kinetechore, 
the special structure required for attachment of the mitotic spindle. 

A specific DNA sequence of approximately 125 nucleotide pairs is sufficient 
to serve as a centromere in the yeast S. cerevisiae. Despite its small size, more 
than a dozen different proteins assemble on this DNA sequence; the proteins 
include the CENP-A histone H3 variant, which, along with the three other core 
histones, forms a centromere-specific nucleosome. The additional proteins at 
the yeast centromere attach this nucleosome to a single microtubule from the 
yeast mitotic spindle (Figure 4-48). 

The centromeres in more complex organisms are considerably larger than 
those in budding yeasts. For example, fly and human centromeres extend over 
hundreds of thousands of nucleotide pairs and do not seem to contain a cen- 
tromere-specific DNA sequence. These centromeres largely consist of short, 
repeated DNA sequences, known as alpha satellite DNA in humans. But the 
same repeat sequences are also found at other (non-centromeric) positions on 



Figure 4-47 Some mechanisms of barrier 
action. These models are derived from 
different analyses of barrier action, and a 
combination of several of them may 
function at any one site. (A) The tethering 
of a region of chromatin to a large fixed 
site, such as the nuclear pore complex 
illustrated here, can form a barrier that 
stops the spread of heterochromatin. 
(B) The tight binding of barrier proteins to 
a group of nucleosomes can compete with 
heterochromatin spreading. (C) By 
recruiting a group of highly active histone- 
modifying enzymes, barriers can erase the 
histone marks that are required for 
heterochromatin to spread. For example, a 
potent acetylation of lysine 9 on histone 
H3 will compete with lysine 9 methylation, 
thereby preventing the HP1 protein 
binding needed to form some forms of 
heterochromatin (see Figure 4-46). (Based 
on A.G. West and P. Fraser, Hum. Mol. Genet 
14:R101-R11 1,2005. With permission from 
Oxford University Press.) 
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Figure 4-48 A model for the structure of 
a simple centromere. In the yeast 
Saccharomyces cerevisiae, a special 
centromeric DNA sequence assembles a 
single nucleosome in which two copies of 
an H3 variant histone (called CENP-A in 
most organisms) replaces the normal H3. 
Peptide sequences unique to this variant 
histone (see Figure 4-41) then help to 
assemble additional proteins, some of 
which form a kinetochore. This 
kinetochore is unusual in capturing only a 
single microtubule; humans have much 
larger centromeres and form kinetochores 
that can capture 20 or more microtubules 
(see Figure 4-50). The kinetochore is 
discussed in detail in Chapter 17. 
(Adapted from A. Joglekar et al., Nat. Ceil 
Biol 8:381-383, 2006. With permission 
from Macmillan Publishers Ltd.) 
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Figure 4-49 Evidence for the plasticity of human centromere formation. (A) A series of A-T-rich alpha satellite DNA 
sequences are repeated many thousands of times at each human centromere (red), surrounded by pericentric 
heterochromatin (brown). However, due to an ancient chromosome breakage and rejoining event, some human 
chromosomes contain two blocks of alpha satellite DNA, each of which presumably functioned as a centromere in its 
original chromosome. Usually, these dicentric chromosomes are not stably propagated because they attach improperly to 
the spindle and are broken apart during mitosis. In chromosomes that do survive, however, one of the centromeres has 
somehow inactivated, even though it contains all the necessary DNA sequences. This allows the chromosome to be stably 
propagated. (B) In a small fraction (1/2000) of human births, extra chromosomes are observed in cells of the offspring. 
Some of these extra chromosomes, which have formed from a breakage event, lack alpha satellite DNA altogether, yet new 
centromeres (neocentromeres) have arisen from what was originally euchromatic DNA. 

chromosomes, indicating that they are not sufficient to direct centromere for- 
mation. Most strikingly, in some unusual cases, new human centromeres (called 
neocentromeres) have been observed to form spontaneously on fragmented 
chromosomes. Some of these new positions were originally euchromatic and 
lack alpha satellite DNA altogether (Figure 4-49). 

It therefore seems that centromeres in complex organisms are defined by an 
assembly of proteins, instead of by a specific DNA sequence. When antibodies 
that stain specific modified nucleosomes are used to examine the stretched 
chromosome fibers from centromeres, one observes striking alternation of two 
modified forms of chromatin (Figure 4-50). It appears that this arrangement 
allows the centric heterochromatin to fold so as to position the CENP-A-con- 
taining nucleosomes on the outside of the mitotic chromosome, where they 
bind the set of proteins that form the kinetechore plates. These plates in turn 
capture a group of microtubules from the mitotic spindle in order to partition 
the chromosomes accurately, as described in Chapter 17. 

Chromatin Structures Can Be Directly Inherited 

To explain the above observations, it has been proposed that de novo cen- 
tromere formation requires an initial seeding event, involving the formation of a 
specialized DNA-protein structure that contains nucleosomes formed with the 
CENP-A variant of histone H3. In humans, this seeding event happens more 
readily on arrays of alpha satellite DNA than on other DNA sequences. The 
H3-H4 tetramers from each nucleosome on the parental DNA helix are directly 
inherited by the daughter DNA helices at a replication fork (see Figure 5-38). 
Therefore, once a set of CENP-A-containing nucleosomes has been assembled 
on a stretch of DNA, it is easy to understand how a new centromere could be 
generated in the same place on both daughter chromosomes following each 
round of cell division (Figure 4-51). 

The plasticity of centromeres may provide an important evolutionary advan- 
tage. We have seen that chromosomes evolve in part by breakage and rejoining 
events (see Figure 4-18). Many of these events produce chromosomes with two 
centromeres, or chromosome fragments with no centromeres at all. Although 
rare, both the inactivation of centromeres and their ability to be activated de novo 
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may occasionally allow newly formed chromosomes to be maintained stably 
thereby facilitating the process of chromosome evolution. 

There are some striking similarities between the formation and mainte- 
nance of centromeres and the formation and maintenance of other regions of 
heterochromatin. In particular, the entire centromere forms as an all-or-none 
entity, suggesting a highly cooperative addition of proteins after a seeding event. 
Moreover, once formed, the structure seems to be directly inherited on the DNA 
as part of each round of chromosome replication. 



Chromatin Structures Add Unique Features to Eucaryotic 
Chromosome Function 

Although a great deal remains to be learned about the functions of different 
chromatin structures, the packaging of DNA into nucleosomes was probably 
crucial for the evolution of eucaryotes like ourselves. Complex multicellular 
organisms would appear to be possible only if the cells in different lineages can 
specialize by changing the accessibility and responsiveness of many hundreds of 
genes to genetic readout. As described in Chapter 22, each cell has a stored 
memory of its past developmental history in the regulatory circuits that control 
its many genes. 

Although bacteria also require cell memory mechanisms, the complexity of 
the memory circuits required by higher eucaryotes is unprecedented. The pack- 
aging of selected regions of eucaryotic genomes into different forms of chro- 
matin makes possible a type of cell memory mechanism that is not available to 
bacteria. The crucial feature of this uniquely eucaryotic form of gene regulation 
is the storage of the memory of the state of a gene on a gene-by-gene basis— in 
the form of local chromatin structures that can persist for various lengths of 
time. At one extreme are structures like centric heterochromatin that, once 
established, are stably inherited from one cell generation to the next (see Figure 
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Figure 4-50 The organization and 
function of the chromatin that forms 
human centromeres. (A) By staining 
stretched chromatin fibers with 
fluorescently labeled antibodies, the alpha 
satellite DNA that forms centric 
heterochromatin in humans is seen to be 
packaged into alternating blocks of 
chromatin. One block is formed from a 
long string of nucleosomes containing the 
CENP-A H3 variant histone (green); the 
other block contains nucleosomes that are 
specially marked with a dimethyl lysine 4 
(red). Each block is more than a thousand 
nucleosomes long. (B) A model for the 
organization of the two types of centric 
heterochromatin. As in yeast, the 
nucleosomes that contain the H3 variant 
histone form the kinetochore. (C) The 
arrangement of the centric and pericentric 
heterochromatin on a human metaphase 
chromosome, as determined by 
fluorescence microscopy using the same 
antibodies as in (A). (Adapted from 
B.A. Sullivan and G.H. Karpen, Nat Struct 
MoL Biol. 1 1:1076-1 083, 2004. With 
permission from Macmillan Publishers Ltd.) 
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Figure 4-5 1 A model for the direct inheritance of centromeric heterochromatin. (A) The normal assembly 
of chromatin on the two daughter DNA helices produced at a replication fork requires the deposition of 
H2A-H2B dimers onto directly inherited H3-H4 tetramers, as well as the assembly of new histone octamers 
(see Figure 5-38 for details). (B) At a centromere, the inheritance of H3 variant-H4 tetramers seeds the 
formation of new histone octamers that likewise contain the variant H3 histone. A similar seeding process 
could cause the adjacent blocks of centric heterochromatin (containing H3 modified at dimethyl lysine 4; 
see Figure 4-50) to be inherited. Although the details are not known, the seeding process is likely to 
involve other centromeric proteins that are inherited along with the nucleosomes (see Figure 4-52). 



4-51). Closely related mechanisms that are likewise based on the direct inheri- 
tance of parental forms of chromatin by the daughter DNA helices behind the 
replication fork are thought to be responsible for other types of condensed 
chromatin (Figure 4-52). For example, the permanently silenced, classical type 
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Figure 4-52 How the packaging of DNA 
in chromatin can be inherited during 
chromosome replication. In this model, 
some of the specialized chromatin 
components are distributed to each 
daughter chromosome after DNA 
duplication, along with the specially 
marked nucleosomes that they bind. 
After DNA replication, the inherited 
nucleosomes that are specially modified, 
acting in concert with the inherited 
chromatin components, change the 
pattern of histone modification on the 
newly formed daughter nucleosomes 
nearby. This creates new binding sites for 
the same chromatin components, which 
then assemble to complete the structure. 
The latter process is likely to involve code 
reader-writer-remodeling complexes 
operating in a manner similar to that 
previously illustrated in Figure 4-46. 
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of heterochromatin contains the HP1 protein, whereas the condensed chro- 
matin that coats important developmental regulatory genes is maintained by 
the polycomb group of proteins. The latter type of heterochromatin silences a 
large number of genes that encode gene regulatory proteins early in embryonic 
development, covering a total of about 2 percent of the human genome, and it 
is removed only when each individual gene is needed by the developing organ- 
ism (discussed in Chapter 22). Although other types of inherited chromatin 
structures exist, it is not yet clear how many different types there are: the num- 
ber could certainly exceed 10 (see p. 238). The fundamental importance of this 
mechanism for distinguishing different genes is schematically represented in 
(Figure 4-53). 

Other forms of chromatin can have a shorter lifetime, much less than the 
division time of the cell; however, many have a built-in persistence that helps to 
mediate biological function. 



Summary 

Despite the uniform assembly of chromosomal DNA into nucleosomes, a large variety 
of different chromatin structures are possible in eucaryotic organisms. This variety is 
based on a large set of reversible covalent modifications of the four histones in the 
nucleosome core. These modifications include the mono-, di-, and tri-methylation of 
many different lysine side chains, an important reaction that is incompatible with the 
acetylation of the same lysines. Specific combinations of the modifications mark each 
nucleosome with a histone code. The histone code is read when protein modules that 
are part of a larger protein complex bind to the modified nucleosomes in a region of 
chromatin. These code-reader proteins then attract additional proteins that catalyze 
biologically relevant functions. 

Some code-reader protein complexes contain a histone-modifying enzyme, such 
as a histone methylase, that "writes" the same mark that the code-reader recognizes. A 
reader-writer-remodeling complex of this type can spread a specific form of chromatin 
for long distances along a chromosome. In particular, large regions of condensed hete- 
rochromatin are thought to be formed in this way. Heterochromatin is commonly 
found around centromeres and near telomeres, but it is also present at many other 
positions in chromosomes. The tight packaging of DNA into heterochromatin usually 
silences the genes within it. 

The phenomenon of position effect variegation provides good evidence for the 
direct inheritance of condensed forms of chromatin by the daughter DNA helices 
formed at a replication fork, and a similar mechanism appears to be responsible for 
maintaining the specialized chromatin at centromeres. More generally, the ability to 
transmit specific chromatin structures from one cell generation to the next provides 
the basis for an epigenetic cell memory process that is likely to be critical for main- 
taining the complex set of different cell states required by complex multicellular 
organisms. 
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Figure 4-53 Schematic itiustration of 
cell memory stored as chromatin-based 
epigenetic information in the genes of 
eucaryotes. Genes in eucaryotic cells can 
be packaged into a large variety of 
different chromatin structures, indicated 
here by different colors. At least some of 
these chromatin structures have a special 
effect on gene expression that can be 
directly inherited as epigenetic 
information when a cell divides. This 
allows some of the gene regulatory 
proteins that create different gene states 
to act only once, inasmuch as the state 
can be remembered after the regulatory 
protein is gone. Epigenetic information 
can also be stored in networks of 
signaling molecules that control gene 
expression (see Figure 7-86). 



THE GLOBAL STRUCTURE OF CHROMOSOMES 

Having discussed the DNA and protein molecules from which the 30-nm chro- 
matin fiber is made, we now turn to the organization of the chromosome on a 
more global scale. As a 30-nm fiber, the typical human chromosome would still 
be 0.1 cm in length and able to span the nucleus more than 100 times. Clearly, 
there must be a still higher level of folding, even in interphase chromosomes. 
Although its molecular basis is still largely a mystery, this higher-order packag- 
ing almost certainly involves the folding of the 30-nm fiber into a series of loops 
and coils. This chromatin packing is fluid, frequently changing in response to 
the needs of the cell. 

We shall begin by describing some unusual interphase chromosomes that 
can be easily visualized, inasmuch as certain features of these exceptional cases 
are thought to be representative of all interphase chromosomes. Moreover, they 
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provide a unique means for investigating some fundamental aspects of chro- 
matin structure raised in the previous section. Next we describe how a typical 
interphase chromosome is arranged in the cell nucleus, focusing on human 
cells. Finally, we conclude by discussing the additional tenfold compaction that 
interphase chromosomes undergo during the process of mitosis. 



Figure 4-54 Lampbrush chromosomes. 

(A) A light micrograph of lampbrush 
chromosomes in an amphibian oocyte. 
Early in oocyte differentiation, each 
chromosome replicates to begin meiosis, 
and the homologous replicated 
chromosomes pair to form this highly 
extended structure containing a total of 
four replicated DNA molecules, or 
chromatids. The lampbrush chromosome 
stage persists for months or years, while 
the oocyte builds up a supply of 
materials required for its ultimate 
development into a new individual. 

(B) An enlarged region of a similar 
chromosome, stained with a fluorescent 
reagent that makes the loops active in 
RNA synthesis clearly visible. (Courtesy of 
Joseph G.Gail.) 



Chromosomes Are Folded into Large Loops of Chromatin 

Insight into the structure of the chromosomes in interphase cells has been 
obtained from studies of the stiff and extended meiotically paired chromosomes 
in growing amphibian oocytes (immature eggs). These very unusual lampbrush 
chromosomes (the largest chromosomes known) are clearly visible even in the 
light microscope, where they are seen to be organized into a series of large chro- 
matin loops emanating from a linear chromosomal axis (Figure 4-54). 

The organization of a lampbrush chromosome is illustrated in Figure 4-55. 
A given loop always contains the same DNA sequence, and it remains extended 
in the same manner as the oocyte grows. These chromosomes are producing 
large amounts of RNA for the oocyte, and most of the genes present in the DNA 
loops are being actively expressed. The majority of the DNA, however, is not in 
loops but remains highly condensed in the chromomeres on the axis, where 
genes are generally not expressed. 

It is thought that the interphase chromosomes of all eucaryotes are similarly 
arranged in loops. Although these loops are normally too small and fragile to be 
easily observed in a light microscope, other methods can be used to infer their 
presence. For example, it has become possible to assess the frequency with 
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Figure 4-55 A model for the structure of 
a lampbrush chromosome. The set of 

lampbrush chromosomes in many 
amphibians contains a total of about 
10,000 chromatin loops, although most of 
the DNA in each chromosome remains 
highly condensed in the chromomeres. 
Each loop corresponds to a particular DNA 
sequence. Four copies of each loop are 
present in each cell, since each of the two 
major units shown at the top consists of 
two closely apposed, newly replicated 
chromosomes. This four-stranded 
structure is characteristic of this stage of 
development of the oocyte, the diplotene 
stage of meiosis; see Figure 21-9. 



which any two loci along an interphase chromosome are paired with each 
other, thus revealing likely candidates for the sites on chromatin that form the 
closely apposed bases of loop structures (Figure 4-56). These experiments and 
others suggest that the DNA in human chromosomes is organized into loops of 
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Figure 4-56 A method for determining the position of loops in interphase chromosomes. In this technique, known as 
the chromosome conformation capture (3C) method, cells are treated with formaldehyde to create the indicated covalent 
DNA-protein and DNA-DNA cross-links. The DNA is then treated with a restriction nuclease that chops the DNA into many 
pieces, cutting at strictly defined nucleotide sequences and forming sets of identical "cohesive ends" (see Figure 8-34). The 
cohesive ends can be made to join through their complementary base-pairing. Importantly, prior to the ligation step 
shown, the DNA is diluted so that the fragments that have been kept in close proximity to each other (through cross- 
linking) are the ones most likely to join. Finally, the cross-links are reversed and the newly ligated fragments of DNA are 
identified and quantified by PCR (the polymerase chain reaction, described in Chapter 8). By combining the frequency-of- 
association information generated by the 3C technique with DNA sequence information, structural models can be 
produced for the interphase conformation of chromosomes. 
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Figure 4-57 A model for the organization of an interphase chromosome. A section of an interphase chromosome is 
shown folded into a series of looped domains, each containing perhaps 50,000-200,000 nucleotide pairs of double-helical 
DNA condensed into a 30-nm fiber. The chromatin in each individual loop is further condensed through poorly understood 
folding processes that are reversed when the cell requires direct access to the DNA packaged in the loop. Neither the 
composition of the postulated chromosomal axis nor how the folded 30-nm fiber is anchored to it is clear. However, in 
mitotic chromosomes the bases of the chromosomal loops are enriched both in condensins and in DNA topoisomerase II 
enzymes, two proteins that may form much of the axis at metaphase (see Figure 4-74). 



different lengths. A typical loop might contain between 50,000 and 200,000 
nucleotide pairs of DNA, although loops of a million nucleotide pairs have also 
been suggested (Figure 4-57). 



Polytene Chromosomes Are Uniquely Useful for Visualizing 
Chromatin Structures 

Certain giant insect cells have grown to their enormous size through multiple 
cycles of DNA synthesis without cell division. Such cells with more than the nor- 
mal DNA complement are said to be polyploid when they contain increased 
numbers of standard chromosomes. But tn several types of secretory cells in fly 
larvae, all the homologous chromosome copies are held side by side, like drink- 
ing straws in a box, creating single large polytene chromosomes. Because poly- 
tene chromosomes can disperse to form a conventional polyploid cell in some 
cases, these two chromosomal states are closely related- The underlying struc- 
ture of a polytene chromosome must therefore be similar to that of a normal 
chromosome. 

Polyteny has been most studied in the larval salivary gland cells of the fruit 
fly Drosophila. When these polytene chromosomes are viewed in the light 
microscope, distinct alternating dark bands and light imerlmnds are visible (Fig- 
ure 4-58), each formed from a thousand identical DNA sequences arranged 
side- by-side in register. About 95% of the DNA in polytene chromosomes is in 
bands, and 5% is in interbands. A very thin band can contain 3000 nucleotide 
pairs, while a thick band may contain 200,000 nucleotide pairs in each of its 
chromatin strands. The chromatin in each band appears dark because the DNA 
is more condensed than the DNA in interbands; it may also contain a higher pro- 
portion of proteins (Figure 4-59). 

There are approximately 3700 bands and 3700 interbands in the complete 
set of Drosophila polytene chromosomes. The bands can be recognized by their 
different thicknesses and spacings, and each one has been given a number to 
generate a chromosome "map" that has been indexed to the finished genome 
sequence of this fly. 

The Drosophila polytene chromosomes provide a good starting point for 
examining how chromatin is organized on a large scale. In the previous section, 
we saw that there are many forms of chromatin, each of which contains micro- 
somes with a different combination of modified histones. By reading this his- 
tone code, specific sets of non-histone proteins assemble on the nucleosomes to 
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Figure 4-58 The entire set of polytene 
chromosomes in one Drosophila 
salivary cell. In this drawing of a light 
micrograph, the giant chromosomes 
have been spread out for viewing by 
squashing them against a microscope 
slide. Drosophila has four chromosomes, 
and there are four different chromosome 
pairs present. But each chromosome is 
tightly paired with its homolog (so that 
each pair appears as a single structure), 
which is not true in most nuclei (except 
in meiosis). Each chromosome has 
undergone multiple rounds of 
replication, and the homologues and all 
their duplicates have remained in exact 
register with each other, resulting in 
huge chromatin cables many DNA 
strands thick. 

The four polytene chromosomes are 
normally linked together by 
heterochromatic regions near their 
centromeres that aggregate to create a 
single large chromocenter (pink region). 
In this preparation, however, the 
chromocenter has been split into two 
halves by the squashing procedure used. 
(Adapted from T.S. Painter, J. Hered. 
25:465-476, 1934. With permission from 
Oxford University Press.) 
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Figure 4-59 Micrographs of polytene 
chromosomes from Drosophila salivary 
glands. (A) Light micrograph of a portion 
of a chromosome. The DNA has been 
stained with a fluorescent dye, but a 
reverse image is presented here that 
renders the DNA black rather than white; 
the bands are clearly seen to be regions 
of increased DNA concentration. This 
chromosome has been processed by a 
high pressure treatment so as to show its 
distinct pattern of bands and interbands 
more clearly. (B) An electron micrograph 
of a small section of a Drosophila 
polytene chromosome seen in thin 
section. Bands of very different thickness 
can be readily distinguished, separated 
by interbands, which contain less 
condensed chromatin. (A, adapted from 
D.V. Novikov, I. Kireev and A.S. Belmont, 
Nat Methods 4:483-485, 2007. With 
permission from Macmillan Publishers 
Ltd. B, courtesy of Veikko Sorsa.) 
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affect biological function in different ways. Some of these non-histone proteins 
can spread for long distances along the DNA, imparting a similar chromatin 
structure to contiguous regions of the genome (see Figure 4-46). Thus, in some 
regions, all of the chromatin has a similar structure and is separated from neigh- 
boring domains by barrier proteins (see Figure 4-47). At low resolution, the 
interphase chromosome can therefore be considered as a mosaic of chromatin 
structures, each containing particular nucleosome modifications associated 
with a particular set of non-histone proteins. (At a higher level of resolution one 
would also emphasize the many sequence-specific DNA-binding proteins that 
will be described in Chapter 7). 

This view of an interphase chromosome helps us to interpret the results 
obtained from studies of polytene chromosomes. By staining with highly spe- 
cific antibodies, one can show that differently modified histones (Figure 4-60), 
as well as distinct sets of non-histone proteins, are located on different polytene 
chromosome bands. This suggests a powerful general strategy. By employing 
combinations of antibodies that bind tightly to each of the many different his- 
tone modifications that create the histone code (see Figure 4-39), it may be pos- 
sible to determine which combinations of modifications specify particular types 
of chromatin domains. And by carrying out similar experiments with antibodies 
that recognize each of the hundreds of different non-histone proteins in chro- 
matin, one can attempt to decipher the many different meanings encoded in 
histone modifications. 



Figure 4-60 The pattern of histone 
modifications on Drosophila polytene 
chromosomes. Antibodies that 
specifically recognize different histone 
modifications can reveal where each 
modification is found with reference to 
the many bands and interbands on these 
chromosomes. In the two preparations 
shown here, the positions of two 
different markings on histone H3 tails are 
compared. In both cases, the antibody 
labeling the modified histone is green, 
and the DNA is stained red. Only a small 
region surrounding each chromocenter is 
shown. (A) Dimethyl Lys 9 (green) is a 
histone modification associated with the 
pericentric heterochromatin. It is seen to 
be associated with the chromocenter. 
(B) Acetylated Lys 9 (green) is a 
modification that is concentrated in 
histones associated with active genes. It 
is seen to be present in numerous bands 
on the chromosome arms, but not in the 
heterochromatic chromocenter. Similar 
experiments can be carried out to 
position many other modified histones, 
as well as the non-histone proteins (see, 
for example, Figure 22-45 for 
chromosomes stained for Polycomb). 
(Adapted from A. Ebert, S. Lein, 
G. Schotta and G. Reuter, Chromosome 
Res. 14:377-392, 2006. With permission 
from Springer.) 
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There Are Multiple Forms of Heterochromatin 

Molecular studies have led to a reevaluation of our view of heterochromatin. For 
many decades, heterochromatin was thought to be a single entity defined by its 
highly condensed structure and its ability to silence genes permanently. But if 
we define heterochromatin as a form of compact chromatin that can silence 
genes, be epigenetically inherited, and spread along chromosomes to cause 
position effect variegation (see Figure 4-36), it is clear that different types of het- 
erochromatin exist. In fact, we have already considered three of these types in 
discussing the human centromere (see Figure 4-50). 

Each domain of heterochromatin is thought to be formed by the cooperative 
assembly of a set of non-histone proteins. For example, classical pericentromeric 
heterochromatin contains more than six such proteins, including heterochro- 
matin protein 1 (HP1), whereas the so-called Polycomb form of heterochromatin 
contains a similar number of proteins in a non- overlapping set (PcG proteins). 
There are hundreds of small blocks of heterochromatin spread across the arms 
of Drosophila polytene chromosomes, as identified by their late replication (dis- 
cussed in Chapter 5). Antibody staining of these regions of heterochromatin sug- 
gests that the known forms of heterochromatin can account for no more than 
half of the heterchromatic polytene bands (Figure 4-61). Thus, other types of 
heterochromatin must exist whose protein composition is not known. It is likely 
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Figure 4-61 Evidence for multiple forms 
of heterochromatin. In this study, 240 
late-replicating sites on the Drosophila 
polytene chromosome arms were 
examined for the presence of two non- 
histone proteins. These proteins are 
known to help compact two different 
forms of heterochromatin (see text). As 
indicated, antibody staining suggests 
that roughly half of the sites are 
packaged into forms of heterochromatin 
that are different from either of these 
two. Experiments such as these 
demonstrate that we have a great deal 
more to learn about the packaging of 
DNA in eucaryotes. (Data from 
I.F. Zhimulev and E.S. Belyaeva, BioEssays 
25:1 040-1 05 1 , 2003. With permission 
from John Wiley & Sons.) 
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that each of these types of heterochromatin is differently regulated and has dif- 
ferent roles in the cell. 

The chromatin structure in each domain ultimately depends on the proteins 
that bind to specific DNA sequences, and these are known to vary depending on 
the cell type and its stage of development in a multicellular organism. Thus, both 
the pattern of chromatin domains and their individual compositions (nucleo- 
some modifications plus non-histone proteins) can vary between tissues. These 
differences make different genes accessible for genetic readout, helping to 
explain the cell diversification that accompanies embryonic development 
(described in Chapter 22). Comparisons of the polytene chromosomes in two dif- 
ferent tissues of a fly lend support to this general idea: although the patterns of 
bands and interbands are largely the same, there are reproducible differences. 



Chromatin Loops Decondense When the Genes Within Them Are 
Expressed 

When an insect progresses from one developmental stage to another, distinctive 
chromosome puffs arise and old puffs recede in its polytene chromosomes as 
new genes become expressed and old ones are turned off (Figure 4-62). From 
inspection of each puff when it is relatively small and the banding pattern is still 
discernible, it seems that most puffs arise from the decondensation of a single 
chromosome band. 

The individual chromatin fibers that make up a puff can be visualized with 
an electron microscope. In favorable cases, loops are seen, much like those 
observed in the amphibian lampbrush chromosomes discussed above. When 
not expressed, the loop of DNA assumes a thickened structure, possibly a folded 
30-nm fiber, but when gene expression is occurring, the loop becomes more 
extended. In electron micrographs, the chromatin located on either side of the 
decondensed loop appears considerably more compact, suggesting that a loop 
constitutes a distinct functional domain of chromatin structure. 

Observations made in human cells also suggest that highly folded loops of 
chromatin expand to occupy an increased volume when a gene within them is 
expressed. For example, quiescent chromosome regions from 0.4 to 2 million 
nucleotide pairs in length appear as compact dots in an interphase nucleus 
when visualized by fluorescence microscopy using FISH or other technologies. 
However, the same DNA is seen to occupy a larger territory when its genes are 
expressed, with elongated, punctate structures replacing the original dot. 



Chromatin Can Move to Specific Sites Within the Nucleus to Alter 
Gene Expression 

New ways of visualizing individual chromosomes have shown that each of the 46 
interphase chromosomes in a human cell tends to occupy its own discrete terri- 
tory within the nucleus (Figure 4-63). However, pictures such as these present 
only an average view of the DNA in each chromosome. Experiments that specifi- 
cally localize the heterochromatic regions of a chromosome reveal that they are 
often closely associated with the nuclear lamina, regardless of the chromosome 
examined. And DNA probes that preferentially stain gene-rich regions of human 



Figure 4-63 Simultaneous visualization of the chromosome territories 
for all of the human chromosomes in a single interphase nucleus. A FISH 
analysis using a different mixture of fluorochromes for marking the DNA of 
each chromosome, detected with seven color channels in a fluoresence 
microscope, allows each chromosome to be distinguished in three- 
dimensional reconstructions. Below the micrograph, each chromosome is 
identified in a schematic of the actual image. Note that the two 
homologous chromosomes (e.g., the two copies of chromosome 9), are not 
in general co-located. (From M.R. Speicher and N.P. Carter, Not. Rev. Genet. 
6:782-792, 2005. With permission from Macmillan Publishers Ltd.) 
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Figure 4-62 RNA synthesis in polytene 
chromosome puffs. An autoradiograph 
of a single puff in a polytene 
chromosome from the salivary glands of 
the freshwater midge C tentans. As 
outlined in Chapter 1 and described in 
detail in Chapter 6, the first step in gene 
expression is the synthesis of an RNA 
molecule using the DNA as a template. 
The decondensed portion of the 
chromosome is undergoing RNA 
synthesis and has become labeled with 
3 H-uridine (see p. 603), an RNA precursor 
molecule that is incorporated into 
growing RNA chains. (Courtesy of Jose 
Bonner.) 
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Figure 4-64 The distribution of gene-rich regions of the human genome 
in an interphase nucleus. Gene-rich regions have been visualized with a 
fluorescent probe that hybridizes to the Alu interspersed repeat, which is 
present in more than a million copies in the human genome (see Figure 
5-75). For unknown reasons, these sequences cluster in chromosomal 
regions rich in genes. In this representation, regions enriched for the Alu 
sequence are green, regions depleted for these sequences are red, while 
the average regions are yellow. The gene-rich regions are seen to be 
depleted in the DNA near the nuclear envelope. (From A. Bolzer et al., PLoS 
Biol. 3:826-842, 2005. With permission from Public Library of Science.) 

chromosomes produce a striking picture of the interphase nucleus that presum- 
ably reflects different average positions for active and inactive genes (Figure 
4-64). 

A variety of different types of experiments have led to the conclusion that 
the position of a gene in the interior of the nucleus changes when it becomes 
highly expressed. Thus, a region that becomes very actively transcribed is often 
found to extend out of its chromosome territory, as if in an extended loop (Fig- 
ure 4-65). We will see in Chapter 6 that the initiation of transcription— the first 
step in gene expression— requires the assembly of over 100 proteins, and it 
makes sense that this would occur most rapidly in regions of the nucleus partic- 
ularly rich in these proteins. 

More generally it is clear that the nucleus is very heterogeneous, with func- 
tionally different regions to which portions of chromosomes can move as they 
are subjected to different biochemical processes— such as when their gene 
expression changes (Figure 4-66). There is evidence that some of these nuclear 
regions are marked with different inositol phospholipids, reminiscent of the way 
that the same lipids are used to distinguish different membranes in the cyto- 
plasm (see Figure 13-11). But what these lipids are attached to in the interior of 
the nucleus is a mystery, as the only known lipid-rich environments are the lipid 
bilayers of the nuclear envelope. 
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Figure 4-65 An effect of high levels of 
gene expression on the intranuclear 
location of chromatin. (A) Fluorescence 
micrographs of human nuclei showing 
how the position of a gene changes 
when it becomes highly transcribed. The 
region of the chromosome adjacent to 
the gene (red) is seen to leave its 
chromosomal territory (green) only when 
it is highly active. (B) Schematic 
representation of a large loop of 
chromatin that expands when the gene is 
on, and contracts when the gene is off. 
Other genes that are less actively 
expressed can be shown by the same 
methods to remain inside their 
chromosomal territory when transcribed. 
(From J.R. Chubb and W.A. Bickmore, Cell 
1 1 2:403-406, 2003. With permission from 
Elsevier.) 
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Networks of Macromolecules Form a Set of Distinct Biochemical 
Environments inside the Nucleus 

In Chapter 6, we describe the function of a variety of subcompartments that are 
present within the nucleus. The largest and most obvious of these is the nucleo- 
lus, a structure well known to microscopists even in the 19th century (see Figure 
4-9). Nucleolar regions consist of networks of RNAs and proteins surrounding 
transcribing ribosomal RNA genes, often existing as multiple nucleoli. The 
nucleolus is the cell's site of ribosome assembly and maturation, as well as the 
place where many other specialized reactions occur. 

A variety of less obvious organelles are also present inside the nucleus. For 
example, spherical structures called Cajal bodies and interchromatin granule 
clusters are present in most plant and animal cells (Figure 4-67). Like the nucle- 
olus, these organelles are composed of selected protein and RNA molecules that 
bind together to create networks that are highly permeable to other protein and 
RNA molecules in the surrounding nucleoplasm (Figure 4-68). 

Structures such as these can create distinct biochemical environments by 
immobilizing select groups of macromolecules, as can other networks of pro- 
teins and RNA molecules associated with nuclear pores and with the nuclear 
envelope. In principle, this allows the molecules that enter these spaces to be 
processed with great efficiency through complex reaction pathways. Highly per- 
meable, fibrous networks of this sort can thereby impart many of the kinetic 
advantages of compartmentalization (see p. 186) to reactions that take place in 
the nucleus (Figure 4-69A). However, unlike the membrane-bound compart- 
ments in the cytoplasm (discussed in Chapter 12), these nuclear subcompart- 
ments — lacking a lipid bilayer membrane— can neither concentrate nor exclude 
specific small molecules. 

The cell has a remarkable ability to construct distinct biochemical environ- 
ments inside the nucleus. Those thus far mentioned facilitate various aspects of 
gene expression to be discussed in Chapter 6 (see Figure 6-49). Like the nucleo- 
lus, these subcompartments appear to form only as needed, and they create a 
high local concentration of the many different enzymes and RNA molecules 
needed for a particular process. In an analogous way when DNA is damaged by 
irradiation, the set of enzymes needed to carry out DNA repair are observed to 
congregate in discrete foci inside the nucleus, creating "repair factories" (see 
Figure 5-60). And nuclei often contain hundreds of discrete foci representing 
factories for DNA or RNA synthesis. 

It seems likely that all of these entities make use of the type of tethering illus- 
trated in Figure 4-69B, where long flexible lengths of polypeptide chain (or some 
other polymer) are interspersed with binding sites that concentrate the multiple 
proteins and/or RNA molecules that are needed to catalyze a particular process. 
Not surprisingly, tethers are similarly used to help to speed biological processes 



Figure 4-66 The movement of genes to 
different regions of the nucleus when 
their expression changes. The interior of 
the nucleus is very heterogeneous, and 
different nuclear neighborhoods are 
known to have distinct effects on gene 
expression. Movements such as those 
indicated here presumably reflect 
changes in the binding affinities that the 
chromatin and RNA molecules 
surrounding a gene have for different 
nuclear neighborhoods. It is thought that 
the movement is driven by diffusion and 
does not require a directed movement 
process, inasmuch as each region of a 
chromosome can be seen to undergo 
constant random motion when marked 
in a way that allows its position to be 
followed in a living cell. 
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Figure 4-67 Electron micrograph showing 
two very common fibrous nuclear 

subcompartments. The large sphere here 
is a Cajal body. The smaller darker sphere is 
an interchromatin granule cluster, also 
known as a spreckle (see also Figure 6-49). 
These "subnuclear organelles" are from the 
nucleus of a Xenopus oocyte. (From 
K.E. Handwerger and J.G. Gall, Trends Cell 
Biol. 16:19-26, 2006. With permission from 
Elsevier.) 
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in the cytoplasm, increasing specific reaction rates (for example, see Figure 
16-38). 

Is there also is an intranuclear framework, analogous to the cytoskeleton, on 
which chromosomes and other components of the nucleus are organized? The 
nuclear matrix, or scaffold, has been defined as the insoluble material left in the 
nucleus after a series of biochemical extraction steps. Many of the proteins and 
RNA molecules that form this insoluble material are likely to be derived from the 
fibrous subcompartments of the nucleus just discussed, while others seem to be 
proteins that help to form the base of chromosomal loops or to attach chromo- 
somes to other structures in the nucleus. Whether or not the nucleus also contains 



Figure 4-68 An experiment showing 
that the subnuclear organelles are 
highly permeable to macromolecules. In 

these micrographs of a living oocyte 
nucleus, the top row compares the 
fluorescence of the interiors of nucleoli, 
Cajal bodies, and spreckles to the 
fluorescence of the surrounding 
nucleoplasm, 1 2 hours after fluorescent 
dextrans of the indicated molecular 
weight had been injected into the 
nucleoplasm. The brightness of each 
organelle reflects its permeability, with 
the most permeable organelle being the 
brightest. For comparison, the bottom 
row presents normal light micrographs of 
the same microscope fields, with the 
nucleolus in each field of view marked 
brown to distinguish it. Cajal bodies can 
be seen to be more permeable than 
nucleoli. However, quantitation shows 
that a great deal of dextran enters each 
organelle, even for the largest dextran 
tested. {From K.E. Handwerger, 
JA Cordero and J.G. Gall, MoL Biol. Cell 
1 6:202-2 1 1 , 2005. With permission from 
American Society of Cell Biology.) 
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Figure 4-69 Effective compartmentalization without a bilayer membrane. (A) Schematic 
illustration of the organization of a spherical subnuclear organelle [left) and of a postulated 
similarly organized subcompartment just beneath the nuclear envelope (right). In both cases, 
RNAs and/or proteins {gray) associate to form highly porous, gel-like structures that contain 
binding sites for other specific proteins and RNA molecules {colored objects). (B) How the 
tethering of a selected set of proteins and RNA molecules to long flexible polymer chains, as in 
A, could create "staging areas" that greatly speed the rates of reactions in subcompartments of 
the nucleus. The reactions catalyzed will depend on the particular macromolecules that are 
localized by the tethering. The same type of rate accelerations are of course expected for similar 
subcompartments established elsewhere in the cell (see also Figure 3-80C). 
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Figure 4-70 A typical mitotic chromosome at metaphase. Each sister 
chromatid contains one of two identical daughter DNA molecules 
generated earlier in the cell cycle by DNA replication (see also Figure 1 7-26). 



long filaments that form organized tracks on which nuclear components can 
move, analogous to some of the filaments in the cytoplasm, is still disputed. 



Mitotic Chromosomes Are Formed from Chromatin in Its Most 
Condensed State 

Having discussed the dynamic structure of interphase chromosomes, we now 
turn to mitotic chromosomes. The chromosomes from nearly all eucaryotic cells 
become readily visible by light microscopy during mitosis, when they coil up to 
form highly condensed structures. This condensation reduces the length of a 
typical interphase chromosome only about tenfold, but it produces a dramatic 
change in chromosome appearance. 

Figure 4-70 depicts a typical mitotic chromosome at the metaphase stage 
of mitosis (for the stages of mitosis, see Figure 17-3). The two daughter DNA 
molecules produced by DNA replication during interphase of the cell-division 
cycle are separately folded to produce two sister chromosomes, or sister chro- 
matids, held together at their centromeres (see also Figure 4-50). These chro- 
mosomes are normally covered with a variety of molecules, including large 
amounts of RNA-protein complexes. Once this covering has been stripped away, 
each chromatid can be seen in electron micrographs to be organized into loops 
of chromatin emanating from a central scaffolding (Figure 4-71). Experiments 
using DNA hybridization to detect specific DNA sequences demonstrate that the 
order of visible features along a mitotic chromosome at least roughly reflects the 
order of genes along the DNA molecule. Mitotic chromosome condensation can 
thus be thought of as the final level in the hierarchy of chromosome packaging 
(Figure 4-72). 

The compaction of chromosomes during mitosis is a highly organized and 
dynamic process that serves at least two important purposes. First, when con- 
densation is complete (in metaphase), sister chromatids have been disentan- 
gled from each other and lie side by side. Thus, the sister chromatids can easily 
separate when the mitotic apparatus begins pulling them apart. Second, the 
compaction of chromosomes protects the relatively fragile DNA molecules from 
being broken as they are pulled to separate daughter cells. 

The condensation of interphase chromosomes into mitotic chromosomes 
begins in early M phase, and it is intimately connected with the progression of 
the cell cycle, as discussed in detail in Chapter 17. During M phase, gene expres- 
sion shuts down, and specific modifications are made to histones that help to 
reorganize the chromatin as it compacts. The compaction is aided by a class of 
proteins called condensins, which use the energy of ATP hydrolysis to help coil 
the two DNA molecules in an interphase chromosome to produce the two chro- 
matids of a mitotic chromosome. Condensins are large protein complexes built 
from SMC protein dimers: these dimers form when two stiff, elongated protein 
monomers join at their tails to form a hinge, leaving two globular head domains 
at the other end that bind DNA and hydrolyze ATP (Figure 4-73). When added 
to purified DNA, condensins can make large right-handed loops in DNA 
molecules in a reaction that requires ATP. Although it is not yet known how they 
act on chromatin, the coiling model shown in Figure 4-73C is based on the fact 
that condensins are a major structural component that end up at the core of 
metaphase chromosomes, with about one molecule of condensin for every 

Figure 4-71 A scanning electron micrograph of a region near one end of 
a typical mitotic chromosome. Each knoblike projection is believed to 
represent the tip of a separate looped domain. Note that the two identical 
paired chromatids (drawn in Figure 4-70) can be clearly distinguished. 
{From M.P. Marsden and U.K. Laemmli, Cell 17:849-858, 1979. With 
permission from Elsevier.) 
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NET RESULT: EACH DNA MOLECULE HAS BEEN 

PACKAGED INTO A MITOTIC CHROMOSOME THAT 

IS 10,000-FOLD SHORTER THAN ITS EXTENDED LENGTH 



Figure 4-72 Chromatin packing. This 
model shows some of the many levels of 
chromatin packing postulated to give rise 
to the highly condensed mitotic 
chromosome. 
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Figure 4-73 The SMC proteins in condensins. (A) Electron micrographs of 
a purified SMC dimer. (B) The structure of a SMC dimer. The long central 
region of this protein is an antiparallel coiled-coil (see Figure 3-9) with a 
flexible hinge in its middle. (C) A model for the way in which the SMC 
proteins in condensins might compact chromatin. In reality, SMC proteins 
are components of a much larger condensin complex. It has been 
proposed that, in the cell, condensins coil long strings of looped chromatin 
domains (see Figure 4-57). In this way, the condensins could form a 
structural framework that maintains the DNA in a highly organized state 
during metaphase of the cell cycle. (A, courtesy of H.P. Erickson; B and C, 
adapted from T. Hirano, Nat. Rev. Mol. Cell Biol. 7:31 1-322, 2006. With 
permission from Macmillan Publishers Ltd.) 
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Figure 4-74 The location of condensin in condensed mitotic 
chromosomes. (A) Fluorescence micrograph of a human chromosome at 
mitosis, stained with an antibody that localizes condensin. In chromosomes 
that are this highly condensed, the condensin is seen to be concentrated in 
punctate structures along the chromosome axis. Similar experiments show 
a similar location for DNA topoisomerase II, an enzyme that makes 
reversible double-strand breaks in DNA that allow one DNA double helix to 
pass through another (see Figure 5-23). (B) immunogold electron 
microscopy reveals localization of condensin {black dots). Here a chromatid 
is seen in cross section, with the chromosome axis perpendicular to the 
plane of the paper. (A, from K. Maeshima and U.K. Laemmli, Dev. Cell 
4:467-480, 2003. With permission from Elsevier. B, courtesy of U.K. Laemmli, 
from K. Maeshima, M. Eltsov and U.K. Laemmli, Chromosoma 1 14:365-375, 
2005. With permission from Springer.) 

10,000 nucleotides of DNA (Figure 4-74). When condensins are experimentally 
depleted from a cell, chromosome condensation still occurs, but the process is 
abnormal. 



Summary 

Chromosomes are generally decondensed during interphase, so that the details of their 
structure are difficult to visualize. Notable exceptions are the specialized lampbrush 
chromosomes of vertebrate oocytes and the polytene chromosomes in the giant secre- 
tory cells of insects. Studies of these two types of interphase chromosomes suggest that 
each long DNA molecule in a chromosome is divided into a large number of discrete 
domains organized as loops of chromatin, each loop probably consisting of a 30-nm 
chromatin fiber that is compacted by further folding. When genes contained in a loop 
are expressed, the loop unfolds and allows the cell's machinery access to the DNA. 

Interphase chromosomes occupy discrete territories in the cell nucleus; that is, they 
are not extensively intertwined. Euchromatin makes up most of interphase chromo- 
somes and, when not being transcribed, it probably exists as tightly folded 30-nm 
fibers. However, euchromatin is interrupted by stretches of heterochromatin, in which 
the 30-nm fibers are subjected to additional packing that usually renders it resistant to 
gene expression. Heterochromatin exists in several forms, some of which are found in 
large blocks in and around centromeres and near telomeres. But heterochromatin is 
also present at many other positions on chromosomes, where it can serve to regulate 
developmentally important genes. 

The interior of the nucleus is highly dynamic, with heterochromatin often posi- 
tioned near the nuclear envelope and loops of chromatin moving away from their 
chromosome territory when genes are very highly expressed. This reflects the existence 
of nuclear subcompartments, where different sets of biochemical reactions are facili- 
tated by an increased concentration of selected proteins and RNAs. The components 
involved informing a subcompartment can self-assemble into discrete organelles such 
as nucleoli or Cajal bodies; they can also be tethered to fixed structures such as the 
nuclear envelope. 

During mitosis, gene expression shuts down and all chromosomes adopt a highly 
condensed conformation in a process that begins early in M phase to package the two 
DNA molecules of each replicated chromosome as two separately folded chromatids. 
This process is accompanied by histone modifications that facilitate chromatin pack- 
ing. However, satisfactory completion of this orderly process, which reduces the end-to- 
end distance of each DNA molecule from its interphase length by an additional factor 
often, requires condensin proteins. 
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HOW GENOMES EVOLVE 

In this chapter, we have discussed the structure of genes and the ways that they 
are packaged and arranged in chromosomes. In this final section, we provide an 
overview of some of the ways that genes and genomes have evolved over time 
to produce the vast diversity of modern-day life forms on our planet. Genome 
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sequencing has revolutionized our view of the process of molecular evolution, 
uncovering an astonishing wealth of information about specific family rela- 
tionships among organisms, as well as illuminating evolutionary mechanisms 
more generally. 

It is perhaps not surprising that genes with similar functions can be found in 
a diverse range of living things. But the great revelation of the past 25 years has 
been the discovery that the actual nucleotide sequences of many genes are suf- 
ficiently well conserved that homologous genes — that is, genes that are similar 
in both their nucleotide sequence and function because of a common ances- 
try — can often be recognized across vast phylogenetic distances. For example, 
unmistakable homologs of many human genes are easy to detect in such organ- 
isms as nematode worms, fruit flies, yeasts, and even bacteria. In many cases, 
the resemblance is so close that the protein-coding portion of a yeast gene can 
be substituted with its human homolog — even though we and yeast are sepa- 
rated by more than a billion years of evolutionary history. 

As emphasized in Chapter 3, the recognition of sequence similarity has 
become a major tool for inferring gene and protein function. Although finding a 
sequence match does not guarantee similarity in function, it has proven to be an 
excellent clue. Thus, it is often possible to predict the function of genes in humans 
for which no biochemical or genetic information is available simply by comparing 
their nucleotide sequences with the sequences of genes in other organisms. 

In general, gene sequences are more tightly conserved than is overall 
genome structure. As we saw earlier, other features of genome organization 
such as genome size, number of chromosomes, order of genes along chromo- 
somes, abundance and size of introns, and amount of repetitive DNA are found 
to differ greatly among organisms, as does the number of genes that an organ- 
ism contains. 

The number of genes is only very roughly correlated with the phenotypic 
complexity of an organism (see Table 1-1). Much of the increase in gene number 
observed with increasing biological complexity involves the expansion of fami- 
lies of closely related genes, an observation that establishes gene duplication 
and divergence as major evolutionary processes. Indeed, it is likely that all pre- 
sent-day genes are descendants— via the processes of duplication, divergence, 
and reassortment of gene segments— of a few ancestral genes that existed in 
early life forms. 

Genome Alterations are Caused by Failures of the Normal 
Mechanisms for Copying and Maintaining DNA 

Cells in the germline do not have specialized mechanisms for creating changes 
in the structures of their genomes: evolution depends instead on accidents and 
mistakes followed by nonrandom survival. Most of the genetic changes that 
occur result simply from failures in the normal mechanisms by which genomes 
are copied or repaired when damaged, although the movement of transposable 
DNA elements also plays an important role. As we will discuss in Chapter 5, the 
mechanisms that maintain DNA sequences are remarkably precise— but they 
are not perfect. For example, because of the elaborate DNA-replication and 
DNA-repair mechanisms that enable DNA sequences to be inherited with 
extraordinary fidelity, along a given line of descent only about one nucleotide 
pair in a thousand is randomly changed in the germline every million years. 
Even so, in a population of 10,000 diploid individuals, every possible nucleotide 
substitution will have been "tried out" on about 20 occasions in the course of a 
million years — a short span of time in relation to the evolution of species. 

Errors in DNA replication, DNA recombination, or DNA repair can lead either 
to simple changes in DNA sequence— such as the substitution of one base pair 
for another— or to large-scale genome rearrangements such as deletions, dupli- 
cations, inversions, and translocations of DNA from one chromosome to another. 
In addition to these failures of the genetic machinery, the various mobile DNA 
elements that will be described in Chapter 5 are an important source of genomic 
change (see Table 5-3, p. 318). These transposable DNA elements [transvosons] 
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are parasitic DNA sequences that colonize genomes and can spread within 
them. In the process, they often disrupt the function or alter the regulation of 
existing genes. On occasion, they can even create altogether novel genes 
through fusions between transposon sequences and segments of existing genes. 
Over long periods of evolutionary time, transposons have profoundly affected 
the structure of genomes. In fact, nearly half of the DNA in the human genome 
has recognizable sequence similarity with known transposon sequences, 
thereby indicating that these sequences are remnants of past transposition 
events (see Figure 4-17). Even more of our genome is no doubt derived from 
transposition events that occurred so long ago (>10 8 years) that the sequences 
can no longer be traced to transposons. 



The Genome Sequences of Two Species Differ in Proportion to 
the Length of Time That They Have Separately Evolved 

The differences between the genomes of species alive today have accumulated 
over more than 3 billion years. Lacking a direct record of changes over time, we 
can nevertheless reconstruct the process of genome evolution from detailed 
comparisons of the genomes of contemporary organisms. 

The basic tool of comparative genomics is the phylogenetic tree. A simple 
example is the tree describing the divergence of humans from the great apes 
(Figure 4-75). The primary support for this tree comes from comparisons of 
gene or protein sequences. For example, comparisons between the sequences of 
human genes or proteins and those of the great apes typically reveal the fewest 
differences between human and chimpanzee and the most between human and 
orangutan. 

For closely related organisms such as humans and chimpanzees, it is rela- 
tively easy to reconstruct the gene sequences of the extinct, last common ances- 
tor of the two species (Figure 4-76). The close similarity between human and 
chimpanzee genes is mainly due to the short time that has been available for the 
accumulation of mutations in the two diverging lineages, rather than to func- 
tional constraints that have kept the sequences the same. Evidence for this view 
comes from the observation that even DNA sequences whose nucleotide order 
is functionally unconstrained — such as the sequences that code for the fib- 
rinopeptides (see p. 264) or the third position of "synonymous" codons (codons 
specifying the same amino acid — see Figure 4-76) — are nearly identical in 
humans and chimpanzees. 

For much less closely related organisms, such as humans and chickens 
(which have evolved separately for about 300 million years), the sequence con- 
servation found in genes is largely due to purifying selection (that is, selection 
that eliminates individuals carrying mutations that interfere with important 
genetic functions), rather than to an inadequate time for mutations to occur. As 
a result, protein-coding, RNA-coding, and regulatory sequences in the DNA are 
often remarkably conserved. In contrast, most DNA sequences in the human 
and chicken genomes have diverged so far due to multiple mutations that it is 
often impossible to align them with one another. 




human chimpanzee gorilla 



orangutan 



Figure 4-75 A phylogenetic tree showing 
the relationship between the human and 
the great apes based on nucleotide 
sequence data. As indicated, the sequences 
of the genomes of all four species are 
estimated to differ from the sequence of 
the genome of their last common ancestor 
by a little over 1 .5%. Because changes occur 
independently on both diverging lineages, 
pairwise comparisons reveal twice the 
sequence divergence from the last 
common ancestor. For example, 
human-orangutan comparisons typically 
show sequence divergences of a little over 
3%, while human-chimpanzee 
comparisons show divergences of 
approximately 1.2%. (Modified from 
F.C. Chen and W.H. Li, Am. J. Hum. Genet. 
68:444-456, 2001. With permission from 
University of Chicago Press.) 



248 Chapter 4: DNA, Chromosomes, and Genomes 



gorilla CAA 

Q 
human gtgcccatccaaaaagtccaagatgacaccaaaaccctcatcaagacaattgtcaccagg 
I I I I ! I I I I I I M I I II I! I I I I I I i I i I I I I ! I 1 I I I I I II I I I I I I I I I I I II I I 

Chtmp GTGCCCATCCAAAAAGTCCAGGATGACACCAAAACCCTCATCAAGACAATTGTCACCAGG 

protein vpiqkvqddtktliktivtr 



human ATCAATGACATTTCACACACGCAGTCAGTCTCCTCCAAACAGAAAGTCACCGGTTTGGAC 

II II III II II Mill II III II II II lllllll IIIIMIMI Mill II II II III I 

Chimp ATCAATGACATTTCACACACGCAGTCAGTCTCCTCCAAACAGAAGGTCACCGGTTTGGAC 

protein indi shtosvsskqkvtgld 

gorilla aag 

gorilla CCC 

P 
human ttcattcctgggctccaccccatcctgaccttatccaagatggaccagacactggcagtc 
I I I I I M I I I I I I I I I I I I I I I I I I I I M I II M I I I I I I I I I II I I I I I I I i II I I 
chimp ttcattcctgggctccaccctatcctgaccttatccaagatggaccagacactggcagtc 
protein fipglhpiltlskmdqtlav 



human taccaacagatcctcaccagtatgccttccagaaacgtgatccaaatatccaacgacctg 
II MMI III II II II II III II II lllllll MM MUM Ml II MM MM I I 
chimp taccaacagatcctcaccagtatgccttccagaaacatgatccaaatatccaacgacctg 
protein yqqiltsmpsrnmiqisndl 

gorilla ATG 



human GAGAACCTCCGGGATCTTCTTCAGGTGCTGGCCTTCTCTAAGAGCTGCCACTTGCCCTGG 

I I I I I ; I I I I I I I MMIIMMM I I I I i I I I I I I I I ( I I I I I I M I I I I I I I 1 1 I I 

Chimp GAGAACCTCCGGGACCTTCTTCAGGTGCTGGCCTTCTCTAAGAGCTGCCACTTGCCCTGG 

protein enlrdllhvlafskschlpw 
gorilla GAC 



Phylogenetic Trees Constructed from a Comparison of DNA 
Sequences Trace the Relationships of All Organisms 

Integration of phylogenetic trees based on molecular sequence comparisons 
with the fossil record has led to the best available view of the evolution of mod- 
ern life forms. The fossil record remains important as a source of absolute dates 
based on the decay of radioisotopes in the rock formations in which fossils are 
found. However, precise divergence times between species are difficult to estab- 
lish from the fossil record, even for species that leave good fossils with distinc- 
tive morphology. 

Such integrated phylogenetic trees suggest that changes in the sequences of 
particular genes or proteins tend to occur at a nearly constant rate, although 
rates that differ from the norm by as much as twofold are observed in particular 
lineages. As discussed above and in Chapter 5, this "molecular clock" runs most 
rapidly and regularly in sequences that are not subject to purifying selection — 
such as intergenic regions, portions of introns that lack splicing or regulatory 
signals, and genes that have been irreversibly inactivated by mutation (the so- 
called pseudogenes). The clock runs most slowly for sequences that are subject 
to strong functional constraints — for example, the amino acid sequences of pro- 
teins such as actin that engage in specific interactions with large numbers of 
other proteins and whose structure is therefore highly constrained (see, for 
example, Figure 16-18). 

Occasionally, rapid change is seen in a previously highly conserved 
sequence. As discussed later in this chapter, such episodes are especially inter- 
esting because they are thought to reflect periods of strong positive selection for 
mutations that conferred a selective advantage in the particular lineage where 
the rapid change occurred. 

Molecular clocks run at rates that are determined both by mutation rates and 
by the degree of purifying selection on particular sequences. Therefore, a com- 
pletely different calibration is required for those genes replicated and repaired by 
different systems within cells. Most notably, in animals, although not in plants, 
clocks based on functionally unconstrained mitochondrial DNA sequences run 



Figure 4-76 Tracing the ancestral 
sequence from a sequence comparison 
of the coding regions of human and 
chimpanzee leptin genes. Leptin is a 
hormone that regulates food intake and 
energy utilization in response to the 
adequacy of fat reserves. As indicated by 
the codons boxed in green, only 
5 nucleotides (of 441 total) differ 
between these two sequences. Moreover, 
when the amino acids encoded by both 
the human and chimpanzee sequences 
are examined, in only one of the 5 
positions does the encoded amino acid 
differ. For each of the 5 variant nucleotide 
positions, the corresponding sequence in 
the gorilla is also indicated. In two cases, 
the gorilla sequence agrees with the 
human sequence, while in three cases it 
agrees with the chimpanzee sequence. 
What was the sequence of the leptin 
gene in the last common ancestor? An 
evolutionary model that seeks to 
minimize the number of mutations 
postulated to have occurred during the 
evolution of the human and chimpanzee 
genes would assume that the leptin 
sequence of the last common ancestor 
was the same as the human and 
chimpanzee sequences when they agree; 
when they disagree, it would use the 
gorilla sequence as a tie-breaker. For 
convenience, only the first 
300 nucleotides of the leptin coding 
sequences are given. The remaining 
141 are identical between humans and 
chimpanzees. 
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Figure 4-77 A phylogenetic tree 
highlighting some of the mammals 
whose genomes are being extensively 
studied. The length of each line is 
proportional to the number of "neutral 
substitutions"— representing the 
nucleotide changes observed in the 
absence of purifying selection. {Adapted 
from G.M. Cooper et al., Genome Res. 
15:901-913, 2005. With permission from 
Cold Spring Harbor Laboratory Press.) 



much faster than clocks based on functionally unconstrained nuclear sequences, 
due to an unusually high mutation rate in animal mitochondria. 

Molecular clocks have a finer time resolution than the fossil record and are 
a more reliable guide to the detailed structure of phylogenetic trees than are 
classical methods of tree construction, which are based on comparisons of the 
morphology and development of different species. For example, the precise rela- 
tionship among the great-ape and human lineages was not settled until suffi- 
cient molecular-sequence data accumulated in the 1980s to produce the tree 
that was shown in Figure 4-75. And with huge amounts of DNA sequence now 
determined from a variety of mammals, much better estimates of our relation- 
ship to them are being obtained (Figure 4-77). 



A Comparison of Human and Mouse Chromosomes Shows How 
The Structures of Genomes Diverge 

As would be expected, the human and chimpanzee genomes are much more 
alike than are the human and mouse genomes. Although the size of the human 
and mouse genomes are roughly the same and they contain nearly identical sets 
of genes, there has been a much longer time period over which changes have 
had a chance to accumulate — approximately 80 million years versus 6 million 
years. In addition, as indicated in Figure 4-77, rodent lineages (represented by 
the rat and the mouse) have unusually fast molecular clocks. Hence, these lin- 
eages have diverged from the human lineage more rapidly than otherwise 
expected. 

As indicated by the DNA sequence comparison in Figure 4-78, mutation has 
led to extensive sequence divergence between humans and mice at all sites that 
are not under selection — such as most nucleotide sequences in introns. In con- 
trast, in human-chimpanzee comparisons, nearly all sequence positions are the 
same simply because not enough time has elapsed since the last common 
ancestor for large numbers of changes to have occurred. 

In contrast to the situation for humans and chimpanzees, local gene order 
and overall chromosome organization have diverged greatly between humans 



Figure 4-78 Comparison of a portion of 
the mouse and human leptin genes. 

Positions where the sequences differ by a 
single nucleotide substitution are boxed 
in green, and positions that differ by the 
addition or deletion of nucleotides are 
boxed in yellow. Note that the coding 
sequence of the exon is much more 
conserved than is the adjacent intron 
sequence. 
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GTGCCCATCCAAAAAGTCCAAGATGACACCAAAACCCTCATCAAGACAATTGTCACCAGGATCAATGACATTTCACACACGpTAAGGAGAGT 

human 



-GGAGTCTCATGGGGGGACAAAGATGTAGGACTAGA 
-ATGC-^GGGACAAA GTAGAACTGCA 



mouse 

ACCAGAGTCTGAGAAACATGTCATGCACCTCCTAGAAGCTGAGAGTTTAT-AAGCCTCGAGTGTACAT-TATTTCTGGTCATGGCTCTTGTCACTGCTGCCTGCTGAAATACAGGGCTGA 
GCCAG--CCC-AGCACTGGCTCCTAGTGGCACTGGACCCAGATAGTCCAAGAAACATTTATTGAACGCCTCCTGAATGCCAGGCACCTACTGGAAGCTGA--GAAGGATTTGAAAGCACA 

human 



250 Chapter 4: DNA, Chromosomes, and Genomes 



human chromosome 14 















mouse chromosome 12 



200,000 bases 



and mice. According to rough estimates, a total of about 180 break-and-rejoin 
events have occurred in the human and mouse lineages since these two species 
last shared a common ancestor. In the process, although the number of chro- 
mosomes is similar in the two species (23 per haploid genome in the human ver- 
sus 20 in the mouse), their overall structures differ greatly. Nonetheless, even 
after the extensive genomic shuffling, there are many large blocks of DNA in 
which the gene order is the same in the human and the mouse. These stretches 
of conserved gene order in chromosomes are referred to as regions of synteny. 

An unexpected conclusion from a detailed comparison of the complete 
mouse and human genome sequences, confirmed from subsequent compar- 
isons between the genomes of other vertebrates, is that small blocks of 
sequences are being deleted from and added to genomes at a surprisingly rapid 
rate. Thus, if we assume that our common ancestor had a genome of human size 
(about 3 billion nucleotide pairs), mice would have lost a total of about 45 per- 
cent of that genome from accumulated deletions during the past 80 million 
years, while humans would have lost about 25 percent. However, substantial 
sequence gains from many small chromosome duplications and from the mul- 
tiplication of transposons have compensated for these deletions. As a result, our 
genome size is unchanged from that of the last common ancestor for humans 
and mice, while the mouse genome is smaller by only 0.3 billion nucleotides. 

Good evidence for the loss of DNA sequences in small blocks during evolu- 
tion can be obtained from a detailed comparison of most regions of synteny in 
the human and mouse genomes. The comparative shrinkage of the mouse 
genome can be clearly seen from such comparisons, with the net loss of 
sequences scattered throughout the long stretches of DNA that are otherwise 
homologous (Figure 4-79). 

DNA is added to genomes both by the spontaneous duplication of chromo- 
somal segments that contain tens of thousands of nucleotide pairs (as will be 
discussed shortly), and by active transposition (most transposition events are 
duplicative, because the original copy of the transposon stays where it was when 
a copy inserts at the new site; for example, see Figure 5-74). Comparison of the 
DNA sequences derived from transposons in the human and the mouse there- 
fore readily reveals some of the sequence additions (Figure 4-80). 

For unknown reasons, all mammals have genome sizes of about 3 billion 
nucleotide pairs that contain nearly identical sets of genes, even though only on 
the order of 150 million nucleotide pairs appear to be under sequence-specific 
functional constraints. 
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Figure 4-79 Comparison of a syntenic 
portion of mouse and human genomes. 

About 90 percent of the two genomes can 
be aligned in this way. Note that while 
there is an identical order of the matched 
index sequences {red marks), there has 
been a net loss of DNA in the mouse 
lineage that is interspersed throughout the 
entire region. This type of net loss is typical 
for all such regions, and it accounts for the 
fact that the mouse genome contains 14 
percent less DNA than does the human 
genome. (Adapted from Mouse 
Sequencing Consortium, Nature 
420:520-573, 2002. With permission from 
Macmillan Publishers Ltd.) 



Figure 4-80 A comparison of the 
(3-globin gene cluster in the human and 
mouse genomes, showing the location 
of transposable elements. This stretch of 
human genome contains five functional 
(3-globin-like genes (orange); the 
comparable region from the mouse 
genome has only four. The positions of 
the human Alu sequence are indicated by 
green circles, and the human LI 
sequences by red circles. The mouse 
genome contains different but related 
transposable elements: the positions of 
B1 elements (which are related to the 
human Alu sequences) are indicated by 
blue triangles, and the positions of the 
mouse L1 elements (which are related to 
the human LI sequences) are indicated 
by orange triangles. The absence of 
transposable elements from the globin 
structural genes can be attributed to 
purifying selection, which would have 
eliminated any insertion that 
compromised gene function. (Courtesy 
of Ross Hardison and Webb Miller.) 
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The Size of a Vertebrate Genome Reflects the Relative Rates of 
DNA Addition and DNA Loss in a Lineage 

Now that we know the complete sequence of a number of vertebrate genomes, 
we see that genome size can vary considerably, apparently without a drastic 
effect on the organism or its number of genes. Thus, while the mouse and dog 
genomes are both in the typical mammalian size range, the chicken has a 
genome that is only about one-third human size (one billion nucleotide pairs). 
A particularly notable example of an organism with a genome of anomalous size 
is the puffer fish, Fugu rubripes (Figure 4-81), which has a tiny genome for a ver- 
tebrate (0.4 billion nucleotide pairs compared to 1 billion or more for many 
other fish). The small size of the Fugu genome is largely due to the small size of 
its introns. Specifically Fugu introns, as well as other noncoding segments of the 
Fugu genome, lack the repetitive DNA that makes up a large portion of the 
genomes of most well-studied vertebrates. Nevertheless, the positions of Fugu 
introns are nearly perfectly conserved relative to their positions in mammalian 
genomes (Figure 4-82). 

While initially a mystery, we now have a simple explanation for such large 
differences in genome size between similar organisms: because all vertebrates 
experience a continuous process of DNA loss and DNA addition, the size of a 
genome merely depends on the balance between these opposing processes act- 
ing over millions of years. Suppose, for example, that in the lineage leading to 
Fugu, the rate of DNA addition happened to slow greatly. Over long periods of 
time, this would result in a major "cleansing" from this fish genome of those 
DNA sequences whose loss could be tolerated. In retrospect, the process of puri- 
fying selection in the Fugu lineage has partitioned those vertebrate DNA 
sequences most likely to be functional into only 400 million nucleotide pairs of 
DNA, providing a major resource for scientists. 




Figure 4-81 The puffer fish, Fugu 
rubripes. (Courtesy of Byrappa Venkatesh.) 



We Can Reconstruct the Sequence of Some Ancient Genomes 

The genomes of ancestral organisms can be inferred, but never directly 
observed: there are no ancient organisms alive today. Although a modern organ- 
ism such as the horseshoe crab looks remarkably similar to fossil ancestors that 
lived 200 million years ago, there is every reason to believe that the horseshoe- 
crab genome has been changing during all that time at a rate similar to that 
occurring in other evolutionary lineages. Selection constraints must have main- 
tained key functional properties of the horseshoe-crab genome to account for 
the morphological stability of the lineage. However, genome sequences reveal 
that the fraction of the genome subject to purifying selection is small; hence the 
genome of the modern horseshoe crab must differ greatly from that of its extinct 
ancestors, known to us only through the fossil record. 

Is there any way around this problem? Can we ever hope to decipher large 
sections of the genome sequence of the extinct ancestors of organisms that are 
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Figure 4-82 Comparison of the genomic 
sequences of the human and Fugu genes 
encoding the protein huntingtin. Both 
genes (indicated in red) contain 
67 short exons that align in 1 :1 
correspondence to one another; these 
exons are connected by curved lines. The 
human gene is 7.5 times larger than the 
Fugu gene (1 80,000 versus 27,000 
nucleotide pairs). The size difference is 
entirely due to larger introns in the human 
gene. The larger size of the human introns 
is due in part to the presence of 
retrotransposons, whose positions are 
represented by green vertical lines; the 
Fugu introns lack retrotransposons. In 
humans, mutation of the huntingtin gene 
causes Huntington's disease, an inherited 
neurodegenerative disorder. (Adapted 
from S. Baxendale et al., Nat. Genet. 
10:67-76, 1995. With permission from 
Macmillan Publishers Ltd.) 
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alive today? For organisms that are as closely related as human and chimp, we 
saw that this may not be difficult. In that case, reference to the gorilla sequence 
can be be used to sort out which of the few differences between human and 
chimp DNA sequences was inherited from our common ancestor some 6 million 
years ago (see Figure 4-76). For an ancestor that has produced a large number of 
different organisms alive today the DNA sequences of many species can be 
compared simultaneously to unscramble the ancestral sequence, allowing sci- 
entists to trace DNA sequences much farther back in time. For example, from 
the complete genome sequences of 20 modern mammals that will soon be 
obtained, it should be possible to decipher most of the genome sequence of the 
100 million year-old Boreoeutherian mammal that gave rise to species as diverse 
as dog, mouse, rabbit, armadillo and human (see Figure 4-77). 



Multispecies Sequence Comparisons Identify Important DNA 
Sequences of Unknown Function 

The massive quantity of DNA sequence now in databases (more than a hundred 
billion nucleotide pairs) provides a rich resource that scientists can mine for 
many purposes. We have already discussed how this information can be used to 
unscramble the evolutionary pathways that have led to modern organisms. But 
sequence comparisons also provide many insights into how cells and organisms 
function. Perhaps the most remarkable discovery in this realm has been the 
observation that, although only about 1 .5% of the human genome codes for pro- 
teins, about three times this amount (in total, 5% of the genome — see Table 4-1, 
p. 206) has been strongly conserved during mammalian evolution. This mass of 
conserved sequence is most clearly revealed when we align and compare DNA 
synteny blocks from many different species. In this way, so-called multispecies 
conserved sequences can be readily identified (Figure 4-83). Most of the non- 
coding conserved sequences discovered in this way turn out to be relatively 
short, containing between 50 and 200 nucleotide pairs. The strict conservation 
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Figure 4-83 The detection of 
multispecies conserved sequences. In 

this example, genome sequences for 
each of the organisms shown have been 
compared with the indicated region of 
the human CFTR gene, scanning in 
25 nucleotide blocks. For each organism, 
the percent identity across its syntenic 
sequences is plotted in green. In addition, 
a computational algorithm has been 
used to detect the sequences within this 
region that are most highly conserved 
when the sequences from all of the 
organisms are taken into account. 
Besides the exon, three other blocks of 
multispecies conserved sequences are 
shown. The function of most such 
sequences in the human genome is not 
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implies that they have important functions that have been maintained by puri- 
fying selection. The puzzle is to unravel what those functions are. Some of the 
conserved sequence that does not code for protein codes for untranslated RNA 
molecules that are known to have important functions, as we shall see in later 
chapters. Another fraction of the noncoding conserved DNA is certainly 
involved in regulating the transcription of adjacent genes, as discussed in Chap- 
ter 7. But we do not yet know how much of the conserved DNA can be accounted 
for in these ways, and the bulk of it is still a deep mystery. The solution to this 
mystery is bound to have profound consequences for medicine, and it reveals 
how much more we need to learn about the biology of vertebrate organisms. 

How can cell biologists tackle this problem? The first step is to distinguish 
between the conserved regions that code for protein and those that do not, and 
then, among the latter, to focus on those that do not already have some other 
identified function, in coding for structural RNA molecules, for example. The 
next task is to discover what proteins or RNA molecules bind to these mysteri- 
ous DNA sequences, how they are packaged into chromatin, and whether they 
ever serve as templates for RNA synthesis. Most of this task still lies before us, 
but a start has been made, and some remarkable insights have been obtained. 
One of the most intriguing concerns the evolutionary changes that have made 
us humans different from other animals — changes, that is, in sequences that 
have been conserved among our close relatives but have undergone sudden 
rapid change in the human sublineage. 

Accelerated Changes in Previously Conserved Sequences Can 
Help Decipher Critical Steps in Human Evolution 

As soon as both the human and the chimpanzee genome sequences became 
available, scientists began searching for DNA sequence changes that might 
account for the striking differences between us and them. With 3 billion 
nucleotide pairs to compare in the two species, this might seem an impossible 
task. But the job was made much easier by confining the search to 35,000 clearly 
defined multispecies conserved sequences (a total of about 5 million nucleotide 
pairs), representing parts of the genome that are most likely to be functionally 
important. Though these sequences are conserved strongly they are not con- 
served perfectly, and when the version in one species is compared with that in 
another they are generally found to have drifted apart by a small amount corre- 
sponding simply to the time elapsed since the last common ancestor. In a small 
proportion of cases, however, one sees signs of a sudden evolutionary spurt. For 
example, some DNA sequences that have been highly conserved in other mam- 
malian species are found to have changed exceptionally fast during the six mil- 
lion years of human evolution since we diverged from the chimpanzees. Such 
human accelerated regions (HARs) are thought to reflect functions that have 
been especially important in making us different in some useful way. 

About 50 such sites were identified in one study, one-fourth of which were 
located near genes associated with neural development. The sequence exhibit- 
ing the most rapid change (18 changes between human and chimp, compared to 
only two changes between chimp and chicken) was examined further and found 
to encode a 118-nucleotide noncoding RNA molecule that is produced in the 
human cerebral cortex at a critical time during brain development (Figure 
4-84). Although the function of this HAR1F RNA is not yet known, this exciting 
finding is stimulating further studies that will hopefully shed light on crucial fea- 
tures of the human brain. 

Gene Duplication Provides an Important Source of Genetic 
Novelty During Evolution 

Evolution depends on the creation of new genes, as well as on the modification 
of those that already exist. How does this occur? When we compare organisms 
that seem very different — a primate with a rodent, for example, or a mouse with 



254 Chapter 4: DNA, Chromosomes, and Genomes 






CRESYL VIOLET STAIN 




outer 
surface 
of cortex ] 




-,.' 



1 


gs- 


* 


?.*r 


* 




» 


% 


> 




It 




> 


$ 


A 


▲ 


(B) 








1 mm 



Figure 4-84 Initial characterization of a 
new gene detected as a previously 
conserved DNA sequence that evolved 
rapidly in humans. (A) Drawing by Ramon 
y Cajal of the outer surface of the human 
neocortex, highlighting the Cajal-Retzius 
neurons. (B) Tissue slices from an 
embryonic human brain showing part of 
the cortex, with the region containing the 
Cajal-Retzius neurons highlighted in 
yellow. Upper photograph: cresyl violet 
stain. Lower photograph: in situ 
hybridization. The red arrows indicate the 
cells that produce HAR1 F RNA as detected 
by in situ hybridization (blue). HAR1 F is a 
novel noncoding RNA that has evolved 
rapidly in the human lineage leading from 
the great apes. The Cajal-Retzius neurons 
make this RNA at the time when the 
neocortex is developing. The results are 
intriguing, because a large neocortex is 
special to humans; for the behavior of cells 
in forming this cortex, see Figure 22-99. 
(Adapted from K.S. Pollard et al., Nature 
443:1 67-1 72, 2006. With permission from 
Macmillan Publishers Ltd.) 



a fish — we rarely encounter genes in the one species that have no homolog in 
the other. Genes without homologous counterparts are relatively scarce even 
when we compare such divergent organisms as a mammal and a worm. On the 
other hand, we frequently find gene families that have different numbers of 
members in different species. To create such families, genes have been repeat- 
edly duplicated, and the copies have then diverged to take on new functions that 
often vary from one species to another. 

The genes encoding nuclear hormone receptors in humans, a nematode 
worm, and a fruit fly illustrate this point (Figure 4-85). Many of the subtypes of 
these nuclear receptors (also called intracellular receptors) have close homologs 
in all three organisms that are more similar to each other than they are to other 
family subtypes present in the same species. Therefore, much of the functional 
divergence of this large gene family must have preceded the divergence of these 
three evolutionary lineages. Subsequently, one major branch of the gene family 
underwent an enormous expansion in the worm lineage only. Similar, but 
smaller, lineage-specific expansions of particular subtypes are evident through- 
out the gene family tree. 

Gene duplication occurs at high rates in all evolutionary lineages, con- 
tributing to the vigorous process of DNA addition discussed previously. In a 
detailed study of spontaneous duplications in yeast, duplications of 50,000 to 
250,000 nucleotide pairs were commonly observed, most of which were 
tandemly repeated. These appeared to result from DNA replication errors that 
led to the inexact repair of double-strand chromosome breaks. A comparison of 
the human and chimpanzee genomes reveals that, since the time that these two 
organisms diverged, segmental duplications have added about 5 million 
nucleotide pairs to each genome every million years, with an average duplica- 
tion size being about 50,000 nucleotide pairs (however, there are duplications 
five times larger, as in yeast). In fact, if one counts nucleotides, duplication 
events have created more differences between our two species than have single 
nucleotide substitutions. 



Duplicated Genes Diverge 

A major question in genome evolution concerns the fate of newly duplicated 
genes. In most cases, there is presumed to be little or no selection — at least ini- 
tially — to maintain the duplicated state since either copy can provide an equiv- 
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alent function. Hence, many duplication events are likely to be followed by loss- 
of-function mutations in one or the other gene. This cycle would functionally 
restore the one-gene state that preceded the duplication. Indeed, there are many 
examples in contemporary genomes where one copy of a duplicated gene can be 
seen to have become irreversibly inactivated by multiple mutations. Over time, 
the sequence similarity between such a pseudogene and the functional gene 
whose duplication produced it would be expected to be eroded by the accumu- 
lation of many mutations in the pseudogene — the homologous relationship 
eventually becoming undetectable. 

An alternative fate for gene duplications is for both copies to remain func- 
tional, while diverging in their sequence and pattern of expression, thus taking 
on different roles. This process of "duplication and divergence" almost certainly 
explains the presence of large families of genes with related functions in biolog- 
ically complex organisms, and it is thought to play a critical role in the evolution 
of increased biological complexity. An examination of many different eucaryotic 
genomes suggests that the probability that any particular gene will undergo a 
duplication event that spreads to most or all individuals in a species is approxi- 
mately 1% every million years. 

Whole-genome duplications offer particularly dramatic examples of the 
duplication-divergence cycle. A whole-genome duplication can occur quite 
simply: all that is required is one round of genome replication in a germline cell 
lineage without a corresponding cell division. Initially, the chromosome number 
simply doubles. Such abrupt increases in the ploidy of an organism are com- 
mon, particularly in fungi and plants. After a whole-genome duplication, all 
genes exist as duplicate copies. However, unless the duplication event occurred 
so recently that there has been little time for subsequent alterations in genome 
structure, the results of a series of segmental duplications — occurring at differ- 
ent times — are very hard to distinguish from the end product of a whole- 
genome duplication. In mammals, for example, the role of whole -genome dupli- 
cations versus a series of piecemeal duplications of DNA segments is quite 
uncertain. Nevertheless, it is clear that a great deal of gene duplication has 
ocurred in the distant past. 

Analysis of the genome of the zebrafish, in which either a whole-genome 
duplication or a series of more local duplications occurred hundreds of millions 
of years ago, has cast some light on the process of gene duplication and diver- 
gence. Although many duplicates of zebrafish genes appear to have been lost by 
mutation, a significant fraction — perhaps as many as 30-50% — have diverged 
functionally while both copies have remained active. In many cases, the most 
obvious functional difference between the duplicated genes is that they are 
expressed in different tissues or at different stages of development (see Figure 
22-46). One attractive theory to explain such an end result imagines that differ- 
ent, mildly deleterious mutations occur quickly in both copies of a duplicated 
gene set. For example, one copy might lose expression in a particular tissue as a 
result of a regulatory mutation, while the other copy loses expression in a second 
tissue. Following such an occurrence, both gene copies would be required to 
provide the full range of functions that were once supplied by a single gene; 
hence, both copies would now be protected from loss through inactivating 
mutations. Over a longer period, each copy could then undergo further changes 
through which it could acquire new, specialized features. 
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Figure 4-85 A phylogenetic tree based 
on the inferred protein sequences for all 
nuclear hormone receptors encoded in 
the genomes of human {H. sapiens), a 
nematode worm (C elegans), and a fruit 
fly (D. melanogaster). Triangles represent 
protein subfamilies that have expanded 
within individual evolutionary lineages; 
the width of these triangles indicates the 
number of genes encoding members of 
these subfamilies. Colored vertical bars 
represent a single gene. There is no 
simple pattern to the historical 
duplications and divergences that have 
created the gene families encoding 
nuclear receptors in the three 
contemporary organisms. The family of 
nuclear hormone receptors is described 
in Figure 15-1 4. These proteins function 
in cell signaling and gene regulation. 
(Adapted from International Human 
Genome Sequencing Consortium, Nature 
409:860-921 , 2001 . With permission from 
Macmillan Publishers Ltd.) 
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Figure 4-86 A comparison of the structure of one-chain and four-chain 
globins. The four-chain globin shown is hemoglobin, which is a complex of 
two ot-globin and two p-globin chains. The one-chain globin in some 
primitive vertebrates forms a dimer that dissociates when it binds oxygen, 
representing an intermediate in the evolution of the four-chain globin. 



The Evolution of the Globin Gene Family Shows How DNA 
Duplications Contribute to the Evolution of Organisms 

The globin gene family provides an especially good example of how DNA dupli- 
cation generates new proteins, because its evolutionary history has been worked 
out particularly well. The unmistakable similarities in amino acid sequence and 
structure among the present-day globins indicate that they all must derive from 
a common ancestral gene, even though some are now encoded by widely sepa- 
rated genes in the mammalian genome. 

We can reconstruct some of the past events that produced the various types 
of oxygen-carrying hemoglobin molecules by considering the different forms of 
the protein in organisms at different positions on the phylogenetic tree of life. A 
molecule like hemoglobin was necessary to allow multicellular animals to grow 
to a large size, since large animals could no longer rely on the simple diffusion of 
oxygen through the body surface to oxygenate their tissues adequately Conse- 
quently hemoglobin-like molecules are found in all vertebrates and in many 
invertebrates. The most primitive oxygen- carrying molecule in animals is a 
globin polypeptide chain of about 150 amino acids, which is found in many 
marine worms, insects, and primitive fish. The hemoglobin molecule in more 
complex vertebrates, however, is composed of two kinds of globin chains. It 
appears that about 500 million years ago, during the continuing evolution of 
fish, a series of gene mutations and duplications occurred. These events estab- 
lished two slightly different globin genes, coding for the a- and p-globin chains, 
in the genome of each individual. In modern vertebrates, each hemoglobin 
molecule is a complex of two a chains and two (3 chains (Figure 4-86). The four 
oxygen-binding sites in the (X2P2 molecule interact, allowing a cooperative 
allosteric change in the molecule as it binds and releases oxygen, which enables 
hemoglobin to take up and release oxygen more efficiently than the single-chain 
version. 

Still later, during the evolution of mammals, the [3-chain gene apparently 
underwent duplication and mutation to give rise to a second p-like chain that 
is synthesized specifically in the fetus. The resulting hemoglobin molecule has 
a higher affinity for oxygen than adult hemoglobin and thus helps in the trans- 
fer of oxygen from the mother to the fetus. The gene for the new [Mike chain 
subsequently duplicated and mutated again to produce two new genes, £ and 
y, the £ chain being produced earlier in development (to form a 2 £2) than the 
fetal y chain, which forms CX2Y2. A duplication of the adult P-chain gene 
occurred still later, during primate evolution, to give rise to a 5-globin gene 
and thus to a minor form of hemoglobin (a 2 5 2 ) that is found only in adult pri- 
mates (Figure 4-87). 

Each of these duplicated genes has been modified by point mutations that 
affect the properties of the final hemoglobin molecule, as well as by changes in 
regulatory regions that determine the timing and level of expression of the gene. 
As a result, each globin is made in different amounts at different times of human 
development (see Figure 7-64B). 

The end result of the gene duplication processes that have given rise to the 
diversity of globin chains is seen clearly in the human genes that arose from 



Figure 4-87 An evolutionary scheme for the globin chains that carry 
oxygen in the blood of animals. The scheme emphasizes the p-like globin 
gene family. A relatively recent gene duplication of the y-chain gene 
produced f 3 and -y\ which are fetal p-like chains of identical function. The 
location of the globin genes in the human genome is shown at the top of 
the figure (see also Figure 7-64). 



single-chain globin binds 
one oxygen molecule 




oxygen- 
binding site 
on heme 



EVOLUTION OF A 
SECOND GLOBIN 
CHAIN BY 
GENE DUPLICATION 
FOLLOWED BY 
f MUTATION 




four-chain globin binds four 
oxygen molecules in a 
cooperative manner 



chromosome 
16 

i ; i i 

various 
a genes 



chromosome 
11 



ft 



5 p 



100 
inn 


fetal\ 


/ adult 

1 




* 




J 


3UU 

tran 
sep; 
and 
700 


slocation 

irattng a 


singte-chain 


p genes 


globin 



HOW GENOMES EVOLVE 



257 



the original p gene, which are arranged as a series of homologous DNA 
sequences located within 50,000 nucleotide pairs of one another. A similar 
cluster of a-globin genes is located on a separate human chromosome. 
Because the a- and P-globin gene clusters are on separate chromosomes in 
birds and mammals but are together in the frog Xenopus, it is believed that a 
chromosome translocation event separated the two gene clusters about 300 
million years ago (see Figure 4-87). 

There are several duplicated globin DNA sequences in the a- and P-globin 
gene clusters that are not functional genes but pseudogenes. These have a close 
sequence similarity to the functional genes but have been disabled by mutations 
that prevent their expression. The existence of such pseudogenes makes it clear 
that, as expected, not every DNA duplication leads to a new functional gene. We 
also know that nonfunctional DNA sequences are not rapidly discarded, as indi- 
cated by the large excess of noncoding DNA that is found in mammalian 
genomes. 



Genes Encoding New Proteins Can Be Created by the 
Recombination of Exons 

The role of DNA duplication in evolution is not confined to the expansion of 
gene families. It can also act on a smaller scale to create single genes by string- 
ing together short duplicated segments of DNA. The proteins encoded by genes 
generated in this way can be recognized by the presence of repeating similar 
protein domains, which are covalently linked to one another in series. The 
immunoglobulins (Figure 4-88) and albumins, for example, as well as most 
fibrous proteins (such as collagens) are encoded by genes that have evolved by 
repeated duplications of a primordial DNA sequence. 

In genes that have evolved in this way, as well as in many other genes, each 
separate exon often encodes an individual protein folding unit, or domain. It is 
believed that the organization of DNA coding sequences as a series of such 
exons separated by long introns has greatly facilitated the evolution of new pro- 
teins. The duplications necessary to form a single gene coding for a protein with 
repeating domains, for example, can often occur by breaking and rejoining the 
DNA anywhere in the long introns on either side of an exon; without introns 
there would be only a few sites in the original gene at which a recombinational 
exchange between DNA molecules could duplicate the domain. By enabling the 
duplication to occur by recombination at many potential sites rather than just a 
few, introns increase the probability of a favorable duplication event. 

More generally, we know from genome sequences that the various parts of 
genes — both their individual exons and their regulatory elements — have served 
as modular elements that have been duplicated and moved about the genome 
to create the great diversity of living things. Thus, for example, many present-day 
proteins are formed as a patchwork of domains from different origins, reflecting 
their long evolutionary history (see Figure 3-19). 
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Figure 4-88 Schematic view of an 
antibody (immunoglobulin) molecule. 

This molecule is a complex of two 
identical heavy chains and two identical 
light chains. Each heavy chain contains 
four similar, covalently linked domains, 
while each light chain contains two such 
domains. Each domain is encoded by a 
separate exon, and all of the exons are 
thought to have evolved by the serial 
duplication of a single ancestral exon. 



Neutral Mutations Often Spread to Become Fixed in a Population, 
with a Probability that Depends on Population Size 

In comparisons between two species that have diverged from one another by 
millions of years, it makes little difference which individuals from each species 
are compared. For example, typical human and chimpanzee DNA sequences 
differ from one another by about 1%. In contrast, when the same region of the 
genome is sampled from two different humans, the differences are typically less 
than 0.1%. For more distantly related organisms, the inter-species differences 
overshadow intra-species variation even more dramatically. However, each 
"fixed difference" between the human and the chimpanzee (in other words, each 
difference that is now characteristic of all or nearly all individuals of each 
species) started out as a new mutation in a single individual. If the size of the 
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interbreeding population in which the mutation occurred is N, the initial allele 
frequency of a new mutation would be 1 / (2N) for a diploid organism. How does 
such a rare mutation become fixed in the population, and hence become a char- 
acteristic of the species rather than of a particular individual genome? 

The answer to this question depends on the functional consequences of the 
mutation. If the mutation has a significantly deleterious effect, it will simply be 
eliminated by purifying selection and will not become fixed. (In the most 
extreme case, the individual carrying the mutation will die without producing 
progeny.) Conversely, the rare mutations that confer a major reproductive 
advantage on individuals who inherit them can spread rapidly in the popula- 
tion. Because humans reproduce sexually and genetic recombination occurs 
each time a gamete is formed (discussed in Chapter 5), the genome of each indi- 
vidual who has inherited the mutation will be a unique recombinational mosaic 
of segments inherited from a large number of ancestors. The selected mutation 
along with a modest amount of neighboring sequence — ultimately inherited 
from the individual in which the mutation occurred — will simply be one piece of 
this huge mosaic. 

The great majority of mutations that are not harmful are not beneficial 
either. These selectively neutral mutations can also spread and become fixed in 
a population, and they make a large contribution to the evolutionary change in 
genomes. Their spread is not as rapid as the spread of the rare strongly advanta- 
geous mutations. The process by which such neutral genetic variation is passed 
down through an idealized interbreeding population can be described mathe- 
matically by equations that are surprisingly simple. The idealized model that has 
proven most useful for analyzing human genetic variation assumes a constant 
population size and random mating, as well as selective neutrality for the muta- 
tions. While neither of the first two assumptions is a good description of human 
population history, they nonetheless provide a useful starting point for analyz- 
ing intra-species variation. 

When a new neutral mutation occurs in a constant population of size N that 
is undergoing random mating, the probability that it will ultimately become 
fixed is approximately 1/(2N). For those mutations that do become fixed, the 
average time to fixation is approximately 4N generations. A detailed analysis of 
data on human genetic variation suggests an ancestral population size of 
approximately 10,000 during the period when the current pattern of genetic 
variation was largely established. With a population that has reached this size, 
the probability that a new, selectively neutral mutation would become fixed is 
small (5 x 10~ 5 ), while the average time to fixation would be on the order of 
800,000 years (assuming a 20-year generation time). Thus, while we know that 
the human population has grown enormously since the development of agri- 
culture approximately 15,000 years ago, most of the present-day set of common 
human genetic variants reflects the mixture of variants that was already present 
long before this time, when the human population was still small enough to 
allow their widespread dissemination. 



A Great Deal Can Be Learned from Analyses of the Variation 
Among Humans 

Even though most of the variation among modern humans originates from vari- 
ation present in a comparatively tiny group of ancestors, the number of varia- 
tions encountered is very large. One important source of variation, which was 
missed for many years, is the presence of many duplications and deletions of 
large blocks of DNA. According to one estimate, when any individual human is 
compared with the standard reference genome in the database, one should 
expect to find roughly 100 differences involving long sequence blocks. Some of 
these "copy number variations" will be very common (Figure 4-89), while oth- 
ers will be present in only a minority of humans (Figure 4-90). From an initial 
sampling, nearly half will contain known genes. In retrospect this type of varia- 
tion is not surprising, given the extensive history of DNA addition and DNA loss 
in vertebrate genomes (for example, see Figure 4-79). 
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Figure 4-89 Visualization of a frequent type of variation among humans. About 
half of the humans tested had nine copies of the amylase gene {left), which produces 
an important enzyme that digests starch. In other humans, there has been either 
DNA loss or DNA addition to produce an altered chromosome, resulting from the 
deletion (loss) or the duplication (addition) of a part of this region. To obtain these 
images, stretched chromatin fibers have been hybridized with differently colored 
probes to the two ends of the amylase gene, as indicated. The blue lines mark the 
general paths of the chromatin. They have been determined by a second stain and 
displaced to one side for clarity. (Adapted from A.J. lafrate et al., Nat Genet 
36:949-951, 2004. With permission from Macmillan Publishers Ltd.) 
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The intra-species variations that have been most extensively characterized 
are single-nucleotide polymorphisms (SNPs). These are simply points in the 
genome sequence where one large fraction of the human population has one 
nucleotide, while another substantial fraction has another. Two human 
genomes sampled from the modern world population at random will differ at 
approximately 2.5 x 10 6 such sites (1 per 1300 nucleotide pairs). As will be 
described in the overview of genetics in Chapter 8, mapped sites in the human 
genome that are polymorphic — meaning that there is a reasonable probability 
(generally more than 1%) that the genomes of two individuals will differ at that 
site — are extremely useful for genetic analyses, in which one attempts to associ- 
ate specific traits (phenotypes) with specific DNA sequences for medical or sci- 
entific purposes (see p. 560). 

Against the background of ordinary SNPs inherited from our prehistoric 
ancestors, certain sequences with exceptionally high mutation rates stand out. 
A dramatic example is provided by CA repeats, which are ubiquitous in the 
human genome and in the genomes of other eucaryotes. Sequences with the 
motif (CA) n are replicated with relatively low fidelity because of a slippage that 
occurs between the template and the newly synthesized strands during DNA 
replication; hence, the precise value of n can vary over a considerable range 
from one genome to the next. These repeats make ideal DNA-based genetic 
markers, since most humans are heterozygous — carrying two values of n at any 
particular CA repeat, having inherited one repeat length (n) from their mother 
and a different repeat length from their father. While the value of n changes suf- 
ficiently rarely that most parent-child transmissions propagate CA repeats faith- 
fully, the changes are sufficiently frequent to maintain high levels of heterozy- 
gosity in the human population. These and some other simple repeats that dis- 
play exceptionally high variability therefore provide the basis for identifying 
individuals by DNA analysis in crime investigations, paternity suits, and other 
forensic applications (see Figure 8-47). 

While most of the SNPs and copy number variations in the human genome 
sequence are thought to have no effect on phenotype, a subset of them must be 
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Figure 4-90 Detection of copy number 
variants on human chromosome 17. 

When 100 individuals were tested by a 
DNA microarray analysis that detects the 
copy number of DNA sequences 
throughout the entire length of this 
chromosome, the indicated distributions 
of DNA additions {green bars) and DNA 
losses {red bars) were observed compared 
with an arbitrary human reference 
sequence. The shortest red and green bars 
represent a single occurrence among the 
200 chromosomes examined, whereas 
the longer bars indicate that the addition 
or loss was correspondingly more 
frequent. The results show preferred 
regions where the variations occur, and 
these tend to be in or near regions that 
already contain blocks of segmental 
duplications. Many of the changes 
include known genes. (Adapted from 
J.L. Freeman et al., Genome Res. 
16:949-961, 2006. With permission from 
Cold Spring Harbor Laboratory Press.) 
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responsible for nearly all the heritable aspects of human individuality. We know 
that even a single nucleotide change that alters one amino acid in a protein can 
cause a serious disease, as for example in sickle cell anemia, which is caused by 
such a mutation in hemoglobin. <TTTT> We also know that gene dosage— a 
doubling or halving of the copy number of some genes — can have a profound 
effect on human development by altering the level of gene product. There is 
therefore every reason to suppose that some of the many differences between 
any two human beings will have substantial effects on human health, physiol- 
ogy, and behavior, whether they be SNPs or copy number variations. The major 
challenge in human genetics is to learn to recognize those relatively few varia- 
tions that are functionally important against a large background of neutral vari- 
ation in the genomes of different humans. 



Summary 

Comparisons of the nucleotide sequences of present-day genomes have revolutionized 
our understanding of gene and genome evolution. Because of the extremely high 
fidelity of DNA replication and DNA repair processes, random errors in maintaining 
the nucleotide sequences in genomes occur so rarely that only about one nucleotide in 
1000 is altered every million years in any particular line of descent. Not surprisingly 
therefore, a comparison of human and chimpanzee chromosomes — which are sepa- 
rated by about 6 million years of evolution — reveals very few changes. Not only are our 
genes essentially the same, but their order on each chromosome is almost identical. 
Although a substantial number of segmental duplications and segmental deletions 
have occurred in the past 6 million years, even the positions of the transposable ele- 
ments that make up a major portion of our noncodingDNA are mostly unchanged. 

When one compares the genomes of two more distantly related organisms—such 
as a human and a mouse, separated by about 80 million years — one finds many more 
changes. Now the effects of natural selection can be clearly seen: through purifying 
selection, essential nucleotide sequences — both in regulatory regions and in coding 
sequences (exon sequences)— have been highly conserved. In contrast, nonessential 
sequences (for example, much of the DNA in introns) have been altered to such an 
extent that an accurate alignment according to ancestry is frequently not possible. 

Because of purifying selection, the comparison of the genome sequences of multiple 
related species is an especially powerful way to find DNA sequences with important 
functions. Although about 5% of the human genome has been conserved as a result of 
purifying selection, the function of the majority of this DNA (tens of thousands ofmul- 
tispecies conserved sequences) remains mysterious. Future experiments characterizing 
their functions should teach us a great deal about vertebrate biology. 

Other sequence comparisons show that a great deal of the genetic complexity of 
present-day organisms is due to the expansion of ancient gene families. DNA duplica- 
tion followed by sequence divergence has clearly been a major source of genetic novelty 
during evolution. The genomes of any two humans will differ from each other both 
because of nucleotide substitutions (single nucleotide polymorphisms, or SNPs) and 
because of inherited DNA gains and DNA losses that cause copy number variants. 
Understanding these differences will improve both medicine and our understanding of 
human biology. 



PROBLEMS 4 ~ 3 Tne f° ur core mstones are relatively small proteins 

with a very high proportion of positively charged amino 

Which statements are true? Explain why or why not. acids; the positive charge helps the histones bind tightly to 

„ - tt r i i ™ i.rr i DNA, regardless of its nucleotide sequence. 
4-1 Human females have 23 different chromosomes, 

whereas human males have 24. 4-4 Nucleosomes bind DNA so tightly that they cannot 

a -> Tr, o ™ m • u + +u ™ T a r i move from the positions where they are first assembled. 
4-2 In a comparison between the DNAs of related organ- 

isms such as humans and mice, identifying the conserved 4-5 Gene duplication and divergence is thought to have 

DNA sequences facilitates the search for functionally played a critical role in the evolution of increased biological 

important regions. complexity. 
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Figure Q4-1 Three nucleotides from the 
interior of a single strand of DNA (Problem 
4-7). Arrows at the ends of the DNA strand 
indicate that the structure continues in 
both directions. 

Discuss the following problems. 

4-6 DNA isolated from the bacterial 
virus M13 contains 25% A, 33% T, 22% C, 
and 20% G. Do these results strike you as 
peculiar? Why or why not? How might you 
explain these values? 

4-7 A segment of DNA from the interior 
of a single strand is shown in Figure Q4-1 . 
What is the polarity of this DNA from top 
to bottom? 

4-8 Human DNA contains 20% C on a 
molar basis. What are the mole percents of 
A,G, andT? 

4-9 Chromosome 3 in orangutans dif- 
fers from chromosome 3 in humans by 
two inversion events (Figure Q4-2). Draw 
the intermediate chromosome that 
resulted from the first inversion and 
explicitly indicate the segments included 
in each inversion. 
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Figure Q4-2 Chromosome 3 in 
orangutans and humans (Problem 
4-9). Differently colored blocks 
indicate segments of the 
chromosomes that were derived by 
previous fusions. 
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human 



4-10 Assuming that the 30-nm chromatin fiber contains 
about 20 nucleosomes (200 bp/nucleosome) per 50 nm of 
length, calculate the degree of compaction of DNA associ- 
ated with this level of chromatin structure. What fraction of 
the 10,000-fold condensation that occurs at mitosis does 
this level of DNA packing represent? 

4-11 In contrast to histone acetylation, which always cor- 
relates with gene activation, histone methylation can lead to 
either transcriptional activation or repression. How do you 
suppose that the same modification — methylation — can 
mediate different biological outcomes? 

4-12 Why is a chromosome with two centromeres (a 
dicentric chromosome) unstable? Would a back-up cen- 
tromere not be a good thing for a chromosome, giving it two 
chances to form a kinetochore and attach to microtubules 
during mitosis? Would that not help to ensure that the chro- 
mosome did not get left behind at mitosis? 

4-13 HP1 proteins, a family of proteins found in hete- 
rochromatin, are implicated in gene silencing and chromatin 
structure. The three proteins in humans — HP la, HPip, and 
HPly — share a highly conserved chromodomain, which is 
thought to direct chromatin localization. To determine 
whether these proteins could bind to the histone H3 N-ter- 
minus, you have covalently attached to separate beads vari- 
ous versions of the H3 N-terminal peptide— unmodified, 



HA Figure Q4-3 Pull-down assays 
unmodK9-Me S10-P unmod to determine binding 

i n ii ii i specificity of HP 1 proteins 

, UB iub IUBIUB (problem Each protein at 
■*'- -~" «**-.»* the left was detected by 
* •* mm. «• «* immunoblotting using a 
««.».. «* * mm* mm, specific antibody after 

separation by SDS-polyacryl- 
amide gel electrophoresis. For 
- ~~~ ;~~ each histone N-terminal 
**«. ** «•*--* \mm«» peptide the total input protein 
(I), the unbound protein (U), 
and the bound protein (B) are 
indicated. (Adapted from 
M. Lachner et al., Nature 
410:1 16-120, 2001. With 
permission from Macmillan 
Publishers Ltd.) 



Lys-9-dimethylated (K9-Me), and Ser-10-phosphorylated 
(Sio-P)— along with an unmodified tail from histone H4. 
This arrangement allows you to incubate the beads with var- 
ious proteins, wash away unbound proteins, and then elute 
bound proteins for assay by Western blotting. The results of 
your 'pull-down' assay for the HP1 proteins are shown in 
Figure Q4-3, along with the results from several control pro- 
teins, including Pax5, a gene regulatory protein, polycomb 
protein Pel, which is known to bind to histones, and 
Suv39hl, a histone methyltransf erase. 

Based on these results, which of the proteins tested bind 
to the unmodified tails of histones? Do any of the HP1 pro- 
teins and control proteins selectively bind to the modified 
histone N-terminal peptides? What histone modification 
would you predict would be found in hetero chromatin? 

4-14 Mobile pieces of DNA — transposable elements — that 
insert themselves into chromosomes and accumulate during 
evolution make up more than 40% of the human genome. 
Transposable elements of four types— long interspersed ele- 
ments (LINEs), short interspersed elements (SINEs), LTR 
retrotransposons, and DNA transposons— are inserted more 
or less randomly throughout the human genome. These ele- 
ments are conspicuously rare at the four homeobox gene 
clusters, HoxA, HoxB, HoxC, and HoxD, as illustrated for 
HoxD in Figure Q4-4, along with an equivalent region of 
chromosome 22, which lacks a Hox cluster. Each Hox cluster 
is about 100 kb in length and contains 9 to 11 genes, whose 
differential expression along the anteroposterior axis of the 
developing embryo establishes the basic body plan for 
humans (and for other animals). Why do you suppose that 
transposable elements are so rare in the Hox clusters? 



chromosome 22 




100 kb 



HoxD cluster 



Figure Q4-4 Transposable elements and genes in 1 Mb regions of 
chromosomes 2 and 22 (Problem 4-1 4). Lines that project upward 
indicate exons of known genes. Lines that project downward 
indicate transposable elements; they are so numerous (constituting 
more than 40% of the human genome) that they merge into nearly 
a solid block outside the Hox clusters. (Adapted from E. Lander et al., 
Nature 409:860-921, 2001. With permission from Macmillan 
Publishers Ltd.) 
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DNA Replication, Repair, 
and Recombination 




The ability of cells to maintain a high degree of order in a chaotic universe 
depends upon the accurate duplication of vast quantities of genetic informa- 
tion carried in chemical form as DNA. This process, called DNA replication, 
must occur before a cell can produce two genetically identical daughter cells. 
Maintaining order also requires the continued surveillance and repair of this 
genetic information because DNA inside cells is repeatedly damaged by chem- 
icals and radiation from the environment, as well as by thermal accidents and 
reactive molecules generated inside the cell. In this chapter, we describe the 
protein machines that replicate and repair the cell's DNA. These machines cat- 
alyze some of the most rapid and accurate processes that take place within 
cells, and their mechanisms clearly demonstrate the elegance and efficiency of 
cell chemistry. 

While the short-term survival of a cell can depend on preventing changes in 
its DNA, the long-term survival of a species requires that DNA sequences be 
changeable over many generations. Despite the great efforts that cells make to 
protect their DNA, occasional changes in DNA sequences do occur. Over time, 
these changes provide the genetic variation upon which selection pressures act 
during the evolution of organisms. 

We begin this chapter with a brief discussion of the changes that occur in 
DNA as it is passed down from generation to generation. Next, we discuss the 
cell mechanisms — DNA replication and DNA repair — that are responsible for 
minimizing these changes. Finally, we consider some of the most intriguing 
pathways that alter DNA sequences — those of DNA recombination, including 
the movement of special DNA sequences in chromosomes called transposable 
elements. 
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THE MAINTENANCE OF DNA SEQUENCES 

Although, as just pointed out, occasional genetic changes enhance the long- 
term survival of a species, the survival of the individual demands a high degree 
of genetic stability. Only rarely do the cell's DNA-maintenance processes fail, 
resulting in permanent change in the DNA. Such a change is called a mutation, 
and it can destroy an organism if it occurs in a vital position in the DNA 
sequence. 



Mutation Rates Are Extremely Low 

The mutation rate, the rate at which observable changes occur in DNA 
sequences, can be determined directly from experiments carried out with a bac- 
terium such as Escherichia coli — a resident of our intestinal tract and a com- 
monly used laboratory organism (discussed in Chapter 1). Under laboratory con- 
ditions, E. coli divides about once every 40 minutes, and a single cell can gener- 
ate a very large population — several billion — in less than a day. In such a popula- 
tion, it is possible to detect the small fraction of bacteria that have suffered a 
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damaging mutation in a particular gene, if that gene is not required for the bac- 
terium's survival. For example, the mutation rate of a gene specifically required 
for cells to use the sugar lactose as an energy source can be determined when 
the cells are grown in the presence of a different sugar, such as glucose. The frac- 
tion of damaged genes underestimates the actual mutation rate because many 
mutations are silent (for example, those that change a codon but not the amino 
acid it specifies, or those that change an amino acid without affecting the activ- 
ity of the protein coded for by the gene). After correcting for these silent muta- 
tions, one finds that a single gene that encodes an average-sized protein (~10 3 
coding nucleotide pairs) accumulates a mutation (not necessarily one that 
would inactivate the protein) about once in about 10 6 bacterial cell generations. 
Stated differently bacteria display a mutation rate of about 1 nucleotide change 
per 10 9 nucleotides per cell generation. 

Recently it has become possible to measure the germ line mutation rate 
directly in more complex, sexually reproducing organisms such as the nemo- 
tode C. elegans. These worms, whose generation time is 4 days, were grown for 
many generations using their self-fertilization mode of reproduction (discussed 
in Chapter 22). The DNA sequence of a large region of the genome was then 
determined for many different descendent worms and compared with that of 
the progenitor worm. This analysis showed, on average, two new mutations 
(mostly short insertions and deletions) arise in the haploid genome each gener- 
ation. When the number of cell divisions needed to produce sperm and eggs is 
taken into account, the mutation rate is roughly 1 mutation per 10 9 nucleotides 
per cell division, a rate remarkably similar to that in the asexually reproducing E. 
coli described above. 

Direct measurement of the germ-line mutation rate in mammals is more 
difficult, but indirect estimates can be obtained. One way is to compare the 
amino acid sequences of the same protein in several species. The fraction of 
the amino acids that differ between any two species can then be compared 
with the estimated number of years since that pair of species diverged from a 
common ancestor, as determined from the fossil record. Using this method, 
one can calculate the number of years that elapse, on average, before an inher- 
ited change in the amino acid sequence of a protein becomes fixed in an 
organism. Because each such change usually reflects the alteration of a single 
nucleotide in the DNA sequence of the gene encoding that protein, we can use 
this value to estimate the average number of years required to produce a sin- 
gle, stable mutation in the gene. 

These calculations will nearly always substantially underestimate the actual 
mutation rate, because many mutations will spoil the function of the protein 
and vanish from the population because of natural selection— that Is. by the 
preferential death of the organisms that contain them. But the sequence of one 
family of protein fragments does not seem to matter, allowing the genes that 
encode them to accumulate mutations without being selected against .These are 
the fibrinopeptides, fragments twenty amino acids long that are discarded when 
the protein fibrinogen is activated to form fibrin during blood clotting. Since the 
function of fibrinopeptides apparently does not depend on their amino acid 
sequence, fibrinopeptides can tolerate almost any amino acid change. Sequence 
comparisons of the fibrinopeptides can therefore be used to estimate the muta- 
tion rate in the germ line. As determined from these studies, a typical protein of 
400 amino acids will suffer an amino acid alteration roughly once every 200,000 
years. 

Another way to estimate mutation rates in humans is to use DNA sequenc- 
ing to compare corresponding nucleotide sequences directly from closely 
related species in regions of the genome that do not appear to carry critical 
information. As expected, such comparisons produce estimates of the mutation 
rate that agree with those obtained from the fibrinopeptide studies. 

E. coli, worms and humans differ greatly in their modes of reproduction and 
in their generation times. Yet, when the mutation rates of each are normalized to 
a single round of DNA replication, they are found to be similar: roughly 1 
nucleotide change per 10 9 nucleotides each time that DNA is replicated. 
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Figure 5-1 Germ-line cells and somatic 
cells carry out fundamentally different 
functions. In sexually reproducing 
organisms, the germ-line cells (red) 
propagate genetic information into the 
next generation. Somatic cells (blue), 
which form the body of the organism, are 
necessary for the survival of germ-line 
cells but do not themselves leave any 
progeny. 



Low Mutation Rates Are Necessary for Life as We Know It 

Since many mutations are deleterious, no species can afford to allow them to 
accumulate at a high rate in its germ cells. Although the observed mutation fre- 
quency is low, it is nevertheless thought to limit the number of essential proteins 
that any organism can encode to perhaps 50,000. By an extension of the same 
argument, a mutation frequency tenfold higher would limit an organism to 
about 5000 essential genes. In this case, evolution would have been limited to 
organisms considerably less complex than a fruit fly. 

The cells of a sexually reproducing organism are of two types: germ cells and 
somatic cells. The germ cells transmit genetic information from parent to off- 
spring; the somatic cells form the body of the organism (Figure 5-1). We have 
seen that germ cells must be protected against high rates of mutation to main- 
tain the species. However, the somatic cells of multicellular organisms must also 
be protected from genetic change to safeguard each individual. Nucleotide 
changes in somatic cells can give rise to variant cells, some of which, through 
natural selection, proliferate rapidly at the expense of the rest of the organism. 
In an extreme case, the result is an uncontrolled cell proliferation known as can- 
cer, a disease that causes more than 20% of the deaths each year in Europe and 
North America. These deaths are due largely to an accumulation of changes in 
the DNA sequences of somatic cells (discussed in Chapter 23). A significant 
increase in the mutation frequency would presumably cause a disastrous 
increase in the incidence of cancer by accelerating the rate at which somatic cell 
variants arise. Thus, both for the perpetuation of a species with a large number 
of genes (germ cell stability) and for the prevention of cancer resulting from 
mutations in somatic cells (somatic cell stability), multicellular organisms like 
ourselves depend on the remarkably high fidelity with which their DNA 
sequences are replicated and maintained. 



Summary 

In all cells, DNA sequences are maintained and replicated with high fidelity. The 
mutation rate, approximately 1 nucleotide change per 10 9 nucleotides each time the 
DNA is replicated, is roughly the same for organisms as different as bacteria and 
humans. Because of this remarkable accuracy the sequence of the human genome 
(approximately 3x 10 9 nucleotide pairs) is changed by only about 3 nucleotides each 
time a cell divides. This allows most humans to pass accurate genetic instructions 
from one generation to the next, and also to avoid the changes in somatic cells that 
lead to cancer. 
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DNA REPLICATION MECHANISMS 

All organisms must duplicate their DNA with extraordinary accuracy before 
each cell division. In this section, we explore how an elaborate "replication 
machine" achieves this accuracy while duplicating DNA at rates as high as 1000 
nucleotides per second. 



Base-Pairing Underlies DNA Replication and DNA Repair 

As introduced in Chapter 1, DNA templating is the mechanism the cell uses to 
copy the nucleotide sequence of one DNA strand into a complementary DNA 
sequence (Figure 5-2). This process entails the recognition of each nucleotide in 
the DNA template strandby a free (unpolymerized) complementary nucleotide, 
and it requires the separation of the two strands of the DNA helix. This separa- 
tion exposes the hydrogen-bond donor and acceptor groups on each DNA base 
for base-pairing with the appropriate incoming free nucleotide, aligning it for its 
enzyme-catalyzed polymerization into a new DNA chain. 

The first nucleotide-polymerizing enzyme, DNA polymerase, was discov- 
ered in 1957. The free nucleotides that serve as substrates for this enzyme were 
found to be deoxyribonucleoside triphosphates, and their polymerization into 
DNA required a single-stranded DNA template. Figure 5-3 and Figure 5-4 illus- 
trate the stepwise mechanism of this reaction. 



The DNA Replication Fork Is Asymmetrical 

During DNA replication inside a cell, each of the two original DNA strands 
serves as a template for the formation of an entire new strand. Because each of 
the two daughters of a dividing cell inherits a new DNA double helix containing 
one original and one new strand (Figure 5-5), the DNA double helix is said to be 
replicated "semiconservatively" by DNA polymerase. How is this feat accom- 
plished? 

Analyses carried out in the early 1960s on whole replicating chromosomes 
revealed a localized region of replication that moves progressively along the 
parental DNA double helix. Because of its Y-shaped structure, this active region 
is called a replication fork (Figure 5-6 J. At the replication fork, a mult [enzyme 
complex that contains the DNA polymerase synthesizes the DNA of both new 
daughter strands. 
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Figure 5-2 The DNA double helix acts as a template for its own 
duplication. Because the nucleotide A will pair successfully only with T, 
and G only with C, each strand of DNA can serve as a template to specify 
the sequence of nucleotides in its complementary strand by DNA base- 
pairing. In this way, a double-helical DNA molecule can be copied precisely. 
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Figure 5-3 The chemistry of DNA synthesis. The addition of a 
deoxyribonucleotide to the 3' end of a polynucleotide chain {the primer 
strand) is the fundamental reaction by which DNA is synthesized. As 
shown, base-pairing between an incoming deoxyribonucleoside 
triphosphate and an existing strand of DNA (the template strand) guides 
the formation of the new strand of DNA and causes it to have a 
complementary nucleotide sequence. 



Initially, the simplest mechanism of DNA replication seemed to be the con- 
tinuous growth of both new strands, nucleotide by nucleotide, at the replication 
fork as it moves from one end of a DNA molecule to the other. But because of the 
antiparallel orientation of the two DNA strands in the DNA double helix (see Fig- 
ure 5-2), this mechanism would require one daughter strand to polymerize in 
the 5'-to-3' direction and the other in the 3'-to-5' direction. Such a replication 
fork would require two distinct types of DNA polymerase enzymes. However, all 
of the many DNA polymerases that have been discovered can synthesize only in 
the 5'-to-3' direction. 

How, then, can a DNA strand grow in the 3'-to-5' direction? The answer was 
first suggested by the results of an experiment performed in the late 1960s. 
Researchers added highly radioactive 3 H-thymidine to dividing bacteria for a 
few seconds, so that only the most recently replicated DNA— that just behind 
the replication fork— became radiolabeled. This experiment revealed the tran- 
sient existence of pieces of DNA that were 1000-2000 nucleotides long, now 
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Figure 5-4 DNA synthesis catalyzed by DNA polymerase. (A) As indicated, DNA polymerase catalyzes the 
stepwise addition of a deoxyribonucleotide to the 3'-OH end of a polynucleotide chain, the primer strand 
that is paired to a second template strand. The newly synthesized DNA strand therefore polymerizes in the 
5'-to-3' direction as shown in the previous figure. Because each incoming deoxyribonucleoside 
triphosphate must pair with the template strand to be recognized by the DNA polymerase, this strand 
determines which of the four possible deoxyribonucleotides (A, C, G, orT) will be added. The reaction is 
driven by a large, favorable free-energy change, caused by the release of pyrophosphate and its 
subsequent hydrolysis to two molecules of inorganic phosphate. (B) The shape of a DNA polymerase 
molecule, as determined by x-ray crystallography. Roughly speaking DNA polymerases resemble a right 
hand in which the palm, fingers, and thumb grasp the DNA and form the active site. In the sequence 
shown, the correct positioning of an incoming deoxynucleoside triphosphate causes the fingers of the 
polymerase to tighten, thereby initiating the nucleotide addition reaction. Dissociation of pyrophosphate 
causes release of the fingers and translocation of the DNA by one nucleotide so the active site of the 
polymerase is ready to receive the next deoxynucleoside triphosphate. 



commonly known as Okazaki fragments, at the growing replication fork. (Simi- 
lar replication intermediates were later found in eucaryotes, where they are only 
100-200 nucleotides long.) The Okazaki fragments were shown to be polymer- 
ized only In the 5'-to-3' chain direction and to be joined together after their syn- 
thesis to create long DNA chains. 

A replication fork therefore has an asymmetric structure (Figure 5-7}. The 
DNA daughter strand that is synthesized continuously is known as the leading 
strand, its synthesis slightly precedes the synthesis of the daughter strand that 
is synthesized disco minuously, known as the lagging strand, hbr the lagging 
strand, the direction of nucleotide polymerization is opposite to the overall 
direction ol DNA chain growth. The synthesis of this strand by a discontinuous 
"backstitching" mechanism means that DNA replication requires only the S'-tO- 
3' type of DNA polymerase, 

The High Fidelity of DNA Replication Requires Several 
Proofreading Mechanisms 

As discussed above, the fidelity of copying DNA during replication is such that 
only about 1 mistake occurs for every 10 9 nucleotides copied. This fidelity is 
much higher than one would expect from the accuracy of complementary base- 



Figure 5-5 The semiconservative nature of DNA replication. In a round of 
replication, each of the two strands of DNA is used as a template for the 
formation of a complementary DNA strand. The original strands therefore 
remain intact through many cell generations. 
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pairing. The standard complementary base pairs (see Figure 4-4) are not the 
only ones possible. For example, with small changes in helix geometry, two 
hydrogen bonds can form between G and T in DNA. In addition, rare tautomeric 
forms of the four DNA bases occur transiently in ratios of 1 part to 10 4 or 10 5 . 
These forms mispair without a change in helix geometry: the rare tautomeric 
form of C pairs with A instead of G, for example. 

If the DNA polymerase did nothing special when a mispairing occurred 
between an incoming deoxyribonucleoside triphosphate and the DNA template, 
the wrong nucleotide would often be incorporated into the new DNA chain, pro- 
ducing frequent mutations. The high fidelity of DNA replication, however, 
depends not only on the initial base-pairing but also on several "proofreading" 
mechanisms that act sequentially to correct any initial mispairings that might 
have occurred. 

DNA polymerase performs the first proofreading step just before a new 
nucleotide is added to the growing chain. Our knowledge of this mechanism 
comes from studies of several different DNA polymerases, including one pro- 
duced by a bacterial virus, T7, that replicates inside E. coll The correct 
nucleotide has a higher affinity for the moving polymerase than does the incor- 
rect nucleotide, because the correct pairing is more energetically favorable. 
Moreover, after nucleotide binding, but before the nucleotide is covalently 
added to the growing chain, the enzyme must undergo a conformational change 
in which its "fingers" tighten around the active site (see Figure 5-4). Because this 
change occurs more readily with correct than incorrect base-pairing, it allows 
the polymerase to "double-check" the exact base-pair geometry before it cat- 
alyzes the addition of the nucleotide. 

The next error-correcting reaction, known as exonucleolytic proofreading, 
takes place immediately after those rare instances in which an incorrect 
nucleotide is covalently added to the growing chain. DNA polymerase enzymes 



Figure 5-6 Two replication forks moving 
in opposite directions on a circular 
chromosome. An active zone of DNA 
replication moves progressively along a 
replicating DNA molecule, creating a 
Y-shaped DNA structure known as a 
replication fork: the two arms of each Y 
are the two daughter DNA molecules, 
and the stem of the Y is the parental DNA 
helix. In this diagram, parental strands are 
orange; newly synthesized strands are 
red. (Micrograph courtesy of Jerome 
Vinograd.) 
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Figure 5-7 The structure of a DNA 
replication fork. Because both daughter 
DNA strands are polymerized in the Site- 
s' direction, the DNA synthesized on the 
lagging strand must be made initially as a 
series of short DNA molecules, called 
Okazaki fragments. On the lagging strand, 
the Okazaki fragments are synthesized 
sequentially, with those nearest the fork 
being the most recently made. 
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Figure 5-8 Exonucleolytic proofreading by DNA polymerase during DNA 
replication. In this example, the mismatch is due to the incorporation of a 
rare, transient tautomeric form of C, indicated by an asterisk. But the same 
proofreading mechanism applies to any misincorporation at the growing 
3'-OH end. The part of DNA polymerase that removes the misincorporated 
nucleotide is a specialized member of a large class of enzymes, known as 
exonudeases, that cleave nucleotides one at a time from the ends of 
polynucleotides. 



are highly discriminating in the types of DNA chains they will elongate: they 
absolutely require a previously formed base-paired 3'-OH end of a primer strand 
(see Figure 5-4). Those DNA molecules with a mismatched (improperly base- 
paired) nucleotide at the 3'-OH end of the primer strand are not effective as tem- 
plates because the polymerase cannot extend such a strand. DNA polymerase 
molecules correct such a mismatched primer strand by means of a separate cat- 
alytic site (either in a separate subunit or in a separate domain of the polymerase 
molecule, depending on the polymerase). This 3-to-S proofreading exonuclease 
clips off any unpaired residues at the primer terminus, continuing until enough 
nucleotides have been removed to regenerate a correctly base-paired 3'-OH ter- 
minus that can prime DNA synthesis. In this way, DNA polymerase functions as 
a "self-correcting" enzyme that removes its own polymerization errors as it 
moves along the DNA (Figure 5-8 and Figure 5-9). 

The self-correcting properties of the DNA polymerase depend on its require- 
ment for a perfectly base-paired primer terminus, and it is apparently not pos- 
sible for such an enzyme to start synthesis de novo. By contrast, the RNA poly- 
merase enzymes involved in gene transcription do not need such an efficient 
exonucleolytic proofreading mechanism: errors in making RNA are not passed 
on to the next generation, and the occasional defective RNA molecule that is 
produced has no long-term significance. RNA polymerases are thus able to start 
new polynucleotide chains without a primer. 

There is an error frequency of about 1 mistake for every 10 4 polymerization 
events both in RNA synthesis and in the separate process of translating mRNA 
sequences into protein sequences. This error rate is 100,000 times greater than 
that in DNA replication, where a series of proofreading processes makes the pro- 
cess unusually accurate (Table 5-1). 
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Figure 5-9 Editing by DNA polymerase. Outline of the structures of DNA 
polymerase complexed with the DNA template in the polymerizing mode 
(left) and the editing mode (right). The catalytic site for the exonucleolytic (E) 
and the polymerization (P) reactions are indicated. In the editing mode, the 
newly synthesized DNA transiently unpairs from the template and the 
polymerase undergoes a conformational change, moving the editing catalytic 
site into place to remove the most recently added nucleotide. <GATT> 
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Table 5-1 The Three Steps That Give Rise to High-Fidelity DNA Synthesis 



REPLICATION STEP 


ERRORS PER NUCLEOTIDE 


5' — » 3' polymerization 


1 in 10 5 


3' —> 5' exonucleolytic proofreading 


1 in 10 2 


Strand-directed mismatch repair 


1 inIO 2 


Combined 


linTO* 



The third step, strand-directed mismatch repair, is described later in this chapter. 



Only DNA Replication in the 5'-to-3' Direction Allows Efficient 
Error Correction 

The need for accuracy probably explains why DNA replication occurs only in the 
5'-to-3' direction. If there were a DNA polymerase that added deoxyribonucleo- 
side triphosphates in the 3'-to-5' direction, the growing 5'-chain end, rather than 
the incoming mononucleotide, would provide the activating triphosphate 
needed for the covalent linkage. In this case, the mistakes in polymerization 
could not be simply hydrolyzed away, because the bare 5'-chain end thus cre- 
ated would immediately terminate DNA synthesis (Figure 5-10). It is therefore 
possible to correct a mismatched base only if it has been added to the 3' end of 
a DNA chain. Although the backstitching mechanism for DNA replication (see 
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Figure 5-10 An explanation for the 
5'-to-3' direction of DNA chain growth. 

Growth in the 5'-to-3' direction, shown on 
the right, allows the chain to continue to 
be elongated when a mistake in 
polymerization has been removed by 
exonucleolytic proofreading (see Figure 
5-8). In contrast, exonucleolytic 
proofreading in the hypothetical 3'-to-5' 
polymerization scheme, shown on the left, 
would block further chain elongation. For 
convenience, only the primer strand of the 
DNA double helix is shown. 
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Figure 5-7) seems complex, it preserves the 5'-to-3' direction of polymerization 
that is required for exonucleolytic proofreading. 

Despite these safeguards against DNA replication errors, DNA polymerases 
occasionally make mistakes. However, as we shall see later, cells have yet 
another chance to correct these errors by a process called strand-directed mis- 
match repair. Before discussing this mechanism, however, we describe the other 
types of proteins that function at the replication fork. 
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A Special Nucleotide-Polymerizing Enzyme Synthesizes Short 
RNA Primer Molecules on the Lagging Strand 

For the leading strand, a special primer is needed only at the start of replication: 
once a replication fork is established, the DNA polymerase is continuously pre- 
sented with a base-paired chain end on which to add new nucleotides. On the 
lagging side of the fork, however, every time the DNA polymerase completes a 
short DNA Okazaki fragment (which takes a few seconds), it must start synthe- 
sizing a completely new fragment at a site further along the template strand (see 
Figure 5-7). A special mechanism produces the base-paired primer strand 
required by this DNA polymerase molecule. The mechanism involves an enzyme 
called DNA primase, which uses ribonucleoside triphosphates to synthesize 
short RNA primers on the lagging strand (Figure 5-11). In eucaryotes, these 
primers are about 10 nucleotides long and are made at intervals of 100-200 
nucleotides on the lagging strand. 

The chemical structure of RNA was introduced in Chapter 1 and is described 
in detail in Chapter 6. Here, we note only that RNA is very similar in structure to 
DNA. A strand of RNA can form base pairs with a strand of DNA, generating a 
DNA/RNA hybrid double helix if the two nucleotide sequences are complemen- 
tary. Thus the same templating principle used for DNA synthesis guides the syn- 
thesis of RNA primers. Because an RNA primer contains a properly base-paired 
nucleotide with a 3'-OH group at one end, it can be elongated by the DNA poly- 
merase at this end to begin an Okazaki fragment. The synthesis of each Okazaki 
fragment ends when this DNA polymerase runs into the RNA primer attached to 
the 5' end of the previous fragment. To produce a continuous DNA chain from 
the many DNA fragments made on the lagging strand, a special DNA repair sys- 
tem acts quickly to erase the old RNA primer and replace it with DNA. An 
enzyme called DNA ligase then joins the 3' end of the new DNA fragment to the 
5' end of the previous one to complete the process (Figures 5-12 and 5-13). 

Why might an erasable RNA primer be preferred to a DNA primer that would 
not need to be erased? The argument that a self-correcting polymerase cannot 
start chains de novo also implies its converse: an enzyme that starts chains anew 
cannot be efficient at self-correction. Thus, any enzyme that primes the synthe- 
sis of Okazaki fragments will of necessity make a relatively inaccurate copy (at 
least 1 error in 10 5 ). Even if the copies retained in the final product constituted 
as little as 5% of the total genome (for example, 10 nucleotides per 200- 
nucleotide DNA fragment), the resulting increase in the overall mutation rate 
would be enormous. It therefore seems likely that the use of RNA rather than 
DNA for priming brings a powerful advantage to the cell: the ribonucleotides in 
the primer automatically mark these sequences as "suspect copy" to be effi- 
ciently removed and replaced. 



Figure 5-12 The synthesis of one of many DNA fragments on the lagging 
strand. In eucaryotes, RNA primers are made at intervals spaced by about 
200 nucleotides on the lagging strand, and each RNA primer is 
approximately 1 nucleotides long. This primer is erased by a special DNA 
repair enzyme (an RNAse H) that recognizes an RNA strand in an RNA/DNA 
helix and fragments it; this leaves gaps that are filled in by DNA polymerase 
and DNA ligase. 
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Figure 5-1 1 RNA primer synthesis. 

A schematic view of the reaction 
catalyzed by DNA primase, the enzyme 
that synthesizes the short RNA primers 
made on the lagging strand using DNA as 
a template. Unlike DNA polymerase, this 
enzyme can start a new polynucleotide 
chain by joining two nucleoside 
triphosphates together. The primase 
synthesizes a short polynucleotide in the 
5'-to-3' direction and then stops, making 
the 3' end of this primer available for the 
DNA polymerase. 
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Special Proteins Help to Open Up the DNA Double Helix in Front 
of the Replication Fork 

For DNA synthesis to proceed, the DNA double helix must be opened up ahead 
of the replication fork so that the incoming deoxyribonucleoside triphosphates 
can form base pairs with the template strand. However, the DNA double helix is 
very stable under physiological conditions; the base pairs are locked in place so 
strongly that it requires temperatures approaching that of boiling water to sep- 
arate the two strands in a test tube. For this reason, two additional types of repli- 
cation proteins — DNA helicases and single-strand DNA-binding proteins — are 
needed to open the double helix and thus provide the appropriate single- 
stranded DNA template for the DNA polymerase to copy. 

DNA helicases were first isolated as proteins that hydrolyze ATP when they 
are bound to single strands of DNA. As described in Chapter 3, the hydrolysis of 
ATP can change the shape of a protein molecule in a cyclical manner that allows 
the protein to perform mechanical work. DNA helicases use this principle to 
propel themselves rapidly along a DNA single strand. When they encounter a 
region of double helix, they continue to move along their strand, thereby prying 
apart the helix at rates of up to 1000 nucleotide pairs per second (Figures 5-14 
and 5-15). 

The two strands of DNA have opposite polarities, and, in principle, a heli- 
case could unwind the DNA double helix by moving in the 5' to 3' direction along 
one strand or in the 3' to 5' direction along the other. In fact, both types of DNA 
helicase exist. In the best understood replication systems in bacteria, a helicase 
moving 5' to 3' along the lagging- strand template appears to have the predomi- 
nant role, for reasons that will become clear shortly 

Single-strand DNA-binding (SSB) proteins, also called helix- destabilizing 
proteins, bind tightly and cooperatively to exposed single- stranded DNA with- 
out covering the bases, which therefore remain available for templating. These 
proteins are unable to open a long DNA helix directly, but they aid helicases by 
stabilizing the unwound, single-stranded conformation. In addition, their coop- 
erative binding coats and straightens out the regions of single -stranded DNA on 
the lagging- strand template, thereby preventing the formation of the short hair- 
pin helices that readily form in single-strand DNA (Figures 5-16 and 5-17). 
These hairpin helices can impede the DNA synthesis catalyzed by DNA poly- 
merase. 



A Sliding Ring Holds a Moving DNA Polymerase onto the DNA 

On their own, most DNA polymerase molecules will synthesize only a short string 
of nucleotides before falling off the DNA template. The tendency to dissociate 



Figure 5-1 4 An assay used to test for DNA helicase enzymes. 

A short DNA fragment is annealed to a long DNA single strand to form a 
region of DNA double helix. The double helix is melted as the helicase runs 
along the DNA single strand, releasing the short DNA fragment in a 
reaction that requires the presence of both the helicase protein and ATP. 
The rapid stepwise movement of the helicase is powered by its ATP 
hydrolysis (see Figure 3-77). As indicated, many DNA helicases are 
composed of six subunits. 



Figure 5-1 3 The reaction catalyzed by 
DNA ligase. This enzyme seals a broken 
phosphodiester bond. As shown, DNA 
ligase uses a molecule of ATP to activate 
the 5' end at the nick (step 1) before 
forming the new bond (step 2). In this 
way, the energetically unfavorable nick- 
sealing reaction is driven by being 
coupled to the energetically favorable 
process of ATP hydrolysis. 
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quickly from a DNA molecule allows a DNA polymerase molecule that has just 
finished synthesizing one Okazaki fragment on the lagging strand to be recycled 
quickly, so as to begin the synthesis of the next Okazaki fragment on the same 
strand. This rapid dissociation, however, would make it difficult for the poly- 
merase to synthesize [he long DNA strands produced at a replication fork were 
il not for an accessory protein that functions as a regulated sliding clamp. This 
clamp keeps the polymerase firmly on the DNA when it is moving, but releases 
it as soon as the polymerase runs into a double-stranded region of DNA. 

How can a sliding clamp prevent the polymerase from dissociating without 
at the same time impeding the polymerase's rapid movement along the DNA 
molecule? The three-dimensional structure of the clamp protein, determined by 
x-ray diffraction, reveals that it forms a large ring around the DNA double helix. 
One side of the ring binds to the back of the DNA polymerase, and the whole ring 
slides freely along the DNA as the polymerase moves. The assembly of the clamp 
around the DNA requires ATP hydrolysis by a special protein complex. I he 
clamp loader, which hydrolyzes ATP as it loads the clamp on to a primer-tem- 
plate junction (Figure 5^18). 

On the leading- strand template, the moving DNA polymerase is tightly 
bound to the clamp, and the two remain associated for a very long time. The 
DNA polymerase on die lagging- strand template also makes use of the clamp, 
but each time the polymerase reaches the 5' end of the preceding Okazaki frag- 
ment, the polymerase releases itself from the clamp and dissociates from the 



Figure 5-1 5 The structure of a DNA 
helicase. <TGCC> (A) A schematic 
diagram of the protein as a hexameric 
ring. (B) Schematic diagram showing a 
DNA replication fork and helicase to 
scale. (C) Detailed structure of the 
bacteriophage T7 replicative helicase, as 
determined by x-ray diffraction. Six 
identical subunits bind and hydrolyze 
ATP in an ordered fashion to propel this 
molecule like a rotary engine along a 
DNA single strand that passes through 
the central hole. Red indicates bound ATP 
molecules in the structure. (B, courtesy of 
Edward H. Egelman; C, from 
M.R. Singleton et al„ Cell 101 :589-600, 
2000. With permission from Elsevier.) 
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Figure 5-16 The effect of single-strand 
DNA-binding proteins (SSB proteins) on 
the structure of single-stranded DNA. 

Because each protein molecule prefers to 
bind next to a previously bound 
molecule, long rows of this protein form 
on a DNA single strand. This cooperative 
binding straightens out the DNA template 
and facilitates the DNA polymerization 
process. The "hairpin helices" shown in 
the bare, single-stranded DNA result from 
a chance matching of short regions of 
complementary nucleotide sequence; 
they are similar to the short helices that 
typically form in RNA molecules (see 
Figure 1-6). 
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Figure 5-1 7 The structure of the human 
single-strand binding protein bound to 
DNA. (A) A front view of the two DNA- 
binding domains of RPA protein, which 
cover a total of eight nucleotides. Note 
that the DNA bases remain exposed in 
this protein-DNA complex. (B) A diagram 
showing the three-dimensional structure, 
with the DNA strand (red) viewed end-on. 
(B, from A. Bochkarev et al., Nature 
385:176-181, 1997. With permission from 
Macmillan Publishers Ltd.) 



template. This polymerase molecule then associates with a new clamp that is 
assembled on the RNA primer of the next Okazaki fragment. 



The Proteins at a Replication Fork Cooperate to Form a 
Replication Machine 

Although we have discussed DNA replication as though it were performed by a 
mix of proteins all acting independently, in reality most of the proteins are held 
together in a large and orderly multienzyme complex that rapidly synthesizes 
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Figure 5-1 8 The regulated 
sliding clamp that holds DNA 
polymerase on the DNA. (A) The 
structure of the clamp protein 
from E. coli, as determined by 
X-ray crystallography, with a 
DNA helix added to indicate how 
the protein fits around DNA. 
(B)The structure of the five- 
subunit clamp loader resembles 
a screw nut with its threads 
matching the grooves of DNA. It 
appears to tighten around the 
primer junction until its further 
progress is blocked by the 3' end 
of the primer, at which point the 
loader hydrolyzes ATP and 
releases the clamp. (C) Schematic 
illustration showing how the 
clamp is assembled to hold a 
moving DNA polymerase 
molecule on the DNA. In the 
simplified reaction shown here, 
the clamp loader dissociates into 
solution once the clamp has 
been assembled. At a true 
replication fork, the clamp loader 
remains close to the lagging- 
strand polymerase, ready to 
assemble a new clamp at the 
start of each new Okazaki 
fragment (see Figure 5-19). 
(A, from X.P Kong et al., Cell 
69:425-437, 1992. With 
permission from Elsevier; C, from 
G.D. Bowman, M. O'Donnell and 
J. Kuriyan, Nature 429:708-709, 
2004. With permission from 
Macmillan Publishers Ltd.) 
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DNA. This complex can be likened to a tiny sewing machine composed of pro- 
tein pans and powered by nucleoside triphosphate hydrolyses. Like a sewing 
machine, the replication complex probably remains stationary with respect to 
its immediate surroundings; the DNA can be thought of as a long strip of cloth 
being rapidly threaded through it. Although the replication complex has been 
most intensively studied in E. coli and several of its viruses, a very similar com- 
plex also operates in eucaryotes, as we see below. 

Figure 5-19A summarizes the functions of the subunits of the replication 
machine. At the front of the replication fork, DNA helicase opens the DNA helix. 
Two DNA polymerase molecules work at the fork, one on the leading strand and 
one on the lagging strand. Whereas the DNA polymerase molecule on the lead- 
ing strand can operate in a continuous fashion, the DNA polymerase molecule 
on the lagging strand must restart at short intervals, using a short RNA primer 
made by a DNA primase molecule. The close association of all these protein 
components increases the efficiency of replication and is made possible by a 
folding back of the lagging strand as shown in Figure 5-19A. This arrangement 
also facilitates the loading of the polymerase clamp each time that an Okazaki 
fragment is synthesized: the clamp loader and the lagging-strand DNA poly- 
merase molecule are kept in place as a part of the protein machine even when 
they detach from tiieir DNA template. The replication proteins are thus linked 
together into a single large unit (total molecular weight >10^ daltonsL enabling 
DNA to be synthesized on both sides of the replication fork in a coordinated and 
efficient manner. 

On the lagging strand, the DNA replication machine leaves behind a series 
of unsealed Okazaki fragments, which still contain the RNA that primed their 
synthesis at their 5' ends. This RNA is removed and the resulting gap is filled in 
by DNA repair enzymes that operate behind the replication fork (see Figure 
5-12). 



A Strand-Directed Mismatch Repair System Removes Replication 
Errors That Escape from the Replication Machine 

As stated previously, bacteria such as E. coli are capable of dividing once every 
40 minutes, making it relatively easy to screen large populations to find a rare 
mutant cell that is altered in a specific process. One interesting class of mutants 
contains alterations in so-called mutator genes, which greatly increase the rate 
of spontaneous mutation. Not surprisingly, one such mutant makes a defective 
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Figure 5-19 An active replication fork. 

(A) This schematic diagram shows a 
current view of the arrangement of 
replication proteins at a replication fork 
when DNA is being synthesized. The 
lagging-strand DNA has been folded to 
bring the lagging-strand DNA 
polymerase molecule into a complex 
with the leading-strand DNA polymerase 
molecule. This folding also brings the 
3' end of each completed Okazaki 
fragment close to the start site for the 
next Okazaki fragment. Because the 
lagging-strand DNA polymerase 
molecule remains bound to the rest of 
the replication proteins, it can be reused 
to synthesize successive Okazaki 
fragments. In this diagram, it is about to 
let go of its completed DNA fragment 
and move to the RNA primer that will be 
synthesized nearby as required to start 
the next DNA fragment. Additional 
proteins (not shown) help to hold the 
different protein components of the fork 
together, enabling them to function as a 
well-coordinated protein machine. 
<AATA> <CCCG> (B) An electron 
micrograph showing the replication 
machine from the bacteriophage T4 as it 
moves along a template synthesizing 
DNA behind it. (C) An interpretation of 
the micrograph is given in the sketch: 
note especially the DNA loop on the 
lagging strand. Apparently the 
replication proteins became partly 
detached from the very front of the 
replication fork during the preparation of 
this sample for electron microscopy. 
(B, courtesy of Jack Griffith; see 
P.D. Chastain et al., J. Biol. Chem. 
278:21276-21285,2003.) 
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form of the 3'-to-5' proofreading exonuclease that is a part of the DNA poly- 
merase enzyme (see Figures 5-8 and 5-9). The mutant DNA polymerase no 
longer proofreads effectively, and many replication errors that would otherwise 
have been removed accumulate in the DNA. 

The study of other E, coli mutants exhibiting abnormally high mutation 
rates has uncovered a proofreading system that removes replication errors made 
by the polymerase that have been missed by the proofreading exonuclease. This 
strand-directed mismatch repair system detects the potential for distortion in 
the DNA helix from the misfit between noncomplementary base pairs. 

If the proofreading system simply recognized a mismatch in newly repli- 
cated DNA and randomly corrected one of the two mismatched nucleotides, it 
would mistakenly "correct" the original template strand to match the error 
exactly half the time, thereby failing to lower the overall error rate. To be effec- 
tive, such a proofreading system must be able to distinguish and remove the 
mismatched nucleotide only on the newly synthesized strand, where the repli- 
cation error occurred. 

The strand-distinction mechanism used by the mismatch proofreading sys- 
tem in E. coli depends on the methylation of selected A residues in the DNA. 
Methyl groups are added to all A residues in the sequence GATC, but not until 
some time after the A has been incorporated into a newly synthesized DNA 
chain. As a result, the only GATC sequences that have not yet been methylated 
are in the new strands just behind a replication fork. The recognition of these 
unmethylated GATCs allows the new DNA strands to be transiently distin- 
guished from old ones, as required if their mismatches are to be selectively 
removed. The three-step process involves recognition of a mismatch, excision of 
the segment of DNA containing the mismatch from the newly synthesized 
strand, and resynthesis of the excised segment using the old strand as a tem- 
plate. This strand-directed mismatch repair system reduces the number of 
errors made during DNA replication by an additional factor of 100 (see Table 
5-1, p. 271). 

A similar mismatch proofreading system functions in human cells (Figure 
5-20). The importance of this system in humans is seen in individuals who 
inherit one defective copy of a mismatch repair gene (along with a functional 
gene on the other copy of the chromosome). These people have a marked pre- 
disposition for certain types of cancers. For example, in a type of colon cancer 
called hereditary nonpolyposis colon cancer (HNPCC), spontaneous mutation of 
the remaining functional gene produces a clone of somatic cells that, because 
they are deficient in mismatch proofreading, accumulate mutations unusually 
rapidly. Most cancers arise in cells that have accumulated multiple mutations 
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Figure 5-20 A model for strand-directed 
mismatch repair in eucaryotes. (A) The 

two proteins shown are present in both 
bacteria and eucaryotic cells: MutS binds 
specifically to a mismatched base pair, 
while MutL scans the nearby DNA for a 
nick. Once MutL finds a nick, it triggers 
the degradation of the nicked strand all 
the way back through the mismatch. 
Because nicks are largely confined to 
newly replicated strands in eucaryotes, 
replication errors are selectively removed. 
In bacteria, the mechanism is the same, 
except that an additional protein in the 
complex (MutH) nicks unmethylated (and 
therefore newly replicated) GATC 
sequences, thereby beginning the 
process illustrated here. (B)The structure 
of the MutS protein bound to a DNA 
mismatch. This protein is a dimer, which 
grips the DNA double helix as shown, 
kinking the DNA at the mismatched base 
pair. It seems that the MutS protein scans 
the DNA for mismatches by testing for 
sites that can be readily kinked, which are 
those without a normal complementary 
base pair. {B, from G. Obmolova et al., 
Nature 407:703-710, 2000. With 
permission from Macmillan 
Publishers Ltd.) 
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(see Figure 20-11), and cells deficient in mismatch proofreading therefore have 
a greatly enhanced chance of becoming cancerous. Fortunately, most of us 
inherit two good copies of each gene that encodes a mismatch proofreading 
protein; this protects us, because it is highly unlikely for both copies to become 
mutated in the same cell. 

In eucaryotes, the mechanism for distinguishing the newly synthesized 
strand from the parental template strand at the site of a mismatch does not 
depend on DNA methylation. Indeed, some eucaryotes — including yeasts and 
Drosophila— do not methylate their DNA. Newly synthesized lagging- strand 
DNA transiently contains nicks (before they are sealed by DNA ligase) and bio- 
chemical experiments reveal that such nicks (also called single-strand breaks) 
provide the signal that directs the mismatch proofreading system to the appro- 
priate strand (see Figure 5-20). This idea also requires that the newly synthe- 
sized DNA on the leading strand be transiently nicked; how this occurs is uncer- 
tain. 



DNATopoisomerases Prevent DNA Tangling During Replication 

As a replication fork moves along double-stranded DNA, it creates what has 
been called the "winding problem." Every 10 base pairs replicated at the fork 
corresponds to one complete turn about the axis of the parental double helix. 
Therefore, for a replication fork to move, the entire chromosome ahead of the 
fork would normally have to rotate rapidly (Figure 5-21). This would require 
large amounts of energy for long chromosomes, and an alternative strategy is 
used instead: a swivel is formed in the DNA helix by proteins known as DNA 
topoisomerases. 

A DNA topoisomerase can be viewed as a reversible nuclease that adds itself 
covalently to a DNA backbone phosphate, thereby breaking a phosphodiester 
bond in a DNA strand. This reaction is reversible, and the phosphodiester bond 
re-forms as the protein leaves. 

One type of topoisomerase, called topoisomerase I, produces a transient sin- 
gle-strand break (or nick); this break in the phosphodiester backbone allows the 
two sections of DNA helix on either side of the nick to rotate freely relative to 
each other, using the phosphodiester bond in the strand opposite the nick as a 
swivel point (Figure 5-22). Any tension in the DNA helix will drive this rotation 
in the direction that relieves the tension. As a result, DNA replication can occur 
with the rotation of only a short length of helix— the part just ahead of the fork. 
Because die covalent linkage that joins die DNA topoisomerase protein to a 
DNA phosphate retains the energy of the cleaved phosphodiester bond, reseal- 
ing is rapid and does not require additional energy input. In this respect, the 
rejoining mechanism differs from that catalyzed by the enzyme DNA ligase, dis- 
cussed previously (see Figure 5-13). 

A second type of DNA topoisomerase, topoisomerase II, forms a covalent 
linkage to both strands of the DNA helix at the same time, making a transient 
double-strand break in the helix. These enzymes are activated by sites on chro- 
mosomes where two double helices cross over each otiier. Once a topoiso- 
merase II molecule binds to such a crossing site, the protein uses ATP hydroly- 
sis to perform the following set of reactions efficiently: (1) it breaks one double 
helix reversibly to create a DNA "gate;" (2) it causes the second, nearby double 
helix to pass through this break: and (3) it then reseals the break and dissociates 
from the DNA (Figure 5-23). In this way, type II DNA topoisomerases can effi- 
ciently separate two interlocked DNA circles (Figure 5-24). 

The same reaction also prevents the severe DNA tangling problems that 
would otherwise arise during DNA replication. This role is nicely illustrated by 



Figure 5-21 The "winding problem" that arises during DNA replication. 

For a bacterial replication fork moving at 500 nucleotides per second, the 
parental DNA helix ahead of the fork must rotate at 50 revolutions per 
second. 



rapid rotation 
of the DNA 
helix needed 
here 




leading -strand 
template 



newJy synthesized DNA chain 






DNA REPLICATION MECHANISMS 



279 



mutant yeast cells that produce, in place of the normal topoisomerase II, a ver- 
sion that is inactive above 37°C. When the mutant cells are warmed to this tem- 
perature, their daughter chromosomes remain intertwined after DNA replica- 
tion and are unable to separate. The enormous usefulness of topoisomerase II 
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Figure 5-22 The reversible DNA nicking 
reaction catalyzed by a eucaryotic DNA 
topoisomerase I enzyme. As indicated, 
these enzymes transiently form a single 
covalent bond with DNA; this allows free 
rotation of the DNA around the covalent 
backbone bonds linked to the blue 
phosphate. 
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Figure 5-23 A model for topoisomerase II action. As indicated, ATP binding to the two ATPase domains 
causes them to dimerize and drives the reactions shown. Because a single cycle of this reaction can occur 
in the presence of a non-hydrolyzable ATP analog, ATP hydrolysis is thought to be needed only to reset the 
enzyme for each new reaction cycle. This model is based on the structure of enzyme in combination with 
biochemical experiments. (Modified from J.M. Berger, Curr. Opin. Struct. Biol. 8:26-32, 1998. With permission 
from Elsevier.) 



for untangling chromosomes can readily be appreciated by anyone who has 
struggled to remove a tangle from a fishing line without the aid of scissors. 

DNA Replication Is Fundamentally Similar in Eucaryotes and 
Bacteria 

Much of what we know about DNA replication was first derived from studies of 
purified bacterial and bacteriophage multi enzyme systems capable of DNA 
replication in vitro. The development of these systems in the 1970s was greatly 
facilitated by the prior isolation of mutants in a variety of replication genes; 
these mutants were exploited to identify and purify the corresponding replica- 
tion proteins The first mammalian replication system that accurately replicated 
DNA in vitro was described in the mid-1980s, and mutations in genes encoding 
nearly all of the replication components have now been isolated and analyzed in 
the yeast Saccharomyces cerevisiae. As a result, much is known about the 
detailed enzymology of DNA replication in eucaryotes, and it is clear that the 
fundamental features of DNA replication— including replication fork geometry 
and the use of a multiprotein replication machine— have been conserved during 
the long evolutionary process that separated bacteria from eucaryotes. 

There are more protein components in eucaryotic replication machines 
than there are in the bacterial analogs, even though the basic functions are the 
same. Thus, for example, the eucaryotic single-strand binding (SSB) protein is 
formed from three subunits, whereas only a single subunit is found in bacteria. 
Similarly the eucaryotic DNA primase is incorporated into a multisubunit 
enzyme that also contains a DNA polymerase called DNA polymerase oc-pri- 
mase. This protein complex begins each Okazaki fragment on the lagging strand 
with RNA and then extends the RNA primer with a short length of DNA. At this 
point, the two main eucaryotic replicative polymerases, 6 and e, come into play 
and complete each Okazaki fragment while simultaneously extending the lead- 
ing strand. Exactly how the tasks of leading and lagging strand synthesis are dis- 
tributed between these two DNA polymerases is not yet understood. 

As we see in the next section, the eucaryotic replication machinery has the 
added complication of having to replicate through nucleosomes, the repeating 
structural unit of chromosomes discussed in Chapter 4. Nucleosomes are 
spaced at intervals of about 200 nucleotide pairs along the DNA, which may 
explain why new Okazaki fragments are synthesized on the lagging strand at 
intervals of 100-200 nucleotides in eucaryotes, instead of 1 000-2000 nucleotides 
as in bacteria. Nucleosomes may also act as barriers that slow down the move- 
ment of DNA polymerase molecules, which may be why eucaryotic replication 
forks move only about one-tenth as fast as bacterial replication forks. 
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Figure 5-24 The DNA-helix-passing reaction catalyzed by DNA 
topoisomerase II. Identical reactions are used to untangle DNA inside the 
cell. Unlike type I topoisomerases, type II enzymes use ATP hydrolysis and 
some of the bacterial versions can introduce superhelical tension into DNA. 
Type II topoisomerases are largely confined to proliferating cells in 
eucaryotes; partly for that reason, they have been popular targets for 
anticancer drugs. 



Summary 

DNA replication takes place at a Y-shaped structure called a replication fork. A self- 
correcting DNA polymerase enzyme catalyzes nucleotide polymerization in a 5'-to-3 
direction, copying a DNA template strand with remarkable fidelity. Since the two 
strands of a DNA double helix are antiparallel, this 5'-to-3 DNA synthesis can take 
place continuously on only one of the strands at a replication fork (the leading strand). 
On the lagging strand, short DNA fragments must be made by a "backstitching" process. 
Because the self-correcting DNA polymerase cannot start a new chain, these lagging- 
strand DNA fragments are primed by short RNA primer molecules that are subsequently 
erased and replaced with DNA. 

DNA replication requires the cooperation of many proteins. These include (1) DNA 
polymerase and DNA primase to catalyze nucleoside triphosphate polymerization; (2) 
DNA helicases and single-strand DNA-binding (SSB) proteins to help in opening up 
the DNA helix so that it can be copied; (3) DNA ligase and an enzyme that degrades 
RNA primers to seal together the discontinuously synthesized lagging- strand DNA 
fragments; and (4) DNA topoisomerases to help to relieve helical winding and DNA 
tangling problems. Many of these proteins associate with each other at a replication 
fork to form a highly efficient Replication machine" through which the activities and 
spatial movements of the individual components are coordinated. 



THE INITIATION AND COMPLETION OF DNA 
REPLICATION IN CHROMOSOMES 

We have seen how a set of replication proteins rapidly and accurately generates 
two daughter DNA double helices behind a replication fork. But how is this repli- 
cation machinery assembled in the first place, and how are replication forks cre- 
ated on a double-stranded DNA molecule? In this section, we discuss how cells 
initiate DNA replication and how they carefully regulate this process to ensure 
that it takes place not only at the proper positions on the chromosome but also 
at the appropriate time in the life of the cell. We also discuss a few of the special 
problems that the replication machinery in eucaryotic cells must overcome. 
These include the need to replicate the enormously long DNA molecules found 
in eucaryotic chromosomes, as well as the difficulty of copying DNA molecules 
that are tightly complexed with histones in nucleosomes. 

DNA Synthesis Begins at Replication Origins 

As discussed previously, the DNA double helix is normally very stable: the two 
DNA strands are locked together firmly by many hydrogen bonds formed 
between the bases on each strand. To be used as a template, the double helix 
must be opened up and the two strands separated to expose unpaired bases. As 
we shall see, the process of DNA replication is begun by special initiator proteins 
that bind to double-stranded DNA and pry the two strands apart, breaking the 
hydrogen bonds between the bases. 

The positions at which the DNA helix is first opened are called replication 
origins (Figure 5-25). In simple cells like those of bacteria or yeast, origins are 
specified by DNA sequences several hundred nucleotide pairs in length. This 
DNA contains both short sequences that attract initiator proteins and stretches 
of DNA that are especially easy to open. We saw in Figure 4-4 that an A-T base 
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pair is held together by fewer hydrogen bonds than a G-C base pair. Therefore, 
DNA rich in A-T base pairs is relatively easy to pull apart, and regions of DNA 
enriched in A-T pairs are typically found at replication origins. 

Although the basic process of replication fork initiation depicted in Figure 
5-25 is fundamentally the same for bacteria and eucaryotes, the detailed way in 
which this process is performed and regulated differs between these two groups 
of organisms. We first consider the simpler and better-understood case in bac- 
teria and then turn to the more complex situation found in yeasts, mammals, 
and other eucaryotes. 



Bacterial Chromosomes Typically Have a Single Origin of DNA 
Replication 

The genome of E. coliis contained in a single circular DNA molecule of 4.6 x 10 6 
nucleotide pairs. DNA replication begins at a single origin of replication, and the 
two replication forks assembled there proceed (at approximately 500-1000 
nucleotides per second) in opposite directions until they meet up roughly 
halfway around the chromosome (Figure 5-26). The only point at which E. coli 
can control DNA replication is initiation: once the forks have been assembled at 
the origin, they synthesize DNA at relatively constant speed until replication is 
finished. Therefore, it is not surprising that the initiation of DNA replication is 
highly regulated. The process begins when initiator proteins bind in multiple 
copies to specific sites in the replication origin, wrapping the DNA around the 
proteins to form a large protein-DNA complex. This complex then attracts a 
DNA hriicase bound to a helix loader, and the helicase is placed around an adja- 
cent DNA single strand whose bases have been exposed by the assembly of the 
initiator protein-DNA complex. The helicase loader is analogous to the clamp 
loader we encountered above; it has the additional job of keeping the helicase in 
an inactive form until it is properly loaded onto a nascent replication fork. Once 
the helicase is loaded, it begins to unwind DNA, exposing enough single- 
stranded DNA for primase to synthesize the RNA primer that begins the leading 
strand (Figure 5-27). This quickly leads to the assembly of the remaining pro- 
teins to create two replication forks, nidi protein complexes that move, with 
respect to the replication origin, in opposite directions. These protein machines 
continue to synthesize DNA until all of the DNA template downstream of each 
fork has been replicated. 

In E. coli, the interaction of the initiator protein with the replication origin is 
carefully regulated, with initiation occurring only when sufficient nutrients are 
available for the bacterium to complete an entire round of replication. Not only 
is the activity of the initiator protein controlled, but an origin of replication that 
has just been used experiences a "refractory period/' caused by a delay in the 
methylation of newly synthesized A nucleotides. Further initiation of replication 
is blocked until these As are methylated {Figure 5-28]. 

Eucaryotic Chromosomes Contain Multiple Origins of Replication 

We have seen how two replication forks begin at a single replication origin in 
bacteria and proceed in opposite directions, moving away from the origin until 
all of the DNA in the single circular chromosome is replicated. The bacterial 
genome is sufficiently small for these two replication forks to duplicate the 
genome in about 40 minutes. Because of the much greater size of most eucary- 
otic chromosomes, a different strategy is required to allow their replication in a 
timely manner. 

A method for determining the general pattern of eucaryotic chromosome 
replication was developed in the early 1960s. Human cells growing in culture are 
labeled for a short time with 3 H-thymidine so that the DNA synthesized during 
this period becomes highly radioactive. The cells are then gently lysed, and the 
DNA is streaked on the surface of a glass slide coated with a photographic emul- 
sion. Development of the emulsion reveals the pattern of labeled DNA through 
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Figure 5-25 A replication bubble 
formed by replication fork initiation. 

This diagram outlines the major steps in 
the initiation of replication forks at 
replication origins. The structure formed 
at the last step, in which both strands of 
the parental DNA helix have been 
separated from each other and serve as 
templates for DNA synthesis, is called a 
replication bubble. 
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Figure 5-26 DNA replication of a bacterial genome. It takes E. coli about 
40 minutes to duplicate its genome of 4.6 x 1 6 nucleotide pairs. For 
simplicity, no Okazaki fragments are shown on the lagging strand. What 
happens as the two replication forks approach each other and collide at 
the end of the replication cycle is not well understood, although the 
replication machines are disassembled as part of the process. 



a technique known as autoradiography. The time allotted for radioactive label- 
ing is chosen to allow each replication fork to move several micrometers along 
the DNA, so that the replicated DNA can be detected in the light microscope as 
lines of silver grains, even though the DNA molecule itself is too thin to be visible. 
In this way, both the rate and the direction of replication-fork movement can be 
determined (Figure 5-29). From the rate at which tracks of replicated DNA 
increase in length with increasing labeling time, the replication forks are esti- 
mated to travel at about 50 nucleotides per second. This is approximately one- 
tenth of the rate at which bacterial replication forks move, possibly reflecting the 
increased difficulty of replicating DNA that is packaged tightly in chromatin. 

An average-size human chromosome contains a single linear DNA molecule 
of about 150 million nucleotide pairs. It would take 0.02 seconds/nucleotide x 
150 x 10 6 nucleotides = 3.0 x 10 6 seconds (about 800 hours) to replicate such a 
DNA molecule from end to end with a single replication fork moving at a rate of 
50 nucleotides per second. As expected, therefore, the autoradiographic experi- 
ments just described reveal that many forks are moving simultaneously on each 
eucaryotic chromosome. 

Further experiments of this type have shown the following: (1) Replication 
origins tend to be activated in clusters, called replication units, of perhaps 20-80 
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Figure 5-27 The proteins that initiate 
DNA replication in bacteria. The 
mechanism shown was established by 
studies in vitro with mixtures of highly 
purified proteins. For £ coli DNA 
replication, the major initiator protein, 
the helicase, and the primase are the 
DnaA, DnaFJ, and DnaG proteins, 
respectively. In the first step, several 
molecules of the initiator protein bind to 
specific DNA sequences in the replication 
origin and form a compact structure in 
which the DNA is wrapped around the 
protein. Next the helicase is brought in 
by a helicase loading protein (the DnaC 
protein), which inhibits the helicase until 
it is properly loaded at the replication 
origin. The helicase loading protein 
thereby prevents the helicase from 
inappropriately entering other single- 
stranded stretches of DNA in the 
bacterial genome. Aided by single-strand 
binding protein (not shown), the loaded 
helicase opens up the DNA thereby 
enabling primase to enter and synthesize 
the primer for the first DNA chain. 
Subsequent steps (not shown) result in 
the initiation of three additional DNA 
chains and the final assembly of two 
complete replication forks. 
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origins. (2) New replication units seem to be activated at different times during 
the cell cycle until all of the DNA is replicated, a point that we return to below. 
(3) Within a replication unit, individual origins are spaced at intervals of 
30,000-250,000 nucleotide pairs from one another. (4) As in bacteria, replication 
forks are formed in pairs and create a replication bubble as they move in oppo- 
site directions away from a common point of origin, stopping only when they 
collide head-on with a replication fork moving in the opposite direction (or 
when they reach a chromosome end). In this way many replication forks oper- 
ate independently on each chromosome and yet form two complete daughter 
DNA helices. 



Figure 5-28 Methylation of the £ coli 
replication origin creates a refractory 
period for DNA initiation. DNA 

methylation occurs at GATC sequences, 
1 1 of which are found in the origin of 
replication (spanning approximately 250 
nucleotide pairs). In its hemimethylated 
state, the origin of replication is bound by 
an inhibitor protein (Seq A, not shown), 
which blocks access of the origin to 
initiator proteins. Eventually (about 
20 minutes after replication is initiated), 
the hemimethylated origins become fully 
methylated by a DNA methylase enzyme; 
Seq A then dissociates. 

A single enzyme, the Dam methylase, is 
responsible for methylating all E. coli 
GATC sequences. A lag in methylation 
after the replication of GATC sequences is 
also used by the E. coli mismatch 
proofreading system to distinguish the 
newly synthesized DNA strand from the 
parental DNA strand; in that case, the 
relevant GATC sequences are scattered 
throughout the chromosome, and they 
are not bound by Seq A. 



In Eucaryotes DNA Replication Takes Place During Only One Part 
of the Cell Cycle 

When growing rapidly, bacteria replicate their DNA continually, and they can 
begin a new round before the previous one is complete. In contrast, DNA repli- 
cation in most eucaryotic cells occurs only during a specific part of the cell divi- 
sion cycle, called the DNA synthesis phase or S phase (Figure 5-30). In a mam- 
malian cell, the S phase typically lasts for about 8 hours; in simpler eucaryotic 
cells such as yeasts, the S phase can be as short as 40 minutes. By its end, each 
chromosome has been replicated to produce two complete copies, which 
remain joined together at their centromeres until the M phase (M for mitosis), 
which soon follows. In Chapter 17, we describe the control system that runs the 
cell cycle and explain why entry into each phase of the cycle requires the cell to 
have successfully completed the previous phase. 

In the following sections, we explore how chromosome replication is coor- 
dinated within the S phase of the cell cycle. 
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Figure 5-29 The experiments that 
demonstrated the pattern in which 
replication forks are formed and move 
on eucaryotic chromosomes. The new 

DNA made in human cells in culture was 
labeled briefly with a pulse of highly 
radioactive thymidine ( 3 H-thymidine). 
(A) In this experiment, the cells were 
lysed, and the DNA was stretched out on 
a glass slide that was subsequently 
covered with a photographic emulsion. 
After several months the emulsion was 
developed, revealing a line of silver 
grains over the radioactive DNA. The 
brown DNA in this figure is shown only to 
help with the interpretation of the 
autoradiograph; the unlabeled DNA is 
invisible in such experiments. (B)This 
experiment was the same except that a 
further incubation in unlabeled medium 
allowed additional DNA, with a lower 
level of radioactivity, to be replicated. The 
pairs of dark tracks in (B) were found to 
have silver grains tapering off in opposite 
directions, demonstrating bidirectional 
fork movement from a central replication 
origin where a replication bubble forms 
(see Figure 5-25). A replication fork is 
thought to stop only when it encounters 
a replication fork moving in the opposite 
direction or when it reaches the end of 
the chromosome; in this way, all the DNA 
is eventually replicated. 
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Figure 5-30 The four successive phases of a standard eucaryotic cell 
cycle. During the Gi, S, and G 2 phases, the cell grows continuously. During 
M phase growth stops, the nucleus divides, and the cell divides in two. 
DNA replication is confined to the part of the cell cycle known as 
S phase. Gi is the gap between M phase and S phase; G2 is the gap 
between S phase and M phase. 

Different Regions on the Same Chromosome Replicate at Distinct 
Times in S Phase 

In mammalian cells, the replication of DNA in the region between one replica- 
tion origin and the next should normally require only about an hour to com- 
plete, given the rate at which a replication fork moves and the largest distances 
measured between replication origins. Yet S phase usually lasts for about 8 hours 
in a mammalian cell. This implies that the replication origins are not all acti- 
vated simultaneously and that the DNA in each replication unit (which, as we 
noted above, contains a cluster of about 20-80 replication origins) is replicated 
during only a small part of the total S-phase interval. 

Are different replication units activated at random, or are different regions of 
the genome replicated in a specified order? One way to answer this question is 
to use the thymidine analog bromodeoxyuridine (BrdU) to label the newly syn- 
thesized DNA in synchronized cell populations, adding it for different short 
periods throughout S phase. Later, during M phase, those regions of the mitotic 
chromosomes that have incorporated BrdU into their DNA can be recognized by 
their altered staining properties or by means of anti-BrdU antibodies. The 
results show that different regions of each chromosome are replicated in a 
reproducible order during S phase (Figure 5-31). Moreover, as one would expect 
from the clusters of replication forks seen in DNA autoradiographs (see Figure 
5-29), the timing of replication is coordinated over large regions of the chromo- 
some. 

Much more sophisticated methods now exist for monitoring DNA replica- 
tion initiation and tracking the movement of DNA replication forks in cells. 
These approaches use DNA microarrays — grids the size of a postage stamp stud- 
ded with tens of thousands of fragments of known DNA sequence. As we will see 
in detail in Chapter 8, each different DNA fragment is placed at a unique posi- 
tion on the microarray, and whole genomes can thereby be represented in an 
orderly manner. If a DNA sample from a group of cells in S phase is broken up 
and hybridized to a microarray representing that organism's genome, the 
amount of each DNA sequence can be determined. Because a segment of a 
genome that has been replicated will contain twice as much DNA as an unrepli- 
cated segment, replication fork initiation and fork movement can be accurately 
monitored in this way (Figure 5-32). Although this method provides much 
greater precision, it leads to many of the same conclusions reached from the ear- 
lier studies. 



Highly Condensed Chromatin Replicates Late, While Genes in Less 
Condensed Chromatin Tend to Replicate Early 

It seems that the order in which replication origins are activated depends, in 
part, on the chromatin structure in which the origins reside. We saw in Chapter 
4 that heterochromatin is a particularly condensed state of chromatin, while 
euchromatin chromatin has a less condensed conformation that is apparently 
required to allow transcription. Heterochromatin tends to be replicated very late 
in S phase, suggesting that the timing of replication is related to the packing of 
the DNA in chromatin. This suggestion is supported by an examination of the 
two X chromosomes in a female mammalian cell. While these two chromosomes 
contain essentially the same DNA sequences, one is active for DNA transcription 
and the other is not (discussed in Chapter 7). Nearly all of the inactive X chro- 
mosome is condensed into heterochromatin, and its DNA replicates late in S 
phase. Its active homolog is less condensed and replicates throughout S phase. 
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Figure 5-31 Different regions of a 
chromosome are replicated at different 
times in S phase. These light 
micrographs show stained mitotic 
chromosomes in which the replicating 
DNA has been differentially labeled 
during different defined intervals of the 
preceding S phase. In these experiments, 
cells were first grown in the presence of 
BrdU (a thymidine analog) and absence 
of thymidine to label the DNA uniformly. 
The cells were then briefly pulsed with 
thymidine in the absence of BrdU during 
early, middle, or late S phase. Because the 
DNA made during the thymidine pulse is 
a double helix with thymidine on one 
strand and BrdU on the other, it stains 
more darkly than the remaining DNA 
(which has BrdU on both strands) and 
shows up as a bright band (arrows) on 
these negatives. Broken lines connect 
corresponding positions on the three 
identical copies of the chromosome 
shown. (Courtesy of Elton Stubblefield.) 
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These findings suggest that regions of the genome whose chromatin is least 
condensed are replicated first. Replication forks seem to move at comparable 
rates throughout S phase, so the extent of chromosome condensation seems to 
influence the time at which replication forks are initiated, rather than their 
speed once formed. 



Figure 5-32 Use of DNA microarrays to 
monitor the formation and progress of 
replication forks in the budding yeast 
genome. For this experiment, a 
population of cells is sychronized so that 
they all begin replication at the same 
time. DNA is collected and hybridized to 
the microarray; DNA that has been 
replicated once gives a hybridization 
signal (dark green squares) twice as high 
as that of unreplicated DNA (light green 
squares). The spots on these microarrays 
represent consecutive sequences along a 
segment of a yeast chromosome 
arranged left to right, top to bottom. 
Only 81 spots are shown here, but the 
actual arrays contain tens of thousands 
of sequences that span the entire yeast 
genome. As can be seen, replication 
begins at an origin and proceeds 
bidirectionally. For simplicity only one 
origin is shown here. In yeast cells, 
replication begins at hundreds of origins 
located throughout the genome. 



Well-defined DNA Sequences Serve as Replication Origins in a 
Simple Eucaryote, the Budding Yeast 

Having seen that a eucaryotic chromosome is replicated using many origins of 
replication, each of which "fires" at a characteristic time in S phase of the cell 
cycle, we turn to the nature of these origins of replication. We saw earlier in this 
chapter that replication origins have been precisely defined in bacteria as spe- 
cific DNA sequences that attract initiator proteins, which then assemble the 
DNA replication machinery. By analogy, one would expect the replication ori- 
gins in eucaryotic chromosomes to be specific DNA sequences too. 

The search for eucaryotic DNA sequences that carry all the information nec- 
essary to specify a replication origin has been most productive in the budding 
yeast S. cerevisiae. Powerful selection methods to find them have been devised 
that make use of mutant yeast cells defective for an essential gene. These cells 
can survive in a selective medium only if they are provided with DNA that car- 
ries a Functional copy of the missing gene. If a circular bacterial plasmid con- 
raining this gene is introduced into the mutant yeast ceils directly the plasmid 
will not be able to replicate because it lacks a functional origin. If random pieces 
of yeast DNA arc inserted into tiiis plasmid, however, only the rare plasmid DNA 
molecules that happen to contain a yeast replication origin can replicate. The 
yeast ceils that carry such plasmids are able to proliferate because they have 
been provided with the essential gene in a form that can be replicated and 
passed on to progeny cells (Figure 5-33). A DNA sequence identified by its pres- 
ence in a plasmid isolated from these surviving yeast cells is called an 
autonomously replicating sequence (ARS). Most ARSs have been shown to be 
authentic chromosomal origins of replication, thereby validating the strategy 
used to obtain them. 

For budding yeast, the location of every origin of replication on each chro- 
mosome has been determined. The particular chromosome shown in Figure 
5-34— chromosome III from the yeast S. cerevisiae— is one of the smallest 
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chromosomes known, with a length less than 1/100 that of a typical human 
chromosome. Its major origins are spaced an average of 30,000 nucleotide pairs 
apart; this density of origins should permit this chromosome to be replicated in 
about 10 minutes if all the origins fire at once. As we have previously discussed, 
mammalian origins are spaced further apart, typically every 100,000 to 250,000 
nucleotide pairs. 

Genetic experiments in S. cerevisiaehave tested the effect of deleting various 
replication origins on chromosome III. Removing a few origins has little effect, 
because replication forks that begin at neighboring origins of replication can 
continue into the regions that lack their own origins. The deletion of more repli- 
cation origins, however, results in the loss of the chromosome as the cells divide, 
because it is replicated too slowly. Many eucaryotes carry an excess of origins of 
replication, presumably to ensure that the complete genome can still be repli- 
cated in a timely manner if some of the origins fail to function. 



Figure 5-33 A strategy used to identify 
DNA sequences that are sufficient for 
initiating DNA replication. Each of the 
yeast DNA sequences identified in this 
way was called an autonomously 
replicating sequence (Ars), since it 
enables a plasmid that contains it to 
replicate in the host cell without having 
to be incorporated into a host cell 
chromosome. 



A Large Multisubunit Complex Binds to Eucaryotic Origins of 
Replication 

The minimal DNA sequence required for directing DNA replication initiation in 
the yeast S. cerevisiae has been determined by testing smaller and smaller DNA 
fragments in the experiment shown in Figure 5-33. Each DNA sequence that 
can serve as an origin of replication is found to contain (1) a binding site for a 
large, multisubunit initiator protein called ORC, for origin recognition com- 
plex, (2) a stretch of DNA that is rich in As and Ts and therefore easy to unwind, 
and (3) at least one binding site for proteins that help attract ORC to the origin 
DNA (Figure 5-35). In bacteria, once the initiator protein is properly bound to 
the single origin of replication the assembly of the replication forks follows 
more or less automatically. In eucaryotes, the situation is significantly different 
because of a profound problem eucaryotes have in replicating chromosomes 
with so many origins of replication (an estimated 400 in yeast and 10,000 in 
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Figure 5-34 The origins of DNA 
replication on chromosome III of the 
yeast S. cerevisiae. This chromosome, 
one of the smallest eucaryotic 
chromosomes known, carries a total of 
1 80 genes. As indicated, it contains 1 9 
replication origins, although they are 
used with different efficiencies. Those in 
red are typically used less than 10% of 
the time, while those in green are used in 
90% of S phases. 
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humans, for example) . With so many places to begin replication, how is the pro- 
cess regulated to ensure that all the DNA is copied once and only once? 

The answer lies in the way that the ORC complex, once bound to an origin 
of replication, is sequentially activated and deactivated. This matter is discussed 
in detail in Chapter 17, when we consider the cellular machinery that underlies 
the cell division cycle. The ORC-origin interaction persists throughout the entire 
cell cycle, dissociating only briefly immediately following replication of the ori- 
gin DNA, and other proteins that bind to it regulate origin activity These include 
the DNA helicase and two helicase loading proteins, Cdc6 and Cdtl, which are 
assembled onto an ORC-DNA complex to form a prereplicative complex at each 
origin during d phase (Figure 5-36). The passage of a cell from Gi to S phase is 
triggered by the activation of protein kinases (Cdks) that lead to dissociation of 
the helicase loading proteins, activation of the helicase, unwinding of the origin 
DNA, and loading of the remaining replication proteins including DNA poly- 
merases (see Figure 5-36). 

The protein kinases that trigger DNA replication simultaneously prevent all 
assembly of new prereplicative complexes until the next M phase resets the 
entire cycle (for details, see pp. 1067-1069). This strategy provides a single win- 
dow of opportunity for prereplicative complexes to form (Gi phase, when Cdk 
activity is low) and a second window for them to be activated and subsequently 
disassembled (S phase, when Cdk activity is high). Because these two phases of 
the cell cycle are mutually exclusive and occur in a prescribed order, each origin 
of replication can fire once and only once a cell cycle. 



Figure 5-35 An origin of replication in 
yeast. Comprising about 1 50 nucleotide 
pairs, this yeast origin (identified by the 
procedure shown in Figure 5-33) has a 
binding site for ORC, and one for Abf 1 , an 
auxiliary protein that facilitates ORC 
binding. All origins contain binding sites 
for ORC but the auxiliary proteins are 
different from one origin to the next. 
Most origins, like the one depicted, also 
contain a stretch of DNA that is especially 
easy to unwind. 



The Mammalian DNA Sequences That Specify the Initiation of 
Replication Have Been Difficult to Identify 

Compared with the situation in budding yeasts, DNA sequences that specify 
replication origins in other eucaryotes have been more difficult to define. 
Recently, however, it has been possible to identify specific human DNA 
sequences, each several thousand nucleotide pairs in length, that are sufficient 
to serve as replication origins. These origins continue to function when moved 
to a different chromosomal region by recombinant DNA methods, as long as 
they are placed in a region where the chromatin is relatively uncondensed. One 
of these origins is from the P-globin gene cluster. At its normal position in the 
genome, the function of this origin depends critically upon distant DNA 
sequences (Figure 5-37). As discussed in Chapter 7, this distant DNA is needed 
for expression of all genes in the P-globin cluster, and its effects on both tran- 
scription and origin function may reflect its long-range decondensation of 
chromatin structure. 

A human ORC homologous to that in yeast cells is required for replication 
initiation. Many of the other proteins that function in the initiation process in 
yeast likewise have central roles in humans. It therefore seems likely that the 
yeast and human initiation mechanisms will turn out to be similar in outline. 
However, the binding sites for the ORC protein seem to be less specific in 
humans than they are in yeast, which may explain why the replication origins of 
humans are less sharply defined. In fact, chromatin structure, rather than DNA 
sequences, may have the central role in defining mammalian origins of replica- 
tion. Thus, as is true in many other areas of cell biology, the mechanism of DNA 
replication initiation in yeast may vividly highlight the core process, while the 
situation in humans represents an elaborate variation on the theme. 
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Figure 5-36 The mechanism of DNA 
replication initiation in eucaryotes.This 
mechanism ensures that each origin of 
replication is activated only once per cell 
cycle. An origin of replication can be used 
only if a prereplicative complex forms 
there in G^ phase. At the beginning of 
S phase, cyclin-dependent kinases (Cdks) 
phosphorylate various replication 
proteins, causing both disassembly of the 
prereplicative complex and initiation of 
DNA replication. A new prereplicative 
complex cannot form at the origin until 
the cell progresses to the next Gi phase. 



New Nucleosomes Are Assembled Behind the Replication Fork 

There are several additional aspects of DNA replication that are specific to 
eucaryotes. As discussed in Chapter 4, eucaryotic chromosomes are composed 
of roughly equal mixtures of DNA and protein. Chromosome duplication there- 
fore requires not only the replication of DNA, but also the synthesis and assem- 
bly of new chromosomal proteins onto the DNA behind each replication fork. 
Although we are far from understanding this process in detail, we are beginning 
to learn how the fundamental unit of chromatin packaging, the nucleosome, is 
duplicated. The cell requires a large amount of new histone protein, approxi- 
mately equal in mass to the newly synthesized DNA, to make the new nucleo- 
somes in each cell cycle. For this reason, most eucaryotic organisms possess 
multiple copies of the gene for each histone. Vertebrate cells, for example, have 
about 20 repeated gene sets, most containing the genes that encode all five his- 
tones (HI, H2A, H2B, H3, and H4). 

Unlike most proteins, which are made continuously throughout interphase, 
histones are synthesized mainly in S phase, when the level of histone mRNA 
increases about fiftyfold as a result of both increased transcription and 
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decreased mRNA degradation. The major histone mRNAs are degraded within 
minutes when DNA synthesis stops at the end of S phase. The mechanism 
depends on special properties of the 3' ends of these mRNAs, as discussed in 
Chapter 7. In contrast, the histone proteins themselves are remarkably stable 
and may survive for the entire life of a cell. The tight linkage between DNA syn- 
thesis and histone synthesis appears to reflect a feedback mechanism that mon- 
itors the level of free histone to ensure that the amount of histone made exactly 
matches the amount of new DNA synthesized. 

As a replication fork advances, it must somehow pass through the parental 
nucleosomes. In vitro studies show that the replication apparatus has a poorly 
understood intrinsic ability to pass through parental nucleosomes without dis- 
placing them from the DNA. However, to replicate chromosomes efficiently in 
the cell, chromatin-remodeling proteins (discussed in Chapter 4), which desta- 
bilize the DNA-histone interface, are required. Aided by such complexes, repli- 
cation forks can transit even highly condensed heterochromatin efficiently. 

As the replication fork passes through chromatin, most of the old histones 
remain DNA-bound and are distributed to the daughter DNA helices behind a 
replication fork. But since the amount of DNA has doubled, an equal amount of 
new histones is also needed to complete the packaging of DNA into chromatin. 
The old and the new histones are combined in an intriguing way. When a nucle- 
osome is transited by a replication fork, the histone octamer appears to be bro- 
ken into an H3-H4 tetramer and two H2A-H2B dimers (see Figure 4-26). The H3- 
H4 tetramer remains associated with DNA and is distributed at random to one 
or the other daughter duplexes, but the H2A-H2B dimers are released from DNA. 
Freshly made H3-H4 tetramers are added to the newly synthesized DNA to fill in 
the "spaces," and H2AB dimers— half of which are old and half new— are then 
added at random to complete the nucleosomes (Figure 5-38). 

The orderly and rapid addition of new H3-H4 tetramers and H2A-H2B 
dimers behind a replication fork requires histone chaperones (also called chro- 
matin assembly factors). These multisubunit complexes bind the highly basic 
histones and release them for assembly only in the appropriate context. These 
histone chaperones, along with their cargoes, are directed to newly replicated 
DNA through a specific interaction with the eucaryotic sliding clamp, called 
PCNA (see Figure 5-38B). These clamps are left behind moving replication forks 
and remain on the DNA long enough for the histone chaperones to complete 
their tasks. 



Figure 5-37 Deletions that inactivate an 
origin of replication in humans. These 
two deletions are found separately in two 
individuals who suffer from thalassemia, a 
disorder caused by the failure to express 
one or more of the genes in the (3-globin 
gene cluster shown. In both of these 
deletion mutants, the DNA in this region 
is replicated by forks that begin at 
replication origins outside the fi-globin 
gene cluster. 



The Mechanisms of Eucaryotic Chromosome Duplication Ensure 
That Patterns of Histone Modification Can Be Inherited 

We saw in Chapter 4 that histones are subject to many types of covalent modifi- 
cations and that the patterns of these modifications can carry important infor- 
mation regarding the fate of underlying DNA. It makes little intuitive sense for 
these patterns to be erased each time a cell divides, but since this information is 
encoded in the histone proteins rather than the DNA, special mechansims are 
needed to preserve and duplicate it. We have seen that histone H3-H4 tetramers 
are distributed randomly to the two daughter chromosomes that emerge behind 
a moving replication fork. The tails as well as other regions of H3 and H4 can be 
extensively modified (see Figure 4-39), and thus each daughter chromosome is 
seeded with the memory of the parental pattern of H3 and H4 modification. 
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Figure 5-38 Distribution of parental and 
newly synthesized histones behind a 
eucaryotic replication fork. (A) The 

distribution of parental H3-H4 tetramers 
to the daughter DNA molecules is 
apparently random, with roughly equal 
numbers inherited by each daughter. In 
contrast H2A-H2B dimers are released 
from the DNA as the replication fork 
passes. (B) Histone chaperones {NAP1 and 
CAF1) restore the full complement of 
histones to daughter molecules. Although 
some daughter nucleosomes contain only 
parental histones or only newly 
synthesized histones, most are hybrids of 
old and new. (Adapted from J.D. Watson et 
al., Molecular Biology of the Gene, 5th ed. 
Cold Spring Harbor: Cold Spring Harbor 
Laboratory Press, 2004.) 
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Once the nucleosome assembly behind a replication fork has been completed, 
the parental patterns of H3-H4 modification can be reinforced through histone 
modification enzymes in reader-writer complexes that recognize the same type 
of modification they create (Figure 5-39). 

The faithful duplication of patterns of histone modification may be respon- 
sible for many examples of epigenetic inheritance, in which a heritable change in 
a cell's phenotype occurs without a change in the nucleotide sequence of DNA. 
We shall revisit the topic of epigenetics in Chapter 7 when we consider how deci- 
sions made by a cell are "remembered" by its progeny cells many generations 
later. 
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Figure 5-39 Strategy through which 
parental patterns of histone H3 and H4 
modification can be inherited by 
daughter chromosomes. Although it is 
unlikely that this mechanism applies to 
all histone modifications, it does pertain 
to some (see Figure 4-51 ). For example, a 
number of histone methylase complexes 
specifically recognize N-terminal histone 
tails that have been previously 
methylated at the same site that the 
methylase modifies. 
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Telomerase Replicates the Ends of Chromosomes 

We saw earlier that synthesis of the lagging strand at a replication fork must 
occur discontinuously through a backstitching mechanism that produces short 
DNA fragments. This mechanism encounters a special problem when the repli- 
cation fork reaches an end of a linear chromosome: there is no place to produce 
the RNA primer needed to start the last Okazaki fragment at the very tip of a lin- 
ear DNA molecule. 

Bacteria solve this "end-replication" problem by having circular DNA 
molecules as chromosomes (see Figure 5-27). Eucaryotes solve it in an inge- 
nious way: they have specialized nucleotide sequences at the ends of their chro- 
mosomes that are incorporated into structures called telomeres (see Chapter 4). 
Telomeres contain many tandem repeats of a short sequence that is similar in 
organisms as diverse as protozoa, fungi, plants, and mammals. In humans, the 
sequence of the repeat unit is GGGTTA, and it is repeated roughly a thousand 
times at each telomere. 

Telomere DNA sequences are recognized by sequence-specific DNA-bind- 
ing proteins that attract an enzyme, called telomerase, that replenishes these 
sequences each time a cell divides. Telomerase recognizes the tip of an existing 
telomere DNA repeat sequence and elongates it in the 5'-to-3' direction, using 
an RNA template that is a component of the enzyme itself to synthesize new 
copies of the repeat (Figure 5-40). The enzymatic portion of telomerase resem- 
bles other reverse transcriptases, enzymes that synthesize DNA using an RNA 
template (see Figure 5-72). After extension of the parental DNA strand by telom- 
erase, replication of the lagging strand at the chromosome end can be com- 
pleted by the conventional DNA polymerases, using these extensions as a tem- 
plate to synthesize the complementary strand (Figure 5-41). 

The mechanism just described, aided by a nuclease that chews back the 5' 
end, ensures that the 3' DNA end at each telomere is always longer than the 5' 
end with which it is paired, leaving a protruding single-stranded end (see Figure 
5-41). This protruding end has been shown to loop back and tuck its single- 
stranded terminus into the duplex DNA of the telomeric repeat sequence to 
form a t-loop (Figure 5-42). In broad outline this reaction resembles strand 
invasion during homologous recombination, discussed below, and it may have 
evolved from these ancient recombination systems. T-loops provide the normal 
ends of chromosomes with a unique structure, which protects them from 
degradative enzymes and clearly distinguishes them from the ends of the bro- 
ken DNA molecules that the cell rapidly repairs (see Figure 5-51). 
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Figure 5-40 The structure of a portion 
of telomerase. Telomerase is a large 
protein-RNA complex. The RNA (blue) 
contains a templating sequence for 
synthesizing new DNA telomere repeats. 
The synthesis reaction itself is carried out 
by the reverse transcriptase domain of 
the protein, shown in green. A reverse 
transcriptase is a special form of 
polymerase enzyme that uses an RNA 
template to make a DNA strand; 
telomerase is unique in carrying its own 
RNA template with it at all times. 
Telomerase also has several additional 
protein domains (not shown) that are 
needed to assemble the enzyme at the 
ends of chromosomes properly. 
(Modified from J. Lingner and T.R. Cech, 
Curr. Opin. Genet. Dev. 8:226-232, 1998. 
With permission from Elsevier.) 
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Figure 5-41 Telomere replication. 

Shown here are the reactions that 
synthesize the repeating G-rich 
sequences that form the ends of the 
chromosomes (telomeres) of diverse 
eucaryotic organisms. The 3' end of the 
parental DNA strand is extended by 
RNA-templated DNA synthesis; this 
allows the incomplete daughter DNA 
strand that is paired with it to be 
extended in its 5' direction. This 
incomplete, lagging strand is presumed 
to be completed by DNA polymerase a, 
which carries a DNA primase as one of its 
subunits.The telomere sequence 
illustrated is that of the ciliate 
Tetrahymena, in which these reactions 
were first discovered. 



Telomere Length Is Regulated by Cells and Organisms 

Because the processes that grow and shrink each telomere sequence are only 
approximately balanced, a chromosome end contains a variable number of 
telomeric repeats. Not surprisingly, experiments show that cells that proliferate 
indefinitely (such as yeast cells) have homeostatic mechanisms that maintain 
the number of these repeats within a limited range (Figure 5-43). 

In the somatic cells of humans, the telomere repeats have been proposed to 
provide each cell with a counting mechanism that helps prevent the unlimited 
proliferation of wayward cells in adult tissues. The simplest form of this idea 
holds that our somatic cells are born with a full complement of telomeric 
repeats. Some stem cells, notably those in tissues that must be replenished 
throughout life — such as those of bone marrow or skin — retain full telomerase 
activity. However, in many other types of cells, the level of telomerase is turned 
down so that the enzyme cannot quite keep up with chromosome duplication. 
Such cells lose 100-200 nucleotides from each telomere every time they divide. 
After many cell generations, the descendent cells will inherit defective chromo- 
somes (because their tips cannot be replicated completely) and consequently 
will withdraw permanently from the cell cycle and cease dividing — a process 
called replicative cell senescence (discussed in Chapter 17). In theory, such a 
mechanism could provide a safeguard against the uncontrolled cell proliferation 
of abnormal cells in somatic tissues, thereby helping to protect us from cancer. 

The idea that telomere length acts as a "measuring stick" to count cell divi- 
sions and thereby regulate the cell's lifetime has been tested in several ways. For 
certain types of human cells grown in tissue culture, the experimental results 
support such a theory. Human fibroblasts normally proliferate for about 60 cell 
divisions in culture before undergoing replicative senescence. Like most other 
somatic cells in humans, fibroblasts produce only low levels of telomerase, and 
their telomeres gradually shorten each time they divide. When telomerase is 
provided to the fibroblasts by inserting an active telomerase gene, telomere 
length is maintained and many of the cells now continue to proliferate indefi- 
nitely It therefore seems clear that telomere shortening can count cell divisions 
and trigger replicative senescence in some human cells. 

It has been proposed that this type of control on cell proliferation is impor- 
tant for the maintenance of tissue architecture and that it is also somehow 
responsible for the aging of animals like ourselves. These ideas have been tested 
by producing transgenic mice that lack telomerase entirely. The telomeres in 
mouse chromosomes are about five times longer than human telomeres, and 
the mice must therefore be bred through three or more generations before their 
telomeres have shrunk to the normal human length. It is therefore perhaps not 
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Figure 5-42 A t-loop at the end of a 
mammalian chromosome. Electron 
micrograph of the DNA at the end of an 
interphase human chromosome. The 
chromosome was fixed, deproteinated, 
and artificially thickened before viewing. 
The loop seen here is approximately 
15,000 nucleotide pairs in length. The 
insertion of the single-stranded 3' end 
into the duplex repeats to form a t-loop is 
thought to be carried out and 
maintained by specialized proteins. 
(From J.D. Griffith et al., Cell 97:503-514, 
1999. With permission from Elsevier.) 
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Figure 5-43 A demonstration that yeast 
cells control the length of their 
telomeres. In this experiment, the 
telomere at one end of a particular 
chromosome is artificially made either 
longer (left) or shorter (right) than 
average. After many cell divisions, the 
chromosome recovers, showing an 
average telomere length and a length 
distribution that is typical of the other 
chromosomes in the yeast cell. A similar 
feedback mechanism for controlling 
telomere length has been proposed for 
the cells in the germ-line cells of animals. 
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surprising that the first generations of mice develop normally. However, the 
mice in later generations develop progressively more defects in some of their 
highly proliferative tissues. In addition, these mice show signs of premature 
aging and have a pronounced tendency to develop tumors. In these and other 
respects these mice resemble humans with the genetic disease dyskeratosis con- 
genita. Individuals afflicted with this disease carry one functional and one non- 
functional copy of the telomerase RNA gene; they have prematurely shortened 
telomeres and typically die of progressive bone marrow failure. They also 
develop lung scarring and liver cirrhosis and show abnormalities in various epi- 
dermal structures including skin, hair follicles, and nails. 

The above observations demonstrate that controlling cell proliferation by 
telomere shortening poses a risk to an organism, because not all of the cells that 
begin losing the ends of their chromosomes will stop dividing. Some apparently 
become genetically unstable, but continue to divide giving rise to variant cells 
that can lead to cancer. Thus, one can question whether the observed down-reg- 
ulation of telomerase in most human somatic cells provides an evolutionary 
advantage, as suggested by those who postulate that telomere shortening pro- 
tects us from cancer and other proliferative diseases. 



Summary 

The proteins that initiate DNA replication bind to DNA sequences at a replication ori- 
gin to catalyze the formation of a replication bubble with two outward- moving repli- 
cation forks. The process begins when an initiator protein-DNA complex is formed 
that subsequently loads a DNA helicase onto the DNA template. Other proteins are 
then added to form the multienzyme "replication machine'' that catalyzes DNA syn- 
thesis at each replication fork. 

In bacteria and some simple eucaryotes, replication origins are specified by spe- 
cific DNA sequences that are only several hundred nucleotide pairs long. In other 
eucaryotes, such as humans, the sequences needed to specify an origin of DNA replica- 
tion seem to be less well defined, and the origin can span several thousand nucleotide 
pairs. 

Bacteria typically have a single origin of replication in a circular chromosome. 
With fork speeds of up to 1000 nucleotides per second, they can replicate their genome 
in less than an hour. Eucaryotic DNA replication takes place in only one part of the cell 
cycle, the S phase. The replication fork in eucaryotes moves about 10 times more slowly 
than the bacterial replication fork, and the much longer eucaryotic chromosomes each 
require many replication origins to complete their replication in an S phase, which 
typically lasts for 8 hours in human cells. The different replication origins in these 
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eucaryotic chromosomes are activated in a sequence, determined in part by the struc- 
ture of the chromatin, with the most condensed regions of chromatin typically begin- 
ning their replication last. After the replication fork has passed, chromatin structure is 
re-formed by the addition of new histones to the old histones that are directly inherited 
by each daughter DNA molecule. The mechanism of chromosome duplication allows 
the parental patterns of histone modification to be passed on to daughter chromo- 
somes, thus providing a means ofepigenetic inheritance. 

Eucaryotes solve the problem of replicating the ends of their linear chromosomes 
with a specialized end structure, the telomere, maintained by a special nucleotide 
polymerizing enzyme called telomerase. Telomerase extends one of the DNA strands at 
the end of a chromosome by using an RNA template that is an integral part of the 
enzyme itself, producing a highly repeated DNA sequence that typically extends for 
thousands of nucleotide pairs at each chromosome end. 



DNA REPAIR 

Maintaining the genetic stability that an organism needs for its survival requires 
not only an extremely accurate mechanism for replicating DNA, but also mech- 
anisms for repairing the many accidental lesions that occur continually in DNA. 
Most such spontaneous changes in DNA are temporary because they are imme- 
diately corrected by a set of processes that are collectively called DNA repair. Of 
the thousands of random changes created every day in the DNA of a human cell 
by heat, metabolic accidents, radiation of various sorts, and exposure to sub- 
stances in the environment, only a few accumulate as mutations in the DNA 
sequence. For example, we now know that fewer than one in 1000 accidental 
base changes in DNA results in a permanent mutation; the rest are eliminated 
with remarkable efficiency by DNA repair. 

The importance of DNA repair is evident from the large investment that cells 
make in DNA repair enzymes. For example, analysis of the genomes of bacteria 
and yeasts has revealed that several percent of the coding capacity of these 
organisms is devoted solely to DNA repair functions. The importance of DNA 
repair is also demonstrated by the increased rate of mutation that follows the 
inactivation of a DNA repair gene. Many DNA repair proteins and the genes that 
encode them — which we now know operate in a wide range of organisms, 
including humans — were originally identified in bacteria by the isolation and 
characterization of mutants that displayed an increased mutation rate or an 
increased sensitivity to DNA-damaging agents. 

Recent studies of the consequences of a diminished capacity for DNA repair 
in humans have linked many human diseases with decreased repair (Table 5-2). 



Table 5-2 Some Inherited Syndromes with Defects in DNA Repair 



NAME 


PHENOTYPE 


ENZYME OR PROCESS AFFECTED 


MSH2,3,6,MLH1,PMS2 


colon cancer 


mismatch repair 


Xeroderma pigmentosum (XP) 


skin cancer, UV sensitivity, neurological 


nucleotide excision-repair 


groups A-G 


abnormalities 




XP variant 


UV sensitivity, skin cancer 


translesion synthesis by DNA polymerase rj 


Ataxia telangiectasia (AT) 


leukemia, lymphoma, y-ray sensitivity, 


ATM protein, a protein kinase activated by 




genome instability 


double-strand breaks 


BRCA2 


breast, ovarian, and prostate cancer 


repair by homologous recombination 


Werner syndrome 


premature aging, cancer at several sites, 
genome instability 


accessory 3'-exonuclease and DNA helicase 


Bloom syndrome 


cancer at several sites, stunted growth, 
genome instability 


accessory DNA helicase for replication 


Fanconi anemia groups A-G 


congenital abnormalities, leukemia, 
genome instability 


DNA interstrand cross-link repair 


46 BR patient 


hypersensitivity to DNA-damaging 
agents, genome instability 


DNA ligase 1 
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Thus, we saw previously that defects in a human gene that normally functions to 
repair the mismatched base pairs resulting from DNA replication errors can lead 
to an inherited predisposition to certain cancers, reflecting an increased muta- 
tion rate. In another human disease, xeroderma pigmentosum (XP), the afflicted 
individuals have an extreme sensitivity to ultraviolet radiation because they are 
unable to repair certain DNA photoproducts. This repair defect results in an 
increased mutation rate that leads to serious skin lesions and an increased sus- 
ceptibility to certain cancers. 



Without DNA Repair, Spontaneous DNA Damage Would Rapidly 
Change DNA Sequences 

Although DNA is a highly stable material, as required for the storage of genetic 
information, it is a complex organic molecule that is susceptible, even under nor- 
mal cell conditions, to spontaneous changes that would lead to mutations if left 
unrepaired (Figure 5-44). For example, the DNA of each human cell loses about 
5000 purine bases (adenine and guanine) every day because their N-glycosyl 
linkages to deoxyribose hydrolyze, a spontaneous reaction called depurination. 
Similarly, a spontaneous deamination of cytosine to uracil in DNA occurs at a 
rate of about 100 bases per cell per day (Figure 5-45). DNA bases are also occa- 
sionally damaged by an encounter with reactive metabolites produced in the 
cell (including reactive forms of oxygen) or by exposure to chemicals in the envi- 
ronment. Likewise, ultraviolet radiation from the sun can produce a covalent 
linkage between two adjacent pyrimidine bases in DNA to form, for example, 
thymine dimers (Figure 5-46). If left uncorrected when the DNA is replicated, 
most of these changes would be expected to lead either to the deletion of one or 
more base pairs or to a base-pair substitution in the daughter DNA chain (Fig- 
ure 5-47). The mutations would then be propagated throughout subsequent cell 
generations. Such a high rate of random changes in the DNA sequence would 
have disastrous consequences for an organism. 



The DNA Double Helix Is Readily Repaired 

The double-helical structure of DNA is ideally suited for repair because it carries 
two separate copies of all the genetic information — one in each of its two 
strands. Thus, when one strand is damaged, the complementary strand retains 
an intact copy of the same information, and this copy is generally used to restore 
the correct nucleotide sequences to the damaged strand. 
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Figure 5-44 A summary of spontaneous alterations likely to require DNA repair. The sites on each 
nucleotide that are known to be modified by spontaneous oxidative damage (red arrows), hydrolytic attack 
(blue arrows), and uncontrolled methylation by the methyl group donor S-adenosylmethionine (green 
arrows) are shown, with the width of each arrow indicating the relative frequency of each event. (After 
T. Lindahl, Nature 362:709-715, 1993. With permission from Macmillan Publishers Ltd.) 



DNA REPAIR 



297 



DEPURINATION 



juger phosphate after 
depynnati'on 



DEAMINATION 




DNA 
strand 



DNA 
strand 



Figure 5-45 Depurination and examination. These two reactions are the most frequent spontaneous 
chemical reactions known to create serious DNA damage in cells. Depurination can release guanine (shown 
here), as well as adenine, from DNA. The major type of deamination reaction converts cytosine to an 
altered DNA base, uracil (shown here), but deamination occurs on other bases as well. These reactions 
normally take place in double-helical DNA; for convenience, only one strand is shown. 



An indication of the importance of a double-stranded helix to the safe stor- 
age of genetic information is that all cells use it; only a few small viruses use sin- 
gle-stranded DNA or RNA as their genetic material. The types of repair processes 
described in this section cannot operate on such nucleic acids, and once dam- 
aged, the chance of a permanent nucleotide change occurring in these single- 
stranded genomes of viruses is thus very high. It seems that only organisms with 
tiny genomes (and therefore tiny targets for DNA damage) can afford to encode 
their genetic information in any molecule other than a DNA double helix. 



DNA Damage Can Be Removed by More Than One Pathway 

Cells have multiple pathways to repair their DNA using different enzymes that 
act upon different kinds of lesions. Figure 5-48 shows two of the most common 
pathways. In both, the damage is excised, the original DNA sequence is restored 
by a DNA polymerase that uses the undamaged strand as its template, and a 
remaining break in the double helix is sealed by DNA ligase (see Figure 5-13). 

The two pathways differ in the way in which they remove the damage from 
DNA. The first pathway, called base excision repair, involves a battery of 
enzymes called DNA glycosylases, each of which can recognize a specific type of 
altered base in DNA and catalyze its hydrolytic removal. There are at least six 
types of these enzymes, including those that remove deaminated Cs, deami- 
nated As, different types of alkylated or oxidized bases, bases with opened rings, 
and bases in which a carbon-carbon double bond has been accidentally con- 
verted to a carbon-carbon single bond. How is an altered base detected within 
the context of the double helix? A key step is an enzyme-mediated "flipping- out" 



Figure 5-46 The most common type of thymine dimer. This type of 
damage occurs in the DNA of cells exposed to ultraviolet irradiation (as in 
sunlight). A similar dimer will form between any two neighboring 
pyrimidine bases (C orT residues) in DNA. 
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of the altered nucleotide from the helix, which allows the DNA glycosylase to 
probe all faces of the base for damage (Figure 5-49). It is thought that these 
enzymes travel along DNA using base-flipping to evaluate the status of each 
base. Once an enzyme finds the damaged base that it recognizes, it removes the 
base from its sugar. 

The "missing tooth" created by DNA glycosylase action is recognized by an 
enzyme called AP endonuclease (AP for apurinic or apyrimidinic, endo to signify 
that the nuclease cleaves within the polynucleotide chain), which cuts the phos- 
phodiester backbone, after which the damage is removed and the resulting gap 
repaired (see Figure 5-48A). Depurination, which is by far the most frequent 
type of damage suffered by DNA, also leaves a deoxyribose sugar with a missing 
base. Depurinations are directly repaired beginning with AP endonuclease, fol- 
lowing the bottom half of the pathway in Figure 5-48A. 

The second major repair pathway is called nucleotide excision repair. This 
mechanism can repair the damage caused by almost any large change in the 
structure of the DNA double helix. Such "bulky lesions" include those created by 
the covalent reaction of DNA bases with large hydrocarbons (such as the car- 
cinogen benzopyrene), as well as the various pyrimidine dimers (T-T, T-C, and C- 
C) caused by sunlight. In this pathway, a large multienzyme complex scans the 
DNA for a distortion in the double helix, rather than for a specific base change. 
Once it finds a bulky lesion, it cleaves the phosphodiester backbone of the 
abnormal strand on both sides of the distortion, and a DNA helicase peels away 
a single-strand oligonucleotide containing the lesion. The large gap produced in 
the DNA helix is then repaired by DNA polymerase and DNA ligase (Figure 
5-48B). 

An alternative to base and nucleotide excision-repair processes is direct 
chemical reversal of DNA damage, and this strategy is employed for the rapid 
removal of certain highly mutagenic or cytotoxic lesions. For example, the allevi- 
ation lesion O^methylguanine has its methyl group removed by direct transfer 
to a cysteine residue in the repair protein itself, which is destroyed in the reac- 
tion. In another example, methyl groups in the alkylation lesions 1-methylade- 
nine and 3-methylcytosine are "burnt off" by an iron-dependent demethylase, 
with release of formaldehyde from the methylated DNA and regeneration of the 
native base. 
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Figure 5-47 How chemical modifications of nucleotides produce mutations. (A) Deamination of cytosine, if uncorrected, 
results in the substitution of one base for another when the DNA is replicated. As shown in Figure 5-45, deamination of 
cytosine produces uracil. Uracil differs from cytosine in its base-pairing properties and preferentially base pairs with 
adenine. The DNA replication machinery therefore adds an adenine when it encounters a uracil on the template strand. 
(B) Depurination can lead to the loss of a nucleotide pair. When the replication machinery encounters a missing purine on 
the template strand, it may skip to the next complete nucleotide as illustrated here, thus producing a nucleotide deletion 
in the newly synthesized strand. Many other types of DNA damage (see Figure 5-44), if left uncorrected, also produce 
mutations when the DNA is replicated. 
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Figure 5-48 A comparison of two major DNA repair pathways. (A) Base excision repair. This pathway starts with a DNA 
glycosylase. Here the enzyme uracil DNA glycosylase removes an accidentally deaminated cytosine in DNA. After the action 
of this glycosylase (or another DNA glycosylase that recognizes a different kind of damage), the sugar phosphate with the 
missing base is cut out by the sequential action of AP endonuclease and a phosphodiesterase. (These same enzymes begin 
the repair of depurinated sites directly.) The gap of a single nucleotide is then filled by DNA polymerase and DNA ligase. 
The net result is that the U that was created by accidental deamination is restored to a C. AP endonuclease is so-named 
because it recognizes any site in the DNA helix that contains a deoxyribose sugar with a missing base; such sites can arise 
either by the loss of a purine fapurinic sites) or by the loss of a pyrimidine fapyrimidinic sites). (B) Nucleotide excision repair. 
In bacteria, after a multienzyme complex has recognized a lesion such as a pyrimidine dimer (see Figure 5-46), one cut is 
made on each side of the lesion, and an associated DNA helicase then removes the entire portion of the damaged strand. 
The excision repair machinery in bacteria leaves the gap of 1 2 nucleotides shown. In humans, once the damaged DNA is 
recognized, a helicase is recruited to unwind the DNA duplex locally. Next, the excision nuclease enters and cleaves on 
either side of the damage, leaving a gap of about 30 nucleotides. The nucleotide excision repair machinery in both bacteria 
and humans can recognize and repair many different types of DNA damage. 



Coupling DNA Repair to Transcription Ensures That the Cell's Most 
Important DNA Is Efficiently Repaired 

All of a cell's DNA is under constant surveillance for damage, and the repair 
mechanisms we have described act on all parts of the genome. However, cells 
have a way of directing DNA repair to the DNA sequences that are most urgently 
needed. They do this by linking RNA polymerase, the enzyme that transcribes 
DNA into RNA as the first step in gene expression, to the repair of DNA damage. 
RNA polymerase stalls at DNA lesions and, through the use of coupling proteins, 
directs the repair machinery to these sites. In bacteria, where genes are relatively 
short, the stalled RNA polymerase can be dissociated from the DNA, the DNA is 
repaired, and the gene is transcribed again from the beginning. In eucaryotes, 
where genes can be enormously long, a more complex reaction is used to "back 
up" the RNA polymerase, repair the damage, and then restart the polymerase. 
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Figure 5-49 The recognition of an 
unusual nucleotide in DNA by base- 
flipping. The DNA glycosylase family of 
enzymes recognizes specific bases in the 
conformation shown. Each of these 
enzymes cleaves the glycosyl bond that 
connects a particular recognized base 
(yellow) to the backbone sugar, removing 
it from the DNA. (A) Stick model; 
(B) space-filling model. 



Transcription- coupled repair works with base excision, nucleotide excision, 
and other repair machinery to direct repair immediately to the cell's most 
important DNA sequences, namely those being expressed when the damage 
occurs. Remarkably this type of repair is specific for the template strand of tran- 
scribed DNA; the other strand is repaired with the same speed and efficiency as 
DNA that is not being transcribed at all. Transcription-coupled repair is particu- 
larly advantageous in humans, because only a small fraction of our genome is 
transcribed at any given time. Its importance is seen in individuals with Cock- 
ayne's syndrome, which is caused by a defect in transcription-coupled repair. 
These individuals suffer from growth retardation, skeletal abnormalities, pro- 
gressive neural retardation, and severe sensitivity to sunlight. Most of these 
problems are thought to arise from RNA polymerase molecules that become 
permanently stalled at sites of DNA damage that lie in important genes. 



The Chemistry of the DNA Bases Facilitates Damage Detection 

The DNA double helix seems to be optimally constructed for repair. As noted 
above, it contains a backup copy of all genetic information. Equally impor- 
tantly, the nature of the four bases in DNA makes the distinction between 
undamaged and damaged bases very clear. For example, every possible deam- 
ination event in DNA yields an "unnatural" base, which can be directly recog- 
nized and removed by a specific DNA glycosylase. Hypoxanthine, for example, 
is the simplest purine base capable of pairing specifically with C, but hypox- 
anthine is the direct deamination product of A (Figure 5-50A). The addition of 
a second amino group to hypoxanthine produces G, which cannot be formed 
from A by spontaneous deamination, and whose deamination product (xan- 
thine) is likewise unique. 

As discussed in Chapter 6, RNA is thought, on an evolutionary time-scale, to 
have served as the genetic material before DNA, and it seems likely that the 
genetic code was initially carried in the four nucleotides A, C, G, and U. This 
raises the question of why the U in RNA was replaced in DNA by T (which is 5- 
methyl U). We have seen that the spontaneous deamination of C converts it to 
U, but that this event is rendered relatively harmless by uracil DNA glycosylase. 
However, if DNA contained U as a natural base, the repair system would find it 
difficult to distinguish a deaminated C from a naturally occurring U. 

A special situation occurs in vertebrate DNA, in which selected C 
nucleotides are methylated at specific C-G sequences that are associated with 
inactive genes (discussed in Chapter 7). The accidental deamination of these 
methylated C nucleotides produces the natural nucleotide T (Figure 5-50B) in a 
mismatched base pair with a G on the opposite DNA strand. To help in repairing 
deaminated methylated C nucleotides, a special DNA glycosylase recognizes a 
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mismatched base pair involving T in the sequence T-G and removes the T. This 
DNA repair mechanism must be relatively ineffective, however, because methy- 
lated C nucleotides are common sites for mutations in vertebrate DNA. It is 
striking that, even though only about 3% of the C nucleotides in human DNA are 
methylated, mutations in these methylated nucleotides account for about one- 
third of the single-base mutations that have been observed in inherited human 
diseases. 
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Figure 5-50 The deamination of DNA 
nucleotides. In each case, the oxygen 
atom that is added in this reaction with 
water is colored red. (A) The spontaneous 
deamination products of A and G are 
recognizable as unnatural when they 
occur in DNA and thus are readily 
recognized and repaired. The 
deamination of C to U was also illustrated 
in Figure 5-45; T has no amino group to 
remove. (B) About 3% of the 
C nucleotides in vertebrate DNAs are 
methylated to help in controlling gene 
expression (discussed in Chapter 7). 
When these 5-methyl C nucleotides are 
accidentally deaminated, they form the 
natural nucleotide T. However, this Twill 
be paired with a G on the opposite 
strand, forming a mismatched base pair. 
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Special DNA Polymerases Are Used in Emergencies to Repair DNA 

If a cell's DNA is heavily damaged, the repair mechanisms that we have dis- 
cussed are often insufficient to cope with it. In these cases, a different strategy is 
called into play one that entails some risk to the cell. The highly accurate 
replicative DNA polymerases stall when they encounter damaged DNA, and in 
emergencies cells employ versatile but less accurate back-up polymerases to 
replicate through the DNA damage. 

Human cells contain more than 10 such DNA polymerases, some of which 
can recognize a specific type of DNA damage and specifically add the nucleotide 
required to restore the initial sequence. Others make only "good guesses," espe- 
cially when the template base has been extensively damaged. These enzymes are 
not as accurate as the normal replicative polymerases when they copy a normal 
DNA sequence. For one thing, the backup polymerases lack exonucleolytic 
proofreading activity; in addition, many are much less discriminating than the 
replicative polymerase in choosing which nucleotide to initially incorporate. 
Presumably for this reason, each such polymerase molecule is given a chance to 
add only one or a few nucleotides. Although the details of these fascinating reac- 
tions are still being worked out, they provide elegant testimony to the care with 
which organisms maintain the integrity of their DNA. 



Double-Strand Breaks Are Efficiently Repaired 

An especially dangerous type of DNA damage occurs when both strands of the 
double helix are broken, leaving no intact template strand to enable accurate 
repair. Ionizing radiation, replication errors, oxidizing agents, and other 
metabolites produced in the cell cause breaks of this type. If these lesions were 
left unrepaired, they would quickly lead to the breakdown of chromosomes into 
smaller fragments and to the loss of genes when the cell divides. However, two 
distinct mechanisms have evolved to ameliorate this type of damage (Figure 
5-51). The simplest to understand is nonhomologous end -joining, in which the 
broken ends are simply brought together and rejoined by DNA ligation, gener- 
ally with the loss of one or more nucleotides at the site of joining (Figure 5-52). 
This end-joining mechanism, which might be seen as a "quick and dirty" solu- 
tion to the repair of double-strand breaks, is common in mammalian somatic 
cells. Although a change in the DNA sequence (a mutation) results at the site of 
breakage, so little of the mammalian genome codes for proteins that this mech- 
anism is apparently an acceptable solution to the problem of rejoining broken 



(A) NONHOMOLOGOUS END JOINING (B) HOMOLOGOUS RECOMBINATION 

accidental double-strand break 







Dtides 
dation 




1 


loss of nude 
due to degra 

from ends 

■ 


1 


loss of nucleotides 
due to degradation 
from ends 






^^™ ^^^^^^^^^^ m 


1 


end joining 


• 


end processing and 

homologous 

recombination 


A 




^^^^^ 


deletion of DNA sequence 


damage repai 
information f 


red accurately using 
rom sister chromatid 



Figure 5-51 Two different ways to repair 
double-strand breaks. (A) Nonhomologous 
end-joining alters the original DNA 
sequence when repairing a broken 
chromosome. These alterations can be 
either deletions (as shown) or short 
insertions. (B) Repairing double-strand 
breaks by homologous recombination is 
more difficult to accomplish, but this type 
of repair restores the original DNA 
sequence. It typically takes place after the 
DNA has been duplicated but before the 
cell has divided. Details of the homologous 
recombination pathway will be presented 
later (see Figure 5-61). 
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chromosomes. By the time a human reaches the age of 70, the typical somatic 
cell contains over 2000 such "scars", distributed throughout its genome, repre- 
senting places where DNA has been inaccurately repaired by nonhomologous 
end-joining. As previously discussed, the specialized structure of telomeres pre- 
vents the natural ends of chromosomes from being mistaken for broken DNA 
and repaired. 

A much more accurate type of double-strand break repair occurs in newly 
replicated DNA (Figure 5-51B). Here, the DNA is repaired using the sister chro- 
matid as a template. This reaction is an example of homologous recombination, 
and we consider its mechanism later in this chapter. Most organisms employ 
both nonhomologous end-joining and homologous recombination to repair 
double-strand breaks in DNA. Nonhomologous end-joining predominates in 
humans; homologous recombination is used only during and shortly after DNA 
replication (in S and G2 phases), when sister chromatids are available to serve as 
templates. 



DNA Damage Delays Progression of the Cell Cycle 

We have just seen that cells contain multiple enzyme systems that can recognize 
and repair many types of DNA damage. Because of the importance of maintain- 
ing intact, undamaged DNA from generation to generation, eucaryotic cells 
have an additional mechanism that maximizes the effectiveness of their DNA 
repair enzymes: they delay progression of the cell cycle until DNA repair is com- 
plete. As discussed in detail in Chapter 17, the orderly progression of the cell 
cycle is maintained through the use of checkpoints that ensure the completion 
of one step before the next step can begin. At several of these cell-cycle check- 
points, the cycle stops if damaged DNA is detected. Thus, in mammalian cells, 
the presence of DNA damage can block entry from d into S phase, it can slow S 
phase once it has begun, and it can block the transition from S phase to M phase. 
These delays facilitate DNA repair by providing the time needed for the repair to 
reach completion. 
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Figure 5-52 Nonhomologous end- 
joining. (A) A central role is played 
by the Ku protein, a heterodimer 
that grasps the broken chromosome 
ends. The additional proteins shown 
are needed to hold the broken ends 
together while they are processed 
and eventually joined covalently. 
(B) Three-dimensional structure of a 
Ku heterodimer bound to the end of 
a duplex DNA fragment. The 
Ku protein is also essential forV(D)J 
joining, a specific recombination 
process through which antibody 
and T cell receptor diversity is 
generated in developing B and 
T cells (discussed in Chapter 25). 
V(D)J joining and nonhomologous 
end-joining show many similarities 
in mechanism but the former relies 
on specific double-strand breaks 
produced deliberately by the cell. 
(B, from J.R. Walker, R.A. Corpina and 
J. Goldberg, Nature 41 2:607-61 4, 
2001 . With permission from 
Macmillan Publishers Ltd.) 
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DNA damage also results in an increased synthesis of some DNA repair 
enzymes. The importance of special signaling mechanisms that respond to DNA 
damage is indicated by the phenotype of humans who are born with defects in 
the gene that encodes the ATM protein. These individuals have the disease 
ataxia telangiectasia (AT), the symptoms of which include neurodegeneration, a 
predisposition to cancer, and genome instability. The ATM protein is a large 
kinase needed to generate the intracellular signals that produce a response to 
many types of spontaneous DNA damage, and individuals with defects in this 
protein therefore suffer from the effects of unrepaired DNA lesions. 



Summary 

Genetic information can be stored stably in DNA sequences only because a large set of 
DNA repair enzymes continuously scan the DNA and replace any damaged 
nucleotides. Most types of DNA repair depend on the presence of a separate copy of the 
genetic information in each of the two strands of the DNA double helix. An accidental 
lesion on one strand can therefore be cut out by a repair enzyme and a corrected strand 
resynthesized by reference to the information in the undamaged strand. 

Most of the damage to DNA bases is excised by one of two major DNA repair path- 
ways. In base excision repair, the altered base is removed by a DNA glycosylase enzyme, 
followed by excision of the resulting sugar phosphate. In nucleotide excision repair, a 
small section of the DNA strand surrounding the damage is removed from the DNA 
double helix as an oligonucleotide. In both cases, the gap left in the DNA helix is filled 
in by the sequential action of DNA polymerase and DNA ligase, using the undamaged 
DNA strand as the template. Some types of DNA damage can be repaired by a different 
strategy— the direct chemical reversal of the damage— which is carried out by special- 
ized repair proteins. 

Other critical repair systems— based on either nonhomologous end-joining or 
homologous recombination— reseal the accidental double-strand breaks that occur in 
the DNA helix. In most cells, an elevated level of DNA damage causes a delay in the cell 
cycle via checkpoint mechanisms, which ensure that DNA damage is repaired before a 
cell divides. 



HOMOLOGOUS RECOMBINATION 

In the two preceding sections, we discussed the mechanisms that allow the DNA 
sequences in cells to be maintained from generation to generation with very lit- 
tle change. In this section, we further explore one of these mechanisms, homol- 
ogous recombination. Although crucial for accurately repairing double-strand 
breaks (see Figure 5-51B) and other types of DNA damage, homologous recom- 
bination, as we shall see, can also rearrange DNA sequences. These rearrange- 
ments often alter the particular versions of genes present in an individual 
genome, as well as the timing and the level of their expression. In a population, 
the type of genetic variation produced by this and other types of genetic recom- 
bination is crucial for facilitating the evolution of organisms in response to a 
changing environment. 



Homologous Recombination Has Many Uses in the Cell 

In homologous recombination (also known as general recombination), genetic 
exchange takes place between a pair of homologous DNA sequences, that is, 
DNA sequences similar or identical in nucleotide sequence. Homologous 
recombination has many uses in the cell, but three are of paramount impor- 
tance. The most widespread use is in accurately repairing double-strand breaks, 
as introduced in the previous section (Figure 5-51B). Although double-strand 
breaks can result from radiation and reactive chemicals, many arise from DNA 
replication forks that become stalled or broken. This application of homologous 
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Figure 5-53 Repair of a broken replication fork by homologous 
recombination. When a moving replication fork encounters a single-strand 
break, it will collapse but can be repaired by homologous recombination. 
As shown, the initial strand invasion requires a free 3' end generated by a 
nuclease that degrades the 5' end of the complementary strand. 
Recombination then begins with strand invasion, as described in detail in 
subsequent figures. Arrowheads represent 3' ends of DNA strands. Green 
strands represent the new DNA synthesis that takes place after the 
replication fork has broken. Note that, in this mechanism, the fork moves 
past the site that was nicked on the original template by using an 
undamaged copy of the site as its template. (Adapted from M.M. Cox, Proc. 
Natl Acad. Sci. USA. 98:81 73-81 80, 2001 . With permission from National 
Academy of Sciences.) 



recombination is essential for every proliferating cell, because accidents occur 
during nearly every round of DNA replication. 

Many types of events can cause the replication fork to break during the repli- 
cation process. Consider just one example: a single-strand nick or gap in the 
parental DNA helix just ahead of a replication fork. When the fork reaches this 
lesion, it falls apart — resulting in one broken and one intact daughter chromo- 
some. However, a series of recombination reactions, which can begin with a 
strand invasion process that triggers DNA synthesis by DNA polymerase, can 
flawlessly repair the broken chromosome (Figure 5-53). 

In addition, homologous recombination is used to exchange bits of genetic 
information between two different chromosomes to create new combinations of 
DNA sequences in each chromosome. The potential evolutionary benefit of this 
type of gene mixing is that it creates an array of new, perhaps beneficial, combi- 
nations of genes. During meiosis in fungi, plants, and animals, homologous 
recombination also plays an important mechanical role in assuring accurate 
chromosome segregation. In this section, we consider only the universal roles of 
homologous recombination, those of repairing DNA damage and of mediating 
genetic exchange. Its more specialized mechanical role in chromosome segrega- 
tion during meiosis will be discussed in Chapter 21. 



Homologous Recombination Has Common Features in All Cells 

The current view of homologous recombination as a critical DNA repair mecha- 
nism in all cells evolved slowly from its initial discovery as a key component in 
the specialized process of meiosis in plants and animals. The subsequent recog- 
nition that homologous recombination also occurs in less complex unicellular 
organisms made it much more amenable to molecular analyses. Thus, most of 
what we know about the biochemistry of genetic recombination was originally 
derived from studies of bacteria, especially of E. coli and its viruses, as well as 
from experiments with simple eucaryotes such as yeasts. For these organisms 
with short generation times and relatively small genomes, it was possible to iso- 
late a large set of mutants with defects in their recombination processes. The 
protein altered in each mutant was then identified and, ultimately, studied bio- 
chemically. More recently, close relatives of these proteins have been discovered 
and extensively characterized in Drosophila, mice, and humans. These studies 
reveal that the fundamental processes that catalyze homologous recombination 
are common to all cells, as we shall now discuss. 
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DNA Base-Pairing Guides Homologous Recombination 

The hallmark of homologous recombination is that it takes place only between 
DNA duplexes that have extensive regions of sequence similarity (homology). 
Not surprisingly, base-pairing underlies this requirement, and two DNA 
duplexes that are undergoing homologous recombination "sample" each other's 
DNA sequence by engaging in extensive base-pairing between a single strand 
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from one DNA duplex and the complementary single strand from the other. The 
match need not be perfect, but it must be very close for homologous recombi- 
nation to succeed. 

In its simplest form, this type of base-pairing interaction can be mimicked 
in a test tube by allowing a DNA double helix to re-form from its separated sin- 
gle strands. This process, called DNA renaturation or hybridization, occurs 
when a rare random collision juxtaposes complementary nucleotide sequences 
on two matching DNA single strands, allowing the formation of a short stretch 
of double helix between them. This relatively slow helix nucleation step is fol- 
lowed by a very rapid "zippering" step, as the region of double helix is extended 
to maximize the number of base-pairing interactions (Figure 5-54). 

Formation of a new double helix in this way requires that the annealing 
strands be in an open, unfolded conformation. For this reason, in vitro 
hybridization reactions are performed either at high temperature or in the pres- 
ence of an organic solvent such as formamide; these conditions "melt out" the 
short hairpin helices that result from the base-pairing interactions that occur 
within a single strand that folds back on itself. Most cells cannot survive such 
harsh conditions and instead use single-strand DNA-binding proteins (see 
p. 273) to melt out the hairpin helices. Single-strand DNA-binding proteins are 
essential for DNA replication (as described earlier) as well as for homologous 
recombination; they bind tightly and cooperatively to the sugar-phosphate 
backbone of all single-stranded DNA regions of DNA, holding them in an 
extended conformation with the bases exposed (see Figures 5-16 and 5-17). In 
this extended conformation, a DNA single strand can base-pair efficiently either 
with a nucleoside triphosphate molecule (in DNA replication) or with a comple- 
mentary section of another DNA single strand (as part of a genetic recombina- 
tion process). 

DNA hybridization creates a region of DNA helix formed from strands that 
originate from two different DNA molecules. The formation of such a region, 
known as a heteroduplex, is an essential step in any homologous recombination 
process. Because the vast majority of DNA inside the cell is double-stranded, the 
"test tube" model of DNA hybridization cannot fully explain how this process 
occurs in the cell. Indeed, special mechanisms are required to start homologous 
recombination between two double-stranded DNA molecules of similar 
nucleotide sequence. Crucial to these mechanisms are proteins that allow DNA 
hybridization to occur in the cell through strand invasion— the pairing of a 
region of single-stranded DNA with a complementary strand in a different DNA 
double helix — as we describe next. 



Figure 5-54 DNA hybridization. DNA double helices re-form from their separated strands in a reaction that 
depends on the random collision of two complementary DNA strands. The vast majority of such collisions 
are not productive, as shown on the left, but a few result in a short region where complementary base pairs 
have formed (helix nucleation). A rapid zippering then leads to the formation of a complete double helix. 
Through this trial-and-error process, a DNA strand will find its complementary partner even in the midst of 
millions of nonmatching DNA strands. 
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The RecA Protein and its Homologs Enable a DNA Single Strand 
to Pair with a Homologous Region of DNA Double Helix 

Because extensive base-pair interactions cannot occur between two intact DNA 
double helices, the DNA hybridization that is critical for homologous recombi- 
nation can begin only after a DNA strand from one DNA helix is freed from pair- 
ing with its complementary strand, thereby making its nucleotide available for 
pairing with a second DNA helix. In the example that was previously illustrated 
in Figure 5-53, this free single strand is formed when a replication fork encoun- 
ters a DNA nick, falls apart (creating a new double-stranded end), and an exonu- 
clease degrades the 5' end at the break, leaving an unpaired single strand at its 
3' end. In other applications of homologous recombination, single stranded 
regions are generated in similar ways, as we discuss later. 

The single-strand at the 3' DNA end is acted upon by several specialized 
proteins that direct it to invade a homologous DNA duplex. Of central impor- 
tance is the RecA protein, its name in E.coli, and its homolog Rad51, its name 
in virtually all eucaryotic organisms (Figure 5-55). Like a single-strand DNA- 
binding protein, the RecA type of protein binds tightly and in long cooperative 
clusters to single-stranded DNA forming a nucleoprotein filament. Because 
each RecA monomer has more than one DNA-binding site, a RecA filament can 
hold a single strand and a double helix together (Figure 5-56). This arrange- 
ment allows RecA to catalyze a multistep DNA synapsis reaction that occurs 
between a DNA double helix and a homologous region of single-stranded DNA. 
In the first step, the RecA protein intertwines the DNA single strand and the 
DNA duplex in a sequence-independent manner. Next, the DNA single strand 
"searches" the duplex for homologous sequences. Exactly how this searching 
and eventual recognition occurs is not understood, but it may involve transient 
base pairs formed between the single strand and bases that flip out from the 
duplex DNA (see Figure 5-49). Once a homologous sequence has been located, 
a strand invasion occurs: the single strand displaces one strand of the duplex as 
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Figure 5-55 The structure of the RecA 
and Rad51 protein-DNA filaments. 

(A) The Rad51 protein, a human homolog 
of the bacterial RecA protein, is bound to 
a DNA single strand. Three successive 
protein monomers in this helical filament 
are colored. (B) A short section of the 
RecA filament, with the three- 
dimensional structure of the protein 
fitted to the image of the filament 
determined by electron microscopy. 
There are about six RecA monomers per 
turn of the helix, holding 18 nucleotides 
of single-stranded DNA that is stretched 
out by the protein. The exact path of the 
DNA in this structure is not known. 
{A, courtesy of Edward Egelman; B, from 
X. Yu et al., J. Mol. Biol. 283:985-992, 1 998. 
With permission from Academic Press.) 
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Figure 5-56 DNA synapsis catalyzed by 
the RecA protein. In vitro experiments 
show that several types of complex are 
formed between a DNA single strand 
(red) covered with RecA protein (blue) 
and a DNA double helix (green). First a 
non-base-paired complex is formed, 
which is converted into a "joint molecule" 
as a homologous sequence is found. This 
complex is dynamic and spins out a DNA 
heteroduplex (one strand green and the 
other strand red) plus a displaced single 
strand from the original helix (green). 
Thus, the structure shown in this diagram 
migrates to the left, reeling in the "input 
DNAs" while producing the "output 
DNAs." (Adapted from S.C. West, Annu. 
Rev. Biochem. 61 :603-640, 1 992. With 
permission from Annual Reviews.) 
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it forms conventional base pairs with the other strand. The result is a heterodu- 
plex— a region of DNA double helix formed by the pairing of two DNA strands 
that were initially part of two different DNA molecules (see Figure 5-56). 

The search for homology and the invasion of a single strand into a DNA 
duplex are the critical reactions that initiate homologous recombination. They 
require, in addition to RecA-like proteins and single-strand binding proteins, 
several proteins with specialized functions. For example, Rad52 protein dis- 
places the single-strand binding proteins allowing the binding of Rad51 
molecules, and in addition, promotes the annealing of complementary single- 
strands (Figure 5-57). 

The short heteroduplex region formed, where the invading single strand has 
paired with its complementary strand in the DNA duplex, is often greatly 
enlarged by a process called branch migration, as we now discuss. 



Branch Migration Can either Enlarge Heteroduplex Regions or 
Release Newly Synthesized DNA as a Single Strand 

Once a strand invasion reaction has occurred, the point of strand exchange (the 
"branch point") can move through a process called branch migration (Figure 
5-58). In this reaction, an unpaired region of one of the single strands displaces 
a paired region of the other single strand, moving the branch point without 
changing the total number of DNA base pairs. Although spontaneous branch 
migration can occur, it proceeds equally in both directions, so it makes little net 
progress (Figure 5-58A). Specialized DNA helicases, however, can catalyze uni- 
directional branch migration, readily producing a region of heteroduplex DNA 
that can be thousands of base pairs long (Figure 5-58B). 

In a related reaction, DNA synthesis catalyzed by DNA polymerase can drive 
a unidirectional branch migration process through which the newly synthesized 
DNA is displaced as a single strand, mimicking the way that a newly synthesized 
RNA chain is released by RNA polymerase. This form of DNA synthesis appears 
to be used in several homologous recombination processes, including the dou- 
ble-strand break repair processes to be described next. 



Homologous Recombination Can Flawlessly Repair Double- 
Stranded Breaks in DNA 

Earlier in this chapter, we discussed the dire problems created by double -strand 
breaks in DNA, and we saw that cells can repair these breaks in two ways. Non- 
homologous end-joining (see Figure 5-51) occurs without a template and creates 




Figure 5-57 Structure of a portion of the 
Rad52 protein. (A) This doughnut-shaped 
structure is composed of 1 1 subunits. 
(B) Single-stranded DNA has been 
modeled into the deep groove running 
along the protein surface. The bases of the 
DNA are exposed, a configuration that is 
proposed to mediate the annealing of two 
complementary single strands. (From 
M.R. Singleton et al., Proc. Not! Acad. Sci. 
U.S.A. 99:1 3492-1 3497, 2002. With 
permission from the National Academy 
of Sciences.) 
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(B) PROTEIN-DIRECTED BRANCH 
MIGRATION 



Figure 5-58 Two types of DNA branch 
migration observed in experiments in 
vitro. (A) Spontaneous branch migration 
is a back-and-forth, random-walk process, 
and it therefore makes little progress over 
long distances. (B) Protein-directed 
branch migration requires energy and it 
moves the branch point at a uniform rate 
in one direction. 



HOMOLOGOUS RECOMBINATION 



309 



a mutation at the site at which two DNA duplexes are joined. It can also inad- 
vertently join together segments from two different chromosomes creating 
chromosome translocations, many of which have serious consequences for the 
cell. In contrast to nonhomologous end-joining, homologous recombination 
can repair double-stranded breaks accurately, without any loss or alteration of 
nucleotides at the site of repair (Figure 5-59). In most cells, recombination- 
mediated double-strand break repair occurs only after the cell has replicated its 
DNA, when one nearby daughter DNA duplex can serve as the template for 
repair of the other. 

Homologous recombination can also be used to repair many other types of 
DNA damage, making it perhaps the most versatile DNA repair mechanism 
available to the cell; the "all-purpose" nature of recombinational repair probably 
explains why its mechanism and the proteins that carry it out have been con- 
served in virtually all cells on Earth. 
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Figure 5-59 Mechanism of double-strand 
break repair by homologous 
recombination. This Is the preferred 
method for repairing DNA double-strand 
breaks that arise shortly after the DNA has 
been replicated and the two sister 
chromatids are still held together. This 
reaction uses many of the same proteins 
as that of Figure 5-53 and proceeds 
through the same basic intermediate 
steps. In general, homologous 
recombination can be regarded as a 
flexible series of reactions, with the exact 
pathway differing from one case to the 
next. For example, the length of the repair 
"patch" can vary considerably depending 
on the extent of 5' processing, branch 
migration, and new DNA synthesis. (See 
M. McVey, J. LaRocque, M.D. Adams, and 
J. Sekelsky, Proc. Not! Acad Sci. U.S.A. 
101:15694-15699,2004.) 



310 Chapter 5: DNA Replication, Repair, and Recombination 



Figure 5-60 Experiment demonstrating the rapid localization of repair 
proteins to DNA double-strand breaks. Human fibroblasts were 
x-irradiated to produce DNA double-strand breaks. Before the x-rays struck 
the cells, they were passed through a microscopic grid with x-ray- 
absorbing "bars" spaced 1 |um apart. This produced a striped pattern of 
DNA damage, allowing a comparison of damaged and undamaged DNA in 
the same nucleus. (A) Total DNA in a fibroblast nucleus stained with the 
dye DAPI. (B) Sites of new DNA synthesis indicated by incorporation of 
BudR (a thymine analog) and subsequent staining with FITC-coupled 
antibodies to BudR. (C) Localization of the Mre1 1 complex to damaged 
DNA as visualized by Texas-red complexed to antibodies against the Mrel 1 
subunit. It has been proposed that the Mre1 1 complex initially recognizes 
double-strand breaks in the cell and then mobilizes additional proteins to 
repair the breaks through homologous recombination (see Figure 5-59). 
(A), (B) and (C) were processed 30 minutes after x-irradiation. (From 
B.E. Nelms et al., Science 280:590-592, 1998. With permission from AAAS.) 



Cells Carefully Regulate the Use of Homologous Recombination 
in DNA Repair 

Although homologous recombination neatly solves the problem of accurately 
repairing double-strand breaks and other types of DNA damage, it does present 
some dangers to the cell and therefore must be tightly regulated. For example, 
the DNA sequence in one chromosomal homolog can be rendered non-func- 
tional by "repairing" it using the other chromosomal homolog as the template. A 
loss of heterozygosity of this type is frequently a critical step in the formation of 
cancers (as discussed in Chapter 20), and cells have poorly understood mecha- 
nisms to minimize it. Although relatively rare in normal cells, loss of heterozy- 
gosity can be viewed as an unfortunate side effect of the versatility of homolo- 
gous recombination. 

Another type of control over recombination repair, which is found in nearly 
all eucaryotic cells, prevents recombination-based "repair" in the absence of 
DNA damage. The enzymes that caialyze recombination repair are normally 
made at relatively high levels hi eucaryotes and are dispersed throughout the 
nucleus. In response to DNA damage, they rapidly converge on the sites of dam- 
age and eventually form "repair factories' 1 where many DNA lesions are appar- 
ently brought together and repaired (Figure 5-60). Tins rapid mobilization of 
repair proteins to DNA damage is tightly controlled by the cell and requires a 
series of additional proteins. Two of these, the Brcal and Brca2 proteins, were 
first discovered because mutations in their genes lead to a greatly increased fre- 
quency of breast cancer. Whereas the removal of a protein essential for homolo- 
gous recombination (such as the human Rad51 protein) will kill a cell, an alter- 
ation in an accessory protein can lead to inefficient repair The subsequent accu- 
mulation of DNA damage can, in a small proportion of cells, give rise to a can- 
cer. Brca2 binds to the RadSl protein preventing its polymerization on DNA and 
(hereby maintaining it in an inactive form, it is thought that Brca2 helps to bring 
RadSl protein rapidly to sites of damage and, once in place, to release it in its 
active form. 

In Chapter 20, we shall see that both too much and too little homologous 
recombination can lead to cancer in humans, the former through an enhanced 
loss of heterozygosity and the latter through an increased mutation rate caused 
by inefficient DNA repair. Clearly; a delicate balance has evolved that keeps this 
process in check on undamaged DNA. while still allowing it to act efficiently and 
rapidly on DNA lesions as soon as they arise. 

Bacteria also carefully regulate their recombination and other DNA repair 
enzymes, but this occurs largely by controlling their intracellular levels. In 
response to severe DNA damage, E. coli increases the transcription of many DNA 
repair enzymes as part of the so-called SOS response. These include nucleotide 
excision repair enzymes, error-prone DNA polymerases that can use damaged 
DNA as a template, and proteins that mediate homologous recombination. 
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Studies of mutant bacteria deficient in different parts of the SOS response 
demonstrate that the newly synthesized proteins have two effects. First, as 
would be expected, the induction of these additional DNA repair enzymes 
increases cell survival after DNA damage. Second, several of the induced pro- 
teins transiently increase the mutation rate by increasing the number of errors 
made in copying DNA sequences. The errors are caused by the production of 
low-fidelity DNA polymerases that can efficiently use damaged DNA as a tem- 
plate for DNA synthesis. While this "error-prone" DNA repair can be harmful to 
individual bacterial cells, it is presumed to be advantageous in the long term 
because it produces a burst of genetic variability in the bacterial population that 
increases the likelihood of a mutant cell arising that is better able to survive in 
the altered environment. 



Holliday Junctions Are Often Formed During Homologous 
Recombination Events 

Homologous recombination can be viewed as a group of related reactions that 
use single-strand invasion, branch migration, and limited DNA synthesis to 
exchange DNA between two double helices of similar nucleotide sequence. Hav- 
ing discussed its role in accurately repairing damaged DNA, we now turn to 
homologous recombination as a means to generate DNA molecules of novel 
sequence. During this process a special DNA intermediate often forms that con- 
tains four DNA strands shared between two DNA helices. In this key intermedi- 
ate, known as a Holliday junction, or cross-strand exchange, two DNA strands 
switch partners between two double helices. The Holliday junction can adopt 
multiple conformations, and a special set of recombination proteins binds to, 
and thereby stabilizes, the open, symmetric isomer (Figure 5-61). By using the 
energy of ATP hydrolysis to coordinate two branch migration reactions, these 
proteins can move the point at which the two DNA helices are joined rapidly 
along the two helices (Figure 5-62). 

The four-stranded DNA structures produced by homologous recombination 
are only transiently present in cells. Thus, to regenerate two separate DNA 
helices, and thus end the recombination process, the strands connecting the 
two helices in a Holliday junction must eventually be cut, a process referred to 
as resolution. In bacteria, where we understand this process the best, a special- 
ized endonuclease (called RuvC) cleaves the Holliday junctions leaving nicks in 
the DNA that DNA ligase can seal easily However, during the meiotic processes 
that produce germ cells in eucaryotes (sperm and egg in animals), the resolution 
mechanisms appear to be much more complicated. 

As we discuss in Chapter 21, extensive homologous recombination occurs as 
an integral part of the process whereby chromosomes are parceled out to germ 
cells during meiosis. Both chromosome crossing over and gene conversion result 
from these recombination events, producing hybrid chromosomes that contain 
genetic information from both the maternal and paternal homologs (Figure 
5-63A). 

As we shall see next, in meiosis the crossing over and gene conversion are 
both generated by homologous recombination mechanisms that, at their core, 
resemble those used to repair double-strand breaks. 




Figure 5-61 A Holliday junction. The 

initially formed structure is usually drawn 
with two crossing (inside) strands and 
two noncrossing (outside) strands. An 
isomerization of the Holliday junction 
produces an open, symmetrical structure 
<CTAG>. This is the form that is bound by 
the RuvA and RuvB proteins (see Figure 
5-62). The Holliday junction is named for 
the scientist who first proposed its 
formation. 
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Figure 5-62 Enzyme-catalyzed double 
branch migration at a Holliday junction. 

In £ coli, a tetramer of the RuvA protein 
(green) and two hexamers of the RuvB 
protein (pale gray) bind to the open form 
of the junction. The RuvB protein, which 
resembles the hexameric helicases used 
in DNA replication (Figure 5-15), uses the 
energy of ATP hydrolysis to move the 
crossover point rapidly along the paired 
DNA helices, extending the heteroduplex 
region as shown. (Image courtesy of 
P. Artymiuk; modified from S.C. West, Ceil 
94:699-701, 1998. With permission from 
Elsevier.) 



MOVES IN 

Meiotic Recombination Begins with a Programmed Double 
Strand Break 

Homologous recombination in meiosis starts with a bold stroke: a specialized 
protein (called Spoil in budding yeast) breaks both strands of the DNA double 
helix in one of the recombining chromosomes. Like a topoisomerase, the reac- 
tion of Spoil with DNA leaves the protein covalently bound to the broken DNA 
(see Figure 5-22). A specialized nuclease then rapidly processes the ends bound 
by Spoil, removing the protein and leaving protruding 3' single-strand ends. At 
this point, a series of strand invasions and branch migrations take place that fre- 
quently produce an intermediate consisting of two closely spaced Holliday junc- 
tions, often called a double Holliday junction (Figure 5-64). 

Although some of the same proteins that function in double-strand break 
repair are used in meiosis, these proteins are directed by several meiosis-specific 
proteins to perform their tasks somewhat differently resulting in the different 
DNA intermediates formed (compare Figure 5-59 with Figure 5-64). Another 
important difference is that, in meiosis, recombination occurs preferentially 
between maternal and paternal chromosomal homologs rather than between the 
newly replicated, identical DNA duplexes that pair in double-strand break repair. 

There are two different ways to resolve the double Holliday intermediate 
shown in Figure 5-64. In the conceptually simplest resolution ("non- 
crossover"), the original pairs of crossing strands are cut at both Holliday junc- 
tions in the same way which causes the two original helices to separate from 
one another in a form unaltered except for the region between the two junc- 
tions (see Figure 5-64, left; in that region, each helix contains a short region of 
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Figure 5-63 Chromosome crossing over 
occurs in meiosis. Meiosis is the process 
by which a diploid cell gives rise to four 
haploid germ cells, as described in detail 
in Chapter 21. Meiosis produces germ 
cells in which the paternal and maternal 
genetic information (red and blue) has 
been reassorted through chromosome 
crossovers. In addition, many short 
regions of gene conversion occur, as 
indicated. 
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Figure 5-64 Homologous recombination 
in meiosis can generate crossovers. Once 
the meiosis-specific protein Spol 1 and the 
Mrel 1 complex break the duplex DNA and 
process the ends, homologous 
recombination proceeds through a double 
Holliday junction. Many of the steps that 
produce chromosome crossovers in 
meiosis resemble those used to repair 
double-strand breaks (Figure 5-59). 
However, in meiosis, the process is tightly 
coupled to other meiotic events and it is 
directed by proteins, such as Spo1 1, that 
are only produced in meiotic cells. 



chromosomes without crossover 



chromosomes with crossover 



heteroduplex adjacent to a region of homoduplex produced by DNA synthesis). 
If, on the other hand, the two Holliday junctions are resolved oppositely (one 
cleaved on the original pair of crossing strands and the other on the non-cross- 
ing strands), the outcome is much more profound. In this type of resolution 
("crossover"), the portions of each chromosome upstream and downstream 
from the two Holliday junctions are swapped, creating two chromosomes that 
have crossed over (see Figure 5-64, right). 

Relatively few of the Spoil-mediated double-strand breaks become 
crossovers; the majority (90% in humans, for example) are resolved as non- 
crossovers. It is not understood how this choice is made, but it apparently happens 
early in the recombination process, before the Holliday junctions are formed. The 
relatively few crossovers that do form are distributed along chromosomes such 
that the presence of a crossover in one position somehow inhibits crossing over in 
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heteroduplex 



neighboring regions. Termed crossover control this fascinating but poorly under- 
stood regulatory mechanism presumably ensures the roughly even distribution of 
crossover points along chromosomes. For many organisms, roughly two 
crossovers per chromosome occur during each meiosis, one on each arm. As dis- 
cussed in detail in Chapter 21, these crossovers play an important mechanical role 
in the proper segregation of chromosomes during meiosis. 

Whether a meiotic recombination event is resolved as a crossover or a non- 
crossover, the recombination machinery leaves behind a heteroduplex region 
where a strand from the parental homolog is base-paired with a strand from the 
maternal homolog (Figure 5-65). These heteroduplex regions can tolerate a 
small percentage of mismatched base pairs, and they often extend for thousands 
of nucleotide pairs. Because of the many non-crossover events in meiosis, they 
produce scattered sites in the germ cells where short DNA sequences from one 
homolog have been pasted into the other homolog. And in all cases, they mark 
sites of potential gene conversion—that is, sites where the four haploid chromo- 
somes produced by meiosis contain three copies of a short DNA sequence from 
one homolog and only one copy of this sequence from the other homolog (see 
Figure 5-63), as will now be explained. 



heteroduplex 



Figure 5-65 Heteroduplexes formed 
during meiosis. Heteroduplex DNA is 
present at sites of recombination that are 
resolved either as crossovers or non- 
crossovers. 



Homologous Recombination Often Results in Gene Conversion 

In sexually reproducing organisms, it is a fundamental law of genetics that each 
parent makes an equal genetic contribution to an offspring, which inherits one 
complete set of nuclear genes from the father and one complete set from the 
mother. Underlying this law is the highly accurate parceling out of chromo- 
somes to the germ cells (eggs and sperm) that takes place during meiosis. Thus, 
when a diploid cell undergoes meiosis to produce four haploid germ cells (dis- 
cussed in Chapter 21), exactly half of the genes distributed among these four 
cells should be maternal (genes that the diploid cell inherited from its mother) 
and the other half paternal (genes that the diploid cell inherited from its father) 
In some organisms (fungi, for example), it is possible to recover and analyze all 
four of the haploid gametes produced from a single cell by meiosis. Studies in 
such organisms have revealed rare cases in which the parceling out of genes vio- 
lates the standard rules of genetics. Occasionally, for example, meiosis yields 
three copies of the maternal version of a gene and only one copy of the paternal 
allele (see Figure 5-63). Alternative versions of the same gene are called alleles, 
and the divergence from their expected distribution during meiosis is known as 
gene conversion. Genetic studies show that only small sections of DNA typically 
undergo gene conversion, and in many cases only a part of a gene is changed. 

Several pathways in the cell can lead to gene conversion. First, the DNA syn- 
thesis that accompanies the early steps of homologous recombination will pro- 
duce regions of the double Holliday junction where three copies of the sequence 
on one homolog are present (see green strands at the bottom of Figure 5-64); 
these will produce sites of gene conversion once the Holliday junction is resolved. 
In addition, if the two strands that make up a heteroduplex region do not have 
identical nucleotide sequences, mismatched pairs will result. These can be 
repaired by the cell's mismatch repair system, described earlier (see Figure 5-20). 
When used during recombination, however, the mismatch repair system makes 
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Figure 5-66 Gene conversion caused by mismatch correction. In this 
process, heteroduplex DNA is formed at the sites of homologous 
recombination between maternal and paternal chromosomes. If the 
maternal and paternal DNA sequences are slightly different, the 
heteroduplex region will include some mismatched base pairs, which may 
then be corrected by the DNA mismatch repair machinery (see Figure 5-20). 
Such repair can "erase" nucleotides on either the paternal or the maternal 
strand. The consequence of this mismatch repair is gene conversion, 
detected as a deviation from the segregation of equal copies of maternal 
and paternal alleles that normally occurs in meiosis. 

no distinction between the paternal and maternal strand and will randomly 
choose which strand to repair. As a consequence of this repair, one allele will be 
"lost" and the other duplicated (Figure 5-66), resulting in net "conversion" of one 
allele to the other. Thus, gene conversion, originally regarded as a mysterious 
deviation from the rules of genetics, can be seen as a straightforward conse- 
quence of the mechanisms of homologous recombination and DNA repair. 
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Mismatch Proofreading Prevents Promiscuous Recombination 
Between Two Poorly Matched DNA Sequences 

We have seen that homologous recombination relies on the pairing of comple- 
mentary (or nearly complementary) DNA strands that initially come from sepa- 
rate DNA duplexes. But what controls how precise the matching must be? This is 
particularly crucial for recombination events that lead to crossovers. For exam- 
ple, the human genome contains many sets of closely related DNA sequences, 
and if crossing over were permitted between all of them, it would create havoc 
in the cell. 

Although we do not completely understand how cells prevent inappropriate 
crossovers, we do know that components of the same mismatch proofreading 
system that removes replication errors (see Figure 5-20) and is responsible for 
some types of gene conversion (see Figure 5-66) have the additional role of 
interrupting genetic recombination between poorly matched DNA sequences. It 
is thought that the mismatch proofreading system normally recognizes the mis- 
paired bases in an initial strand exchange, and— if there are significant mis- 
matches—it prevents the subsequent steps (particularly branch migration) 
required to form a crossover. This type of recombinational proofreading is 
thought to prevent promiscuous recombination events that would otherwise 
scramble the human genome (Figure 5-67). Although controversial, it has also 
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Figure 5-67 The mechanism that 
prevents general recombination from 
destabilizing a genome that contains 
repeated sequences. Components of the 
mismatch proofreading system, 
diagrammed in Figure 5-20, have the 
additional role of recognizing 
mismatches and preventing 
inappropriate recombination. If allowed 
to proceed, such recombination would 
produce deletions (left) or gene 
conversions (right) where information 
from one of the original repeated 
sequences has been lost. 
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been proposed that recombinational proofreading helps to preserve speciation, 
particularly among bacteria, by blocking genetic exchange between closely 
related species. For example, the genomes of E. coli and Salmonella 
typhimurium are 80% identical in nucleotide sequences, yet this proofreading 
step blocks recombination between their genomes. 



Summary 

Homologous recombination (also called general recombination) results in the transfer 
of genetic information between two DNA duplex segments of similar nucleotide 
sequence. This process is essential for the error-free repair of chromosome damage in all 
cells, and it is also responsible for the crossing-over of chromosomes that occurs during 
meiosis. The recombination event is guided by a specialized set of proteins. Although it 
can occur anywhere on a DNA molecule, it always requires extensive base-pairing inter- 
actions between complementary strands of the two interacting DNA duplexes. 

In meiosis, homologous recombination is initiated by double-strand breaks that 
are intentionally produced along each chromosome. These breaks are processed into 
single-stranded 3' ends which, in a reaction catalyzed by RecA- family proteins, invade 
a homologous partner DNA duplex. Branch migration accompanied by limited DNA 
synthesis then leads to the formation of four- stranded structures known as Holliday 
junctions. Each recombination reaction ends when DNA cutting resolves these recom- 
bination intermediates. The result can either be two chromosomes that have crossed- 
over (that is, chromosomes in which the DNA on either side of the site of DNA pairing 
originates from two different homologs), or two non-crossover chromosomes. In the lat- 
ter case, the two chromosomes that result are identical to the original two homologs, 
except for relatively minor DNA sequence changes at the site of recombination. Unlike 
the situation in meiosis, the homologous recombination reactions that flawlessly repair 
double-strand breaks rarely produce crossover products. 



TRANSPOSITION AND CONSERVATIVE SITE- 
SPECIFIC RECOMBINATION 

ttfe have seen that, through homologous recombination, rearrangements occur 
between DNA segments thai can result in the exchange of DNA sequences 
between chromosomes. However, the order of genes on the interacting chromo- 
some typically remains the same following homologous recombination, inas- 
much as the recom bining sequences must be very similar for the process to 
occur, in this section, we describe two very different types of recombination— 
transposition (also called transpositional recombination) and conservative 
site-specific recombination— that do not require substantial regions of DNA 
homology. These two types of recombination events can alter gene order along 
a chromosome, and cause unusual types of mutations that add new information 
to genomes. 

Transposition and conservative site-specific recombination are largely dedi- 
cated to moving a wide variety of specialized segments of DNA, collectively 
termed mobile genetic elements, from one position in a genome to another. We 
Will see that mobile genetic elements can range in size from a few hundred to tens 
of thousands of nucleotide pairs, and each typically carries a unique set of genes. 
Often, one of these genes encodes a specialized enzyme thai catalyzes the move- 
ment of only that element, thereby making this type of recombination possible 

Virtually all cells contain mobile genetic elements (known informally as 
"jumping genes"). As explained in Chapter 4, over evolutionary timescales, they 
have had a profound effect on the shaping of modern genomes. For example, 
nearly half of the human genome can be traced to these elements (see Figure 
4-17). Over time, random mutation lias altered their nucleotide sequences, and. 
as a result, only a few of die many copies of these elements in our DNA are still 
active and capable of movement. The remainder are molecular fossils whose 
existence provides striking clues about our own evolutionary historv. 
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Mobile genetic elements are often considered to be molecular parasites 
(they are also termed "selfish DNA") that persist because cells cannot get rid of 
them; they certainly have come close to overrunning our own genome. However, 
mobile DNA elements can provide benefits to the cell. For example, the genes 
they carry are sometimes advantageous, as in the case of antibiotic resistance in 
bacterial cells discussed below. The movement of mobile genetic elements also 
produces many of the genetic variants upon which evolution depends, because, 
in addition to moving themselves, mobile genetic elements occasionally rear- 
range neighboring sequences of the host genome. Thus, spontaneous mutations 
observed in Drosophila, humans, and other organisms are often due to the 
movement of mobile genetic elements. While the vast majority of these muta- 
tions will be deleterious to the organism, some will result in increased fitness 
and tend to spread throughout the population. It is almost certain that much of 
the variety of life we see around us originally arose from the movement of 
mobile genetic elements. 

In this section, we introduce mobile genetic elements and describe the 
mechanisms that enable them to move around a genome. We shall see that some 
of these elements move through transposition mechanisms and others through 
conservative site-specific recombination. We begin with transposition, as there 
are many more examples of this type of movement known. 

Through Transposition, Mobile Genetic Elements Can Insert Into 
Any DNA Sequence 

Mobile elements that move by way of transposition are called transposons, or 
transposable elements. In transposition, a specific enzyme, usually encoded by 
the transposon itself and typically called a transposase, acts on a specific DNA 
sequence at each end of the transposon, causing it to insert into a new target 
DNA site. Most transposons are only modestly selective in choosing their target 
site, and they can therefore insert themselves into many different locations in 
the genome. In particular, there is no general requirement for homology 
between the ends of the element and the target sequence. Most transposons 
move only rarely. In bacteria, where it is possible to measure the frequency accu- 
rately, transposons typically move once every 10 5 cell divisions. In most cases, 
transposition appears to be a rare stochastic process, albeit one often linked to 
the passage of a replication fork. 

On the basis of their structure and transposition mechanism, transposons 
can be grouped into three large classes: DNA-only transposons, retroviral-like 
retrotransposons, and nonretroviral retrotransposons. Each class will be discussed 
in detail below. For reference purposes, the differences between them are briefly 
outlined in Table 5-3. 

DNA-Only Transposons Move by Both Cut-and-Paste and 
Replicative Mechanisms 

DNA-only transposons predominate in bacteria, and they are largely responsible 
for the spread of antibiotic resistance in bacterial strains. When antibiotics like 
penicillin and streptomycin first became widely available in the 1950s, most bac- 
teria that caused human disease were susceptible to them. Fifty years later, the 
situation has changed dramatically— antibiotics such as penicillin (and its mod- 
ern derivatives) are no longer effective against many modern bacterial strains, 
including those causing gonorrhea and bacterial pneumonia. The spread of 
antibiotic resistance is due largely to genes that encode antibiotic-inactivating 
enzymes that are carried on transposons (Figure 5-68). Although these mobile 
elements can transpose only within cells that already carry them, they can be 
moved from one cell to another through other mechanisms known collectively as 
horizontal gene transfer (Figure 1-23). Once introduced into a new cell, a trans- 
poson can insert itself into the genome and be faithfully passed on to all its 
progeny cells through the normal processes of DNA replication and cell division. 
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Table 5-3 Three Major Classes of Transposable Elements 

CLASS DESCRIPTION AND 
STRUCTURE 

DNA-only transposons 



SPECIALIZED ENZYMES 
REQUIRED FOR MOVEME 



ENT 



IODE OF MOVEMENT 



J.J 



short inverted repeats at 
each end 



troviral-fike retrotransposons 



transposase 



directly repeated long terminal 
repeats (LTRs) at each end 



Nonretroviral retrotransposons 



reverse transcriptase and 
integrase 



Poly A at 3' end of RNA 
transcript; 5' end is often 
truncated 



reverse transcriptase and 
endonuclease 



moves as DNA, either by 
cut-and-paste or 
replicative pathways 



moves via an RNA 
intermediate produced 
by a promoter in the LTR 



moves via an RNA 
intermediate that is 
often produced 
from a neighboring 
promoter 



EXAMPLES 




P element (Drosophila) 
Ac-Ds (maize) 
Tn3 and Tn10(£ co//) 
Tam3 (snapdragon) 



Copia (Drosophila) 
Tyl (yeast) 
THE! (human) 
Bsl (maize) 



F element (Drosophila) 
LI (human) 
Cin4 (maize) 



These elements range in length from 1 000 to about 1 2,000 nucleotide pairs Each family contains many members, only a few of which are listed 
here, in addition to transposable elements, some viruses can move in and out of host cell chromosomes by transpositional mechanisms. These 
viruses are related to the first two classes of transposons 



DNA-only transposons, so named because they exist only as DNA during 
their movement, can relocate from a donor site to a target site either by cut-and- 
paste transposition or by replicative transposition. Because it is conceptually sim- 
pler, we discuss the cut-and-paste mechanism first. The process begins when 
each of the special short DNA sequences that mark the two ends of the element 
binds a molecule of transposase. The two transposase molecules come together 
to form a multimeric "transpososome" that produces a DNA loop juxtaposing the 
two ends of the element (Figure 5-69). The transposase then introduces cuts at 
the base of the loop and removes the element completely from its original chro- 
mosome, forming the central intermediate in the transposition process (Figure 
5-70). To complete the DNA movement, the transposase catalyzes a direct attack 
of the element's two DNA ends on a target DNA molecule, breaking two phos- 
phodiester bonds in the target molecule and creating two new ones as it joins the 
element and target DNAs together. Because this DNA joining reaction begins and 
ends with the same number of phosphodiester bonds, it can occur without the 
input of additional energy. We will see in the next chapter that this same type of 
phosphodiester bond rearrangement (called transesterification) underlies 
another fundamental process in molecular biology, RNA splicing. 

Because the breaks made in the two target DNA strands are staggered (red 
arrowheads in Figure 5-69), the product DNA molecule initially contains two 
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Figure 5-68 Three of the many 
DNA-only transposons found in 
bacteria. Each of these mobile DNA 
elements contains a gene that encodes a 
transposase, an enzyme that conducts at 
least some of the DNA breakage and 
joining reactions needed for the element 
to move. Each transposon also carries 
short DNA sequences (indicated in red) 
that are recognized only by the 
transposase encoded by that element 
and are necessary for movement of the 
element. In addition, two of the three 
mobile elements shown carry genes that 
encode enzymes that inactivate the 
antibiotics ampicillin (AmpR) and 
tetracycline (TetR). The transposable 
element TnIO, shown in the bottom 
diagram, is thought to have evolved from 
the chance landing of two much shorter 
mobile elements on either side of a 
tetracycline-resistance gene: the wide 
use of tetracycline as an antibiotic has 
selected for the spread of this transposon 
through bacterial populations. 
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short, single-stranded gaps, one at each end of the inserted transposon. A host- 
cell DNA polymerase and DNA ligase fill in and seal these gaps to complete the 
recombination process. This produces a short duplication of the target DNA 
sequence at the insertion site; these flanking direct repeat sequences, whose 
length is different for different transposons, serve as convenient records of prior 
transposition events. 

When a cut-and-paste DNA-only transposon is excised from its original 
location, it leaves behind a "hole" in the chromosome. This lesion can be per- 
fectly healed by recombinational double-strand break repair (see Figure 5-59), 
provided that the chromosome has just been replicated and an identical copy 
of the damaged host sequence is available. In this case, the repair process will 
restore the transposon to its original position. Alternatively in diploid organ- 
isms, the damaged chromosome can be recombinationally repaired using the 
chromosomal homolog, in which case the transposon will not be restored but a 
loss of heterozygosity could occur at the site of repair. As the third possibility, a 
nonhomologous end-joining reaction can reseal the break; in this case, the 
DNA sequence that originally flanked the transposon is altered, producing a 
mutation at the chromosomal site from which the transposon was excised (see 
Figure 5-52). 

Remarkably, the same mechanism used to excise cut-and-paste transposons 
from DNA has been found to operate in developing immune systems of verte- 
brates, catalyzing the DNA rearrangments that produce antibody and T cell 
receptor diversity. Known as V(D)J recombination, this process will be discussed 
in Chapter 25. Found only in vertebrates, V(D)J recombination is a relatively 
recent evolutionary novelty but it is believed to have evolved from the much 
more ancient cut-and-paste transposons. 

Some DNA-only transposons move by a mechanism called replicative trans- 
position. In this case, the transposon DNA is replicated during transposition: 
one copy remains at the original site while the other is inserted at a new chro- 
mosomal location. Although the mechanism used is more complex, it is closely 
related to the cut-and-paste mechanism just described; indeed, some trans- 
posons can move by both pathways. 



Some Viruses Use a Transposition Mechanism to Move 
Themselves into Host Cell Chromosomes 

Certain viruses are considered mobile genetic elements because they use trans- 
position mechanisms to integrate their genomes into that of their host cell. How- 
ever, unlike transposons, these viruses encode proteins that package their 
genetic information into virus particles that can infect other cells. Many of the 
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Figure 5-69 Cut-and-paste 
transposition. DNA-only transposons can 

be recognized in chromosomes by the 
"inverted repeat DNA sequences" (red) 
present at their ends. These sequences, 
which can be as short as 20 nucleotides, 
are all that is necessary for the DNA 
between them to be transposed by the 
particular transposase enzyme associated 
with the element. The cut-and-paste 
movement of a DNA-only transposable 
element from one chromosomal site to 
another begins when the transposase 
brings the two inverted DNA sequences 
together, forming a DNA loop. Insertion 
into the target chromosome, catalyzed by 
the transposase, occurs at a random site 
through the creation of staggered breaks 
in the target chromosome {red 
arrowheads). Following the transposition 
reaction, the single-strand gaps created 
by the staggered break are repaired by 
DNA polymerase and ligase {purple). As a 
result, the insertion site is marked by a 
short direct repeat of the target DNA 
sequence, as shown. Although the break 
in the donor chromosome (green) is 
repaired, this process often alters the 
DNA sequence, causing a mutation at the 
original site of the excised transposable 
element (not shown). 
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Figure 5-70 The structure of the central 
intermediate formed by a cut-and-paste 
transposase. (A) Schematic view of the 
overall structure. (B)The detailed 
structure of a transposase holding the 
two DNA ends, whose 3'-OH groups are 
poised to attack a target chromosome. 
One domain of the transposase 
recognizes the DNA sequence at the 
transposon end while a different domain 
carries out the DNA breakage-joining 
chemistry (B, from D.R. Davies et al., 
Science 289:77-85, 2000. With permission 
from AAAS.) 
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viruses that insert themselves inro a host chromosome do so hy employing one of 
the top two mechanisms listed in Table 5-3, Indeed, much of our knowledge of 
these mechanisms has come from studies of particular viruses diat employ them. 

A virus that infects a bacterium is known as a bacteriophage, The bacteria- 
phagr Mu not only uses DNA-based transposition to integrate its genome into its 
host cell chromosome, it also uses replieative transposition to replicate its 
genome. Transposition also has a key role in the life cycle of many other viruses. 
Most notable are the retroviruses, which include the human AIDS virus, HIV. 
Outside the cell, a retrovirus exists as a single-stranded RNA genome packed into 
a protein capsid along with a virus-encoded reverse transcriptase enzyme. Dur- 
ing the infection process, the viral RNA enters a cell and is converted to a double- 
stranded DNA molecule by the action of this crucial enzyme, which is able to 
polymerize DNA on either an RNA or a DNA template (Figure 5-71 and Figure 
5-72). The term retrovirus refers to the virus's ability to reverse the usual flow of 
genetic information, which normally is from DNA to RNA (see Figure 1-5). 

Once the reverse transcriptase has produced a double-stranded DNA 
molecule, specific sequences near its two ends are held together by a virus- 
encoded transposase called integrase. Integrase creates activated 3'-OH viral 
DNA ends that can directly attack a target DNA molecule through a mechanism 
similar to that used by the cut-and-paste DNA-only transposons (Figure 5-73). 
In fact, detailed analyses of the three-dimensional structures of bacterial trans- 
posases and HIV integrase have revealed remarkable similarities in these 
enzymes, even though their amino acid sequences have diverged considerably. 



Retroviral-like Retrotransposons Resemble Retroviruses, but Lack 
a Protein Coat 

A large family of transposons called retfoviraj-!ikeretrotransposon§ (see Table 
5-3) move themselves in and out of chromosomes by a mechanism thar is iden- 
tical to that used by retroviruses. These elements are present in organisms as 
diverse as yeasts, files, and mammals; unlike viruses, they have no intrinsic abil- 
ity to leave their resident cell but are passed along to all descendants of that cell 
through the normal process of DNA replication and cell division. The first step 
in their transposition is the transcription of the entire transposon, producing an 
RNA copy of the element that is typically several thousand nucleotides long. This 
transcript, which is translated as a messenger RNA by the host cell, encodes a 
reverse transcriptase enzyme. This enzyme makes a double-stranded DNA copy 
of the RNA molecule via an RNA/ DNA hybrid intermediate, precisely mirroring 
the early stages of infection by a retrovirus (see Figure 5-71). Like retroviruses, 
the linear double-stranded DNA molecule then integrates into a site on the 
chromosome by using an integrase enzyme that is also encoded bv the element 
(see Figure 5-73). 
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Figure 5-71 The life cycle of a retrovirus. The retrovirus genome consists of an RNA molecule of about 8500 nucleotides; 
two such molecules are typically packaged into each viral particle. The enzyme reverse transcriptase first makes a DNA 
copy of the viral RNA molecule and then a second DNA strand, generating a double-stranded DNA copy of the RNA 
genome. The integration of this DNA double helix into the host chromosome is then catalyzed by a virus-encoded 
integrase enzyme (see Figure 5-73). This integration is required for the synthesis of new viral RNA molecules by the host 
cell RNA polymerase, the enzyme that transcribes DNA into RNA (discussed in Chapter 6). 

A Large Fraction of the Human Genome Is Composed of 
Nonretroviral Retrotransposons 

A significant fraction of many vertebrate chromosomes is made up of repeated 
DNA sequences. In human chromosomes, these repeats are mostly mutated and 
truncated versions of nonretroviral retrotransposons, the third major type of 
transposon (see Table 5-3). Although most of these transposons are immobile, a 
few retain the ability to move. Relatively recent movements of the LI element 
(sometimes referred to as a LINE or long interspersed nuclear element) have 
been identified, some of which result in human disease; for example, a particu- 
lar type of hemophilia results from an LI insertion into the gene encoding the 
blood clotting protein Factor VIII (see Figure 6-25). 
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Figure 5-72 Reverse transcriptase. 

(A) The three-dimensional structure of 
the enzyme from HIV (the human AIDS 
virus) determined by X-ray 
crystallography. (B) A model showing the 
enzyme's activity on an RNA template. 
Note that the polymerase domain (yellow 
in B) has a covalently attached RNAse H 
(H for "hybrid") domain (red) that 
degrades the RNA strand in an RNA/DNA 
helix. This activity helps the polymerase 
to convert the initial hybrid helix into a 
DNA double helix (A, courtesy of Tom 
Steitz; B, adapted from L.A. Kohlstaedt et 
al., Science 256:1 783-1 790, 1 992. With 
permission from AAAS.) 
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Nonretroviral retrotransposons are found in many organisms and move via a 
distinct mechanism that requires a complex of an endonuclease and a reverse 
transcriptase. As illustrated in Figure 5-74, the RNA and reverse transcriptase 
have a much more direct role in the recombination event than they do in the 
retroviral-like retrotransposons described above. 

Inspection of the human genome sequence has revealed that the bulk of 
nonretroviral retrotransposons— for example, the many copies of the Alu ele- 
ment, a member of the SINE (short interspersed nuclear element) family— do 
not carry their own endonuclease or reverse transcriptase genes. Nonetheless 
they have successfully amplified themselves to become major constituents of 
our genome, presumably by pirating enzymes encoded by other transposons. 

The LI and Alu elements seem to have multiplied in the human genome rel- 
atively recently (Figure 5-75). Thus, for example, the mouse contains sequences 
closely related to LJ and Alu, but their placement in mouse chromosomes dif- 
fers from that in human chromosomes (see Figure 4-80). Togei her the LINHs and 
SINEs make up about 40% of the human genome (see Figure 4-17). 

Different Transposable Elements Predominate in Different 
Organisms 

We have described several types of transposable elements: (1) DNA-only trans- 
posons, the movement pf which is based on DNA breaking and joining reac- 
tions; (2) retroviral-] ike retrotransposons, which also move via DNA breakage 
and joining, but where RNA has a key role as a template to generate the DNA 
recombination substrate; and (3) nonretroviral retrotransposons, in which an 
RNA copy of the element is central lo the incorporation of the element into the 
target DNA, acting as a direct template for a DNA target-primed reverse tran- 
scription event. 

Intriguingly, different types of transposons predominate in different organ- 
isms. For example, the vast majority of bacterial transposons are DNA-only 
types, with a few related to the nonretroviral retrotransposons also present. In 
yeasts, the main mobile elements that have been observed are retroviral-like 
retrotransposons. In Drosophila, DNA-based, retroviral, and nonretroviral 

Figure 5-74 Transposition by a nonretroviral retrotransposon. 
Transposition by the L 1 element (red) begins when an endonuclease 
attached to the L 1 reverse transcriptase {green} and the U RNA (blue) nicks 
the target DNA at the point at which insertion will occur. This cleavage 
releases a 3'OH DNA end in the target DNA, which is then used as a primer 
for the reverse transcription step shown. This generates a single-stranded 
DNA copy of the element that is directly linked to the target DNA. In 
subsequent reactions, further processing of the single-stranded DNA copy 
results in the generation of a new double-stranded DNA copy of the 
L I element that is inserted at the site of the initial nick 



Figure 5-73 Transposition by either a 
retrovirus (such as HIV) or a retroviral- 
like retrotransposon. The process begins 
with a double-stranded DNA molecule 
(orange) produced by reverse 
transcriptase (see Figure 5-71). In an 
initial step, the integrase enzyme forms a 
DNA loop and cuts one strand at each 
end of the viral DNA sequence, exposing 
new 3'-OH groups. Each of these 3'-OH 
ends then directly attack a 
phosphodiester bond on opposite 
strands of a randomly selected site on a 
target chromosome (red arrowheads on 
blue DNA). This inserts the viral DNA 
sequence into the target chromosome, 
leaving short gaps on each side that are 
filled in by DNA repair processes. Because 
of the gap filling, this mechanism (like 
that of cut-and-paste transposition) 
leaves short repeats of target DNA 
sequence (black) on each side of the 
integrated DNA segment. 
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transposons are all found. Finally, the human genome contains all three types of 
transposon, but as discussed below, their evolutionary histories are strikingly 
different. 



Genome Sequences Reveal the Approximate Times that 
Transposable Elements Have Moved 

The nucleotide sequence of the human genome provides a rich "fossil record" of 
the activity of transposons over evolutionary time spans. By carefully comparing 
the nucleotide sequences of the approximately 3 million transposable element 
remnants in the human genome, it has been possible to broadly reconstruct the 
movements of transposons in our ancestors' genomes over the past several hun- 
dred million years. For example, the DNA-only transposons appear to have been 
very active well before the divergence of humans and Old World monkeys (25-35 
million years ago); but, because they gradually accumulated inactivating muta- 
tions, they have been dormant in the human lineage since that time. Likewise, 
although our genome is littered with relics of retroviral -like transposons, none 
appear to be active today. Only a single family of retroviral- like retro transposons 
is believed to have transposed in the human genome since the divergence of 
human and chimpanzee approximately 6 million years ago. The nonretroviral 
retrotransposons are also ancient, but in contrast to other types, some are still 
moving in our genome, as mentioned previously. For example, it is estimated 
that de novo movement of an Alu element is seen once every 100-200 human 
births. The movement of nonretroviral retrotransposons is responsible for a 
small but significant fraction of new human mutations — perhaps two mutations 
out of every thousand. 

The situation in mice is significantly different. Although the mouse and 
human genomes contain roughly the same density of the three types of trans- 
posons, both types of retrotransposons are still actively transposing in the 
mouse genome, being responsible for approximately 10% of new mutations. 

Although we are only beginning to understand how the movements of trans- 
posons has shaped the genomes of present-day mammals, it has been proposed 
that bursts in transposition activity could have been responsible for critical spe- 
ciation events during the radiation of the mammalian lineages from a common 
ancestor, a process that began approximately 170 million years ago. At this 
point, we can only wonder how many of our uniquely human qualities arose 
from the past activity of the many mobile genetic elements whose remnants are 
found today scattered throughout our chromosomes. 
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Figure 5-75 The expansion of the 
abundant Alu and B1 sequences found 
in the human and mouse genomes, 
respectively. Both of these transposable 
DNA sequences are thought to have 
evolved from the 7SL RNA gene, which 
encodes the SRP RNA (see Figure 1 2-39). 
On the basis of their positions in the two 
genomes and the sequence similarity of 
these highly repeated elements, the 
major expansions in copy number seem 
to have occurred independently in mice 
and humans (see Figure 4-80). (Adapted 
from P.L Deininger and G.R. Daniels, 
Trends Genet. 2:76-80, 1986, with 
permission from Elsevier, and 
International Human Genome 
Sequencing Consortium, Nature 
409:860-921, 2001, with permission from 
Macmillan Publishers Ltd.) 



Conservative Site-specific Recombination Can Reversibly 
Rearrange DNA 

A different kind recombination mechanism, known as conservative site-specific 
recombination, mediates the rearrangements of other types of mobile DNA ele- 
ments. In this pathway, breakage and joining occur at two special sites, one on 
each participating DNA molecule. Depending on the positions and relative ori- 
entations of the two recombination sites, DNA integration, DNA excision, or DNA 
inversion can occur (Figure 5-76). Conservative site-specific recombination is 
carried out by specialized enzymes that break and rejoin two DNA double 
helices at specific sequences on each DNA molecule. The same enzyme system 
that joins two DNA molecules can often take them apart again, precisely restor- 
ing the sequence of the two original DNA molecules (see Figure 5-76A). 

Several features distinguish conservative site-specific recombination from 
transposition. First, conservative site-specific recombination requires special- 
ized DNA sequences on both the donor and recipient DNA (hence the term site- 
specific). These sequences contain recognition sites for the particular recombi- 
nase that will catalyze the rearrangement. In contrast, transposition requires 
only that the transposon have a specialized sequence; for most transposons, the 
recipient DNA can be of any sequence. Second, the reaction mechanisms are 
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fundamentally different- The recombinases that catalyze conservative site-spe- 
cific recombination resemble iopoisomerases in the sense that they form tran- 
sient high-energy covalent bonds with the DNA and use this energy to complete 
the DNA rearrangements. Thus all the phosphate bonds that are broken during a 
recombination event are restored upon its completion (hence the term conser- 
vative). Transposition, in contrast, uses a transesterification reaction that does 
not proceed through a covalently joined protein-DNA intermediate. This process 
leaves gaps in the DNA that must be resealed by DNA polymerase and ligase, both 
of which require the input of energy in the form nucleotide hydrolysis. 



Figure 5-76 Two types of DNA 
rearrangement produced by conservative 
site-specific recombination. The only 
difference between the reactions in (A) and 
(B) is the relative orientation of the two 
short DNA sites (indicated by arrows) at 
which a site-specific recombination event 
occurs. (A) Through an integration reaction, 
a circular DNA molecule can become 
incorporated into a second DNA molecule; 
by the reverse reaction (excision), it can 
exit to reform the original DNA circle. Many 
bacterial viruses move in and out of their 
host chromosomes in this way (see Figure 
5-77). (B) Conservative site-specific 
recombination can also invert a specific 
segment of DNA in a chromosome. A well- 
studied example of DNA inversion through 
site-specific recombination occurs in the 
bacterium Salmonella typhimurium, an 
organism that is a major cause of food 
poisoning in humans; the inversion of a 
DNA segment changes the type of 
flagellum that is produced by the 
bacterium (see Figure 7-64). 



Conservative Site-Specific Recombination Was Discovered in 
Bacteriophage X 

A bacterial virus, bacteriophage lambda, was the first mobile DNA element of 
any type to be understood in biochemical detail. When this virus enters a cell, it 
directs the synthesis of a virus-encoded recombinase enzyme called lambda 
integrase. This enzyme mediates the covalent joining of the viraJ DNA to the bac- 
terial chromosome, causing the virus to become part of this chromosome so 
that it is replicated automatically— as part of the host's DNA. The recombination 
process begins when several molecules of the integrase protein bind tightly to a 
specific DNA sequence on the circular bacteriophage chromosome, along with 
several host proteins. This DNA-protein complex can now bind to an attach- 
ment-site DNA sequence on the bacterial chromosome, bringing the bacterial 
and bacteriophage chromosomes together. The integrase then catalyzes the 
required cutting and resealing reactions that result in recombination. Because of 
a short region of sequence homology in the two joined sequences, a tiny het- 
eroduplex joint is formed at this point of exchange (Figure 5-77). 

The same type of site-specific recombination mechanism enables bacterio- 
phage lambda DNA to exit from its integration site in the E. coli chromosome in 
response to specific signals and multiply rapidly within the bacterial cell (Figure 
5-78). Excision is catalyzed by a complex of integrase enzyme and host factors 
with a second bacteriophage protein, excisionase, which is produced by the 
virus only when its host cell is stressed— in which case, it is in the bacterio- 
phage's interest to abandon the host cell and multiply again as a virus particle. 



Conservative Site-Specific Recombination Can Be Used to Turn 
Genes On or Off 

When the special sites recognized by a conservative site-specific recombination 
enzyme are inverted in their orientation, the DNA sequence between them is 
inverted rather than excised (see Figure 5-76). Many bacteria use such an inver- 
sion of a DNA sequence to control the gene expression of particular genes— for 
example, by assembling an active gene from separated coding segments, This 
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Figure 5-77 The insertion of a circular 
bacteriophage lambda DNA 
chromosome into the bacterial 
chromosome. In this example of 
conservative site-specific recombination, 
the lambda integrase enzyme binds to a 
specific attachment-site DNA sequence 
on each chromosome, where it makes 
cuts that bracket a short homologous 
DNA sequence. The integrase then 
switches the partner strands and reseals 
them to form a heteroduplex joint that is 
seven nucleotide pairs long. A total of 
four strand-breaking and strand-joining 
reactions are required; for each of them, 
the energy of the cleaved 
phosphodiester bond is stored in a 
transient covalent linkage between the 
DNA and the enzyme, so that DNA strand 
resealing occurs without a requirement 
for ATP or DNA ligase. 
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Figure 5-78 The life cycle of 
bacteriophage lambda. The double- 
stranded DNA lambda genome contains 
50,000 nucleotide pairs and encodes 50-60 
different proteins. When the lambda DNA 
enters the cell, its two ends join to form a 
circular DNA molecule. This bacteriophage 
can multiply in £ coli by a lytic pathway, 
which destroys the cell, or it can enter a 
latent prophage state. Damage to a cell 
carrying a lambda prophage induces the 
prophage to exit from the host 
chromosome and shift to lytic growth (red 
arrows). Both the entrance of the lambda 
DNA into, and its exit from, the bacterial 
chromosome are accomplished by 
conservative site-specific recombination 
catalyzed by the lambda integrase enzyme 
{see Figure 5-77). 
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Figure 5-79 How a conservative site- 
specific recombination enzyme from 
bacteria can be used to delete specific 
genes from particular mouse tissues. 

This approach requires the insertion of 
two specially engineered DNA molecules 
into the animal's germ line. The first 
contains the gene for a recombinase (in 
this case the Cre recombinase from the 
bacteriophage P1) under the control of a 
tissue-specific promoter, which ensures 
that the recombinase is expressed only in 
that tissue. The second DNA molecule 
contains the gene of interest flanked by 
recognition sites (in this case loxP sites) 
for the recombinase. The mouse is 
engineered so that this is the only copy 
of this gene. Therefore, if the 
recombinase is expressed only in the 
liver, the gene of interest will be deleted 
there, and only there. 

As described in Chapter 7, many tissue- 
specific promoters are known; moreover, 
many of these promoters are active only 
at specific times in development. Thus, it 
is possible to study the effect of deleting 
specific genes at many different times 
during the development of each tissue. 



type of gene control has the advantage of being directly inheritable, since the 
new DNA arrangement is transferred to daughter chromosomes automatically 
when a cell divides. We will encounter a specific example of this use of conser- 
vative site-specific recombination in Chapter 7 (see Figure 7-64). 

Bacterial conservative site- specific recombinases have also become power- 
ful tools for cell and developmental biologists. To decipher the roles of specific 
genes and proteins in complex multicellular organisms, genetic engineering 
techniques are used to produce mice carrying a gene encoding a site-specific 
recombination enzyme plus a carefully designed target DNA with the DNA sites 
that this enzyme recognizes. At an appropriate time, the gene encoding the 
enzyme can be activated to rearrange the target DNA sequence. Such a rear- 
rangement is widely used to delete a specific gene in a particular tissue of the 
mouse (Figure 5-79). h is particularly useful when the gene of interest plays a 
key role in the early development of many tissues, and a complete deletion of 
the gene from the germ line would cause death very early in embryogenesis. The 
same strategy can also be used to inappropriately express any specific gene in a 
tissue of interest; here, the triggered deletion joins a strong transcriptional pro- 
moter to the gene of interest. With this tool one can in principle determine the 
influence of any protein in any desired tissue of an intact animal. 



Summary 

The genomes of nearly all organisms contain mobile genetic elements that can move 
from one position in the genome to another by either transpositional or conservative 
site-specific recombination processes. In most cases, this movement is random and 
happens at a very low frequency Mobile genetic elements include transposons, which 
move within a single cell (and its descendants), plus those viruses whose genomes can 
integrate into the genomes of their host cells. 

There are three classes of transposons: the DNA-only transposons, the retroviral- 
like retro? ransposons, and the non retroviral retrotmmposons. All but the last have 
close relative!; among the viruses. Although viruses and tmnsposahte elements can be 
viewed as parasites, many of the new arrangements of DNA sequences that their site- 
specific recombination events produce have created the genetic variation crucial for 
the evolution of cells and organisms. 
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PROBLEMS 

Which statements are true? Explain why or why not. 

5-1 No two cells in your body have the identical 
nucleotide sequence. 

5-2 In E. coli, where the replication fork travels at 500 
nucleotide pairs per second, the DNA ahead of the fork must 
rotate at nearly 3000 revolutions per minute. 

5-3 When bidirectional replication forks from adjacent 
origins meet, a leading strand always runs into a lagging 
strand. 

5-4 DNA repair mechanisms all depend on the existence 
of two copies of the genetic information, one in each of the 
two homologous chromosomes. 

Discuss the following problems. 

5-5 To determine the reproducibility of mutation fre- 
quency measurements, you do the following experiment. You 
inoculate each of 10 cultures with a single E. coli bacterium, 
allow the cultures to grow until each contains 10 6 cells, and 
then measure the number of cells in each culture that carry a 
mutation in your gene of interest. You were so surprised by 
the initial results that you repeated the experiment to confirm 
them. Both sets of results display the same extreme variabil- 
ity, as shown in Table Q5-1. Assuming that the rate of muta- 
tion is constant, why do you suppose there is so much varia- 
tion in the frequencies of mutant cells in different cultures? 

Table Q5-1 Frequencies of mutant cells in multiple cultures 
(Problem 5-5). 
■■ ■■■ 
EXPERIMENT 1 2 

1 4 

2 128 



CULTURE (mutant cells/10 6 cells] 



3 

257 1 
1 4 
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5-6 DNA repair enzymes preferentially repair mis- 
matched bases on the newly synthesized DNA strand, using 
the old DNA strand as a template. If mismatches were 
repaired instead without regard for which strand served as 
template, would mismatch repair reduce replication errors? 
Would such an indiscriminate mismatch repair result in 
fewer mutations, more mutations, or the same number of 
mutations as there would have been without any repair at 
all? Explain your answers. 

5-7 If DNA polymerase requires a perfectly paired primer 
in order to add the next nucleotide, how is it that any mis- 
matched nucleotides "escape" the polymerase and become 
substrates for mismatch repair enzymes? 

5-8 The laboratory you joined is studying the life cycle of 
an animal virus that uses a circular, double -stranded DNA as 
its genome. Your project is to define the location of the ori- 
gin (s) of replication and to determine whether replication 
proceeds in one or both directions away from an origin (uni- 
directional or bidirectional replication). To accomplish your 
goal, you isolated replicating molecules, cleaved them with 
a restriction nuclease that cuts the viral genome at one site 
to produce a linear molecule from the circle, and examined 
the resulting molecules in the electron microscope. Some of 



the molecules you observed are illustrated schematically in 
Figure Q5-1. (Note that it is impossible to distinguish the 
orientation of one DNA molecule from another in the elec- 
tron microscope.) 

You must present your conclusions to the rest of the lab 
tomorrow How will you answer the two questions your 
advisor posed for you? Is there a single, unique origin of 
replication or several origins? Is replication unidirectional or 
bidirectional? 



original molecule 



bubbles 



Figure Q5-1 Parental and 
replicating forms of an animal 
virus (Problem 5-8). 
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5-9 If you compare the frequency of the sixteen possible 
dinucleotide sequences in E. coli and human cells, there are 
no striking differences except for one dinucleotide, 5'-CG-3'. 
The frequency of CG dinucleotides in the human genome is 
significantly lower than in K coli and significantly lower 
than expected by chance. Why do you suppose that CG din- 
ucleotides are underrepresented in the human genome? 

5-10 With age, somatic cells are thought to accumulate 
genomic "scars" as a result of the inaccurate repair of dou- 
ble-strand breaks by nonhomologous end-joining (NHEJ). 
Estimates based on the frequency of breaks in primary 
human fibroblasts suggest that by age 70 each human 
somatic cell may carry some 2000 NHEJ-induced mutations 
due to inaccurate repair. If these mutations were distributed 
randomly around the genome, how many genes would you 
expect to be affected? Would you expect cell function to be 
compromised? Why or why not? (Assume that 2% of the 
genome— 1.5% coding and 0.5% regulatory— is crucial 
information.) 

5-1 1 Draw the structure of the double Holliday junction 
that would result from strand invasion by both ends of the 
broken duplex into the intact homologous duplex shown in 
Figure Q5-2. Label the left end of each strand in the Holli- 
day junction 5' or 3' so that the relationship to the parental 
and recombinant duplexes is clear. Indicate how DNA syn- 
thesis would be used to fill in any single-strand gaps in your 
double Holliday junction. 

Figure Q5-2 A broken duplex with 
single-strand tails ready to invade 
an intact homologous duplex 
(Problem 5-11). 
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5-12 Why is it that recombination between similar, but 
nonidentical, repeated sequences poses a problem for 
human cells? How does the mismatch- repair system protect 
against such recombination events? 
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Chapter 6 



How Cells Read the Genome: 
From DNA to Protein 




Only when the structure of DNA was discovered in the early 1950s did it become 
clear how the hereditary information in cells is encoded in DNA's sequence of 
nucleotides. The progress since then has been astounding. Within fifty years we 
knew the complete genome sequences for many organisms, including humans. 
We therefore know the maximum amount of information that is required to pro- 
duce a complex organism like ourselves. The limits on the hereditary informa- 
tion needed for life constrain the biochemical and structural features of cells 
and make it clear that biology is not infinitely complex. 

In this chapter, we explain how cells decode and use the information in their 
genomes. Much has been learned about how the genetic instructions written in 
an alphabet of just four "letters" — the four different nucleotides in DNA — direct 
the formation of a bacterium, a fruit fly, or a human. Nevertheless, we still have 
a great deal to discover about how the information stored in an organism's 
genome produces even the simplest unicellular bacterium with 500 genes, let 
alone how it directs the development of a human with approximately 25,000 
genes. An enormous amount of ignorance remains; many fascinating challenges 
therefore await the next generation of cell biologists. 

The problems that cells face in decoding genomes can be appreciated by 
considering a small portion of the genome of the fruit fly Drosophila 
melanogaster (Figure 6-1). Much of the DNA-encoded information present in 
this and other genomes specifies the linear order — the sequence — of amino 
acids for every protein the organism makes. As described in Chapter 3, the 
amino acid sequence in turn dictates how each protein folds to give a molecule 
with a distinctive shape and chemistry. When a cell makes a particular protein, 
it must decode accurately the corresponding region of the genome. Additional 
information encoded in the DNA of the genome specifies exactly when in the life 
of an organism and in which cell types each gene is to be expressed into protein. 
Since proteins are the main constituents of cells, the decoding of the genome 
determines not only the size, shape, biochemical properties, and behavior of 
cells, but also the distinctive features of each species on Earth. 

One might have predicted that the information present in genomes would 
be arranged in an orderly fashion, resembling a dictionary or a telephone direc- 
tory. Although the genomes of some bacteria seem fairly well organized, the 
genomes of most multicellular organisms, such as our Drosophila example, are 
surprisingly disorderly. Small bits of coding DNA (that is, DNA that codes for 
protein) are interspersed with large blocks of seemingly meaningless DNA. 
Some sections of the genome contain many genes and others lack genes alto- 
gether. Proteins that work closely with one another in the cell often have their 
genes located on different chromosomes, and adjacent genes typically encode 
proteins that have little to do with each other in the cell. Decoding genomes is 
therefore no simple matter. Even with the aid of powerful computers, it is still 
difficult for researchers to locate definitively the beginning and end of genes in 
the DNA sequences of complex genomes, much less to predict when each gene 
is expressed in the life of the organism. Although the DNA sequence of the 
human genome is known, it will probably take at least a decade to identify every 
gene and determine the precise amino acid sequence of the protein it produces. 
Yet the cells in our body do this thousands of times a second. 
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Figure 6-1 (opposite page) Schematic depiction of a portion of chromosome 2 from the genome of the fruit fly 
Drosophila melanogaster. This figure represents approximately 3% of the total Drosophila genome, arranged as six 
contiguous segments. As summarized in the key, the symbolic representations are: black vertical lines of various 
thicknesses: locations of transposable elements, with thicker bars indicating clusters of elements; colored boxes: genes 
(both known and predicted) coded on one strand of DNA (boxes above the midline) and genes coded on the other strand 
(boxes below the midline). The length of each gene box includes both its exons (protein-coding DNA) and its introns 
(noncoding DNA) (see Figure 4-15); its height is proportional to the number of known cDNAs that match the gene. (As 
described in Chapter 8, cDNAs are DNA copies of mRNA molecules, and large collections of the nucleotide sequences of 
cDNAs have been deposited in a variety of databases, the more matches, the higher the confidence that the predicted 
gene is transcribed into RNA and is thus a genuine gene.) The color of each gene box indicates whether a closely related 
gene is known to occur in other organisms. For example, MWY means the gene has close relatives in mammals, in the 
nematode worm Caenorhabditis elegans, and in the yeast Saccharomyces cerevisiae. MW indicates the gene has close 
relatives in mammals and the worm but not in yeast. The rainbow-colored bar indicates percent G-C base pairs; across 
many different genomes, this percentage shows a striking regional variation, whose origin and significance are uncertain. 
(From M.D. Adams et al., Science 287:2185-2195, 2000. With permission from AAAS.) 



The DNA in genomes does not direct protein synthesis itself, but instead 
uses RNA as an intermediary. When the cell needs a particular protein, the 
nucleotide sequence of the appropriate portion of the immensely long DNA 
molecule in a chromosome is first copied into RNA (a process called transcrip- 
tion). It is these RNA copies of segments of the DNA that are used directly as 
templates to direct the synthesis of the protein (a process called translation). 
The flow of genetic information in cells is therefore from DNA to RNA to protein 
(Figure 6-2). All cells, from bacteria to humans, express their genetic informa- 
tion in this way — a principle so fundamental that it is termed the central dogma 
of molecular biology. 

Despite the universality of the central dogma, there are important variations 
in the way in which information flows from DNA to protein. Principal among 
these is that RNA transcripts in eucaryotic cells are subject to a series of pro- 
cessing steps in the nucleus, including RNA splicing, before they are permitted 
to exit from the nucleus and be translated into protein. These processing steps 
can critically change the "meaning" of an RNA molecule and are therefore cru- 
cial for understanding how eucaryotic cells read their genomes. Finally, 
although we focus on the production of the proteins encoded by the genome in 
this chapter, we see that for many genes RNA is the final product. Like proteins, 
many of these RNAs fold into precise three-dimensional structures that have 
structural, catalytic, and regulatory roles in the cell. 

We begin this chapter with the first step in decoding a genome: the process 
of transcription by which an RNA molecule is produced from the DNA of a gene. 
We then follow the fate of this RNA molecule through the cell, finishing when a 
correctly folded protein molecule has been formed. At the end of the chapter, we 
consider how the present quite complex scheme of information storage, tran- 
scription, and translation might have arisen from simpler systems in the earliest 
stages of cell evolution. 



FROM DNA TO RNA 

Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of pro- 
tein rapidly when necessary. But each gene can also be transcribed and trans- 
lated with a different efficiency, allowing the cell to make vast quantities of some 
proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in the 
next chapter, a cell can change (or regulate) the expression of each of its genes 
according to the needs of the moment — most commonly by controlling the pro- 
duction of its RNA. 
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Figure 6-2 The pathway from DNA to 
protein. The flow of genetic information 
from DNA to RNA (transcription) and 
from RNA to protein (translation) occurs 
in all living cells. 



332 Chapter 6: How Cells Read the Genome: From DNA to Protein 



gene A 



gene B 



DNA 



TRANSCRIPTION 
^^^m RNA 



TRANSLATION 

V& \A, ^ v& S& 
V£> \£s K> ^£s VC/ 
K& K& K& K& K& 
\A, ^^ . yA, vA, 

v6/ v6/ v6/ vA/ v6/ 



TRANSCRIPTION 
^^M RNA 



TRANSLATION 



Figure 6-3 Genes can be expressed with 
different efficiencies. In this example, 
gene A is transcribed and translated 
much more efficiently than gene B. This 
allows the amount of protein A in the cell 
to be much greater than that of protein B. 



Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence — a gene — into an 
RNA nucleotide sequence. The information in RNA, although copied into 
another chemical form, is still written in essentially the same language as it is in 
DNA — the language of a nucleotide sequence. Hence the name transcription. 

Like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds (Figure 6-4). It differs from 
DNA chemically in two respects: (1) the nucleotides in RNA are 
ribonucleotides — that is, they contain the sugar ribose (hence the name ribonu- 
cleic acid) rather than deoxyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine (C), it contains the base uracil (U) 
instead of the thymine (T) in DNA. Since U, like T, can base-pair by hydrogen- 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters 4 and 5 apply also to RNA (in RNA, G pairs with 
C, and A pairs with U). We also find other types of base pairs in RNA: for exam- 
ple, G occasionally pairs with U. 

Figure 6-4 The chemical structure of RNA. (A) RNA contains the sugar 
ribose, which differs from deoxyribose, the sugar used in DNA, by the 
presence of an additional -OH group. (B) RNA contains the base uracil, 
which differs from thymine, the equivalent base in DNA, by the absence of 
a -CH 3 group. (C) A short length of RNA. The phosphodiester chemical 
linkage between nucleotides in RNA is the same as that in DNA. 
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Although these chemical differences are slight, DNA and RNA differ quite 
dramatically in overall structure. Whereas DNA always occurs in cells as a dou- 
ble-stranded helix, RNA is single-stranded. An RNA chain can therefore fold up 
into a particular shape, just as a polypeptide chain folds up to form the final 
shape of a protein (Figure 6-6) . As we see later in this chapter, the ability to fold 
into complex three-dimensional shapes allows some RNA molecules to have 
precise structural and catalytic functions. 



Transcription Produces RNA Complementary to One Strand of 
DNA 

The RNA in a cell is made by DNA transcription, a process that has certain simi- 
larities to the process of DNA replication discussed in Chapter 5. Transcription 
begins with the opening and unwinding of a small portion of the DNA double helix 
to expose the bases on each DNA strand. One of the two strands of the DNA dou- 
ble helix then acts as a template for the synthesis of an RNA molecule. As in DNA 
replication, the nucleotide sequence of the RNA chain is determined by the com- 
plementary base-pairing between incoming nucleotides and the DNA template. 
When a good match is made, the incoming ribonucleotide is covalently linked to 
the growing RNA chain in an enzymatically catalyzed reaction. The RNA chain 
produced by transcription — the transcript— is therefore elongated one nucleotide 
at a time, and it has a nucleotide sequence that is exactly complementary to the 
strand of DNA used as the template (Figure 6-7). 

Transcription, however, differs from DNA replication in several crucial ways. 
Unlike a newly formed DNA strand, the RNA strand does not remain hydrogen- 
bonded to the DNA template strand. Instead, just behind the region where the 
ribonucleotides are being added, the RNA chain is displaced and the DNA helix 
re-forms. Thus, the RNA molecules produced by transcription are released from 
the DNA template as single strands. In addition, because they are copied from 
only a limited region of the DNA, RNA molecules are much shorter than DNA 
molecules. A DNA molecule in a human chromosome can be up to 250 million 
nucleotide-pairs long; in contrast, most RNAs are no more than a few thousand 
nucleotides long, and many are considerably shorter. 
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Figure 6-5 Uracil forms base pairs with 
adenine. The absence of a methyl group 
in U has no effect on base-pairing; thus, 
U-A base pairs closely resemble T-A base 
pairs (see Figure 4-4). 




(B) (C) 

Figure 6-6 RNA can fold into specific structures. RNA is largely single-stranded, but it often contains short stretches of 
nucleotides that can form conventional base pairs with complementary sequences found elsewhere on the same molecule. 
These interactions, along with additional "nonconventional" base-pair interactions, allow an RNA molecule to fold into a 
three-dimensional structure that is determined by its sequence of nucleotides. <AATC> (A) Diagram of a folded RNA 
structure showing only conventional base-pair interactions. (B) Structure with both conventional (red) and nonconventional 
(green) base-pair interactions. (C) Structure of an actual RNA, a portion of a group I intron (see Figure 6-36). Each 
conventional base-pair interaction is indicated by a "rung" in the double helix. Bases in other configurations are indicated by 
broken rungs. 
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The enzymes that perform transcription are called RNA polymerases. Like 
the DNA polymerase that catalyzes DNA replication (discussed in Chapter 5), 
RNA polymerases catalyze the formation of the phosphodiester bonds that link 
the nucleotides together to form a linear chain. The RNA polymerase moves 
stepwise along the DNA, unwinding the DNA helix just ahead of the active site 
for polymerization to expose a new region of the template strand for comple- 
mentary base -pairing. In this way, the growing RNA chain is extended by one 
nucleotide at a time in the 5'-to-3' direction (Figure 6-8). The substrates are 
nucleoside triphosphates (ATP, CTP, UTP, and GTP); as in DNA replication, the 
hydrolysis of high-energy bonds provides the energy needed to drive the reac- 
tion forward (see Figure 5-4). 

The almost immediate release of the RNA strand from the DNA as it is syn- 
thesized means that many RNA copies can be made from the same gene in a rel- 
atively short time, with the synthesis of additional RNA molecules being started 
before the first RNA is completed (Figure 6-9). When RNA polymerase 
molecules follow hard on each other's heels in this way, each moving at about 20 
nucleotides per second (the speed in eucaryotes), over a thousand transcripts 
can be synthesized in an hour from a single gene. 

Although RNA polymerase catalyzes essentially the same chemical reaction 
as DNA polymerase, there are some important differences between the activities 
of the two enzymes. First, and most obviously, RNA polymerase catalyzes the 
linkage of ribonucleotides, not deoxyribonucleotides. Second, unlike the DNA 
polymerases involved in DNA replication, RNA polymerases can start an RNA 
chain without a primer. This difference may exist because transcription need 
not be as accurate as DNA replication (see Table 5-1, p. 271). Unlike DNA, RNA 
does not permanently store genetic information in cells. RNA polymerases make 
about one mistake for every 10 4 nucleotides copied into RNA (compared with an 
error rate for direct copying by DNA polymerase of about one in 10 7 
nucleotides), and the consequences of an error in RNA transcription are much 
less significant than that in DNA replication. 

Although RNA polymerases are not nearly as accurate as the DNA poly- 
merases that replicate DNA, they nonetheless have a modest proofreading 
mechanism. If an incorrect ribonucleotide is added to the growing RNA chain, 
the polymerase can back up, and the active site of the enzyme can perform an 
excision reaction that resembles the reverse of the polymerization reaction, 
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Figure 6-7 DNA transcription produces 
a single-stranded RNA molecule that is 
complementary to one strand of DNA. 
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Figure 6-8 DNA is transcribed by the enzyme RNA polymerase. (A) The RNA polymerase (pale blue) moves stepwise along the DNA, 
unwinding the DNA helix at its active site. As it progresses, the polymerase adds nucleotides (represented as small "T" shapes) one by one to 
the RNA chain at the polymerization site, using an exposed DNA strand as a template. The RNA transcript is thus a complementary copy of one 
of the two DNA strands. A short region of DNA/RNA helix (approximately nine nucleotide pairs in length) is therefore formed only transiently, 
and a "window" of DNA/RNA helix therefore moves along the DNA with the polymerase. The incoming nucleotides are in the form of 
ribonucleoside triphosphates (ATP, UTP, CTP, and GTP), and the energy stored in their phosphate-phosphate bonds provides the driving force 
for the polymerization reaction (see Figure 5-4). (B) The structure of a bacterial RNA polymerase, as determined by x-ray crystallography. Four 
different subunits, indicated by different colors, comprise this RNA polymerase. The DNA strand used as a template is red, and the nontemplate 
strand is yellow. (A, adapted from a figure courtesy of Robert Landick; B, courtesy of Seth Darst.) 
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except that water instead of pyrophosphate is used and a nucleoside monophos- 
phate is released. 

Given that DNA and RNA polymerases both carry out template-dependent 
nucleotide polymerization, it might be expected that the two types of enzymes 
would be structurally related. However, x-ray crystallographic studies of both 
types of enzymes reveal that, other than containing a critical Mg 2+ ion at the cat- 
alytic site, they are virtually unrelated to each other; indeed template-depen- 
dent nucleotide polymerizing enzymes seem to have arisen independently twice 
during the early evolution of cells. One lineage led to the modern DNA poly- 
merases and reverse transcriptases discussed in Chapter 5, as well as to a few 
single -subunit RNA polymerases from viruses. The other lineage formed all of 
the modern cellular RNA polymerases (Figure 6-10), which we discuss in this 
chapter. 



Figure 6-9 Transcription of two genes as 
observed under the electron 
microscope. The micrograph shows 
many molecules of RNA polymerase 
simultaneously transcribing each of two 
adjacent genes. Molecules of RNA 
polymerase are visible as a series of dots 
along the DNA with the newly 
synthesized transcripts (fine threads) 
attached to them. The RNA molecules 
(ribosomal RNAs) shown in this example 
are not translated into protein but are 
instead used directly as components of 
ribosomes, the machines on which 
translation takes place. The particles at 
the 5' end (the free end) of each rRNA 
transcript are believed to reflect the 
beginnings of ribosome assembly. From 
the lengths of the newly synthesized 
transcripts, it can be deduced that the 
RNA polymerase molecules are 
transcribing from left to right. (Courtesy 
of Ulrich Scheer.) 



Cells Produce Several Types of RNA 

The majority of genes carried in a cell's DNA specify the amino acid sequence of 
proteins; the RNA molecules that are copied from these genes (which ultimately 
direct the synthesis of proteins) are called messenger RNA (mRNA) molecules. 
The final product of a minority of genes, however, is the RNA itself. Careful anal- 
ysis of the complete DNA sequence of the genome of the yeast S. cerevisiae has 
uncovered well over 750 genes (somewhat more than 10% of the total number of 
yeast genes) that produce RNA as their final product. These RNAs, like proteins, 
serve as enzymatic and structural components for a wide variety of processes in 



loop 




• dimerization 

• acquisition of lysines K 

• acquisition of aspartic acids D 

• acquisition of loops 



ancient double-psi barrel 



Figure 6-1 The evolution of modern cellular RNA polymerases. 

According to this hypothesis, RNA polymerases evolved from an ancient 
protein domain, called the double-psi barrel. Crucial evolutionary steps are 
thought to include the dimerization of the domain, the insertion of large 
polypeptide "loops," the acquisition of two critical lysines needed to 
position the template, and the acquisition of three aspartic acids required 
to chelate a magnesium at the active site. This scheme depicts the 
evolution of the two largest subunits of RNA polymerase, which form the 
active site of the enzyme. The structures shown are those of the (3 and 
p' subunits of the E. coli enzyme, but the corresponding subunits of the 
eucaryotic enzyme are closely related. (Adapted from LM. Iyer, E.V. Koonin 
and L Aravind, BMC Struct Biol. 3:1, 2003.) 
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the cell. In Chapter 5 we encountered one of those RNAs, the template carried 
by the enzyme telomerase. Although many of these noncoding RNAs are still 
mysterious, we shall see in this chapter that small nuclear RNA (snRNA) 
molecules direct the splicing of pre-mRNA to form mRNA, that ribosomal RNA 
(rRNA) molecules form the core of ribosomes, and that transfer RNA (tRNA) 
molecules form the adaptors that select amino acids and hold them in place on 
a ribosome for incorporation into protein. Finally, we shall see in Chapter 7 that 
microRNA (miRNA) molecules and small interfering RNA (siRNA) molecules 
serve as key regulators of eucaryotic gene expression (Table 6-1). 

Each transcribed segment of DNA is called a transcription unit. In eucary- 
otes, a transcription unit typically carries the information of just one gene, and 
therefore codes for either a single RNA molecule or a single protein (or group of 
related proteins if the initial RNA transcript is spliced in more than one way to 
produce different mRNAs). In bacteria, a set of adjacent genes is often tran- 
scribed as a unit; the resulting mRNA molecule therefore carries the information 
for several distinct proteins. 

Overall, RNA makes up a few percent of a cell's dry weight. Most of the RNA 
in cells is rRNA; mRNA comprises only 3-5% of the total RNA in a typical mam- 
malian cell. The mRNA population is made up of tens of thousands of different 
species, and there are on average only 10-15 molecules of each species of mRNA 
present in each cell. 



Signals Encoded in DNA Tell RNA Polymerase Where to Start and 
Stop 

To transcribe a gene accurately, RNA polymerase must recognize where on the 
genome to start and where to finish. The way in which RNA polymerases per- 
form these tasks differs somewhat between bacteria and eucaryotes. Because 
the processes in bacteria are simpler, we discuss them first. 

The initiation of transcription is an especially important step in gene expres- 
sion because it is the main point at which the cell regulates which proteins are 
to be produced and at what rate. The bacterial RNA polymerase core enzyme is 
a multisubunit complex that sythesizes RNA using a DNA template as a guide. A 
detachable subunit called sigma (a) factor associates with the core enzyme and 
assists it in reading the signals in the DNA that tell it where to begin transcribing 
(Figure 6-11). Together, a factor and core enzyme are known as the RNA poly- 
merase holoenzyme; this complex adheres only weakly to bacterial DNA when 



Table 6-1 Principal Types 


of RNAs Produced in Cells 


TYPE OF RNA 


FUNCTION 


mRNAs 


messenger RNAs, code for proteins 


rRNAs 


ribosomal RNAs, form the basic structure of the 
ribosome and catalyze protein synthesis 


tRNAs 


transfer RNAs, central to protein synthesis as adaptors 
between mRNA and amino acids 


snRNAs 


small nuclear RNAs, function in a variety of nuclear 
processes, including the splicing of pre-mRNA 


snoRNAs 


small nucleolar RNAs, used to process and chemically 
modify rRNAs 


scaRNAs 


small cajal RNAs, used to modify snoRNAs and snRNAs 


miRNAs 


microRNAs, regulate gene expression typically by 
blocking translation of selective mRNAs 


siRNAs 


small interfering RNAs, turn off gene expression by 
directing degradation of selective mRNAs and the 
establishment of compact chromatin structures 


Other noncoding 
RNAs 


function in diverse cell processes, including telomere 
synthesis, X-chromosome inactivation, and the 
transport of proteins into the ER 
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the two collide, and a holoenzyme typically slides rapidly along the long DNA 
molecule until it dissociates again. However, when the polymerase holoenzyme 
slides into a region on the DNA double helix called a promoter, a special 
sequence of nucleotides indicating the starting point for RNA synthesis, the 
polymerase binds tightly to this DNA. The polymerase holoenzyme, through its 

factor, recognizes the promoter DNA sequence by making specific contacts 
with the portions of the bases that are exposed on the outside of the helix (step 

1 in Figure 6-11). 

After the RNA polymerase holoenzyme binds tightly to the promoter DNA 
in this way, it opens up the double helix to expose a short stretch of nucleotides 
on each strand (step 2 in Figure 6-11). Unlike a DNA helicase reaction (see Fig- 
ure 5-14), this limited opening of the helix does not require the energy of ATP 
hydrolysis. Instead, the polymerase and DNA both undergo reversible struc- 
tural changes that result in a state more energetically favorable than that of the 
initial binding. With the DNA unwound, one of the two exposed DNA strands 
acts as a template for complementary base-pairing with incoming ribonu- 
cleotides, two of which are joined together by the polymerase to begin an RNA 
chain (step 3 in Figure 6-11). After the first ten or so nucleotides of RNA have 
been synthesized (a relatively inefficient process during which polymerase syn- 
thesizes and discards short RNA oligomers), the core enzyme breaks its inter- 
actions with the promoter DNA, weakens its interactions with o factor, and 
begins to move down the DNA, synthesizing RNA (steps 4 and 5 in Figure 6-11). 
Chain elongation continues (at a speed of approximately 50 nucleotides/sec for 
bacterial RNA polymerases) until the enzyme encounters a second signal in the 
DNA, the terminator (described below), where the polymerase halts and 
releases both the newly made RNA chain and the DNA template (step 7 in Fig- 
ure 6-11). After the polymerase core enzyme has been released at a terminator, 
it reassociates with a free o factor to form a holoenzyme that can begin the pro- 
cess of transcription again. 
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Figure 6-1 1 The transcription cycle of 
bacterial RNA polymerase. In step 1, the 
RNA polymerase holoenzyme 
(polymerase core enzyme plus o factor) 
assembles and then locates a promoter 
(see Figure 6-1 2). The polymerase 
unwinds the DNA at the position at 
which transcription is to begin (step 2) 
and begins transcribing (step 3). This 
initial RNA synthesis (sometimes called 
"abortive initiation") is relatively 
inefficient. However, once RNA 
polymerase has managed to synthesize 
about 10 nucleotides of RNA, it breaks its 
interactions with the promoter DNA and 
weakens, and eventually breaks, its 
interaction with o.The polymerase now 
shifts to the elongation mode of RNA 
synthesis (step 4), moving rightward 
along the DNA in this diagram. During 
the elongation mode (step 5), 
transcription is highly processive, with 
the polymerase leaving the DNA 
template and releasing the newly 
transcribed RNA only when it encounters 
a termination signal (steps 6 and 7). 
Termination signals are typically encoded 
in DNA, and many function by forming an 
RNA structure that destabilizes the 
polymerase's hold on the RNA (step 7). In 
bacteria, all RNA molecules are 
synthesized by a single type of RNA 
polymerase and the cycle depicted in the 
figure therefore applies to the production 
of mRNAs as well as structural and 
catalytic RNAs. (Adapted from a figure 
courtesy of Robert Landick.) 
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The process of transcription initiation is complex and requires that the RNA 
polymerase holoenzyme and the DNA undergo a series of conformational 
changes. We can view these changes as opening up and positioning the DNA in 
the active site followed by a successive tightening of the enzyme around the 
DNA and RNA to ensure that it does not dissociate before it has finished tran- 
scribing a gene. If an RNA polymerase does dissociate prematurely, it cannot 
resume synthesis but must start over again at the promoter. 

How do the termination signals in the DNA stop the elongating polymerase? 
For most bacterial genes a termination signal consists of a string of A-T 
nucleotide pairs preceded by a two-fold symmetric DNA sequence, which, when 
transcribed into RNA, folds into a "hairpin" structure through Watson-Crick 
base-pairing (see Figure 6-11). As the polymerase transcribes across a termina- 
tor, the formation of the hairpin may help to "pull" the RNA transcript from the 
active site. The DNA-RNA hybrid in the active site, which is held together at ter- 
minators predominantly by U-A base pairs (which are less stable than G-C base 
pairs because they form two rather than three hydrogen bonds per base pair), is 
not strong enough to hold the RNA in place, and it dissociates causing the 
release of the polymerase from the DNA (step 7 in Figure 6-11). Thus, in some 
respects, transcription termination seems to involve a reversal of the structural 
transitions that happen during initiation. The process of termination also is an 
example of a common theme in this chapter: the folding of RNA into specific 
structures affects many steps in decoding the genome. 



Transcription Start and Stop Signals Are Heterogeneous in 
Nucleotide Sequence 

As we have just seen, the processes of transcription initiation and termination 
involve a complicated series of structural transitions in protein, DNA, and RNA 
molecules. The signals encoded in DNA that specify these transitions are often 
difficult for researchers to recognize. Indeed, a comparison of many different 
bacterial promoters reveals a surprising degree of variation. Nevertheless, they 
all contain related sequences, reflecting in part aspects of the DNA that are rec- 
ognized directly by the a factor. These common features are often summarized 
in the form of a consensus sequence (Figure 6-12). A consensus nucleotide 
sequence is derived by comparing many sequences with the same basic func- 
tion and tallying up the most common nucleotide found at each position. It 
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Figure 6-1 2 Consensus sequence for the 
major class of E coli promoters. (A) The 

promoters are characterized by two 
hexameric DNA sequences, the -35 
sequence and the -1 sequence named 
for their approximate location relative to 
the start point of transcription 
(designated +1). For convenience, the 
nucleotide sequence of a single strand of 
DNA is shown; in reality the RNA 
polymerase recognizes the promoter as 
double-stranded DNA. On the basis of a 
comparison of 300 promoters, the 
frequencies of the four nucleotides at 
each position in the -35 and -1 
hexamers are given. The consensus 
sequence, shown below the graph, 
reflects the most common nucleotide 
found at each position in the collection of 
promoters. The sequence of nucleotides 
between the -35 and -1 hexamers 
shows no significant similarities among 
promoters. (B) The distribution of spacing 
between the -35 and -1 hexamers 
found in E. coli promoters. 

The information displayed in these two 
graphs applies to E coli promoters that 
are recognized by RNA polymerase and 
the major o factor (designated o 70 ). As 
we shall see in the next chapter, bacteria 
also contain minor o factors, each of 
which recognizes a different promoter 
sequence. Some particularly strong 
promoters recognized by RNA 
polymerase and o 70 have an additional 
sequence, located upstream (to the left, 
in the figure) of the -35 hexamer, which 
is recognized by another subunit of RNA 
polymerase. 
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therefore serves as a summary or "average" of a large number of individual 
nucleotide sequences. 

The DNA sequences of individual bacterial promoters differ in ways that 
determine their strength (the number of initiation events per unit time of the 
promoter). Evolutionary processes have fine-tuned each to initiate as often as 
necessary and have thereby created a wide spectrum of promoters. Promoters 
for genes that code for abundant proteins are much stronger than those associ- 
ated with genes that encode rare proteins, and their nucleotide sequences are 
responsible for these differences. 

Like bacterial promoters, transcription terminators also have a wide range 
of sequences, with the potential to form a simple hairpin RNA structure being 
the most important common feature. Since an almost unlimited number of 
nucleotide sequences have this potential, terminator sequences are even more 
heterogeneous than promoter sequences. 

We have discussed bacterial promoters and terminators in some detail to 
illustrate an important point regarding the analysis of genome sequences. 
Although we know a great deal about bacterial promoters and terminators and 
can construct consensus sequences that summarize their most salient features, 
their variation in nucleotide sequence makes it difficult to definitively locate 
them simply by analysis of the nucleotide sequence of a genome. It is even more 
difficult to locate analogous sequences in eucaryotic genomes, due in part to the 
excess DNA carried in them. Often, we need additional information, some of it 
from direct experimentation, to locate and accurately interpret the short DNA 
signals contained in genomes. 

Since DNA is double -stranded, two different RNA molecules could in princi- 
ple be transcribed from any gene, using each of the two DNA strands as a tem- 
plate. However, a gene typically has only a single promoter, and because the pro- 
moter's nucleotide sequence is asymmetric (see Figure 6-12), the polymerase can 
bind in only one orientation. The polymerase synthesizes RNA in the 5'- to -3' 
direction, and it can therefore only transcribe one strand per gene (Figure 6-13). 
Genome sequences reveal that the DNA strand used as the template for RNA syn- 
thesis varies from gene to gene depending on the location and orientation of the 
promoter (Figure 6-14). 

Having considered transcription in bacteria, we now turn to the situation in 
eucaryotes, where the synthesis of RNA molecules is a much more elaborate 
affair. 
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Figure 6-1 3 The importance of RNA 
polymerase orientation. The DNA strand 
serving as template must be traversed in a 
3'-to-5' direction. Thus, the direction of 
RNA polymerase movement determines 
which of the two DNA strands is to serve 
as a template for the synthesis of RNA, as 
shown in (A) and (B). Polymerase direction 
is, in turn, determined by the orientation 
of the promoter sequence, the site at 
which the RNA polymerase begins 
transcription. 



Transcription Initiation in Eucaryotes Requires Many Proteins 

In contrast to bacteria, which contain a single type of RNA polymerase, eucary- 
otic nuclei have three: RNA polymerase I, RNA polymerase II, and RNA poly- 
merase III. The three polymerases are structurally similar to one another (and to 
the bacterial enzyme) and share some common subunits, but they transcribe 
different types of genes (Table 6-2). RNA polymerases I and III transcribe the 
genes encoding transfer RNA, ribosomal RNA, and various small RNAs. RNA 
polymerase II transcribes most genes, including all those that encode proteins, 
and our subsequent discussion therefore focuses on this enzyme. 

Although eucaryotic RNA polymerase II has many structural similarities to 
bacterial RNA polymerase (Figure 6-15), there are several important differences 
in the way in which the bacterial and eucaryotic enzymes function, two of which 
concern us immediately. 
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Figure 6-14 Directions of transcription 
along a short portion of a bacterial 
chromosome. Some genes are transcribed 
using one DNA strand as a template, while 
others are transcribed using the other 
DNA strand. The direction of transcription 
is determined by the promoter at the 
beginning of each gene (green 
arrowheads). This diagram shows 
approximately 0.2% (9000 base pairs) of 
the E. coli chromosome. The genes 
transcribed from left to right use the 
bottom DNA strand as the template; those 
transcribed from right to left use the top 
strand as the template. 
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Table 6-2 The Three 


RNA 


Polymerases in Eucaryotic Cells 


TYPE OF POLYMERASE 


GENES TRANSCRIBED 


RNA polymerase 1 




5.8S, 1 8S, and 28S rRNA genes 


RNA polymerase II 




all protein -coding genes, plus snoRNA genes, miRNA 
genes, siRNA genes, and most snRNA genes 


RNA polymerase III 




tRNA genes, 5S rRNA genes, some snRNA genes 
and genes for other small RNAs 



The rRNAs are named according to their "S" values, which refer to their rate of sedimentation in an 
ultracentrifuge. The larger the S value, the larger the rRNA. 



While bacterial RNA polymerase requires only a single additional protein 
(o factor) for transcription initiation to occur in vitro, eucaryotic RNA poly- 
merases require many additional proteins, collectively called the general 
transcription factors. 

Eucaryotic transcription initiation must deal with the packing of DNA into 
nucleosomes and higher- order forms of chromatin structure, features 
absent from bacterial chromosomes. 



RNA Polymerase II Requires General Transcription Factors 

The general transcription factors help to position eucaryotic RNA polymerase 
correctly at the promoter, aid in pulling apart the two strands of DNA to allow 
transcription to begin, and release RNA polymerase from the promoter into the 
elongation mode once transcription has begun. <CTAT> The proteins are "gen- 
eral" because they are needed at nearly all promoters used by RNA polymerase II; 
consisting of a set of interacting proteins, they are designated as TFII (for tran- 
scription factor for polymerase II), and are denoted arbitrarily as TFIIB, TFIID, 
and so on. In a broad sense, the eucaryotic general transcription factors carry out 
functions equivalent to those of the o factor in bacteria; indeed, portions of TFIIF 
have the same three-dimensional structure as the equivalent portions of a. 

Figure 6-16 illustrates how the general transcription factors assemble at 
promoters used by RNA polymerase II, and Table 6-3 summarizes their activi- 
ties. The assembly process begins when the general transcription factor TFIID 
binds to a short double-helical DNA sequence primarily composed of T and A 
nucleotides. For this reason, this sequence is known as the TATA sequence, or 
TATA box, and the subunit of TFIID that recognizes it is called TBP (for TATA- 
binding protein). The TATA box is typically located 25 nucleotides upstream 
from the transcription start site. It is not the only DNA sequence that signals the 
start of transcription (Figure 6-17), but for most polymerase II promoters it is 
the most important. The binding of TFIID causes a large distortion in the DNA 
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Figure 6-15 Structural similarity 
between a bacterial RNA polymerase 
and a eucaryotic RNA polymerase II. 

Regions of the two RNA polymerases that 
have similar structures are indicated in 
green. The eucaryotic polymerase is larger 
than the bacterial enzyme (12 subunits 
instead of 5), and some of the additional 
regions are shown in gray. The blue 
spheres represent Zn atoms that serve as 
structural components of the 
polymerases, and the red sphere 
represents the Mg atom present at the 
active site, where polymerization takes 
place. The RNA polymerases in all 
modern-day cells (bacteria, archaea, and 
eucaryotes) are closely related, indicating 
that the basic features of the enzyme 
were in place before the divergence of 
the three major branches of life. 
(Courtesy of P. Cramer and R. Kornberg.) 
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Figure 6-16 Initiation of transcription of a eucaryotic gene by RNA 
polymerase II. To begin transcription, RNA polymerase requires several 
general transcription factors. (A) The promoter contains a DNA 
sequence called the TATA box, which is located 25 nucleotides away 
from the site at which transcription is initiated. (B) Through its subunit 
TBP, TFIID recognizes and binds the TATA box, which then enables the 
adjacent binding of TFIIB (C). For simplicity the DNA distortion 
produced by the binding of TFIID (see Figure 6-1 8) is not shown. 
(D) The rest of the general transcription factors, as well as the RNA 
polymerase itself, assemble at the promoter. (E) TFIIH then uses ATP to 
pry apart the DNA double helix at the transcription start point, locally 
exposing the template strand. TFIIH also phosphorylates RNA 
polymerase II, changing its conformation so that the polymerase is 
released from the general factors and can begin the elongation phase 
of transcription. As shown, the site of phosphorylation is a long 
C-terminal polypeptide tail, also called the C-terminal domain (CTD), 
that extends from the polymerase molecule. The assembly scheme 
shown in the figure was deduced from experiments performed in vitro, 
and the exact order in which the general transcription factors assemble 
on promoters may vary from gene to gene in vivo. The general 
transcription factors have been highly conserved in evolution; some of 
those from human cells can be replaced in biochemical experiments by 
the corresponding factors from simple yeasts. 



start of transcription 



TATA box 



(A) 




TBP TFIID 



(B) 




TFIIB 



(C) 



TFIIE 



TFIIH 



CTD 



TFIIF other factors 



Iwlflf 



RNA polymerase II 



of the TATA box (Figure 6-18). This distortion is thought to serve as a physical 
landmark for the location of an active promoter in the midst of a very large 
genome, and it brings DNA sequences on both sides of the distortion together to 
allow for subsequent protein assembly steps. Other factors then assemble, along 
with RNA polymerase II, to form a complete transcription initiation complex 
(see Figure 6-16). The most complicated of the general transcription factors is 
TFIIH. Consisting of 9 subunits, it is nearly as large as RNA polymerase II itself 
and, as we shall see shortly, performs several enzymatic steps needed for the ini- 
tiation of transcription. 

After forming a transcription initiation complex on the promoter DNA, RNA 
polymerase II must gain access to the template strand at the transcription start 
point. TFIIH, which contains a DNA helicase as one of its subunits, makes this 
step possible by hydrolyzing ATP and unwinding the DNA, thereby exposing the 
template strand. Next, RNA polymerase II, like the bacterial polymerase, 
remains at the promoter synthesizing short lengths of RNA until it undergoes a 
series of conformational changes that allow it to move away from the promoter 
and enter the elongation phase of transcription. A key step in this transition is 
the addition of phosphate groups to the "tail" of the RNA polymerase (known as 
the CTD or C-terminal domain). In humans, the CTD consists of 52 tandem 
repeats of a seven- amino -acid sequence, which extend from the RNA poly- 
merase core structure. During transcription initiation, the serine located at the 
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Table 6-3 The General Transcription Factors Needed for Transcription Initiation by Eucaryotic RNA Polymerase II 

NAME NUMBER OF ROLES IN TRANSITION INITIATE 

BUNITS 




TFIIB 

TFIIF 
TFIIE 
TFIIH 




TFIID 
TBP subunit 
TAF subunits 



1 recog n izes TATA box 

-1 1 recognizes other DNA sequences near the transcription start point; regulates DNA-binding 

by TBP 

1 recognizes BRE element in promoters; accurately positions RNA polymerase at the start site 

of transcription 
3 stabilizes RNA polymerase interaction with TBP and TFIIB; helps attract TFIIE and TFIIH 

2 attracts and regulates TFIIH 

9 unwinds DNA at the transcription start point, phosphorylates Ser5 of the RNA polymerase 

CTD; releases RNA polymerase from the promoter 



TFIID is composed of TBP and ~1 1 additional subunits called TAFs (TBP-associated factors); CTD, C-terminal domain. 
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fifth position in the repeat sequence (Ser5) is phosphorylated by TFIIH, which 
contains a protein kinase in another of its subunits (see Figure 6-16D and E). 
The polymerase can then disengage from the cluster of general transcription 
factors. During this process, it undergoes a series of conformational changes 
that tighten its interaction with DNA, and it acquires new proteins that allow it 
to transcribe for long distances, and in some cases for many hours, without dis- 
sociating from DNA. 

Once the polymerase II has begun elongating the RNA transcript, most of 
the general transcription factors are released from the DNA so that they are 
available to initiate another round of transcription with a new RNA polymerase 
molecule. As we see shortly, the phosphorylation of the tail of RNA polymerase 
II also causes components of the RNA-processing machinery to load onto the 
polymerase and thus be positioned to modify the newly transcribed RNA as it 
emerges from the polymerase. 



Figure 6-1 7 Consensus sequences 
found in the vicinity of eucaryotic RNA 
polymerase II start points. The name 
given to each consensus sequence (first 
column) and the general transcription 
factor that recognizes it (last column) are 
indicated. N indicates any nucleotide, and 
two nucleotides separated by a slash 
indicate an equal probability of either 
nucleotide at the indicated position. In 
reality, each consensus sequence is a 
shorthand representation of a histogram 
similar to that of Figure 6-1 2. 
For most RNA polymerase II 
transcription start points, only two or 
three of the four sequences are present. 
For example, many polymerase II 
promoters have a TATA box sequence, but 
those that do not typically have a "strong" 
INR sequence. Although most of the DNA 
sequences that influence transcription 
initiation are located upstream of the 
transcription start point, a few, such as 
the DPE shown in the figure, are located 
in the transcribed region. 



Polymerase II Also Requires Activator, Mediator, and Chromatin- 
Modifying Proteins 

Studies of the behavior of RNA polymerase II and its general transcription fac- 
tors on purified DNA templates in vitro established the model for transcription 
initiation just described. However, as discussed in Chapter 4, DNA in eucaryotic 
cells is packaged into nucleosomes, which are further arranged in higher-order 




Figure 6-18 Three-dimensional 
structure of TBP (TATA-binding protein) 
bound to DNA. The TBP is the subunit of 
the general transcription factor TFIID that 
is responsible for recognizing and 
binding to the TATA box sequence in the 
DNA (red). The unique DNA bending 
caused by TBP — two kinks in the double 
helix separated by partly unwound 
DNA — may serve as a landmark that 
helps to attract the other general 
transcription factors. TBP is a single 
polypeptide chain that is folded into two 
very similar domains (blue and green). 
(Adapted from J.L. Kim et al., Nature 
365:520-527, 1993. With permission from 
Macmillan Publishers Ltd.) 
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chromatin structures. As a result, transcription initiation in a eucaryotic cell is 
more complex and requires even more proteins than it does on purified DNA. 
First, gene regulatory proteins known as transcriptional activators must bind to 
specific sequences in DNA and help to attract RNA polymerase II to the start 
point of transcription (Figure 6- 19). We discuss the role of activators in Chapter 
7, because they are one of the main ways in which cells regulate expression of 
their genes. Here we simply note that their presence on DNA is required for tran- 
scription initiation in a eucaryotic cell. Second, eucaryotic transcription initia- 
tion in vivo requires the presence of a protein complex known as Mediator, 
which allows the activator proteins to communicate properly with the poly- 
merase II and with the general transcription factors. Finally, transcription initi- 
ation in a eucaryotic cell typically requires the local recruitment of chromatin- 
modifying enzymes, including chromatin remodeling complexes and histone- 
modifying enzymes. As discussed in Chapter 4, both types of enzymes can allow 
greater access to the DNA present in chromatin, and by doing so, they facilitate 
the assembly of the transcription initiation machinery onto DNA. We will revisit 
the role of these enzymes in transcription initiation in Chapter 7. 

As illustrated in Figure 6-19, many proteins (well over 100 individual sub- 
units) must assemble at the start point of transcription to initiate transcription 
in a eucaryotic cell. The order of assembly of these proteins does not seem to fol- 
low a prescribed pathway; rather, the order differs from gene to gene. Indeed, 
some of these different protein complexes may interact with each other away 
from the DNA and be brought to DNA as preformed subassemblies. To begin 
transcribing, RNA polymerase II must be released from this large complex of 
proteins, and, in addition to the steps described in Figure 6-16, this often 
requires the in situ proteolysis of the activator protein. We return to some of 
these issues in Chapter 7, where we discuss how eucaryotic cells can regulate the 
process of transcription initiation. 
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Figure 6-1 9 Transcription initiation by 
RNA polymerase II in a eucaryotic cell. 

Transcription initiation in vivo requires 
the presence of transcriptional activator 
proteins. As described in Chapter 7, these 
proteins bind to specific short sequences 
in DNA. Although only one is shown here, 
a typical eucaryotic gene has many 
activator proteins, which together 
determine its rate and pattern of 
transcription. Sometimes acting from a 
distance of several thousand nucleotide 
pairs (indicated by the dashed DNA 
molecule), these gene regulatory 
proteins help RNA polymerase, the 
general transcription factors, and the 
mediator all to assemble at the promoter. 
In addition, activators attract ATP- 
dependent chromatin remodeling 
complexes and histone acetylases. 

As discussed in Chapter 4, the "default" 
state of chromatin is probably the 30-nm 
filament (see Figure 4-22), and this is 
likely to be a form of DNA upon which 
transcription is initiated. For simplicity, it 
is not shown in the figure. 



Transcription Elongation Produces Superhelical Tension in DNA 

Once it has initiated transcription, RNA polymerase does not proceed smoothly 
along a DNA molecule; rather, it moves jerkily, pausing at some sequences and 
rapidly transcribing through others. Elongating RNA polymerases, both bacte- 
rial and eucaryotic, are associated with a series of elongation factors, proteins 
that decrease the likelihood that RNA polymerase will dissociate before it 
reaches the end of a gene. These factors typically associate with RNA polymerase 
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Figure 6-20 Superhelical tension in DNA 
causes DNA supercoiling. (A) For a DNA 
molecule with one free end (or a nick in 
one strand that serves as a swivel), the 
DNA double helix rotates by one turn for 
every 10 nucleotide pairs opened. 
(B) If rotation is prevented, superhelical 
tension is introduced into the DNA by 
helix opening. One way of 
accommodating this tension would be to 
increase the helical twist from 10 to 1 1 
nucleotide pairs per turn in the double 
helix that remains; the DNA helix, 
however, resists such a deformation in a 
springlike fashion, preferring to relieve 
the superhelical tension by bending into 
supercoiled loops. As a result, one DNA 
supercoil forms in the DNA double helix 
for every 10 nucleotide pairs opened. The 
supercoil formed in this case is a positive 
supercoil. (C) Supercoiling of DNA is 
induced by a protein tracking through 
the DNA double helix. The two ends of 
the DNA shown here are unable to rotate 
freely relative to each other, and the 
protein molecule is assumed also to be 
prevented from rotating freely as it 
moves. Under these conditions, the 
movement of the protein causes an 
excess of helical turns to accumulate in 
the DNA helix ahead of the protein and a 
deficit of helical turns to arise in the DNA 
behind the protein, as shown. 



shortly after initiation and help polymerases to move through the wide variety of 
different DNA sequences that are found in genes. Eucaryotic RNA polymerases 
must also contend with chromatin structure as they move along a DNA tem- 
plate, and they are typically aided by ATP-dependent chromatin remodeling 
complexes (see pp. 215-216). These complexes may move with the polymerase 
or may simply seek out and rescue the occasional stalled polymerase. In addi- 
tion, some elongation factors associated with eucaryotic RNA polymerase facil- 
itate transcription through nucleosomes without requiring additional energy. It 
is not yet understood in detail how this is accomplished, but these proteins can 
transiently dislodge H2A-H2B dimers from the nucleosome core, replacing 
them as the polymerase moves through the nucleosome. 

There is yet another barrier to elongating polymerases, both bacterial and 
eucaryotic. To discuss this issue, we need first to consider a subtle property 
inherent in the DNA double helix called DNA supercoiling. DNA supercoiling 
represents a conformation that DNA adopts in response to superhelical tension; 
conversely, creating various loops or coils in the helix can create such tension. 
Figure 6-20 illustrates the topological constraints that cause DNA supercoiling. 
There are approximately 10 nucleotide pairs for every helical turn in a DNA dou- 
ble helix. Imagine a helix whose two ends are fixed with respect to each other (as 
they are in a DNA circle, such as a bacterial chromosome, or in a tightly clamped 
loop, as is thought to exist in eucaryotic chromosomes). In this case, one large 
DNA supercoil will form to compensate for each 10 nucleotide pairs that are 
opened (unwound). The formation of this supercoil is energetically favorable 
because it restores a normal helical twist to the base-paired regions that remain, 
which would otherwise need to be overwound because of the fixed ends. 

RNA polymerase also creates superhelical tension as it moves along a stretch 
of DNA that is anchored at its ends (see Figure 6-20C) . As long as the polymerase 
is not free to rotate rapidly (and such rotation is unlikely given the size of RNA 
polymerases and their attached transcripts), a moving polymerase generates 
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positive superhelical tension in the DNA in front of it and negative helical ten- 
sion behind it. For eucaryotes, this situation is thought to provide a bonus: the 
positive superhelical tension ahead of the polymerase makes the DNA helix 
more difficult to open, but this tension should facilitate the unwrapping of DNA 
in nucleosomes, as the release of DNA from the histone core helps to relax pos- 
itive superhelical tension. 

Any protein that propels itself alone along a DNA strand of a double helix 
tends to generate superhelical tension. In eucaryotes, DNA topoisomerase 
enzymes rapidly remove this superhelical tension (see p. 278). But in bacteria a 
specialized topoisomerase called DNA gyrase uses the energy of ATP hydrolysis 
to pump supercoils continuously into the DNA, thereby maintaining the DNA 
under constant tension. These are negative supercoils, having the opposite 
handedness from the positive supercoils that form when a region of DNA helix 
opens (see Figure 6-20B). Whenever a region of helix opens, it removes these 
negative supercoils from bacterial DNA, reducing the superhelical tension. DNA 
gyrase therefore makes the opening of the DNA helix in bacteria energetically 
favorable compared with helix opening in DNA that is not supercoiled. For this 
reason, it usually facilitates those genetic processes in bacteria, including the 
initiation of transcription by bacterial RNA polymerase, that require helix open- 
ing (see Figure 6-11). 



Figure 6-21 Summary of the steps 
leading from gene to protein in 
eucaryotes and bacteria. The final level 
of a protein in the cell depends on the 
efficiency of each step and on the rates of 
degradation of the RNA and protein 
molecules. (A) In eucaryotic cells the RNA 
molecule resulting from transcription 
contains both coding (exon) and 
noncoding (intron) sequences. Before it 
can be translated into protein, the two 
ends of the RNA are modified, the introns 
are removed by an enzymatically 
catalyzed RNA splicing reaction, and the 
resulting mRNA is transported from the 
nucleus to the cytoplasm. Although the 
steps in this figure are depicted as 
occurring one at a time, in a sequence, in 
reality they can occur concurrently. For 
example, the RNA cap is added and 
splicing typically begins before 
transcription has been completed. 
Because of the coupling between 
transcription and RNA processing, 
primary transcripts — the RNAs that 
would, in theory, be produced if no 
processing had occurred — are found only 
rarely. (B) In procaryotes the production 
of mRNA is much simpler. The 5' end of 
an mRNA molecule is produced by the 
initiation of transcription, and the 3' end 
is produced by the termination of 
transcription. Since procaryotic cells lack 
a nucleus, transcription and translation 
take place in a common compartment. In 
fact, the translation of a bacterial mRNA 
often begins before its synthesis has 
been completed. 



Transcription Elongation in Eucaryotes Is Tightly Coupled to RNA 
Processing 

We have seen that bacterial mRNAs are synthesized solely by the RNA poly- 
merase starting and stopping at specific spots on the genome. The situation in 
eucaryotes is substantially different. In particular, transcription is only the first 
of several steps needed to produce an mRNA. Other critical steps are the cova- 
lent modification of the ends of the RNA and the removal of intron sequences 
that are discarded from the middle of the RNA transcript by the process of RNA 
splicing (Figure 6-21). 
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Both ends of eucaryotic mRNAs are modified: by capping on the 5' end and 
by polyadenylation of the 3' end (Figure 6-22). These special ends allow the cell 
to assess whether both ends of an mRNA molecule are present (and the message 
is therefore intact) before it exports the RNA sequence from the nucleus and 
translates it into protein. RNA splicing joins together the different portions of a 
protein coding sequence, and it provides higher eucaryotes with the ability to 
synthesize several different proteins from the same gene. 

An ingenious mechanism couples all of the above RNA processing steps to 
transcription elongation. As discussed previously, a key step in transcription ini- 
tiation by RNA polymerase II is the phosphorylation of the RNA polymerase II 
tail, called the CTD (C-terminal domain). This phosphorylation proceeds grad- 
ually as the RNA polymerase initiates transcription and moves along the DNA. It 
not only helps dissociate the RNA polymerase II from other proteins present at 
the start point of transcription, but also allows a new set of proteins to associate 
with the RNA polymerase tail that function in transcription elongation and RNA 
processing. As discussed next, some of these processing proteins seem to "hop" 
from the polymerase tail onto the nascent RNA molecule to begin processing it 
as it emerges from the RNA polymerase. Thus, we can view RNA polymerase II in 
its elongation mode as an RNA factory that both transcribes DNA into RNA and 
processes the RNA it produces (Figure 6-23). Fully extended, the CTD is nearly 
10 times longer than the remainder of RNA polymerase and, in effect, it serves as 
a tether, holding a variety of proteins close by until they are needed. This strat- 
egy, which speeds up the rate of subsequent reactions, is one commonly 
observed in the cell (see Figures 4-69 and 16-38). 



RNA Capping Is the First Modification of Eucaryotic Pre-mRNAs 

As soon as RNA polymerase II has produced about 25 nucleotides of RNA, the 5' 
end of the new RNA molecule is modified by addition of a cap that consists of a 
modified guanine nucleotide (see Figure 6-22B). Three enzymes, acting in suc- 
cession, perform the capping reaction: one (a phosphatase) removes a phos- 
phate from the 5' end of the nascent RNA, another (a guanyl transferase) adds a 
GMP in a reverse linkage (5' to 5' instead of 5' to 3'), and a third (a methyl trans- 
ferase) adds a methyl group to the guanosine (Figure 6-24). Because all three 
enzymes bind to the RNA polymerase tail phosphorylated at serine-5 position, 
the modification added by TFIIH during transcription initiation, they are poised 
to modify the 5' end of the nascent transcript as soon as it emerges from the 
polymerase. 

The 5'-methyl cap signifies the 5' end of eucaryotic mRNAs, and this land- 
mark helps the cell to distinguish mRNAs from the other types of RNA molecules 
present in the cell. For example, RNA polymerases I and III produce uncapped 



CH 2 - P- P- P-CH 2 
5' 



N© 
I 
CH 3 



5'-to-5' 
triphosphate ^ 
bridge 




Figure 6-22 A comparison of the 
structures of procaryotic and eucaryotic 
mRNA molecules. (A) The 5' and 3' ends 
of a bacterial mRNA are the unmodified 
ends of the chain synthesized by the RNA 
polymerase, which initiates and 
terminates transcription at those points, 
respectively. The corresponding ends of a 
eucaryotic mRNA are formed by adding a 
5' cap and by cleavage of the pre-mRNA 
transcript and the addition of a poly-A 
tail, respectively. The figure also illustrates 
another difference between the 
procaryotic and eucaryotic mRNAs: 
bacterial mRNAs can contain the 
instructions for several different proteins, 
whereas eucaryotic mRNAs nearly always 
contain the information for only a single 
protein. (B) The structure of the cap at the 
5' end of eucaryotic mRNA molecules. 
Note the unusual 5'-to-5' linkage of the 
7-methyl G to the remainder of the RNA. 
Many eucaryotic mRNAs carry an 
additional modification: the 2 / -hydroxyl 
group on the second ribose sugar in the 
mRNA is methylated (not shown). 
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Figure 6-23 Eucaryotic RNA polymerase II as an "RNA factory/' As the 

polymerase transcribes DNA into RNA, it carries pre-mRNA-processing 
proteins on its tail that are transferred to the nascent RNA at the 
appropriate time. The tail, known as the CTD, contains 52 tandem repeats 
of a seven amino acid sequence, and there are two serines in each repeat. 
The capping proteins first bind to the RNA polymerase tail when it is 
phosphorylated on Ser5 of the heptad repeat late in the process of 
transcription initiation (see Figure 6-1 6). This strategy ensures that the RNA 
molecule is efficiently capped as soon as its 5' end emerges from the RNA 
polymerase. As the polymerase continues transcribing, its tail is extensively 
phosphorylated on the Ser2 positions by a kinase associated with the 
elongating polymerase and is eventually dephosphorylated at Ser5 
positions. These further modifications attract splicing and 3'-end 
processing proteins to the moving polymerase, positioning them to act on 
the newly synthesized RNA as it emerges from the RNA polymerase. There 
are many RNA-processing enzymes, and not all travel with the polymerase. 
For RNA splicing, for example, the tail carries only a few critical 
components; once transferred to an RNA molecule, they serve as a 
nucleation site for the remaining components. 

When RNA polymerase II finishes transcribing a gene, it is released from 
DNA, soluble phosphatases remove the phosphates on its tail, and it can 
reinitiate transcription. Only the dephosphorylated form of RNA 
polymerase II is competent to begin RNA synthesis at a promoter. 



RNAs during transcription, in part because these polymerases lack a CTD. In the 
nucleus, the cap binds a protein complex called CBC (cap-binding complex), 
which, as we discuss in subsequent sections, helps the RNA to be properly pro- 
cessed and exported. The 5'-methyl cap also has an important role in the trans- 
lation of mRNAs in the cytosol, as we discuss later in the chapter. 
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RNA Splicing Removes Intron Sequences from Newly Transcribed 
Pre-mRNAs 



As discussed in Chapter 4, the protein coding sequences of eucaryotic genes are 
typically interrupted by noncoding intervening sequences (introns). Discovered 
in 1977, this feature of eucaryotic genes came as a surprise to scientists, who had 
been, until that time, familiar only with bacterial genes, which typically consist 
of a continuous stretch of coding DNA that is directly transcribed into mRNA. In 
marked contrast, eucaryotic genes were found to be broken up into small pieces 
of coding sequence {expressed sequences or exons) interspersed with much 
longer intervening sequences or introns; thus, the coding portion of a eucaryotic 
gene is often only a small fraction of the length of the gene (Figure 6-25). 

Both intron and exon sequences are transcribed into RNA. The intron 
sequences are removed from the newly synthesized RNA through the process of 
RNA splicing. The vast majority of RNA splicing that takes place in cells func- 
tions in the production of mRNA, and our discussion of splicing focuses on this 
so-called precursor-mRNA (or pre-mRNA) splicing. Only after 5' and 3' end pro- 
cessing and splicing have taken place is such RNA termed mRNA. 

Each splicing event removes one intron, proceeding through two sequen- 
tial phosphoryl-transfer reactions known as transesterifications; these join two 
exons while removing the intron as a "lariat" (Figure 6-26). Since the number 
of high-energy phosphate bonds remains the same, these reactions could in 



Figure 6-24 The reactions that cap the 5' end of each RNA molecule 
synthesized by RNA polymerase II. The final cap contains a novel 5'-to-5' 
linkage between the positively charged 7-methyl G residue and the 5' end 
of the RNA transcript (see Figure 6-22B).The letter N represents any one of 
the four ribonucleotides, although the nucleotide that starts an RNA chain 
is usually a purine (an A or a G). (After A.J. Shatkin, BioEssays 7:275-277, 
1987. With permission from ICSU Press.) 
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Figure 6-25 Structure of two human genes showing the arrangement of exons and introns. (A) The relatively small 
(3-globin gene, which encodes one of the subunits of the oxygen-carrying protein hemoglobin, contains 3 exons (see also 
Figure 4-7). (B) The much larger Factor VIII gene contains 26 exons; it codes for a protein (Factor VIII) that functions in the 
blood-clotting pathway. The most prevalent form of hemophilia results from mutations in this gene. 

principle take place without nucleoside triphosphate hydrolysis. However, the 
machinery that catalyzes pre-mRNA splicing is complex, consisting of 5 addi- 
tional RNA molecules and as many as 200 proteins, and it hydrolyzes many ATP 
molecules per splicing event. This additional complexity ensures that splicing is 
accurate, while at the same time being flexible enough to deal with the enor- 
mous variety of introns found in a typical eucaryotic cell. 

It may seem wasteful to remove large numbers of introns by RNA splicing. In 
attempting to explain why it occurs, scientists have pointed out that the 
exon-intron arrangement would seem to facilitate the emergence of new and 
useful proteins over evolutionary time scales. Thus, the presence of numerous 
introns in DNA allows genetic recombination to readily combine the exons of 
different genes (see p. 140), enabling genes for new proteins to evolve more eas- 
ily by the combination of parts of preexisting genes. The observation, described 
in Chapter 3, that many proteins in present-day cells resemble patchworks com- 
posed from a common set of protein domains, supports this idea. 

RNA splicing also has a present-day advantage. The transcripts of many 
eucaryotic genes (estimated at 75% of genes in humans) are spliced in more 
than one way, thereby allowing the same gene to produce a corresponding set of 
different proteins (Figure 6-27). Rather than being the wasteful process it may 
have seemed at first sight, RNA splicing enables eucaryotes to increase the 
already enormous coding potential of their genomes. We shall return to this idea 
again in this chapter and the next, but we first need to describe the cellular 
machinery that performs this remarkable task. 
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Figure 6-26 The pre-mRNA splicing 
reaction. (A) In the first step, a specific 
adenine nucleotide in the intron 
sequence (indicated in red) attacks the 
5' splice site and cuts the sugar- 
phosphate backbone of the RNA at this 
point. The cut 5' end of the intron 
becomes covalently linked to the adenine 
nucleotide, as shown in detail in (B), 
thereby creating a loop in the RNA 
molecule. The released free 3'-OH end of 
the exon sequence then reacts with the 
start of the next exon sequence, joining 
the two exons together and releasing the 
intron sequence in the shape of a lariat 
The two exon sequences thereby become 
joined into a continuous coding 
sequence; the released intron sequence is 
eventually degraded. 
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Nucleotide Sequences Signal Where Splicing Occurs 

The mechanism of pre -mRNA splicing shown in Figure 6-26 implies that the 
splicing machinery must recognize three portions of the precursor RNA 
molecule: the 5' splice site, the 3' splice site, and the branch point in the intron 
sequence that forms the base of the excised lariat. Not surprisingly, each site has 
a consensus nucleotide sequence that is similar from intron to intron and pro- 
vides the cell with cues for where splicing is to take place (Figure 6-28) . How- 
ever, these consensus sequences are relatively short and can accommodate a 
high degree of sequence variability; as we shall see shortly, the cell incorporates 
additional types of information to ultimately choose exactly where, on each RNA 
molecule, splicing is to take place. 

The high variability of the splicing consensus sequences presents a special 
challenge for scientists attempting to decipher genome sequences. Introns 
range in size from about 10 nucleotides to over 100,000 nucleotides, and choos- 
ing the precise borders of each intron is a difficult task even with the aid of pow- 
erful computers. The possibility of alternative splicing compounds the problem 
of predicting protein sequences solely from a genome sequence. This difficulty 
is one of the main barriers to identifying all of the genes in a complete genome 
sequence, and it is one of the primary reasons why we know only the approxi- 
mate number of genes in the human genome. 



Figure 6-27 Alternative splicing of the 
oe-tropomyosin gene from rat. 

a-Tropomyosin is a coiled-coil protein 
(see Figure 3-9) that regulates 
contraction in muscle cells. The primary 
transcript can be spliced in different 
ways, as indicated in the figure, to 
produce distinct mRNAs, which then give 
rise to variant proteins. Some of the 
splicing patterns are specific for certain 
types of cells. For example, the 
a-tropomyosin made in striated muscle is 
different from that made from the same 
gene in smooth muscle. The arrowheads 
in the top part of the figure mark the 
sites where cleavage and poly-A addition 
form the 3' ends of the mature mRNAs. 



RNA Splicing Is Performed by the Spliceosome 

Unlike the other steps of mRNA production we have discussed, key steps in RNA 
splicing are performed by RNA molecules rather than proteins. Specialized RNA 
molecules recognize the nucleotide sequences that specify where splicing is to 
occur and also participate in the chemistry of splicing. These RNA molecules are 
relatively short (less than 200 nucleotides each), and there are five of them (Ul, 
U2, U4, U5, and U6) involved in the major form of pre-mRNA splicing. Known as 
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Figure 6-28 The consensus nucleotide 
sequences in an RNA molecule that 
signal the beginning and the end of most 
introns in humans. Only the three blocks 
of nucleotide sequences shown are 
required to remove an intron sequence; 
the rest of the intron can be occupied by 
any nucleotides. Here A, G, U, and C are 
the standard RNA nucleotides; R stands for 
purines (A or G); and Y stands for 
pyrimidines (C or U).The A highlighted in 
red forms the branch point of the lariat 
produced by splicing. Only the GU at the 
start of the intron and the AG at its end are 
invariant nucleotides in the splicing 
consensus sequences. Several different 
nucleotides can occupy the remaining 
positions (even the branch point A), 
although the indicated nucleotides are 
preferred. The distances along the RNA 
between the three splicing consensus 
sequences are highly variable; however, 
the distance between the branch point 
and 3' splice junction is typically much 
shorter than that between the 5' splice 
junction and the branch point. 
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snRNAs (small nuclear RNAs), each is complexed with at least seven protein 
subunits to form a snRNP (small nuclear ribonucleoprotein). These snRNPs 
form the core of the spliceosome, the large assembly of RNA and protein 
molecules that performs pre-mRNA splicing in the cell. 

The spliceosome is a complex and dynamic machine. When studied in vitro, 
a few components of the spliceosome assemble on pre-mRNA and, as the splic- 
ing reaction proceeds, new components enter as those that have already per- 
formed their tasks are jettisoned (Figure 6-29) . However, many scientists believe 
that, inside the cell, the spliceosome is a preexisting, loose assembly of all the 
components — capturing, splicing and releasing RNA as a coordinated unit, and 
undergoing extensive rearrangements each time a splice is made. During the 
splicing reaction, recognition of the 5' splice junction, the branch-point site, and 
the 3' splice junction is performed largely through base-pairing between the 
snRNAs and the consensus RNA sequences in the pre-mRNA substrate (Figure 
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The U1 snRNP forms base pairs with the 5' 
splice junction (see Figure 6-30A) and the 
BBP (branch-point binding protein) and 
U2AF (U2 auxilliary factor) recognize the 
branch-point site. 



The U2 snRNP displaces BBP and U2AF and 
forms base pairs with the branch-point site 
consensus sequence (see Figure 6-30B). 



The U4/U6»U5 "triple" snRNP enters the 
reaction. In this triple snRNP, the U4 and U6 
snRNAs are held firmly together by base-pair 
interactions. Subsequent rearrangements 
create the active site of the spliceosome and 
position the appropriate portions of the 
pre-mRNA substrate for the first 
phosphoryl-transferase reaction. 



Several more RNA-RNA rearrangements 
occur that break apart the U4/U6 base pairs 
and allow the U6 snRNP to displace U1 at 
the 5' splice junction (see Figure 6-30A) to 
form the active site for the second 
phosphoryl-transferase reaction, which 
completes the splice. 
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Figure 6-29 The pre-mRNA splicing 
mechanism. RNA splicing is catalyzed by 
an assembly of snRNPs (shown as colored 
circles) plus other proteins (most of which 
are not shown), which together 
constitute the spliceosome. The 
spliceosome recognizes the splicing 
signals on a pre-mRNA molecule, brings 
the two ends of the intron together, and 
provides the enzymatic activity for the 
two reaction steps (see Figure 6-26). 
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6-30). In the course of splicing, the spliceosome undergoes several shifts in 
which one set of base-pair interactions is broken and another is formed in its 
place. For example, Ul is replaced by U6 at the 5' splice junction (see Figure 
6-30A). This type of RNA-RNA rearrangement (in which the formation of one 
RNA-RNA interaction requires the disruption of another) occurs several times 
during the splicing reaction. It permits the checking and rechecking of RNA 
sequences before the chemical reaction is allowed to proceed, thereby increas- 
ing the accuracy of splicing. 



The Spliceosome Uses ATP Hydrolysis to Produce a Complex 
Series of RNA-RNA Rearrangements 

Although ATP hydrolysis is not required for the chemistry of RNA splicing per se, 
it is required for the assembly and rearrangements of the spliceosome. Some of 
the additional proteins that make up the spliceosome use the energy of ATP 
hydrolysis to break existing RNA-RNA interactions to allow the formation of new 
ones. In fact, all the steps shown previously in Figure 6-29 — except the associa- 
tion of BBP with the branch-point site and Ul snRNP with the 5' splice site — 
require ATP hydrolysis and additional proteins. Each successful splice requires 
as many as 200 proteins, if we include those that form the snRNPs. 

The ATP- requiring RNA-RNA rearrangements that take place in the spliceo- 
some occur within the snRNPs themselves and between the snRNPs and the pre- 
mRNA substrate. One of the most important functions of these rearrangements 
is the creation of the active catalytic site of the spliceosome. The strategy of cre- 
ating an active site only after the assembly and rearrangement of splicing com- 
ponents on a pre-mRNA substrate is a particularly effective way to prevent way- 
ward splicing. 
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Figure 6-30 Several of the 
rearrangements that take place in the 
spliceosome during pre-mRNA splicing. 

Shown here are the details for the yeast 
Saccharomyces cerevisiae, in which the 
nucleotide sequences involved are 
slightly different from those in human 
cells. (A) The exchange of Ul snRNP for 
U6 snRNP occurs before the first 
phosphoryl-transfer reaction (see Figure 
6-29). This exchange requires the 
5' splice site to be read by two different 
snRNPs, thereby increasing the accuracy 
of 5' splice site selection by the 
spliceosome. (B)The branch-point site is 
first recognized by BBP and subsequently 
by U2 snRNP; as in (A), this "check and 
recheck" strategy provides increased 
accuracy of site selection. The binding of 
U2 to the branch point forces the 
appropriate adenine (in red) to be 
unpaired and thereby activates it for the 
attack on the 5' splice site (see Figure 
6-29). This, in combination with 
recognition by BBP, is the way in which 
the spliceosome accurately chooses the 
adenine that is ultimately to form the 
branch point. (C) After the first 
phosphoryl-transfer reaction (left) has 
occurred, a series of rearrangements 
brings the two exons into close proximity 
for the second phosphoryl-transfer 
reaction (right). The snRNAs both position 
the reactants and provide (either all or in 
part) the catalytic sites for the two 
reactions. The U5 snRNP is present in the 
spliceosome before this rearrangement 
occurs; for clarity it has been omitted 
from the left panel. As discussed in the 
text, all of the RNA-RNA rearrangements 
shown in this figure (as well as others 
that occur in the spliceosome but are not 
shown) require the participation of 
additional proteins and ATP hydrolysis. 
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Perhaps the most surprising feature of the spliceosome is the nature of the 
catalytic site itself: it is largely (if not exclusively) formed by RNA molecules 
instead of proteins. In the last section of this chapter we discuss in general terms 
the structural and chemical properties of RNA that allow it to perform catalysis; 
here we need only consider that the U2 and U6 snRNAs in the spliceosome form 
a precise three-dimensional RNA structure that juxtaposes the 5' splice site of 
the pre-mRNA with the branch-point site and probably performs the first trans- 
esterification reaction (see Figure 6-30C). In a similar way, the 5' and 3' splice 
junctions are brought together (an event requiring the U5 snRNA) to facilitate 
the second transesterification. 

Once the splicing chemistry is completed, the snRNPs remain bound to the 
lariat. The disassembly of these snRNPs from the lariat (and from each other) 
requires another series of RNA-RNA rearrangements that require ATP hydroly- 
sis, thereby returning the snRNAs to their original configuration so that they can 
be used again in a new reaction. At the completion of a splice, the spliceosome 
directs a set of proteins to bind to the mRNA near the position formerly occu- 
pied by the intron. Called the exon junction complex (EJC), these proteins mark 
the site of a successful splicing event and, as we shall see later in this chapter, 
influence the subsequent fate of the mRNA. 



Other Properties of Pre-mRNA and Its Synthesis Help to Explain 
the Choice of Proper Splice Sites 

As we have seen, intron sequences vary enormously in size, with some being in 
excess of 100,000 nucleotides. If splice-site selection were determined solely by 
the snRNPs acting on a preformed, protein-free RNA molecule, we would expect 
splicing mistakes — such as exon skipping and the use of "cryptic" splice sites — 
to be very common (Figure 6-31). The fidelity mechanisms built into the 
spliceosome, however, are supplemented by two additional strategies that 
increase the accuracy of splicing. The first is simply a consequence of the early 
stages of splicing occurring while the pre-mRNA molecules are being synthe- 
sized by RNA polymerase II. As transcription proceeds, the phosphorylated tail 
of RNA polymerase carries several components of the spliceosome (see Figure 
6-23), and these components are transferred directly from the polymerase to the 
RNA as RNA is synthesized. This strategy helps the cell keep track of introns and 
exons: for example, the snRNPs that assemble at a 5' splice site are initially pre- 
sented with only a single 3' splice site since the sites further downstream have 
not yet been synthesized. The coordination of transcription with splicing is 
especially important in preventing inappropriate exon skipping. 

A strategy called "exon definition" is another way cells choose the appropri- 
ate splice sites. Exon size tends to be much more uniform than intron size, aver- 
aging about 150 nucleotide pairs across a wide variety of eucaryotic organisms 
(Figure 6-32). According to the exon definition idea, the splicing machinery ini- 
tially seeks out the relatively homogenously sized exon sequences. As RNA syn- 
thesis proceeds, a group of additional components (most notably SR proteins, so- 
named because they contain a domain rich in serines and arginines) assemble on 
exon sequences and help to mark off each 3' and 5' splice site starting at the 5' 
end of the RNA (Figure 6-33). These proteins, in turn, recruit Ul snRNA, which 
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Figure 6-31 Two types of splicing errors. 

(A) Exon skipping. (B) Cryptic splice-site 
selection. Cryptic splicing signals are 
nucleotide sequences of RNA that closely 
resemble true splicing signals. 
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marks the downstream exon boundary, and U2AF, which specifies the upstream 
one. By specifically marking the exons in this way and thereby taking advantage 
of the relatively uniform size of exons, the cell increases the accuracy with which 
it deposits the initial splicing components on the nascent RNA and thereby helps 
to avoid cryptic splice sites. How the SR proteins discriminate exon sequences 
from intron sequences is not understood in detail; however, it is known that some 
of the SR proteins bind preferentially to specific RNA sequences in exons, termed 
splicing enhancers. In principle, since any one of several different codons can be 
used to code for a given amino acid, there is freedom to adjust the exon 
nucleotide sequence so as to form a binding site for an SR protein, without nec- 
essarily affecting the amino acid sequence that the exon specifies. 

Both the marking of exon and intron boundaries and the assembly of the 
spliceosome begin on an RNA molecule while it is still being elongated by RNA 
polymerase at its 3' end. However, the actual chemistry of splicing can take place 
much later. This delay means that intron sequences are not necessarily removed 
from a pre-mRNA molecule in the order in which they occur along the RNA 
chain. It also means that, although spliceosome assembly is co -transcriptional, 
the splicing reactions sometimes occur posttranscriptionally — that is, after a 
complete pre-mRNA molecule has been made. 
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Figure 6-32 Variation in intron and exon 
lengths in the human, worm, and fly 
genomes. (A) Size distribution of exons. 
(B) Size distribution of introns. Note that 
exon length is much more uniform than 
intron length. (Adapted from 
International Human Genome 
Sequencing Consortium, Nature 
409:860-921, 2001. With permission from 
Macmillan Publishers Ltd.) 



A Second Set of snRNPs Splice a Small Fraction of Intron 
Sequences in Animals and Plants 

Simple eucaryotes such as yeasts have only one set of snRNPs that perform all 
pre-mRNA splicing. However, more complex eucaryotes such as flies, mammals, 
and plants have a second set of snRNPs that direct the splicing of a small frac- 
tion of their intron sequences. This minor form of spliceosome recognizes a dif- 
ferent set of RNA sequences at the 5' and 3' splice junctions and at the branch 
point; it is called the U12-type spliceosome because of the involvement of the 
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Figure 6-33 The exon definition idea. 

According to one proposal, SR proteins 
bind to each exon sequence in the pre- 
mRNA and thereby help to guide the 
snRNPs to the proper intron/exon 
boundaries. This demarcation of exons by 
the SR proteins occurs co-transcriptionally, 
beginning at the CBC (cap-binding 
complex) at the 5' end. As indicated, the 
intron sequences in the pre-mRNA, which 
can be extremely long, are packaged into 
hnRNP (heterogeneous nuclear 
ribonucleoprotein) complexes that 
compact them into more manageable 
structures and perhaps mask cryptic splice 
sites. It has been proposed that hnRNP 
proteins may preferentially associate with 
intron sequences and that this preference 
may also help the spliceosome distinguish 
introns from exons. However, as shown, at 
least some hnRNP proteins also bind to 
exon sequences. (Adapted from R. Reed, 
Curr. Opin. Cell Biol. 1 2:340-345, 2000. With 
permission from Elsevier.) 
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U12 SnRNP (Figure 6-34A). Despite recognizing different nucleotide sequences, 
the snRNPs in this spliceosome make the same types of RNA-RNA interactions 
with the pre-mRNA and with each other as do the major snRNPs (Figure 6-34B). 
Although, as we have seen, components of the major spliceosomes travel with 
RNA polymerase II as it transcribes genes, this may not be the case for the U12 
spliceosome. It is possible that U12-mediated splicing is thereby delayed, and 
this presents the cell with a way to co -regulate splicing of the several hundred 
genes whose expression requires this spliceosome. A number of mammalian 
mRNAs contain a mixture of introns, some removed by the major spliceosome 
and others by the minor spliceosome, and it has been proposed that this arrange- 
ment permits particularly complex patterns of alternative splicing to occur. 

A few eucaryotic organisms exhibit a particular variation on splicing, called 
trans-splicing. These organisms include the single-celled trypanosomes — pro- 
tozoans that cause African sleeping sickness in humans — and the model multi- 
cellular organism, the nematode worm. In trans -splicing, exons from two sep- 
arate RNA transcripts are spliced together to form a mature mRNA molecule 
(see Figure 6-34A). Trypanosomes produce all of their mRNAs in this way, 
whereas trans-splicing accounts for only about 1% of nematode mRNAs. In 
both cases, a single exon is spliced onto the 5' end of many different RNA tran- 
scripts produced by the cell; in this way, all of the products of trans- splicing 
have the same 5' exon and different 3' exons. Many of the same snRNPs that 
function in conventional splicing are used in this reaction, although trans- 
splicing uses a unique snRNP (called the SL RNP) that brings in the common 
exon (see Figure 6-34). 



Figure 6-34 Outline of the mechanisms 
used for three types of RNA splicing. 

(A) Three types of spliceosomes. The 
major spliceosome (left), the U12-type 
spliceosome {middle), and the 
trans-spliceosome (right) are each shown 
at two stages of assembly. Introns 
removed by the U12-type spliceosome 
have a different set of consensus 
nucleotide sequences from those 
removed by the major spliceosome. In 
humans, it is estimated that 0.1% of 
introns are removed by the U1 2-type 
spliceosome. In trans-splicing, which 
does not occur in humans, the SL snRNP 
is consumed in the reaction because a 
portion of the SL snRNA becomes the 
first exon of the mature mRNA. (B) The 
major U6 snRNP and the U6 snRNP 
specific to the U1 2-type spliceosome 
both recognize the 5' splice junction, but 
they do so through a different set of 
base-pair interactions. The sequences 
shown are from humans. (Adapted from 
Y.T. Yu et al., The RNA World, pp. 487-524. 
Cold Spring Harbor, New York: Cold 
Spring Harbor Laboratory Press, 1999.) 
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Figure 6-35 Abnormal processing of the p-globin primary RNA transcript 
in humans with the disease p thalassemia. In the examples shown, the 
disease is caused by splice-site mutations {black arrowheads) found in the 
genomes of affected patients. The dark blue boxes represent the three 
normal exon sequences; the red lines indicate the 5' and 3' splice sites. The 
light blue boxes depict new nucleotide sequences included in the final 
mRNA molecule as a result of the mutation. Note that when a mutation 
leaves a normal splice site without a partner, an exon is skipped or one or 
more abnormal cryptic splice sites nearby is used as the partner site, as in 
(C) and (D). (Adapted in part from S.H. Orkin, in The Molecular Basis of 
Blood Diseases [G. Stamatoyannopoulos et al., eds.], pp. 106-126. 
Philadelphia: Saunders, 1987.) 
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We do not know why even a few organisms use trans- splicing; however, it is 
thought that the common 5' exon may aid in the translation of the mRNA. Thus, 
the mRNAs produced by trans -splicing in nematodes seem to be translated with 
especially high efficiency 
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RNA Splicing Shows Remarkable Plasticity 

We have seen that the choice of splice sites depends on such features of the pre- 
mRNA transcript as the affinity of the three signals on the RNA (the 5' and 3' 
splice junctions and the branch point) for the splicing machinery, the co-tran- 
scriptional assembly of the spliceosome, and the "bookkeeping" that underlies 
exon definition. We do not know how accurate splicing normally is because, as 
we see later, there are several quality control systems that rapidly destroy 
mRNAs whose splicing goes awry However, we do know that, compared with 
other steps in gene expression, splicing is unusually flexible. For example, a 
mutation in a nucleotide sequence critical for splicing of a particular intron does 
not necessarily prevent splicing of that intron altogether. Instead, the mutation 
typically creates a new pattern of splicing (Figure 6-35). Most commonly, an 
exon is simply skipped (Figure 6-35B). In other cases, the mutation causes a 
cryptic splice junction to be efficiently used (Figure 6-35C). Apparently, the 
splicing machinery has evolved to pick out the best possible pattern of splice 
junctions, and if the optimal one is damaged by mutation, it will seek out the 
next best pattern, and so on. This flexibility in the process of RNA splicing sug- 
gests that changes in splicing patterns caused by random mutations have been 
an important pathway in the evolution of genes and organisms. 

The plasticity of RNA splicing also means that the cell can regulate the pat- 
tern of RNA splicing. Earlier in this section we saw that alternative splicing can 
give rise to different proteins from the same gene. Some examples of alternative 
splicing are constitutive; that is, the alternatively spliced mRNAs are produced 
continuously by cells of an organism. However, in many cases, the cell regulates 
the splicing patterns so that different forms of the protein are produced at dif- 
ferent times and in different tissues (see Figure 6-27). In Chapter 7 we return to 
this issue to discuss some specific examples of regulated RNA splicing. 
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Spliceosome-Catalyzed RNA Splicing Probably Evolved from 
Self-splicing Mechanisms 

When the spliceosome was first discovered, it puzzled molecular biologists. Why 
do RNA molecules instead of proteins perform important roles in splice site 
recognition and in the chemistry of splicing? Why is a lariat intermediate used 
rather than the apparently simpler alternative of bringing the 5' and 3' splice 
sites together in a single step, followed by their direct cleavage and rejoining? 
The answers to these questions reflect the way in which the spliceosome is 
believed to have evolved. 
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As discussed briefly in Chapter 1 (and in more detail in the final section of 
this chapter), it is likely that early cells used RNA molecules rather than proteins 
as their major catalysts and that they stored their genetic information in RNA 
rather than in DNA sequences. RNA- catalyzed splicing reactions presumably 
had important roles in these early cells. As evidence, some self- splicing RNA 
introns (that is, intron sequences in RNA whose splicing out can occur in the 
absence of proteins or any other RNA molecules) remain today — for example, in 
the nuclear rRNA genes of the ciliate Tetrahymena, in a few bacteriophage T4 
genes, and in some mitochondrial and chloroplast genes. 

A self-splicing intron sequence can be identified in a test tube by incubating 
a pure RNA molecule that contains the intron sequence and observing the splic- 
ing reaction. Two major classes of self-splicing intron sequences can be distin- 
guished in this way. Group I intron sequences begin the splicing reaction by bind- 
ing a G nucleotide to the intron sequence; this G is thereby activated to form the 
attacking group that will break the first of the phosphodiester bonds cleaved 
during splicing (the bond at the 5' splice site). In group II intron sequences, an 
especially reactive A residue in the intron sequence is the attacking group, and a 
lariat intermediate is generated. Otherwise the reaction pathways for the two 
types of self- splicing intron sequences are the same. Both are presumed to rep- 
resent vestiges of very ancient mechanisms (Figure 6-36). 

For both types of self-splicing reactions, the nucleotide sequence of the 
intron is critical; the intron RNA folds into a specific three-dimensional struc- 
ture, which brings the 5' and 3' splice junctions together and provides precisely 
positioned reactive groups to perform the chemistry (see Figure 6-6C). Because 
the chemistries of their splicing reactions are so similar, it has been proposed 
that the pre-mRNA splicing mechanism of the spliceosome evolved from group 
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Figure 6-36 The two known classes of 
self-splicing intron sequences. The 

figure emphasizes the similarities 
between the two mechanisms. Both are 
normally aided by proteins in the cell that 
speed up the reaction, but the catalysis is 
nevertheless mediated by the RNA in the 
intron sequence. The group I intron 
sequences bind a free G nucleotide to a 
specific site on the RNA to initiate 
splicing, while the group II intron 
sequences use an especially reactive A 
nucleotide in the intron sequence itself 
for the same purpose. Both types of self- 
splicing reactions require the intron to be 
folded into a highly specific three- 
dimensional structure that provides the 
catalytic activity for the reaction (see 
Figure 6-6). The mechanism used by 
group II intron sequences releases the 
intron as a lariat structure and closely 
resembles the pathway of pre-mRNA 
splicing catalyzed by the spliceosome 
(compare with Figure 6-29). The 
spliceosome performs most RNA splicing 
in eucaryotic cells, and self-splicing RNAs 
represent unusual cases. (Adapted from 
T.R. Cech, Cell 44:207-2 1 0, 1 986. With 
permission from Elsevier.) 
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Figure 6-37 Consensus nucleotide 
sequences that direct cleavage and 
polyadenylation to form the 3' end of a 
eucaryotic mRNA. These sequences are 
encoded in the genome; specific proteins 
recognize them after they are transcribed 
into RNA.The hexamer AAUAAA is bound 
by CPSF, the GU-rich element beyond the 
cleavage site is bound by CstF (see Figure 
6-38), and the CA sequence is bound by 
a third factor required for the cleavage 
step. Like other consensus nucleotide 
sequences discussed in this chapter (see 
Figure 6-1 2), the sequences shown in the 
figure represent a variety of individual 
cleavage and polyadenylation signals. 



II self- splicing. According to this idea, when the spliceosomal snRNPs took over 
the structural and chemical roles of the group II introns, the strict sequence con- 
straints on intron sequences would have disappeared, thereby permitting a vast 
expansion in the number of different RNAs that could be spliced. 



RNA-Processing Enzymes Generate the 3' End of Eucaryotic 
mRNAs 

As previously explained, the 5' end of the pre-mRNA produced by RNA poly- 
merase II is capped almost as soon as it emerges from the RNA polymerase. 
Then, as the polymerase continues its movement along a gene, the spliceosome 
assembles on the RNA and delineates the intron and exon boundaries. The long 
C-terminal tail of the RNA polymerase coordinates these processes by transfer- 
ring capping and splicing components directly to the RNA as it emerges from the 
enzyme. We see in this section that, as RNA polymerase II reaches the end of a 
gene, a similar mechanism ensures that the 3' end of the pre-mRNA is appropri- 
ately processed. 

As might be expected, the position of the 3' end of each mRNA molecule is 
ultimately specified by a signal encoded in the genome (Figure 6-37). These sig- 
nals are transcribed into RNA as the RNA polymerase II moves through them, 
and they are then recognized (as RNA) by a series of RNA-binding proteins and 
RNA-processing enzymes (Figure 6-38). Two multisubunit proteins, called CstF 
(cleavage stimulation factor) and CPSF (cleavage and polyadenylation speci- 
ficity factor), are of special importance. Both of these proteins travel with the 
RNA polymerase tail and are transferred to the 3'-end processing sequence on 
an RNA molecule as it emerges from the RNA polymerase. 

Once CstF and CPSF bind to specific nucleotide sequences on the emerging 
RNA molecule, additional proteins assemble with them to create the 3' end of 
the mRNA. First, the RNA is cleaved (see Figure 6-38). Next an enzyme called 
poly-A polymerase (PAP) adds, one at a time, approximately 200 A nucleotides to 
the 3' end produced by the cleavage. The nucleotide precursor for these addi- 
tions is ATP, and the same type of 5'-to-3' bonds are formed as in conventional 
RNA synthesis (see Figure 6-4). Unlike the usual RNA polymerases, poly-A poly- 
merase does not require a template; hence the poly-A tail of eucaryotic mRNAs 



Figure 6-38 Some of the major steps in generating the 3' end of a 
eucaryotic mRNA. This process is much more complicated than the 
analogous process in bacteria, where the RNA polymerase simply stops at a 
termination signal and releases both the 3' end of its transcript and the 
DNA template (see Figure 6-11). 
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is not directly encoded in the genome. As the poly-A tail is synthesized, proteins 
called poly-A-binding proteins assemble onto it and, by a poorly understood 
mechanism, determine the final length of the tail. Some poly-A-binding proteins 
remain bound to the poly-A tail as the mRNA travels from the nucleus to the 
cytosol and they help to direct the synthesis of a protein on the ribosome, as we 
see later in this chapter. 

After the 3' end of a eucaryotic pre-mRNA molecule has been cleaved, the 
RNA polymerase II continues to transcribe, in some cases for hundreds of 
nucleotides. But the polymerase soon releases its grip on the template and tran- 
scription terminates. After 3'- end cleavage has occurred, the newly synthesized 
RNA that emerges from the polymerases lacks a 5' cap; this unprotected RNA is 
rapidly degraded by a 5' -^ 3' exonuclease, which is carried along on the poly- 
merase tail. Apparently, it is this RNA degradation that eventually causes the 
RNA polymerase to dissociate from the DNA. 



Mature Eucaryotic mRNAs Are Selectively Exported from the 
Nucleus 

We have seen how eucaryotic pre-mRNA synthesis and processing take place in 
an orderly fashion within the cell nucleus. However, these events create a special 
problem for eucaryotic cells, especially those of complex organisms where the 
introns are vastly longer than the exons. Of the pre-mRNA that is synthesized, 
only a small fraction — the mature mRNA — is of further use to the cell. The rest — 
excised introns, broken RNAs, and aberrantly processed pre-mRNAs — is not only 
useless but potentially dangerous. How, then, does the cell distinguish between 
the relatively rare mature mRNA molecules it wishes to keep and the overwhelm- 
ing amount of debris from RNA processing? 

The answer is that, as an RNA molecule is processed, it loses certain proteins 
and acquires others, thereby signifying the successful completion of each of the 
different steps. For example, we have seen that acquisition of the cap-binding 
complexes, the exon junction complexes, and the poly-A-binding proteins mark 
the completion of capping, splicing, and poly-A addition, respectively. A properly 
completed mRNA molecule is also distinguished by the proteins it lacks. For 
example, the presence of a snRNP would signify incomplete or aberrant splicing. 
Only when the proteins present on an mRNA molecule collectively signify that 
processing was successfully completed is the mRNA exported from the nucleus 
into the cytosol, where it can be translated into protein. Improperly processed 
mRNAs, and other RNA debris are retained in the nucleus, where they are even- 
tually degraded by the nuclear exosome, a large protein complex whose interior 
is rich in 3'-to-5' RNA exonucleases. Eucaryotic cells thus export only useful RNA 
molecules to the cytoplasm, while debris is disposed of in the nucleus. 

Of all the proteins that assemble on pre-mRNA molecules as they emerge 
from transcribing RNA polymerases, the most abundant are the hnRNPs (het- 
erogeneous nuclear ribonuclear proteins) (see Figure 6-33). Some of these pro- 
teins (there are approximately 30 of them in humans) unwind the hairpin helices 
from the RNA so that splicing and other signals on the RNA can be read more 
easily. Others preferentially package the RNA contained in the very long intron 
sequences typically found in genes of complex organisms. They may therefore 
play an important role in distinguishing mature mRNA from the debris left over 
from RNA processing. 

Successfully processed mRNAs are guided through the nuclear pore com- 
plexes (NPCs) — aqueous channels in the nuclear membrane that directly con- 
nect the nucleoplasm and cytosol (Figure 6-39). Small molecules (less than 
50,000 daltons) can diffuse freely through these channels. However, most of the 
macromolecules in cells, including mRNAs complexed with proteins, are far too 
large to pass through the channels without a special process. The cell uses 
energy to actively transport such macromolecules in both directions through 
the nuclear pore complexes. 

As explained in detail in Chapter 12, macromolecules are moved through 
nuclear pore complexes by nuclear transport receptors, which, depending on the 
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Figure 6-39 Transport of a large mRNA molecule through the nuclear pore complex. (A) The maturation of an mRNA 
molecule as it is synthesized by RNA polymerase and packaged by a variety of nuclear proteins. This drawing of an 
unusually abundant RNA, called the Balbiani Ring mRNA, is based on EM micrographs such as that shown in (B). Balbiani 
Rings are found in the cells of certain insects. (A, adapted from B. Daneholt, Cell 88:585-588, 1 997. With permission from 
Elsevier; B, from BJ. Stevens and H. Swift, J. Cell Biol. 31:55-77, 1966. With permission from The Rockefeller University Press.) 



identity of the macromolecule, escort it from the nucleus to the cytoplasm or 
vice versa. For mRNA export to occur, a specific nuclear transport receptor must 
be loaded onto the mRNA, a step that, at least in some organisms, takes place in 
concert with 3' cleavage and polyadenylation. Once it helps to move an RNA 
molecule through the nuclear pore complex, the transport receptor dissociates 
from the mRNA, re-enters the nucleus, and exports a new mRNA molecule (Fig- 
ure 6-40). 

The export of mRNA-protein complexes from the nucleus can be observed 
with the electron microscope for the unusually abundant mRNA of the insect 
Balbiani Ring genes. As these genes are transcribed, the newly formed RNA is 
seen to be packaged by proteins, including hnRNPs, SR proteins, and compo- 
nents of the spliceosome. This protein-RNA complex undergoes a series of 
structural transitions, probably reflecting RNA processing events, culminating 
in a curved fiber (see Figure 6-39). This curved fiber moves through the nucleo- 
plasm and enters the nuclear pore complex (with its 5' cap proceeding first), and 
it then undergoes another series of structural transitions as it moves through the 
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Figure 6-40 Schematic illustration of an "export-ready 7 ' mRNA molecule and its transport through the nuclear pore. As 

indicated, some proteins travel with the mRNA as it moves through the pore, whereas others remain in the nucleus. The 
nuclear export receptor for mRNAs is a complex of proteins that is deposited when the mRNA has been correctly spliced 
and polyadenylated. When the mRNA is exported to the cytosol, the export receptor dissociates from the mRNA and is 
re-imported into the nucleus, where it can be used again. Just after it leaves the nucleus, and before it loses the cap- 
binding complex (CBC) the mRNA is subjected to a final check, called nonsense-mediated decay, which is described later in 
the chapter. Once it passes this test the mRNA continues to shed previously bound proteins and acquire new ones before it 
is efficiently translated into protein. EJC, exon junction complex. 
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pore. These and other observations reveal that the pre-mRNA-protein and 
mRNA-protein complexes are dynamic structures that gain and lose numerous 
specific proteins during RNA synthesis, processing, and export (see Figure 6-40). 

As we have seen, some of these proteins mark the different stages of mRNA 
maturation; other proteins deposited on the mRNA while it is still in the nucleus 
can affect the fate of the RNA after it is transported to the cytosol. Thus, the sta- 
bility of an mRNA in the cytosol, the efficiency with which it is translated into 
protein, and its ultimate destination in the cytosol can all be determined by pro- 
teins acquired in the nucleus that remain bound to the RNA after it leaves the 
nucleus. We will discuss these issues in Chapter 7 when we turn to the post-tran- 
scriptional control of gene expression. 

We have seen that RNA synthesis and processing are closely coupled in the 
cell, and it might be expected that export from the nucleus is somehow inte- 
grated with these two processes. Although the Balbiani Ring RNAs can be seen 
to move through the nucleoplasm and out through the nuclear pores, other 
mRNAs appear to be synthesized and processed in close proximity to nuclear 
pore complexes. In these cases, which may represent the majority of eucaryotic 
genes, mRNA synthesis, processing, and transport all appear to be tightly cou- 
pled; the mRNA can thus be viewed as emerging from the nuclear pore as a 
newly manufactured car might emerge from an assembly line. Later in this 
chapter, we will see that the cell performs an additional quality- control check on 
each mRNA before it is allowed to be efficiently translated into protein. 

Before discussing what happens to mRNAs after they leave the nucleus, we 
briefly consider how the synthesis and processing of noncoding RNA molecules 
occurs. Although there are many other examples, our discussion focuses on the 
rRNAs that are critically important for the translation of mRNAs into protein. 



Many Noncoding RNAs Are Also Synthesized and Processed in the 
Nucleus 

A few percent of the dry weight of a mammalian cell is RNA; of that, only about 
3-5% is mRNA. A fraction of the remainder represents intron sequences before 
they have been degraded, but the bulk of the RNA in cells performs structural 
and catalytic functions (see Table 6-1, p. 336). The most abundant RNAs in cells 
are the ribosomal RNAs (rRNAs), constituting approximately 80% of the RNA in 
rapidly dividing cells. As discussed later in this chapter, these RNAs form the 
core of the ribosome. Unlike bacteria — in which a single RNA polymerase syn- 
thesizes all RNAs in the cell — eucaryotes have a separate, specialized poly- 
merase, RNA polymerase I, that is dedicated to producing rRNAs. RNA poly- 
merase I is similar structurally to the RNA polymerase II discussed previously; 
however, the absence of a C-terminal tail in polymerase I helps to explain why 
its transcripts are neither capped nor polyadenylated. As mentioned earlier, this 
difference helps the cell distinguish between noncoding RNAs and mRNAs. 

Because multiple rounds of translation of each mRNA molecule can provide 
an enormous amplification in the production of protein molecules, many of the 
proteins that are very abundant in a cell can be synthesized from genes that are 
present in a single copy per haploid genome. In contrast, the RNA components 
of the ribosome are final gene products, and a growing mammalian cell must 
synthesize approximately 10 million copies of each type of ribosomal RNA in 
each cell generation to construct its 10 million ribosomes. The cell can produce 
adequate quantities of ribosomal RNAs only because it contains multiple copies 
of the rRNA genes that code for ribosomal RNAs (rRNAs). Even E. coli needs 
seven copies of its rRNA genes to meet the cell's need for ribosomes. Human 
cells contain about 200 rRNA gene copies per haploid genome, spread out in 
small clusters on five different chromosomes (see Figure 4-11), while cells of the 
frog Xenopus contain about 600 rRNA gene copies per haploid genome in a sin- 
gle cluster on one chromosome (Figure 6-41). 

There are four types of eucaryotic rRNAs, each present in one copy per ribo- 
some. Three of the four rRNAs (18S, 5.8S, and 28S) are made by chemically mod- 
ifying and cleaving a single large precursor rRNA (Figure 6-42); the fourth (5S 
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RNA) is synthesized from a separate cluster of genes by a different polymerase, 
RNA polymerase III, and does not require chemical modification. 

Extensive chemical modifications occur in the 13,000-nucleotide-long pre- 
cursor rRNA before the rRNAs are cleaved out of it and assembled into ribo- 
somes. These include about 100 methylations of the 2 / -OH positions on 
nucleotide sugars and 100 isomerizations of uridine nucleotides to pseudouri- 
dine (Figure 6-43A). The functions of these modifications are not understood in 
detail, but many probably aid in the folding and assembly of the final rRNAs and 
some may subtly alter the function of ribosomes. Each modification is made at 
a specific position in the precursor rRNA. These positions are specified by about 
150 "guide RNAs," which position themselves through base-pairing to the pre- 
cursor rRNA and thereby bring an RNA- modifying enzyme to the appropriate 
position (Figure 6-43B). Other guide RNAs promote cleavage of the precursor 
rRNAs into the mature rRNAs, probably by causing conformational changes in 
the precursor rRNA that expose these sites to nucleases. All of these guide RNAs 
are members of a large class of RNAs called small nucleolar RNAs (or snoRNAs), 
so named because these RNAs perform their functions in a sub compartment of 
the nucleus called the nucleolus. Many snoRNAs are encoded in the introns of 
other genes, especially those encoding ribosomal proteins. They are therefore 
synthesized by RNA polymerase II and processed from excised intron 
sequences. 

Recently several snoRNA-like RNAs have been identified that are synthe- 
sized only in cells of the brain. These are believed to direct the modification of 
mRNAs, instead of rRNAs, and are likely to represent a new, but poorly under- 
stood, type of gene regulatory mechanism. 



Figure 6-41 Transcription from tandemly 
arranged rRNA genes, as seen in the 
electron microscope. The pattern of 
alternating transcribed gene and 
nontranscribed spacer is readily seen. 
A higher-magnification view of rRNA 
genes is shown in Figure 6-9. (From 
V.E. Foe, Cold Spring Harbor Symp. Quant. 
Biol. 42:723-740, 1978. With permission 
from Cold Spring Harbor Laboratory Press.) 
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Figure 6-42 The chemical modification 
and nucleolytic processing of a 
eucaryotic 45S precursor rRNA molecule 
into three separate ribosomal RNAs. 

Two types of chemical modifications 
(color-coded as indicated in Figure 6-43) 
are made to the precursor rRNA before it 
is cleaved. Nearly half of the nucleotide 
sequences in this precursor rRNA are 
discarded and degraded in the nucleus. 
The rRNAs are named according to their 
"S" values, which refer to their rate of 
sedimentation in an ultracentrifuge.The 
larger the S value, the larger the rRNA. 
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The Nucleolus Is a Ribosome-Producing Factory 

The nucleolus is the most obvious structure seen in the nucleus of a eucaryotic 
cell when viewed in the light microscope. Consequently, it was so closely scruti- 
nized by early cytologists that an 1898 review could list some 700 references. We 
now know that the nucleolus is the site for the processing of rRNAs and their 
assembly into ribosome subunits. Unlike many of the major organelles in the 
cell, the nucleolus is not bound by a membrane (Figure 6-44); instead, it is a 
large aggregate of macromolecules, including the rRNA genes themselves, pre- 
cursor rRNAs, mature rRNAs, rRNA-processing enzymes, snoRNPs, ribosomal 
proteins and partly assembled ribosomes. The close association of all these 
components presumably allows the assembly of ribosomes to occur rapidly and 
smoothly. 
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Figure 6-43 Modifications of the 
precursor rRNA by guide RNAs. (A) Two 

prominent covalent modifications occur 
after rRNA synthesis; the differences from 
the initially incorporated nucleotide are 
indicated by red atoms. Pseudouridine is 
an isomer of uridine; the base has been 
"rotated" relative to the sugar. (B) As 
indicated, snoRNAs determine the sites of 
modification by base-pairing to 
complementary sequences on the 
precursor rRNA. The snoRNAs are bound 
to proteins, and the complexes are called 
snoRNPs. snoRNPs contain both the 
guide sequences and the enzymes that 
modify the rRNA. 



Figure 6-44 Electron micrograph of a 
thin section of a nucleolus in a human 
fibroblast showing its three distinct 
zones. (A) View of entire nucleus. 
(B) High-power view of the nucleolus. It is 
believed that transcription of the rRNA 
genes takes place between the fibrillar 
center and the dense fibrillar component 
and that processing of the rRNAs and 
their assembly into the two subunits of 
the ribosome proceeds outward from the 
dense fibrillar component to the 
surrounding granular components. 
(Courtesy of E.G. Jordan and 
J. McGovern.) 
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Figure 6-45 Changes in the appearance of the nucleolus in a human cell 
during the cell cycle. Only the cell nucleus is represented in this diagram. 
In most eucaryotic cells the nuclear envelope breaks down during mitosis, 
as indicated by the dashed circles. 



Various types of RNA molecules play a central part in the chemistry and 
structure of the nucleolus, suggesting that it may have evolved from an ancient 
structure present in cells dominated by RNA catalysis. In present-day cells, the 
rRNA genes also have an important role in forming the nucleolus. In a diploid 
human cell, the rRNA genes are distributed into 10 clusters, located near the tips 
of five different chromosome pairs (see Figure 4-11). During interphase these 10 
chromosomes contribute DNA loops (containing the rRNA genes) to the nucle- 
olus; in M-phase, when the chromosomes condense, the nucleolus disappears. 
Finally, in the telophase part of mitosis, as chromosomes return to their semi- 
dispersed state, the tips of the 10 chromosomes coalesce and the nucleolus 
reforms (Figure 6-45 and Figure 6-46) . The transcription of the rRNA genes by 
RNA polymerase I is necessary for this process. As might be expected, the size of 
the nucleolus reflects the number of ribosomes that the cell is producing. Its size 
therefore varies greatly in different cells and can change in a single cell, occupy- 
ing 25% of the total nuclear volume in cells that are making unusually large 
amounts of protein. 

Ribosome assembly is a complex process, the most important features of 
which are outlined in Figure 6-47. In addition to its important role in ribosome 
biogenesis, the nucleolus is also the site where other RNAs are produced and 
other RNA-protein complexes are assembled. For example, the U6 snRNP, which 
functions in pre-mRNA splicing (see Figure 6-29), is composed of one RNA 
molecule and at least seven proteins. The U6 snRNA is chemically modified by 
snoRNAs in the nucleolus before its final assembly there into the U6 snRNP. 
Other important RNA-protein complexes, including telomerase (encountered 
in Chapter 5) and the signal recognition particle (which we discuss in Chapter 
12), are also believed to be assembled at the nucleolus. Finally, the tRNAs (trans- 
fer RNAs) that carry the amino acids for protein synthesis are processed there as 
well; like the rRNA genes, those encoding tRNAs are clustered in the nucleolus. 
Thus, the nucleolus can be thought of as a large factory at which many different 
noncoding RNAs are transcribed, processed, and assembled with proteins to 
form a large variety of ribonucleoprotein complexes. 



The Nucleus Contains a Variety of Subnuclear Structures 

Although the nucleolus is the most prominent structure in the nucleus, several 
other nuclear bodies have been observed and studied (Figure 6-48). These 
include Cajal bodies (named for the scientist who first described them in 1906), 
GEMS (Gemini of Cajal bodies), and interchromatin granule clusters (also called 



nuclear ^ 
envelope 

nucleolus 



> 



preparation 
for mitosis 



• 



nucleolar 
dissociation 



:x 



•■• 



prophase 

^ metaphase 
anaphase 

telophase 



nucleolar 


• :• 




assoc 


ation 


•• 








•: 

Gi 


preparation 
for DNA 






• 

• 


replication 


1 












■ 



DNA 
replication 




10|um 



Figure 6-46 Nucleolar fusion. These light 
micrographs of human fibroblasts grown 
in culture show various stages of 
nucleolar fusion. After mitosis, each of 
the 1 human chromosomes that carry a 
cluster of rRNA genes begins to form a 
tiny nucleolus, but these rapidly coalesce 
as they grow to form the single large 
nucleolus typical of many interphase 
cells. (Courtesy of E.G. Jordan and 
J. McGovern.) 
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"speckles"). Like the nucleolus, these other nuclear structures lack membranes 
and are highly dynamic; their appearance is probably the result of the tight 
association of protein and RNA components involved in the synthesis, assem- 
bly, and storage of macromolecules involved in gene expression. Cajal bodies 
and GEMS resemble one another and are frequently paired in the nucleus; it is 
not clear whether they truly represent distinct structures. These are likely to be 
the locations in which snoRNAs and snRNAs undergo covalent modifications 
and final assembly with proteins. A group of guide RNAs, termed small Cajal 
RNAs (scaRNAs), selects the sites of these modifications through base pairing. 
Cajal bodies/ GEMS may also be sites where the snRNPs are recycled and their 
RNAs are "reset" after the rearrangements that occur during splicing (see p. 352). 
In contrast, the interchromatin granule clusters have been proposed to be stock- 
piles of fully mature snRNPs and other RNA processing components that are 
ready to be used in the production of mRNA (Figure 6-49). 

Scientists have had difficulties in working out the function of these small 
subnuclear structures, in part because their appearances differ between organ- 
isms and can change dramatically as cells traverse the cell cycle or respond to 
changes in their environment. Much of the progress now being made depends 
on genetic tools — examination of the effects of designed mutations in model 




Figure 6-47 The function of the 
nucleolus in ribosome and other 
ribonucleoprotein synthesis. The 45S 
precursor rRNA is packaged in a large 
ribonucleoprotein particle containing 
many ribosomal proteins imported from 
the cytoplasm. While this particle remains 
at the nucleolus, selected pieces are 
added and others discarded as it is 
processed into immature large and small 
ribosomal subunits.The two ribosomal 
subunits are thought to attain their final 
functional form only as each is 
individually transported through the 
nuclear pores into the cytoplasm. Other 
ribonucleoprotein complexes, including 
telomerase shown here, are also 
assembled in the nucleolus. 
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Figure 6-48 Visualization of some prominent nuclear bodies. (A)-(D) Micrographs of the same human cell nucleus, each 
processed to show a particular set of nuclear structures. (E) All four images enlarged and superimposed. (A) shows the 
location of the protein fibrillarin (a component of several snoRNPs), which is present at both nucleoli and Cajal bodies, the 
latter indicated by arrows. (B) shows interchromatin granule clusters or "speckles" detected by using antibodies against a 
protein involved in pre-mRNA splicing. (C) is stained to show bulk chromatin. (D) shows the location of the protein coilin, 
which is present at Cajal bodies (arrows; see also Figure 4-67). (From J.R. Swedlow and A.I. Lamond, Gen. Biol. 2:1-7, 2001. 
With permission from BioMed Central. Micrographs courtesy of Judith Sleeman.) 



organisms or of spontaneous mutations in humans. As one example, GEMS con- 
tain the SMN (survival of motor neurons) protein. Certain mutations of the gene 
encoding this protein are the cause of inherited spinal muscular atrophy, a 
human disease characterized by a wasting away of the muscles. The disease 
seems to be caused by a defect in snRNP production. A more complete loss of 
snRNPs would be expected to be lethal. 
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Figure 6-49 Schematic view of 
subnuclear structures. A typical 
vertebrate nucleus has several Cajal 
bodies, which are proposed to be the 
sites where snRNPs and snoRNPs 
undergo their final modifications. 
Interchromatin granule clusters are 
proposed to be storage sites for fully 
mature snRNPs. A typical vertebrate 
nucleus has 20-50 interchromatin 
granule clusters. 

After their initial synthesis, snRNAs are 
exported from the nucleus to undergo 
5' and 3' end-processing and assemble 
with the seven common snRNP proteins 
(called Sm proteins). These complexes are 
reimported into the nucleus and the 
snRNPs undergo their final modification 
by scaRNAs at Cajal bodies. In addition, 
snoRNAs chemically modify the U6 
snRNP at the nucleolus. The sites of active 
transcription and splicing (approximately 
2000-3000 sites per vertebrate nucleus) 
correspond to the"perichromatin fibers" 
seen under the electron microscope. 
(Adapted from J.D. Lewis and D.Tollervey, 
Science 288:1 385-1 389, 2000. With 
permission from AAAS.) 
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Given the importance of nuclear subdomains in RNA processing, it might 
have been expected that pre-mRNA splicing would occur in a particular location 
in the nucleus, as it requires numerous RNA and protein components. However, 
the assembly of splicing components on pre-mRNA is co-transcriptional; thus, 
splicing must occur at many locations along chromosomes. Although a typical 
mammalian cell may be expressing on the order of 15,000 genes, transcription 
and RNA splicing may be localized to only several thousand sites in the nucleus. 
These sites themselves are highly dynamic and probably result from the associ- 
ation of transcription and splicing components to create small "assembly lines" 
with a high local concentration of these components. Interchromatin granule 
clusters — which contain stockpiles of RNA-processing components — are often 
observed next to sites of transcription, as though poised to replenish supplies. 
Thus, the nucleus seems to be highly organized into subdomains, with snRNPs, 
snoRNPs, and other nuclear components moving between them in an orderly 
fashion according to the needs of the cell (see Figure 6-48; also see Figure 4-69). 



Summary 

Before the synthesis of a particular protein can begin, the corresponding mRNA 
molecule must be produced by transcription. Bacteria contain a single type of RNA 
polymerase (the enzyme that carries out the transcription of DNA into RNA). An mRNA 
molecule is produced when this enzyme initiates transcription at a promoter, synthe- 
sizes the RNA by chain elongation, stops transcription at a terminator, and releases 
both the DNA template and the completed mRNA molecule. In eucaryotic cells, the 
process of transcription is much more complex, and there are three RNA polymerases — 
polymerase I, II, and III — that are related evolutionary to one another and to the bac- 
terial polymerase. 

RNA polymerase II synthesizes eucaryotic mRNA. This enzyme requires a series of 
additional proteins, the general transcription factors, to initiate transcription on a 
purified DNA template, and still more proteins (including chromatin- remodeling 
complexes and histone- modifying enzymes) to initiate transcription on its chromatin 
templates inside the cell. 

During the elongation phase of transcription, the nascent RNA undergoes three 
types of processing events: a special nucleotide is added to its & end (capping), intron 
sequences are removed from the middle of the RNA molecule (splicing), and the 3 end 
of the RNA is generated (cleavage and polyadenylation). Each of these processes is ini- 
tiated by proteins that travel along with RNA polymerase II by binding to sites on its 
long, extended C- terminal tail. Splicing is unusual in that many of its key steps are car- 
ried out by specialized RNA molecules rather than proteins. Properly processed mRNAs 
are passed through nuclear pore complexes into the cytosol, where they are translated 
into protein. 

For some genes, RNA is the final product. In eucaryotes, these genes are usually 
transcribed by either RNA polymerase I or RNA polymerase III. RNA polymerase I 
makes the ribosomal RNAs. After their synthesis as a large precursor, the rRNAs are 
chemically modified, cleaved, and assembled into the two ribosomal subunits in the 
nucleolus — a distinct subnuclear structure that also helps to process some smaller 
RNA-protein complexes in the cell. Additional subnuclear structures (including Cajal 
bodies and interchromatin granule clusters) are sites where components involved in 
RNA processing are assembled, stored, and recycled. 



FROM RNA TO PROTEIN 

In the preceding section we have seen that the final product of some genes is an 
RNA molecule itself, such as those present in the snRNPs and in ribosomes. 
However, most genes in a cell produce mRNA molecules that serve as interme- 
diaries on the pathway to proteins. In this section we examine how the cell con- 
verts the information carried in an mRNA molecule into a protein molecule. This 
feat of translation was a focus of attention of biologists in the late 1950s, when it 
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was posed as the "coding problem": how is the information in a linear sequence 
of nucleotides in RNA translated into the linear sequence of a chemically quite 
different set of units — the amino acids in proteins? This fascinating question 
stimulated great excitement among scientists at the time. Here was a cryp- 
togram set up by nature that, after more than 3 billion years of evolution, could 
finally be solved by one of the products of evolution — human beings. And 
indeed, not only has the code been cracked step by step, but in the year 2000 the 
structure of the elaborate machinery by which cells read this code — the ribo- 
some — was finally revealed in atomic detail. 



An mRNA Sequence Is Decoded in Sets of Three Nucleotides 

Once an mRNA has been produced by transcription and processing, the infor- 
mation present in its nucleotide sequence is used to synthesize a protein. Tran- 
scription is simple to understand as a means of information transfer: since DNA 
and RNA are chemically and structurally similar, the DNA can act as a direct 
template for the synthesis of RNA by complementary base-pairing. As the term 
transcription signifies, it is as if a message written out by hand is being con- 
verted, say, into a typewritten text. The language itself and the form of the mes- 
sage do not change, and the symbols used are closely related. 

In contrast, the conversion of the information in RNA into protein represents 
a translation of the information into another language that uses quite different 
symbols. Moreover, since there are only 4 different nucleotides in mRNA and 20 
different types of amino acids in a protein, this translation cannot be accounted 
for by a direct one-to-one correspondence between a nucleotide in RNA and an 
amino acid in protein. The nucleotide sequence of a gene, through the interme- 
diary of mRNA, is translated into the amino acid sequence of a protein by rules 
that are known as the genetic code. This code was deciphered in the early 1960s. 

The sequence of nucleotides in the mRNA molecule is read in consecutive 
groups of three. RNA is a linear polymer of four different nucleotides, so there 
are 4 x 4 x 4 = 64 possible combinations of three nucleotides: the triplets AAA, 
AUA, AUG, and so on. However, only 20 different amino acids are commonly 
found in proteins. Either some nucleotide triplets are never used, or the code is 
redundant and some amino acids are specified by more than one triplet. The 
second possibility is, in fact, the correct one, as shown by the completely deci- 
phered genetic code in Figure 6-50. Each group of three consecutive 
nucleotides in RNA is called a codon, and each codon specifies either one amino 
acid or a stop to the translation process. 

This genetic code is used universally in all present-day organisms. Although 
a few slight differences in the code have been found, these are chiefly in the DNA 
of mitochondria. Mitochondria have their own transcription and protein syn- 
thesis systems that operate quite independently from those of the rest of the cell, 
and it is understandable that their small genomes have been able to accommo- 
date minor changes to the code (discussed in Chapter 14). 
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Figure 6-50 The genetic code. The standard one-letter abbreviation for each amino acid is presented below its three-letter 
abbreviation (see Panel 3-1 , pp. 1 28-1 29, for the full name of each amino acid and its structure). By convention, codons are 
always written with the 5'-terminal nucleotide to the left. Note that most amino acids are represented by more than one 
codon, and that there are some regularities in the set of codons that specifies each amino acid. Codons for the same amino 
acid tend to contain the same nucleotides at the first and second positions, and vary at the third position. Three codons do 
not specify any amino acid but act as termination sites (stop codons), signaling the end of the protein-coding sequence. 
One codon — AUG — acts both as an initiation codon, signaling the start of a protein-coding message, and also as the codon 
that specifies methionine. 
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In principle, an RNA sequence can be translated in any one of three differ- 
ent reading frames, depending on where the decoding process begins (Figure 
6-51). However, only one of the three possible reading frames in an mRNA 
encodes the required protein. We see later how a special punctuation signal at 
the beginning of each RNA message sets the correct reading frame at the start of 
protein synthesis. 



tRNA Molecules Match Amino Acids to Codons in mRNA 

The codons in an mRNA molecule do not directly recognize the amino acids 
they specify: the group of three nucleotides does not, for example, bind directly 
to the amino acid. Rather, the translation of mRNA into protein depends on 
adaptor molecules that can recognize and bind both to the codon and, at 
another site on their surface, to the amino acid. These adaptors consist of a set 
of small RNA molecules known as transfer RNAs (tRNAs), each about 80 
nucleotides in length. 

We saw earlier in this chapter that RNA molecules can fold into precise 
three-dimensional structures, and the tRNA molecules provide a striking exam- 
ple. Four short segments of the folded tRNA are double-helical, producing a 
molecule that looks like a cloverleaf when drawn schematically (Figure 6-52). 
For example, a 5'-GCUC-3' sequence in one part of a polynucleotide chain can 
form a relatively strong association with a 5'-GAGC-3' sequence in another 
region of the same molecule. The cloverleaf undergoes further folding to form a 
compact L-shaped structure that is held together by additional hydrogen bonds 
between different regions of the molecule. 

Two regions of unpaired nucleotides situated at either end of the L-shaped 
molecule are crucial to the function of tRNA in protein synthesis. One of these 
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Figure 6-51 The three possible reading 
frames in protein synthesis. In the 

process of translating a nucleotide 
sequence (blue) into an amino acid 
sequence (red), the sequence of 
nucleotides in an mRNA molecule is read 
from the 5' end to the 3' end in 
consecutive sets of three nucleotides. In 
principle, therefore, the same RNA 
sequence can specify three completely 
different amino acid sequences, 
depending on the reading frame. In 
reality, however, only one of these 
reading frames contains the actual 
message. 
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5' GCGGAUUUAGCUCAGDDGGGAGAGCGCCAGACUGAAYA^FCUGGAGGUCCUGUGT^CGAUCCACAGAAUUCGCACCA 3' 
(D) anticodon 

Figure 6-52 A tRNA molecule. A tRNA specific for the amino acid phenylalanine (Phe) is depicted in various ways. (A) The cloverleaf structure 
showing the complementary base-pairing (red lines) that creates the double-helical regions of the molecule. The anticodon is the sequence of 
three nucleotides that base-pairs with a codon in mRNA. The amino acid matching the codon/anticodon pair is attached at the 3' end of the 
tRNA. tRNAs contain some unusual bases, which are produced by chemical modification after the tRNA has been synthesized. For example, the 
bases denoted \\f (pseudouridine — see Figure 6-43) and D (dihydrouridine — see Figure 6-55) are derived from uracil. (B and C) Views of the 
L-shaped molecule, based on x-ray diffraction analysis. Although this diagram shows the tRNA for the amino acid phenylalanine, all other tRNAs 
have similar structures. <CGCA> (D) The linear nucleotide sequence of the molecule, color-coded to match (A), (B), and (C). (E) The tRNA icon we 
sue in this book. 
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regions forms the anticodon, a set of three consecutive nucleotides that pairs 
with the complementary codon in an mRNA molecule. The other is a short sin- 
gle-stranded region at the 3' end of the molecule; this is the site where the amino 
acid that matches the codon is attached to the tRNA. 

We have seen in the previous section that the genetic code is redundant; that 
is, several different codons can specify a single amino acid (see Figure 6-50). 
This redundancy implies either that there is more than one tRNA for many of the 
amino acids or that some tRNA molecules can base-pair with more than one 
codon. In fact, both situations occur. Some amino acids have more than one 
tRNA and some tRNAs are constructed so that they require accurate base-pair- 
ing only at the first two positions of the codon and can tolerate a mismatch (or 
wobble) at the third position (Figure 6-53). This wobble base-pairing explains 
why so many of the alternative codons for an amino acid differ only in their third 
nucleotide (see Figure 6-50). In bacteria, wobble base-pairings make it possible 
to fit the 20 amino acids to their 61 codons with as few as 31 kinds of tRNA 
molecules. The exact number of different kinds of tRNAs, however, differs from 
one species to the next. For example, humans have nearly 500 tRNA genes but, 
among them, only 48 different anticodons are represented. 



tRNAs Are Covalently Modified Before They Exit from the Nucleus 

Like most other eucaryotic RNAs, tRNAs are covalently modified before they are 
allowed to exit from the nucleus. Eucaryotic tRNAs are synthesized by RNA poly- 
merase III. Both bacterial and eucaryotic tRNAs are typically synthesized as 
larger precursor tRNAs, which are then trimmed to produce the mature tRNA. In 
addition, some tRNA precursors (from both bacteria and eucaryotes) contain 
introns that must be spliced out. This splicing reaction differs chemically from 
pre-mRNA splicing; rather than generating a lariat intermediate, tRNA splicing 
uses a cut-and-paste mechanism that is catalyzed by proteins (Figure 6-54). 
Trimming and splicing both require the precursor tRNA to be correctly folded in 
its cloverleaf configuration. Because misfolded tRNA precursors will not be pro- 
cessed properly, the trimming and splicing reactions are thought to act as qual- 
ity-control steps in the generation of tRNAs. 

All tRNAs are modified chemically — nearly 1 in 10 nucleotides in each 
mature tRNA molecule is an altered version of a standard G, U, C, or A ribonu- 
cleotide. Over 50 different types of tRNA modifications are known; a few are 
shown in Figure 6-55. Some of the modified nucleotides — most notably inosine, 
produced by the deamination of adenosine — affect the conformation and base- 
pairing of the anticodon and thereby facilitate the recognition of the appropri- 
ate mRNA codon by the tRNA molecule (see Figure 6-53) . Others affect the accu- 
racy with which the tRNA is attached to the correct amino acid. 



Figure 6-53 Wobble base-pairing between codons and anticodons. If the 

nucleotide listed in the first column is present at the third, or wobble, 
position of the codon, it can base-pair with any of the nucleotides listed in 
the second column. Thus, for example, when inosine (I) is present in the 
wobble position of the tRNA anticodon, the tRNA can recognize any one of 
three different codons in bacteria and either of two codons in eucaryotes. 
The inosine in tRNAs is formed from the deamination of guanine (see 
Figure 6-55), a chemical modification that takes place after the tRNA has 
been synthesized. The nonstandard base pairs, including those made with 
inosine, are generally weaker than conventional base pairs. Note that 
codon-anticodon base pairing is more stringent at positions 1 and 2 of the 
codon: here only conventional base pairs are permitted. The differences in 
wobble base-pairing interactions between bacteria and eucaryotes 
presumably result from subtle structural differences between bacterial and 
eucaryotic ribosomes, the molecular machines that perform protein 
synthesis. (Adapted from C. Guthrie and J. Abelson, in The Molecular 
Biology of the Yeast Saccharomyces: Metabolism and Gene Expression, 
pp. 487-528. Cold Spring Harbor, New York: Cold Spring Harbor Laboratory 
Press, 1982.) 
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Figure 6-54 Structure of a tRNA-splicing 
endonuclease docked to a precursor 
tRNA. The endonuclease (a four-subunit 
enzyme) removes the tRNA intron (blue). 
A second enzyme, a multifunctional tRNA 
ligase (not shown), then joins the two 
tRNA halves together. (Courtesy of Hong 
Li, Christopher Trotta and John Abelson.) 



Specific Enzymes Couple Each Amino Acid to Its Appropriate 
tRNA Molecule 

We have seen that, to read the genetic code in DNA, cells make a series of dif- 
ferent tRNAs. We now consider how each tRNA molecule becomes linked to the 
one amino acid in 20 that is its appropriate partner. Recognition and attach- 
ment of the correct amino acid depends on enzymes called aminoacyl-tRNA 
synthetases, which covalently couple each amino acid to its appropriate set of 
tRNA molecules (Figure 6-56 and Figure 6-57). Most cells have a different syn- 
thetase enzyme for each amino acid (that is, 20 synthetases in all); one attaches 
glycine to all tRNAs that recognize codons for glycine, another attaches alanine 
to all tRNAs that recognize codons for alanine, and so on. Many bacteria, how- 
ever, have fewer than 20 synthetases, and the same synthetase enzyme is 
responsible for coupling more than one amino acid to the appropriate tRNAs. 
In these cases, a single synthetase places the identical amino acid on two dif- 
ferent types of tRNAs, only one of which has an anticodon that matches the 
amino acid. A second enzyme then chemically modifies each "incorrectly" 
attached amino acid so that it now corresponds to the anticodon displayed by 
its covalently linked tRNA. 
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deamination of A 
(inosine) 



Figure 6-55 A few of the unusual 
nucleotides found in tRNA molecules. 

These nucleotides are produced by 
covalent modification of a normal 
nucleotide after it has been incorporated 
into an RNA chain. Two other types of 
modified nucleotides are shown in Figure 
6-43. In most tRNA molecules about 1 0% 
of the nucleotides are modified (see 
Figure 6-52). 
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The synthetase-catalyzed reaction that attaches the amino acid to the 3' end 
of the tRNA is one of many reactions coupled to the energy- releasing hydrolysis 
of ATP (see pp. 79-81), and it produces a high-energy bond between the tRNA 
and the amino acid. The energy of this bond is used at a later stage in protein 
synthesis to link the amino acid covalently to the growing polypeptide chain. 

The aminoacyl-tRNA synthetase enzymes and the tRNAs are equally impor- 
tant in the decoding process (Figure 6-58). This was established by an experi- 
ment in which one amino acid (cysteine) was chemically converted into a dif- 
ferent amino acid (alanine) after it already had been attached to its specific 
tRNA. When such "hybrid" aminoacyl-tRNA molecules were used for protein 
synthesis in a cell-free system, the wrong amino acid was inserted at every point 
in the protein chain where that tRNA was used. Although, as we shall see, cells 
have several quality control mechanisms to avoid this type of mishap, the exper- 
iment establishes that the genetic code is translated by two sets of adaptors that 
act sequentially. Each matches one molecular surface to another with great 
specificity, and it is their combined action that associates each sequence of three 
nucleotides in the mRNA molecule — that is, each codon — with its particular 
amino acid. 



Figure 6-56 Amino acid activation. 

An amino acid is activated for protein 
synthesis by an aminoacyl-tRNA 
synthetase enzyme in two steps. As 
indicated, the energy of ATP hydrolysis is 
used to attach each amino acid to its 
tRNA molecule in a high-energy linkage. 
The amino acid is first activated through 
the linkage of its carboxyl group directly 
to an AMP moiety, forming an adenylated 
amino acid; the linkage of the AMP, 
normally an unfavorable reaction, is 
driven by the hydrolysis of the ATP 
molecule that donates the AMP. Without 
leaving the synthetase enzyme, the AMP- 
linked carboxyl group on the amino acid 
is then transferred to a hydroxyl group on 
the sugar at the 3' end of the tRNA 
molecule. This transfer joins the amino 
acid by an activated ester linkage to the 
tRNA and forms the final aminoacyl-tRNA 
molecule. The synthetase enzyme is not 
shown in this diagram. 



Editing by tRNA Synthetases Ensures Accuracy 

Several mechanisms working together ensure that the tRNA synthetase links the 
correct amino acid to each tRNA. The synthetase must first select the correct 
amino acid, and most synthetases do so by a two-step mechanism. First, the cor- 
rect amino acid has the highest affinity for the active-site pocket of its synthetase 
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Figure 6-57 The structure of the 
aminoacyl-tRNA linkage. The carboxyl 
end of the amino acid forms an ester 
bond to ribose. Because the hydrolysis of 
this ester bond is associated with a large 
favorable change in free energy, an 
amino acid held in this way is said to be 
activated. (A) Schematic drawing of the 
structure. The amino acid is linked to the 
nucleotide at the 3' end of the tRNA (see 
Figure 6-52). (B) Actual structure 
corresponding to the boxed region in (A). 
There are two major classes of synthetase 
enzymes: one links the amino acid 
directly to the 3'-OH group of the ribose, 
and the other links it initially to the 2 / -OH 
group. In the latter case, a subsequent 
transesterification reaction shifts the 
amino acid to the 3' position. As in Figure 
6-56, the "R group" indicates the side 
chain of the amino acid. 
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and is therefore favored over the other 19. In particular, amino acids larger than 
the correct one are effectively excluded from the active site. However, accurate 
discrimination between two similar amino acids, such as isoleucine and valine 
(which differ by only a methyl group), is very difficult to achieve by a one-step 
recognition mechanism. A second discrimination step occurs after the amino 
acid has been covalently linked to AMP (see Figure 6-56). When tRNA binds the 
synthetase, it tries to force the amino acid into a second pocket in the syn- 
thetase, the precise dimensions of which exclude the correct amino acid but 
allow access by closely related amino acids. Once an amino acid enters this edit- 
ing pocket, it is hydrolyzed from the AMP (or from the tRNA itself if the aminoa- 
cyl-tRNA bond has already formed), and is released from the enzyme. This 
hydrolytic editing, which is analogous to the exonucleolytic proofreading by 
DNA polymerases (Figure 6-59), raises the overall accuracy of tRNA charging to 
approximately one mistake in 40,000 couplings. 
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Figure 6-58 The genetic code is 
translated by means of two adaptors 
that act one after another. The first 
adaptor is the aminoacyl-tRNA 
synthetase, which couples a particular 
amino acid to its corresponding tRNA; 
the second adaptor is the tRNA molecule 
itself, whose anticodon forms base pairs 
with the appropriate codon on the mRNA. 
An error in either step would cause the 
wrong amino acid to be incorporated 
into a protein chain. In the sequence of 
events shown, the amino acid tryptophan 
(Trp) is selected by the codon UGG on 
the mRNA. 
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Figure 6-59 Hydrolytic editing. (A) tRNA 
synthetases remove their own coupling 
errors through hydrolytic editing of 
incorrectly attached amino acids. As 
described in the text, the correct amino 
acid is rejected by the editing site. (B) The 
error-correction process performed by 
DNA polymerase shows some similarities; 
however, it differs in so far as the removal 
process depends strongly on a mispairing 
with the template (see Figure 5-8). 
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Figure 6-60 The recognition of a tRNA molecule by its aminoacyl-tRNA 
synthetase. For this tRNA (tRNA Gln ), specific nucleotides in both the 
anticodon (bottom) and the amino acid-accepting arm allow the correct tRNA 
to be recognized by the synthetase enzyme (blue). A bound ATP molecule is 
yellow. (Courtesy of Tom Steitz.) 



The tRNA synthetase must also recognize the correct set of tRNAs, and 
extensive structural and chemical complementarity between the synthetase and 
the tRNA allows the synthetase to probe various features of the tRNA (Figure 
6-60). Most tRNA synthetases directly recognize the matching tRNA anticodon; 
these synthetases contain three adjacent nucleotide-binding pockets, each of 
which is complementary in shape and charge to a nucleotide in the anticodon. 
For other synthetases, the nucleotide sequence of the acceptor stem is the key 
recognition determinant. In most cases, however, the synthetase "reads" the 
nucleotides at several different positions on the tRNA. 



Amino Acids Are Added to the C-terminal End of a Growing 
Polypeptide Chain 

Having seen that amino acids are first coupled to tRNA molecules, we now turn 
to the mechanism that joins amino acids together to form proteins. The funda- 
mental reaction of protein synthesis is the formation of a peptide bond between 
the carboxyl group at the end of a growing polypeptide chain and a free amino 
group on an incoming amino acid. Consequently, a protein is synthesized step- 
wise from its N-terminal end to its C-terminal end. Throughout the entire pro- 
cess the growing carboxyl end of the polypeptide chain remains activated by its 
covalent attachment to a tRNA molecule (forming a peptidyl-tRNA). Each addi- 
tion disrupts this high- energy covalent linkage, but immediately replaces it with 
an identical linkage on the most recently added amino acid (Figure 6-61) . In this 
way, each amino acid added carries with it the activation energy for the addition 
of the next amino acid rather than the energy for its own addition — an example 
of the "head growth" type of polymerization described in Figure 2-68. 



The RNA Message Is Decoded in Ribosomes 

The synthesis of proteins is guided by information carried by mRNA molecules. 
To maintain the correct reading frame and to ensure accuracy (about 1 mistake 
every 10,000 amino acids), protein synthesis is performed in the ribosome, a 
complex catalytic machine made from more than 50 different proteins (the 
ribosomal proteins) and several RNA molecules, the ribosomal RNAs (rRNAs). 
<CGCC> A typical eucaryotic cell contains millions of ribosomes in its cyto- 
plasm (Figure 6-62). Eucaryotic ribosome subunits are assembled at the nucle- 
olus, when newly transcribed and modified rRNAs associate with ribosomal 




Figure 6-61 The incorporation of an 
amino acid into a protein. A polypeptide 
chain grows by the stepwise addition of 
amino acids to its C-terminal end. The 
formation of each peptide bond is 
energetically favorable because the 
growing C-terminus has been activated by 
the covalent attachment of a tRNA 
molecule. The peptidyl-tRNA linkage that 
activates the growing end is regenerated 
during each addition. The amino acid side 
chains have been abbreviated as Ri, R2, R3, 
and R4; as a reference point, all of the atoms 
in the second amino acid in the 
polypeptide chain are shaded gray. The 
figure shows the addition of the fourth 
amino acid (red) to the growing chain. 
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400 nm 



proteins, which have been transported into the nucleus after their synthesis in 
the cytoplasm. The two ribosomal subunits are then exported to the cytoplasm, 
where they join together to synthesize proteins. 

Eucaryotic and procaryotic ribosomes have similar designs and functions. 
Both are composed of one large and one small subunit that fit together to form 
a complete ribosome with a mass of several million daltons (Figure 6-63). The 
small subunit provides the framework on which the tRNAs can be accurately 



Figure 6-62 Ribosomes in the 
cytoplasm of a eucaryotic cell. This 
electron micrograph shows a thin section 
of a small region of cytoplasm. The 
ribosomes appear as black dots (red 
arrows). Some are free in the cytosol; 
others are attached to membranes of the 
endoplasmic reticulum. (Courtesy of 
Daniel S. Friend.) 
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Figure 6-63 A comparison of procaryotic and eucaryotic ribosomes. Despite differences in the number and size of their 
rRNA and protein components, both procaryotic and eucaryotic ribosomes have nearly the same structure and they 
function similarly. Although the 18S and 28S rRNAs of the eucaryotic ribosome contain many nucleotides not present in 
their bacterial counterparts, these nucleotides are present as multiple insertions that form extra domains and leave the 
basic structure of each rRNA largely unchanged. 
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matched to the codons of the mRNA (see Figure 6-58), while the large subunit 
catalyzes the formation of the peptide bonds that link the amino acids together 
into a polypeptide chain (see Figure 6-61). 

When not actively synthesizing proteins, the two subunits of the ribosome 
are separate. They join together on an mRNA molecule, usually near its 5' end, 
to initiate the synthesis of a protein. The mRNA is then pulled through the ribo- 
some; as its codons enter the core of the ribosome, the mRNA nucleotide 
sequence is translated into an amino acid sequence using the tRNAs as adaptors 
to add each amino acid in the correct sequence to the end of the growing 
polypeptide chain. When a stop codon is encountered, the ribosome releases 
the finished protein, and its two subunits separate again. These subunits can 
then be used to start the synthesis of another protein on another mRNA 
molecule. 

Ribosomes operate with remarkable efficiency: in one second, a single ribo- 
some of a eucaryotic cell adds about 2 amino acids to a polypeptide chain; the 
ribosomes of bacterial cells operate even faster, at a rate of about 20 amino acids 
per second. How does the ribosome choreograph the many coordinated move- 
ments required for efficient translation? A ribosome contains four binding sites 
for RNA molecules: one is for the mRNA and three (called the A-site, the P-site, 
and the E-site) are for tRNAs (Figure 6-64) . A tRNA molecule is held tightly at the 
A- and P-sites only if its anticodon forms base pairs with a complementary 
codon (allowing for wobble) on the mRNA molecule that is threaded through the 
ribosome (Figure 6-65). The A- and P-sites are close enough together for their 
two tRNA molecules to be forced to form base pairs with adjacent codons on the 
mRNA molecule. This feature of the ribosome maintains the correct reading 
frame on the mRNA. 

Once protein synthesis has been initiated, each new amino acid is added to 
the elongating chain in a cycle of reactions containing four major steps: tRNA 



Figure 6-64 The RNA-binding sites in 
the ribosome. Each ribosome has one 
binding site for mRNA and three binding 
sites for tRNA: the A-, P-, and E-sites 
(short for aminoacyl-tRNA, peptidyl-tRNA, 
and exit, respectively). (A) A bacterial 
ribosome viewed with the small subunit 
in the front (dark green) and the large 
subunit in the back (light green). Both the 
rRNAs and the ribosomal proteins are 
shown. tRNAs are shown bound in the 
E-site (red), the P-site (orange) and the 
A-site (yellow). Although all three tRNA 
sites are shown occupied here, during the 
process of protein synthesis not more 
than two of these sites are thought to 
contain tRNA molecules at any one time 
(see Figure 6-66). (B) Large and small 
ribosomal subunits arranged as though 
the ribosome in (A) were opened like a 
book. (C) The ribosome in (A) rotated 
through 90° and viewed with the large 
subunit on top and small subunit on the 
bottom. (D) Schematic representation of 
a ribosome (in the same orientation as C), 
which will be used in subsequent figures. 
(A, B, and C, adapted from M.M. Yusupov 
et al., Science 292:883-896, 2001 . With 
permission from AAAS; courtesy of 
Albion Baucom and Harry Noller.) 
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binding, peptide bond formation, large subunit and small subunit translocation. 
As a result of the two translocation steps, the entire ribosome moves three 
nucleotides along the mRNA and is positioned to start the next cycle. (Figure 
6-66) . Our description of the chain elongation process begins at a point at which 
some amino acids have already been linked together and there is a tRNA 
molecule in the P-site on the ribosome, covalently joined to the end of the grow- 
ing polypeptide. In step 1, a tRNA carrying the next amino acid in the chain 
binds to the ribosomal A-site by forming base pairs with the mRNA codon posi- 
tioned there, so that the P-site and the A-site contain adjacent bound tRNAs. In 
step 2, the carboxyl end of the polypeptide chain is released from the tRNA at the 
P-site (by breakage of the high-energy bond between the tRNA and its amino 
acid) and joined to the free amino group of the amino acid linked to the tRNA at 
the A-site, forming a new peptide bond. This central reaction of protein synthe- 
sis is catalyzed by a peptidyl transferase contained in the large ribosomal sub- 
unit. In step 3, the large subunit moves relative to the mRNA held by the small 
subunit, thereby shifting the acceptor stems of the two tRNAs to the E- and P- 
sites of the large subunit. In step 4, another series of conformational changes 
moves the small subunit and its bound mRNA exactly three nucleotides, reset- 
ting the ribosome so it is ready to receive the next aminoacyl-tRNA. Step 1 is 
then repeated with a new incoming aminoacyl-tRNA, and so on. <CGTT> 

This four-step cycle is repeated each time an amino acid is added to the 
polypeptide chain, as the chain grows from its amino to its carboxyl end. 



Figure 6-65 The path of mRNA (blue) 
through the small ribosomal subunit. 

The orientation is the same as that in the 

right-hand panel of Figure 6-64B. 

(Courtesy of Harry F. Noller, based on 

data in G.Z. Yusopova et al., Cell 

1 06:233-241 , 2001 . With permission from 

Elsevier.) 
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Figure 6-66 Translating an mRNA molecule. Each amino acid added to the 
growing end of a polypeptide chain is selected by complementary base-pairing 
between the anticodon on its attached tRNA molecule and the next codon on the 
mRNA chain. Because only one of the many types of tRNA molecules in a cell can 
base-pair with each codon, the codon determines the specific amino acid to be 
added to the growing polypeptide chain. The four-step cycle shown is repeated 
over and over during the synthesis of a protein. In step 1, an aminoacyl-tRNA 
molecule binds to a vacant A-site on the ribosome and a spent tRNA molecule 
dissociates from the E-site. In step 2, a new peptide bond is formed. In step 3, the 
large subunit translocates relative to the small subunit, leaving the two tRNAs in 
hybrid sites: P on the large subunit and A on the small, for one; E on the large 
subunit and P on the small, for the other. In step 4, the small subunit translocates 
carrying its mRNA a distance of three nucleotides through the ribosome. This 
"resets" the ribosome with a fully empty A-site, ready for the next aminoacyl-tRNA 
molecule to bind. As indicated, the mRNA is translated in the 5'-to-3' direction, and 
the N-terminal end of a protein is made first, with each cycle adding one amino 
acid to the C-terminus of the polypeptide chain. 
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Elongation Factors Drive Translation Forward and Improve Its 
Accuracy 

The basic cycle of polypeptide elongation shown in outline in Figure 6-66 has an 
additional feature that makes translation especially efficient and accurate. Two 
elongation factors enter and leave the ribosome during each cycle, each 
hydrolyzing GTP to GDP and undergoing conformational changes in the pro- 
cess. These factors are called EF-Tu and EF-G in bacteria, and EF1 and EF2 in 
eucaryotes. Under some conditions in vitro, ribosomes can be forced to synthe- 
size proteins without the aid of these elongation factors and GTP hydrolysis, but 
this synthesis is very slow, inefficient, and inaccurate. Coupling the GTP hydrol- 
ysis-driven changes in the elongation factors to transitions between different 
states of the ribosome speeds up protein synthesis enormously. Although these 
ribosomal states are not yet understood in detail, they almost certainly involve 
RNA structure rearrangements in the ribosome core. The cycles of elongation 
factor association, GTP hydrolysis, and dissociation ensure that all such changes 
occur in the "forward" direction so that translation can proceed efficiently (Fig- 
ure 6-67). 

As shown previously, EF-Tu simultaneously binds GTP and aminoacyl-tRNAs 
(see Figure 3-74). In addition to helping move translation forward, EF-Tu (EF1 in 
eucaryotes) increases the accuracy of translation in several ways. First, as it 
escorts an incoming aminoacyl-tRNA to the ribosome, EF-Tu checks whether the 
tRNA-amino acid match is corrrect. Exactly how this is accomplished is not well 
understood. According to one idea, correct tRNA-amino acid matches have a nar- 
rowly defined affinity for EF-Tu, which allows EF-Tu to discriminate, albeit 
crudely, among many different amino acid-tRNA combinations, selectively bring- 
ing the correct ones with it into the ribosome. Second, EF-Tu monitors the initial 
interaction between the anticodon of an incoming aminoacyl-tRNA and the 
codon of the mRNA in the A-site. Aminoacyl-tRNAs are "bent" when bound to the 
GTP-form of EF-Tu; this bent conformation allows codon pairing but prevents 
incorporation of the amino acid into the growing polypeptide chain. However, if 
the codon-anticodon match is correct, the ribosome rapidly triggers the hydroly- 
sis of the GTP molecule, whereupon EF-Tu releases its grip on the tRNA and dis- 
sociates from the ribosome, allowing the tRNA to donate its amino acid for pro- 
tein synthesis. But how is the "correctness" of the codon-anticodon match 
assessed? This feat is carried out by the ribosome itself through an RNA-based 
mechanism. The rRNA in the small subunit of the ribosome forms a series of 
hydrogen bonds with the codon-anticodon pair that allows determination of its 
correctness (Figure 6-68). In essence, the rRNA folds around the codon-anti- 
codon pair, and its final closure — which occurs only when the correct anticodon 
is in place — triggers GTP hydrolysis. Remarkably, this induced fit mechanism can 
distinguish correct from incorrect codon-anticodon interactions despite the rules 
for wobble base-pairing summarized in Figure 6-53. From this example, as for 
RNA splicing, one gets a sense of the highly sophisticated forms of molecular 
recognition that can be achieved solely by RNA. 

The interactions of EF-Tu, tRNA, and the ribosome just described introduce 
critical proofreading steps into protein synthesis at the initial tRNA selection 
stage. But after GTP is hydrolyzed and EF-Tu dissociates from the ribosome, 
there is an additional opportunity for the ribosome to prevent an incorrect 
amino acid from being added to the growing chain. Following GTP hydrolysis, 
there is a short time delay as the amino acid carried by the tRNA moves into 
position on the ribosome. This time delay is shorter for correct than incorrect 
codon-anticodon pairs. Moreover, incorrectly matched tRNAs dissociate more 

Figure 6-67 Detailed view of the translation cycle. The outline of 
translation presented in Figure 6-66 has been expanded to show the roles 
of two elongation factors EF-Tu and EF-G, which drive translation in the 
forward direction. As explained in the text, EF-Tu also provides two 
opportunities for proofreading of the codon-anticodon match. In this way, 
incorrectly paired tRNAs are selectively rejected, and the accuracy of 
translation is improved. 
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rapidly than those correctly bound because their interaction with the codon is 
weaker. Thus, most incorrectly bound tRNA molecules (as well as a significant 
number of correctly bound molecules) will leave the ribosome without being 
used for protein synthesis. All of these proofreading steps, taken together, are 
largely responsible for the 99.99% accuracy of the ribosome in translating RNA 
into protein. 



The Ribosome Is a Ribozyme 

The ribosome is a large complex composed of two-thirds RNA and one-third 
protein. The determination, in 2000, of the entire three-dimensional conforma- 
tion of its large and small subunits is a major triumph of modern structural biol- 
ogy. The findings confirm earlier evidence that rRNAs — and not proteins — are 
responsible for the ribosome's overall structure, its ability to position tRNAs on 
the mRNA, and its catalytic activity in forming covalent peptide bonds. The ribo- 
somal RNAs are folded into highly compact, precise three-dimensional struc- 
tures that form the compact core of the ribosome and determine its overall 
shape (Figure 6-69). 

In marked contrast to the central positions of the rRNAs, the ribosomal pro- 
teins are generally located on the surface and fill in the gaps and crevices of the 
folded RNA (Figure 6-70). Some of these proteins send out extended regions of 
polypeptide chain that penetrate short distances into holes in the RNA core (Fig- 
ure 6-71). The main role of the ribosomal proteins seems to be to stabilize the 
RNA core, while permitting the changes in rRNA conformation that are necessary 
for this RNA to catalyze efficient protein synthesis. The proteins probably also aid 
in the initial assembly of the rRNAs that make up the core of the ribosome. 

Not only are the A-, P-, and E-binding sites for tRNAs formed primarily by 
ribosomal RNAs, but the catalytic site for peptide bond formation is also 
formed by RNA, as the nearest amino acid is located more than 1.8 nm away. 
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Figure 6-68 Recognition of correct 
codon-anticodon matches by the small 
subunit rRNA of the ribosome. Shown is 
the interaction between a nucleotide of 
the small subunit rRNA and the first 
nucleotide pair of a correctly paired 
codon-anticodon; similar interactions 
occur between other nucleotides of the 
rRNA and the second and third positions 
of the codon-anticodon pair. The small- 
subunit rRNA can form this network of 
hydrogen bonds only with correctly 
matched codon-anticodon pairs. As 
explained in the text, this 
codon-anticodon monitoring bythesmall- 
subunit rRNA increases the accuracy of 
protein synthesis. (From J.M. Ogle et al., 
Science 292:897-902, 2001 . With 
permission from AAAS.) 
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Figure 6-69 Structure of the rRNAs in the large subunit of a bacterial ribosome, as determined by x-ray crystallography. 

(A) Three-dimensional conformations of the large-subunit rRNAs (5S and 23S) as they appear in the ribosome. One of the 
protein subunits of the ribosome (L1) is also shown as a reference point, since it forms a characteristic protrusion on the 
ribosome. (B) Schematic diagram of the secondary structure of the 23S rRNA, showing the extensive network of base- 
pairing. The structure has been divided into six "domains" whose colors correspond to those in (A). The secondary-structure 
diagram is highly schematized to represent as much of the structure as possible in two dimensions. To do this, several 
discontinuities in the RNA chain have been introduced, although in reality the 23S RNA is a single RNA molecule. For 
example, the base of Domain III is continuous with the base of Domain IV even though a gap appears in the diagram. 
(Adapted from N. Ban et al., Science 289:905-920, 2000. With permission from AAAS.) 
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Figure 6-70 Location of the protein components of the bacterial large ribosomal subunit.The rRNAs (5S and 23S) are shown in gray and the 
large-subunit proteins (27 of the 31 total) in gold. For convenience, the protein structures depict only the polypeptide backbones. (A) Interface 
with the small subunit, the same view shown in Figure 6-64B. (B) Side opposite to that shown in (A), obtained by rotating (A) by 1 80° around a 
vertical axis. (C) Further slight rotation of (B) through a diagonal axis, allowing a view into the peptide exit channel in the center of the 
structure. (From N. Ban et al., Science 289:905-920, 2000. With permission from AAAS.) 



This discovery came as a surprise to biologists because, unlike proteins, RNA 
does not contain easily ionizable functional groups that can be used to catalyze 
sophisticated reactions like peptide bond formation. Moreover, metal ions, 
which are often used by RNA molecules to catalyze chemical reactions (as dis- 
cussed later in the chapter), were not observed at the active site of the ribosome. 
Instead, it is believed that the 23S rRNA forms a highly structured pocket that, 
through a network of hydrogen bonds, precisely orients the two reactants (the 
growing peptide chain and an aminoacyl-tRNA) and thereby greatly accelerates 
their covalent joining. In addition, the tRNA in the P site contributes to the active 
site, perhaps supplying a functional OH group that participates directly in the 
catalysis. This mechanism may ensure that catalysis occurs only when the tRNA 
is properly positioned in the ribosome. 

RNA molecules that possess catalytic activity are known as ribozymes. We 
saw earlier in this chapter how other ribozymes function in self-splicing reac- 
tions (for example, see Figure 6-36). In the final section of this chapter, we con- 
sider what the ability of RNA molecules to function as catalysts for a wide vari- 
ety of different reactions might mean for the early evolution of living cells. For 
now, we merely note that there is good reason to suspect that RNA rather than 
protein molecules served as the first catalysts for living cells. If so, the ribosome, 
with its RNA core, may be a relic of an earlier time in life's history — when protein 
synthesis evolved in cells that were run almost entirely by ribozymes. 



Nucleotide Sequences in mRNA Signal Where to Start Protein 
Synthesis 

The initiation and termination of translation share features of the translation 
elongation cycle described above. The site at which protein synthesis begins on 
the mRNA is especially crucial, since it sets the reading frame for the whole 
length of the message. An error of one nucleotide either way at this stage would 
cause every subsequent codon in the message to be misread, resulting in a non- 
functional protein with a garbled sequence of amino acids. The initiation step is 
also important because for most genes it is the last point at which the cell can 
decide whether the mRNA is to be translated and the protein synthesized; the 
rate of initiation is thus one determinant of the rate at which any protein is syn- 
thesized. We shall see in Chapter 7 that cells use several mechanisms to regulate 
translation initiation. 




Figure 6-71 Structure of the LI 5 protein 
in the large subunit of the bacterial 
ribosome. The globular domain of the 
protein lies on the surface of the 
ribosome and an extended region 
penetrates deeply into the RNA core of 
the ribosome. The L15 protein is shown in 
yellow and a portion of the ribosomal 
RNA core is shown in red. (From D. Klein, 
RB. Moore and T.A. Steitz, J. Mol. Biol. 
340:1 41 -1 47, 2004. With permission from 
Academic Press.) 
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The translation of an mRNA begins with the codon AUG, and a special tRNA 
is required to start translation. This initiator tRNA always carries the amino acid 
methionine (in bacteria, a modified form of methionine — formylmethionine — 
is used), with the result that all newly made proteins have methionine as the first 
amino acid at their N-terminus, the end of a protein that is synthesized first. This 
methionine is usually removed later by a specific protease. The initiator tRNA 
can be specially recognized by initiation factors because it has a nucleotide 
sequence distinct from that of the tRNA that normally carries methionine. 

In eucaryotes, the initiator tRNA-methionine complex (Met-tRNAi) is first 
loaded into the small ribosomal subunit along with additional proteins called 
eucaryotic initiation factors, or elFs (Figure 6-72). Of all the aminoacyl-tRNAs 
in the cell, only the methionine -charged initiator tRNA is capable of tightly 
binding the small ribosome subunit without the complete ribosome being pre- 
sent and it binds directly to the P-site. Next, the small ribosomal subunit binds 
to the 5' end of an mRNA molecule, which is recognized by virtue of its 5' cap 
and its two bound initiation factors, eIF4E (which directly binds the cap) and 
eIF4G (see Figure 6-40). The small ribosomal subunit then moves forward (5' to 
30 along the mRNA, searching for the first AUG. Additional initiation factors that 
act as ATP-powered helicases facilitate the ribosome's movement through RNA 
secondary structure. In 90% of mRNAs, translation begins at the first AUG 
encountered by the small subunit. At this point, the initiation factors dissociate, 
allowing the large ribosomal subunit to assemble with the complex and com- 
plete the ribosome. The initiator tRNA is still bound to the P-site, leaving the A- 
site vacant. Protein synthesis is therefore ready to begin (see Figure 6-72). 

The nucleotides immediately surrounding the start site in eucaryotic 
mRNAs influence the efficiency of AUG recognition during the above scanning 
process. If this recognition site differs substantially from the consensus recogni- 
tion sequence (5'-ACCMJGG-3'), scanning ribosomal subunits will sometimes 
ignore the first AUG codon in the mRNA and skip to the second or third AUG 
codon instead. Cells frequently use this phenomenon, known as "leaky scan- 
ning," to produce two or more proteins, differing in their N- termini, from the 
same mRNA molecule. It allows some genes to produce the same protein with 
and without a signal sequence attached at its N-terminus, for example, so that 
the protein is directed to two different compartments in the cell. 

The mechanism for selecting a start codon in bacteria is different. Bacterial 
mRNAs have no 5' caps to signal the ribosome where to begin searching for the 
start of translation. Instead, each bacterial mRNA contains a specific ribosome- 
binding site (called the Shine-Dalgarno sequence, named after its discoverers) 
that is located a few nucleotides upstream of the AUG at which translation is to 
begin. This nucleotide sequence, with the consensus 5'-AGGAGGU-3', forms 
base pairs with the 16S rRNA of the small ribosomal subunit to position the ini- 
tiating AUG codon in the ribosome. A set of translation initiation factors orches- 
trates this interaction, as well as the subsequent assembly of the large ribosomal 
subunit to complete the ribosome. 

Unlike a eucaryotic ribosome, a bacterial ribosome can therefore readily 
assemble directly on a start codon that lies in the interior of an mRNA molecule, 
so long as a ribosome-binding site precedes it by several nucleotides. As a result, 
bacterial mRNAs are often polycistronic — that is, they encode several different 
proteins, each of which is translated from the same mRNA molecule (Figure 
6-73). In contrast, a eucaryotic mRNA generally encodes only a single protein. 



Figure 6-72 The initiation of protein synthesis in eucaryotes. Only three 
of the many translation initiation factors required for this process are 
shown. Efficient translation initiation also requires the poly-A tail of the 
mRNA bound by poly-A-binding proteins which, in turn, interact with 
elF4G. In this way, the translation apparatus ascertains that both ends of 
the mRNA are intact before initiating protein synthesis (see Figure 6-40). 
Although only one GTP hydrolysis event is shown in the figure, a second is 
known to occur just before the large and small ribosomal subunits join. 
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Figure 6-73 Structure of a typical bacterial mRNA molecule. Unlike 
eucaryotic ribosomes, which typically require a capped 5' end, procaryotic 
ribosomes initiate transcription at ribosome-binding sites (Shine-Dalgarno 
sequences), which can be located anywhere along an mRNA molecule. This 
property of ribosomes permits bacteria to synthesize more than one type 
of protein from a single mRNA molecule. 



Stop Codons Mark the End of Translation 

The end of the protein- coding message is signaled by the presence of one of 
three stop codons (UAA, UAG, or UGA) (see Figure 6-50). These are not recog- 
nized by a tRNA and do not specify an amino acid, but instead signal to the ribo- 
some to stop translation. Proteins known as release factors bind to any ribosome 
with a stop codon positioned in the A site, forcing the peptidyl transferase in the 
ribosome to catalyze the addition of a water molecule instead of an amino acid 
to the peptidyl-tRNA (Figure 6-74). This reaction frees the carboxyl end of the 
growing polypeptide chain from its attachment to a tRNA molecule, and since 
only this attachment normally holds the growing polypeptide to the ribosome, 
the completed protein chain is immediately released into the cytoplasm. The 
ribosome then releases the mRNA and separates into the large and small sub- 
units, which can assemble on this or another mRNA molecule to begin a new 
round of protein synthesis. 

Release factors are an example of molecular mimicry, whereby one type of 
macromolecule resembles the shape of a chemically unrelated molecule. In this 
case, the three-dimensional structure of release factors (made entirely of pro- 
tein) resembles the shape and charge distribution of a tRNA molecule (Figure 
6-75). This shape and charge mimicry helps them enter the A-site on the ribo- 
some and cause translation termination. 

During translation, the nascent polypeptide moves through a large, water- 
filled tunnel (approximately 10 nm x 1.5 nm) in the large subunit of the ribo- 
some (see Figure 6-70C). The walls of this tunnel, made primarily of 23S rRNA, 
are a patchwork of tiny hydrophobic surfaces embedded in a more extensive 
hydrophilic surface. This structure is not complementary to any peptide, and 
thus provides a "Teflon" coating through which a polypeptide chain can easily 
slide. The dimensions of the tunnel suggest that nascent proteins are largely 
unstructured as they pass through the ribosome, although some a-helical 
regions of the protein can form before leaving the ribosome tunnel. As it leaves 
the ribosome, a newly synthesized protein must fold into its proper three- 
dimensional conformation to be useful to the cell, and later in this chapter we 
discuss how this folding occurs. First, however, we describe several additional 
aspects of the translation process itself. 



Proteins Are Made on Polyribosomes 

The synthesis of most protein molecules takes between 20 seconds and several 
minutes. During this very short period, however, it is usual for multiple initia- 
tions to take place on each mRNA molecule being translated. As soon as the pre- 
ceding ribosome has translated enough of the nucleotide sequence to move out 

Figure 6-74 The final phase of protein synthesis. The binding of a release 
factor to an A-site bearing a stop codon terminates translation. The 
completed polypeptide is released and, in a series of reactions that 
requires additional proteins and GTP hydrolysis (not shown), the ribosome 
dissociates into its two separate subunits. 
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Figure 6-75 The structure of a human 
translation release factor (eRFI) and its 
resemblance to a tRNA molecule. The 

protein is on the left and the tRNA on the 
right (From H. Song et al., Cell 100:31 1-321, 
2000. With permission from Elsevier.) 



of the way, the 5' end of the mRNA is threaded into a new ribosome. The mRNA 
molecules being translated are therefore usually found in the form of polyribo- 
somes (or polysomes): large cytoplasmic assemblies made up of several ribo- 
somes spaced as close as 80 nucleotides apart along a single mRNA molecule 
(Figure 6-76). These multiple initiations allow the cell to make many more pro- 
tein molecules in a given time than would be possible if each had to be com- 
pleted before the next could start. <GAAG> 

Both bacteria and eucaryotes use polysomes, and both employ additional 
strategies to speed up the overall rate of protein synthesis even further. Because 
bacterial mRNA does not need to be processed and is accessible to ribosomes 
while it is being made, ribosomes can attach to the free end of a bacterial mRNA 
molecule and start translating it even before the transcription of that RNA is com- 
plete, following closely behind the RNA polymerase as it moves along DNA. In 
eucaryotes, as we have seen, the 5' and 3' ends of the mRNA interact (see Figures 
6-40 and 6-76A); therefore, as soon as a ribosome dissociates, its two subunits are 
in an optimal position to reinitiate translation on the same mRNA molecule. 



There Are Minor Variations in the Standard Genetic Code 



As discussed in Chapter 1, the genetic code (shown in Figure 6-50) applies to all 
three major branches of life, providing important evidence for the common 
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Figure 6-76 A polyribosome. 

(A) Schematic drawing showing how a 
series of ribosomes can simultaneously 
translate the same eucaryotic mRNA 
molecule. (B) Electron micrograph of a 
polyribosome from a eucaryotic cell. 
(B, courtesy of John Heuser.) 
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ancestry of all life on Earth. Although rare, there are exceptions to this code. For 
example, Candida albicans, the most prevalent human fungal pathogen, trans- 
lates the codon CUG as serine, whereas nearly all other organisms translate it as 
leucine. Mitochondria (which have their own genomes and encode much of 
their translational apparatus) often deviate from the standard code. For exam- 
ple, in mammalian mitochondria AUA is translated as methionine, whereas in 
the cytosol of the cell it is translated as isoleucine (see Table 14-3, p. 862). This 
type of deviation in the genetic code is "hardwired" into the organisms or the 
organelles in which it occurs. 

A different type of variation, sometimes called translation recoding, occurs 
in many cells. In this case, other nucleotide sequence information present in an 
mRNA can change the meaning of the genetic code at a particular site in the 
mRNA molecule. The standard code allows cells to manufacture proteins using 
only 20 amino acids. However, bacteria, archaea, and eucaryotes have available 
to them a twenty- first amino acid that can be incorporated directly into a grow- 
ing polypeptide chain through translation recoding. Selenocysteine, which is 
essential for the efficient function of a variety of enzymes, contains a selenium 
atom in place of the sulfur atom of cysteine. Selenocysteine is enzymatically 
produced from a serine attached to a special tRNA molecule that base-pairs with 
the UGA codon, a codon normally used to signal a translation stop. The mRNAs 
for proteins in which selenocysteine is to be inserted at a UGA codon carry an 
additional nucleotide sequence in the mRNA nearby that causes this recoding 
event (Figure 6-77). 

Another form of recoding, translational frameshifting, allows more than one 
protein to be synthesized from a single mRNA. Retroviruses, members of a large 
group of eucaryotic-infecting pathogens, commonly use translational 
frameshifting to make both the capsid proteins (Gag proteins) and the viral 
reverse transcriptase and integrase (Pol proteins) from the same RNA transcript 
(see Figure 5-73). The virus needs many more copies of the Gag proteins than it 
does of the Pol proteins. This quantitative adjustment is achieved by encoding 
the Pol genes just after the Gag genes but in a different reading frame. Small 
amounts of the Pol gene products are made because, on occasion, an upstream 
translational frameshift allows the Gag protein stop codon to be bypassed. This 
frameshift occurs at a particular codon in the mRNA and requires a specific 
recoding signal, which seems to be a structural feature of the RNA sequence 
downstream of this site (Figure 6-78). 



Figure 6-77 Incorporation of 
selenocysteine into a growing 
polypeptide chain. A specialized tRNA is 
charged with serine by the normal seryl- 
tRNA synthetase, and the serine is 
subsequently converted enzymatically to 
selenocysteine. A specific RNA structure 
in the mRNA (a stem and loop structure 
with a particular nucleotide sequence) 
signals that selenocysteine is to be 
inserted at the neighboring UGA codon. 
As indicated, this event requires the 
participation of a selenocysteine-specific 
translation factor. 



Inhibitors of Procaryotic Protein Synthesis Are Useful as 
Antibiotics 

Many of the most effective antibiotics used in modern medicine are compounds 
made by fungi that inhibit bacterial protein synthesis. Fungi and bacteria com- 
pete for many of the same environmental niches, and millions of years of coevo- 
lution has resulted in fungi producing potent bacterial inhibitors. Some of these 
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drugs exploit the structural and functional differences between bacterial and 
eucaryotic ribosomes so as to interfere preferentially with the function of bacte- 
rial ribosomes. Thus humans can take high dosages of some of these com- 
pounds without undue toxicity Many antibiotics lodge in pockets in the riboso- 
mal RNAs and simply interfere with the smooth operation of the ribosome (Fig- 
ure 6-79) . Table 6-4 lists some of the more common antibiotics of this kind 
along with several other inhibitors of protein synthesis, some of which act on 
eucaryotic cells and therefore cannot be used as antibiotics. 

Because they block specific steps in the processes that lead from DNA to 
protein, many of the compounds listed in Table 6-4 are useful for cell biological 
studies. Among the most commonly used drugs in such investigations are chlo- 
ramphenicol, cycloheximide, and puromycin, all of which specifically inhibit 
protein synthesis. In a eucaryotic cell, for example, chloramphenicol inhibits 
protein synthesis on ribosomes only in mitochondria (and in chloroplasts in 
plants), presumably reflecting the procaryotic origins of these organelles (dis- 
cussed in Chapter 14). Cycloheximide, in contrast, affects only ribosomes in the 
cytosol. Puromycin is especially interesting because it is a structural analog of a 
tRNA molecule linked to an amino acid and is therefore another example of 
molecular mimicry; the ribosome mistakes it for an authentic amino acid and 
covalently incorporates it at the C-terminus of the growing peptide chain, 
thereby causing the premature termination and release of the polypeptide. As 
might be expected, puromycin inhibits protein synthesis in both procaryotes 
and eucaryotes. 



Figure 6-78 The translational 
frameshifting that produces the reverse 
transcriptase and integrase of a 
retrovirus. The viral reverse transcriptase 
and integrase are produced by proteolytic 
processing of a large protein (the Gag-Pol 
fusion protein) consisting of both the Gag 
and Pol amino acid sequences. Proteolytic 
processing of the more abundant Gag 
protein produces the viral capsid proteins. 
Both the Gag and the Gag-Pol fusion 
proteins start with identical mRNA, but 
whereas the Gag protein terminates at a 
stop codon downstream of the sequence 
shown, translation of the Gag-Pol fusion 
protein bypasses this stop codon, allowing 
the synthesis of the longer Gag-Pol fusion 
protein. The stop-codon-bypass is made 
possible by a controlled translational 
frameshift, as illustrated. Features in the 
local RNA structure (including the RNA 
loop shown) cause the tRNA Leu attached 
to the C-terminus of the growing 
polypeptide chain occasionally to slip 
backward by one nucleotide on the 
ribosome, so that it pairs with a UUU 
codon instead of the UUA codon that had 
initially specified its incorporation; the 
next codon (AGG) in the new reading 
frame specifies an arginine rather than a 
glycine. This controlled slippage is due in 
part to a pseudoknot that forms in the viral 
mRNA (see Figure 6-1 02). The sequence 
shown is from the human AIDS virus, HIV. 
(Adapted from T. Jacks et al., Nature 
331:280-283, 1988. With permission from 
Macmillan Publishers Ltd.) 
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Figure 6-79 Binding sites for antibiotics 
on the bacterial ribosome. The small (left) 
and large (right) subunits of the ribosome 
are arranged as though the ribosome has 
been opened like a book; the bound tRNA 
molecules are shown in purple (see Figure 
6-64). Most of the antibiotics shown bind 
directly to pockets formed by the 
ribosomal RNA molecules. Hygromycin B 
induces errors in translation, 
spectinomycin blocks the translocation of 
the peptidyl-tRNA from the A-site to the 
P-site, and streptogramin B prevents 
elongation of nascent peptides. Table 6-4 
lists the inhibitory mechanisms of the 
other antibiotics shown in the figure. 
(Adapted from J. Poehlsgaard and 
S. Douthwaite, Nat. Rev. Microbiol. 
3:870-881, 2005. With permission from 
Macmillan Publishers Ltd.) 
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Table 6-4 Inhibitors of Protein or RNA Synthesis 
INHIBITOR SPECIFIC EFFECT 

Acting only on bacteria 

Tetracycline blocks binding of aminoacyl-tRNA to A-site of ribosome 

Streptomycin prevents the transition from translation initiation to chain elongation and also causes miscoding 

Chloramphenicol blocks the peptidyl transferase reaction on ribosomes (step 2 in Figure 6-66) 

Erythromycin binds in the exit channel of the ribosome and thereby inhibits elongation of the peptide chain 

Rifamycin blocks initiation of RNA chains by binding to RNA polymerase (prevents RNA synthesis) 

Acting on bacteria and eucaryotes 

Puromycin causes the premature release of nascent polypeptide chains by its addition to the growing chain end 

Actinomycin D binds to DNA and blocks the movement of RNA polymerase (prevents RNA synthesis) 

Acting on eucaryotes but not bacteria 

Cycloheximide blocks the translocation reaction on ribosomes (step 3 in Figure 6-66) 

Anisomycin blocks the peptidyl transferase reaction on ribosomes (step 2 in Figure 6-66) 

oc-Amanitin blocks mRNA synthesis by binding preferentially to RNA polymerase II 

The ribosomes of eucaryotic mitochondria (and chloroplasts) often resemble those of bacteria in their sensitivity to inhibitors. Therefore, some of 
these antibiotics can have a deleterious effect on human mitochondria. 

Accuracy in Translation Requires the Expenditure of Free Energy 

Translation by the ribosome is a compromise between the opposing constraints 
of accuracy and speed. We have seen, for example, that the accuracy of transla- 
tion (1 mistake per 10 4 amino acids joined) requires time delays each time a new 
amino acid is added to a growing polypeptide chain, producing an overall speed 
of translation of 20 amino acids incorporated per second in bacteria. Mutant 
bacteria with a specific alteration in the small ribosomal subunit have longer 
delays and translate mRNA into protein with an accuracy considerably higher 
than this; however, protein synthesis is so slow in these mutants that the bacte- 
ria are barely able to survive. 

We have also seen that attaining the observed accuracy of protein synthesis 
requires the expenditure of a great deal of free energy; this is expected, since, as 
discussed in Chapter 2, there is a price to be paid for any increase in order in the 
cell. In most cells, protein synthesis consumes more energy than any other 
biosynthetic process. At least four high-energy phosphate bonds are split to 
make each new peptide bond: two are consumed in charging a tRNA molecule 
with an amino acid (see Figure 6-56), and two more drive steps in the cycle of 
reactions occurring on the ribosome during synthesis itself (see Figure 6-67). In 
addition, extra energy is consumed each time that an incorrect amino acid link- 
age is hydrolyzed by a tRNA synthetase (see Figure 6-59) and each time that an 
incorrect tRNA enters the ribosome, triggers GTP hydrolysis, and is rejected (see 
Figure 6-67). To be effective, these proofreading mechanisms must also allow an 
appreciable fraction of correct interactions to be removed; for this reason, proof- 
reading is even more costly in energy than it might seem. 

Quality Control Mechanisms Act to Prevent Translation of 
Damaged mRNAs 

In eucaryotes, mRNA production involves both transcription and a series of elab- 
orate RNA-processing steps; these take place in the nucleus, segregated from 
ribosomes, and only when the processing is complete are the mRNAs transported 
to the cytoplasm to be translated (see Figure 6-40). However, this scheme is not 
foolproof, and some incorrectly processed mRNAs are inadvertently sent to the 
cytoplasm. In addition, mRNAs that were flawless when they left the nucleus can 
become broken or otherwise damaged in the cytosol. The danger of translating 
damaged or incompletely processed mRNAs (which would produce truncated or 
otherwise aberrant proteins) is apparently so great that the cell has several 
backup measures to prevent this from happening. 
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To avoid translating broken mRNAs, the 5' cap and the poly-A tail are both 
recognized by the translation-initiation machinery before translation begins 
(see Figure 6-72). To help ensure that mRNAs are properly spliced before they 
are translated, the exon junction complex (EJC), which is deposited on the 
mRNA following splicing (see Figure 6-40), stimulates the subsequent transla- 
tion of the mRNA. 

But the most powerful mRNA surveillance system, called nonsense-medi- 
ated mRNA decay, eliminates defective mRNAs before they can be efficiently 
translated into protein. This mechanism is brought into play when the cell 
determines that an mRNA molecule has a nonsense (stop) codon (UAA, UAG, or 
UGA) in the "wrong" place — a situation likely to arise in an mRNA molecule that 
has been improperly spliced. Aberrant splicing will usually result in the random 
introduction of a nonsense codon into the reading frame of the mRNA, espe- 
cially in organisms, such as humans, that have a large average intron size (see 
Figure 6-32B). 

This surveillance mechanism begins as an mRNA molecule is being trans- 
ported from the nucleus to the cytosol. As its 5' end emerges from the nuclear 
pore, the mRNA is met by a ribosome, which begins to translate it. As translation 
proceeds, the exon junction complexes (EJC) bound to the mRNA at each splice- 
site are apparently displaced by the moving ribosome. The normal stop codon 
will be within the last exon, so by the time the ribosome reaches it and stalls, no 
more EJCs should be bound to the mRNA. If this is the case, the mRNA "passes 
inspection" and is released to the cytosol where it can be translated in earnest 
(Figure 6-80). However, if the ribosome reaches a premature stop codon and 
stalls, it senses that EJCs remain and the bound mRNA molecule is rapidly 
degraded. In this way, the first round of translation allows the cell to test the fit- 
ness of each mRNA molecule as it exits the nucleus. 

Nonsense-mediated decay may have been especially important in evolu- 
tion, allowing eucaryotic cells to more easily explore new genes formed by DNA 
rearrangements, mutations, or alternative patterns of splicing — by selecting 
only those mRNAs for translation that can produce a full-length protein. Non- 
sense-mediated decay is also important in cells of the developing immune sys- 
tem, where the extensive DNA rearrangements that occur (see Figure 25-36) 
often generate premature termination codons. The surveillance system 
degrades the mRNAs produced from such rearranged genes, thereby avoiding 
the potential toxic effects of truncated proteins. 



Figure 6-80 Nonsense-mediated mRNA 
decay. As shown on the right, the failure 
to correctly splice a pre-mRNA often 
introduces a premature stop codon into 
the reading frame for the protein. The 
introduction of such an "in -frame" stop 
codon is particularly likely to occur in 
mammals, where the introns tend to be 
very long. When translated, these 
abnormal mRNAs produce aberrant 
proteins, which could damage the cell. 
However, as shown at the bottom right of 
the figure, these abnormal RNAs are 
destroyed by the nonsense-mediated 
decay mechanism. According to one 
model, an mRNA molecule, bearing exon 
junction complexes (EJCs) to mark 
successfully completed splices, is first met 
by a ribosome that performs a "test" 
round of translation. As the mRNA passes 
through the tight channel of the 
ribosome, the EJCs are stripped off, and 
successful mRNAs are released to 
undergo multiple rounds of translation 
(leftside). However, if an in-frame stop 
codon is encountered before the final 
exon junction complex is reached {right 
side), the mRNA undergoes nonsense- 
mediated decay, which is triggered by 
the Upf proteins {green) that bind to each 
EJC. Note that, to trigger nonsense- 
mediated decay, the premature stop 
codon must be in the same reading 
frame as that of the normal protein. 
(Adapted from J. Lykke-Andersen et al., 
Cell 103:1 121-1 131, 2000. With 
permission from Elsevier.) 
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Figure 6-81 The rescue of a bacterial ribosome stalled on an incomplete 
mRNA molecule. The tmRNA shown is a 363-nucleotide RNA with both 
tRNA and mRNA functions, hence its name. It carries an alanine and can 
enter the vacant A-site of a stalled ribosome to add this alanine to a 
polypeptide chain, mimicking a tRNA although no codon is present to 
guide it. The ribosome then translates 1 codons from the tmRNA, 
completing an 1 1 amino acid tag on the protein. Proteases recognize this 
tag and degrade the entire protein. Although the example shown in the 
figure is from bacteria, eucaryotes can employ a similar strategy. 



Finally, the nonsense-mediated surveillance pathway plays an important 
role in mitigating the symptoms of many inherited human diseases. As we have 
seen, inherited diseases are usually caused by mutations that spoil the function 
of a key protein, such as hemoglobin or one of the blood clotting factors. 
Approximately one-third of all genetic disorders in humans result from non- 
sense mutations or mutations (such as frameshift mutations or splice-site muta- 
tions) that place nonsense mutations into the gene's reading frame. In individu- 
als that carry one mutant and one functional gene, nonsense-mediated decay 
eliminates the aberrant mRNA and thereby prevents a potentially toxic protein 
from being made. Without this safeguard, individuals with one functional and 
one mutant "disease gene" would likely suffer much more severe symptoms. 

We saw earlier in this chapter that bacteria lack the elaborate mRNA pro- 
cessing found in eucaryotes and that translation often begins before the synthe- 
sis of the RNA molecule is completed. Yet bacteria also have quality control 
mechanisms to deal with incompletely synthesized and broken mRNAs. When 
the bacterial ribosome translates to the end of an incomplete RNA it stalls and 
does not release the RNA. Rescue comes in the form of a special RNA (called 
tmRNA), which enters the A-site of the ribosome and is itself translated, releas- 
ing the ribosome. The special 1 1 amino acid tag thus added to the C-terminus of 
the truncated protein signals to proteases that the entire protein is to be 
degraded (Figure 6-81). 



Some Proteins Begin to Fold While Still Being Synthesized 

The process of gene expression is not over when the genetic code has been used 
to create the sequence of amino acids that constitutes a protein. To be useful to 
the cell, this new polypeptide chain must fold up into its unique three-dimen- 
sional conformation, bind any small-molecule cofactors required for its activity, 
be appropriately modified by protein kinases or other protein-modifying 
enzymes, and assemble correctly with the other protein subunits with which it 
functions (Figure 6-82) . 

The information needed for all of the steps listed above is ultimately con- 
tained in the sequence of linked amino acids that the ribosome produces when 
it translates an mRNA molecule into a polypeptide chain. As discussed in Chap- 
ter 3, when a protein folds into a compact structure, it buries most of its 
hydrophobic residues in an interior core. In addition, large numbers of nonco- 
valent interactions form between various parts of the molecule. It is the sum of 
all of these energetically favorable arrangements that determines the final fold- 
ing pattern of the polypeptide chain — as the conformation of lowest free energy 
(seep. 130). 

Through many millions of years of evolution, the amino acid sequence of 
each protein has been selected not only for the conformation that it adopts but 
also for an ability to fold rapidly. For some proteins, this folding begins immedi- 
ately, as the protein spins out of the ribosome, starting from the N-terminal end. 
In these cases, as each protein domain emerges from the ribosome, within a few 
seconds it forms a compact structure that contains most of the final secondary 
features (a helices and (3 sheets) aligned in roughly the right conformation (Fig- 
ure 6-83). For many protein domains, this unusually dynamic and flexible state 
called a molten globule, is the starting point for a relatively slow process in which 
many side-chain adjustments occur that eventually form the correct tertiary 
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Figure 6-82 Steps in the creation of a functional protein. As indicated, 
translation of an mRNA sequence into an amino acid sequence on the 
ribosome is not the end of the process of forming a protein. To function, 
the completed polypeptide chain must fold correctly into its three- 
dimensional conformation, bind any cofactors required, and assemble with 
its partner protein chains (if any). Noncovalent bond formation drives these 
changes. As indicated, many proteins also require covalent modifications of 
selected amino acids. Although the most frequent modifications are 
protein glycosylation and protein phosphorylation, more than 100 different 
types of covalent modifications are known (see, for example, Figure 3-81). 



structure. It takes several minutes to synthesize a protein of average size, and for 
some proteins much of the folding process is complete by the time the ribosome 
releases the C-terminal end of a protein (Figure 6-84). 



Molecular Chaperones Help Guide the Folding of Most Proteins 

Most proteins probably do not begin to fold during their synthesis. Instead, they 
are met at the ribosome by a special class of proteins called molecular chaper- 
ones. Molecular chaperones are useful for cells because there are many different 
paths that can be taken to convert an unfolded or partially folded protein to its 
final compact conformation. For many proteins, some of the intermediates 
formed along the way would aggregate and be left as off-pathway dead ends 
without the intervention of a chaperone (Figure 6-85). 

Many molecular chaperones are called heat-shock proteins (designated 
Hsp), because they are synthesized in dramatically increased amounts after a 
brief exposure of cells to an elevated temperature (for example, 42°C for cells 
that normally live at 37°C). This reflects the operation of a feedback system that 
responds to an increase in misfolded proteins (such as those produced by ele- 
vated temperatures) by boosting the synthesis of the chaperones that help these 
proteins refold. 

There are several major families of eucaryotic molecular chaperones, 
including the Hsp60 and Hsp70 proteins. Different family members function in 
different organelles. Thus, as discussed in Chapter 12, mitochondria contain 
their own Hsp60 and Hsp70 molecules that are distinct from those that function 
in the cytosol; and a special Hsp70 (called BIP) helps to fold proteins in the 
endoplasmic reticulum. 
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Figure 6-83 The structure of a molten 
globule. (A) A molten globule form of 
cytochrome b 56 2 is more open and less 
highly ordered than the final folded form 
of the protein, shown in (B). Note that the 
molten globule contains most of the 
secondary structure of the final form, 
although the ends of the a helices are 
unravelled and one of the helices is only 
partly formed. (Courtesy of Joshua Wand, 
from Y. Feng et a I., Nat. Struct Biol. 
1:30-35, 1994. With permission from 
Macmillan Publishers Ltd.) 
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Figure 6-84 Co-translational protein 
folding. A growing polypeptide chain is 
shown acquiring its secondary and 
tertiary structure as it emerges from a 
ribosome. The N-terminal domain folds 
first, while the C-terminal domain is still 
being synthesized. This protein has not 
achieved its final conformation at the 
time it is released from the ribosome. 
(Modified from A.N. Federov and 
TO. Baldwin, J. Biol. Chem. 
272:32715-32718,1997.) 
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The Hsp60 and Hsp70 proteins each work with their own small set of asso- 
ciated proteins when they help other proteins to fold. Hsps share an affinity for 
the exposed hydrophobic patches on incompletely folded proteins, and they 
hydrolyze ATP, often binding and releasing their protein substrate with each 
cycle of ATP hydrolysis. In other respects, the two types of Hsp proteins function 
differently. The Hsp70 machinery acts early in the life of many proteins, binding 
to a string of about seven hydrophobic amino acids before the protein leaves the 
ribosome (Figure 6-86). In contrast, Hsp60-like proteins form a large barrel- 
shaped structure that acts after a protein has been fully synthesized. This type of 
chaperone, sometimes called a chaperonin, forms an "isolation chamber" into 
which misfolded proteins are fed, preventing their aggregation and providing 
them with a favorable environment in which to attempt to refold (Figure 6-87). 
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Figure 6-85 A current view of protein 
folding. Each domain of a newly 
synthesized protein rapidly attains a 
"molten globule" state. Subsequent 
folding occurs more slowly and by 
multiple pathways, often involving the 
help of a molecular chaperone. Some 
molecules may still fail to fold correctly; 
as explained in the text, specific 
proteases recognize and degrade these 
molecules. 
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The chaperones shown in Figures 6-86 and 6-87 often use many cycles of 
ATP hydrolysis to fold a single polypeptide chain correctly Although some of this 
energy expenditure is used to perform mechanical work, probably much more is 
expended to ensure that protein folding is accurate. Just as we saw for transcrip- 
tion, splicing, and translation, the expenditure of free energy can be used by 
cells to improve the accuracy of a biological process. In the case of protein fold- 
ing, ATP hydrolysis allows chaperones to recognize a wide variety of misfolded 
structures, to halt any further misfolding and to recommence folding of a pro- 
tein in an orderly way. 

Although our discussion focuses on only two types of chaperones, the cell 
has a variety of others. The enormous diversity of proteins in cells presumably 
requires a wide range of chaperones with versatile surveillance and correction 
capabilities. 



Figure 6-86 The Hsp70 family of 
molecular chaperones. These proteins 
act early, recognizing a small stretch of 
hydrophobic amino acids on a protein's 
surface. Aided by a set of smaller Hsp40 
proteins (not shown), ATP-bound Hsp70 
molecules grasp their target protein and 
then hydrolyze ATP to ADP, undergoing 
conformational changes that cause the 
Hsp70 molecules to associate even more 
tightly with the target. After the Hsp40 
dissociates, the rapid rebinding of ATP 
induces the dissociation of the Hsp70 
protein after ADP release. In reality, 
repeated cycles of Hsp protein binding 
and release help the target protein to 
refold, as schematically illustrated in 
Figure 6-85. 



Exposed Hydrophobic Regions Provide Critical Signals for Protein 
Quality Control 

If radioactive amino acids are added to cells for a brief period, the newly syn- 
thesized proteins can be followed as they mature into their final functional form. 
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Figure 6-87 The structure and function of the Hsp60 family of molecular chaperones. (A) The 

catalysis of protein refolding. A misfolded protein is initially captured by hydrophobic interactions 
along one rim of the barrel. The subsequent binding of ATP plus a protein cap increases the 
diameter of the barrel rim, which may transiently stretch (partly unfold) the client protein. This 
also confines the protein in an enclosed space, where it has a new opportunity to fold. After 
about 15 seconds, ATP hydrolysis occurs, weakening the complex. Subsequent binding of another 
ATP molecule ejects the protein, whether folded or not, and the cycle repeats. This type of 
molecular chaperone is also known as a chaperonin; it is designated as Hsp60 in mitochondria, 
TCP1 in the cytosol of vertebrate cells, and GroEL in bacteria. As indicated, only half of the 
symmetrical barrel operates on a client protein at any one time. (B) The structure of GroEL bound 
to its GroES cap, as determined by X-ray crystallography. On the left is shown the outside of the 
barrel-like structure and on the right a cross section through its center. (B, adapted from B. Bukau 
and A.L Horwich, Cell 92:35 1-366, 1998. With permission from Elsevier.) 
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This type of experiment demonstrates that the Hsp70 proteins act first, begin- 
ning when a protein is still being synthesized on a ribosome, and the Hsp60-like 
proteins act only later to help fold completed proteins. But how does the cell dis- 
tinguish misfolded proteins, which require additional rounds of ATP- catalyzed 
refolding, from those with correct structures? 

Before answering, we need to pause to consider the post-translational fate 
of proteins more broadly Usually, if a protein has a sizable exposed patch of 
hydrophobic amino acids on its surface, it is abnormal: it has either failed to fold 
correctly after leaving the ribosome, suffered an accident that partly unfolded it 
at a later time, or failed to find its normal partner subunit in a larger protein 
complex. Such a protein is not merely useless to the cell, it can be dangerous. 
Many proteins with an abnormally exposed hydrophobic region can form large 
aggregates in the cell. We shall see that, in rare cases, such aggregates do form 
and cause severe human diseases. Normally, however, powerful protein quality 
control mechanisms prevent such disasters. 

Given this background, it is not surprising that cells have evolved elaborate 
mechanisms that recognize the hydrophobic patches on proteins and minimize 
the damage they cause. Two of these mechanisms depend on the molecular 
chaperones just discussed, which bind to the patch and attempt to repair the 
defective protein by giving it another chance to fold. At the same time, by cover- 
ing the hydrophobic patches, these chaperones transiently prevent protein 
aggregation. Proteins that very rapidly fold correctly on their own do not display 
such patches and the chaperones bypass them. 

Figure 6-88 outlines all of the quality control choices that a cell makes for a 
difficult-to-fold, newly synthesized protein. As indicated, when attempts to 
refold a protein fail, a third mechanism is called into play that completely 
destroys the protein by proteolysis. The proteolytic pathway begins with the 
recognition of an abnormal hydrophobic patch on a protein's surface, and it 
ends with the delivery of the entire protein to a protein destruction machine, a 
complex protease known as the proteasome. As described next, this process 
depends on an elaborate protein-marking system that also carries out other 
central functions in the cell by destroying selected normal proteins. 
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The Proteasome Is a Compartmentalized Protease with 
Sequestered Active Sites 

The proteolytic machinery and the chaperones compete with one another to 
reorganize a misfolded protein. If a newly synthesized protein folds rapidly, at 
most only a small fraction of it is degraded. In contrast, a slowly folding protein 
is vulnerable to the proteolytic machinery for a longer time, and many more of 
its molecules are destroyed before the remainder attain the proper folded state. 
Due to mutations or to errors in transcription, RNA splicing, and translation, 
some proteins never fold properly. It is particularly important that the cell 
destroy these potentially harmful proteins. 

The apparatus that deliberately destroys aberrant proteins is the protea- 
some, an abundant ATP-dependent protease that constitutes nearly 1% of cell 
protein. Present in many copies dispersed throughout the cytosol and the 
nucleus, the proteasome also destroys aberrant proteins of the endoplasmic 
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Figure 6-88 The processes that monitor 
protein quality following protein 
synthesis. A newly synthesized protein 
sometimes folds correctly and assembles 
on its own with its partner proteins, in 
which case the quality control 
mechanisms leave it alone. Incompletely 
folded proteins are helped to refold by 
molecular chaperones: first by a family of 
Hsp70 proteins, and then in some cases, 
by Hsp60-like proteins. For both types of 
chaperones, the client proteins are 
recognized by an abnormally exposed 
patch of hydrophobic amino acids on their 
surface. These "protein-rescue" processes 
compete with another mechanism that, 
upon recognizing an abnormally exposed 
patch, marks the protein for destruction 
by the proteasome. The combined activity 
of all of these processes is needed to 
prevent massive protein aggregation in a 
cell, which can occur when many 
hydrophobic regions on proteins clump 
together nonspecifically. 
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reticulum (ER). An ER-based surveillance system detects proteins that fail either 
to fold or to be assembled properly after they enter the ER, and retrotranslocates 
them back to the cytosol for degradation (discussed in Chapter 12). 

Each proteasome consists of a central hollow cylinder (the 20S core protea- 
some) formed from multiple protein subunits that assemble as a quasi- cylindri- 
cal stack of four heptameric rings (Figure 6-89). Some of the subunits are dis- 
tinct proteases whose active sites face the cylinder's inner chamber. The design 
prevents these highly efficient proteases from running rampant through the cell. 
Each end of the cylinder is normally associated with a large protein complex (the 
19S cap), which contains a six-subunit protein ring, through which target pro- 
teins are threaded into the proteasome core where they are degraded (Figure 
6-90). The threading reaction, driven by ATP hydrolysis, unfolds the target pro- 
teins as they move through the cap, exposing them to the proteases lining the 
proteasome core (Figure 6-91). The proteins that make up the ring structure in 
the proteasome cap belong to a large class of protein "unfoldases" known as AAA 
proteins. Many of them function as hexamers, and it is possible that they share 
mechanistic features with the ATP- dependent unwinding of DNA by DNA heli- 
cases (see Figure 5-15). 

A crucial property of the proteasome, and one reason for the complexity of 
its design, is the processivity of its mechanism: in contrast to a "simple" protease 
that cleaves a substrate's polypeptide chain just once before dissociating, the 
proteasome keeps the entire substrate bound until all of it is converted into 
short peptides. 

The 19S caps also act as regulated "gates" at the entrances to the inner pro- 
teolytic chamber, and they are responsible for binding a targeted protein sub- 
strate to the proteasome. With a few exceptions, the proteasomes act on pro- 
teins that have been specifically marked for destruction by the covalent 
attachment of a recognition tag formed from a small protein called ubiquitin 
(Figure 6-92A). Ubiquitin exists in cells either free or covalently linked to 
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Figure 6-89 The proteasome. (A) A cut- 
away view of the structure of the central 
20S cylinder, as determined by x-ray 
crystallography, with the active sites of the 
proteases indicated by red dots. 
(B) The entire proteasome, in which the 
central cylinder (yellow) is supplemented 
by a 1 9S cap (blue) at each end. The cap 
structure has been determined by 
computer processing of electron 
microscope images. The complex cap (also 
called the regulatory particle) selectively 
binds proteins that have been marked by 
ubiquitin for destruction; it then uses ATP 
hydrolysis to unfold their polypeptide 
chains and feed them through a narrow 
channel (see Figure 6-91) into the inner 
chamber of the 20S cylinder for digestion 
to short peptides. (B, from W. Baumeister 
et al., Cell 92:367-380, 1 998. With 
permission from Elsevier.) 



Figure 6-90 Processive protein 
digestion by the proteasome. The 

proteasome cap recognizes a substrate 
protein, in this case marked by a 
polyubiquitin chain (see Figure 6-92), 
and subsequently translocates it into the 
proteasome core, where it is digested. At 
an early stage, the ubiquitin is cleaved 
from the substrate protein and is 
recycled. Translocation into the core of 
the proteasome is mediated by a ring of 
ATP-dependent proteins that unfold the 
substrate protein as it is threaded 
through the ring and into the 
proteasome core (see Figure 6-91). (From 
S. Prakash and A. Matouschek, Trends 
Biochem. Sci. 29:593-600, 2004. With 
permission from Elsevier.) 
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Figure 6-91 A hexameric protein unfoldase. (A) The structure is formed from six subunits 
each belonging to the AAA family of proteins. (B) Model for the ATP-dependent unfoldase 
activity of AAA proteins. The ATP-bound form of a hexameric ring of AAA proteins binds a 
folded substrate protein that has been marked for unfolding (and eventual destruction) 
by a recognition tag such as a polyubiquitin chain (see below) or the peptide added to 
mark incompletely synthesized proteins (see Figure 6-81). A conformational change, 
made irreversible by ATP hydrolysis, pulls the substrate into the central core and strains 
the ring structure. At this point, the substrate protein, which is being tugged upon, can 
partially unfold and enter further into the pore or it can maintain its structure and 
dissociate. Very stable protein substrates may require hundreds of cycles of ATP hydrolysis 
and dissociation before they are successfully pulled into the AAA ring. Once unfolded, the 
substrate protein moves relatively quickly through the pore by successive rounds of ATP 
hydrolysis. (A, from X. Zhang et al., Mol. Cell 6:1473-1484, 2000, and A.N. Lupas and 
J. Martin, Cum Opin. Struct Biol. 1 2:746-753, 2002; B, from R.T. Sauer et al., Cell 1 1 9:9-1 8, 
2004. All with permission from Elsevier.) 
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many different intracellular proteins. For many proteins, tagging by ubiquitin 
results in their destruction by the proteasome. However, in other cases, ubiqui- 
tin tagging has an entirely different meaning. Ultimately, it is the number of 
ubiquitin molecules added and the way in which they are linked together that 
determines how the cell interprets the ubiquitin message (Figure 6-93). In the 
following sections, we emphasize the role of ubiquitylation in signifying protein 
degradation. 



An Elaborate Ubiquitin-Conjugating System Marks Proteins for 
Destruction 

Ubiquitin is prepared for conjugation to other proteins by the ATP-dependent 
ubiquitin- activating enzyme (El), which creates an activated, El-bound ubiqui- 
tin that is subsequently transferred to one of a set of ubiquitin- conjugating (E2) 
enzymes (Figure 6-92B). The E2 enzymes act in conjunction with accessory (E3) 
proteins. In the E2-E3 complex, called ubiquitin ligase, the E3 component binds 
to specific degradation signals, called degrons, in protein substrates, helping E2 
to form a polyubiquitin chain linked to a lysine of the substrate protein. In this 
chain, the C-terminal residue of each ubiquitin is linked to a specific lysine of 
the preceding ubiquitin molecule (see Figure 6-93), producing a linear series of 
ubiquitin-ubiquitin conjugates (Figure 6-92C). It is this polyubiquitin chain on 
a target protein that is recognized by a specific receptor in the proteasome. 

There are roughly 30 structurally similar but distinct E2 enzymes in mam- 
mals, and hundreds of different E3 proteins that form complexes with specific 
E2 enzymes. The ubiquitin-proteasome system thus consists of many distinct 
but similarly organized proteolytic pathways, which have in common both the 
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El enzyme at the "top" and the proteasome at the "bottom," and differ by the 
compositions of their E2-E3 ubiquitin ligases and accessory factors. Distinct 
ubiquitin ligases recognize different degradation signals, and therefore target 
distinct subsets of intracellular proteins for destruction. 

Denatured or otherwise misfolded proteins, as well as proteins containing 
oxidized or other abnormal amino acids, are recognized and destroyed because 
abnormal proteins tend to present on their surface amino acid sequences or 
conformational motifs that are recognized as degradation signals by a set of E3 
molecules in the ubiquitin-proteasome system; these sequences must of course 
be buried and therefore inaccessible in the normal counterparts of these pro- 
teins. However, a proteolytic pathway that recognizes and destroys abnormal 
proteins must be able to distinguish between completed proteins that have 
"wrong" conformations and the many growing polypeptides on ribosomes (as 
well as polypeptides just released from ribosomes) that have not yet achieved 
their normal folded conformation. This is not a trivial problem; the 
ubiquitin-proteasome system is thought to destroy many of the nascent and 
newly formed protein molecules not because these proteins are abnormal as 
such, but because they transiently expose degradation signals that are buried in 
their mature (folded) state. 
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Figure 6-92 Ubiquitin and the marking of proteins with polyubiquitin 
chains. (A) The three-dimensional structure of ubiquitin; this relatively 
small protein contains 76 amino acids. (B) The C-terminus of ubiquitin is 
initially activated through its high-energy thioester linkage to a cysteine 
side chain on the E1 protein. This reaction requires ATP, and it proceeds via 
a covalent AMP-ubiquitin intermediate. The activated ubiquitin on E1, also 
known as the ubiquitin-activating enzyme, is then transferred to the 
cysteines on a set of E2 molecules. These E2s exist as complexes with an 
even larger family of E3 molecules. (C) The addition of a polyubiquitin 
chain to a target protein. In a mammalian cell there are several hundred 
distinct E2-E3 complexes, many of which recognize a specific degradation 
signal on target proteins by means of the E3 component. The E2s are called 
ubiquitin-conjugating enzymes. The E3s have been referred to traditionally 
as ubiquitin ligases, but it is more accurate to reserve this name for the 
functional E2-E3 complex. The detailed structure of such a complex is 
presented in Figure 3-79. 
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Many Proteins Are Controlled by Regulated Destruction 

One function of intracellular proteolytic mechanisms is to recognize and elimi- 
nate misfolded or otherwise abnormal proteins, as just described. Yet another 
function of these proteolytic pathways is to confer short lifetimes on specific 
normal proteins whose concentrations must change promptly with alterations 
in the state of a cell. Some of these short-lived proteins are degraded rapidly at 
all times, while many others are conditionally short-lived, that is, they are 
metabolically stable under some conditions but become unstable upon a 
change in the cell's state. For example, mitotic cyclins are long-lived throughout 
the cell cycle until their sudden degradation at the end of mitosis, as explained 
in Chapter 17. 

How is such a regulated destruction of a protein controlled? Several mecha- 
nisms are illustrated through specific examples that appear later in this book. In 
one general class of mechanism (Figure 6-94A), the activity of a ubiquitin ligase 
is turned on either by E3 phosphorylation or by an allosteric transition in an E3 
protein caused by its binding to a specific small or large molecule. For example, 
the anaphase-promoting complex (APC) is a multisubunit ubiquitin ligase that 
is activated by a cell-cycle-timed subunit addition at mitosis. The activated APC 
then causes the degradation of mitotic cyclins and several other regulators of the 
metaphase-anaphase transition (see Figure 17-44). 

Alternatively, in response either to intracellular signals or to signals from the 
environment, a degradation signal can be created in a protein, causing its rapid 
ubiquitylation and destruction by the proteasome. One common way to create 
such a signal is to phosphorylate a specific site on a protein that unmasks a nor- 
mally hidden degradation signal. Another way to unmask such a signal is by the 
regulated dissociation of a protein subunit. Finally, powerful degradation signals 
can be created by cleaving a single peptide bond, provided that this cleavage 
creates a new N-terminus that is recognized by a specific E3 as a "destabilizing" 
N-terminal residue (Figure 6-94B). 

The N-terminal type of degradation signal arises because of the "N-end 
rule," which relates the lifetime of a protein in vivo to the identity of its N-ter- 
minal residue. There are 12 destabilizing residues in the N-end rule of the yeast 
S. cerevisiae (Arg, Lys, His, Phe, Leu, Tyr, Trp, He, Asp, Glu, Asn, and Gin), out of 
the 20 standard amino acids. The destabilizing N-terminal residues are recog- 
nized by a special ubiquitin ligase that is conserved from yeast to humans. 

As we have seen, all proteins are initially synthesized bearing methionine 
(or formylmethionine in bacteria), as their N-terminal residue, which is a sta- 
bilizing residue in the N-end rule. Special proteases, called methionine 
aminopeptidases, will often remove the first methionine of a nascent protein, 
but they will do so only if the second residue is also stabilizing according to N- 
end rule. Therefore, it was initially unclear how N-end rule substrates form in 
vivo. However, it is now understood that these substrates are formed by site- 
specific proteases. For example, a subunit of cohesin, a protein complex that 
holds sister chromatids together, is cleaved by a highly specific protease during 
the metaphase-anaphase transition. This cell- cycle -regulated cleavage allows 
separation of the sister chromatids and leads to the completion of mitosis (see 



Figure 6-93 The marking of proteins by 
ubiquitin. Each modification pattern 
shown can have a specific meaning to the 
cell. The two types of polyubiquitylation 
differ in the way the ubiquitin molecules 
are linked together. Linkage through Lys48 
signifies degradation by the proteasome 
whereas that through Lys63 has other 
meanings. Ubiquitin markings are "read" 
by proteins that specifically recognize 
each type of modification. 
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Figure 17-44). The C-terminal fragment of the cleaved subunit bears an N-ter- 
minal arginine, a destabilizing residue in the N-end rule. Mutant cells lacking 
the N-end rule pathway exhibit a greatly increased frequency of chromosome 
loss, presumably because a failure to degrade this fragment of the cohesin sub- 
unit interferes with the formation of new chromatid- associated cohesin com- 
plexes in the next cell cycle. 



Abnormally Folded Proteins Can Aggregate to Cause Destructive 
Human Diseases 

Many inherited human diseases (for example, sickle-cell anemia (see p. 1495) 
and a- 1 -antitrypsin deficiency, a condition that often leads to liver disease and 
emphysema) result from mutant proteins that escape the cell's quality controls, 
fold abnormally, and form aggregates. By absorbing critical macromolecules, 
these aggregates can severely damage cells and even cause cell death. Often, the 
inheritance of a single mutant allele of a gene can cause disease, since the nor- 
mal copy of the gene cannot protect the cell from the destructive properties of 
the aggregate. 

In normal humans, the gradual decline of the cell's protein quality controls 
can also cause disease by permitting normal proteins to form aggregates (Fig- 
ure 6-95). In some cases, the protein aggregates are released from dead cells 
and accumulate in the extracellular matrix that surrounds the cells in a tissue, 
and in extreme cases they can also damage tissues. Because the brain is com- 
posed of a highly organized collection of nerve cells, it is especially vulnerable. 
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Figure 6-94 Two general ways of 
inducing the degradation of a specific 
protein. (A) Activation of a specific E3 
molecule creates a new ubiquitin ligase. 
(B) Creation of an exposed degradation 
signal in the protein to be degraded. This 
signal binds a ubiquitin ligase, causing 
the addition of a polyubiquitin chain to a 
nearby lysine on the target protein. All six 
pathways shown are known to be used 
by cells to induce the movement of 
selected proteins into the proteasome. 
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Figure 6-95 Protein aggregates that cause human disease. (A) Schematic illustration of the type of conformational change in a 
protein that produces material for a cross-beta filament. (B) Diagram illustrating the self-infectious nature of the protein 
aggregation that is central to prion diseases. PrP (prion protein) is highly unusual because the misfolded version of the protein, 
called PrP*, induces the normal PrP protein it contacts to change its conformation, as shown. Most of the human diseases caused 
by protein aggregation are caused by the overproduction of a variant protein that is especially prone to aggregation, but the 
protein aggregate cannot spread from one animal to another. (C) Drawing of a cross-beta filament, a common type of protease- 
resistant protein aggregate found in many human neurological diseases. Because the hydrogen-bond interactions in a (3 sheet 
form between polypeptide backbone atoms (see Figure 3-9), a number of different abnormally folded proteins can produce this 
structure. (D) One of several possible models for the conversion of PrP to PrP*, showing the likely change of two a helices into four 
P strands. Although the structure of the normal protein has been determined accurately, the structure of the infectious form is not 
yet known with certainty because the aggregation has prevented the use of standard structural techniques. (C, courtesy of Louise 
Serpell, adapted from M. Sunde et a\.,J. Mol. Biol. 273:729-739, 1997. With permission from Academic Press; D, adapted from 
S.B. Prusiner, Trends Biochem. Sci. 21 :482-487, 1996. With permission from Elsevier.) 

Not surprisingly, therefore, protein aggregates primarily cause neurodegenera- 
tive diseases. Prominent among these are Huntington's disease and Alzheimer's 
disease — the latter causing age-related dementia in more than 20 million people 
in today's world. 

For a particular type of protein aggregate to survive, grow, and damage an 
organism, it must be highly resistant to proteolysis both inside and outside the 
cell. Many of the protein aggregates that cause problems form fibrils built from 
a series of polypeptide chains that are layered one over the other as a continu- 
ous stack of (3 sheets. This so-called cross-beta filament (Figure 6-95C), a struc- 
ture particularly resistant to proteolysis, is observed in many of the neurological 
disorders caused by protein aggregates, where it produces distinctly staining 
deposits known as amyloids. 

One particular variety of these pathologies has attained special notoriety. 
These are the prion diseases. Unlike Huntington's or Alzheimer's, prion diseases 
can spread from one organism to another, providing that the second organism 
eats a tissue containing the protein aggregate. A set of diseases — called scrapie 
in sheep, Creutzfeldt-Jacob disease (CJD) in humans, and bovine spongiform 
encephalopathy (BSE) in cattle — are caused by a misfolded, aggregated form of 
a protein called PrP (for prion protein). The PrP is normally located on the outer 
surface of the plasma membrane, most prominently in neurons. Its normal 
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function is not known. However, PrP has the unfortunate property of being con- 
vertible to a very special abnormal conformation (see Figure 6-95A). This con- 
formation not only forms protease-resistant, cross-beta filaments; it is also 
"infectious" because it converts normally folded molecules of PrP to the same 
pathological form. This property creates a positive feedback loop that propa- 
gates the abnormal form of PrP, called PrP* (see Figure 6-95B) and thereby 
allows the pathological conformation to spread rapidly from cell to cell in the 
brain, eventually causing death in both animals and humans. It can be danger- 
ous to eat the tissues of animals that contain PrP*, as witnessed by the spread of 
BSE (commonly referred to as "mad cow disease") from cattle to humans in 
Great Britain. Fortunately, in the absence of PrP*, PrP is extraordinarily difficult 
to convert to its abnormal form. 

Although very few proteins have the potential to misfold into an infectious 
conformation, another example causes an otherwise mysterious "protein- only 
inheritance" observed in yeast cells. The ability to study infectious proteins in 
yeast has clarified another remarkable feature of prions. These protein 
molecules can form several distinctively different types of aggregates from the 
same polypeptide chain. Moreover, each type of aggregate can be infectious, 
forcing normal protein molecules to adopt the same type of abnormal structure. 
Thus, several different "strains" of infectious particles can arise from the same 
polypeptide chain (Figure 6-96). How a single polypeptide sequence can adopt 
multiple aggregate forms is not fully understood; it is possible that all prion 
aggregates resemble cross-beta filaments (see Figure 6-95C) where the structure 
is held together predominantly with main peptide chain interactions. This 
would leave the amino acid side chains free to adopt different conformations 
and, if the structures are self- propagating, the existence of different strains 
could be explained. 

Finally, although prions were discovered because they cause disease, they 
also appear to have some positive roles in the cell. For example, some species of 
fungi use prion transformations to establish different types of cells. Although the 
idea is controversial, it has even been proposed that prions have a role in con- 
solidating memories in complex, multicellular organisms like ourselves. 



Figure 6-96 Creation of different prion 
strains in vitro. In this experiment, 
amyloid fibers were denatured and the 
components renatured at different 
temperatures. This treatment produced 
three distinctive types of amyloids, each 
of which could self-propagate when new 
subunits are added. 
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Figure 6-97 The production of a protein 
by a eucaryotic cell. The final level of 
each protein in a eucaryotic cell depends 
upon the efficiency of each step 
depicted. 
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There Are Many Steps From DNA to Protein 

We have seen so far in this chapter that many different types of chemical reactions 
are required to produce a properly folded protein from the information contained 
in a gene (Figure 6-97) . The final level of a properly folded protein in a cell there- 
fore depends upon the efficiency with which each of the many steps is performed. 
In the following chapter, we shall see that cells have the ability to change the 
levels of their proteins according to their needs. In principle, any or all of the 
steps in Figure 6-97 could be regulated for each individual protein. As we shall 
see in Chapter 7, there are examples of regulation at each step from gene to pro- 
tein. However, the initiation of transcription is the most common point for a cell 
to regulate the expression of each of its genes. This makes sense, inasmuch as 
the most efficient way to keep a gene from being expressed is to block the very 
first step — the transcription of its DNA sequence into an RNA molecule. 



Summary 

The translation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large ribonucleoprotein assembly called a ribosome. The 
amino acids used for protein synthesis are first attached to a family oftRNA molecules, 
each of which recognizes, by complementary base-pair interactions, particular sets of 
three nucleotides in the mRNA (codons). The sequence of nucleotides in the mRNA is 
then read from one end to the other in sets of three according to the genetic code. 
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To initiate translation, a small ribosomal subunit binds to the mRNA molecule at 
a start codon (AUG) that is recognized by a unique initiator tRNA molecule. A large 
ribosomal subunit binds to complete the ribosome and begin protein synthesis. Dur- 
ing this phase, aminoacyl-tRNAs — each bearing a specific amino acid — bind sequen- 
tially to the appropriate codons in mRNA through complementary base pairing 
between tRNA anticodons and mRNA codons. Each amino acid is added to the C- ter- 
minal end of the growing polypeptide in four sequential steps: aminoacyl-tRNA bind- 
ing, followed by peptide bond formation, followed by two ribosome translocation 
steps. Elongation factors use GTP hydrolysis to drive these reactions forward and to 
improve the accuracy of amino acid selection. The mRNA molecule progresses codon 
by codon through the ribosome in the 5 ' -to-3 direction until it reaches one of three 
stop codons. A release factor then binds to the ribosome, terminating translation and 
releasing the completed polypeptide. 

Eucaryotic and bacterial ribosomes are closely related, despite differences in the 
number and size of their rRNA and protein components. The rRNA has the dominant 
role in translation, determining the overall structure of the ribosome, forming the 
binding sites for the tRNAs, matching the tRNAs to codons in the mRNA, and creating 
the active site of the peptidyl transferase enzyme that links amino acids together dur- 
ing translation. 

In the final steps of protein synthesis, two distinct types of molecular chaperones 
guide the folding of polypeptide chains. These chaperones, known as Hsp60 and 
Hsp70, recognize exposed hydrophobic patches on proteins and serve to prevent the 
protein aggregation that would otherwise compete with the folding of newly synthe- 
sized proteins into their correct three-dimensional conformations. This protein folding 
process must also compete with an elaborate quality control mechanism that destroys 
proteins with abnormally exposed hydrophobic patches. In this case, ubiquitin is cova- 
lently added to a misfolded protein by a ubiquitin ligase, and the resulting polyubiq- 
uitin chain is recognized by the cap on a proteasome that moves the entire protein to 
the interior of the proteasome for proteolytic degradation. A closely related proteolytic 
mechanism, based on special degradation signals recognized by ubiquitin ligases, is 
used to determine the lifetimes of many normally folded proteins. By this method, 
selected normal proteins are removed from the cell in response to specific signals. 



THE RNA WORLD AND THE ORIGINS OF LIFE 

We have seen that the expression of hereditary information requires extraordi- 
narily complex machinery and proceeds from DNA to protein through an RNA 
intermediate. This machinery presents a central paradox: if nucleic acids are 
required to synthesize proteins and proteins are required, in turn, to synthesize 
nucleic acids, how did such a system of interdependent components ever arise? 
One view is that an RNA world existed on Earth before modern cells arose (Fig- 
ure 6-98). According to this hypothesis, RNA both stored genetic information 
and catalyzed the chemical reactions in primitive cells. Only later in evolution- 
ary time did DNA take over as the genetic material and proteins become the 
major catalyst and structural component of cells. If this idea is correct, then the 
transition out of the RNA world was never complete; as we have seen in this 
chapter, RNA still catalyzes several fundamental reactions in modern-day cells, 
which can be viewed as molecular fossils of an earlier world. 
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Figure 6-98 Time line for the universe, 
suggesting the early existence of an 
RNA world of living systems. 
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In this section we present some of the arguments in support of the RNA 
world hypothesis. We will see that several of the more surprising features of 
modern-day cells, such as the ribosome and the pre-mRNA splicing machinery, 
are most easily explained by viewing them as descendants of a complex network 
of RNA-mediated interactions that dominated cell metabolism in the RNA 
world. We also discuss how DNA may have taken over as the genetic material, 
how the genetic code may have arisen, and how proteins may have eclipsed RNA 
to perform the bulk of biochemical catalysis in modern-day cells. 



Life Requires Stored Information 

It has been proposed that the first "biological" molecules on Earth were formed 
by metal-based catalysis on the crystalline surfaces of minerals. In principle, an 
elaborate system of molecular synthesis and breakdown (metabolism) could 
have existed on these surfaces long before the first cells arose. Although contro- 
versial, many scientists believe that an extensive phase of "chemical evolution" 
took place on the prebiotic Earth, during which small molecules that could cat- 
alyze their own synthesis competed with each other for raw materials. 

But life requires much more than this. As described in Chapter 1, heredity 
is perhaps the central feature of life. Not only must a cell use raw materials to 
create a network of catalyzed reactions, it must do so according to an elaborate 
set of instructions encoded in the hereditary information. The replication of 
this information ensures that the complex metabolism of cells can accurately 
reproduce itself. Another crucial feature of life is the genetic variability that 
results from changes in the hereditary information. This variability, acted 
upon by selective pressures, is responsible for the great diversity of life on our 
planet. 

Thus, the emergence of life requires a way to store information, a way to 
duplicate it, a way to change it, and a way to convert the information through 
catalysis into favorable chemical reactions. But how could such a system begin 
to be formed? In present-day cells the most versatile catalysts are polypeptides, 
composed of many different amino acids with chemically diverse side chains 
and, consequently, able to adopt diverse three-dimensional forms that bristle 
with reactive chemical groups. Polypeptides also carry information, in the order 
of their amino acid subunits. But there is no known way in which a polypeptide 
can reproduce itself by directly specifying the formation of another of precisely 
the same sequence. 



Polynucleotides Can Both Store Information and Catalyze 
Chemical Reactions 

Polynucleotides have one property that contrasts with those of polypeptides: they 
can directly guide the formation of copies of their own sequence. This capacity 
depends on complementary base pairing of nucleotide subunits, which enables 
one polynucleotide to act as a template for the formation of another. As we have 
seen in this and the preceding chapter, such complementary templating mecha- 
nisms lie at the heart of DNA replication and transcription in modern-day cells. 

But the efficient synthesis of polynucleotides by such complementary tem- 
plating mechanisms requires catalysts to promote the polymerization reaction: 
without catalysts, polymer formation is slow, error-prone, and inefficient. Today, 
template-based nucleotide polymerization is rapidly catalyzed by protein 
enzymes — such as the DNA and RNA polymerases. How could such polymeriza- 
tion be catalyzed before proteins with the appropriate enzymatic specificity 
existed? The beginnings of an answer to this question came from the discovery 
in 1982 that RNA molecules themselves can act as catalysts. We have seen in this 
chapter, for example, that a molecule of RNA catalyzes one of the central reac- 
tions in the cell, the covalent joining of amino acids to form proteins. The unique 
potential of RNA molecules to act both as information carrier and as catalyst 
forms the basis of the RNA world hypothesis. 
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Figure 6-99 An RNA molecule that can catalyze its own synthesis. This 
hypothetical process would require catalysis of the production of both a 
second RNA strand of complementary nucleotide sequence and the use of 
this second RNA molecule as a template to form many molecules of RNA 
with the original sequence. The red rays represent the active site of this 
hypothetical RNA enzyme. 



RNA therefore has all the properties required of a molecule that could cat- 
alyze a variety of chemical reactions, including those that lead to its own syn- 
thesis (Figure 6-99). Although self- replicating systems of RNA molecules have 
not been found in nature, scientists are confident that they can be constructed 
in the laboratory While this demonstration would not prove that self- replicating 
RNA molecules were essential in the origin of life on Earth, it would certainly 
indicate that such a scenario is possible. 
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A Pre-RNA World May Predate the RNA World 

Although RNA seems well suited to form the basis for a self-replicating set of bio- 
chemical catalysts, it is not clear that RNA was the first kind of molecule to do so. 
From a purely chemical standpoint, it is difficult to imagine how long RNA 
molecules could be formed initially by purely nonenzymatic means. For one 
thing, the precursors of RNA, the ribonucleotides, are difficult to form nonenzy- 
matically Moreover, the formation of RNA requires that a long series of 3'-to-5' 
phosphodiester linkages assemble in the face of a set of competing reactions, 
including hydrolysis, 2'-to-5' linkages, and 5'-to-5' linkages. Given these prob- 
lems, it has been suggested that the first molecules to possess both catalytic 
activity and information storage capabilities may have been polymers that 
resemble RNA but are chemically simpler (Figure 6-100). We do not have any 
remnants of these compounds in present-day cells, nor do such compounds 
leave fossil records. Nonetheless, the relative simplicity of these "RNA-like poly- 
mers" suggests that one of them, rather than RNA itself, may have been the first 
biopolymer on Earth capable of both information storage and catalytic activity. 
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Figure 6-100 Structures of RNA and two 
related information-carrying polymers. 

In each case, B indicates a purine or 
pyrimidine base. The polymer TNA 
(threose nucleic acid) has a 4-carbon 
sugar unit in contrast to the 5-carbon 
ribose in RNA. In PNA (peptide nucleic 
acid), the ribose phosphate backbone of 
RNA has been replaced by the peptide 
backbone found in proteins. Like RNA, 
TNA and PNA can form double helices 
through complementary base-pairing, 
and each could therefore in principle 
serve as a template for its own synthesis. 
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If the pre-RNA world hypothesis is correct, then a transition to the RNA 
world must have occurred, presumably through the synthesis of RNA using one 
of these simpler polymers as both template and catalyst. While the details of the 
pre-RNA and RNA worlds will likely remain unknown, we know for certain that 
RNA molecules can catalyze a wide variety of chemical reactions, and we now 
turn to the properties of RNA that make this possible. 



Figure 6-101 Common elements of RNA 
secondary structure. Conventional, 
complementary base-pairing interactions 
are indicated by red "rungs" in double- 
helical portions of the RNA. 



Single-Stranded RNA Molecules Can Fold into Highly Elaborate 
Structures 

We have seen that complementary base-pairing and other types of hydrogen 
bonds can occur between nucleotides in the same chain, causing an RNA 
molecule to fold up in a unique way determined by its nucleotide sequence (see, 
for example, Figures 6-6, 6-52, and 6-69). Comparisons of many RNA structures 
have revealed conserved motifs, short structural elements that are used over and 
over again as parts of larger structures. Figure 6-101 shows some of these RNA 
secondary structural motifs, and Figure 6-102 shows a few common examples 
of more complex and often longer- range interactions, known as RNA tertiary 
interactions. 



Figure 6-102 Examples of RNA tertiary 
interactions. Some of these interactions 
can join distant parts of the same RNA 
molecule or bring two separate RNA 
molecules together. 
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Figure 6-103 A ribozyme. This simple RNA molecule catalyzes the 
cleavage of a second RNA at a specific site. This ribozyme is found 
embedded in larger RNA genomes — called viroids — which infect plants. 
The cleavage, which occurs in nature at a distant location on the same RNA 
molecule that contains the ribozyme, is a step in the replication of the 
viroid genome. Although not shown in the figure, the reaction requires a 
Mg molecule positioned at the active site. (Adapted from T.R. Cech and 
O.C. Uhlenbeck, Nature 372:39-40, 1 994. With permission from Macmillan 
Publishers Ltd.) 



Protein catalysts require a surface with unique contours and chemical prop- 
erties on which a given set of substrates can react (discussed in Chapter 3). In 
exactly the same way, an RNA molecule with an appropriately folded shape can 
serve as an enzyme (Figure 6-103). Like some proteins, many of these 
ribozymes work by positioning metal ions at their active sites. This feature gives 
them a wider range of catalytic activities than the limited chemical groups of the 
polynucleotide chain. 

Relatively few catalytic RNAs are known to exist in modern-day cells, how- 
ever, and much of our inference about the RNA world has come from experi- 
ments in which large pools of RNA molecules of random nucleotide sequences 
are generated in the laboratory Those rare RNA molecules with a property spec- 
ified by the experimenter are then selected out and studied (Figure 6-104) . Such 
experiments have created RNAs that can catalyze a wide variety of biochemical 
reactions (Table 6-5), with reaction rate enhancements approaching those of 
proteins. Given these findings, it is not clear why protein catalysts greatly out- 
number ribozymes in modern cells. Experiments have shown that RNA 
molecules may have more difficulty than proteins in binding to flexible, 
hydrophobic substrates; moreover, the availability of 20 types of amino acids 
over four types of bases may provide proteins with a greater number of catalytic 
strategies. 

Like proteins, RNAs can undergo conformational changes, either in 
response to small molecules or to other RNAs. We saw several examples of this 
in the ribosome and the spliceosome, and we will see others in Chapter 7 when 
we discuss riboswitches. One of the most dramatic RNA conformational changes 
has been observed with an artificial ribozyme which can exist in two entirely dif- 
ferent conformations, each with a different catalytic activity (Figure 6-105). 
Since the discovery of catalysis by RNA, it has become clear that RNA is an enor- 
mously versatile molecule, and it is therefore not unreasonable to contemplate 
the past existence of an RNA world with a very high level of biochemical sophis- 
tication. 



Self-Replicating Molecules Undergo Natural Selection 

The three-dimensional folded structure of a polynucleotide affects its stability, 
its actions on other molecules, and its ability to replicate. Therefore, certain 
polynucleotides will be especially successful in any self-replicating mixture. 
Because errors inevitably occur in any copying process, new variant sequences 
of these polynucleotides will be generated over time. 

Certain catalytic activities would have had a cardinal importance in the early 
evolution of life. Consider in particular an RNA molecule that helps to catalyze 
the process of templated polymerization, taking any given RNA molecule as a 
template (Figure 6-106). Such a molecule, by acting on copies of itself, can repli- 
cate. At the same time, it can promote the replication of other types of RNA 
molecules in its neighborhood (Figure 6-107). If some of these neighboring 
RNAs have catalytic actions that help the survival of RNA in other ways (catalyz- 
ing ribonucleotide production, for example), a set of different types of RNA 
molecules, each specialized for a different activity, could evolve into a coopera- 
tive system that replicates with unusually great efficiency 

But for any of these cooperative systems to evolve, they must be present 
together in a compartment. For example, a set of mutually beneficial RNAs (such 
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Figure 6-104 In vitro selection of a synthetic ribozyme. Beginning with a 
large pool of nucleic acid molecules synthesized in the laboratory, those 
rare RNA molecules that possess a specified catalytic activity can be 
isolated and studied. Although a specific example (that of an 
autophosphorylating ribozyme) is shown, variations of this procedure have 
been used to generate many of the ribozymes listed in Table 6-5. During 
the autophosphorylation step, the RNA molecules are kept sufficiently 
dilute to prevent the "cross"-phosphorylation of additional RNA molecules. 
In reality, several repetitions of this procedure are necessary to select the 
very rare RNA molecules with this catalytic activity. Thus, the material 
initially eluted from the column is converted back into DNA, amplified 
many fold (using reverse transcriptase and PCR as explained in Chapter 8), 
transcribed back into RNA, and subjected to repeated rounds of selection. 
(Adapted from J.R. Lorsch and J.W. Szostak, Nature 371:31-36, 1994. With 
permission from Macmillan Publishers Ltd.) 

as those of Figure 6-107) could replicate themselves only if all the RNAs 
remained in the neighborhood of the RNA that is specialized for templated poly- 
merization. Moreover, compartmentalization would bar parasitic RNA 
molecules from entering the system. Selection of a set of RNA molecules accord- 
ing to the quality of the self- replicating systems they generated could not there- 
fore occur efficiently until some form of compartment evolved to contain them. 

An early, crude form of compartmentalization may have been simple 
adsorption on surfaces or particles. The need for more sophisticated types of 
containment is easily fulfilled by a class of small molecules that has the simple 
physicochemical property of being amphiphilic, that is, consisting of one part 
that is hydrophobic (water insoluble) and another part that is hydrophilic (water 
soluble). When such molecules are placed in water, they aggregate, arranging 
their hydrophobic portions as much in contact with one another as possible and 
their hydrophilic portions in contact with the water. Amphiphilic molecules of 
appropriate shape aggregate spontaneously to form bilayers, creating small 
closed vesicles whose aqueous contents are isolated from the external medium 
(Figure 6-108). The phenomenon can be demonstrated in a test tube by simply 
mixing phospholipids and water together: under appropriate conditions, small 
vesicles will form. All present-day cells are surrounded by a plasma membrane 
consisting of amphiphilic molecules — mainly phospholipids — in this configura- 
tion; we discuss these molecules in detail in Chapter 10. 

The spontaneous assembly of a set of amphiphilic molecules, enclosing a 
self- replicating mixture of RNAs (or pre -RNAs) and other molecules (Figure 



Table 6-5 Some Biochemical Reactions That Can Be Catalyzed by Ribozymes 
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Ligase fold 



HDVfold 



Figure 6-1 05 An RNA molecule that folds into two different ribozymes. This 88-nucleotide 
RNA, created in the laboratory, can fold into a ribozyme that carries out a self-ligation reaction 
(left) or a self-cleavage reaction (right). The ligation reaction forms a 2 / ,5 / phosphodiester 
linkage with the release of pyrophosphate. This reaction seals the gap (gray shading), which was 
experimentally introduced into the RNA molecule. In the reaction carried out by the HDV fold, 
the RNA is cleaved at this same position, indicated by the arrowhead. This cleavage resembles 
that used in the life cycle of HDV, a hepatitis B satellite virus, hence the name of the fold. Each 
nucleotide is represented by a colored dot, with the colors used simply to clarify the two 
different folding patterns. The folded structures illustrate the secondary structures of the two 
ribozymes with regions of base-pairing indicated by close oppositions of the colored dots. Note 
that the two ribozyme folds have no secondary structure in common. (Adapted from 
E.A. Schultes and D.P. Bartel, Science 289:448-452, 2000. With permission from AAAS.) 



6-109), presumably formed the first membrane-bounded cells. Although it is 
not clear at what point in the evolution of biological catalysts this might have 
occurred, once RNA molecules were sealed within a closed membrane they 
could begin to evolve in earnest as carriers of genetic instructions: new variants 
could be selected not merely on the basis of their own structure, but also accord- 
ing to their effect on the other molecules in the same compartment. The 
nucleotide sequences of the RNA molecules could now be expressed in the char- 
acter of a unitary living cell. 
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Figure 6-106 A ribozyme created in the 
laboratory that can catalyze templated 
synthesis of RNA from nucleoside 
triphosphates. (A) Schematic diagram of 
the ribozyme showing one step of the 
templated polymerization reaction it 
catalyzes. (B) Nucleotide sequence of the 
ribozyme with base pairings indicated. 
Although relatively inefficient (it can only 
synthesize short lengths of RNA), this 
ribozyme adds the correct base, as 
specified by the template, over 95% of 
the time. (From W.K. Johnston et al., 
Science 292:1 31 9-1 325, 2001 . With 
permission from AAAS.) 
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Figure 6-1 07 A family of mutually 
supportive RNA molecules. One 

molecule is a ribozyme that replicates 
itself as well as the other RNA molecules. 
The other molecules would catalyze 
secondary tasks needed for the survival 
of the cooperative system, for example, 
by synthesizing ribonucleotides for RNA 
synthesis or phospholipids for 
compartmentalization. 
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How Did Protein Synthesis Evolve? 

The molecular processes underlying protein synthesis in present-day cells seem 
inextricably complex. Although we understand most of them, they do not make 
conceptual sense in the way that DNA transcription, DNA repair, and DNA repli- 
cation do. It is especially difficult to imagine how protein synthesis evolved 
because it is now performed by a complex interlocking system of protein and 
RNA molecules; obviously the proteins could not have existed until an early ver- 
sion of the translation apparatus was already in place. The RNA world hypothe- 
sis is especially appealing because the use of RNA in both information and catal- 
ysis seems both economic and conceptually simple. As attractive as this idea is 
for envisioning early life, it does not explain how the modern-day system of pro- 
tein synthesis arose. Although we can only speculate on the origins of modern 
protein synthesis and the genetic code, several experimental observations have 
provided plausible scenarios. 

In modern cells, some short peptides (such as antibiotics) are synthesized 
without the ribosome; peptide synthetase enzymes assemble these peptides, 
with their proper sequence of amino acids, without mRNAs to guide their syn- 
thesis. It is plausible that this non-coded, primitive version of protein synthesis 
first developed during the RNA world where it would have been catalyzed by 
RNA molecules. This idea presents no conceptual difficulties because, as we 
have seen, rRNA catalyzes peptide bond formation in present-day cells. We also 
know that ribozymes created in the laboratory can perform specific aminoacy- 
lation reactions; that is, they can match specific amino acids to specific tRNAs. 
It is therefore possible that tRNA-like adapters, each matched to a specific 
amino acid, could have arisen in the RNA world, marking the beginnings of a 
genetic code. 

In principle, other RNAs (the precursors to mRNAs) could have served as 
crude templates to direct the nonrandom polymerization of a few different 
amino acids. Any RNA that helped guide the synthesis of a useful polypeptide 
would have a great advantage in the evolutionary struggle for survival. We can 
envision a relatively nonspecific peptidyl transferase ribozyme, which, over 
time, grew larger and acquired the ability to position charged tRNAs accurately 
on RNA templates — leading eventually to the modern ribosome. Once protein 



Figure 6-108 Formation of membrane by phospholipids. Because these 
molecules have hydrophilic heads and lipophilic tails, they align 
themselves at an oil/water interface with their heads in the water and their 
tails in the oil. In the water they associate to form closed bilayer vesicles in 
which the lipophilic tails are in contact with one another and the 
hydrophilic heads are exposed to the water. 
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Figure 6-109 Encapsulation of RNA by simple amphiphilic molecules. 

For these experiments, the clay mineral montmorillonite was used to bring 
together RNA and fatty acids. (A) A montmorillonite particle, coated by RNA 
{red) has become trapped inside a fatty acid vesicle {green). (B) RNA (red) in 
solution has been encapsulated by fatty acids (green). These experiments 
show that montmorillonite can greatly accelerate the spontaneous 
generation of vesicles from amphiphilic molecules and trap RNA inside 
them. It has been hypothesized that conceptually similar actions may have 
led to the first primitive cells on Earth. (From M.M. Hanczyc et al., Science 
302:618-622, 2003. With permission from AAAS.) 
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synthesis evolved, the transition to a protein- dominated world could proceed, 
with proteins eventually taking over the majority of catalytic and structural tasks 
because of their greater versatility, with 20 rather than 4 different subunits. 
Although the scenarios just discussed are highly speculative, the known proper- 
ties of RNA molecules are consistent with these ideas. 



All Present-Day Cells Use DNA as Their Hereditary Material 

If the evolutionary speculations embodied in the RNA world hypothesis are cor- 
rect, early cells would have differed fundamentally from the cells we know today 
in having their hereditary information stored in RNA rather than in DNA (Figure 
6-110). Evidence that RNA arose before DNA in evolution can be found in the 
chemical differences between them. Ribose, like glucose and other simple car- 
bohydrates, can be formed from formaldehyde (HCHO), a simple chemical 
which is readily produced in laboratory experiments that attempt to simulate 
conditions on the primitive Earth. The sugar deoxyribose is harder to make, and 
in present-day cells it is produced from ribose in a reaction catalyzed by a pro- 
tein enzyme, suggesting that ribose predates deoxyribose in cells. Presumably, 
DNA appeared on the scene later, but then proved more suitable than RNA as a 
permanent repository of genetic information. In particular, the deoxyribose in 
its sugar-phosphate backbone makes chains of DNA chemically more stable 
than chains of RNA, so that much greater lengths of DNA can be maintained 
without breakage. 

The other differences between RNA and DNA — the double-helical structure 
of DNA and the use of thymine rather than uracil — further enhance DNA stabil- 
ity by making the many unavoidable accidents that occur to the molecule much 
easier to repair, as discussed in detail in Chapter 5 (see pp. 296-297 and 300-301). 



Summary 

From our knowledge of present-day organisms and the molecules they contain, it 
seems likely that the development of the directly autocatalytic mechanisms funda- 
mental to living systems began with the evolution of families of molecules that could 
catalyze their own replication. With time, a family of cooperating RNA catalysts prob- 
ably developed the ability to direct the synthesis of polypeptides. DNA is likely to have 
been a late addition: as the accumulation of additional protein catalysts allowed 
more efficient and complex cells to evolve, the DNA double helix replaced RNA as a 
more stable molecule for storing the increased amounts of genetic information 
required by such cells. 



Figure 6-1 1 The hypothesis that RNA preceded DNA and proteins in 
evolution. In the earliest cells, pre-RNA molecules would have had 
combined genetic, structural, and catalytic functions and RNA would have 
gradually taken over these functions. In present-day cells, DNA is the 
repository of genetic information, and proteins perform the vast majority 
of catalytic functions in cells. RNA primarily functions today as a 
go-between in protein synthesis, although it remains a catalyst for a small 
number of crucial reactions. 
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PROBLEMS 

Which statements are true? Explain why or why not. 

6-1 The consequences of errors in transcription are less 
than those of errors in RNA replication. 

6-2 Since introns are largely genetic "junk," they do not 
have to be removed precisely from the primary transcript 
during RNA splicing. 

6-3 Wobble pairing occurs between the first position in 
the codon and the third position in the anticodon. 

6-4 Protein enzymes are thought to greatly outnum- 
ber ribozymes in modern cells because they catalyze a 
much greater variety of reactions at much faster rates 
than ribozymes. 

Discuss the following problems. 

6-5 In which direction along the template must the RNA 
polymerase in Figure Q6-1 be moving to have generated the 
supercoiled structures that are shown? Would you expect 
supercoils to be generated if the RNA polymerase were free 
to rotate about the axis of the DNA as it progressed along the 
template? 



RNA polymerase 




positive supercoils 



negative supercoils 



Figure Q6-1 Supercoils around a moving RNA polymerase (Problem 
6-5). 

6-6 Phosphates are attached to the CTD (C-terminal 
domain) of RNA polymerasse II during transcription. What 
are the various roles of RNA polymerase II CTD phosphory- 
lation? 

6-7 The human a- tropomyosin gene is alternatively 
spliced to produce several forms of oc-tropomyosin mRNA in 
various cell types (Figure Q6-2). For all forms of the mRNA, 
the encoded protein sequence is the same for exons 1 and 
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Figure Q6-2 Alternatively spliced mRNAs from the human 
a-tropomyosin gene (Problem 6-7). (A) Exons in the human 
a-tropomyosin gene. The locations and relative sizes of exons are 
shown by the blue and red rectangles. (B) Splicing patterns for four 
a-tropomyosin mRNAs. Splicing is indicated by lines connecting the 
exons that are included in the mRNA. 



10. Exons 2 and 3 are alternative exons used in different 
mRNAs, as are exons 7 and 8. Which of the following state- 
ments about exons 2 and 3 is the most accurate? Is that 
statement also the most accurate one for exons 7 and 8? 
Explain your answers. 

A. Exons 2 and 3 must have the same number of 
nucleotides. 

B. Exons 2 and 3 must each contain an integral number of 
codons (that is, the number of nucleotides divided by 3 must 
be an integer). 

C. Exons 2 and 3 must each contain a number of 
nucleotides that when divided by 3 leaves the same remain- 
der (that is, 0, 1, or 2). 

6-8 After treating cells with a chemical mutagen, you iso- 
late two mutants. One carries alanine and the other carries 
methionine at a site in the protein that normally contains 
valine (Figure Q6-3) . After treating these two mutants again 
with the mutagen, you isolate mutants from each that now 
carry threonine at the site of the original valine (Figure 
Q6-3). Assuming that all mutations involve single 
nucleotide changes, deduce the codons that are used for 
valine, methionine, threonine, and alanine at the affected 
site. Would you expect to be able to isolate valine -to -threo- 
nine mutants in one step? 



first Ala second Figure Q6-3 Two rounds of 

treatment y treatment mutagenesis and the altered 

amino acids at a single position 
Thr in a protein (Problem 6-8). 
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6-9 The elongation factor EF-Tu introduces two short 
delays between codon-anticodon base-pairing and forma- 
tion of the peptide bond. These delays increase the accuracy 
of protein synthesis. Describe these delays and explain how 
they improve the fidelity of translation. 

6-10 Both Hsp60-like and Hsp70 molecular chaperones 
share an affinity for exposed hydrophobic patches on pro- 
teins, using them as indicators of incomplete folding. Why 
do you suppose hydrophobic patches serve as critical sig- 
nals for the folding status of a protein? 

6-1 1 Most proteins require molecular chaperones to assist 
in their correct folding. How do you suppose the chaperones 
themselves manage to fold correctly? 

6-12 What is so special about RNA that makes it such an 
attractive evolutionary precursor to DNA and protein? What 
is it about DNA that makes it a better material than RNA for 
storage of genetic information? 

6-13 If an RNA molecule could form a hairpin with a sym- 
metric internal loop, as shown in Figure Q6-4, could the 
complement of this RNA form a similar structure? If so, 
would there be any regions of the two structures that are 
identical? Which ones? 
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Figure Q6-4 An RNA hairpin with a 
symmetric internal loop (Problem 6-13). 
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Chapter 7 



Control of Gene Expression 




An organism's DNA encodes all of the RNA and protein molecules required to 
construct its cells. Yet a complete description of the DNA sequence of an organ- 
ism — be it the few million nucleotides of a bacterium or the few billion 
nucleotides of a human — no more enables us to reconstruct the organism than 
a list of English words enables us to reconstruct a play by Shakespeare. In both 
cases, the problem is to know how the elements in the DNA sequence or the 
words on the list are used. Under what conditions is each gene product made, 
and, once made, what does it do? 

In this chapter we discuss the first half of this problem — the rules and mech- 
anisms by which a subset of the genes is selectively expressed in each cell. The 
mechanisms that control the expression of genes operate at many levels, and we 
discuss the different levels in turn. We begin with an overview of some basic 
principles of gene control in multicellular organisms. 



AN OVERVIEW OF GENE CONTROL 

The different cell types in a multicellular organism differ dramatically in both 
structure and function. If we compare a mammalian neuron with a lymphocyte, 
for example, the differences are so extreme that it is difficult to imagine that the 
two cells contain the same genome (Figure 7-1). For this reason, and because 
cell differentiation is often irreversible, biologists originally suspected that genes 
might be selectively lost when a cell differentiates. We now know, however, that 
cell differentiation generally depends on changes in gene expression rather than 
on any changes in the nucleotide sequence of the cell's genome. 
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The Different Cell Types of a Multicellular Organism Contain the 
Same DNA 

The cell types in a multicellular organism become different from one another 
because they synthesize and accumulate different sets of RNA and protein 
molecules. Evidence that they generally do this without altering the sequence of 
their DNA comes from a classic set of experiments in frogs. When the nucleus of 
a fully differentiated frog cell is injected into a frog egg whose nucleus has been 
removed, the injected donor nucleus is capable of directing the recipient egg to 
produce a normal tadpole (Figure 7-2A). Because the tadpole contains a full 
range of differentiated cells that derived their DNA sequences from the nucleus 
of the original donor cell, it follows that the differentiated donor cell cannot have 
lost any important DNA sequences. A similar conclusion has been reached in 
experiments performed with various plants. Here differentiated pieces of plant 
tissue are placed in culture and then dissociated into single cells. Often, one of 
these individual cells can regenerate an entire adult plant (Figure 7-2B). Finally, 
this same principle has been demonstrated in mammals, including sheep, cat- 
tle, pigs, goats, dogs, and mice by introducing nuclei from somatic cells into 
enucleated eggs; when placed into surrogate mothers, some of these eggs (called 
reconstructed zygotes) develop into healthy animals (Figure 7-2C). 
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Figure 7-1 A mammalian neuron and a lymphocyte. The long branches of 
this neuron from the retina enable it to receive electrical signals from many 
cells and carry those signals to many neighboring cells. The lymphocyte is 
a white blood cell involved in the immune response to infection and 
moves freely through the body. Both of these cells contain the same 
genome, but they express different RNAs and proteins. (From 
B.B. Boycott Essays on the Nervous System [R. Bellairs and E.G. Gray, eds.]. 
Oxford, UK: Clarendon Press, 1974.) 



Further evidence that large blocks of DNA are not lost or rearranged during 
vertebrate development comes from comparing the detailed banding patterns 
detectable in condensed chromosomes at mitosis (see Figure 4-11). By this cri- 
terion the chromosome sets of differentiated cells in the human body appear to 
be identical. Moreover, comparisons of the genomes of different cells based on 
recombinant DNA technology have confirmed, as a general rule, that the 
changes in gene expression that underlie the development of multicellular 
organisms do not rely on changes in the DNA sequences of the corresponding 
genes. There are, however, a few cases where DNA rearrangements of the 
genome take place during the development of an organism— most notably, in 
generating the diversity of the immune system of mammals, which we discuss in 
Chapter 25. 



Different Cell Types Synthesize Different Sets of Proteins 

As a first step in understanding cell differentiation, we would like to know how 
many differences there are between any one cell type and another. Although we 
still do not have a detailed answer to this fundamental question, we can make 
certain general statements. 

1. Many processes are common to all cells, and any two cells in a single 
organism therefore have many proteins in common. These include the 
structural proteins of chromosomes, RNA polymerases, DNA repair 
enzymes, ribosomal proteins, enzymes involved in the central reactions of 
metabolism, and many of the proteins that form the cytoskeleton. 

2. Some proteins are abundant in the specialized cells in which they function 
and cannot be detected elsewhere, even by sensitive tests. Hemoglobin, for 
example, can be detected only in red blood cells. 

3. Studies of the number of different mRNAs suggest that, at any one time, a 
typical human cell expresses 30-60% of its approximately 25,000 genes. 
When the patterns of mRNAs in a series of different human cell lines are 
compared, it is found that the level of expression of almost every active 
gene varies from one cell type to another. A few of these differences are 
striking, like that of hemoglobin noted above, but most are much more 
subtle. Even genes that are expressed in all cell types vary in their level of 
expression from one cell type to the next. The patterns of mRNA abun- 
dance (determined using DNA microarrays, discussed in Chapter 8) are so 
characteristic of cell type that they can be used to type human cancer cells 
of uncertain tissue origin (Figure 7-3). 

Although the differences in mRNAs among specialized cell types are strik- 
ing, they nonetheless underestimate the full range of differences in the pat- 
tern of protein production. As we shall see in this chapter, Uiere are many 
steps after transcription at which gene expression can be regulated For 
example, alternative splicing can produce a whole family of proteins from 
a single gene. Finally proteins can be covalently modified after they are 
synthesized. Therefore a better way of appreciating the radical differences 
in gene expression between cell types is through methods that directly dis- 
play the levels of proteins and their post-translational modifications (Fig- 
ure 7-4). 
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Figure 7-2 Evidence that a differentiated cell contains all the genetic instructions necessary to direct 
the formation of a complete organism. (A) The nucleus of a skin cell from an adult frog transplanted into 
an enucleated egg can give rise to an entire tadpole. The broken arrow indicates that to give the 
transplanted genome time to adjust to an embryonic environment, a further transfer step is required in 
which one of the nuclei is taken from the early embryo that begins to develop and is put back into a 
second enucleated egg. (B) In many types of plants, differentiated cells retain the ability to "dedifferentiate/' 
so that a single cell can form a clone of progeny cells that later give rise to an entire plant. (C) A 
differentiated cell nucleus from an adult cow introduced into an enucleated egg from a different cow can 
give rise to a calf. Different calves produced from the same differentiated cell donor are genetically 
identical and are therefore clones of one another. (A, modified from J.B. Gurdon, Sci. Am. 219:24-35, 1968. 
With permission from Scientific American.) 



External Signals Can Cause a Cell to Change the Expression of Its 
Genes 

Most of the specialized cells in a multicellular organism are capable of altering 
their patterns of gene expression in response to extracellular cues. If a liver cell is 
exposed to a glucocorticoid hormone, for example, the production of several spe- 
cific proteins is dramatically increased. Glucocorticoids are released in the body 
during periods of starvation or intense exercise and signal the liver to increase the 
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Figure 7-3 Differences in mRNA expression 
patterns among different types of human 
cancer cells. This figure summarizes a very 
large set of measurements in which the 
mRNA levels of 1800 selected genes 
(arranged top to bottom) were determined 
for 142 different human tumor cell lines 
(arranged left to right), each from a different 
patient. Each small red bar indicates that the 
given gene in the given tumor is transcribed 
at a level significantly higher than the 
average across all the cell lines. Each small 
green bar indicates a less-than-average 
expression level, and each black bar denotes 
an expression level that is close to average 
across the different tumors. The procedure 
used to generate these data — mRNA 
isolation followed by hybridization to DNA 
microarrays— is described in Chapter 8 
(pp. 574-575). The figure shows that the 
relative expression levels of each of the 1800 
genes analyzed vary among the different 
tumors (seen by following a given gene 
from left to right across the figure). This 
analysis also shows that each type of tumor 
has a characteristic gene expression pattern. 
This information can be used to "type" 
cancer cells of unknown tissue origin by 
matching the gene expression profiles to 
those of known tumors. For example, the 
unknown sample in the figure has been 
identified as a lung cancer. (Courtesy of 
Patrick O. Brown, David Botstein, and the 
Stanford Expression Collaboration.) 
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Figure 7-4 Differences in the proteins expressed by two human tissues. In each panel, the 
proteins are displayed using two-dimensional polyacrylamide-gel electrophoresis (see 
pp. 521 -522), The proteins have been separated by molecular weight (top to bottom) and 
isoelectric point, the pH at which the protein has no net charge (right to left), The protein spots 
artificially colored red are common to both samples; those in blue are specific to one of the two 
tissues. The differences between the two tissue samples vastly outweigh their similarities: even 
for proteins that are shared between the two tissues, their relative abundances are usually 
different. Note that this technique separates proteins by both size and charge; therefore a protein 
that has, for example, several different phosphorylation states will appear as a series of horizontal 
spots (see upper right-hand portion of right panel). Only a small portion of the complete protein 
spectrum is shown for each sample. Although two dimensional gel electrophoresis provides a 
simple way to visualize the differences between two protein samples, methods based on mass 
spectrometry (see pp. 519-521) provide much more detailed information and are therefore more 
commonly used. (Courtesy of Tim Myers and Leigh Anderson, Large Scale Biology Corporation.) 
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production of glucose from amino acids and other small molecules; the set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosine to glucose. When the hormone is no 
longer present, the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids differently. Fat cells, for example, 
reduce the production of tyrosine aminotransferase, while some other cell types 
do not respond to glucocorticoids at all. These examples illustrate a general fea- 
ture of cell specialization: different cell types often respond differently to the 
same extracellular signal. Underlying such adjustments that occur in response 
to extracellular signals, there are features of the gene expression pattern that do 
not change and give each cell type its permanently distinctive character. 



Gene Expression Can Be Regulated at Many of the Steps in the 
Pathway from DNA to RNA to Protein 

If differences among the various cell types of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the previous chapter, there are many steps in the path- 
way leading from DNA to protein. We now know that all of them can in principle 
be regulated. Thus a cell can control the proteins it makes by (1) controlling when 
and how often a given gene is transcribed (transcriptional control), (2) control- 
ling the splicing and processing of RNA transcripts (RNA processing control), (3) 
selecting which completed mRNAs are exported from the nucleus to the cytosol 
and determining where in the cytosol they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
by ribosomes (translational control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, inactivating, degrading, or locating specific protein molecules after they 
have been made (protein activity control) (Figure 7-5). 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize superfluous interme- 
diates. In the following sections we discuss the DNA and protein components 
that perform this function by regulating the initiation of gene transcription. We 
shall return at the end of the chapter to the many additional ways of regulating 
gene expression. 



Figure 7-5 Six steps at which eucaryotic 
gene expression can be controlled. 

Controls that operate at steps 1 through 
5 are discussed in this chapter. Step 6, the 
regulation of protein activity, occurs 
largely through covalent post- 
radiational modifications including 
phosphorylation, acetylation, and 
ubiquitylation (see Table 3-3, p. 1 86) and 
is discussed in many chapters throughout 
the book. 



Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules. A cell typically expresses only a 
fraction of its genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern of 
genes they express in response to changes in their environment, such as signals from 
other cells. Although all of the steps involved in expressing a gene can in principle he 
regulated, for most genes the initiation of RNA transcription is the most important 
point of control 
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DNA-BINDING MOTIFS IN GENE REGULATORY 
PROTEINS 

How does a cell determine which of its thousands of genes to transcribe? As out- 
lined in Chapter 6, the transcription of each gene is controlled by a regulatory 
region of DNA relatively near the site where transcription begins. Some regula- 
tory regions are simple and act as switches thrown by a single signal Many oth- 
ers are complex and resemble tiny microprocessors, responding to a variety of 
signals that they interpret and integrate in order to switch their neighboring 
gene on or off. Whether complex or simple, these switching devices are found in 
all cells and are composed of two types of fundamental components: (1) short 
stretches of DNA of defined sequence and (2) gene regulatory proteins that rec- 
ognize and bind to this DNA. 

We begin our discussion of gene regulatory proteins by describing how they 
were discovered. 



Gene Regulatory Proteins Were Discovered Using Bacterial 
Genetics 

Genetic analyses in bacteria carried out in the 1950s provided the first evidence 
for the existence of gene regulatory proteins (often loosely called "transcription 
factors") that turn specific sets of genes on or off. One of these regulators, the 
lambda repressor, is encoded by a bacterial virus, bacteriophage lambda. The 
repressor shuts off the viral genes that code for the protein components of new 
virus particles and thereby enables the viral genome to remain a silent passen- 
ger in the bacterial chromosome, multiplying with the bacterium when condi- 
tions are favorable for bacterial growth (see Figure 5-78). The lambda repressor 
was among the first gene regulatory proteins to be characterized, and it remains 
one of the best understood, as we discuss later. Other bacterial regulators 
respond to nutritional conditions by shutting off genes encoding specific sets of 
metabolic enzymes when they are not needed. The Lac repressor, the first of 
these bacterial proteins to be recognized, turns off the production of the pro- 
teins responsible for lactose metabolism when this sugar is absent from the 
medium. 

The first step toward understanding gene regulation was the isolation of 
mutant strains of bacteria and bacteriophage lambda that were unable to shut 
off specific sets of genes. It was proposed at the time, and later proven, that most 
of these mutants were deficient in proteins acting as specific repressors for these 
sets of genes. Because these proteins, like most gene regulatory proteins, are 
present in small quantities, it was difficult and time-consuming to isolate them. 
They were eventually purified by fractionating cell extracts. Once isolated, the 
proteins were shown to bind to specific DNA sequences close to the genes that 
they regulate. The precise DNA sequences that they recognized were then deter- 
mined by a combination of classical genetics and methods for studying pro- 
tein-DNA interactions discussed later in this chapter. 



The Outside of the DNA Helix Can Be Read by Proteins 

As discussed in Chapter 4, the DNA in a chromosome consists of a very long 
double helix (Figure 7H5K Gene regulatory proteins must recognize specific 
nucleotide sequences embedded within this structure. Ii was originally thought 
that these proteins might require direct access to the hydrogen bonds between 
base pairs fa the interior of the double helix to distinguish between one DNA 



Figure 7-6 Double-helical structure of DNA. A space-filling model of DNA 
showing the major and minor grooves on the outside of the double helix. 
The atoms are colored as follows: carbon, dark blue; nitrogen, light blue; 
hydrogen, white; oxygen, red; phosphorus, yellow. 
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Figure 7-7 How the different base pairs in DNA can be recognized from their edges without the need to open the 
double helix. The four possible configurations of base pairs are shown, with potential hydrogen bond donors indicated in 
blue, potential hydrogen bond acceptors in red, and hydrogen bonds of the base pairs themselves as a series of short 
parallel red lines. Methyl groups, which form hydrophobic protuberances, are shown in yellow, and hydrogen atoms that are 
attached to carbons, and are therefore unavailable for hydrogen bonding, are white. (From C. Branden and J.Tooze, 
Introduction to Protein Structure, 2nd ed. New York: Garland Publishing, 1 999.) 



sequence and another. It is now clear, however, that the outside of the double 
helix is studded with DNA sequence information that gene regulatory proteins 
can recognize without having to open the double helix. The edge of each base 
pair is exposed at the surface of the double helix, presenting a distinctive pattern 
of hydrogen bond donors, hydrogen bond acceptors, and hydrophobic patches 
for proteins to recognize in both the major and minor groove (Figure 7-7). But 
only in the major groove are the patterns markedly different for each of the four 
base-pair arrangements (Figure 7-8). For this reason, gene regulatory proteins 
generally make specific contacts with the major groove — as we shall see. 
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Figure 7-8 A DNA recognition code. The 

edge of each base pair, seen here looking 
directly at the major or minor groove, 
contains a distinctive pattern of 
hydrogen bond donors, hydrogen bond 
acceptors, and methyl groups. From the 
major groove, each of the four base-pair 
configurations projects a unique pattern 
of features. From the minor groove, 
however, the patterns are similar for G-C 
and C-G as well as for A-T and T-A. The 
color code is the same as that in Figure 
7-7. (From C. Branden and J.Tooze, 
Introduction to Protein Structure, 2nd ed. 
New York: Garland Publishing, 1999.) 
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Short DNA Sequences Are Fundamental Components of Genetic 
Switches 

A specific nucleotide sequence can be "read" as a pattern of molecular features 
on the surface of the DNA double helix. Particular nucleotide sequences, each 
typically less than 20 nucleotide pairs in length, function as fundamental com- 
ponents or genetic switches by serving as recognition sites for the binding of 
specific gene regulatory proteins. Thousands of such DNA sequences have been 
identified, each recognized by a different gene regulatory protein (or by a sel of 
related gene regulatory prmoiiisj. Some of the gene regulatory proteins that are 
discussed in the course of this chapier are listed in Table 7-1, along with the 
DNA sequences that they recognize. 

We now turn to the gene regulatory proteins themselves, the second funda- 
mental component of genetic switches. We begin with the structural features 
that allow these proteins to recognize short, specific DNA sequences contained 
in a much longer double helix. 



Gene Regulatory Proteins Contain Structural Motifs That Can 
Read DNA Sequences 

Molecular recognition in biology generally relies on an exact fit between the sur- 
faces of two molecules, and the study of gene regulatory proteins has provided 
some of the clearest examples of this principle. A gene regulatory protein recog- 
nizes a specific DNA sequence because the surface of the protein is extensively 



Table 7-1 Some Gene Regulatory Proteins and the DNA Sequences That They 
Recognize 



Bacteria 



Yeast 



Drosophiia 



Mammals 



Lac repressor 

CAP 

Lambda repressor 

Gal4 

Mata2 

Gcn4 

Kruppel 

Bicotd 

Spl 

Octl Pou domain 

GATA1 

MyoD 

p53 



>NA SEQUENCE RECOGNIi 



5' AA^G'KpAG^GGATAACAATT 
3' TTAACACTCGCCTATTGTTAA 

TGTGAGTTAGCTCACT 

I I I I I I II I I I f I I I I 
ACACTCAATCGAGTGA 

TATCACCGCCAGAGGT 

I I I I II I I I I I I I I I I 
ATAGTGGCGGTCTCCAT 

CGGAGGACTGTCCTCCG 

I I I I I I I I I I I I I I I I i 
GCCTCCTGACAGGAGGC 

CATGTAATT 

I I I I I I I I I 
GTACATTAA 

ATGACTCAT 

I I I I I II I I 
TACTGAGTA 

AACGGGTTAA 

I I I I I I I I I I 
TTGCCCAATT 

GGGATTAGA 

II I I I I I I i 
CCCTAATCT 

GGGCGG 

I I I I I I 
CCCGCC 

ATGCAAAT 

I I I I I I I I 
TACGTTTA 

TGATAG 

I I I I I I 
ACTATC 

CAAATG 

GTTTAC 

GGGCAAGTGT 

! I I I i IT I I i 
CCCGTTCAGA 



*For convenience, only one recognition sequence, rather than a consensus sequence (see Figure 
6-1 2), is given for each protein 
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complementary to the special surface features of the double helix in that region. 
In most cases the protein makes a series of contacts with the DNA, involving 
hydrogen bonds, ionic bonds, and hydrophobic interactions. Although each indi- 
vidual contact is weak, the 20 or so that are typically formed at the protein-DNA 
interface add together to ensure that the interaction is both highly specific and 
very strong (Figure 7-9). In fact, DNA-protein interactions include some of the 
tightest and most specific molecular interactions known in biology. 

Although each example of protein-DNA recognition is unique in detail, x- 
ray crystallographic and NMR spectroscopic studies of several hundred gene 
regulatory proteins have revealed that many of them contain one or another of 
a small set of DNA-binding structural motifs. These motifs generally use either a 
helices or p sheets to bind to the major groove of DNA; this groove, as we have 
seen, contains sufficient information to distinguish one DNA sequence from any 
other. The fit is so good that it has been suggested that the dimensions of the 
basic structural units of nucleic acids and proteins evolved together to permit 
these molecules to interlock. 



The Helix-Turn-Helix Motif Is One of the Simplest and Most 
Common DNA-Binding Motifs 

The first DNA-binding protein motif to be recognized was the helix-turn-helix. 
Originally identified in bacterial proteins, this motif has since been found in 
many hundreds of DNA-binding proteins from both eucaryotes and procary- 
otes. It is constructed from two a helices connected by a short extended chain of 
amino acids, which constitutes the "turn" (Figure 7-10). The two helices are held 
at a fixed angle, primarily through interactions between the two helices. The 
more C-terminal helix is called the recognition helix because it fits into the major 
groove of DNA; its amino acid side chains, which differ from protein to protein, 
play an important part in recognizing the specific DNA sequence to which the 
protein binds. 

Outside the helix-turn-helix region, the structure of the various proteins 
that contain this motif can vary enormously (Figure 7-11). Thus each protein 
"presents" its helix-turn-helix motif to the DNA in a unique way, a feature 
thought to enhance the versatility of the helix-turn-helix motif by increasing the 
number of DNA sequences that the motif can be used to recognize. Moreover, in 
most of these proteins, parts of the polypeptide chain outside the 
helix-turn-helix domain also make important contacts with the DNA, helping to 
fine-tune the interaction. 
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Figure 7-9 The binding of a gene 
regulatory protein to the major groove 
of DNA. Only a single contact is shown. 
Typically, the protein-DNA interface 
would consist of 10-20 such contacts, 
involving different amino acids, each 
contributing to the strength of the 
protein-DNA interaction. 
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Figure 7-10 The DNA-binding 
helix-turn-helix motif. The motif is 
shown in (A), where each white circle 
denotes the central carbon of an amino 
acid. The C-terminal a helix (red) is called 
the recognition helix because it 
participates in sequence-specific 
recognition of DNA. As shown in (B), this 
helix fits into the major groove of DNA, 
where it contacts the edges of the base 
pairs (see also Figure 7-7). The 
N-terminal a-helix (blue) functions 
primarily as a structural component that 
helps to position the recognition helix. 



The group of helix-turn-helix proteins shown in Figure 7-1 1 demonstrates a 
common feature of many sequence-specific DNA-binding proteins. They bind 
as symmetric dimers to DNA sequences that are composed of two very similar 
"half-sites/' which are also arranged symmetrically (Figure 7-12). This arrange- 
ment allows each protein monomer to make a nearly identical set of contacts 
and enormously increases the binding affinity: as a first approximation, dou- 
bling the number of contacts doubles the free energy of the interaction and 
thereby squares the affinity constant. 



Homeodomain Proteins Constitute a Special Class of 
Helix-Turn-Helix Proteins 

Not long after the first gene regulatory proteins were discovered in bacteria, 
genetic analyses in the fruit fly Drosophila led to the characterization of an 
important class of genes, the homeotic selector genes, that play a critical part in 
orchestrating fly development. As discussed in Chapter 22, they have since 
proved to have a fundamental role in the development of higher animals as well 
Mutations in these genes can cause one body part in the fly to be converted into 
another, showing that the proteins they encode control critical developmental 
decisions. 

When the nucleotide sequences of several homeotic selector genes were 
determined in the early 1980s, each proved to code for an almost identical 
stretch of 60 amino acids that defines this class of proteins and is termed the 
homeodomain. When the three-dimensional structure of the homeodomain 
was determined, it was seen to contain a helix-turn-helix motif related to that of 




tryptophan repressor 



lambda Cro 



lambda repressor 
fragment 



CAP fragment 



DNA 



Figure 7-1 1 Some helix-turn-helix DNA-binding proteins. All of the proteins bind DNA as dimers in which the two copies 
of the recognition helix (red cylinder) are separated by exactly one turn of the DNA helix (3.4 nm). The other helix of the 
helix-turn-helix motif is colored blue, as in Figure 7-10. The lambda repressor and Cro proteins control bacteriophage 
lambda gene expression, and the tryptophan repressor and the catabolite activator protein (CAP) control the expression of 
sets of E.coli genes. 
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Figure 7-12 A specific DNA sequence recognized by the bacteriophage 
lambda Cro protein. The nucleotides labeled in green in this sequence are 
arranged symmetrically, allowing each half of the DNA site to be 
recognized in the same way by each protein monomer, also shown in 
green. See Figure 7-1 1 for the actual structure of the protein. 



5 ' T A A C A C CG TGCGTGTTG 

i I I i i I r i i m i i i i M 

3' A T T G T G G C A G 6 C A C A A C 



the bacterial gene regulatory proteins, providing one of the first indications that 
the principles of gene regulation established in bacteria are relevant to higher 
organisms as well. More than 60 homeodomain proteins have now been discov- 
ered in Drosophila alone, and homeodomain proteins have been identified in vir- 
tually all eucaryotic organisms that have been studied, from yeasts to plants to 
humans. 

The structure of a homeodomain bound to its specific DNA sequence is 
shown in Figure 7-13. Whereas the helix-turn-helix motif of bacterial gene reg- 
ulatory proteins is often embedded in different structural contexts, the 
helix-turn-helix motif of homeodomains is always surrounded by the same 
structure (which forms the rest of the homeodomain), suggesting that the motif 
is always presented to DNA in the same way. Indeed, structural studies have 
shown that a yeast homeodomain protein and a Drosophila homeodomain pro- 
tein have very similar conformations and recognize DNA in almost exactly the 
same manner, although they are identical at only 17 of 60 amino acid positions 
(see Figure 3-13). 



There Are Several Types of DNA-Binding Zinc Finger Motifs 

The helix-turn-helix motif is composed solely of amino acids. A second impor- 
tant group of DNA-binding motifs includes one or more zinc atoms as structural 
components. Although all such zinc-coordinated DNA-binding motifs are called 
zinc fingers, this description refers only to their appearance in schematic draw- 
ings dating from their initial discovery (Figure 7-14A). Subsequent structural 
studies have shown that they fall into several distinct structural groups, two of 
which we consider here. The first type was initially discovered in the protein that 
activates the transcription of a eucaryotic ribosomal RNA gene. It has a simple 
structure, in which the zinc holds an a helix and a p sheet together (Figure 
7-14B). This type of zinc finger is often found in tandem clusters so that the a 
helix of each can contact the major groove of the DNA, forming a nearly contin- 
uous stretch of a helices along the groove. In this way, a strong and specific 
DNA-protein interaction is built up through a repeating basic structural unit 
(Figure 7-15). 

Another type of zinc finger is found in the large family of intracellular 
receptor proteins (discussed in detail in Chapter 15). It forms a different type of 




Figure 7-1 3 A homeodomain bound to its 
specific DNA sequence. Two different 
views of the same structure are shown. 

(A) The homeodomain is folded into three 
a helices, which are packed tightly 
together by hydrophobic interactions. The 
part containing helices 2 and 3 closely 
resembles the helix-turn-helix motif. 

(B) The recognition helix (helix 3, red) forms 
important contacts with the major groove 
of DNA. The asparagine (Asn) of helix 3, for 
example, contacts an adenine, as shown in 
Figure 7-9. A flexible arm attached to helix 
1 forms contacts with nucleotide pairs in 
the minor groove. The homeodomain 
shown here is from a yeast gene regulatory 
protein, but it closely resembles 
homeodomains from many eucaryotic 
organisms. <ACGT> (Adapted from 

C. Wolberger et al., Cell 67:5 1 7-528, 1 991 . 
With permission from Elsevier.) 
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Figure 7-14 One type of zinc finger 
protein. This protein belongs to the 
Cys-Cys-His-His family of zinc finger 
proteins, named after the amino acids that 
grasp the zinc. (A) Schematic drawing of 
the amino acid sequence of a zinc finger 
from a frog protein of this class. (B) The 
three-dimensional structure of this same 
type of zinc finger is constructed from an 
antiparallel (3 sheet (amino acids 1 to 1 0) 
followed by an a helix (amino acids 1 2 to 
24). The four amino acids that bind the zinc 
(Cys 3, Cys 6, His 19, and His 23) hold one 
end of the a helix firmly to one end of the 
P sheet. (Adapted from M.S. Lee et al., 
Science 245:635-637, 1 989. With 
permission from AAAS.) 



structure (similar in some respects to the helix-turn-helix motif) in which two a 
helices are packed together with zinc atoms (Figure 7-16). Like the helix-turn- 
helix proteins, these proteins usually form dimers that allow one of the two a 
helices of each subunit to interact with the major groove of the DNA. Although 
the two types of zinc finger structures discussed in this section are structurally 
distinct, they share two important features: both use zinc as a structural ele- 
ment, and both use an a helix to recognize the major groove of the DNA. 



p sheets Can Also Recognize DNA 

In the DNA-binding motifs discussed so far, a helices are the primary mecha- 
nism used to recognize specific DNA sequences. One large group of gene regu- 
latory proteins, however, has evolved an entirely different recognition strategy 
In this case, a two-si randed p sheet, with amino acid side chains extending from 
the sheet toward the DNA, reads the information on the surface of the major 
groove (Figure 7-17). As in the case of a recognition a helix, this p- sheet motif 
can be used to recognize many different DNA sequences; the exact DNA 
sequence recognized depends on the sequence of amino acids that make up the 
P sheet. 



COOH 



COOH 




Figure 7-15 DNA binding by a zinc finger 
protein. (A) The structure of a fragment of 
a mouse gene regulatory protein bound to 
a specific DNA site. This protein recognizes 
DNA by using three zinc fingers of the 
Cys-Cys-His-His type (see Figure 7-14) 
arranged as direct repeats. <ATCT> (B) The 
three fingers have similar amino acid 
sequences and contact the DNA in similar 
ways. In both (A) and <B) the zinc atom in 
each finger is represented by a small 
sphere. (Adapted from N. Pavletich and 
C. Pabo, Science 252:810-817, 1991. With 
permission from AAAS.) 
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Some Proteins Use Loops That Enter the Major and Minor 
Grooves to Recognize DNA 

A few DNA-binding proteins use protruding peptide loops to read nucleotide 
sequences, rather than a helices and [i sheets. For example, p53, a critical wmor 
suppressor in humans, recognizes nucleotide pairs from both the major and 
minor grooves using such loops [Figure 7-18}. The normal role of the p53 pro- 
tein is to lightly regulate cell growth and proliferation. Its importance can be 
appreciated by the fact that nearly half of all human cancers have acquired 
somatic mutations in die gene for p53; this step is key to the progression of 
many tumors, as we shall see in Chapter 20. Many of the p53 mutations 
observed in cancer cells destroy or alter its DNA-binding properties; indeed, Arg 
248, which contacts the minor groove of DNA (see Figure 7-18) is the most fre- 
quently mutated p53 residue in human cancers. 



Figure 7-16 A dimer of the zinc finger 
domain of the intracellular receptor 
family bound to its specific DNA 
sequence. Each zinc finger domain 
contains two atoms of Zn (indicated by 
the small gray spheres)) one stabilizes the 
DNA recognition helix (shown in brown in 
one subunit and red in the other), and 
one stabilizes a loop (shown in purple) 
involved in dimer formation. Each Zn 
atom is coordinated by four appropriately 
spaced cysteine residues. Like the 
helix-turn-helix proteins shown in Figure 
7-1 1 , the two recognition helices of the 
dimer are held apart by a distance 
corresponding to one turn of the DNA 
double helix. The specific example shown 
is a fragment of the glucocorticoid 
receptor. This is the protein through 
which cells detect and respond 
transcriptionally to the glucocorticoid 
hormones produced in the adrenal gland 
in response to stress. (Adapted from 
B.F. Luisi et al., Nature 352:497-505, 1 991 . 
With permission from Macmillan 
Publishers Ltd.) 



The Leucine Zipper Motif Mediates Both DNA Binding and Protein 
Dimerization 

Many gene regulatory proteins recognize DNA as homodimers, probably 
because, as we have seen, this is a simple way of achieving strong specific bind- 
ing (see Figure 7-12). Usually, the portion of the protein responsible for dimer- 
ization is distinct from the portion that is responsible for DNA binding. One 
motif, however, combines these two functions elegantly and economically. It is 
called the leucine zipper motif, so named because of the way the two a helices, 
one from each monomer, are joined together to form a short coiled-coil (see Fig- 
ure 3-9). The helices are held together by interactions between hydrophobic 
amino acid side chains (often on leucines) that extend from one side of each 
helix. Just beyond the dimerization interface the two a helices separate from 
each other to form a Y-shaped structure, which allows their side chains to con- 
tact the major groove of DNA. The dimer thus grips the double helix like a 
clothespin on a clothesline (Figure 7-19). 





Figure 7-1 7 The bacterial Met repressor 
protein. The bacterial Met repressor 
regulates the genes encoding the 
enzymes that catalyze methionine 
synthesis. When this amino acid is 
abundant, it binds to the repressor, 
causing a change in the structure of the 
protein that enables it to bind to DNA 
tightly, shutting off the synthesis of the 
enzyme. (A) In order to bind to DNA 
tightly, the Met repressor must be 
complexed with S-adenosyl methionine, 
outlined in red. One subunit of the 
dimeric protein is shown in green, while 
the other is shown in blue. The two- 
stranded p sheet that binds to DNA is 
formed by one strand from each subunit 
and is shown in dark green and dark blue. 
(B) Simplified diagram of the Met 
repressor bound to DNA, showing how 
the two-stranded p sheet of the repressor 
binds to the major groove of DNA. For 
clarity, the other regions of the repressor 
have been omitted. (A, adapted from 
S. Phillips, Curr. Opin. Struct Biol. 1:89-98, 
1 991 , with permission from Elsevier; 
B, adapted from W. Somers and 
S. Phillips, Nature 359:387-393, 1992, 
with permission from Macmillan 
Publishers Ltd.) 
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Figure 7-18 DNA recognition by the 
p53 protein. The most important DNA 
contacts are made by arginine 248 and 
lysine 120, which extend from the 
protruding loops entering the minor and 
major grooves. The folding of the p53 
protein requires a zinc atom {shown as a 
sphere), but the way in which the zinc is 
grasped by the protein is completely 
different from that of the zinc finger 
proteins, described previously. 



Heterodimerization Expands the Repertoire of DNA Sequences 
That Gene Regulatory Proteins Can Recognize 

Many of the gene regulatory proteins we have seen thus far bind DNA as homo- 
dLmers, that is. dimers made up of two identical subunits. However, many gene 
regulatory proteins can also associate with nonidentical partners to form het~ 
erodimers composed of two different subunits. Because heterodimers typically 
form from two proteins with distinct DNA-binding specificities, the mixing and 
matching of gene regulatory proteins in this way greatly expands the repertoire 
of DNA-binding specificities that these proteins can display As illustrated in Fig- 
ure 7-20, three distinct DNA-binding specificities could, in principle, be gener- 
ated from two types of leucine zipper monomers, while six could be created 
from three types of monomers, and so on. 
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DNA 




Figure 7-19 A leucine zipper dimer bound to DNA. Two a-helical DNA^ 
binding domains (bottom) dimerize through their a-helical leucine zipper 
region (top) to form an inverted V-shaped structure. Each arm of the Y is 
formed by a single </ helix, one from each monomer, that mediates 
binding to a specific DNA sequence in the major groove of DNA. <TGTT> 
Each a helix binds to one-half of a symmetric DNA structure. The 
structure shown is of the yeast Gcn4 protein, which regulates 
transcription in response to the availability of amino acids in the 
environment. (Adapted from T.E. Ellenberger et al., Cell 71 :1 223-1 237, 
1992. With permission from Elsevier.) 



Figure 7-20 Heterodimerization of leucine zipper 
proteins can alter their DNA-binding specificity. Leucine 
zipper homodimers bind to symmetric DNA sequences, as 
shown in the left-hand and center drawings. These two 
proteins recognize different DNA sequences, as indicated 
by the red and blue regions in the DNA. The two different 
monomers can combine to form a heterodimer, which now 
recognizes a hybrid DNA sequence, composed from one 
red and one blue region. 
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There are, however, limits to this promiscuity: for example, if all the many 
types of leucine zipper proteins in a typical eucaryotic cell formed heterodimers, 
the amount of "cross-talk" between the gene regulatory circuits of a cell would 
presumably be so great as to cause chaos. Whether or not a particular het- 
erodimer can form depends on how well the hydrophobic surfaces of the two 
leucine zipper a helices mesh with each other, which in turn depends on the 
exact amino acid sequences of the two zipper regions. Thus, each leucine zipper 
protein in the cell can form dimers with only a small set of other leucine zipper 
proteins. 

Heterodimerization is an example of combinatorial control, in which com- 
binations of different proteins, rather than individual proteins, control a cell 
process. Heterodimerization as a mechanism for combinatorial control of gene 
expression occurs in many different types of gene regulatory proteins (Figure 
7-21). Combinatorial control is a major theme that we shall encounter repeat- 
edly in this chapter, and the formation of heterodimeric gene regulatory com- 
plexes is only one of many ways in which proteins work in combinations to con- 
trol gene expression. 

Certain combinations of gene regulatory proteins have become "hardwired" 
in the cell; for example, two distinct DNA-binding domains can, through gene 
rearrangements occurring over evolutionary time scales, become joined into a 
single polypeptide chain that displays a novel DNA-binding specificity (Figure 
7-22). 



Figure 7-21 A heterodimer composed of 
two homeodomain proteins bound to its 
DNA recognition site. The yellow helix 4 of 
the protein on the right (Mata2) is 
unstructured in the absence of the protein 
on the left (Matal), forming a helix only 
upon heterodimerization. The DNA 
sequence is recognized jointly by both 
proteins; some of the protein-DNA 
contacts made by Matcc2 were shown in 
Figure 7-1 3. These two proteins are from 
budding yeast, where the heterodimer 
specifies a particular cell type (see Figure 
7-65). The helices are numbered in 
accordance with Figure 7-1 3. (Adapted 
from T. Li et al., Science 270:262-269, 1 995. 
With permission from AAA5.) 



The Helix-Loop-Helix Motif Also Mediates Dimerization and DNA 
Binding 

Another important DNA-binding motif, related to the leucine zipper, is the 
helix-loop-helix (HLH) motif, which differs from the helix-turn-helix motif 
discussed earlier. An HLH motif consists of a short a helix connected by a loop 
to a second, longer a helix. The flexibility of the loop allows one helix to fold back 
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Figure 7-22 Two DNA-binding domains 
covalently joined by a flexible 
polypeptide. The structure shown {called 
a Pou-domain) consists of both a 
homeodomain and a helix-turn-helix 
structure joined by a flexible polypeptide 
"leash" indicated by the broken lines. 
A single gene encodes the entire protein, 
which is synthesized as a continuous 
polypeptide chain. The covalent joining 
of two structures in this way results in a 
large increase in the affinity of the 
protein for its specific DNA sequence 
compared with the DNA affinity of either 
separate structure. The group of 
mammalian gene regulatory proteins 
exemplified by this structure regulate the 
production of growth factors, 
immunoglobulins, and other molecules 
involved in development. The particular 
example shown is from the Oct1 
protein. (Adapted from J.D. Klemm et al., 
Ceil 77:21 -32, 1994. With permission 
from Elsevier.) 
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and pack against the other. As shown in Figure 7-23, this two-helix structure 
binds both to DNA and to the HLH motif of a second HLH protein. The second 
HLH protein can be the same (creating a homodimer) or different (creating a 
heterodimer). In either case, two a helices that extend from the dimerization 
interface make specific contacts with the DNA. 

Several HLH proteins lack the a-helical extension responsible for binding to 
DNA. These truncated proteins can form heterodimers with full-length III II 
proteins, but the heterodimers are unable to bind DNA tightly because they 
Form only half ul ine necessary contacts. Thus, in addition to creating active 
dimers, heterodimerization provides cells with a widely used way to hold spe- 
cific gene regulatory proieins in check (Figure 7-24). 



Figure 7-23 A helix-loop-helix (HLH) 
dimer bound to DNA. The two 

monomers are held together in a four- 
helix bundle: each monomer contributes 
two a helices connected by a flexible 
loop of protein (red). A specific DNA 
sequence is bound by the two a helices 
that project from the four-helix bundle. 
(Adapted from A.R. Ferre-D'Amare et al., 
Nature 363:38-45, 1993. With permission 
from Macmillan Publishers Ltd.) 



It Is Not Yet Possible to Predict the DNA Sequences Recognized by 
All Gene Regulatory Proteins 

The various DNA-binding motifs that we have discussed provide structural 
frameworks from which specific amino acid side chains extend to contact spe- 
cific base pairs in the DNA. It is reasonable to ask, therefore, whether there is a 
simple amino acid-base pair recognition code: is a G-C base pair, for example, 
always contacted by a particular amino acid side chain? The answer is no. 
although certain types of amino acid-base interactions appear much more fre- 
quently than others (Figure 7-25), As we saw in Chapter 3. protein surfaces of 
virtually any shape and chemistry can be made from just 20 different amino 
acids, and a gene regulatory pmtein uses different patterns of these to create a 
sm lace that is precisely complementary to a particular DNA sequence. We know 
that the same base pair can thereby be recognized in many ways depending on 
its context (Figure 7-26). Nevertheless, molecular biologists are beginning to 
understand the principles of protein-DNA recognition well enough to design 
new proteins that will recognize a given DNA sequence. 

Having outlined the general features of gene regulatory proteins, we turn to 
some of the methods that are now used to study them. 
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Figure 7-24 Inhibitory regulation by 
truncated HLH proteins. The HLH motif 
is responsible for both dimerization and 
DNA binding. On the left, an HLH 
homodimer recognizes a symmetric DNA 
sequence. On the right, the binding of a 
full-length HLH protein (blue) to a 
truncated HLH protein (green) that lacks 
the DNA-binding a helix generates a 
heterodimer that is unable to bind DNA 
tightly. If present in excess, the truncated 
protein molecule blocks the 
homodimerization of the full-length HLH 
protein and thereby prevents it from 
binding to DNA. 
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Figure 7-25 One of the most common protein-DNA interactions. 

Because of its specific geometry of hydrogen-bond acceptors (see 
Figure 7-7), the side chain of arginine unambiguously recognizes guanine. 
Figure 7-9 shows another common protein-DNA interaction. 



A Gel-Mobility Shift Assay Readily Detects Sequence-Specific 
DNA-Binding Proteins 

Genetic analyses, which provided a route to the gene regulatory proteins of bac- 
teria, yeast, and Drosophila, are much more difficult in vertebrates. Therefore, 
the isolation of vertebrate gene regulatory proteins had to await the develop- 
ment of different approaches. Many of these approaches rely on the detection in 
a cell extract of a DNA-binding protein that specifically recognizes a DNA 
sequence known to control the expression of a particular gene. One of the most 
common ways to detect and study sequence-specific DNA-binding proteins is 
based on the effect of a bound protein on the migration of DNA molecules in an 
electric field. 
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Figure 7-26 Summary of sequence- 
specific interactions between six 
different zinc fingers and their DNA 
recognition sequences. Even though all 
six Zn fingers have the same overall 
structure (see Figure 7-14), each binds to 
a different DNA sequence. The numbered 
amino acids form the a helix that 
recognizes DNA (Figures 7-14 and 7-15), 
and those that make sequence-specific 
DNA contacts are green. Bases contacted 
by protein are orange. Although 
arginine-guanine contacts are common 
(see Figure 7-25), guanine can also be 
recognized by serine, histidine, and 
lysine, as shown. Moreover, the same 
amino acid (serine, in this example) can 
recognize more than one base. Two of 
the Zn fingers depicted are from the TTK 
protein (a Drosophila protein that 
functions in development); two are from 
the mouse protein (Zif268) that was 
shown in Figure 7-1 5; and two are from a 
human protein (GL1) whose aberrant 
forms can cause certain types of cancers. 
{Adapted from C. Branden and J.Tooze, 
Introduction to Protein Structure, 2nd ed. 
New York: Garland Publishing, 1999.) 
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A DNA molecule is highly negatively charged and will therefore move rapidly 
toward a positive electrode when it is subjected to an electric field. When ana- 
lyzed by polyacrylamide-gel electrophoresis (see p. 534), DNA molecules are 
separated according to their size because smaller molecules are able to pene- 
trate the fine gel meshwork more easily than large ones. Protein molecules 
bound to a DNA molecule will cause it to move more slowly through the gel; in 
general, the larger the bound protein, the greater the retardation of the DNA 
molecule. This phenomenon provides the basis for the gel-mobility shift assay, 
which allows even trace amounts of a sequence-specific DNA-binding protein to 
be readily detected. In this assay, a short DNA fragment of specific length and 
sequence (produced either by DNA cloning or by chemical synthesis, as dis- 
cussed in Chapter 8) is radioactively labeled and mixed with a cell extract; the 
mixture is then loaded onto a polyacrylamide gel and subjected to elec- 
trophoresis. If the DNA fragment corresponds to a chromosomal region where, 
for example, several sequence-specific proteins bind, autoradiography (see pp. 
602-603) will reveal a series of DNA bands, each retarded to a different extent 
and representing a distinct DNA-protein complex. The proteins responsible for 
each band on the gel can then be separated from one another by subsequent 
fractionations of the cell extract (Figure 7-27). Once a sequence-specific DNA 
protein has been purified, the gel-mobility shift assay can be used to study the 
strength and specificity of its interactions with different DNA sequences, the 
lifetime of DNA-protein complexes, and other properties critical to the func- 
tioning of the protein in the cell. 



DNA Affinity Chromatography Facilitates the Purification of 
Sequence-Specific DNA-Binding Proteins 

A particularly powerful protein-purification method called DNA affinity chro- 
matography can be used once the DNA sequence that a gene regulatory protein 
recognizes has been determined. A double- stranded oligonucleotide of the cor- 
rect sequence is synthesized by chemical methods and linked to an insoluble 
porous matrix such as agarose; the matrix with the oligonucleotide attached is 
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Figure 7-27 A gel-mobility shift assay. 

The principle of the assay is shown 
schematically in (A). In this example an 
extract of an antibody-producing cell line 
is mixed with a radioactive DNA fragment 
containing about 160 nucleotides of a 
regulatory DNA sequence from a gene 
encoding the light chain of the antibody 
made by the cell line. The effect of the 
proteins in the extract on the mobility of 
the DNA fragment is analyzed by 
polyacrylamide-gel electrophoresis 
followed by autoradiography. The free 
DNA fragments migrate rapidly to the 
bottom of the gel, while those fragments 
bound to proteins are retarded; the 
finding of six retarded bands suggests 
that the extract contains six different 
sequence-specific DNA-binding proteins 
(indicated as C1-C6) that bind to this DNA 
sequence. (For simplicity, any DNA 
fragments with more than one protein 
bound have been omitted from the 
figure.) In (B) a standard chromatographic 
technique (see pp. 512-513 ) was used to 
fractionate the extract (top), and each 
fraction was mixed with the radioactive 
DNA fragment, applied to one lane of a 
polyacrylamide gel, and analyzed as in (A). 
(B, modified from C. Scheidereit, A. Heguy 
and R.G. Roeder, Ceil 5 1:783-793, 1987. 
With permission from Elsevier.) 
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then used to construct a column that selectively binds proteins that recognize 
the particular DNA sequence (Figure 7-28). Purifications as great as 10,000-fold 
can be achieved by this means with relatively little effort. 

Although most gene regulatory proteins are present at very low levels in the 
cell, enough pure protein can usually be isolated by affinity chromatography to 
obtain a partial amino acid sequence by mass spectrometry or other means (dis- 
cussed in Chapter 8). If the complete genome sequence of the organism is 
known, the partial amino acid sequence can be used to identify the gene. The 
gene not only provides the complete amino acid sequence of the protein; it also 
provides the means to produce the protein in unlimited amounts through 
genetic engineering techniques, also discussed in Chapter 8. 



The DNA Sequence Recognized by a Gene Regulatory Protein Can 
Be Determined Experimentally 

Gene regulatory proteins can be discovered before the DNA sequence they rec- 
ognize is known. For example, many of the Drosophila homeodomain proteins 
were discovered through the isolation of mutations that altered fly development. 
This allowed the genes encoding the proteins to be identified, and the proteins 
could then be overexpressed in cultured cells and easily purified. DNA foot- 
printingis one method of determining the DNA sequences recognized by a gene 
regulatory protein once it has been purified. This strategy also requires a puri- 
fied fragment of duplex DNA that contains somewhere within it a recognition 
site for the protein. Short recognition sequences can occur by chance on any 
long DNA fragment, although it is often necessary to use DNA corresponding to 
a regulatory region for a gene known to be controlled by the protein of interest. 
DNA footprinting is based on nucleases or chemicals that randomly cleave DNA 
at every phosphodiester bond. A bound gene regulatory protein blocks the 
phosphodiester bonds from attack, thereby revealing the protein's precise recog- 
nition site as a protected zone, or footprint (Figure 7-29). 

A second way of determining the DNA sequences recognized by a gene reg- 
ulatory protein requires no prior knowledge of what genes the protein might 
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Figure 7-28 DNA affinity chromatography. 

In the first step, all the proteins that can 
bind DNA are separated from the remainder 
of the cell proteins on a column containing 
a huge number of different DNA sequences. 
Most sequence-specific DNA-binding 
proteins have a weak (nonspecific) affinity 
for bulk DNA and are therefore retained on 
the column. This affinity is due largely to 
ionic attractions, and the proteins can be 
washed off the DNA by a solution that 
contains a moderate concentration of salt. 
In the second step, the mixture of DNA- 
binding proteins is passed through a 
column that contains only DNA of a 
particular sequence. Typically, all the DNA- 
binding proteins will stick to the column, 
the great majority by nonspecific 
interactions. These are again eluted by 
solutions of moderate salt concentration, 
leaving on the column only those proteins 
(typically one or only a few) that bind 
specifically and therefore very tightly to the 
particular DNA sequence. These remaining 
proteins can be eluted from the column by 
solutions containing a very high 
concentration of salt. 
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Figure 7-29 DNA footprinting. 

(A) Schematic of the method. A DNA 
fragment is labeled at one end with 32 P, a 
procedure described in Figure 8-34; next, 
the DNA is cleaved with a nuclease or 
chemical that makes random, single- 
stranded cuts. After the DNA molecule is 
denatured to separate its two strands, the 
resultant fragments from the labeled 
strand are separated on a gel and 
detected by autoradiography (see Figure 
8-33). The pattern of bands from DNA cut 
in the presence of a DNA-binding protein 
is compared with that from DNA cut in its 
absence. When protein is present, it 
covers the nucleotides at its binding site 
and protects their phosphodiester bonds 
from cleavage. As a result, those labeled 
fragments that would otherwise 
terminate in the binding site are missing, 
leaving a gap in the gel pattern called a 
"footprint." In the example shown, the 
DNA-binding protein protects seven 
phosphodiester bonds from the DNA 
cleaving agent. (B) An actual footprint 
used to determine the binding site for a 
gene regulatory protein from humans. 
The cleaving agent was a small, iron- 
containing organic molecule that 
normally cuts at every phosphodiester 
bond with nearly equal frequency. 
(B, courtesy of Michele Sawadogo and 
Robert Roeder.) 
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regulate. Here, the purified protein is used to select, from a large, randomly gen- 
erated pool of different short DNA fragments, only those that bind tightly to it. 
After several rounds of such selection, the nucleotide sequences of the tightly 
bound DNAs are determined, and a consensus DNA recognition sequence for 
the gene regulatory protein can be formulated (Figure 7-30). Once the DNA 
sequence recognized by a gene regulatory protein is known, computerized 
genome searches can identify candidate genes whose transcription the gene 



Figure 7-30 A method for determining the DNA sequence recognized by 
a gene regulatory protein. A purified gene regulatory protein is mixed 
with millions of different short DNA fragments, each with a different 
sequence of nucleotides. A collection of such DNA fragments can be 
produced by programming a DNA synthesizer, a machine that chemically 
synthesizes DNA of any desired sequence (discussed in Chapter 8), For 
example, there are 4^ or approximately 4.2 million, possible sequences for 
a DNA fragment of 1 1 nucleotides. The double-stranded DNA fragments 
that bind tightly to the gene regulatory protein are then separated from 
the DNA fragments that fail to bind. One method for accomplishing this 
separation is through gel-mobility shifts, as illustrated in Figure 7-27. After 
separation of the DNA-protein complexes from the free DNA, the DNA 
fragments are removed from the protein and typically used for several 
additional rounds of the same selection process (not shown). The 
nucleotide sequences of those DNA fragments that remain through 
multiple rounds of binding and release can be determined, and a 
consensus DNA recognition sequence can thus be generated. 
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DNA sequences from five closely related yeast species 

1 TGATGACAGTCTTAATATCATCTGCAAC TCTTGAAATCTTGCTTTATAGTCAAAATTTACGTACGCTTTTCACTATATAATATGATTTGTCAAT 

2 CAACGGTAGTTTCGAGGTTGCATATAAT CCGGTGGA-CTGGCGTTAAAGTTAGAAGTCCACTTCACTTCT-TC--ATTG-TATTCTGTCTTATC 

3 TCTTGATGGCAGTCTTGATACCGTGTAAAAC CCACGTGGTCTAGTCTCATACTCAAAATT-ACGTCCACTTTCCCCTGTATATTATGTTTTGTCGAT 

4 CAAAACTGATCC-AGAAGCACTCCTGATTCACCTTTGATACCAATCGATTCATTCAAATCGGCCTGAACCTTG-AT ATA-TGTATCTTGCCCCT 

5 TAATACTGATCCCATAGGTGCTCTTAA--CACCCGCAGT-CTGGCCTTATGATTGAAAGTTAATCGAACTTTT-ATTGTATA-TAACGCTATGTATT 

binding site for gene regulatory protein 



1 GTGATGAGTGAATGTCTCCCTGTTACCCGGTT-TTCATGTTGATTTTTGTTTCAGGCTCTAA- ATGTT TGATGCAATATTTAACAAG|GAGAACAGAAA — 

2 AACATCCGTAJATCAATTCTTGATACCC{KTCTCGGCTCGTTGA 

3 GTC CGTAGAGCACTCTCTGTTACCCGGATATTCCTGTTATCTTTTGTTTCAAGCTTTAA-AAAT5 TGATGCAATATTTAACAAdCAGTACAGGAA- - 

4 ATCGTATCAGAATTTATTGG- ATTACCCGGGCCGACCCTTTTTTGCGTGTTTCAAGCTTCAA-AAAAC TGATGCAATTTTTAACAAGrGGTATATA 

5 GGCACAACCAAATATATTTTCGTTACCCGAACCAGCTTTTAATTATCTGTTTCAGG TGRTGCAATTCTTAAOU«^CAATACATAGA- - 



1 TGTTTTGTGACAGCACCTGTCAATTT-TAGGATAGTAGCAATCGCAAAC GTTCTCAATAATTCTAAGA m 

2 CTCTGCTCTATAGTAGCACTTCTAACTTCATTGAGAAACAATAAAGACAGAA CTACTTAACAGCTCTAGCA gene 

3 TGTTTTGTGATAGCACTTCTCAGTTT-TGAAATAACAGCAACCGCAGAC A CAAAACCTCTAAA I 

4 TGTTGTACGATAGCACCCTTGTGTTCGCTTGAAAACACCAAAGGAAGACAGCTAGCCCCATCCCCACGACTCCAGC ^equen 

5 TGTTTTGTGATAGCACTCTCAAGTTTACTTGAAAAGGACAAAAGAAGAA CCGCCCGACGCCTCCAAT ■ 

Figure 7-31 Phylogeneticfootprinting.This example compares DNA sequences upstream of the same gene from five 
closely related yeasts; identical nucleotides are highlighted in yellow. Phylogenetic footprinting reveals DNA recognition 
sites for regulatory proteins, as they are typically more conserved than surrounding sequences. Only the region upstream 
of a particular gene is shown in this example, but the approach is typically used to analyze entire genomes. The gene 
regulatory proteins that bind to the site outlined in red are shown in Figure 7-21 . Some of the shorter phylogenetic 
footprints in this example represent binding sites for additional gene regulatory proteins, not all of which have been 
identified. (From M. Kellis et al., Nature 423:241 -254, 2003, with permission from Macmillan Publishers Ltd., and 
DJ. Galgoczy et al., Proc. Natl Acad. Sci. U.S.A. 101:1 8069-1 8074, 2004, with permission from National Academy of Sciences.) 

regulatory protein of interest might control. However, this strategy is not fool- 
proof. For example, many organisms produce a set of closely related gene regu- 
latory proteins that recognize very similar DNA sequences, and this approach 
cannot resolve them. In most cases, predictions of the sites of action of gene reg- 
ulatory proteins obtained from searching genome sequences must, in the end, 
be tested experimentally. 

Phylogenetic Footprinting Identifies DNA Regulatory Sequences 
Through Comparative Genomics 

The widespread availability of complete genome sequences provides a surpris- 
ingly simple method for identifying important regulatory sites on DNA, even 
when the gene regulatory protein that binds them is unknown. In this approach, 
genomes from several closely related species are compared. If the species are 
chosen properly, the protein-coding portions of the genomes will be very simi- 
lar, but the regions between sequences that encode protein or RNA molecules 
will have diverged considerably as most of this sequence is functionally irrele- 
vant and therefore not constrained in evolution. Among the exceptions are the 
regulatory sequences that control gene transcription. These stand out as con- 
served islands in a sea of nonconserved nucleotides (Figure 7-31). Although the 
identity of the gene regulatory proteins that recognize the conserved DNA 
sequences must be determined by other means, phylogenetic footprinting is a 
powerful method for identifying many of the DNA sequences that control gene 
expression. 



Chromatin Immunoprecipitation Identifies Many of the Sites That 
Gene Regulatory Proteins Occupy in Living Cells 

A gene regulatory protein will not occupy all of its potential DNA-binding sites 
in the genome at a particular time. Under some conditions, the protein may not 
be synthesized, and so will be absent from the cell; it may be present but lacking 
a heterodimer partner; or it may be excluded from the nucleus until an appro- 
priate signal is received from the cell's environment. Even if the gene regulatory 
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Figure 7-32 Chromatin immunoprecipitation. This method allows the 
identification of all the sites in a genome that a gene regulatory protein 
occupies in vivo. For the amplification of DNA by a polymerase chain 
reaction (PCR), see Figure 8-45. The identities of the precipitated, amplified 
DNA fragments can be determined by hybridizing the mixture of 
fragments to DNA microarrays, as described in Chapter 8. 



protein is present in the nucleus and is competent to bind DNA, components of 
chromatin or other gene regulatory proteins that can bind to the same or over- 
lapping DNA sequences may occlude many of its potential binding sites on 
DNA. 

Chromatin immunoprecipitation provides one way of empirically deter- 
mining the sites on DNA that a given gene regulatory protein occupies under a 
particular set of conditions (Figure 7-32). In this approach, proteins are cova- 
lently cross-linked to DNA in living cells, the cells are broken open, and the DNA 
is mechanically sheared into small fragments. Antibodies directed against a 
given gene regulatory protein are then used to purify DNA that became cova- 
lently cross-linked to that protein in the cell. If this DNA is hybridized to 
microarrays that contain the entire genome displayed as a series of discrete DNA 
fragments (see Figure 8-73), the precise genomic location of each precipitated 
DNA fragment can be determined. In this way, all the sites occupied by the gene 
regulatory protein in the original cells can be mapped on the cell's genome (Fig- 
ure 7-33). 

Chromatin immunoprecipitation is also routinely used to identify the posi- 
tions along a genome that are packaged by the various types of modified his- 
tones (discussed in Chapter 4). In this case, antibodies specific to the particular 
histone modification of interest are employed. 
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Summary 

Gene regulatory proteins recognize short stretches of double- helical DNA of defined 
sequence and thereby determine which of the thousands of genes in a cell will be tran- 
scribed. Thousands of gene regulatory proteins have been identified in a wide variety 
of organisms. Although each of these proteins has unique features, most bind to DNA 
as homodimers or heterodimers and recognize DNA through one of a small number of 
structural motifs. The common motifs include the helix-turn-helix } the homeo- 
domain, the leucine zipper, the helix-loop-helix, and zinc fingers of several types. The 
precise amino acid sequence that is folded into a motif determines the particular DNA 
sequence that a gene regulatory protein recognizes. Heterodimerization increases the 
range of DNA sequences that can be recognized. Powerful techniques are now available 
for identifying and isolating these proteins, the genes that encode them, and the DNA 
sequences they recognize, and for mapping all of the genes that they regulate on a 
genome. 
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HOW GENETIC SWITCHES WORK 

In the previous section, we described the basic components of genetic switches: 
gene regulatory proteins and the specific DNA sequences that these proteins 
recognize. We shall now discuss how these components operate to turn genes on 
and off in response to a variety of signals. 

In the mid-twentieth century the idea that genes could be switched on and 
off was revolutionary. This concept was a major advance, and it came originally 
from the study of how E. coli bacteria adapt to changes in the composition of 
their growth medium. Parallel studies of the lambda bacteriophage led to many 
of the same conclusions and helped to establish the underlying mechanism. 
Many of the same principles apply to eucaryotic cells. However, the enormous 
complexity of gene regulation in higher organisms, combined with the packaging 
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of their DNA into chromatin, creates special challenges and some novel oppor- 
tunities for control — as we shall see. We begin with the simplest example — an 
on-off switch in bacteria that responds to a single signal. 



The Tryptophan Repressor Is a Simple Switch That Turns Genes 
On and Off in Bacteria 

The chromosome of the bacterium E. coli, a single-celled organism, is a single 
circular DNA molecule of about 4.6 x 10 6 nucleotide pairs. This DNA encodes 
approximately 4300 proteins, although the cell makes only a fraction of these at 
any one time. The expression of many genes is regulated according to the avail- 
able food in the environment. This is illustrated by the five E. coli genes that code 
for enzymes that manufacture the amino acid tryptophan. These genes are 
arranged as a single operon; that is, they are adjacent to one another on the 
chromosome and are transcribed from a single promoter as one long mRNA 
molecule (Figure 7-34). But when tryptophan is present in the growth medium 
and enters the cell (when the bacterium is in the gut of a mammal that has just 
eaten a meal of protein, for example), the cell no longer needs these enzymes 
and shuts off their production. 

The molecular basis for this switch is understood in considerable detail. As 
described in Chapter 6, a promoter is a specific DNA sequence that directs 
RNA polymerase to bind to DNA, to open the DNA double helix, and to begin 
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Figure 7-33 A gene regulatory circuit: the 
complete set of genes controlled by three 
key regulatory proteins in budding yeast, 
as deduced from the DNA sites where the 
regulatory proteins bind. The regulatory 
proteins — called Matal, Matal, and 
Mata2 — specify the two different haploid 
mating types (analogous to male and 
female gamates) of this unicellular 
organism. The 1 6 chromosomes in the yeast 
genome are shown (gray), with colored bars 
indicating sites where various combinations 
of the three regulatory proteins bind. 
Above each binding site is the name of the 
protein product of the regulated target 
gene. Matal, acting in a complex with 
another protein, Mcml, activates 
expression of the genes marked in red; 
Mata2, acting in a complex with Mcm1, 
represses the genes marked in blue; and 
Matal in a complex with Mata2 represses 
the genes marked in green (see Figures 
7-21 and 7-65). Double arrowheads 
represent divergently transcribed genes, 
which are controlled by the indicated gene 
regulatory proteins. This complete map of 
bound regulatory proteins was determined 
using a combination of genome-wide 
chromatin immunoprecipitation (see Figure 
7-32) and phylogeneticfootprinting (see 
Figure 7-29). Such determinations of 
complete transcriptional circuits show that 
transcriptional networks are not infinitely 
complex, although they may appear that 
way initially. This type of study also helps to 
reveal the overall logic of the 
transcriptional circuits used by modern 
cells. (From DJ. Galgoczy et al., Proc Natl 
Acad. Sci. U.S.A. 1 01 :1 8069-1 8074, 2004. 
With permission from National Academy of 
Sciences.) 
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Figure 7-34 The clustered genes in 
E. coli that code for enzymes that 
manufacture the amino acid 
tryptophan. These five genes of the Trp 
operon — denoted as TrpA, B, C, D, and 
E — are transcribed as a single mRNA 
molecule, which allows their expression 
to be controlled coordinately. Clusters of 
genes transcribed as a single mRNA 
molecule are common in bacteria. Each 
such cluster is called an operon. 
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synthesizing an RNA molecule. Within the promoter that directs transcription of 
the tryptophan biosynthetic genes lies a regulator element called an operator 
(see Figure 7-34). This is simply a short region of regulatory DNA of defined 
nucleotide sequence that is recognized by a repressor protein, in this case the 
tryptophan repressor, a member of the helix-turn-helix family (see Figure 
7-11). The promoter and operator are arranged so that when the tryptophan 
repressor occupies the operator, it blocks access to the promoter by RNA poly- 
merase, thereby preventing expression of the tryptophan-producing enzymes 
(Figure 7-35). 

The block to gene expression is regulated in an ingenious way: to bind to its 
operator DNA, the repressor protein has to have two molecules of the amino 
acid tryptophan bound to it. As shown in Figure 7-36, tryptophan binding tilts 
the hellx-turn-helix motif of the repressor so that ii is presented properly to the 
DNA major groove; without tryptophan, the motif swings inward and the pro- 
tein is unable to bind to the operator. Thus, the tryptophan repressor and oper- 
ator form a simple device that switches production of the tryptophan biosyn- 
thetic enzymes on and off according to the availability of free tryptophan. 



GENES ARE OFF 

Figure 7-35 Switching the tryptophan 
genes on and off. If the level of 
tryptophan inside the cell is low, RNA 
polymerase binds to the promoter and 
transcribes the five genes of the 
tryptophan (7rp) operon. If the level of 
tryptophan is high, however, the 
tryptophan repressor is activated to bind 
to the operator, where it blocks the 
binding of RNA polymerase to the 
promoter. Whenever the level of 
intracellular tryptophan drops, the 
repressor releases its tryptophan and 
becomes inactive, allowing the 
polymerase to begin transcribing these 
genes. The promoter includes two key 
blocks of DNA sequence information, the 
-35 and -10 regions highlighted in 
yellow (see Figure 6-12). 
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Figure 7-36 The binding of tryptophan to the tryptophan repressor protein changes its conformation. This structural 
change enables this gene regulatory protein to bind tightly to a specific DNA sequence (the operator), thereby blocking 
transcription of the genes encoding the enzymes required to produce tryptophan (the Trp operon). The three-dimensional 
structure of this bacterial helix-turn-helix protein, as determined by x-ray diffraction with and without tryptophan bound, 
is illustrated. Tryptophan binding increases the distance between the two recognition helices in the homodimer, allowing ' 
the repressor to fit snugly on the operator. (Adapted from R. Zhang et al„ Nature 327:591 -597, 1 987. With permission from 
Macmillan Publishers Ltd.) 
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Because the active, DNA-binding form of the protein serves to turn genes off, 
this mode of gene regulation is called negative control, and the gene regulatory 
proteins that function in this way are called transcriptional repressors or gene 
repressor proteins. 

Transcriptional Activators Turn Genes On 

We saw in Chapter 6 that purified E. coli RNA polymerase (including its a sub- 
unit) can bind to a promoter and initiate DNA transcription. Many bacterial pro- 
moters, however, are only marginally functional on their own, either because 
they are recognized poorly by RNA polymerase or because the polymerase has 
difficulty opening the DNA helix and beginning transcription. In either case 
these poorly functioning promoters can be rescued by gene regulatory proteins 
that bind to a nearby site on the DNA and contact the RNA polymerase in a way 
that dramatically increases the probability that a transcript will be initiated. 
Because the active, DNA-binding form of such a protein turns genes on, this 
mode of gene regulation is called positive control, and the gene regulatory pro- 
teins that function in this manner are known as transcriptional activators or 
gene activator proteins. In some cases, bacterial gene activator proteins aid RNA 
polymerase in binding to the promoter by providing an additional contact sur- 
face for the polymerase. In other cases, they contact RNA polymerase and facil- 
itate its transition from the initial DNA-bound conformation of polymerase to 
the actively transcribing form by stabilizing a transition state of the enzyme. Like 
repressors, gene activator proteins must be bound to DNA to exert their effects. 
In this way each regulatory protein acts selectively controlling only those genes 
that bear a DNA sequence recognized by it. 

DNA-bound activator proteins can increase the rate of transcription initia- 
tion up to 1000-fold, a value consistent with a relatively weak and nonspecific 
interaction between the activator and RNA polymerase. For example, a 1000- 
fold change in the affinity of RNA polymerase for its promoter corresponds to a 
change in AG of ~4 kcal/mole, which could be accounted for by just a few weak, 
noncovalent bonds. Thus gene activator proteins can work simply by providing 
a few favorable interactions that help to attract RNA polymerase to the pro- 
moter. 

As in negative control by a transcriptional repressor, a transcriptional acti- 
vator can operate as part of a simple on-off genetic switch. The bacterial activa- 
tor protein CAP (catabolite activator protein), for example, activates genes that 
enable E. coli to use alternative carbon sources when glucose, its preferred car- 
bon source, is unavailable. Falling levels of glucose cause an increase in the 
intracellular signaling molecule cyclic AMP, which binds to the CAP protein, 
enabling it to bind to its specific DNA sequence near target promoters and 
thereby turn on the appropriate genes. In this way the expression of a target 
gene is switched on or off, depending on whether cyclic AMP levels in the cell are 
high or low, respectively. Figure 7-37 summarizes the different ways that posi- 
tive and negative control can be used to regulate genes. 

Transcriptional activators and transcriptional repressors are similar in 
design. The tryptophan repressor and the transcriptional activator CAP, for exam- 
ple, both use a helix-turn-helix motif (see Figure 7-11) and both require a small 
cofactor in order to bind DNA. In fact, some bacterial proteins (including CAP 
and the bacteriophage lambda repressor) can act as either activators or repres- 
sors, depending on the exact placement of the DNA sequence they recognize in 
relation to the promoter: if the binding site for the protein overlaps the promoter, 
the polymerase cannot bind and the protein acts as a repressor (Figure 7-38). 

A Transcriptional Activator and a Transcriptional Repressor 
Control the Lac Operon 

More complicated types of genetic switches combine positive and negative con- 
trols. The Lac operon in E. coli, for example, unlike the Trp operon, is under both 
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Figure 7-37 Summary of the mechanisms by which specific gene regulatory proteins control gene transcription in 
procaryotes. (A) Negative regulation; (B) positive regulation. Note that the addition of an "inducing" ligand can turn on a 
gene either by removing a gene repressor protein from the DNA (upper left panel) or by causing a gene activator protein to 
bind (lower right panel). Likewise, the addition of an "inhibitory" ligand can turn off a gene either by removing a gene 
activator protein from the DNA (upper right panel) or by causing a gene repressor protein to bind (lower left panel). 



negative and positive transcriptional controls by the Lac repressor protein and 
CAR respectively. The Lac operon codes for proteins required to transport the 
disaccharide lactose into the cell and to break it down. CAP, as we have seen, 
enables bacteria to use alternative carbon sources such as lactose in the absence 
of glucose. It would be wasteful, however, for CAP to induce expression of the 
Lac operon if lactose is not present, and the Lac repressor ensures that the Lac 
operon is shut off in the absence of lactose. This arrangement enables the con- 
trol region of the Lac operon to respond to and integrate two different signals, so 
that the operon is highly expressed only when two conditions are met: lactose 
must be present and glucose must be absent. In any of the other three possible 
signal combinations, the cluster of genes is held in the off state (Figure 7-39). 

The simple logic of this genetic switch first attracted the attention of biolo- 
gists over 50 years ago. As explained above, the molecular basis of the switch was 
uncovered by a combination of genetics and biochemistry providing the first 
glimpse into how gene expression is controlled. 
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Figure 7-38 Some bacterial gene regulatory proteins can act as either a 
transcriptional activator or a repressor, depending on the precise 
placement of their DNA-binding sites. An example is the bacteriophage 
lambda repressor. For some genes, the protein acts as a transcriptional 
activator by providing a favorable contact for RNA polymerase (top). At 
other genes (bottom), the operator is located one base pair closer to the 
promoter, and, instead of helping polymerase, the repressor now competes 
with it for binding to the DNA. The lambda repressor recognizes its 
operator by a helix-turn-helix motif, as shown in Figure 7-1 1 . 
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DNA Looping Occurs During Bacterial Gene Regulation 

The control of the Lac operon as shown in Figure 7-39 is simple and economi- 
cal, but the continued study of this and other examples of bacterial gene regula- 
tion revealed a new feature of gene regulation, known as DNA looping. The Lac 
operon was originally thought to contain a single operator, but subsequent work 
revealed additional, secondary operators located nearby. A single tetrameric 
molecule of the Lac repressor can bind two operators simultaneously, looping 
out the intervening DNA. The ability to bind simultaneously to two operators 
strengthens the overall interaction of the Lac repressor with DNA and thereby 
leads to greater levels of repression in the cell (Figure 7-40). 

DNA looping also allows two different proteins bound along a DNA double 
helix to contact one another readily. The DNA can be thought of as a tether, help- 
ing one DNA-bound protein interact with another even though thousands of 
nucleotide pairs may separate the binding sites for the two proteins (Figure 
7-41 ) . We shall see below that DNA looping is especially important in eucaryotic 




Figure 7-39 Dual control of the Lac 
operon. Glucose and lactose levels 
control the initiation of transcription of 
the Lac operon through their effects on 
CAP and the Lac repressor protein, 
respectively. LacZ, the first gene of the 
Lac operon, encodes the enzyme 
(3-galactosidase, which breaks down the 
disaccharide lactose to galactose and 
glucose. Lactose addition increases the 
concentration of allolactose, an isomer of 
lactose, which binds to the repressor 
protein and removes it from the DNA. 
Glucose addition decreases the 
concentration of cyclic AMP; because 
cyclic AMP no longer binds to CAP, this 
gene activator protein dissociates from 
the DNA, turning off the operon. 

This figure summarizes the essential 
features of the Lac operon, but in reality 
the situation is more complex. There are 
several Lac repressor binding sites 
located at different positions along the 
DNA. Although the one illustrated exerts 
the greatest effect, the others are 
required for full repression (see Figure 
7-40). In addition, expression of the Lac 
operon never completely shuts down. 
A small amount of the enzyme 
(3-galactosidase is required to convert 
lactose to allolactose, thereby permitting 
the Lac repressor to be inactivated when 
lactose is added to the growth medium. 



Figure 7-40 DNA looping can stabilize 
protein-DNA interactions. The Lac 

repressor, a tetramer, can simultaneously 
bind to two operators. The Lac operon 
has a total of three operators, but for 
simplicity, only two are shown here, the 
main operator (O m ) and an auxiliary 
operator (O a ). The figure shows all the 
possible states of the Lac repressor 
bound to these two operators. At the 
concentrations of Lac repressor in the 
cell, and in the absence of lactose, the 
state in the lower right is the most stable, 
and to dissociate completely from the 
DNA, the Lac repressor must first pass 
through an intermediate where it is 
bound to only a single operator. In these 
states, the local concentration of the 
repressor is very high in relation to the 
free operator, and the reaction to the 
double-bound form is favored over the 
dissociation reaction. In this way, even a 
low-affinity site (O a ) can increase the 
occupancy of a high-affinity site (O m ) and 
give higher levels of gene repression in 
the cell. {Adapted from J.M.G. Vilar and 
S. LeibleU Mol. Biol. 331:981-989, 2003. 
With permission from Academic Press.) 
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Figure 7-41 Binding of two proteins to separate sites on the DNA double helix can greatly increase their probability of 
interacting. (A) The tethering of one protein to the other via an intervening DNA loop of 500 nucleotide pairs increases 
their frequency of collision. The intensity of the blue coloring at each point in space indicates the probability that the red 
protein will be located at that distance from the white protein. (B) The flexibility of DNA is such that an average sequence 
makes a smoothly graded 90° bend (a curved turn) about once every 200 nucleotide pairs. Thus, when only 100 nucleotide 
pairs tethers two proteins, the contact between those proteins is relatively restricted. In such cases the protein interaction 
is facilitated when the two protein-binding sites are separated by a multiple of about 1 nucleotide pairs, which places 
both proteins on the same side of the DNA helix (which has about 1 nucleotides per turn) and thus on the inside of the 
DNA loop, where they can best reach each other. (C) The theoretical effective concentration of the red protein at the site 
where the white protein is bound, as a function of their separation. Experiments suggest that the actual effective 
concentrations at short distances are greater than those predicted here. (C, courtesy of Gregory Bellomy, modified from 
M.C. Mossing and M.T. Record, Science 233:889-892, 1986. With permission from AAAS.) 

gene regulation. However, it also plays crucial roles in many examples of bacte- 
rial gene regulation in addition to that of the Lac operon. For example, DNA 
looping readily allows the bacterial gene activator protein NtrC to contact RNA 
polymerase directly even though the two proteins are bound several hundred 
nucleotide pairs apart (Figure 7-42). 

Bacteria Use Interchangeable RNA Polymerase Subunits to Help 
Regulate Gene Transcription 

We have seen the importance of gene regulatory proteins that bind to sequences 
of DNA and signal to RNA polymerase whether or not to start the synthesis of an 
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Figure 7-42 Gene activation at a 
distance. (A) NtrC is a bacterial gene 
regulatory protein that activates 
transcription by directly contacting RNA 
polymerase and causing a transition 
between the initial DNA-bound form of 
the polymerase and the transcriptionally 
competent form (discussed in Chapter 6). 
As indicated, the transition stimulated by 
NtrC requires the energy from ATP 
hydrolysis, although this requirement is 
unusual for bacterial transcription 
initiation. (B) The interaction of NtrC and 
RNA polymerase, with the intervening 
DNA looped out, can be seen in the 
electron microscope. (B, courtesy of 
Harrison Echols and Sydney Kustu.) 
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Table 7-2 Sigma Factors of E. coli 
SIGMA FACTOR PROMOTERS RECOGNIZED 



? 70 
^32 



I 28 
T 28 



t24 



most genes 

genes induced by heat shock 

genes for stationary phase and stress response 

genes involved in motility and chemotaxis 

genes for nitrogen metabolism 

genes dealing with misfoided proteins in the periplasm 



The sigma factor designations refer to their approximate molecular weights, in kilodaltons. 

RNA chain. Although this is one of the main ways in which both eucaryotes and 
procaryotes control transcription initiation, some bacteria and their viruses use 
an additional strategy based on interchangeable subunits of RNA polymerase. 
As described in Chapter 6, a sigma (c) subunit is required for the bacterial RNA 
polymerase to recognize a promoter. Most bacteria produce a whole range of 
sigma subunits, each of which can interact with the RNA polymerase core and 
direct it to a different set of promoters (Table 7-2). This scheme permits one 
large set of genes to be turned off and a new set to be turned on simply by replac- 
ing one sigma subunit with another; the strategy is efficient because it bypasses 
the need to deal with genes one by one. Indeed, some bacteria code for nearly 
one hundred different sigma subunits and therefore rely heavily on this form of 
gene regulation. Bacterial viruses often use it subversively to take over the host 
polymerase and activate several sets of viral genes rapidly and sequentially (Fig- 
ure 7-43). 



Complex Switches Have Evolved to Control Gene Transcription in 
Eucaryotes 

Bacteria and eucaryotes share many principles of gene regulation, including the 
key role played by gene regulatory proteins that bind tightly to short stretches of 
DNA, the importance of weak protein-protein actions in gene activation, and 
the versatility afforded by DNA looping. However, by comparison, gene regula- 
tion in eucaryotes involves many more proteins, much longer stretches of DNA, 
and often seems bewilderingly complex. This increased complexity provides the 
eucaryotic cell with an important advantage. Genetic switches in bacteria, as we 
have seen, typically respond to one or a few signals. But in eucaryotes it is com- 
mon for dozens of signals to converge on a single promoter, with the transcrip- 
tion machinery integrating all these different signals to produce the appropriate 
level of mRNA. We begin our description of eucaryotic gene regulation by out- 
lining the main features that distinguish it from gene regulation in bacteria. 
• As discussed in Chapter 6, eucaryotic RNA polymerase II, which tran- 
scribes all the protein-coding genes, requires five general transcription fac- 
tors (27 subunits in toto, see Table 6-3, p. 341), whereas bacterial RNA poly- 
merase needs only a single general transcription factor, the c subunit. As 
we have seen, the stepwise assembly of the general transcription factors at 
a eucaryotic promoter provides, in principle, multiple steps at which the 
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Figure 7-43 Interchangeable RNA 
polymerase subunits as a strategy to 
control gene expression in a bacterial 

virus. The bacterial virus SP01 , which 
infects the bacterium B. subtilis, uses the 
bacterial polymerase to transcribe its 
early genes immediately after the viral 
DNA enters the cell. One of the early 
genes, called 28, encodes a sigmalike 
factor that binds to RNA polymerase and 
displaces the bacterial sigma factor. This 
new form of polymerase specifically 
initiates transcription of the SP01 
"middle" genes. One of the middle genes 
encodes a second sigmalike factor, 34, 
that displaces the 28 product and directs 
RNA polymerase to transcribe the "late" 
genes. This last set of genes produces the 
proteins that package the virus 
chromosome into a virus coat and lyse 
the cell. By this strategy, sets of virus 
genes are expressed in the order in which 
they are needed; this ensures a rapid and 
efficient viral replication. 
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cell can speed up or slow down the rate of transcription initiation in 
response to gene regulatory proteins. 

• Eucaryotic cells lack operons — sets of related genes transcribed as a unit — 
and therefore must regulate each gene individually. 

• Each bacterial gene is typically controlled by one or only a few gene regu- 
latory proteins, but it is common in eucaryotes for genes to be controlled 
by many (sometimes hundreds) of different regulatory proteins. This com- 
plexity is possible because, as we shall see, many eucaryotic gene regula- 
tory proteins can act over very large distances (tens of thousands of 
nucleotide pairs) along DNA, allowing an almost unlimited number of 
them to influence the expression of a single gene. 

• A central component of gene regulation in eucaryotes is Mediatory a 24- 
subunit complex, which serves as an intermediary between gene regula- 
tory proteins and RNA polymerase (see Figure 6-19). Mediator provides an 
extended contact area for gene regulatory proteins compared to that pro- 
vided by RNA polymerase alone, as in bacteria. 

• The packaging of eucaryotic DNA into chromatin provides many opportu- 
nities for transcriptional regulation not available to bacteria. 

Having discussed the general transcription factors for RNA polymerase II in 
Chapter 6 (see pp. 340-343), we focus here on the last four of these features and 
how they are used to control eucaryotic gene expression. 

A Eucaryotic Gene Control Region Consists of a Promoter Plus 
Regulatory DNA Sequences 

Because the typical eucaryotic gene regulatory protein controls transcription 
when bound to DNA far away from the promoter, the DNA sequences that con- 
trol the expression of a gene are often spread over long stretches of DNA. We 
use the term gene control region to describe the whole expanse of DNA 
involved in regulating and initiating transcription of a gene, including the pro- 
moter, where the general transcription factors and the polymerase assemble, 
and all of the regulatory sequences to which gene regulatory proteins bind to 
control the rate of the assembly processes at the promoter (Figure 7-44). In 
animals and plants, it is not unusual to find the regulatory sequences of a gene 
dotted over distances as great as 50,000 nucleotide pairs. Much of this DNA 
serves as "spacer" sequences that gene regulatory proteins do not directly rec- 
ognize, but this DNA may provide the flexibility needed for efficient DNA loop- 
ing. In this context, it is important to remember that, like other regions of 
eucaryotic chromosomes, most of the DNA in gene control regions is packaged 
into nucleosomes and higher-order forms of chromatin, thereby compacting 
its length and altering its properties. 

In this chapter, we shall loosely use the term gene to refer only to a segment 
of DNA that is transcribed into RNA (see Figure 7-44). However, the classical 
view of a gene includes the gene control region as well, making most eucaryotic 
genes considerably larger. The discovery of alternative RNA splicing has further 
complicated the definition of a gene — a point we discussed briefly in Chapter 6 
and will return to later in this chapter. 

It is the gene regulatory proteins that allow the genes of an organism to be 
turned on or off individually. In contrast to the small number of general tran- 
scription factors, which are abundant proteins that assemble on the promoters 
of all genes transcribed by RNA polymerase II, there are thousands of different 
gene regulatory proteins. For example, of the roughly 25,000 human genes, an 
estimated 8% (-2000 genes) encode gene regulatory proteins. Most of these rec- 
ognize DNA sequences using one of the DNA-binding motifs described previ- 
ously. Not surprisingly, the eucaryotic cell regulates each of its many genes in a 
unique way. Given the sheer number of genes in eucaryotes and the complexity 
of their regulation, it has been difficult to formulate simple rules for gene regu- 
lation that apply in every case. We can, however, make some generalizations 
about how gene regulatory proteins, once bound to gene control regions on 
DNA, set in motion the train of events that lead to gene activation or repression. 
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Eucaryotic Gene Activator Proteins Promote the Assembly of RNA 
Polymerase and the General Transcription Factors at the 
Startpoint of Transcription 

The DNA sites to which eucaryotic gene activator proteins bind were originally 
called enhancers because their presence "enhanced" the rate of transcription 
initiation. It came as a surprise when it was first discovered that these activator 
proteins could be bound tens of thousands of nucleotide pairs away from the 
promoter, but, as we have seen, DNA looping provides at least one explanation 
for this initially puzzling observation. 

The simplest gene activator proteins have a modular design consisting of 
two distinct domains. One domain usually contains one of the structural motifs 
discussed previously that recognizes a specific DNA sequence. The second 
domain — sometimes called an activation domain — accelerates the rate of tran- 
scription initiation. This type of modular design was first revealed by experi- 
ments in which genetic engineering techniques were used to create a chimeric 
protein containing the activation domain of one protein fused to the DNA-bind- 
ing domain of a different protein (Figure 7-45). 

Once bound to DNA, how do eucaryotic gene activator proteins increase the 
rate of transcription initiation? As we will see shortly, there are several mecha- 
nisms by which this can occur, and, in many cases, these different mechanisms 
work in concert at a single promoter. But, regardless of the precise biochemical 
pathway, the ultimate function of activators is to attract, position, and modify 
the general transcription factors, Mediator, and RNA polymerase II at the pro- 
moter so that transcription can begin. They do this both by acting directly on 
these components and, indirectly, by changing the chromatin structure around 
the promoter. 

Some activator proteins bind directly to one or more of the general tran- 
scription factors, accelerating their assembly on a promoter that is linked 
through DNA to that activator. Others interact with Mediator and attract it to 
DNA where it can then facilitate assembly of RNA polymerase and the general 
transcription factors at the promoter (see Figure 7-44). In this sense, eucaryotic 



Figure 7-44 The gene control region for 
a typical eucaryotic gene. The promoter 
is the DNA sequence where the general 
transcription factors and the polymerase 
assemble (see Figure 6-1 6). The 
regulatory sequences serve as binding 
sites for gene regulatory proteins, whose 
presence on the DNA affects the rate of 
transcription initiation. These sequences 
can be located adjacent to the promoter, 
far upstream of it, or even within introns 
or downstream of the gene. As shown in 
the lower panel, DNA looping allows 
gene regulatory proteins bound at any of 
these positions to interact with the 
proteins that assemble at the promoter. 
Many gene regulatory proteins act 
through Mediator, while others influence 
the general transcription factors and RNA 
polymerase directly. Although not shown 
here, many gene regulatory proteins also 
influence the chromatin structure of the 
DNA control region thereby affecting 
transcription initiation indirectly (see 
Figure 4-45). As noted in the text, for 
simplicity, "gene X" refers here to the 
coding sequence within the gene. 

Whereas Mediator and the general 
transcription factors are the same for all 
polymerase II transcribed genes, the 
gene regulatory proteins and the 
locations of their binding sites relative to 
the promoter differ for each gene. 
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activators resemble those of bacteria in recruiting RNA polymerase to specific 
sites on DNA so it can begin transcribing. 



Eucaryotic Gene Activator Proteins Also Modify Local Chromatin 
Structure 

The general transcription factors, Mediator, and RNA polymerase seem unable 
on their own to assemble on a promoter that is packaged in standard nucleo- 
somes. Indeed, it has been proposed such packaging may have evolved to pre- 
vent "leaky" transcription. In addition to their direct actions in assembling the 
transcription machinery at the promoter, gene activator proteins also promote 
transcription initiation by changing the chromatin structure of the regulatory 
sequences and promoters of genes. 

As we saw in Chapter 4, four of the most important ways of locally altering 
chromatin are through covalent histone modifications, nucleosome remodel- 
ing, nucleosome removal, and nucleosome replacement. Gene activator pro- 
teins use all four of these mechanisms by attracting histone modification 
enzymes, ATP-dependent chromatin remodeling complexes, and histone chap- 
erones to alter the chromatin structure of promoters they control (Figure 7-46). 
In general terms, these local alterations in chromatin structure are believed to 
make the underlying DNA more accessible, thereby facilitating the assembly of 
the general transcription factors, Mediator, and RNA polymerase at the pro- 
moter. Local chromatin modification also allows additional gene regulatory pro- 
teins to bind to the control region of the gene. However, the most important role 
of covalent histone modifications in transcription is probably not in directly 
changing chromatin structure; rather, as discussed in Chapter 4, these modifi- 
cations provide favorable interactions for the binding of a large set of proteins 
that read a "histone code." For transcription initiation, these proteins include 
other histone-modifying enzymes (reader-writer complexes), chromatin 
remodeling complexes, and at least one of the general transcription factors (Fig- 
ure 7-47). 

The alterations of chromatin structure that occur during transcription initi- 
ation can persist for variable lengths of time. In some cases, as soon as the gene 
regulatory protein dissociates from DNA, the chromatin modifications are 
rapidly reversed, restoring the gene to its pre-activated state. This rapid reversal 
is especially important for genes that the cell must quickly switch on and off in 



Figure 7-45 The modular structure of a 
gene activator protein. Outline of an 
experiment that reveals the presence of 
independent DNA-binding and 
transcription-activating domains in the 
yeast gene activator protein Gal4. 
A functional activator can be reconstituted 
from the C-terminal portion of the yeast 
Gal4 protein if it is attached to the DNA- 
binding domain of a bacterial gene 
regulatory protein (the LexA protein) by 
genetic engineering techniques. When the 
resulting bacterial-yeast hybrid protein is 
produced in yeast cells, it will activate 
transcription from yeast genes provided 
that the specific DNA-binding site for the 
bacterial protein has been inserted next to 
them. (A) Gal4 is normally responsible for 
activating the transcription of yeast genes 
that code for the enzymes that convert 
galactose to glucose. (B) A chimeric gene 
regulatory protein, produced by genetic 
engineering techniques, requires a LexA 
recognition sequence to activate 
transcription. In the experiment shown, 
the control region for one of the genes 
controlled by LexA was fused to the £ coli 
LacZ gene, which codes for the enzyme 
pi-galactosidase (see Figure 7-39). 
(3-Galactosidase is very simple to detect 
biochemically and thus provides a 
convenient way to monitor the expression 
level specified by a gene control region. As 
used here, LacZ is said to serve as a 
reporter gene, since it "reports" the activity 
of a gene control region. 
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response to external signals, such as the glucocorticoid hormone discussed ear- 
lier in this chapter. However, in other cases, the altered chromatin structure 
seems to persist, even after the gene regulatory protein that directed its estab- 
lishment has dissociated from DNA. In principle, this memory can extend into 
the next cell generation because, as discussed in Chapter 4, chromatin structure 
can be self- renewing (see Figure 4-52). It is interesting to consider the possibil- 
ity that different histone modifications persist for different times in order to pro- 
vide the cell with a mechanism for long-, medium-, and short-term memory of 
gene expression patterns. 

A special type of chromatin modification occurs as RNA polymerase II tran- 
scribes through a gene. In most cases, the nucleosomes just ahead of the poly- 
merase are acetylated by writer complexes carried by the polymerase, removed 
by histone chaperones, and deposited behind the moving polymerase. They are 
then rapidly deacetylated and methylated, also by reader-writer complexes that 
are carried by the polymerase, leaving behind nucleosomes that are especially 
resistant to transcription. Although this remarkable process may seem counter- 
intuitive, it likely evolved to prevent spurious transcription re-initiation behind 
a moving polymerase, which is, in essence, clearing a path through chromatin. 
Later in this chapter, when we discuss RNA interference, the potential dangers to 
the cell of such inappropriate transcription will become especially obvious. 

We have just seen that gene activator proteins can profoundly influence 
chromatin structure. However, even before these activator proteins are brought 
into play, many genes are "poised" to become rapidly activated. For example, the 
regulatory regions for many genes are "marked" by a short, nucleosome-free 
region flanked by nucleosomes that contain the histone variant H2AZ. This 
arrangement, which is specified by DNA sequence, allows free access of gene 
regulatory proteins to the nucleosome-free region; in addition, the H2AZ-con- 
taining nucleosomes are thought to be easily disassembled, thus further facili- 
tating transcription initiation. 



Figure 7-46 Four ways eucaryotic 
activator proteins can direct local 
alterations in chromatin structure to 
stimulate transcription initiation. 

Although shown as separate pathways, 
these mechanisms often work together 
during the activation of a gene. For 
example, prior acetylation of histones 
makes it easier for histone chaperones to 
remove them from nucleosomes. A few 
patterns of histone modification that 
promote transcription initiation are listed 
in Figure 4-44, and a specific example is 
given in Figure 7-47. Nucleosome 
remodeling and histone removal favor 
transcription initiation by increasing the 
accessibility of DNA and thereby 
facilitating the binding of Mediator, RNA 
polymerase, and the general 
transcription factors as well as additional 
activator proteins. Transcription initiation 
and the formation of a compact 
chromatin structure can be regarded as 
competing biochemical assembly 
reactions, and enzymes that increase- 
even transiently — the accessibility of 
DNA in chromatin will tend to favor 
transcription initiation. 
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Figure 7-47 Writing and reading the histone code during transcription 
initiation. In this example, taken from the human interferon gene promoter, a 
gene activator protein binds to DNA packaged into chromatin and first 
attracts a histone acetyl transferase to acetylate lysine 9 of histone H3 and 
lysine 8 of histone H4. Next, a histone kinase, attracted by the gene activator 
protein, phosphorylates serine 10 of histone H3, but can only do so after 
lysine 9 has been acetylated. The serine modification then signals the histone 
acetyl transferase to acetylate position K14 of histone H3. At this point the 
histone code for transcription initiation, set into motion by the binding of the 
gene activator protein, has been written. Note that the writing is sequential, 
with each histone modification depending on a prior modification. 

The final reading of the code occurs when the general transcription factor 
TFIID and the chromatin remodeling complex SWI/SNF bind, both of which 
strongly promote the subsequent steps of transcription initiation. TFIID and 
SWI/SNF both recognize acetylated histone tails through a bromodomain, a 
protein domain specialized to read this particular mark on histones; a 
bromodomain is carried in a subunit of each protein complex. (Adapted from 
T. Agalioti, G. Chen and D. Thanos, Cell 1 1 1 :381 -392, 2002. With permission 
from Elsevier.) 



Gene Activator Proteins Work Synergistically 

We have seen that eucaryotic gene activator proteins can influence different 
steps in transcription initiation. In general, where several factors work together 
to enhance a reaction rate, the joint effect is not merely the sum of the enhance- 
ments that each factor alone contributes, but the product. If, for example, factor 
A lowers the free-energy barrier for a reaction by a certain amount and thereby 
speeds up the reaction 100-fold, and factor B, by acting on another aspect of the 
reaction, does likewise, then A and B acting in parallel will lower the barrier by a 
double amount and speed up the reaction 10,000-fold. Even if A and B work sim- 
ply by attracting the same protein, the affinity of that protein for the reaction site 
increases multiplicatively. Thus, gene activator proteins often exhibit transcrip- 
tional synergy, where several activator proteins working together produce a 
transcription rate that is much higher than that of the sum of the activators 
working alone (Figure 7-48). It is not difficult to see how multiple gene regula- 
tory proteins, each binding to a different regulatory DNA sequence, work 
together to control the final rate of transcription of a eucaryotic gene. 

Since gene activator proteins can influence many different steps on the 
pathway to transcriptional activation, it is worth considering whether these 
steps always occur in a prescribed order. For example, does histone modifica- 
tion always precede chromatin remodeling, as in the example of Figure 7-47? 
Does Mediator enter before or after RNA polymerase? The answers to these 
questions appear to be different for different genes— and even for the same 
gene under the influence of different gene regulatory proteins (Figure 7-49). 
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Figure 7-48 Transcriptional synergy. This experiment compares the rate of transcription 
produced by three experimentally constructed regulatory regions in a eucaryotic cell and reveals 
transcriptional synergy, the greater than additive effect of multiple activators. Transcriptional 
synergy is typically observed between different gene activator proteins from the same organism 
and even between activator proteins from different eucaryotic species when they are 
experimentally introduced into the same cell. This last observation reflects the high degree of 
conservation of the elaborate machinery responsible for eucaryotic transcription initiation. 
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Figure 7-49 An order of events leading to transcription initiation of a 
specific gene. In this well-studied example from the budding yeast 
S. cerevisiae, the steps toward transcription initiation occur in a particular 
order; however, this order differs from one gene to the next. For example, 
at another gene, histone modification occurs first, followed by RNA 
polymerase recruitment, followed by chromatin remodeling complex 
recruitment. Figure 7-47 illustrates yet another possible order of events. 



Whatever the precise mechanisms and the order in which they are carried out, a 
gene regulatory protein must be bound to DNA either directly or indirectly to 
influence transcription of its target gene, and the rate of transcription of a gene 
ultimately depends upon the spectrum of regulatory proteins bound upstream 
and downstream of its transcription start site. 



Eucaryotic Gene Repressor Proteins Can Inhibit Transcription in 
Various Ways 

Like bacteria, eucaryotes use gene repressor proteins in addition to activator 
proteins to regulate transcription of their genes. However, because of differences 
in the way that eucaryotes and bacteria initiate transcription, eucaryotic repres- 
sors have many more possible mechanisms of action. We saw in Chapter 4 that 
large regions of the genome can be shut down by the packaging of DNA into het- 
erochromatin. However, eucaryotic genes are rarely organized along the 
genome according to function, so this strategy is not generally useful for most 
examples of gene regulation. Instead, most eucaryotic repressors must work on 
a gene-by-gene basis. Unlike bacterial repressors, most eucaryotic repressors do 
not directly compete with the RNA polymerase for access to the DNA; rather 
they use a variety of other mechanisms, some of which are illustrated in Figure 
7-50. Like gene activator proteins, many eucaryotic repressor proteins act 
through more than one mechanism at a given target gene, thereby ensuring 
robust and efficient repression. 

Gene repression is especially important to animals and plants whose growth 
depends on elaborate and complex developmental programs. Misexpression of 
a single gene at a critical time can have disastrous consequences for the indi- 
vidual. For this reason, many of the genes encoding the most important devel- 
opmental regulatory proteins are kept tightly repressed when they are not 
needed. 
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Eucaryotic Gene Regulatory Proteins Often Bind DNA 
Cooperatively 

So far we have seen that when eucaryotic activator and repressor proteins bind 
to specific DNA sequences, they set in motion a complex series of events that 
culminate in transcription initiation or its opposite, repression. However, these 
proteins rarely recognize DNA as individual polypeptides. In reality, efficient 
DNA binding in the eucaryotic cell typically requires several sequence-specific 
DNA proteins acting together. For example, two gene regulatory proteins with a 
weak affinity for each other might cooperate to bind to a DNA sequence, neither 
protein having a sufficient affinity for DNA to bind to the DNA site on its own. In 
one well-studied case, the DNA-bound protein dimer creates a distinct surface 
that is recognized by a third protein that carries an activator domain that stim- 
ulates transcription. This example illustrates an important general point: pro- 
tein-protein interactions that are too weak to form complexes in solution can do 
so on DNA, with the DNA sequence acting as a "crystallization" site or seed for 
the assembly of a protein complex. 

As shown in Figure 7-51, an individual gene regulatory protein can often 
participate in more than one type of regulatory complex. A protein might func- 
tion, for example, in one case as part of a complex that activates transcription 
and in another case as part of a complex that represses transcription. Thus, 
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individual eucaryotic gene regulatory proteins are not necessarily dedicated 
activators or repressors; instead, they function as regulatory parts that are used 
to build complexes whose function depends on the final assembly of all of the 
individual components. This final assembly, in turn, depends both on the 
arrangement of control region DNA sequences and on the particular gene regu- 
latory proteins present in active form in the cell. Each eucaryotic gene is there- 
fore regulated by a "committee" of proteins, all of which must be present to 
express the gene at its proper level. 

In some cases, the precise DNA sequence to which a regulatory protein 
binds directly can affect the conformation of this protein and thereby influence 
its subsequent transcriptional activity. When bound to one type of DNA 
sequence, for example, a steroid hormone receptor protein interacts with a co- 
repressor and ultimately turns off transcription. When bound to a slightly differ- 
ent DNA sequence, it assumes a different conformation and interacts with a 
coactivator, thereby stimulating transcription. 
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Figure 7-50 Six ways in which eucaryotic gene repressor proteins can operate. (A) Gene activator proteins and gene 
repressor proteins compete for binding to the same regulatory DNA sequence. (B) Both proteins can bind DNA, but the 
repressor binds to the activation domain of the activator protein, thereby preventing it from carrying out its activation 
functions. In a variation of this strategy, the repressor binds tightly to the activator without having to be bound to DNA 
directly. (C)The repressor blocks assembly of the general transcription factors. Some repressors also act at late stages in 
transcription initiation, for example, by preventing the release of the RNA polymerase from the general transcription 
factors. (D) The repressor recruits a chromatin remodeling complex which returns the nucleosomal state of the promoter 
region to its pre-transcriptional form. (E) The repressor attracts a histone deacetylase to the promoter. As we have seen, 
histone acetylation can stimulate transcription initiation (Figure 7-47), and the repressor simply reverses this modification. 
(F) The repressor attracts a histone methyl transferase which modifies certain positions on histones which, in turn, are 
bound by proteins that maintain the chromatin in a transcriptionally silent form. For example, in Drosophila, the histone 
methyl transferase Suv39 methylates the K9 position of histone H3, a modification that is bound by the HP1 protein. In 
another example, E(z) methylates the K27 position of H3, and this modification is bound by the Polycomb protein. HP1 and 
Polycomb recognize methylated lysines through a chromodomain. They can act locally to turn off specific genes, as shown 
here, or can occupy a whole region of a chromosome to repress a cluster of genes. A seventh mechanism of negative 
control— inactivation of a transcriptional activator by heterodimerization— is illustrated in Figure 7-24. For simplicity, 
nucleosomes have been omitted from (A)-(C), and the scales of (D)-(F) have been reduced relative to (A)-(C). 



HOW GENETIC SWITCHES WORK 



447 



(A) IN SOLUTION 




(B) ON DNA 

coactivator 

_i ACTIVATES 

TRANSCRIPTION 



core pressor 



1 



REPRESSES 
TRANSCRIPTION 



1 



W V -- :o.'.o!.:.. 

GENE ON 



GENE OFF 



coactivator 
coactivator ACTIVATES \ ACTIVATES 

TRANSCRIPTION mh TRANSCRIPTION 



GENE ON 



GENE ON 



Typically, a few relatively short stretches of nucleotide sequence guide the 
assembly of a group of regulatory proteins on DNA (see Figure 7-51). However, 
in some extreme cases of regulation by committee a more elaborate 
protein-DNA structure is formed (Figure 7-52). Since the final assembly 
requires the presence of many gene regulatory proteins that bind DNA, it pro- 
vides a simple way to ensure that a gene is expressed only when the cell contains 
the correct combination of these proteins. We saw earlier how the formation of 
heterodimers in solution provides a mechanism for the combinatorial control of 
gene expression. The assembly of complexes of gene regulatory proteins on 
DNA provides a second important mechanism for combinatorial control, one 
that offers far richer opportunities. 



Figure 7-51 Eucaryotic gene regulatory 
proteins often assemble into complexes 
on DNA. Seven gene regulatory proteins 
are shown in (A). The nature and function 
of the complex they form depends on the 
specific DNA sequence that seeds their 
assembly. In (B), some assembled 
complexes activate gene transcription, 
while another represses transcription. 
Note that both the red and the green 
proteins are shared by both activating and 
repressing complexes. Proteins that do not 
themselves bind DNA but assemble on 
other DNA-bound gene regulatory 
proteins are often termed coactivators or 
co-repressors. However, these terms are 
somewhat confusing because they 
encompass an enormous variety of 
proteins including histone readers and 
writers, chromatin remodeling complexes, 
and many other classes of proteins. Some 
have no intrinsic activity themselves but 
simply serve as a "scaffolding" to attract 
those that do. 



Complex Genetic Switches That Regulate Drosophilo 
Development Are Built Up from Smaller Modules 

Given that gene regulatory proteins can be positioned at multiple sites along 
long stretches of DNA, that these proteins can assemble into complexes at each 
site, and that the complexes influence the chromatin structure as well as the 
recruitment and assembly of the general transcription machinery at the pro- 
moter, there would seem to be almost limitless possibilities for the elaboration 
of control devices to regulate eucaryotic gene transcription. 

A particularly striking example of a complex, multicomponent genetic 
switch is that controlling the transcription of the Drosophila Even-skipped [Eve) 
gene, whose expression plays an important part in the development of the 
Drosophila embryo. If this gene is inactivated by mutation, many parts of the 
embryo fail to form, and the embryo dies early in development. As discussed in 
Chapter 22, at the stage of development when Eve begins to be expressed, the 
embryo is a single giant cell containing multiple nuclei in a common cytoplasm. 
This cytoplasm is not uniform, however: it contains a mixture of gene regulatory 
proteins that are distributed unevenly along the length of the embryo, thus pro- 
viding positional information that distinguishes one part of the embryo from 
another (Figure 7-53) . (The way these differences are initially set up is discussed 
in Chapter 22.) Although the nuclei are initially identical, they rapidly begin to 
express different genes because they are exposed to different gene regulatory 
proteins. The nuclei near the anterior end of the developing embryo, for exam- 
ple, are exposed to a set of gene regulatory proteins that is distinct from the set 
that influences nuclei at the posterior end of the embryo. 

The regulatory DNA sequences controlling the Eve gene are designed to read 
the concentrations of gene regulatory proteins at each position along the length 
of the embryo and to interpret this information in such a way that the Eve gene 
is expressed in seven stripes, each initially five to six nuclei wide and positioned 
precisely along the anterior-posterior axis of the embryo (Figure 7-54). How is 
this remarkable feat of information processing carried out? Although not all of 
the molecular details are understood, several general principles have emerged 
from studies of Eve and other Drosophila genes that are similarly regulated. 
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Figure 7-52 Schematic depiction of a 
committee of gene regulatory proteins 
bound to an enhancer. The protein 
shown in yellow is called an architectural 
protein since its main role is to bend the 
DNA to allow the cooperative assembly 
of the other components. The structure 
depicted here is based on that found in 
the control region of the gene that codes 
for a subunit of the T cell receptor 
(discussed in Chapter 25), and it activates 
transcription at a nearby promoter. Only 
certain cells of the developing immune 
system, which eventually give rise to 
mature T cells, have the complete set of 
proteins needed to form this structure. 
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Figure 7-53 The nonuniform distribution of four gene regulatory 
proteins in an early Drosophila embryo. At this stage the embryo is a 
syncytium, with multiple nuclei in a common cytoplasm. Although the 
detail is not shown in these drawings, all of these proteins are 
concentrated in the nuclei 



The regulatory region of the Eve gene is very large (approximately 20,000 
nucleotide pairs). It is formed from a series of relatively simple regulatory mod- 
ules, each of which contains multiple regulatory sequences and is responsible 
for specifying a particular stripe of Eve expression along the embryo. This mod- 
ular organization of the Eve gene control region is revealed by experiments in 
which a particular regulatory module (say that specifying stripe 2) is removed 
from its normal setting upstream of the Eve gene, placed in front of a reporter 
gene (see Figure 7-45), and reintroduced into the Drosophila genome. When 
developing embryos derived from flies carrying this genetic construct are exam- 
ined, the reporter gene is found to be expressed in precisely the position of stripe 
2 (Figure 7-55). Similar experiments reveal the existence of other regulatory 
modules, each of which specifies either one of the other six stripes or some other 
part of the Eve expression pattern normally displayed at later stages of develop- 
ment (see Figure 22-39). 



The Drosophila Eve Gene Is Regulated by Combinatorial Controls 

A detailed study of the stripe 2 regulatory module has provided insights into how 
it reads and interprets positional information. It contains recognition sequences 
for two gene regulatory proteins (Bicoid and Hunchback) that activate Eve tran- 
scription and two (Kruppel and Giant) that repress it (Figure 7-56). (The gene 
regulatory proteins of Drosophila often have colorful names reflecting the phe- 
notype that results if the gene encoding the protein is inactivated by mutation.) 
The relative concentrations of these four proteins determine whether the pro- 
tein complexes that form at the stripe 2 module activate transcription of the Eve 
gene. Figure 7-57 shows the distributions of the four gene regulatory proteins 
across the region of a Drosophila embryo where stripe 2 forms. It is thought that 
either of the two repressor proteins, when bound to the DNA, will turn off the 




Figure 7-54 The seven stripes of the 
protein encoded by the Even-skipped 
(Eve) gene in a developing Drosophila 
embryo. Two and one-half hours after 
fertilization, the egg was fixed and 
stained with antibodies that recognize 
the Eve protein (green) and antibodies 
that recognize the Giant protein (red). 
Where Eve and Giant proteins are both 
present, the staining appears yellow. At 
this stage in development, the egg 
contains approximately 4000 nuclei. The 
Eve and Giant proteins are both located 
in the nuclei, and the Eve stripes are 
about four nuclei wide. The staining 
pattern of the Giant protein is also shown 
in Figure m7-52/7-53. (Courtesy of 
Michael Levine.) 



Figure 7-55 Experiment demonstrating 
the modular construction of the 
Eve gene regulatory region. 

(A) A 480-nucleotide-pair piece of the Eve 
regulatory region was removed and 
inserted upstream of a test promoter that 
directs the synthesis of the enzyme 
(3-galactosidase (the product of the £ coli 
LacZ gene). (B) When this artificial 
construct was reintroduced into the 
genome of Drosophila embryos, the 
embryos expressed p-galactosidase 
(detectable by histochemical staining) 
precisely in the position of the second of 
the seven Eve stripes (C). (B and 
C, courtesy of Stephen Small and 
Michael Levine.) 



stnpe 3 
module 


stripe 2 
module 


stripe 7 
module 




■■ 


^H 




J 


TATA Eve gene 


I 




* 




stripe 2 

module 




TATA LacZ gene 



<Ai 




HOW GENETIC SWITCHES WORK 



449 






stripe 2 module: 480 nucleotide pairs 




Kruppel and its 
binding site 



Giant and its 
binding site 



Bicoid and its 

binding site 



Hunchback and 
its binding site 



stripe 2 module, whereas both Bicoid and Hunchback must bind for its maximal 
activation. This simple regulatory unit thereby combines these four positional 
signals so as to turn on the stripe 2 module (and therefore the expression of the 
Eve gene) only in those nuclei that are located where the levels of both Bicoid 
and Hunchback are high and both Kruppel and Giant are absent. This combina- 
tion of activators and repressors occurs in only one region of the early embryo; 
everywhere else, therefore, the stripe 2 module is silent. 

We have thus far discussed two mechanisms of combinatorial control of 
gene expression — heterodimerization of gene regulatory proteins in solution 
(see Figure 7-20) and the assembly of combinations of gene regulatory proteins 
into small complexes on DNA (see Figure 7-51). It is likely that both mechanisms 
participate in the complex regulation of Eve expression. In addition, the regula- 
tion of stripe 2 just described illustrates a third type of combinatorial control. 



Figure 7-56 Close-up view of the Eve 
stripe 2 unit. The segment of the Eve 
gene control region identified in the 
previous figure contains regulatory 
sequences, each of which binds one or 
another of four gene regulatory proteins. 
It is known from genetic experiments 
that these four regulatory proteins are 
responsible for the proper expression of 
Eve in stripe 2. Flies that are deficient in 
the two gene activators Bicoid and 
Hunchback, for example, fail to express 
efficiently Eve in stripe 2. In flies deficient 
in either of the two gene repressors, 
Giant and Kruppel, stripe 2 expands and 
covers an abnormally broad region of the 
embryo. The DNA binding sites for these 
gene regulatory proteins were 
determined by cloning the genes 
encoding the proteins, overexpressing 
the proteins in E, coli, purifying them, and 
performing DNA-footprinting 
experiments (see Figure 7-29). The top 
diagram indicates that, in some cases, the 
binding sites for the gene regulatory 
proteins overlap and the proteins can 
compete for binding to the DNA. For 
example, binding of Kruppel and binding 
of Bicoid to the site at the far right are 
thought to be mutually exclusive. 
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Figure 7-57 Distribution of the gene regulatory proteins responsible for ensuring that Eve is 
expressed in stripe 2. The distributions of these proteins were visualized by staining a 
developing Drosophila embryo with antibodies directed against each of the four proteins (see 
Figures 7-53 and 7-54). The expression of Eve\n stripe 2 occurs only at the position where the 
two activators (Bicoid and Hunchback) are present and the two repressors (Giant and Kruppel) 
are absent. In fly embryos that lack Kruppel, for example, stripe 2 expands posteriorly. Likewise, 
stripe 2 expands posteriorly if the DNA-binding sites for Kruppel in the stripe 2 module (see 
Figure 7-56) are inactivated by mutation. 

The Eve gene itself encodes a gene regulatory protein, which, after its pattern of expression is 
set up in seven stripes, regulates the expression of other Drosophila genes. As development 
proceeds, the embryo is thus subdivided into finer and finer regions that eventually give rise to 
the different body parts of the adult fly, as discussed in Chapter 22. 

This example from Drosophila embryos is unusual in that the nuclei are exposed directly to 
positional cues in the form of concentrations of gene regulatory proteins. In embryos of most 
other organisms, individual nuclei are in separate cells, and extracellular positional information 
must either pass across the plasma membrane or, more usually, generate signals in the cytosol 
in order to influence the genome. 
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Figure 7-58 The integration of multiple 
inputs at a promoter. Multiple sets of 
gene regulatory proteins can work 
together to influence transcription 
initiation at a promoter, as they do in the 
Eve stripe 2 module illustrated in Figure 
7-56. It is not yet understood in detail 
how the cell achieves integration of 
multiple inputs, but it is likely that the 
final transcriptional activity of the gene 
results from a competition between 
activators and repressors that act by the 
mechanisms summarized in Figures 7-46 
and 7-50. 



Because the individual regulatory sequences in the Eve stripe 2 module are 
strung out along the DNA, many sets of gene regulatory proteins can be bound 
simultaneously at separate sites and influence the promoter of a gene. The pro- 
moter integrates the transcriptional cues provided by all of the bound proteins 
(Figure 7-58). 

The regulation of Eve expression is an impressive example of combinatorial 
control. Seven combinations of gene regulatory proteins— one combination for 
each stripe— activate Eve expression, while many other combinations (all those 
found in the interstripe regions of the embryo) keep the stripe elements silent. 
The other stripe regulatory modules are thought to be constructed similarly to 
those described for stripe 2, being designed to read positional information pro- 
vided by other combinations of gene regulatory proteins. The entire gene con- 
trol region, strung out over 20,000 nucleotide pairs of DNA, binds more than 20 
different regulatory proteins. A large and complex control region is thereby built 
from a series of smaller modules, each of which consists of a unique arrange- 
ment of short DNA sequences recognized by specific gene regulatory proteins. 



Complex Mammalian Gene Control Regions Are Also Constructed 
from Simple Regulatory Modules 

Perhaps 8% of the coding capacity of a mammalian genome is devoted to the 
synthesis of proteins that serve as regulators of gene transcription. This large 
number of genes reflects the exceedingly complex network of controls govern- 
ing expression of mammalian genes. Each gene is regulated by a set of gene reg- 
ulatory proteins; each of ih.ose proteins is the product of a gene that is in turn 
regulated by a whole set of other proteins, and so on. Moreover, the regulatory 
protein molecules are themselves influenced by signals from outside the cell, 
which can make them active or inactive in a whole variety of ways (Figure 7-59) . 
Thus, we can view the pattern of gene expression in a cell as the result of a com- 
plicated molecular computation that the intracellular gene control network per- 
forms in response to information from the cell's surroundings. We shall discuss 
these issues further in Chapters 15 and 22, which deal with cell signaling and 
development, but the complexity is remarkable even at the level of an individual 
genetic switch regulating the activity of a single gene. It is not unusual, for exam- 
ple, to find a mammalian gene with a control region that is 100,000 nucleotide 
pairs in length, in which many modules, each containing a number of regulatory 
sequences that bind gene regulatory proteins, are interspersed with long 
stretches of other noncoding DNA. 

One of the best-understood examples of a complex mammalian regulatory 
region is found in the human p-globin gene, which is expressed exclusively in red 
blood cells. A complex array of gene regulatory proteins controls the expression 
of the gene, some acting as activators and others as repressors (Figure 7-60). The 
concentrations (or activities) of many of these gene regulatory proteins change 
during development, and only a particular combination of all the proteins trig- 
gers transcription of the gene. The human P-globin gene is part of a cluster of 
globin genes (Figure 7-61A) that are all transcribed exclusively in erythroid cells, 
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that is, cells of the red blood cell lineage, but at different stages of mammalian 
development (see Figure 7-61B). The E-globin gene is expressed in the early 
embryo, y in the later embryo and fetus, and 5 and p primarily in the adult. The 
gene products differ slightly in their oxygen-binding properties, suiting them for 
the different oxygenation conditions in the embryo, fetus, and adult. Each of the 
globin genes has its own set of regulatory proteins that are necessary to turn the 
gene on at the appropriate time. 

The globin genes are unusual in that, at the appropriate time and place, they 
are transcribed at extremely high rates: indeed, red blood cells are little more 
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Figure 7-59 Some ways in which the 
activity of gene regulatory proteins is 
regulated in eucaryotic cells. (A) The 

protein is synthesized only when needed 
and is rapidly degraded by proteolysis so 
that it does not accumulate. 

(B) Activation by ligand binding. 

(C) Activation by covalent modification. 
Phosphorylation is shown here, but many 
other modifications are possible (see 
Table 3-3, p. 186). (D) Formation of a 
complex between a DNA-binding protein 
and a separate protein with a 
transcription-activating domain. 

(E) Unmasking of an activation domain 
by the phosphorylation of an inhibitor 
protein. (F) Stimulation of nuclear entry 
by removal of an inhibitory protein that 
otherwise keeps the regulatory protein 
from entering the nucleus. (G) Release of 
a gene regulatory protein from a 
membrane bilayer by regulated 
proteolysis. 
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Figure 7-60 Model for the control of the human P-globin gene. The diagram shows some of the gene regulatory proteins 
that control expression of the gene during red blood cell development (see Figure 7-61 ). Some of the gene regulatory 
proteins shown, such as CP1 , are found in many types of cells, while others, such as GATA1 , are present in only a few types 
of cells—including red blood cells— and therefore are thought to contribute to the cell-type specificity of p-globin gene 
expression. As indicated by the double-headed arrows, several of the binding sites for GATA1 overlap those of other gene 
regulatory proteins; it is thought that by binding to these sites, GATA1 excludes binding of other proteins. Once bound to 
DNA, the gene regulatory proteins recruit chromatin remodeling complexes, histone modifying enzymes, the general 
transcription factors, Mediator, and RNA polymerase to the promoter. (Adapted from B. Emerson, in Gene Expression: 
General and Cell-Type Specific [M. Karin, ed.] ( pp. 1 16-161. Boston: Birkhauser, 1993.) 
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than bags of hemoglobin that was synthesized by precursor cells. To achieve this 
extraordinarily high level of transcription, the globin genes, in addition to their 
individual regulatory sequences, share a control region called the locus control 
region (LCR), which lies far upstream from the gene cluster and is needed for the 
proper expression of each gene in the cluster (see Figure 7-61A). The importance 
of the LCR can be seen in patients with a certain type of thalassemia, a severe 
inherited form of anemia. In these patients, the (3-globin locus has suffered a 
deletion that removes all or part of the LCR. Although the (3-globin and its nearby 
regulatory region are intact, the gene remains transcriptionally silent, even in 
erythroid cells. 

The way in which the LCR functions is not understood in detail, but it is 
known that the gene regulatory proteins that bind the LCR interact, through 
DNA looping, with proteins bound to the control regions of the globin genes 
they regulate. In this way, the proteins bound at the LCR help attract chromatin 
remodeling complexes, histone-modifying enzymes, and components of the 
transcription machinery that act in conjunction with the specific regulatory 
regions of each individual globin gene. In addition, the LCR includes a barrier 
sequence (see Figure 4-47) that prevents the spread of neighboring heterochro- 
matin into the P-globin locus, as discussed in Chapter 4. This dual feature dis- 
tinguishes the globin LCR from many other types of regulatory sequences in the 
human genome; however, the globin genes are not alone in having an LCR, as 
LCRs are also present upstream of other highly transcribed, cell-type-specific 
genes. We should probably think of LCRs, not as unique DNA elements with spe- 
cialized properties, but rather as especially powerful combinations of more fun- 
damental types of regulatory sequences. 



Figure 7-61 The cluster of p-like globin 
genes in humans. (A) The large 
chromosomal region shown spans 
1 00,000 nucleotide pairs and contains the 
five globin genes and a locus control 
region (LCR). (B) Changes in the 
expression of the p-like globin genes at 
various stages of human development. 
Each of the globin chains encoded by 
these genes combines with an a-globin 
chain to form the hemoglobin in red 
blood cells (see Figure 4-86). (A, after 
F. Grosveld, G.B. van Assendelft, 
D.R. Greaves and G. Kollias, Cell 
51:975-985, 1987. With permission 
from Elsevier.) 



Insulators Are DNA Sequences That Prevent Eucaryotic Gene 
Regulatory Proteins from Influencing Distant Genes 

All genes have control regions, which dictate at which times, under what condi- 
tions, and in what tissues the gene will be expressed. We have also seen that 
eucaryotic gene regulatory proteins can act across very long stretches of DNA. 
How, then, are control regions of different genes kept from interfering with one 
another? In other words, what keeps a gene regulatory protein bound on the 
control region of one gene from inappropriately influencing the transcription of 
adjacent genes? 

To avoid such cross-talk, several types of DNA elements function to com- 
partmentalize the genome into discrete regulatory domains. In Chapter 4 we 
discussed barrier sequences that prevent the spread of heterochromatin into 
genes that need to be expressed. A second type of DNA element, called an insu- 
lator, prevents enhancers from running amok and activating inappropriate 
genes (Figure 7-62). An insulator can apparently block the communication 
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Figure 7-62 Schematic diagram 
summarizing the properties of 
insulators and barrier sequences. 

Insulators directionally block the action 
of enhancers (left-hand side), and barrier 
sequences prevent the spread of 
heterochromatin (right-hand side). Thus 
gene B is properly regulated and gene B's 
enhancer is prevented from influencing 
the transcription of gene A. How barrier 
sequences are likely to function is 
depicted in Figure 4-47. It is not yet 
understood how insulators exert their 
effects; one possibility is that they serve 
as "decoys," tying up the transcriptional 
machinery and preventing it from 
interacting with an authentic enhancer. 
Another is that they anchor DNA to the 
nuclear envelope, thereby interfering 
with DNA looping between an enhancer 
and an inappropriate promoter. 
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between an enhancer and a promoter, but, to do so, it must be located between 
the two. Although proteins that bind to insulators have been identified, how they 
directionally neutralize enhancers is still a mystery. 

Even though their mechanisms are not understood in detail, the distribution 
of insulators and barrier sequences in a genome is thought to divide it into inde- 
pendent domains of gene regulation and chromatin structure (Figure 7-63). 
Aspects of this organization can be visualized by staining whole chromosomes 
for the specialized proteins that bind these DNA elements. 

Although chromosomes are organized into orderly domains that discourage 
enhancers from acting indiscriminately, there are special circumstances where an 
enhancer located one chromosome has been found to activate a gene located on 
a second chromosome. A remarkable example occurs in the regulation of the 
mammalian olfactory receptors. These are the proteins expressed by sensory neu- 
rons that allow mammals to discriminate accurately among many thousands of 
distinct smells (see p. 917). Humans, for example, have 350 olfactory receptor 
genes, and they are carefully regulated so that only one of these genes is expressed 
in each sensory neuron. The olfactory receptor genes are dispersed among many 
different chromosomes, but there is only a single enhancer for all of them. Once 
this enhancer activates a receptor gene by associating with its regulatory region, it 
remains stably associated thereby precluding activation of any of the other recep- 
tor genes. Although there is much we do not understand about this mechanism, it 
does indicate the extreme versatility of transcriptional regulation strategies. 



Figure 7-63 Localization ofaDrosophila 
insulator-binding protein on polytene 
chromosomes. A polytene chromosome 
(discussed in Chapter 4) was stained with 
propidium iodide (red) to show its 
banding patterns— with bands appearing 
bright red and interbands as dark gaps in 
the pattern (top). The positions on this 
polytene chromosome that are bound by 
a particular insulator protein (called 
BEAF) are stained bright green using 
antibodies directed against the protein 
(bottom). BEAF is preferentially localized 
to interband regions, reflecting its role in 
organizing chromosomes into structural, 
as well as functional, domains. For 
convenience, these two micrographs of 
the same polytene chromosome are 
arranged as mirror images. (Courtesy of 
UN Laemmli, from K. Zhao et al., Cell 
81:879-889, 1995. With permission from 
Elsevier.) 



Gene Switches Rapidly Evolve 

We have seen that the control regions of eucaryotic genes are often spread out 
over long stretches of DNA, whereas those of procaryotic genes are typically 
clustered around the start point of transcription. It seems likely that the close- 
packed arrangement of bacterial genetic switches developed from more 
extended forms of switches in response to the evolutionary pressure on bacteria 
to maintain a small genome size. This compression comes at a price, however, as 
it restricts the complexity and adaptability of the control device. In contrast, the 
extended form of eucaryotic control regions— with discrete regulatory modules 
separated by long stretches of spacer DNA— facilitates the reshuffling of regula- 
tory modules during evolution, both to create new regulatory circuits and to 
modify old ones. As we saw in Chapters 1 and 4, and we shall see again in Chap- 
ter 22, changes in gene regulation— rather than the acquisition of new genes- 
underlie much of the wide variety of life on Earth. Unraveling the history of how 
modern gene control regions have evolved presents a fascinating challenge to 
biologists, with many clues available in present-day genomes. 



Summary 

Gene regulatory proteins switch the transcription of individual genes on and off in 
cells. In procaryotes these proteins usually bind to specific DNA sequences close to the 
RNA polymerase start site and, depending on the nature of the regulatory protein and 
the precise location of its binding site relative to the start site, either activate or repress 
transcription of the gene. The flexibility of the DNA helix, however, also allows proteins 
bound at distant sites to affect the RNA polymerase at the promoter by the looping out 
of the intervening DNA. The regulation of higher eucaryotic genes is much more com- 
plex, commensurate with a larger genome size and the large variety of cell types that 
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are formed. A single eucaryotic gene is typically controlled by many gene regulatory 
proteins bound to sequences that can be thousands of nucleotide pairs from the pro- 
moter that directs transcription of the gene. Eucaryotic activators and repressors act by 
a wide variety of mechanisms—generally altering chromatin structure and controlling 
the assembly of the general transcription factors, Mediator, and RNA polymerase at the 
promoter. The time and place that each gene is transcribed, as well as its rates of tran- 
scription under different conditions, are determined by the spectrum of gene regula- 
tory proteins that bind to the regulatory region of the gene. 



THE MOLECULAR GENETIC MECHANISMS THAT 
CREATE SPECIALIZED CELL TYPES 

Although all cells must be able to switch genes on and off in response to changes 
in their environments, the cells of multicellular organisms have evolved this 
capacity to an extreme degree and in highly specialized ways to form an orga- 
nized array of differentiated cell types. In particular, once a cell in a multicellu- 
lar organism becomes committed to differentiate into a specific cell type, the 
cell maintains this choice through many subsequent cell generations, which 
means that it remembers the changes in gene expression involved in the choice. 
This phenomenon of cell memory is a prerequisite for the creation of organized 
tissues and for the maintenance of stably differentiated cell types. In contrast, 
other changes in gene expression in eucaryotes, as well as most in bacteria, are 
only transient. The tryptophan repressor, for example, switches off the trypto- 
phan genes in bacteria only in the presence of tryptophan; as soon as trypto- 
phan is removed from the medium, the genes are switched back on, and the 
descendants of the cell will have no memory that their ancestors had been 
exposed to tryptophan. Even in bacteria, however, a few types of changes in gene 
expression can be inherited stably. 

In this section we shall examine not only cell memory mechanisms, but also 
how gene regulatory devices can be combined to create logic circuits" through 
which cells integrate signals, keep time, remember events in their past, and 
adjust the levels of gene expression over entire chromosomes. We begin by con- 
sidering some of the best-understood genetic mechanisms of cell differentia- 
tion, which operate in bacterial and yeast cells. 



DNA Rearrangements Mediate Phase Variation in Bacteria 

We have seen that cell differentiation in higher eucaryotes usually occurs with- 
out detectable changes in DNA sequence. In some procaryotes, in contrast, a 
stably inherited pattern of gene regulation is achieved by DNA rearrangements 
that activate or inactivate specific genes. Since a change in DNA sequence will 
be copied faithfully during all subsequent DNA replication cycles, an altered 
state of gene activity will be inherited by all the progeny of the cell in which the 
rearrangement occurred. Some of these DNA rearrangements are, however, 
reversible so that occasional individuals can switch back to the original DNA 
configuration. The result is an alternating pattern of gene activity that can be 
detected by observations over long time periods and many generations. 

A well-studied example of this differentiation mechanism occurs in 
Salmonella bacteria and is known as phase variation. Although this mode of dif- 
ferentiation has no known counterpart in higher eucaryotes, it can have consid- 
erable impact on animals because disease-causing bacteria use it to evade 
detection by the immune system. The switch in Salmonella gene expression is 
brought about by the occasional inversion of a specific IQOG-nucleotide-pair 
piece of DNA. This change alters the expression of the cell-surface protein flag- 
ellin, for which the bacterium has two different genes (Figure 7-64). A site-spe- 
cific recombination enzyme catalyzes the inversion and thereby changes the ori- 
entation of a promoter that is bcatecl within the inverted DNA segment. With 
the promoter in one orientation, the bacteria svnthesize one tvne of flaeelHti: 
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Figure 7-64 Switching gene expression 
by DNA inversion in bacteria. 

Alternating transcription of two flagellin 
genes in a Salmonella bacterium is 
caused by a simple site-specific 
recombination event that inverts a small 
DNA segment containing a promoter. 
(A) In one orientation, the promoter 
activates transcription of the H2 flagellin 
gene as well as that of a repressor protein 
that blocks the expression of the 
HI flagellin gene. <B) When the promoter 
is inverted, it no longer turns on H2 or 
the repressor, and the HI gene, which is 
thereby released from repression, is 
expressed instead. The recombination 
mechanism is activated only rarely (about 
once in every 10 5 cell divisions). 
Therefore, the production of one or other 
flagellin tends to be faithfully inherited in 
each clone of cells. 



with the promoter in the other orientation, they synthesize the other type. 
Because inversions occur only rarely, entire clones of bacteria will have one type 
of flagellin or the other. 

Phase variation almost certainly evolved because it protects the bacterial 
population against the immune response of its vertebrate host. If the host makes 
antibodies against one type of flagellin, a few bacteria whose flagellin has been 
altered by gene inversion will still be able to survive and multiply. 

Bacteria isolated from the wild very often exhibit phase variation for one or 
more phenotypic traits. Standard laboratory strains of bacteria lose these "insta- 
bilities" over time, and underlying mechanisms have been studied in only a few 
cases. Not all involve DNA inversion. A bacterium that causes a common sexu- 
ally transmitted human disease (Neisseria gonorrhoeae), for example, avoids 
immune attack by means of a heritable change in its surface properties that 
arises from gene conversion (discussed in Chapter 5) rather than by inversion. 
This mechanism transfers DNA sequences from a library of silent "gene cas- 
settes" to a site in the genome where the genes are expressed; it has the advan- 
tage of creating many variants of the major bacterial surface protein. 



A Set of Gene Regulatory Proteins Determines Cell Type in a 
Budding Yeast 

Because they are so easy to grow and to manipulate genetically yeasts have 
served as model organisms for studying the mechanisms of gene control in 
eucaryotic cells. The common baker's yeast, Saccharomyces cerevisiae, has 
attracted special interest because of its ability to differentiate into three distinct 
cell types. S. cerevisiae is a single-celled eucaryote that exists in either a haploid 
or a diploid state. Diploid cells form by a process known as mating, in which two 
haploid cells fuse. In order for two haploid cells to mate, they must differ in mat- 
ing type (sex). In S. cerevisiae there are two mating types, a and a, which are spe- 
cialized for mating with each other. Each produces a specific diffusible signaling 
molecule (mating factor) and a specific cell-surface receptor protein. These 
jointly enable a cell to recognize and be recognized by its opposite cell type, with 
which it then fuses. The resulting diploid cells, called a/ a, are distinct from 
either parent: they are unable to mate but can form spores (sporulate) when 
they run out of food, giving rise to haploid cells by the process of meiosis (dis- 
cussed in Chapter 21). 

The mechanisms by which these three cell types are established and main- 
tained illustrate several of the strategies we have discussed for changing the pat- 
tern of gene expression. The mating type of the haploid cell is determined by a 
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single locus, the mating-type (Mat) locus, which in an a- type cell encodes a sin- 
gle gene regulatory protein, Matal, and in an a cell encodes two gene regulatory 
proteins, Matal and Mata2. The Matal protein has no effect in the a-type hap- 
loid cell that produces it, but becomes important later in the diploid cell that 
results from mating. In contrast, the Mata2 protein acts in the a cell as a tran- 
scriptional repressor that turns off the a-specific genes, while the Matal protein 
acts as a transcriptional activator that turns on the a-specific genes. Once cells 
of the two mating types have fused, the combination of the Matal and Mata2 
regulatory proteins generates a completely new pattern of gene expression, 
unlike that of either parent cell. Figure 7-65 illustrates the mechanism by which 
the mating-type-specific genes are expressed in different patterns in the three 
cell types. This was among the first examples of combinatorial gene control to be 
identified, and it remains one of the best understood at the molecular level. 

Although in most laboratory strains of S. cerevisiae, the a and a cell types are 
stably maintained through many cell divisions, some strains isolated from the 
wild can switch repeatedly between the a and a cell types by a mechanism of 
gene rearrangement whose effects are reminiscent of the DNA rearrangements in 
N. gonorrhoeae, although the exact mechanism seems to be peculiar to yeast. On 
either side of the Mat locus in the yeast chromosome, there is a silent locus 
encoding the mating-type gene regulatory proteins: the silent locus on one side 
encodes Matal and Mata2; the silent locus on the other side encodes Matal. In 
approximately every other cell division, the active gene in the Mat locus is 
excised and replaced by a newly synthesized copy of the silent locus determining 
the opposite mating type. Because the change removes one gene from the active 
"slot" and replaces it by another, this mechanism is called the cassette mecha- 
nism. The change is reversible because, although the original gene at the Mat 
locus is discarded, a silent copy remains in the genome. New DNA copies made 
from the silent genes function as disposable cassettes that will be inserted in 
alternation into the Ma? locus, which serves as the "playing head" (Figure 7-66). 

The silent cassettes are packaged into a specialized form of chromatin and 
maintained in a transcriptionally inactive form. The study of these cassettes- 
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Figure 7-65 Control of cell type in 
yeasts. Three gene regulatory proteins 
(Matal, Mata2, and Matal) produced by 
the Mat locus determine yeast cell type. 
Different sets of genes are transcribed in 
haploid cells of type a, in haploid cells of 
type a, and in diploid cells (type a/a). The 
haploid cells express a set of haploid- 
specific genes (hSG) and either a set of 
a-specific genes (aSG) or a set of a- 
specific genes (aSG). The diploid cells 
express none of these genes. The Mat 
regulatory proteins control many target 
genes in each type of cell by binding, in 
various combinations, to specific 
regulatory sequences upstream of these 
genes. Note that the Matal protein is a 
gene activator protein, whereas the 
Mata2 protein is a gene repressor 
protein. Both work in combination with a 
gene regulatory protein called Mcm1 that 
is present in all three cell types. In the 
diploid cell type, Mata2 and Matal form 
a heterodimer (shown in detail in Figure 
7-21 ) that turns off a set of genes 
(including the gene encoding the Matal 
activator protein) different from that 
turned off by the Mata2 and Mcml 
proteins. This relatively simple system of 
gene regulatory proteins is an example of 
combinatorial control of qene expression. 
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which has been ongoing for nearly 40 years— has provided many of the key 
insights into the role of chromatin structure in gene regulation. 



Figure 7-66 Cassette model of yeast 
mating-type switching. Cassette switching 
occurs by a gene-conversion process that 
involves a specialized enzyme (the 
HO endonuclease) that makes a double- 
stranded cut at a specific DNA sequence in 
the Mot locus. The DNA near the cut is then 
excised and replaced by a copy of the 
silent cassette of opposite mating type. 
The mechanism of this specialized form of 
gene conversion is similar to the repair of 
double-stranded breaks discussed in 
Chapter 5 (pp. 308-309). 



Two Proteins That Repress Each Other's Synthesis Determine the 
Heritable State of Bacteriophage Lambda 

As we saw at the beginning of the present chapter, the nucleus of a single differ- 
entiated cell contains all the genetic information needed to construct a whole 
vertebrate or plant. This observation eliminates the possibility that an irre- 
versible change in DNA sequence is a major mechanism of cell differentiation in 
these higher eucaryotes although such changes do occur in lymphocyte differ- 
entiation (discussed in Chapter 25). Reversible DNA sequence changes, resem- 
bling those just described for Salmonella and yeasts, in principle could still be 
responsible for some of the inherited changes in gene expression observed in 
higher organisms, but there is currently no evidence that such mechanisms are 
widely used. 

Other mechanisms that we have already touched upon in this chapter, how- 
ever, are also capable of producing patterns of gene regulation that can be inher- 
ited by subsequent cell generations. One of the simplest examples is found in 
the bacterial virus (bacteriophage) lambda where a switch causes the virus to 
flip-flop between two stable self-maintaining states. This type of switch can be 
viewed as a prototype for similar, but more complex, switches that operate in the 
development of higher eucaryotes. 

We mentioned earlier that bacteriophage lambda can in favorable condi- 
tions become integrated into the E. coli cell DNA, to be replicated automatically 
each time the bacterium divides. Alternatively, the virus can multiply in the 
cytoplasm, killing its host (see Figure 5-78). Proteins encoded by the bacterio- 
phage genome mediate the switch between these two states. The genome con- 
tains a total of about 50 genes, which are transcribed in very different patterns 
in the two states. A virus destined to integrate, for example, must produce the 
lambda integrase protein, which is needed to insert the lambda DNA into the 
bacterial chromosome, but must repress the production of the viral proteins 
responsible for virus multiplication. Once one transcriptional pattern or the 
other has been established, it is stably maintained. 

At the heart of this complex gene regulatory switching mechanism are two 
gene regulatory proteins synthesized by the virus: the lambda repressor pro- 
tein (cl protein), which we have already encountered, and the Cro protein. 
These proteins repress each other's synthesis, an arrangement that gives rise to 
just two stable states (Figure 7-67). In state 1 (the prophage state) the lambda 
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stable state 1: the prophage state 
lambda repressor protein is made 



stable state 2: the lytic state 
lambda Cro protein is made 





repressor occupies the operator, blocking the synthesis of Cro and also activat- 
ing its own synthesis. In state 2 (the lytic stale) the Cro protein occupies a differ- 
ent site in the operator, blocking the synthesis of repressor but allowing its own 
synthesis. In the prophage state most of the DNA of the stably integrated bacte- 
riophage is not transcribed; in the lytic state, this DNA is extensively transcribed, 
replicated, packaged into new bacteriophage, and released by host cell lysis. 

When the host bacteria are growing well, an infecting virus tends to adopt 
state 1 , allowing the DNA of the virus to multiply along with the host chromo- 
some. When the host cell is damaged, an integrated virus converts from state 1 
to state 2 in order to multiply in the cell cytoplasm and make a quick exit. This 
conversion is triggered by the host response to DNA damage, which inactivates 
the repressor protein. In the absence of such interference, however, the lambda 
repressor both turns off production of the Cro protein and turns on its own syn- 
thesis, and this positive feedback loop helps to maintain the prophage state. 



Figure 7-67 A simplified version of the 
regulatory system that determines the 
mode of growth of bacteriophage 
lambda in the E. coli host cell. In stable 
state 1 (the prophage state) the 
bacteriophage synthesizes a repressor 
protein, which activates its own synthesis 
and turns off the synthesis of several 
other bacteriophage proteins, including 
the Cro protein. In state 2 (the lytic state) 
the bacteriophage synthesizes the Cro 
protein, which turns off the synthesis of 
the repressor protein, so that many 
bacteriophage proteins are made and the 
viral DNA replicates freely in the E. coli 
cell, eventually producing many new 
bacteriophage particles and killing the 
cell. This example shows how two gene 
regulatory proteins can be combined in a 
circuit to produce two heritable states. 
Both the lambda repressor and the Cro 
protein recognize the operator through a 
helix-turn-helix motif (see Figure 7-11). 



Simple Gene Regulatory Circuits Can Be Used to Make Memory 
Devices 

Positive feedback loops provide a simple general strategy for cell memory— that 
is, for the establishment and maintenance of heritable patterns of gene tran- 
scription. Figure 7-68 shows the basic principle, stripped to its barest essentials. 
Eucaryotic cells use many variations of this simple strategy Several gene regula- 
tory proteins that are involved in establishing the Dmsaphifa body plan (dis- 
cussed in Chapter 22), for example, stimulate their own transcription, thereby 
creating a positive feedback loop thai promotes tbeir continued synthesis; at the 
same time many of these proteins repress the transcription of genes encoding 
other important gene regulatory proteins. In this way a few gene regulatory pro- 
teins that reciprocally affect one another's synthesis and activities can specify a 
sophisticated pattern of inherited behavior. 
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Figure 7-68 Schematic diagram 
showing how a positive feedback loop 
can create cell memory. Protein A is a 
gene regulatory protein that activates its 
own transcription. All of the descendants 
of the original cell will therefore 
"remember" that the progenitor cell had 
experienced a transient signal that 
initiated the production of the orotein. 
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Figure 7-69 Common types of network motifs in transcriptional circuits. 

A and B represent gene regulatory proteins, arrows indicate positive 
transcriptional control, and lines with bars depict negative transcriptional 
control. More detailed descriptions of positive feedback loops and flip-flop 
devices are given in Figures 7-70 and 7-71, respectively. In feed-forward 
loops, A and B represent regulatory proteins that both activate the 
transcription of a target gene, Z. 

Transcription Circuits Allow the Cell to Carry Out Logic 
Operations 

Simple gene regulatory switches can be combined to create all sorts of control 
devices, just as simple electronic switching elements in a computer can be linked 
to perform many types of operations. The analysis of gene regulatory circuits has 
revealed that certain simple types of arrangements are found over and over again 
in cells from widely different species. For example, positive and negative feed- 
back loops are especially common in all cells (Figure 7-69). As we have seen, the 
former provides a simple memory device; the latter is often used to keep the 
expression of a gene close to a standard level regardless of variations in bio- 
chemical conditions inside a cell. Suppose, for example, that a transcriptional 
repressor protein binds to the regulatory region of its own gene and exerts a 
strong negative feedback, such that transcription occurs at a very low rate when 
the concentration of repressor protein is above some critical value (determined 
by its affinity for its DNA binding site) , and at a very high rate when it is below the 
critical value. The concentration of the protein will then be held close to the crit- 
ical value, since any circumstance that causes a fall below that value will lead to 
a steep increase in synthesis, and any rise above that value will cause synthesis 
to switch off. Such adjustments will, however, take time, so that an abrupt 
change of conditions will cause a disturbance of gene expression that is strong 
but transient. As we discuss in Chapter 15, the negative feedback system can thus 
function as a detector of sudden change. Alternatively, if there is a delay in the 
feedback loop, the result may be spontaneous oscillations in the expression of 
the gene (see Figure 15-28). The quantitative details of the negative feedback 
loop determine which of these possible behaviors will occur. 

With two or more genes, the possible range of control circuits and circuit 
behaviors rapidly becomes more complex. Bacteriophage lambda, as we have 
seen, exemplifies a common type of two-gene circuit that can flip-flop between 
expression of one gene and expression of the other. Another common circuit 
arrangement is called afeed-forwardloop (see Figure 7-69); among other things, 
this can serve as a filter, responding to input signals that are prolonged but dis- 
regarding those that are brief (Figure 7-70) . A cell can use these various network 
motifs as miniature logic devices to process information in surprisingly sophis- 
ticated ways. 

The simple types of devices just illustrated are combined in a typical eucary- 
otic cell to create exceedingly complex circuits (Figure 7-71). Each cell in a 
developing multicellular organism is equipped with this control machinery, and 
must, in effect, use the intricate system of interlocking transcriptional switches 
to compute how it should behave at each time point in response to the many dif- 
ferent past and present inputs received. We are only beginning to understand 



INPUT 



ri 
li 



Z (OUTPUT) 
(A) 



INPUT 

time - 



INPUT 

_t~l 



OUTPUT 



time ■ 



OUTPUT 








positive 

feedback 

loop 



negative 

feedback 

loop 



flip-flop 
device 




feed-forward 
loop 



(B) 



time ■ 



(C) 



time ■ 



Figure 7-70 How a feed-forward loop can 
measure the duration of a signal. (A) In 

this theoretical example the gene activator 
proteins A and B are both required for 
transcription of Z, and A becomes active 
only when an input signal is present. (B) If 
the input signal to A is brief, A does not 
stay active long enough for B to 
accumulate, and the Z gene is not 
transcribed. (C) If the signal to A persists, B 
accumulates, A remains active, and Z is 
transcribed. This arrangement allows the 
cell to ignore rapid fluctuations of the input 
signal and respond only to persistent levels. 
This strategy could be used, for example, to 
distinguish between random noise and a 
true signal. 

The behavior shown here was computed 
for one particular set of parameter values 
describing the quantitative properties of A, 
B, and Z and their syntheses. With different 
values of these parameters, feed-forward 
loops can in principle perform other types 
of "calculations." Many feed-forward loops 
have been discovered in cells, and 
theoretical analysis helps researchers to 
appreciate and subsequently test the 
different ways in which they may function. 
(Adapted from S.S. Shen-Orr, R. Milo, 

S. Mangan and U. Alon, Nat. Genet. 
31:64-68, 2002. With permission from 
Macmillan Publishers Ltd.) 
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how to study such complex intracellular control networks. Indeed, without 
quantitative information far more precise and complete than we yet have, it is 
impossible to predict the behavior of a system such as that shown in Figure 7-71: 
the circuit diagram by itself is not enough. 



Synthetic Biology Creates New Devices from Existing Biological 
Parts 

Our discussion has focused on naturally occurring transcriptional circuits, but it 
is also instructive to design and construct artificial circuits in the laboratory and 
introduce them into cells to examine their behavior. Figure 7-72 shows, for 
example, how an engineered bacterial cell can switch between three states in a 
prescribed order, thus functioning as an oscillator or simple clock. The con- 
struction of such new devices from existing parts is often termed synthetic biol- 
ogy. Scientists use this approach to test whether they truly understand the prop- 
erties of the component parts; if so, they should be able to combine these pans 
in novel ways and accurately predict the characteristics of the new device. The 
fact that these predictions usually fail illustrates how far we are from truly 
understanding the detailed workings of the cell. There are many large gaps in 
our knowledge that will require skillful application of the quantitative 
approaches of the physical sciences to complex biological systems. 



Figure 7-71 The exceedingly complex 
gene circuit that specifies a portion of 
the developing sea urchin embryo. Each 
colored small box represents a different 
gene. Those in yellow code for gene 
regulatory proteins and those in green 
and blue code for proteins that give cells 
of the mesoderm and endoderm, 
respectively, their specialized 
characteristics. Genes depicted in gray 
are largely active in the mother and 
provide the egg with cues needed for 
proper development. Arrows depict 
instances in which a gene regulatory 
protein activates the transcription of 
another gene. Lines ending in bars 
indicate examples of gene repression. 



Circadian Clocks Are Based on Feedback Loops in Gene 
Regulation 

Life on Earth evolved in the presence of a daily cycle of day and night, and many 
present-day organisms (ranging from archaea to plants to humans) have come 
to possess an internal rhythm that dictates different behaviors at different times 
of day. These behaviors range from the cyclical change in metabolic enzyme 
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activities of a fungus to the elaborate sleep-wake cycles of humans. The internal 
oscillators that control such diurnal rhythms are called circadian clocks. 

By carrying its own circadian clock, an organism can anticipate the regular 
daily changes in its environment and take appropriate action in advance. Of 
course, the internal clock cannot be perfectly accurate, and so it must be capa- 
ble of being reset by external cues such as the light of day. Thus, circadian clocks 
keep running even when the environmental cues (changes in light and dark) are 
removed, but the period of this free-running rhythm is generally a little less or a 
little more than 24 hours. External signals indicating the time of day cause small 
adjustments in the running of the clock, so as to keep the organism in synchrony 
with its environment. Following more drastic shifts, circadian cycles become 
gradually reset (entrained) by the new cycle of light and dark, as anyone who has 
experienced jet lag can attest. 

We might expect that the circadian clock in a creature such as a human would 
itself be a complex multicellular device, with different groups of cells responsible 
for different parts of the oscillation mechanism. Remarkably, however, it turns 
out that in almost all organisms, including humans, the timekeepers are individ- 
ual cells. Thus, a clock that operates in each member of a specialized group of 
brain cells (the SCN cells in the suprachiasmatic nucleus of the hypothalamus) 
controls our diurnal cycles of sleeping and waking, body temperature, and hor- 
mone release. Even if these cells are removed from the brain and dispersed in a 
culture dish, they will continue to oscillate individually, showing a cyclic pattern 
of gene expression with a period of approximately 24 hours. In the intact body, 
the SCN cells receive neural cues from the retina, entraining them to the daily 
cycle of light and dark, and they send information about the time of day to 
another brain area, the pineal gland, which relays the time signal to the rest of the 
body by releasing the hormone melatonin in time with the clock. 

Although the SCN cells have a central role as timekeepers in mammals, they 
are not the only cells in the mammalian body that have an internal circadian 
rhythm or an ability to reset it in response to light. Similarly in Drosophila, many 
different types of cells, including those of the thorax, abdomen, antenna, leg, 
wing, and testis all continue a circadian cycle when they have been dissected 
away from the rest of the fly. The clocks in these isolated tissues, like those in the 
SCN cells, can be reset by externally imposed light and dark cycles. 

The working of circadian clocks, therefore, is a fundamental problem in cell 
biology. Although we do not yet understand all the details, studies in a wide vari- 
ety of organisms have revealed many of the basic principles and molecular com- 
ponents. For animals, much of what we know has come from searches in 
Drosophila for mutations that make the fly's circadian clock run fast, or slow, or 
not at all; and this work has led to the discovery that many of the same compo- 
nents are involved in the circadian clock of mammals. 



Figure 7-72 A simple gene oscillator 
or "clock" designed in the laboratory. 

(A) Recombinant DNA techniques were 
used to make three artificial genes, 
each coding for a different bacterial 
repressor protein, and each controlled 
by the product of another gene in the 
set, so as to create a regulatory circuit 
as shown. These repressors (denoted A, 
B, and C in the figure) are the Lac 
repressor (see Figure 7-39), theTet 
repressor, which regulates genes in 
response to tetracycline, and the 
Lambda repressor (see Figure 7-67). 
When introduced into a bacterial cell, 
the three genes form a delayed 
negative feedback circuit: the product 
of gene A, for example, acts via genes 
B and C to indirectly inhibit its own 
expression. The delayed negative 
feedback gives rise to oscillations. 

(B) Computer prediction of the 
oscillatory behavior. The cell cycles 
repetitively through a series of states, 
expressing A, then B, then C, then A 
again, and so on, as each gene product 
in turn escapes from inhibition by the 
previous one and represses the next. 

(C) Actual oscillations observed in a 
cell containing the three artificial 
genes in (A), demonstrated with a 
fluorescent reporter of the expression 
of one of these genes. The increasing 
amplitude of the fluorescence signal 
reflects the growth of the bacterial cell. 
(Adapted from M.B. Elowitz and 

S. Leibler, Nature 403:335-338, 2000. 
With permission from Macmillan 
Publishers Ltd.) 
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The mechanism of the clock in Drosophila is briefly outlined in Figure 7-73. 
At the heart of the oscillator is a transcriptional feedback loop that has a time 
delay built into it: accumulation of certain key gene products switches off the 
transcription of their genes, but with a delay, so that— crudely speaking— the 
cell oscillates between a state in which the products are present and transcrip- 
tion is switched off, and one in which the products are absent and transcription 
is switched on. 

Despite the relative simplicity of the basic principle behind circadian clocks, 
the details are complex. One reason for this complexity is that clocks must be 
buffered against changes in temperature, which typically speed up or slow down 
macromolecular association. They must also run accurately but be capable of 
being reset. Although it is not yet understood how biological clocks run at a con- 
stant speed despite changes in temperature, the mechanism for resetting the 
Drosophila clock is the light-induced destruction of one of the key gene regula- 
tory proteins (see Figure 7-73). 



Figure 7-73 Simplified outline of the 
mechanism of the circadian clock in 
Drosophila cells. A central feature of the 
clock is the periodic accumulation and 
decay of two gene regulatory proteins, 
Tim (short for timeless, based on the 
phenotype of a gene mutation) and Per 
(short for period). The mRNAs encoding 
these proteins are translated in the 
cytosol, and, when each protein has 
accumulated to critical levels, they form a 
heterodimer. After a time delay, the 
heterodimer dissociates and Tim and Per 
are transported into the nucleus, where 
they regulate a number of gene products 
that mediate effects of the clock. Once in 
the nucleus, Per also represses the Tim 
and Per genes, creating a feedback 
system that causes the levels of Tim and 
Per to fall. In addition to this 
transcriptional feedback, the clock 
depends on a set of other proteins. For 
example, the controlled degradation of 
Per indicated in the diagram imposes 
delays in the periodic accumulation of 
Tim and Per, which are crucial to the 
functioning of the clock. Steps at which 
specific delays are imposed are shown 
in red. 

Entrainment (or resetting) of the clock 
occurs in response to new light-dark 
cycles. Although most Drosophila cells do 
not have true photoreceptors, light is 
sensed by intracellular flavoproteins, also 
called cryptochromes. In the presence of 
light, these proteins associate with the 
Tim protein and cause its degradation, 
thereby resetting the clock. (Adapted 
from J.C Dunlap, Science 31 1:184-186, 
2006. With permission from AAAS.) 



A Single Gene Regulatory Protein Can Coordinate the Expression 
of a Set of Genes 

Cells need to be able to switch genes on and off individually but they also need 
to coordinate the expression of large groups of different genes. For example, 
when a quiescent eucaryotic cell receives a signal to divide, many hitherto unex- 
pressed genes are turned on together to set in motion the events that lead even- 
tually to cell division (discussed in Chapter 1 7) . One way bacteria coordinate the 
expression of a set of genes is to cluster them together in an operon under con- 
trol of a single promoter (see Figure 7-34). In eucaryotes, however, each gene is 
transcribed from a separate promoter. 

How, then, do eucaryotes coordinate gene expression? This is an especially 
important question because, as we have seen, most eucaryotic gene regulatory 
proteins act as part of a regulatory protein committee, all of whose members are 
necessary to express rhe gene in ihe right cell at the right time, in response to 
the proper signals, and to the proper level How, then, can a eucaryotic cell 
rapidly and decisively switch whole groups of genes on or off? 

The answer is that even though control of gene expression is combinatorial, 
the effect of a single gene regulatory protein can still be decisive in switching any 
particular gene on or off, simply by completing the combination needed to max 
imally activate or repress that gene. This situation is analogous to dialing in the 
final number of a combination lock: the lock will spring open with only this sim- 
ple addition if ah of the other numbers have been previously entered. Moreover, 
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the same number can complete the combination for many different locks. Anal- 
ogously, the addition of a particular protein can turn on many different genes. 

An example in humans is the control of gene expression by the glucocorti- 
coid receptor protein. To bind to regulatory sites in DNA, this gene regulatory 
protein must first form a complex with a molecule of a glucocorticoid steroid 
hormone, such as Cortisol (see Figure 15-13). The body releases this hormone 
during times of starvation and intense physical activity, and among its other 
activities, it stimulates liver cells to increase the production of glucose from 
amino acids and other small molecules. To respond in this way, liver cells 
increase the expression of many different genes that code for metabolic 
enzymes and other products. Although these genes all have different and com- 
plex control regions, their maximal expression depends on the binding of the 
hormone-glucocorticoid receptor complex to a regulatory site in the DNA of 
each gene. When the body has recovered and the hormone is no longer present, 
the expression of each of these genes drops to its normal level in the liver. In this 
way a single gene regulatory protein can control the expression of many differ- 
ent genes (Figure 7-74). 

The effects of the glucocorticoid receptor are not confined to cells of the 
liver. In other cell types, activation of this gene regulatory protein by hormone 
also causes changes in the expression levels of many genes; the genes affected, 
however, are often different from those affected in liver cells. As we have seen, 
each cell type has an individualized set of gene regulatory proteins, and because 
of combinatorial control, these critically influence the action of the glucocorti- 
coid receptor. Because the receptor is able to assemble with many different sets 
of cell- type-specific gene regulatory proteins, switching it on with hormone pro- 
duces a different spectrum of effects in each cell type. 



Expression of a Critical Gene Regulatory Protein Can Trigger the 
Expression of a Whole Battery of Downstream Genes 

The ability to switch many genes on or off coordinately is important not only in 
the day-to-day regulation of cell function. It is also the means by which eucary- 
otic cells differentiate into specialized cell types during embryonic develop- 
ment. The development of muscle cells provides a striking example. 
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Figure 7-74 A single gene regulatory 
protein can coordinate the expression 
of several different genes. The action of 
the glucocorticoid receptor is illustrated 
schematically. On the left is a series of 
genes, each of which has various gene 
activator proteins bound to its regulatory 
region. However, these bound proteins 
are not sufficient on their own to fully 
activate transcription. On the right is 
shown the effect of adding an additional 
gene regulatory protein — the 
glucocorticoid receptor in a complex 
with glucocorticoid hormone — that can 
bind to the regulatory region of each 
gene. The glucocorticoid receptor 
completes the combination of gene 
regulatory proteins required for maximal 
initiation of transcription, and the genes 
are now switched on as a set. In the 
absence of the hormone, the 
glucocorticoid receptor is unavailable to 
bind to DNA. 

In addition to activating gene 
expression, the hormone-bound form of 
the glucocorticoid receptor represses 
transcription of certain genes, depending 
on the gene regulatory proteins already 
present on their control regions. The 
effect of the glucocorticoid receptor on 
any given gene therefore depends upon 
the type of cell, the gene regulatory 
proteins contained within it, and the 
regulatory region of the gene. The 
structure of the DNA-binding portion of 
the glucocorticoid receptor is shown in 
Figure 7-16. 
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As described in Chapter 16, a mammalian skeletal muscle cell is a highly dis- 
tinctive giant cell, formed by the fusion of many muscle precursor cells called 
myoblasts, and therefore containing many nuclei. The mature muscle cell syn- 
thesizes a large number of characteristic proteins, including specific types of 
actin, myosin, tropomyosin, and troponin (all part of the contractile apparatus), 
creatine phosphokinase (for the specialized metabolism of muscle cells), and 
acetylcholine receptors (to make the membrane sensitive to nerve stimulation). 
In proliferating myoblasts, these muscle-specific proteins and their mRNAs are 
absent or are present in very low concentrations. As myoblasts begin to fuse with 
one another, the corresponding genes are all switched on coordinately as part of 
a general transformation of the pattern of gene expression. 

This entire program of muscle differentiation can be triggered to cultured 
skin fibroblasts and certain other cell types by introducing any one of a family of 
helix-loop-helix proteins—the so-called myogenic proteins (MyoD, MyfS, 
MyoG, and Mrf4j— that are normally expressed only in muscle cells (Figure 
7-75A). Binding sites for these regulatory proteins are present in the regulatory 
DNA sequences adjacent to many muscle-specific genes, and the myogenic pro- 
teins Ihereby directly activate the transcription of these genes. In addition, the 
myogenic proteins stimulate their own transcription as well as that of various 
other gene regulatory proteins involved in muscle development, creating an 
elaborate series of positive feedback and feed-forward loops that amplify and 
maintain the muscle developmental program, even after the initiating signal has 
disappeared (Figure 7-75B; see also Chapter 22). 

It is probable that those cell rypes that are converted to muscle cells by the 
addition of myogenic proteins have already accumulated a number of gene reg- 
ulaiory proteins thai can cooperate with the myogenic proteins to switch on 
muscle-specific genes. Other cell types fail to be converted to muscle by myo- 
genin or its relatives; these cells presumably have not accumulated the other 
gene regulatory proteins required. 

The conversion of one cell type (fibroblast) to another (skeletal muscle) by a 
single gene regulatory protein reemphasizes one of the most important princi- 
ples discussed in this chapter: differences in gene expression can produce dra- 
matic differences between cell types— in size, shape, chemistry, and function. 

Combinatorial Gene Control Creates Many Different Cell Types in 
Eucaryotes 

We have already discussed how multiple gene regulatory proteins can act in 
combination to regulate the expression of an individual gen a But, as die exam- 
ple of the myogenic proteins shows, combinatorial gene control means more 
than this: not only does each gene respond to many gene regulatory proteins 
that control it. bui each regulatory protein contributes to the control of many 
genes. Moreover, although some gene regulatory proteins are specific to a single 
cell type, most are switched on in a variety of cell types, at several sites in the 
body and at several times in development, this point is illustrated schematically 
in Figure 7-76, which shows how combinatorial gene control makes it possible 
to generate a great deal of biological complexity even with relatively few gene 
regulatory proteins. 

With combinatorial control, a given gene regulatory protein does not neces- 
sarily have a single, simply definable function as commander of a particular bat- 
tery of genes or specifier of a particular cell type. Rather, gene regulatory pro- 
teins can be likened to the words of a language: they are used with different 
meanings in a variety of contexts and rarely alone; it is the well-chosen combi- 
nation that conveys the information that specifies a gene regulatory event. 

One requirement of combinatorial control is that many gene regulatory pro- 
teins must be able to work together to influence the final rate of transcription. 
Experiments demonstrate that even unrelated gene regulatory proteins from 
widely different eucaryonc species can cooperate when introduced into the same 
cell. This situation reflects the high degree of conservation of the transcription 
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Figure 7-75 Role of the myogenic 
regulatory proteins in muscle 
development. (A) The effect of expressing 
the MyoD protein in fibroblasts. As shown 
in this immunofluorescence micrograph, 
fibroblasts from the skin of a chick embryo 
have been converted to muscle cells by the 
experimentally induced expression of the 
MyoD gene. The fibroblasts that have been 
induced to express the MyoD gene have 
fused to form elongated multinucleate 
muscle-like cells, which are stained green 
with an antibody that detects a muscle- 
specific protein. Fibroblasts that do not 
express the MyoD gene are barely visible in 
the background. (B) Simplified scheme 
showing some of the gene regulatory 
proteins involved in skeletal muscle 
development. External signals result in the 
synthesis of the four closely related 
myogenic gene regulatory proteins, MyoD, 
MyfS, MyoG, and Mrf4. These gene 
regulatory proteins activate their own as 
well as each other's synthesis in a complex 
series of feedback loops, only some of 
which are shown in the figure. These 
proteins in turn directly activate 
transcription of muscle structural genes as 
well as the Mef2 gene, which encodes an 
additional gene regulatory protein. Mef2 
acts in combination with the myogenic 
proteins in a feed-forward loop to further 
activate the transcription of muscle 
structural genes, as well as forming an 
additional positive feedback loop that 
helps to maintain transcription of the 
myogenic genes. (A, courtesy of Stephen 
Tapscott and Harold Weintraub; 
B, adapted from J.D. Molkentin and 
E.N. Olson, Proc Natl Acad. Sci. U.S.A. 
93:9366-9373, 1996. With permission from 
National Academy of Sciences.) 
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machinery. It seems that the multifunctional, combinatorial mode of action of 
gene regulatory proteins has put a tight constraint on their evolution: they must 
interlock with other gene regulatory proteins, the general transcription factors, 
Mediator, RNA polymerase, and the chromatin-modifying enzymes. 

An important consequence of combinatorial gene control is that the effect 
of adding a new gene regulatory protein to a cell will depend on the cell's past 
history, since this history will determine which gene regulatory proteins are 
already present. Thus, during development a cell can accumulate a series of 
gene regulatory proteins that need not initially alter gene expression. The addi- 
tion of the final members of the requisite combination of gene regulatory pro- 
teins completes the regulatory message, and can lead to large changes in gene 
expression. Such a scheme, as we have seen, helps to explain how the addition 
of a single regulatory protein to a fibroblast can produce the dramatic transfor- 
mation of the fibroblast into a muscle cell It also can account for the important 
difference, discussed in Chapter 22, between the process of cell determination— 
where a cell becomes committed to a particular developmental fate — and the 
process of cell differentiation, in which a committed cell expresses its specialized 
character. 



A Single Gene Regulatory Protein Can Trigger the Formation of an 
Entire Organ 

We have seen that even though combinatorial control is the norm for eucaryotic 
genes, a single gene regulatory protein, if it completes the appropriate combi- 
nation, can be decisive in switching a whole set of genes on or off, and we have 
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Figure 7-76 The importance of 
combinatorial gene control for 
development. Combinations of a few 
gene regulatory proteins can generate 
many cell types during development. In 
this simple, idealized scheme a "decision" 
to make one of a pair of different gene 
regulatory proteins (shown as numbered 
circles) is made after each cell division. 
Sensing its relative position in the 
embryo, the daughter cell toward the left 
side of the embryo is always induced to 
synthesize the even-numbered protein of 
each pair, while the daughter cell toward 
the right side of the embryo is induced to 
synthesize the odd-numbered protein. 
The production of each gene regulatory 
protein is assumed to be self- 
perpetuating once it has become 
initiated (see Figure 7-68). In this way, 
through cell memory, the final 
combinatorial specification is built up 
step by step. In this purely hypothetical 
example, five different gene regulatory 
proteins have created eight final cell 
types (G-N). 
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seen how this can convert one cell type into another. A dramatic extension of the 
principle comes from studies of eye development in Drosophifa mice, and 
humans. Here, a gene regulatory protein, called Ey {shorl for Eyeless) in flies and 
Paxfi in vertebrates, is crucial. When expressed in the proper context; Ey can trig- 
ger the formation of not just a single cell type but a whole organ [an eye), com- 
posed of different types of cells, all properly organized in three-dimensional 
space. 

The most striking evidence for the role of Ey comes from experiments in 
fruit flies in which the £y gene is artificially expressed early in development in 
groups of cells that normally will go on to form leg parts. This abnormal gene 
expression causes eyes to develop in the legs (Figure 7-77). 

The Drosophila eye is composed of thousands of cells, and the question of 
how a regulatory protein coordinates the construction of a whole organ is a cen- 
tral topic in tlcrelopmentat biology. As discussed in Chapter 22, it involves 
cell-cell interactions as well as intracellular gene regulatory proteins. Here, we 
note that Ey directly controls the expression of many other "genes by binding to 
their regulatory regions. Some of the genes controlled by Ey themselves code for 
gene regulatory proleins thai, in turn, control the expression of other genes. 
Moreover, some of these regulatory gene products act back on Ey itself to create 
a positive feedback loop that ensures the continued synthesis of the Ey protein 
as the cells divide and further differentiate (Figure 7-78). In this way, die action 
of just one regulatory protein can permanently turn on a cascade of gene regu- 
latory proteins and cell-cell Interaction mechanisms, whose actions result in an 
organized group of many different types of cells. One can begin to imagine how, 
by repeated applications of this principle, a complex organism is assembled 
piece by piece. 



Figure 7-77 Expression of the Drosophila 
Ey gene in precursor cells of the leg 
triggers the development of an eye on 
the leg. (A) Simplified diagrams showing 
the result when a fruit fly larva contains 
either the normally expressed Ey gene 
(left) or an Ey gene that is additionally 
expressed artificially in cells that normally 
give rise to leg tissue (right). 
(B) Photograph of an abnormal leg that 
contains a misplaced eye (see also Figure 
22-2). (B, courtesy of Walter Gehring.) 



Figure 7-78 Gene regulatory proteins that specify eye development in 
Drosophila. Toy (Twin of eyeless) and Ey (Eyeless) encode similar gene 
regulatory proteins, Toy and Ey, either of which, when ectopically 
expressed, can trigger eye development. In normal eye development, 
expression of Ey requires the Toy gene. Once its transcription is activated by 
Toy, Ey activates the transcription of So (Sine oculis) and Eya (Eyes absent), 
which act together to switch on the Dae (Dachshund) gene. As indicated by 
the green arrows, some of the gene regulatory proteins form a series of 
interlocking positive feedback loops that reinforce the initial commitment 
to eye development. The Ey protein is known to bind directly to numerous 
target genes for eye development, including those encoding lens 
crystallins, rhodopsins, and other photoreceptor proteins. (Adapted from 
T Czerny et al., Mol. Ceil 3:297-307, 1 999. With permission from Elsevier.) 
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The Pattern of DNA Methylation Can Be Inherited When 
Vertebrate Cells Divide 

Thus far, we have emphasized the regulation of gene transcription by proteins 
that associate with DNA. However, DNA itself can be covalently modified, and in 
the following sections we shall see that this, too, provides opportunities for the 
regulation of gene expression. In vertebrate cells, the methylation of cytosine 
provides a powerful mechanism through which gene expression patterns are 
passed on to progeny cells. The methylated form of cytosine, 5-methylcytosine 
(5-methyl C), has the same relation to cytosine that thymine has to uracil, and 
the modification likewise has no effect on base-pairing (Figure 7-79). DNA 
methylation in vertebrate DNA is restricted to cytosine (C) nucleotides in the 
sequence CG, which is base-paired to exactly the same sequence (in opposite 
orientation) on the other strand of the DNA helix. Consequently, a simple mech- 
anism permits the existing pattern of DNA methylation to be inherited directly 
by the daughter DNA strands. An enzyme called maintenance methyltransferase 
acts preferentially on those CG sequences that are base-paired with a CG 
sequence that is already methylated. As a result, the pattern of DNA methylation 
on the parental DNA strand serves as a template for the methylation of the 
daughter DNA strand, causing this pattern to be inherited directly following 
DNA replication (Figure 7-80). 

The stable inheritance of DNA methylation patterns can be explained by 
maintenance DNA methyltransferases. DNA methylation patterns, however, are 
dynamic during vertebrate development. Shortly after fertilization there is a 
genome-wide wave of demethylation, when the vast majority of methyl groups 
are lost from the DNA. This demethylation may occur either by suppression of 
maintenance DNA methyltransferase activity, resulting in the passive loss of 
methyl groups during each round of DNA replication, or by a specific demethyl- 
ating enzyme. Later in development, new methylation patterns are established 
by several de novo DNA methyltransferases that are directed to DNA by 
sequence-specific DNA-binding proteins where they modify adjacent unmethy- 
lated CG nucleotides. Once the new patterns of methylation are established, 
they can be propagated through rounds of DNA replication by the maintenance 
methyl transferases. 

DNA methylation has several uses in the vertebrate cell. Perhaps its most 
important role is to work in conjunction with other gene expression control 
mechanisms to establish a particularly efficient form of gene repression that can 
be faithfully passed on to progeny cells (Figure 7-81). This combination of 
mechanisms ensures that unneeded eucaryotic genes can be repressed to very 
high degrees. For example, the rate at which a vertebrate gene is transcribed can 
vary 10 6 -fold between one tissue and another. The unexpressed vertebrate genes 
are much less "leaky" in terms of transcription than bacterial genes, in which the 
largest known differences in transcription rates between expressed and unex- 
pressed gene states are about 1000-fold. 
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Figure 7-79 Formation of 5-methyl- 
cytosine occurs by methylation of a 
cytosine base in the DNA double helix. 

In vertebrates this event is confined to 
selected cytosine (C) nucleotides located 
in the sequence CG. 



Figure 7-80 How DNA methylation 
patterns are faithfully inherited. In 

vertebrate DNAs a large fraction of the 
cytosine nucleotides in the sequence CG 
are methylated (see Figure 7-79). 
Because of the existence of a methyl- 
directed methylating enzyme (the 
maintenance methyltransferase), once a 
pattern of DNA methylation is 
established, that pattern of methylation is 
inherited in the progeny DNA, as shown. 
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How DNA methylation helps to repress gene expression is not understood in 
detail, but two general mechanisms have emerged. DNA methylation of the pro- 
moter region of a gene or of its regulatory sequences can interfere directly with 
the binding of proteins required for transcription initiation. In addition, the cell 
has a repertoire of proteins that specifically bind to methylated DNA (see Figure 
7-81), thereby blocking access of other proteins. One reflection of the impor- 
tance of DNA methylation to humans is the widespread involvement of errors in 
this mechanism in cancer progression (see Chapter 20). 

We shall return to the topic of gene silencing by DNA methylation later in 
this chapter, when we discuss X-chromosome inactivation and other examples 
of large-scale gene silencing. First, however, we describe some of the other ways 
in which DNA methylation affects our genomes. 



Genomic Imprinting Is Based on DNA Methylation 

Mammalian cells are diploid, containing one set of genes inherited from the 
father and one set from the mother. The expression of a small minority of genes 
depends on whether they have been inherited from the mother or the father: 
while the paternally inherited gene copy is active, the maternally inherited gene 
copy is silent, or vice-versa. This phenomenon is called genomic imprinting. 
The gene for insulin-like growth factor-2 (Igf2) is a well-studied example of an 

histone modifying 
enzyme ("writer") 




gene regulatory protein complex 
that represses gene expression 
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Figure 7-81 Multiple mechanisms 
contribute to stable gene repression. In 

this schematic example, histone reader 
and writer proteins, under the direction 
of gene regulatory proteins, establish a 
repressive form of chromatin. A de novo 
DNA methylase is attracted by the 
histone reader and methylates nearby 
cytosines in DNA, which are, in turn, 
bound by DNA methyl-binding proteins. 
During DNA replication, some of the 
modified (blue dot) histones will be 
inherited by one daughter chromosome, 
some by the other, and in each daughter 
they can induce reconstruction of the 
same pattern of chromatin modifications 
(see Figure 5-39). At the same time, the 
mechanism shown in Figure 7-80 will 
cause both daughter chromosomes to 
inherit the same methylation pattern. The 
two inheritance mechanisms will be 
mutually reinforcing, if DNA methylation 
stimulates the activity of the histone 
writer. This scheme can account for the 
inheritance by daughter cells of both the 
histone and the DNA modifications. It can 
also explain the tendency of some 
chromatin modifications to spread along 
a chromosome (see Figure 4-45). 
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imprinted gene. Igf2 is required for prenatal growth, and mice that do not 
express Igf2 at all are born half the size of normal mice. However, only the pater- 
nal copy of Igf2 is transcribed, and only this gene copy matters for the pheno- 
type. As a result, mice with a mutated paternally derived Igf2 gene are stunted, 
while mice with a mutated maternally derived Igf2 gene are normal. 

In the early embryo, genes subject to imprinting are marked by methylation 
according to whether they were derived from a sperm or an egg chromosome. In 
this way, DNA methylation is used as a mark to distinguish two copies of a gene 
that may be otherwise identical (Figure 7-82). Because imprinted genes are 
somehow protected from the wave of demethylation that takes place shortly after 
fertilization (see p. 467), this mark enables somatic cells to "remember" the 
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Figure 7-82 Imprinting in the mouse. The top portion of the figure shows a pair of homologous chromosomes in the somatic cells of two 
adult mice, one male and one female. In this example, both mice have inherited the top homolog from their father and the bottom homolog 
from their mother, and the paternal copy of a gene subject to imprinting (indicated in orange) is methylated, preventing its expression. The 
maternally derived copy of the same gene (yellow) is expressed. The remainder of the figure shows the outcome of a cross between these two 
mice. During germ cell formation, but before meiosis, the imprints are erased and then, much later in germ cell development, they are 
reimposed in a sex-specific pattern (middle portion of figure). In eggs produced from the female, neither allele of the A gene is methylated. In 
sperm from the male, both alleles of gene A are methylated. Shown at the bottom of the figure are two of the possible imprinting patterns 
inherited by the progeny mice; the mouse on the left has the same imprinting pattern as each of the parents, whereas the mouse on the right 
has the opposite pattern. If the two alleles of A gene are distinct, these different imprinting patterns can cause phenotypic differences in the 
progeny mice, even though they carry exactly the same DNA sequences of the two A gene alleles. Imprinting provides an important exception 
to classical genetic behavior, and several hundred mouse genes are thought to be affected in this way. However, the majority of mouse genes 
are not imprinted, and therefore the rules of Mendelian inheritance apply to most of the mouse genome. 
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parental origin of each of the two copies of the gene and to regulate their expres- 
sion accordingly. In most cases, the methyl imprint silences nearby gene expres- 
sion. In some cases, however, the methyl imprint can activate expression of a 
gene. In the case of Igf2, for example, methylation of an insulator element (see 
Figure 7-62) on the paternally derived chromosome blocks its function and 
allows a distant enhancer to activate transcription of the Igf2 gene. On the 
maternally derived chromosome, the insulator is not methylated and the Igf2 
gene is therefore not transcribed (Figure 7-83). 

Why imprinting should exist at all is a mystery. In vertebrates, it is restricted 
to placental mammals, and many of the imprinted genes are involved in fetal 
development. One idea is that imprinting reflects a middle ground in the evo- 
lutionary struggle between males to produce larger offspring and females to 
limit offspring size. Whatever its purpose might be, imprinting provides 
startling evidence that features of DNA other than its sequence of nucleotides 
can be inherited. 



Figure 7-83 Mechanism of imprinting of 
the mouse Igf2 gene. On chromosomes 
inherited from the female, a protein 
called CTCF binds to an insulator (see 
Figure 7-62), blocking communication 
between the enhancer (green) and the 
Igf2 gene (orange). IGF2 is therefore not 
expressed from the maternally inherited 
chromosome. Because of imprinting, the 
insulator on the male-derived 
chromosome is methylated; this 
inactivates the insulator, by blocking the 
binding of the CTCF protein, and allows 
the enhancer to activate transcription of 
the Igf2 gene. In other examples of 
imprinting, methylation blocks gene 
expression by interfering with the 
binding of proteins required for a gene's 
transcription. 



CG-Rich Islands Are Associated with Many Genes in Mammals 

Because of the way in which DNA repair enzymes work, methylated C 
nucleotides in the genome tend to be eliminated in the course of evolution. 
Accidental deamination of an unmethylated C gives rise to U (see Figure 5-45), 
which is not normally present in DNA and thus is recognized easily by the DNA 
repair enzyme uracil DNA glycosylase, excised, and then replaced with a C (as 
discussed in Chapter 5). But accidental deamination of a 5-methyl C cannot be 
repaired in this way, for the deamination product is a T and so is indistinguish- 
able from the other, nonmutant T nucleotides in the DNA. Although a special 
repair system exists to remove these mutant T nucleotides, many of the deami- 
nations escape detection, so that those C nucleotides in the genome that are 
methylated tend to mutate to T over evolutionary time. 

During the course of evolution, more than three out of every four CGs have 
been lost in this way, leaving vertebrates with a remarkable deficiency of this 
dinucleotide. The CG sequences that remain are very unevenly distributed in the 
genome; they are present at 10-20 times their average density in selected 
regions, called CG islands, which are 1000-2000 nucleotide pairs long. These 
islands, with some important exceptions, seem to remain unmethylated in all 
cell types. They often surround the promoters of the so-called housekeeping 
genes—those genes that code for the many proteins that are essential for cell via- 
bility and are therefore expressed in most cells (Figure 7-84). 

The distribution of CG islands (also called CpG islands to distinguish the CG 
dinucleotides from the CG base pair) can be explained if we assume that CG 
methylation was adopted in vertebrates primarily as a way of maintaining DNA 
in a transcriptionally inactive state (see Figure 7-81). In vertebrates, new 
methyl-C to T mutations can be transmitted to the next generation only if they 
occur in the germ line, the cell lineage that gives rise to sperm or eggs. Most of 
the DNA in vertebrate germ cells is inactive and highly methylated. Over long 
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Figure 7-84 The CG islands surrounding 
the promoter in three mammalian 
housekeeping genes. The yellow boxes 
show the extent of each island. As for 
most genes in mammals (see Figure 
6-25), the exons (dark red) are very short 
relative to the introns (light red). (Adapted 
from A.R Bird, Trends Genet 3:342-347, 
1987. With permission from Elsevier.) 
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periods of evolutionary time, the methylated CG sequences in these inactive 
regions have presumably been lost through spontaneous deamination events 
that were not properly repaired. However, promoters of genes that remain active 
in the germ cell lineages (including most housekeeping genes) are kept 
unmethylated, and therefore spontaneous deaminations of Cs that occur within 
them can be accurately repaired. Such regions are preserved in modern-day ver- 
tebrate cells as CG islands (Figure 7-85). In addition, any mutation of a CG 
sequence in the genome that destroyed the function or regulation of a gene in 
the adult would be selected against, and some CG islands are presumably the 
result of a higher than normal density of critical CG sequences for these genes. 
The mammalian genome contains an estimated 20,000 CG islands. Most of 
the islands mark the 5' ends of transcription units and thus, presumably, of 
genes. The presence of CG islands thereby provides a convenient way of identi- 
fying genes in the DNA sequences of vertebrate genomes. 



Epigenetic Mechanisms Ensure That Stable Patterns of Gene 
Expression Can Be Transmitted to Daughter Cells 

As we have seen, once a cell in an organism differentiates into a particular cell 
type, it generally remains specialized in that way; if it divides, its daughters 
inherit the same specialized character. For example, liver cells, pigment cells, 
and endothelial cells (discussed in Chapter 23) divide many times in the life of 
an individual, each of them faithfully producing daughter cells of the same type. 
Such differentiated cells must remember their specific pattern of gene expres- 
sion and pass it on to their progeny through all subsequent cell divisions. 

We have already described several ways of enabling daughter cells to 
"remember" what kind of cells they are supposed to be. One of the simplest is 
through a positive feedback loop in which a key gene regulatory protein acti- 
vates, either directly or indirectly, the transcription of its own gene (see Figures 
7-m and 7-69), interlocking positive feedback loops provide even more stability 
by buffering the circuit against fluctuations in the level of any one gene regula- 
tory protein (Figures 7-75B and 7-78). We also saw above that DNA methylation 
can serve as a means for propagating gene expression patterns to descendants 
(see Figure 7-80). 

Figure 7-85 A mechanism to explain both the marked overall deficiency 
of CG sequences and their clustering into CG islands in vertebrate 
genomes. A black line marks the location of a CG dinucleotide in the DNA 
sequence, while a red "lollipop" indicates the presence of a methyl group on 
the CG dinucleotide. CG sequences that lie in regulatory sequences of 
genes that are transcribed in germ cells are unmethylated and therefore 
tend to be retained in evolution. Methylated CG sequences, on the other 
hand, tend to be lost through deamination of 5-methyl C to X unless the 
CG sequence is critical for survival. 
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Positive feedback loops and DNA methylation are common to both bacteria 
and eucaryotes; but eucaryotes also have available to them another means of 
maintaining a differentiated state through many cell generations. As we saw in 
Chapter 4, chromatin structure itself can be faithfully propagated from parent to 
daughter cell. There are several mechanisms to bring this about, but the sim- 
plest is based on the covalent modifications of histones. As we have seen, these 
modifications form a "histone code," with different patterns of modification 
serving as binding sites for different reader proteins. If these proteins, in turn, 
serve as (or attract) writer enzymes that replicate the very modification patterns 
that attracted them in the first place, then the distribution of active and silent 
regions of chromatin can be faithfully propagated (see Figure 5-39). In a sense, 
self-sustaining modification of histones is a form of positive feedback loop that 
is tied to the DNA but does not require the participation of the underlying DNA 
sequences. 

The ability of a daughter cell to retain a memory of the gene expression pat- 
terns thai were present in the parent cell is an example of epigenetic inheri- 
tance. This term lias subtly different meanings in different branches of biology, 
but we will use it in its broadest sense to cover any heritable difference in the 
phenotype of a cell or an organism that does not result from changes in the 
nucleotide sequence of DNA (see Figure 4-35). We have just discussed three of 
the most important mechanisms underlying epigenetic changes, but others also 
exist [Figure 7-86). Cells often combine these mechanisms to ensure that pat- 
terns of gene expression are maintained and inherited accurately and reliably- 
over a period of up to a hundred years or moie, in our own case. 

For more than half a century biologists have been preoccupied with DNA as 
the carrier of heritable information— and rightly so. However, it has become 
clear that human chromosomes also carry a great deal of information that is epi- 
genetic, and not contained in the sequence of the DNA itself. Imprinting is one 
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Figure 7-86 Four distinct mechanisms 
that can produce an epigenetic form of 
inheritance in an organism. (For the 
inheritance of histone modifications, see 
Figure 4-52; for the inheritance of protein 
conformations, see Figure 6-95.) 
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example. Another is seen in the phenomenon of mono-allelic expression, in 
which only one of the two copies of certain human genes is expressed. For many 
such genes, the decision of which allele to express and which to silence is ran- 
dom, but once made, it is passed on to progeny cells. Below, we will see an 
extreme example of this phenomenon inX-chromosome inactivation. 

The net effect of random and environmentally triggered epigenetic changes 
in humans can be seen by comparing identical twins: their genomes have the 
same sequence of nucleotides, but when their histone modification and DNA 
methylation patterns are compared, many differences are observed. Because 
these differences are roughly correlated not only with age but also with the time 
that the twins have spent apart from each other, it is believed that some of these 
changes are the result of environmental factors (Figure 7-87) . Although studies 
of the epigenome are in early stages, the idea that environmental events can be 
permanently registered by our cells is a fascinating one that presents an impor- 
tant challenge to the next generation of biological scientists. 








Figure 7-87 Identical twins raised apart 
from one another. (Courtesy of Nancy L. 



Segal.) 



Chromosome-Wide Alterations in Chromatin Structure Can Be 
Inherited 

We have seen that chromatin states and DNA methylation can be heritable, serv- 
ing to establish and preserve patterns of gene expression for many cell genera- 
tions. Perhaps the most striking example of this effect occurs in mammals, in 
which an alteration in the chromatin structure of an entire chromosome can 
modulate the levels of expression of all genes on that chromosome. 

Males and females differ in their sex chromosomes. Females have two X 
chromosomes, whereas males have one X and one Y chromosome. As a result, 
female cells contain twice as many copies of X-chromosome genes as do male 
cells. In mammals, the X and Y sex chromosomes differ radically in gene con- 
tent: the X chromosome is large and contains more than a thousand genes, 
whereas the Y chromosome is small and contains less than 100 genes. Mam- 
mals have evolved a dosage compensation mechanism to equalize the dosage of 
X-chromosome gene products between males and females. Mutations that 
interfere with dosage compensation are lethal: the correct ratio of X chromo- 
some to autosome (non-sex chromosome) gene products is critical for survival. 

Mammals achieve dosage compensation by the transcriptional inactivation 
of one of the two X chromosomes in female somatic cells, a process known as 
X-inactivation. Early in the development of a female embryo, when it consists 
of a few thousand cells, one of the two X chromosomes in each cell becomes 
highly condensed into a type of heterochromatin. The condensed X chromo- 
some can be easily seen under the light microscope in interphase cells; it was 
originally called a Ban body and is located near the nuclear membrane (Figure 
7-88). As a result of X-inactivation, two X chromosomes can coexist within the 
same nucleus, exposed to the same diffusible gene regulatory proteins, yet dif- 
fer entirely in their expression. 

The initial choice of which X chromosome to inactivate, the maternally 
inherited one (X m ) or the paternally inherited one (X p ), is random. Once either 
X p or X m has been inactivated, it remains silent throughout all subsequent cell 
divisions of that cell and its progeny, indicating that the inactive state is faith- 
fully maintained through many cycles of DNA replication and mitosis. Because 
X-inactivation is random and takes place after several thousand cells have 
already formed in the embryo, every female is a mosaic of clonal groups of cells 



Figure 7-88 X-chromosome inactivation in female cells. (A) Only the 
inactive X chromosome is coated with XIST RNA, visualized here by in situ 
hybridization to fluorescently labeled RNAs of complementary nucleotide 
sequence. The panel shows the nuclei of two adjacent cells. (B)The same 
sample, stained with antibodies against a component of the Polycomb group 
complex, which coats the X chromosome and helps to silence expression of 
its genes. (From B. Panning, Methods Enzymoi. 376:41 9-428, 2004. With 
permission from Academic Press.) 
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in which either X p or X m is silenced (Figure 7-89). These clonal groups are dis- 
tributed in small clusters in the adult animal because sister cells tend to remain 
close together during later stages of development. For example, X-chromosome 
inactivation causes the red and black "tortoise-shell" coat coloration of some 
female cats. In these cats, one X chromosome carries a gene that produces red 
hair color, and the other X chromosome carries an allele of the same gene that 
results in black hair color; it is the random X- inactivation that produces patches 
of cells of two distinctive colors. In contrast to the females, male cats of this 
genetic stock are either solid red or solid black, depending on which X chromo- 
some they inherit from their mothers. 

Although X-chromosome inactivation is maintained over thousands of cell 
divisions, it is not always permanent. In particular, it is reversed during germ- 
cell formation, so that all haploid oocytes contain an active X chromosome and 
can express X-linked gene products. 

How is an entire chromosome transcriptionally inactivated? X-chromosome 
inactivation is initiated and spreads from a single site in the middle of the X 
chromosome, the X-inactivation center (XIC). Encoded within the XIC is an 
unusual RNA molecule, XISTRNA, which is expressed solely from the inactive X 
chromosome and whose expression is necessary for X-inactivation. The XIST 
RNA is not translated into protein and remains in the nucleus, where it eventu- 
ally coats the entire inactive X chromosome. The spread of XIST RNA from the 
XIC over the entire chromosome correlates with the spread of gene silencing, 
indicating that XIST RNA drives the formation and spread of heterochromatin 
(Figure 7-90). Curiously, about 10% of the genes on the X-chromosome escape 
this silencing and remain active. 

In addition to containing XIST RNA, the X-chromosome heterochromatin is 
characterized by a specific variant of histone 2A, by hypoacetylation of histones 
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Figure 7-89 X-inactivation. The clonal 
inheritance of a condensed inactive 
X chromosome that occurs in female 
mammals. 
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Figure 7-90 Mammalian X-chromosome 
inactivation. X-chromosome inactivation 
begins with the synthesis of XIST 
(X-inactivation specific transcript) RNA 
from the XIC (X-inactivation center) locus. 
The association of XIST RNA with one of a 
female's two X chromosomes is 
correlated with the condensation of that 
chromosome. Early in embryogenesis, 
both XIST association and chromosome 
condensation gradually move from the 
XIC locus outward to the chromosome 
ends. The details of how this occurs 
remain to be deciphered. 
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H3 and H4, by ubiquitylation of histone 2A, by methylation of a specific position 
on histone H3, and by specific patterns of methylation on the underlying DNA 
(for a suggestion of how these features may be causally linked, see Figure 7-81). 
The combination of such modifications presumably makes most of the inactive 
X chromosome unusually resistant to transcription. Because these modifications 
are, at least in principle, self-propagating, it is easy to see how, once formed, an 
inactive X chromosome can be stably maintained through many cell divisions. 

Many features of mammalian X-chromosome inactivation remain to be dis- 
covered. How is the initial decision made as to which X chromosome to inacti- 
vate? What mechanism prevents the other X chromosome from also being inac- 
tivated? How does XIST RNA coordinate the formation of heterochromatin? How 
do some genes on the X chromosome escape inactivation? We are just beginning 
to understand this mechanism of gene regulation that is crucial for the survival 
of our own species. 

X-chromosome inactivation in females is only one of the ways in which sex- 
ually reproducing organisms achieve dosage compensation. In Drosophila, 
most of the genes on the single X chromosome present in male cells are tran- 
scribed at approximately twofold higher levels than their counterparts in female 
cells. This male-specific "up-regulation" of transcription results from an alter- 
ation in chromatin structure over the entire male X chromosome. A dosage- 
compensation complex, containing several histone-modifying enzymes as well 
as two noncoding RNAs transcribed from the X chromosome, assembles at hun- 
dreds of positions along the X chromosome and produces patterns of histone 
modification that are thought to upregulate transcription—through effects on 
either initiation or elongation — at most genes on the male X chromosome. 

The nematode worm uses a third strategy for dosage compensation. Here, 
the two sexes are male (with one X chromosome) and hermaphrodite (with two 
X chromosomes), and dosage compensation occurs by an approximately two- 
fold "down-regulation" of transcription from each of the two X chromosomes in 
cells of the hermaphrodite. This is brought about through chromosome-wide 
structural changes in the X chromosomes of hermaphrodites (Figure 7-91). A 
dosage-compensation complex, which is completely different from that of 
Drosophila and resembles instead the condensin complex that compacts chro- 
mosomes during mitosis and meiosis (see Figure 17-27), assembles along each 
X chromosome of hermaphrodites and, by an unknown mechanism, superim- 
poses a global twofold repression on the normal expression level of each gene. 

Although the strategy and components used to cause dosage compensation 
differ between mammals, flies, and worms, they all involve structural alterations 
over the entire X chromosome. It seems likely that features of chromosome 
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structure that are quite general were independently adapted and harnessed dur- 
ing evolution to overcome a highly specific problem in gene regulation encoun- 
tered by sexually reproducing animals. 



The Control of Gene Expression is Intrinsically Noisy 

So far in this chapter we have discussed gene expression as though it were a 
strictly deterministic process, so that, if only one knew the concentrations of all 
the relevant regulatory proteins and other control molecules, the level of gene 
expression would be precisely predictable. In reality, there is a large amount of 
random variation in the behavior of cells. In part, this is because there are ran- 
dom fluctuations in the environment, and these disturb the concentrations of 
regulatory molecules inside the cell in unpredictable ways. Another possible 
cause, in some cases, maybe chaotic behavior of the intracellular control system: 
mathematical analysis shows that even quite simple control systems may be 
acutely sensitive to the control parameters, in such a way that, for example, a tiny 
difference of initial conditions may lead to a radically different long-term out- 
come. But in addition to these causes of unpredictability, there is a further, more 
fundamental reason why all cell behavior is inescapably random to some degree. 

Cells are chemical systems consisting of relatively small numbers of 
molecules, and chemical reactions at the level of individual molecules occur in 
an essentially random, or stochastic, manner. A given molecule has a certain 
probability per unit time of undergoing a chemical reaction, but whether it will 
actually do so at any given moment is unpredictable, depending on random 
thermal collisions and the probabilistic rules of quantum mechanics. The 
smaller the number of molecules governing a process inside the cell, the more 
severely it will be affected by the randomness of chemical events at the single- 
molecule level. Thus there is some degree of randomness in every aspect of cell 
behavior, but certain processes are liable to be random in the extreme. 

The control of transcription, in particular, depends on the precise chemical 
condition of the gene. Consider a simple idealized case, in which a gene is tran- 
scribed so long as it has a transcriptional activator protein bound to its regula- 
tory region, and transcriptionally silent when this protein is not bound. The 
association/dissociation reaction between the regulatory DNA and the protein 
is stochastic: if the bound state has a half- life h /2 of an hour, the gene may remain 
activated sometimes for 30 minutes or less, sometimes for a couple of hours or 
more at a stretch, before the activator protein dissociates. In this way, transcrip- 
tion will flicker on and off in an essentially random way. The average rate of flick- 
ering, and the ratio of the average time spent in the "on" state to the average time 
spent in the "off" state, will be determined by the k ff and k on values for the bind- 
ing reaction and by the concentration of the activator protein in the cell. The 
quantity of gene transcripts accumulated in the cell will fluctuate accordingly; if 
the lifetime of the transcripts is long compared with ti/ 2 , the fluctuations will be 
smoothed out; if it is short, they will be severe. 

One way to demonstrate such random fluctuations in the expression of indi- 
vidual gene copies is to genetically engineer cells in which one copy of a gene 
control region is linked to a sequence coding for a green fluorescent reporter 
protein, while another copy is linked similarly to a sequence coding for a red flu- 
orescent reporter. Although both these gene constructs are in the same cell and 
experiencing the same environment, they fluctuate independently in their level 
of expression. As a result, in a population of cells that all carry the same pair of 
constructs, some cells appear green, others red, and still others a mixture of the 
two colors, and thus in varying shades of yellow (see Figure 8-75). More gener- 
ally, cell fate decisions are often made in a stochastic manner, presumably as a 
result of such random fluctuations; we shall encounter an example in Chapter 
23, where we discuss the genesis of the different types of white blood cells. 

In some types of cells, and for some aspects of cell behavior, randomness 
in the control of gene transcription, such as we have just described, seems to 
be the major source of random variability; in other cell types, other sources of 
random variation predominate. Where noise in a control system would be 
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Figure 7-91 Localization of dosage 
compensation proteins to the 
X chromosomes of G elegans 
hermaphrodite (XX) nuclei. This image 
shows many nuclei from a developing 
embryo. Total DNA is stained blue with 
the DNA-intercalating dye DAPI, and the 
Sdc2 protein is stained red using anti- 
Sdc2 antibodies coupled to a fluorescent 
dye. This experiment shows that the Sdc2 
protein associates with only a limited set 
of chromosomes, identified by other 
experiments to be the two 
X chromosomes. Sdc2 is bound along the 
entire length of the X chromosome and 
recruits the dosage-compensation 
complex. (From H.E. Dawes et al., Science 
284:1800-1804, 1999. With permission 
from AAAS.) 
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harmful, special control mechanisms have evolved to minimize its effects; the 
feed- forward loop discussed earlier is an example of such a device, serving to fil- 
ter out the effects of rapid fluctuations in a control signal. But in all cells, some 
degree of randomness is inevitable. It is a fundamental feature of cell behavior. 



Summary 

The many types of cells in animals and plants are created largely through mechanisms 
that cause different sets of genes to be transcribed in different cells. Since specialized 
animal cells can maintain their unique character through many cell division cycles 
and even when grown in culture, the gene regulatory mechanisms involved in creating 
them must be stable once established and heritable when the cell divides. These fea- 
tures reflect the cell's memory of its developmental history. Bacteria and yeasts also 
exhibit cell memory and provide unusually accessible model systems in which to study 
gene regulatory mechanisms. 

Direct or indirect positive feedback loops, which enable gene regulatory proteins to 
perpetuate their own synthesis, provide the simplest mechanism for cell memory. 
Transcription circuits also provide the cell with the means to carry out logic operations 
and measure time. Simple transcription circuits combined into large regulatory net- 
works drive highly sophisticated programs of embryonic development. 

In eucaryotes } the transcription of any particular gene is generally controlled by a 
combination of gene regulatory proteins. It is thought that each type of cell in a higher 
eucaryotic organism contains a specific set of gene regulatory proteins that ensures the 
expression of only those genes appropriate to that type of cell. A given gene regulatory 
protein may be active in a variety of circumstances and is typically involved in the reg- 
ulation of many different genes. 

Unlike bacteria, eucaryotic cells use inherited states of chromatin condensation as 
an additional mechanism to regulate gene expression and to create cell memory. An 
especially dramatic case is the inactivation of an entire X chromosome in female 
mammals. DNA methylation can also silence genes in a heritable manner in eucary- 
otes. In addition, it underlies the phenomenon of genomic imprinting in mammals, in 
which the expression of a gene depends on whether it was inherited from the mother 
or the father. 

POST-TRANSCRIPTIONAL CONTROLS 

In principle, every step required for the process of gene expression can be con- 
trolled. Indeed, one can find examples of each type of regulation, and many 
genes are regulated by multiple mechanisms. As we have seen, controls on the 
initiation of gene transcription are a critical form of regulation for all genes. But 
other controls can act later in the pathway from DNA to protein to modulate the 
amount of gene product that is made— and in some cases, to determine the 
exact amino acid sequence of the protein product. These post-transcriptional 
controls, which operate after RNA polymerase has bound to the gene's promoter 
and begun RNA synthesis, are crucial for the regulation of many genes. 

In the following sections, we consider the varieties of post-transcriptional reg- 
ulation in temporal order, according to the sequence of events that an RNA 
molecule might experience after its transcription has begun (Figure 7-92). 



Transcription Attenuation Causes the Premature Termination of 
Some RNA Molecules 

It has long been known that the expression of certain genes in bacteria is inhib- 
ited by premature termination of transcription, a phenomenon called tran- 
scription attenuation. In some of these cases the nascent RNA chain adopts a 
structure that causes it to interact with the RNA polymerase in such a way as to 
abort its transcription. When the gene product is required, regulatory proteins 
bind to the nascent RNA chain and interfere with attenuation, allowing the tran- 
scription of a complete RNA molecule. 



START RNA 
TRANSCRIPTION 



POSSIBLE 
ATTENUATION 

CAPPING 

SPLICING 
AND 3'-END 
CLEAVAGE 




RNA 

transcript 

aborte 

nonfunctional 

mRNA 

sequences 


POSSIBLE 
RNA EDITING 






NUCLEAR 
EXPORT 


* 


retention and 

degradation 
in nucleus 


*• 




SPATIAL 
LOCALIZATION 


<. 




IN CYTOPLASM 




> > 


START 
TRANSLATION 


h 


translation 
blocked 


POSSIBLE 

TRANSLATIONAL 

RECODING 


r 

1 




POSSIBLE 

RNA 

STABILIZATION 


h 


RNA degraded 



CONTINUED 
PROTEIN SYNTHESIS 

Figure 7-92 Post-transcriptional 
controls on gene expression. The final 
synthesis rate of a protein can, in 
principle, be controlled at any of the 
steps shown. RNA splicing, RNA editing, 
and translation recoding (described in 
Chapter 6) can also alter the sequence of 
amino acids in a protein, making it 
possible for the cell to produce more 
than one protein variant from the same 
gene. Only a few of the steps depicted 
here are likely to be critical for the 
regulation of any one particular protein. 
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Transcription attenuation also operates in eucaryotes. A well- studied exam- 
ple occurs during the life cycle of Hiy the human immunodeficiency virus, 
causative agent of acquired immune deficiency syndrome, or AIDS. Once it has 
been integrated into the host genome, the viral DNA is transcribed by the cell's 
RNA polymerase II (see Figure 5-71). However, the host polymerase usually ter- 
minates transcription after synthesizing transcripts of several hundred 
nucleotides and therefore does not efficiently transcribe the entire viral genome. 
When conditions for viral growth are optimal, a virus-encoded protein called Tat, 
which binds to a specific stem-loop structure in the nascent RNA that contains a 
"bulged base," prevents this premature termination. Once bound to this specific 
RNA structure (called Tar), Tat assembles several cell proteins that allow the RNA 
polymerase to continue transcribing. The normal role of at least some of these 
proteins is to prevent pausing and premature termination by RNA polymerase 
when it transcribes normal cell genes. Eucaryotic genes often contain long 
introns; to transcribe a gene efficiently, RNA polymerase II cannot afford to linger 
at nucleotide sequences that happen to promote pausing. Thus, a normal cell 
mechanism has apparently been adapted by HIV to permit efficient transcription 
of its genome to be controlled by a single viral protein. 



Riboswitches Might Represent Ancient Forms of Gene Control 

In Chapter 6, we discussed the idea that, before modern cells arose on Earth, RNA 
both stored hereditary information and catalyzed chemical reactions. The recent 
discovery of riboswitches shows that RNA can also form control devices that reg- 
ulate gene expression. Riboswitches are short sequences of RNA that change 
their conformation on binding small molecules, such as metabolites. Each 
riboswitch recognizes a specific small molecule and the resulting conformational 
change is used to regulate gene expression. Riboswitches are often located near 
the 5' end of mRNAs, and they fold while the mRNA is being synthesized block- 
ing or permitting progress of the RNA polymerase according to whether the reg- 
ulatory small molecule is bound (Figure 7-93). 

Riboswitches are particularly common in bacteria, in which they sense key 
small metabolites in the cell and adjust gene expression accordingly. Perhaps 
their most remarkable feature is the high specificity and affinity with which each 
recognizes only the appropriate small molecule; in many cases, every chemical 
feature of the small molecule is read by the RNA (Figure 7-93C). Moreover, the 
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Figure 7-93 A riboswitch that responds to guanine. (A) In this example from bacteria, the 
riboswitch controls expression of the purine biosynthetic genes, When guanine levels In cells 
are low, an elongating RNA polymerase transcribes the purine biosynthetk genes, and the 
enzymes needed for guanine synthesis are therefore expressed. (B) When guanine is abundant, 
it binds the riboswitch, causing it to undergo a conformational change that forces the RNA 
polymerase to terminate transcription (see Figure 6-11). (C) Guanine (red) bound to the 
riboswitch. Only those nucleotides that form the guanme-binding pocket are. shown. Many 
other riboswitches exist, including those that recognize S-adenosyl methione, coenzyme B^, 
flavin mononucleotide, adenine, lysine, and glycine. (Adapted from JV1. Mandal and R.R. Breaker, 
Nat. Rev. Mol CeltBiat. 5 451-63 r 2004, with permission from Matmillan Publishers Ltd., and 
CK, Vanderpool and S. Gottesman, Moi Microbiol. 54:1076-1089, 2004 r with permission from 
Blackwell Publishing.) 
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binding affinities observed are as tight as those typically observed between 
small molecules and proteins. 

Riboswitches are perhaps the most economical examples of gene control 
devices, inasmuch as they bypass the need for regulatory proteins altogether. In 
the example shown in Figure 7-93, the riboswitch controls transcription elonga- 
tion, but they also regulate other steps in gene expression, as we shall see later 
in this chapter. Clearly, highly sophisticated gene control devices can be made 
from short sequences of RNA. 



Alternative RNA Splicing Can Produce Different Forms of a 
Protein from the Same Gene 

As discussed in Chapter 6, RNA splicing shortens the transcripts of many 
eucaryotic genes by removing the intron sequences from the mRNA precursor. 
We also saw that a cell can splice an RNA transcript differently and thereby make 
different polypeptide chains from the same gene — a process called alternative 
RNA splicing (see Figure 6-27 and Figure 7-94). A substantial proportion of ani- 
mal genes (estimated at 40% in flies and 75% in humans) produce multiple pro- 
teins in this way. 

When different splicing possibilities exist at several positions in the tran- 
script, a single gene can produce dozens of different proteins. In one extreme 
case, a Drosophila gene may produce as many as 38,000 different proteins from 
a single gene through alternative splicing (Figure 7-95), although only a fraction 
of these forms have thus far been experimentally observed. Considering that the 
Drosophila genome has approximately 14,000 identified genes, it is clear that the 
protein complexity of an organism can greatly exceed the number of its genes. 
This example also illustrates the perils in equating gene number with an organ- 
ism's complexity. For example, alternative splicing is relatively rare in single- 
celled budding yeasts but very common in flies. Budding yeast has -6200 genes, 
only about 300 of which are subject to splicing, and nearly all of these have only 
a single intron. To say that flies have only 2-3 times as many genes as yeasts 
greatly underestimates the difference in complexity of these two genomes. 
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Figure 7-94 Four patterns of alternative 
RNA splicing. In each case a single type 
of RNA transcript is spliced in two 
alternative ways to produce two distinct 
mRNAs (1 and 2). The dark blue boxes 
mark exon sequences that are retained in 
both mRNAs. The light blue boxes mark 
possible exon sequences that are 
included in only one of the mRNAs. The 
boxes are joined by red lines to indicate 
where intron sequences (yellow) are 
removed. {Adapted with permission from 

A. Andreadis, M.E. Gallego and 

B. Nadal-Ginard, Annu. Rev. Cell Biol. 
3:207-242, 1987. With permission from 
Annual Reviews.) 
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Figure 7-95 Alternative splicing of RNA transcripts of the Drosophila Dscam gene. DSCAM proteins are axon guidance 
receptors that help to direct growth cones to their appropriate targets in the developing nervous system. The final mRNA 
contains 24 exons, four of which (denoted A, B, C, and D) are present in the Dscam gene as arrays of alternative exons. Each 
RNA contains 1 of 1 2 alternatives for exon A (red), 1 of 48 alternatives for exon B (green), 1 of 33 alternatives for exon C 
(blue), and 1 of 2 alternatives for exon D (yellow). If all possible splicing combinations are used, 38,01 6 different proteins 
could in principle be produced from the Dscam gene. This figure shows only one of the many possible splicing patterns 
(indicated by the red line and by the mature mRNA below it). Each variant Dscam protein would fold into roughly the same 
structure [predominantly a series of extracellular immunoglobulin-like domains linked to a membrane-spanning region 
(see Figure 25-74)], but the amino acid sequence of the domains would vary according to the splicing pattern. It is 
suspected that this receptor diversity contributes to the formation of complex neural circuits, but the precise properties 
and functions of the many Dscam variants are not yet understood. (Adapted from D.L. Black, Celt 103:367-370, 2000. With 
permission from Elsevier.) 
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Figure 7-96 Negative and positive 
control of alternative RNA splicing. 

(A) In negative control, a repressor 
protein binds to the pre-mRNA transcript 
and blocks access of the splicing 
machinery to a splice junction. This often 
results in the use of a cryptic splice site, 
thereby producing an altered pattern of 
splicing (not shown). (B) In positive 
control, the splicing machinery is unable 
to remove a particular intron sequence 
efficiently without assistance from an 
activator protein. Because the nucleotide 
sequences that bind these activators can 
be located many nucleotide pairs from 
the splice junctions they control, they are 
often called splicing enhancers. 



In some cases alternative RNA splicing occurs because there is an intron 
sequence ambiguity: the standard spliceosome mechanism for removing intron 
sequences (discussed in Chapter 6) is unable to distinguish cleanly between two 
or more alternative pairings of 5' and 3' splice sites, so that different choices are 
made by chance on different transcripts. Where such constitutive alternative 
splicing occurs, several versions of the protein encoded by the gene are made in 
all cells in which the gene is expressed. 

In many cases, however, alternative RNA splicing is regulated rather than 
constitutive. In the simplest examples, regulated splicing is used to switch from 
the production of a nonfunctional protein to the production of a functional one. 
The transposase that catalyzes the transposition of the Dwsophila P element, for 
example, is produced in a functional form in germ cells and a nonfunctional 
form in somatic cells of the fly allowing the P element to spread throughout the 
genome of the fly without causing damage in somatic cells (see Figure 5-69). The 
difference in transposon activity has been traced to the presence of an intron 
sequence in the transposase RNA that is removed only in germ cells. 

In addition to switching from the production of a functional protein to the 
production of a nonfunctional one, the regulation of RNA splicing can generate 
different versions of a protein in different cell types, according to the needs of 
the cell. Tropomyosin, for example, is produced in specialized forms in different 
types of cells (see Figure 6-27). Cell-rype-specific forms of many other proteins 
are produced in the same way 

RNA splicing can be regulated either negatively, by a regulatory molecule 
that prevents the splicing machinery from gaining access to a particular splice 
site on the RNA, or positively, by a regulatory molecule that helps direct the 
splicing machinery to an otherwise overlooked splice site (Figure 7-96). 

Because of the plasticity of RNA splicing, the blocking of a "strong" splicing 
site will often expose a "weak" site and result in a different pattern of splicing. 
Likewise, activating a suboptimal splice site can result in alternative splicing by 
suppressing a competing splice site. Thus the splicing of a pre-mRNA molecule 
can be thought of as a delicate balance between competing splice sites— a bal- 
ance that can easily be tipped by regulatory proteins. 



The Definition of a Gene Has Had to Be Modified Since the 
Discovery of Alternative RNA Splicing 

The discovery that eucaryotic genes usually contain introns and that their cod- 
ing sequences can be assembled in more than one way raised new questions 
about the definition of a gene. A gene was first clearly defined in molecular 
terms in the early 1940s from work on the biochemical genetics of the fungus 
Neurospora. Until then, a gene had been defined operationally as a region of 
the genome that segregates as a single unit during meiosis and gives rise to a 
definable phenotypic trait, such as a red or a white eye in Dwsophila or a round 
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or wrinkled seed in peas. The work on Neurospora showed that most genes cor- 
respond to a region of the genome that directs the synthesis of a single enzyme. 
This led to the hypothesis that one gene encodes one polypeptide chain. The 
hypothesis proved fruitful for subsequent research; as more was learned about 
the mechanism of gene expression in the 1960s, a gene became identified as that 
stretch of DNA that was transcribed into the RNA coding for a single polypeptide 
chain (or a single structural RNA such as a tRNA or an rRNA molecule). The dis- 
covery of split genes and introns in the late 1970s could be readily accommo- 
dated by the original definition of a gene, provided that a single polypeptide 
chain was specified by the RNA transcribed from any one DNA sequence. But it 
is now clear that many DNA sequences in higher eucaryotic cells can produce a 
set of distinct (but related) proteins by means of alternative RNA splicing. How, 
then, is a gene to be defined? 

In those relatively rare cases in which a single transcription unit produces 
two very different eucaryotic proteins, the two proteins are considered to be 
produced by distinct genes that overlap on the chromosome. It seems unneces- 
sarily complex, however, to consider most of the protein variants produced by 
alternative RNA splicing as being derived from overlapping genes. A more sensi- 
ble alternative is to modify the original definition to count as a gene any DNA 
sequence that is transcribed as a single unit and encodes one set of closely 
related polypeptide chains (protein isoforms). This definition of a gene also 
accommodates those DNA sequences that encode protein variants produced by 
post-transcriptional processes other than RNA splicing, such as translational 
frameshifting (see Figure 6-78), regulated poly-A addition, and RNA editing (to 
be discussed below). 
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Sex Determination in Drosophila Depends on a Regulated Series 
of RNA Splicing Events 

We now turn to one of the best-understood examples of regulated RNA splicing. 
In Drosophila the primary signal for determining whether the fly develops as a 
male or female is the ratio of the number of X chromosomes (X) to the number 
of autosomal sets (A). Individuals with an X/A ratio of 1 (normally two X chro- 
mosomes and two sets of autosomes) develop as females, whereas those with an 
X/A a ratio of 0.5 (normally one X chromosome and two sets of autosomes) 
develop as males. This ratio is assessed early in development and is remembered 
thereafter by each cell. Three crucial gene products transmit information about 
this ratio to the many other genes that specify male and female characteristics 
(Figure 7-97). As explained in Figure 7-98, sex determination in Drosophila 
depends on a cascade of regulated RNA splicing events that involves these three 
gene products. 

Although Drosophila sex determination provides one of the best-under- 
stood examples of a regulatory cascade based on RNA splicing, it is not clear why 
the fly should use this strategy. Other organisms (the nematode, for example) 
use an entirely different scheme for sex determination — one based on tran- 
scriptional and translational controls. Moreover, the Drosophila male-determi- 
nation pathway requires that a number of nonfunctional RNA molecules be con- 
tinually produced, which seems unnecessarily wasteful. One speculation is that 
this RNA-splicing cascade, like the ribsoswitches discussed above, represents an 
ancient control strategy, left over from the early stage of evolution in which RNA 



Figure 7-97 Sex determination in Drosophila. The gene products shown act 
in a sequential cascade to determine the sex of the fly according to the 
X-chromosome/autosome set ratio (X/A). The genes are called Sex-lethal 
(Sxl), Transformer (Tra), and Doublesex (Dsx) because of the phenotypes that 
result when the gene is inactivated by mutation. The function of these gene 
products is to transmit the information about the X/A ratio to the many 
other genes that create the sex-related phenotypes. These other genes 
function as two alternative sets: those that specify female features and those 
that specify male features (see Figure 7-98). 
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was the predominant biological molecule, and controls of gene expression 
would have had to be based almost entirely on RNA-RNA interactions. 



A Change in the Site of RNA Transcript Cleavage and Poly-A 
Addition Can Change the C-terminus of a Protein 

We saw in Chapter 6 that the 3' end of a eucaryotic mRNA molecule is not 
formed by the termination of RNA synthesis by the RNA polymerase. Instead, it 
results from an RNA cleavage reaction that is catalyzed by additional factors 
while the transcript is elongating (see Figure 6-37). A cell can control the site of 
this cleavage so as to change the C-terminus of the resultant protein. 

A well-studied example is the switch from the synthesis of membrane- 
bound to secreted antibody molecules that occurs during the development of B 
lymphocytes (see Figure 25-17). Early in the life history of a B lymphocyte, the 
antibody it produces is anchored in the plasma membrane, where it serves as a 
receptor for antigen. Antigen stimulation causes B lymphocytes to multiply and 
to begin secreting their antibody. The secreted form of the antibody is identical 
to the membrane-bound form except at the extreme C-terminus. In this part of 
the protein, the membrane- bound form has a long string of hydrophobic amino 
acids that traverses the lipid bilayer of the membrane, whereas the secreted 
form has a much shorter string of hydrophilic amino acids. The switch from 
membrane-bound to secreted antibody therefore requires a different nucleotide 
sequence at the 3' end of the mRNA; this difference is generated through a 
change in the length of the primary RNA transcript, caused by a change in the 
site of RNA cleavage, as shown in Figure 7-99. This change is caused by an 
increase in the concentration of a subunit of CstF, the protein that binds to the 
G/U-rich sequences of RNA cleavage and poly-A addition sites and promotes 



Figure 7-98 The cascade of changes in 
gene expression that determines the 
sex of a fly through alternative RNA 
splicing. An X-chromosome/autosome 
set ratio of 0.5 results in male 
development. Male is the "default" 
pathway in which the Sxl and Tra genes 
are both transcribed, but the RNAs are 
spliced constitutively to produce only 
nonfunctional RNA molecules, and the 
Dsx transcript is spliced to produce a 
protein that turns off the genes that 
specify female characteristics. An X/A 
ratio of 1 triggers the female 
differentiation pathway in the embryo by 
transiently activating a promoter within 
the Sxl gene that causes the synthesis of 
a special class of Sxl transcripts that are 
constitutively spliced to give functional 
Sxl protein. Sxl is a splicing regulatory 
protein with two sites of action: (1) it 
binds to the constitutively produced Sxl 
RNA transcript causing a female-specific 
splice that continues the production of a 
functional Sxl protein, and (2) it binds to 
the constitutively produced Tra RNA and 
causes an alternative splice of this 
transcript which now produces an active 
Tra regulatory protein. The Tra protein 
acts with the constitutively produced 
Tra2 protein to produce the female- 
specific spliced form of the Dsx transcript; 
this encodes the female form of the Dsx 
protein, which turns off the genes that 
specify male features. 

The components in this pathway were 
all initially identified through the study of 
Drosophila mutants that are altered in 
their sexual development. The Dsx gene, 
for example, derives its name (Doublesex) 
from the observation that a fly lacking 
this gene product expresses both male- 
specific and female-specific features. 
Note that this pathway includes both 
negative and positive control of splicing 
(see Figure 7-96). Sxl binds to the 
pyrimidine-rich stretch of nucleotides 
that is part of the standard splicing 
consensus sequence and blocks access 
by the normal splicing factor, U2AF (see 
Figure 6-29). Tra binds to specific RNA 
sequences in an exon and withTra2 
activates a normally suboptimal splicing 
signal by the binding of U2AF. 
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RNA cleavage (see Figures 6-37 and 6-38). The first cleavage-poly-A addition site 
that an RNA polymerase transcribing the antibody gene encounters is subopti- 
mal and is usually skipped in unstimulated B lymphocytes, leading to produc- 
tion of the longer RNA transcript. When activated to produce antibodies, the B 
lymphocyte increases its CstF concentration; as a result, cleavage now occurs at 
the suboptimal site, and the shorter transcript is produced. In this way, a change 
in concentration of a general RNA processing factor can have a dramatic effect 
on the expression of a particular gene. 



RNA Editing Can Change the Meaning of the RNA Message 

The molecular mechanisms used by cells are a continual source of surprises. An 
example is the process of RNA editing, which alters the nucleotide sequences of 
RNA transcripts once they are synthesized and thereby changes the coded mes- 
sage they carry. The most dramatic form of RNA editing was discovered in RNA 
transcripts that code for proteins in the mitochondria of trypanosomes. Here, 
one or more U nucleotides are inserted (or, less frequently, removed) from 
selected regions of a transcript, altering both the original reading frame and the 
sequence and thereby changing the meaning of the message. For some genes 
the editing is so extensive that over half of the nucleotides in the mature mRNA 
are U nucleotides that were inserted during the editing process. A set of 40- to 
80-nucleotide-long RNA molecules that are transcribed separately contains the 
information that specifies exactly how the initial RNA transcript is to be altered. 
These so-called guide RNAs have a 5' end that is complementary in sequence to 
one end of the region of the transcript to be edited, followed by a sequence that 
specifies the set of nucleotides to be inserted into the transcript (Figure 7-100). 
The editing mechanism is remarkably complex: at each edited position, the RNA 
is broken, U nucleotides are added to the broken 3' end, and the RNA is ligated. 



Figure 7-99 Regulation of the site of 
RNA cleavage and poly-A addition 
determines whether an antibody 
molecule is secreted or remains 
membrane-bound. In unstimulated 
B lymphocytes (left), a long RNA 
transcript is produced, and the intron 
sequence near its 3' end is removed by 
RNA splicing to give rise to an mRNA 
molecule that codes for a membrane- 
bound antibody molecule. In contrast 
after antigen stimulation (right), the 
primary RNA transcript is cleaved 
upstream from the splice site in front of 
the last exon sequence. As a result, some 
of the intron sequence that is removed 
from the long transcript remains as a 
coding sequence in the short transcript. 
These are the nucleotide sequences that 
encode the hydrophilic C-terminal 
portion of the secreted antibody 
molecule. 
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Figure 7-100 RNA editing in the 
mitochondria of trypanosomes. Editing 
generally starts near the 3' end and 
progresses toward the 5' end of the RNA 
transcript, as shown, because the "anchor 
sequence" at the 5' end of most guide 
RNAs can pair only with edited 
sequences. The U nucleotides are added 
by a specialized enzyme called a uridylyl 
transferase. 
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RNA editing of a more refined type occurs in mammals. Here, two principal 
types of editing occur, the deamination of adenine to produce inosine (A-to-I 
editing) and the deamination of cytosine to produce uracil (C-to-U editing; see 
Figure 5-50). Because these chemical modifications alter the pairing properties 
of the bases (I pairs with C, and U pairs with A), they can have profound effects 
on the meaning of the RNA. If the edit occurs in a coding region, it can change 
the amino acid sequence of the protein or produce a truncated protein. Edits 
that occur outside coding sequences can affect the pattern of pre-mRNA splic- 
ing, the transport of mRNA from the nucleus to the cytosol, or the efficiency with 
which the RNA is translated. 

The process of A-to-I editing is particularly prevalent in humans, where it is 
estimated to affect over 1000 genes. Enzymes called ADARs (adenosine deami- 
nases acting on RNA) perform this type of editing; these enzymes recognize a 
double-stranded RNA structure that is formed through base pairing between the 
site to be edited and a complementary sequence located elsewhere on the same 
RNA molecule, typically in a 3' intron (Figure 7-101). These complementary 
sequences specify whether the mRNA is to be edited, and if so, exactly where the 
edit should be made. An especially important example of A-to-I editing takes 
place in the mRNA that codes for a transmitter-gated ion channel in the brain. A 
single edit changes a glutamine to an arginine; the affected amino acid lies on 
the inner wall of the channel, and the editing change alters the Ca 2+ permeabil- 
ity of the channel. The importance of this edit in mice has been demonstrated 
by deleting the relevant ADAR gene. The mutant mice are prone to epileptic 
seizures and die during or shortly after weaning. If the gene for the gated ion 
channel is mutated to produce the edited form of the protein directly, mice lack- 
ing the ADAR develop normally, showing that editing of the ion channel RNA is 
normally crucial for proper brain development. 

C-to-U editing, which is carried out by a different set of enzymes, is also cru- 
cial in mammals. For example, in certain cells of the gut, the mRNA for 
apolipoprotein B undergoes a C-to-U edit that creates a premature stop codon 
and therefore produces a shorter form of the protein. In cells of the liver, the 
editing enzyme is not expressed, and the full-length apolipoprotein B is pro- 
duced. The two protein isoforms have different properties, and each plays a spe- 
cialized role in lipid metabolism that is specific to the organ that produces it. 

Why RNA editing exists at all is a mystery. One idea is that it arose in evolu- 
tion to correct "mistakes" in the genome. Another is that it arose as a somewhat 
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Figure 7-101 Mechanism of A-to-I RNA 
editing in mammals. RNA sequences 
carried on the same RNA molecule signal 
the position of an edit. Typically, a 
sequence complementary to the position 
of the edit is present in an intron, and the 
resulting double-stranded RNA structure 
attracts the A-to-I editing enzyme ADAR. 
This type of editing takes place in the 
nucleus, before the pre-mRNA has been 
fully processed. Mice and humans have 
three ADAR enzymes: ADR1 is required in 
the liver for proper red blood cell 
development, ADR2 is required for 
proper brain development (as described 
in the text), and the role of ADR3 is 
uncertain. 
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slapdash way for the cell to produce subtly different proteins from the same 
gene. A third possibility is that RNA editing originally evolved as a defense mech- 
anism against retroviruses and retrotransposons and was later adapted by the 
cell to change the meanings of certain mRNAs. Indeed, RNA editing still plays 
important roles in cell defense. Some retroviruses, including HIV (see Figure 
5-71), are extensively edited after they infect cells. This hyperediting creates 
many harmful mutations in the viral RNA genome and also causes viral mRNAs 
to be retained in the nucleus, where they are eventually degraded. Although 
some modern retroviruses protect themselves against this defense mechanism, 
it presumably helps to hold many viruses in check. 

Primates have much higher levels of A-to-I editing than do other mammals, 
and most of this takes place on RNAs that are transcribed from the highly abun- 
dant Alu elements. It has been proposed that A-to-I editing has stopped these 
mobile elements from completely overtaking our genomes by inactivating the 
RNA transcripts they require to proliferate (see Figure 5-74). If this idea is cor- 
rect, RNA editing may have had a profound impact on the shaping of the mod- 
ern human genome. 



485 



RNA Transport from the Nucleus Can Be Regulated 

It has been estimated that in mammals only about one- twentieth of the total 
mass of RNA synthesized ever leaves the nucleus. We saw in Chapter 6 that most 
mammalian RNA molecules undergo extensive processing and that the "left- 
over" RNA fragments (excised introns and RNA sequences 3' to the cleavage/ 
poly-A site) are degraded in the nucleus. Incompletely processed and otherwise 
damaged RNAs are also eventually degraded as part of the quality control sys- 
tem of RNA production. 

As described in Chapter 6, the export of RNA molecules from the nucleus is 
delayed until processing has been completed. However, mechanisms that delib- 
erately override this control point can be used to regulate gene expression. This 
strategy forms the basis for one of the best-understood examples of regulated 
nuclear transport of mRNA, which occurs in the human AIDS virus, HIV. 

As we saw in Chapter 5, HR/; once inside the cell, directs the formation of a 
double-stranded DNA copy of its genome, which is then inserted into the genome 
of the host (see Figure 5-71). Once inserted, the viral DNA is transcribed as one 
long RNA molecule by the host cell's RNA polymerase II. This transcript is then 
spliced in many different ways to produce over 30 different species of mRNA, 
which in turn are translated into a variety of different proteins (Figure 7-102). In 
order to make progeny virus, entire, unspliced viral transcripts must be exported 
from the nucleus to the cytosol, where they are packaged into viral capsids and 
serve as the viral genome (see Figure 5-71). This large transcript, as well as alter- 
natively spliced HIV mRNAs that the virus needs to move to the cytoplasm for pro- 
tein synthesis, still carries complete introns. The host cell's normal block to the 
nuclear export of unspliced RNAs therefore presents a special problem for HIV 

The block is overcome in an ingenious way. The virus encodes a protein 
(called Rev) that binds to a specific RNA sequence (called the Rev responsive ele- 
ment, RRE) located within a viral intron. The Rev protein interacts with a nuclear 
export receptor (exportin 1), which directs the movement of viral RNAs through 
nuclear pores into the cytosol despite the presence of intron sequences. We dis- 
cuss in detail the way in which export receptors function in Chapter 12. 
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Figure 7-102 The compact genome of 
HIV, the human AIDS virus. The positions 
of the nine HIV genes are shown in green. 
The red double line indicates a DNA copy 
of the viral genome that has become 
integrated into the host DNA (gray). Note 
that the coding regions of many genes 
overlap, and that those of Tat and Rev are 
split by introns. The blue line at the 
bottom of the figure represents the pre- 
mRNA transcript of the viral DNA and 
shows the locations of all the possible 
splice sites (arrows). There are many 
alternative ways of splicing the viral 
transcript; for example the Env mRNAs 
retain the intron that has been spliced 
out of the Tat and Rev mRNAs. The Rev 
response element (RRE) is indicated by a 
blue ball and stick. It is a 234-nucleotide 
long stretch of RNA that folds into a 
defined structure; Rev recognizes a 
particular hairpin within this larger 
structure. 

The Gag gene codes for a protein that is 
cleaved into several smaller proteins that 
form the viral capsid. The Pol gene codes 
for a protein that is cleaved to produce 
reverse transcriptase (which transcribes 
RNA into DNA), as well as the integrase 
involved in integrating the viral genome 
(as double-stranded DNA) into the host 
genome. Pol is produced by ribosomal 
frameshifting of translation that begins at 
Gag (see Figure 6-78). The Env gene 
codes for the envelope proteins (see 
Figure 5-71 ). Tat, Rev, Vif, Vpr, Vpu, and 
Nef are small proteins with a variety of 
functions. For example, Rev regulates 
nuclear export (see Figure 7-1 03) and Tat 
regulates the elongation of transcription 
across the integrated viral genome 
(see p. 478). 
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Figure 7-103 Regulation of nuclear 
export by the HIV Rev protein. Early in 
HIV infection (A), only the fully spliced 
RNAs (which contain the coding 
sequences for Rev, Tat, and Nef) are 
Rev and other early exported from the nucleus and 

viral proteins translated. Once sufficient Rev protein 

synthesized has accumulated and been transported 

into the nucleus (B), unspliced viral RNAs 
can be exported from the nucleus. Many 
of these RNAs are translated into protein, 
and the full-length transcripts are 
packaged into new viral particles. 
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The regulation of nuclear export by Rev has several important consequences 
for HIV growth and pathogenesis. In addition to ensuring the nuclear export of 
specific unspliced RNAs, it divides the viral infection into an early phase (in 
which Rev is translated from a fully spliced RNA and all of intron-containing 
viral RNAs are retained in the nucleus and degraded) and a late phase (in which 
unspliced RNAs are exported due to Rev function). This timing helps the virus 
replicate by providing the gene products in roughly the order in which they are 
needed (Figure 7-103) . Regulation by Rev may also help the HIV virus to achieve 
latency, a condition in which the HIV genome has become integrated into the 
host cell genome but the production of viral proteins has temporarily ceased. If, 
after its initial entry into a host cell, conditions became unfavorable for viral 
transcription and replication, Rev is made at levels too low to promote export of 
unspliced RNA. This situation stalls the viral growth cycle until conditions 
improve, Rev levels increase, and the virus enters the replication cycle. 



Some mRNAs Are Localized to Specific Regions of the Cytoplasm 

Once a newly made eucaryotic mRNA molecule has passed through a nuclear 
pore and entered the cytosol, it is typically met by ribosomes, which translate it 
into a polypeptide chain (see Figure 6-40). Once the first round of translation 
"passes" the nonsense-mediated decay test (see Figure 6-80), the mRNA is usu- 
ally translated in earnest. If the mRNA encodes a protein that is destined to be 
secreted or expressed on the cell surface, a signal sequence at the protein's 
amino terminus will direct it to the endoplasmic reticulum (ER); components of 
the cell's protein- sorting apparatus recognize the signal sequence as soon as it 
emerges from the ribosome and direct the entire complex of ribosome, mRNA, 
and nascent protein to the membrane of the ER, where the remainder of the 
polypeptide chain is synthesized, as discussed in Chapter 12. In other cases free 
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ribosomes in the cytosol synthesize the entire protein, and signals in the com- 
pleted polypeptide chain may then direct the protein to other sites in the cell. 

Some mRNAs are themselves directed to specific intracellular locations 
before their efficient translation begins, allowing the cell to position its mRNAs 
close to the sites where the encoded protein is needed. This strategy provides the 
cell with many advantages. For example, it allows the establishment of asymme- 
tries in the cytosol of the cell, a key step in many stages of development. Local- 
ized mRNA, coupled with translation control, also allows the cell to regulate 
gene expression independently in its different parts. This feature is particularly 
important in large, highly polarized cells such as neurons, where growth cones 
must respond to signals without waiting to involve the distant nucleus. RNA 
localization has been observed in many organisms, including unicellular fungi, 
plants, and animals, and it is likely to be a common mechanism that cells use to 
concentrate high-level production of proteins at specific sites. 

Several distinct mechanisms for mRNA localization have been discovered 
(Figure 7-104), but all of them require specific signals in the mRNA itself. These 
signals are usually concentrated in the 3' untranslated region (UTR), the region 
of RNA that extends from the stop codon that terminates protein synthesis to 
the start of the poly-A tail (Figure 7-105). This mRNA localization is usually cou- 
pled with translational controls to ensure that the mRNA remains quiescent 
until it has been moved into place. 

The Drosophila egg exhibits an especially striking example of mRNA local- 
ization. The mRNA encoding the bicoid gene regulatory protein is localized by 
attachment to the cytoskeleton at the anterior tip of the developing egg. When 
fertilization triggers the translation of this mRNA, it generates a gradient of the 
bicoid protein that plays a crucial part in directing the development of the ante- 
rior part of the embryo (shown in Figure 7-53 and discussed in more detail in 
Chapter 22). Many mRNAs in somatic cells are also localized in a similar way. 
The mRNA that encodes actin, for example, is localized to the actin-filament- 
rich cell cortex in mammalian fibroblasts by means of a 3' UTR signal. 

We saw in Chapter 6 that mRNA molecules exit from the nucleus bearing 
numerous markings in the form of RNA modifications (the 5' cap and the 3' 
poly-A tail) and bound proteins (exon-junction complexes, for example) that 
signify the successful completion of the different pre-mRNA processing steps. 
As just described, the 3' UTR of an mRNA can be thought of as a "zip code" that 
directs mRNAs to different places in the cell. Below we will also see that mRNAs 
carry information specifying their average lifetime in the cytosol and the effi- 
ciency with which they are translated into protein. In a broad sense, the 



Figure 7-104 Three mechanisms for the 
localization of mRNAs. The mRNA to be 
localized leaves the nucleus through 
nuclear pores (top). Some localized mRNAs 
(left diagram) travel to their destination by 
associating with cytoskeletal motors. As 
described in Chapter 16, these motors use 
the energy of ATP hydrolysis to move 
unidirectionally along filaments in the 
cytoskeleton (red). At their destination, 
anchor proteins (black) hold the mRNAs in 
place. Other mRNAs randomly diffuse 
through the cytosol and are simply 
trapped and therefore concentrated at 
their sites of localization (center diagram). 
Some of these mRNAs (right diagram) are 
degraded in the cytosol unless they have 
bound, through random diffusion, a 
localized protein complex that anchors 
and protects the mRNA from degradation 
(black). Each of these mechanisms requires 
specific signals on the mRNA, which are 
typically located in the 3' UTR (see Figure 
7-1 05). In many cases of mRNA 
localization, additional mechanisms block 
the translation of the mRNA until it is 
properly localized. (Adapted from 
H.D. Lipshitz and C.A. Smibert, Curr. Opin. 
Genet. Dev. 1 0:476-488, 2000. With 
permission from Elsevier.) 
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untranslated regions of eucaryotic mRNAs resemble the transcriptional control 
regions of genes: their nucleotide sequences contain information specifying the 
way the RNA is to be used, and proteins interpret this information by binding 
specifically to these sequences. Thus, over and above the specification of the 
amino acid sequences of proteins, mRNA molecules are rich with many addi- 
tional types of information. 

The 5' and 3' Untranslated Regions of mRNAs Control Their 
Translation 

Once an mRNA has been synthesized, one of the most common ways of regulat- 
ing the levels of its protein product is to control the step that initiates transla- 
tion. Even though the details of translation initiation differ between eucaryotes 
and bacteria (as we saw in Chapter 6), they each use some of the same basic reg- 
ulatory strategies. 

In bacterial mRNAs, a conserved stretch of six nucleotides, the Shine-Dal- 
garno sequence, is always found a few nucleotides upstream of the initiating 
AUG codon. This sequence forms base pairs with the 16S RNA in the small ribo- 
somal subunit, correctly positioning the initiating AUG codon in the ribosome. 
Because this interaction strongly contributes to the efficiency of initiation, it 
provides the bacterial cell with a simple way to regulate protein synthesis 
through translational control mechanisms. These mechanisms, carried out by 
proteins or by RNA molecules, generally involve either exposing or blocking the 
Shine-Dalgarno sequence (Figure 7-106). 

Eucaryotic mRNAs do not contain a Shine-Dalgarno sequence. Instead, as 
discussed in Chapter 6, the selection of an AUG codon as a translation start site 
is largely determined by its proximity to the cap at the 5' end of the mRNA 
molecule, which is the site at which the small ribosomal subunit binds to the 
mRNA and begins scanning for an initiating AUG codon. Despite the differences 
in translation initiation, eucaryotes use similar strategies to regulate translation. 
For example, translational repressors bind to the 5' end of the mRNA and 
thereby inhibit translation initiation. Other repressors recognize nucleotide 
sequences in the 3' UTR of specific mRNAs and decrease translation initiation by 
interfering with the communication between the 5' cap and 3' poly-A tail, a step 
required for efficient translation (see Figure 6-72). A particularly important type 
of translational control in eucaryotes relies on small RNAs (termed microRNAs 
or miRNAs) that bind to mRNAs and reduce protein output. The miRNAs are 
synthesized and processed in a specialized way, and we shall return to them 
later in the chapter. 



The Phosphorylation of an Initiation Factor Regulates Protein 
Synthesis Globally 

Eucaryotic cells decrease their overall rate of protein synthesis in response to a 
variety of stressful situations, including deprivation of growth factors or nutri- 
ents, infection by viruses, and sudden increases in temperature. Much of this 
decrease is caused by the phosphorylation of the translation initiation factor 
eIF2 by specific protein kinases that respond to the changes in conditions. 

The normal function of eIF2 was outlined in Chapter 6. It forms a complex 
with GTP and mediates the binding of the methionyl initiator tRNA to the small 
ribosomal subunit, which then binds to the 5' end of the mRNA and begins scan- 
ning along the mRNA. When an AUG codon is recognized, the eIF2 protein 
hydrolyzes the bound GTP to GDP, causing a conformational change in the pro- 
tein and releasing it from the small ribosomal subunit. The large ribosomal sub- 
unit then joins the small one to form a complete ribosome that begins protein 
synthesis (see Figure 6-71). 

Because eIF2 binds very tightly to GDR a guanine nucleotide exchange fac- 
tor (see Figure 3-73), designated eIF2B, is required to cause GDP release so that 
a new GTP molecule can bind and eIF2 can be reused (Figure 7-107A). The 
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Figure 7-105 An experiment 
demonstrating the importance of the 
3' UTR in localizing mRNAs to specific 
regions of the cytoplasm. For this 
experiment, two different fluorescently 
labeled RNAs were prepared by 
transcribing DNA in vitro in the presence 
of fluorescently labeled derivatives of 
UTR One RNA (labeled with a red 
fluorochrome) contains the coding region 
for the Drosophila hairy protein and 
includes the adjacent 3' UTR (see Figure 
6-22). The other RNA (labeled green) 
contains the hairy coding region with the 
3' UTR deleted. The two RNAs were mixed 
and injected into a Drosophila embryo at 
a stage of development when multiple 
nuclei reside in a common cytoplasm 
(see Figure 7-53). When the fluorescent 
RNAs were visualized 1 minutes later, 
the full-length hairy RNA (red) was 
localized to the apical side of nuclei (blue) 
but the transcript missing the 3' UTR 
(green) failed to localize. Hairy is one of 
many gene regulatory proteins that 
specifies positional information in the 
developing Drosophila embryo 
(discussed in Chapter 22). The localization 
of its mRNA (shown in this experiment to 
depend on its 3' UTR) is thought to be 
critical for proper fly development. 
(Courtesy of Simon Bullock and 
David Ish-Horowicz.) 
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reuse of eIF2 is inhibited when it is phosphorylated — the phosphorylated eIF2 
binds to eIF2B unusually tightly, inactivating eIF2B. There is more eIF2 than 
eIF2B in cells, and even a fraction of phosphorylated eIF2 can trap nearly all of 
the eIF2B. This prevents the reuse of the nonphosphorylated eIF2 and greatly 
slows protein synthesis (Figure 7-1 07B). 

Regulation of the level of active eIF2 is especially important in mammalian 
cells; eIF2 is part of the mechanism that allows cells to enter a nonproliferating, 
resting state (called Go) — in which the rate of total protein synthesis is reduced 
to about one-fifth the rate in proliferating cells (discussed in Chapter 17). 



Initiation at AUG Codons Upstream of the Translation Start Can 
Regulate Eucaryotic Translation Initiation 

We saw in Chapter 6 that eucaryotic translation typically begins at the first AUG 
downstream of the 5' end of the mRNA, which is the first AUG encountered by a 
scanning small ribosomal subunit. But the nucleotides immediately surrounding 
the AUG also influence the efficiency of translation initiation. If the recognition 
site is poor enough, scanning ribosomal subunits will sometimes ignore the first 
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Figure 7-106 Mechanisms of translational control. Although these examples are from bacteria, many of the same 
principles operate in eucaryotes. (A) Sequence-specific RNA binding proteins repress translation of specific mRNAs by 
blocking access of the ribosome to the Shine-Dalgarno sequence {orange). For example, some ribosomal proteins repress 
translation of their own RNA. This mechanism comes into play only when the ribosomal proteins are produced in excess 
over ribosomal RNA and are therefore not incorporated into ribosomes, and it allows the cell to maintain correctly 
balanced quantities of the various components needed to form ribosomes. In these cases, the regulatory RNA sequence 
present on the mRNA often matches the RNA sequence that the protein recognizes during ribosome assembly. (B) An RNA 
"thermosensor" permits efficient translation initiation only at elevated temperatures in which the stem-loop structure has 
been melted. An example occurs in the human pathogen Listeria monocytogenes, in which the translation of its virulence 
genes increases at 37°C, the temperature of the host. (C) Binding of a small molecule to a riboswitch causes a structural 
rearrangement of the RNA, sequestering the Shine-Dalgarno sequence (orange) and blocking translation initiation. In 
many bacteria, S-adenosyl methionine acts in this manner to block production of the enzymes that synthesize it. (D) An 
"antisense" RNA produced elsewhere from the genome base-pairs with a specific mRNA, and blocks its translation. Many 
bacteria regulate expression of iron-storage proteins in this way. When iron is abundant, an antisense transcript is down- 
regulated, thereby allowing efficient translation of genes encoding the storage proteins. Antisense RNAs are used 
extensively by eucaryotic cells to regulate gene expression. The mechanism is somewhat different from that shown here 
and is discussed in detail later in this chapter. 
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AUG codon in the mRNA and skip to the second or third AUG codon instead. 
This phenomenon, known as "leaky scanning," is a strategy frequently used to 
produce two or more closely related proteins, differing only in their amino ter- 
mini, from the same mRNA. Very importantly it allows some genes to produce 
the same protein with and without a signal sequence attached at its amino ter- 
minus so that the protein is directed to two different locations in the cell (for 
example, to both mitochondria and the cytosol). In some cases, the cell can reg- 
ulate the relative abundance of the protein isoforms produced by leaky scan- 
ning; for example, a cell-type-specific increase in the abundance of the initiation 
factor eIF4F favors the use of the AUG closest to the 5' end of the mRNA. 

Another type of control found in eucaryotes uses one or more short open 
reading frames (nucleotide sequences free from stop codons) that lie between 
the 5' end of the mRNA and the beginning of the gene. Open reading frames 
(ORFs) will be discussed more fully in Chapter 8: for present purposes an ORF 
can be considered a stretch of DNA that begins with a start codon (ATG) and 
ends with a stop codon, with no stop codons in between, and thus could in prin- 
ciple encode a polypeptide. Often, the amino acid sequences coded by these 
upstream open reading frames (uORFs) are not important; rather the uORFs 
serve a purely regulatory function. An uORF present on an mRNA molecule will 
generally decrease translation of the downstream gene by trapping a scanning 
ribosome initiation complex and causing the ribosome to translate the uORF 
and dissociate from the mRNA before it reaches the protein-coding sequences. 

When the activity of a general translation factor (such as the eIF2 discussed 
above) is reduced, one might expect that the translation of all mRNAs would be 
reduced equally Contrary to this expectation, however, the phosphorylation of 
eIF2 can have selective effects, even enhancing the translation of specific mRNAs 
that contain uORFs. This can enable yeast cells, for example, to adapt to starvation 
for specific nutrients by shutting down the synthesis of all proteins except those 
that are required for synthesis of the nutrients that are missing. The details of this 
mechanism have been worked out for a specific yeast mRNA that encodes a pro- 
tein called Gcn4, a gene regulatory protein that is required for the activation of 
many genes that encode proteins that are important for amino acid synthesis. 

The Gcn4 mRNA contains four short uORFs, and these are responsible for 
selectively increasing the translation of Gcn4 in response to the eIF2 phospho- 
rylation provoked by amino acid starvation. The mechanism by which Gcn4 
translation is increased is complex. In outline, the small subunit of the ribosome 
moves along the mRNA, encountering each of the uORFs but directing transla- 
tion of only a subset of them; if the last uORF is translated, as is the case in nor- 
mal unstarved cells, the ribosomes dissociate at the end of the uORF, and trans- 
lation of Gcn4 is inefficient. The global decrease in eIF2 activity brought about 
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by nutrient starvation (see Figure 7-108) makes it more likely that a scanning 
small ribosomal subunit will move through the fourth uORF before it acquires a 
molecule of eIF2 (see Figure 6-72). Such a ribosomal subunit is free to initiate 
translation on the actual Gcn4 sequences, and the increased amount of this gene 
regulatory protein that results leads to the production of a set of proteins that 
increase amino acid synthesis inside the cell. 



Internal Ribosome Entry Sites Provide Opportunities for 
Translation Control 

Although approximately 90% of eucaryotic mRNAs are translated beginning 
with the first AUG downstream from the 5' cap, certain AUGs, as we saw in the 
previous section, can be skipped over during the scanning process. In this sec- 
tion, we discuss yet another way that cells can initiate translation at positions 
distant from the 5' end of the mRNA, using a specialized type of RNA sequence 
called an internal ribosome entry site (IRES). An IRES can occur in many dif- 
ferent places in an mRNA and, in some unusual cases, two distinct protein-cod- 
ing sequences are carried in tandem on the same eucaryotic mRNA; translation 
of the first occurs by the usual scanning mechanism, and translation of the sec- 
ond occurs through an IRES. IRESs are typically several hundred nucleotides in 
length and fold into specific structures that bind many, but not all, of the same 
proteins that are used to initiate normal 5' cap-dependent translation (Figure 
7-108). In fact, different IRESs require different subsets of initiation factors. 
However, all of them bypass the need for a 5' cap structure and the translation 
initiation factor that recognizes it, eIF4E. 

Some viruses use IRESs as part of a strategy to get their own mRNA molecules 
translated while blocking normal 5'-cap-dependent translation of host mRNAs. 
On infection, these viruses produce a protease (encoded in the viral genome) 
that cleaves the host cell translation factor eIF4G, rendering it unable to bind to 
eIF4E, the cap-binding complex. This shuts down most of the host cell's transla- 
tion and effectively diverts the translation machinery to the IRES sequences, 
which are present on many viral mRNAs. The truncated eIF4G remains compe- 
tent to initiate translation at these internal sites and may even stimulate the 
translation of certain IRES-containing viral mRNAs. A selective activation of 
IRES-mediated translation can also occur on host cell mRNAs. For example, 
when mammalian cells enter the programmed cell death pathway (discussed in 
Chapter 18), eIF4G is cleaved, and a general decrease in translation ensues. How- 
ever, some proteins critical for the control of cell death seem to be translated 
from IRES-containing mRNAs, allowing their continued synthesis. In this way, 
the IRES mechanism allows translation of selected mRNAs at a high rate despite 
a general decrease in the cell's overall capacity to initiate protein synthesis. 
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Figure 7-108 Two mechanisms of 
translation initiation. (A) The normal, cap- 
dependent mechanism requires a set of 
initiation factors whose assembly on the 
mRNA is stimulated by the presence of a 
5' cap and a poly-A tail {see also Figure 
6-72). (B)The IRES-dependent mechanism 
seen mainly in viruses, requires only a 
subset of the normal translation initiating 
factors, and these assemble directly on 
the folded IRES. (Adapted from A. Sachs, 
Cell 101:243-245, 2000. With permission 
from Elsevier.) 
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Changes in mRNA Stability Can Regulate Gene Expression 

Most mRNAs in a bacterial cell are very unstable, having half- lives of less than a 
couple of minutes. Exonucleases, which degrade in the 3'-to-5' direction, are 
usually responsible for the rapid destruction of these mRNAs. Because its 
mRNAs are both rapidly synthesized and rapidly degraded, a bacterium can 
adapt quickly to environmental changes. 

As a general rule, the mRNAs in eucaryotic cells are more stable. Some, such 
as that encoding (3-globin, have half-lives of more than 10 hours, but most have 
considerably shorter half-lives, typically less than 30 minutes. The mRNAs that 
code for proteins such as growth factors and gene regulatory proteins, whose 
production rates need to change rapidly in cells, have especially short half- lives. 

Two general mechanisms exist for destroying eucaryotic mRNAs. Both begin 
with the gradual shortening of the poly-A tail by an exonuclease, a process that 
starts as soon as the mRNA reaches the cytoplasm. In a broad sense, this poly-A 
shortening acts as a timer that counts down the lifetime of each mRNA. Once the 
poly-A tail is reduced to a critical length (about 25 nucleotides in humans), the 
two pathways diverge. In one, the 5' cap is removed (a process called decapping) 
and the "exposed" mRNA is rapidly degraded from its 5' end. In the other, the 
mRNA continues to be degraded from the 3' end, through the poly-A tail into the 
coding sequences (Figure 7-109). Most eucaryotic mRNAs are degraded by both 
mechanisms. 

Nearly all mRNAs are subject to these two types of decay, and the specific 
sequences of each mRNA determine how fast each step occurs and therefore 
how long each mRNA will persist in the cell and be able to produce protein. The 
3' UTR sequences are especially important in controlling mRNA lifetimes, and 
they often carry binding sites for specific proteins that increase or decrease the 
rate of poly-A shortening, decapping, or 3'-to-5' degradation. The half-life of an 
mRNA is also affected by how efficiently it is translated. Poly-A shortening and 
decapping compete directly with the machinery that translates the mRNA; 
therefore any factors that affect the translation efficiency of an mRNA will tend 
to have the opposite effect on its degradation (Figure 7-110). 

Although poly-A shortening controls the half- life of most eucaryotic mRNAs, 
some can be degraded by a specialized mechanism that bypasses this step alto- 
gether. In these cases, specific nucleases cleave the mRNA internally, effectively 
decapping one end and removing the poly-A tail from the other so that both 
halves are rapidly degraded. The mRNAs that are destroyed in this way carry spe- 
cific nucleotide sequences, often in the 3' UTRs, that serve as recognition 
sequences for these endonucleases. This strategy makes it especially simple to 
tightly regulate the stability of these mRNAs by blocking the endonuclease site 
in response to extracellular signals. For example, the addition of iron to cells 
decreases the stability of the mRNA that encodes the receptor protein that binds 
the iron-transporting protein transferrin, causing less of this receptor to be 
made. This effect is mediated by the iron-sensitive RNA-binding protein aconi- 
tase (which also controls ferritin mRNA translation). Aconitase can bind to the 
3' UTR of the transferrin receptor mRNA and increase receptor production by 
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Figure 7-1 09 Two mechanisms of 
eucaryotic mRNA decay. A critical 
threshold of poly-A tail length induces 
3'-to-5' degradation, which may be 
triggered by the loss of the poly-A binding 
proteins (see Figure 6-40). As shown in 
Figure 7-1 1 0, the deadenylase associates 
with both the 3' poly-A tail and the 5' cap, 
and this arrangement may signal decapping 
after poly-A shortening. Although 5' to 3' 
and 3' to 5' degradation are shown here on 
separate RNA molecules, these two 
processes can occur together on the same 
molecule. (Adapted from C.A. Beelman and 
R. Parker, Cell 81 :1 79-1 83, 1 995. With 
permission from Elsevier.) 
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Figure 7-1 10 The competition between 
mRNA translation and mRNA decay. The 

same two features of an mRNA molecule, 
its 5' cap and the 3' poly-A tail, are used 
in both translation initiation and 
deadenylation-dependent mRNA decay 
(see Figure 7-1 09). The deadenylase that 
shortens the poly-A tail in the 3'-to-5' 
direction associates with the 5' cap. As 
described in Chapter 6 (see Figure 6-72), 
the translation initiation machinery also 
associates with both the 5' cap and the 
poly-A tail. (Adapted from M. Gao et al., 
Mol. Cell 5:479-488, 2000. With 
permission from Elsevier.) 
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blocking endonucleolytic cleavage of the mRNA. On the addition of iron, aconi- 
tase is released from the mRNA, exposing the cleavage site and thereby decreas- 
ing the stability of the mRNA (Figure 7-111). 



Cytoplasmic Poly-A Addition Can Regulate Translation 

The initial polyadenylation of an RNA molecule (discussed in Chapter 6) occurs 
in the nucleus, apparently automatically for nearly all eucaryotic mRNA precur- 
sors. As we have just seen, the poly-A tails on most mRNAs gradually shorten in 
the cytosol, and the RNAs are eventually degraded. In some cases, however, the 
poly-A tails of specific mRNAs are lengthened in the cytosol, and this mecha- 
nism provides an additional form of translation regulation. 

Maturing oocytes and eggs provide the most striking example. Many of the 
normal mRNA degradation pathways seem to be disabled in these giant cells, so 
that the cells can build up large stores of mRNAs in preparation for fertilization. 
Many mRNAs are stored in the cytoplasm with only 10 to 30 As at their 3' end, 
and in this form they are not translated. At specific times during oocyte matura- 
tion and just after fertilization, when the cell requires the proteins encoded by 
these mRNAs, poly-A is added to selected mRNAs by a cytosolic poly-A poly- 
merase, greatly stimulating their translation. 



Figure 7-1 1 1 Two post-translational 
controls mediated by iron. 

(A) During iron starvation, the 
binding of aconitase to the 5' UTR of 
the ferritin mRNA blocks translation 
initiation; its binding to the 3' UTR of 
the transferrin receptor mRNA blocks 
an endonuclease cleavage site and 
thereby stabilizes the mRNA. (B) In 
response to an increase in iron 
concentration in the cytosol, a cell 
increases its synthesis of ferritin in 
order to bind the extra iron and 
decreases its synthesis of transferrin 
receptors in order to import less iron 
across the plasma membrane. Both 
responses are mediated by the same 
iron-responsive regulatory protein, 
aconitase, which recognizes common 
features in a stem-and-loop structure 
in the mRNAs encoding ferritin and 
transferrin receptor. Aconitase 
dissociates from the mRNA when it 
binds iron. But because the 
transferrin receptor and ferritin are 
regulated by different types of 
mechanisms, their levels respond 
oppositely to iron concentrations 
even though they are regulated by 
the same iron-responsive regulatory 
protein. (Adapted from M.W. Hentze 
et al., Science 238:1 570-1 573, 1 987 
and J.L. Casey et al., Science 
240:924-928, 1988. With permission 
from AAAS.) 



Small Noncoding RNA Transcripts Regulate Many Animal and 
Plant Genes 

In the previous chapter, we introduced the central dogma, according to which 
the flow of genetic information proceeds from DNA through RNA to protein 
(Figure 6-2). But we have seen that RNA molecules perform many critical tasks 
in the cell besides serving as intermediate carriers of genetic information. A 
series of recent, striking discoveries has revealed that noncoding RNAs are far 
more prevalent than previously imagined and play previously unanticipated, 
but widespread, roles in regulating gene expression. 

Of special importance to animals and plants is a type of short noncoding RNA 
called microRNA (miRNA). Humans, for example, express more than 400 different 
miRNAs, and these appear to regulate at least one-third of all human protein-cod- 
ing genes. Once made, miRNAs base-pair with specific mRNAs and regulate their 
stability and their translation. The miRNA precursors are synthesized by RNA 
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polymerase II and are capped and polyadenylated. They then undergo a special 
type of processing, after which the miRNA is assembled with a set of proteins to 
form an RNA-induced silencing complex or RISC Once formed, the RISC seeks 
out its target mRNAs by searching for complementary nucleotide sequences 
(Figure 7-112). This search is greatly facilitated by the Argonaute protein, a 
component of RISC, which displays the 5' region of the miRNA so that it is opti- 
mally positioned for base-pairing to another RNA molecule (Figure 7-113). In 
animals, the extent of base-pairing is typically seven nucleotide pairs, and it 
usually takes place in the 3' UTR of the target mRNA. 

Once an mRNA has been bound by an miRNA, several outcomes are possi- 
ble. If the base-pairing is extensive, the mRNA is cleaved by the Argonaute pro- 
tein, effectively removing its poly-A tail and exposing it to exonucleases (see Fig- 
ure 7-109). Following cleavage of the mRNA, RISC (with its associated miRNA) is 
released, and it can seek out additional mRNAs. Thus, a single miRNA can act 
catalytically to destroy many complementary mRNAs. The miRNAs can be 
thought of as guide sequences that bring destructive nucleases into contact with 
specific mRNAs. 

If the base-pairing between the miRNA and the mRNA is less extensive, Arg- 
onaute does not slice the mRNA; rather, translation of the mRNA is repressed 
and the mRNA is destabilized. This effect is associated with shortening of the 
poly-A tail and the movement of the mRNA to cytosolic structures called pro- 
cessing bodies (P-bodies). Here the mRNAs are sequestered from ribosomes and 
eventually decapped and degraded. P-bodies are dynamic structures composed 
of large assemblies of mRNAs and RNA-degrading enzymes, and they are 
believed to be the sites in the cell where the final destruction of most mRNAs, 
even those not controlled by miRNAs, takes place (Figure 7-114). 

Several features make miRNAs especially useful regulators of gene expres- 
sion. First, a single miRNA can regulate a whole set of different mRNAs so long as 
the mRNAs carry a common sequence in their UTRs. This situation is common in 
humans, where some miRNAs control hundreds of different mRNAs. Second, reg- 
ulation by miRNAs can be combinatorial. When the base-pairing between the 
miRNA and mRNA fails to trigger cleavage, additional miRNAs binding to the 



Figure 7-112 miRNA processing and 
mechanism of action. The precursor 
miRNA, through complementarity 
between one part of its sequence and 
another, forms a double-stranded 
structure. This is cropped while still in the 
nucleus, and then exported to the cytosol, 
where it is further cleaved by the Dicer 
enzyme to form the miRNA proper. 
Argonaute, in conjunction with other 
components of RISC, initially associates 
with both strands of the miRNA and 
cleaves and discards one of them. The 
other strand guides RISC to specific 
mRNAs through base pairing. If the 
RNA:RNA match is extensive, as is 
commonly seen in plants, Argonaute 
cleaves the target mRNA, causing its rapid 
degradation. In animals, the miRNA-mRNA 
match often does not extend beyond a 
short 7-nucleotide "seed" region near the 
5' end of the miRNA. This less extensive 
base pairing leads to inhibition of 
translation, mRNA destabilization, and 
transfer of the mRNA to P-bodies, where it 
is eventually degraded. 




Figure 7-1 1 3 Structure of Argonaute 
protein bound to a perfectly base- 
paired miRNA and mRNA. (Adapted from 
N.H. Tolia and L. Joshua-Tor, Nat. Chem, 
Biol 3:36-43, 2007. With permission from 
Macmillan Publishers Ltd.) 
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same mRNA lead to further reductions in its translation. As discussed earlier in 
this chapter for gene regulatory proteins, combinatorial control greatly expands 
the possibilities available to the cell by linking gene expression to a combination 
of different regulators rather than a single regulator. Third, an miRNA occupies 
relatively little space in the genome when compared with a protein. Indeed, their 
small size is one reason that miRNAs were discovered only recently. Although we 
are only beginning to understand the full impact of miRNAs, it is clear that they 
represent a very important part of the cell's equipment for regulating the expres- 
sion of its genes. 
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Figure 7-1 14 Visualization of P-bodies. 

Human cells were stained with antibodies 
to a component of the mRNA decapping 
enzyme Dcpla (left panels) and to the 
Argonaute protein (middle panels). The 
merged image (right panels) shows that 
the two proteins co-localize to foci in the 
cytoplasm called P-bodies. (Adapted from 
J. Liu et al., Nat. Cell Biol. 7:643-644, 2005. 
With permission from Macmillan 
Publishers Ltd.) 



RNA Interference Is a Cell Defense Mechanism 

Many of the proteins that participate in the miRNA regulatory mechanisms just 
described also serve a second function as a defense mechanism: they orches- 
trate degradation of foreign RNA molecules, specifically those that occur in dou- 
ble-stranded form. Termed RNA interference (RNAi), this mechanism is found 
in a wide variety of organisms, including single-celled fungi, plants, and worms, 
suggesting that it is evolutionary ancient. Many transposable elements and 
viruses produce double-stranded RNA, at least transiently, in their life cycles, 
and RNAi helps to keep these potentially dangerous invaders in check. As we 
shall see, RNAi has also provided scientists with a powerful experimental tech- 
nique to turn off the expression of individual genes. 

The presence of double-stranded RNA in the cell triggers RNAi by attracting 
a protein complex containing Dicer, the same nuclease that processes miRNAs 
(see Figure 7-112). This protein complex cleaves the double-stranded RNA into 
small (approximately 23- nucleotide-pair) fragments called small interfering 
RNAs (siRNA). These double-stranded siRNAs are then bound by Argonaute and 
other components of the RISC, as we saw above for miRNAs, and one strand of 
the duplex RNA is cleaved by Argonaute and discarded. The single -stranded 
siRNA molecule that remains directs RISC back to complementary RNA 
molecules produced by the virus or transposable element; because the match is 
exact, Argonaute cleaves these molecules, leading to their rapid destruction 
(Figure 7-115). 

Each time RISC cleaves a new RNA molecule, it is released; thus as we saw 
for miRNAs, a single RNA molecule can act catalytically to destroy many com- 
plementary RNAs. Some organisms employ an additional mechanism that 
amplifies the RNAi response even further. In these organisms, RNA-dependent 
RNA polymerases can convert the products of siRNA-mediated cleavage into 
more double-stranded RNA. This amplification ensures that, once initiated, 
RNA interference can continue even after all the initiating double- stranded RNA 
has been degraded or diluted out. For example, it permits progeny cells to con- 
tinue carrying out RNA interference that was provoked in the parent cells. 

In some organisms, the RNA interference activity can be spread by the 
transfer of RNA fragments from cell to cell. This is particularly important in 
plants (whose cells are linked by fine connecting channels, as discussed in 
Chapter 19), because it allows an entire plant to become resistant to an RNA 
virus after only a few of its cells have been infected. In a broad sense, the RNAi 
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response resembles certain aspects of animal immune systems; in both, an 
invading organism elicits a customized response, and — through amplification of 
the "attack" molecules — the host becomes systemically protected. 



Figure 7-1 1 5 siRNA-mediated 
heterochromatin formation. In many 
organisms, double-stranded RNA can 
trigger both the destruction of 
complementary mRNAs (left) and 
transcriptional silencing (right). The 
change in chromatin structure induced 
by the bound RITS (RNA-induced 
transcriptional silencing) complex 
resembles that in Figure 7-81 . 



RNA Interference Can Direct Heterochromatin Formation 

The RNA interference pathway just described does not necessarily stop with the 
destruction of target RNA molecules. In some cases, the RNA interference 
machinery can selectively shut off synthesis of the target RNAs. For this remark- 
able mechanism to occur, the short siRNAs produced by the Dicer protein are 
assembled with a group of proteins (including Argonaute) to form the RITS 
(RNA-induced transcriptional silencing) complex. Using single-stranded siRNA 
as a guide sequence, this complex binds complementary RNA transcripts as they 
emerge from a transcribing RNA polymerase II (see Figure 7-115). Positioned on 
the genome in this manner, the RITS complex attracts proteins that covalently 
modify nearby histones and eventually directs the formation and spread of het- 
erochromatin to prevent further transcription initiation. In some cases, the RITS 
complex also induces the methylation of DNA, which, as we have seen, can 
repress gene expression even further. Because heterochromatin and DNA 
methylation can be self-propagating, an initial RNA interference signal can con- 
tinue to silence gene expression long after all the siRNA molecules have dissi- 
pated. 

RNAi-directed heterochromatin formation is an important cell defense 
mechanism that limits the accumulation of transposable elements in the 
genome by maintaining them in a transcriptionally silent form. However, this 
same mechanism is also used in many normal processes in the cell. For exam- 
ple, in many organisms, the RNA interference machinery maintains the hete- 
rochromatin formed around centromeres. Centromeric DNA sequences are 
transcribed in both directions, producing complementary RNA transcripts that 
can base-pair to form double-stranded RNA. This double-stranded RNA triggers 
the RNA interference pathway and stimulates heterochromatin formation at 
centromeres. This heterochromatin, in turn, is necessary for the centromeres to 
segregate chromosomes accurately during mitosis (see Figure 4-50). 
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RNA Interference Has Become a Powerful Experimental Tool 

Although it probably arose initially as a defense mechanism, RNA interference 
has become thoroughly integrated into many aspects of normal cell biology, 
ranging from the control of gene expression to the structure of chromosomes. It 
has also been developed by scientists into a powerful experimental tool that 
allows almost any gene to be inactivated by evoking an RNAi response to it. This 
technique, carried out in cultured cells, and in some cases, whole animals and 
plants, has revolutionized genetic approaches in cell and molecular biology. We 
shall discuss it in more detail in the following chapter (see pp. 571-572). RNAi 
also has great potential in treating human disease. Since many human disorders 
result from the misexpression of genes, the ability to turn these genes off by 
experimentally introducing complementary siRNA molecules holds great medi- 
cal promise. Remarkably, the mechanism of RNA interference was discovered 
only recently, and we are still being surprised by its mechanistic details and by 
its broad biological implications. 



Summary 

Many steps in the pathway from RNA to protein are regulated by cells in order to con- 
trol gene expression. Most genes are regulated at multiple levels, in addition to being 
controlled at the initiation stage of transcription. The regulatory mechanisms include 
(1) attenuation of the RNA transcript by its premature termination, (2) alternative RNA 
splice-site selection, (3) control of '3> -end formation by cleavage and poly-A addition, (4) 
RNA editing, (5) control of transport from the nucleus to the cytosol, (6) localization of 
mRNAs to particular parts of the cell, (7) control of translation initiation, and (8) regu- 
lated mRNA degradation. Most of these control processes require the recognition of spe- 
cific sequences or structures in the RNA molecule being regulated, a task performed by 
either regulatory proteins or regulatory RNA molecules. A particularly widespread form 
of post-transcriptional control is RNA interference, where guide RNAs base-pair with 
mRNAs. RNA interference can cause mRNAs to be either destroyed or translationally 
repressed. It can also cause specific genes to be packaged into heterochromatin. 
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PROBLEMS 



Which statements are true? Explain why or why not. 

7-1 In terms of its biochemical function, the helix-loop- 
helix motif is more closely related to the leucine zipper motif 
than it is to the helix-turn-helix motif. 

7-2 Reversible genetic rearrangements are a common 
way of regulating gene expression in procaryotes and mam- 
malian cells. 

7-3 CG islands are thought to have arisen during evolution 
because they were associated with portions of the genome 
that remained active, hence unmethylated, in the germline. 

Discuss the following problems. 

7-4 A small portion of a two-dimensional display of pro- 
teins from human brain is shown in Figure Q7-1. These pro- 
teins were separated on the basis of size in one dimension 
and electrical charge (isoelectric point) in the other. Not all 
protein spots on such displays are products of different 
genes; some represent modified forms of a protein that 
migrate to different positions. Pick out a couple of sets of 
spots that could represent proteins that differ by the num- 
ber of phosphates they carry. Explain the basis for your 
selection. 




acidic basic 

Figure Q7-1 Two-dimensional separation of proteins from the 
human brain (Problem 7-4). The proteins were displayed using two- 
dimensional gel electrophoresis. Only a small portion of the protein 
spectrum is shown. (Courtesy of Tim Myers and Leigh Anderson, 
Large Scale Biology Corporation.) 



7-5 DNA microarray analysis of the patterns of mRNA 
abundance in different human cell types shows that the level 
of expression of almost every active gene is different. The pat- 
terns of mRNA abundance are so characteristic of cell type 
that they can be used to determine the tissue of origin of can- 
cer cells, even though the cells may have metastasized to dif- 
ferent parts of the body By definition, however, cancer cells 
are different from their noncancerous precursor cells. How do 
you suppose then that patterns of mRNA expression might be 
used to determine the tissue source of a human cancer? 
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7-6 The nucleus of a eucaryotic cell is much larger than a 
bacterium, and it contains much more DNA. As a conse- 
quence, a DNA-binding protein in a eucaryotic cell must be 
able to select its specific binding site from among many 
more unrelated sequences than does a DNA-binding pro- 
tein in a bacterium. Does this present any special problems 
for eucaryotic gene regulation? 

Consider the following situation. Assume that the eucary- 
otic nucleus and the bacterial cell each have a single copy of 
the same DNA-binding site. In addition, assume that the 
nucleus is 500 times the volume of the bacterium, and has 
500 times as much DNA. If concentration of the gene regu- 
latory protein that binds the site were the same in the 
nucleus and in the bacterium, would the regulatory protein 
find its binding site equally as well in the eucaryotic nucleus 
as it does in the bacterium? Explain your answer. 

7-7 DNA-binding proteins often find their specific sites 
much faster than would be anticipated by simple three- 
dimensional diffusion. The Lac repressor, for example, asso- 
ciates with the Lac operator — its DNA-binding site — more 
than 100 times faster than expected from this model. 
Clearly the repressor must find the operator by mechanisms 
that reduce the dimensionality or volume of the search in 
order to hasten target acquisition. 

Several techniques have been used to investigate this 
problem. One of the most elegant used strongly fluorescent 
RNA polymerase molecules that could be followed individu- 
ally An array of DNA molecules was aligned in parallel and 
anchored to a glass slide. Fluorescent RNA polymerase 
molecules were then allowed to flow across them at an 
oblique angle (Figure Q7-2A). Traces of individual RNA 
polymerases showed that about half flowed in the same 
direction as the bulk and about half deviated from the bulk 
flow in a characteristic manner (Figure Q7-2B). If the RNA 
polymerase molecules were first incubated with short DNA 
fragments containing a strong promoter, all the traces fol- 
lowed the bulk flow. 

A. Offer an explanation for why some RNA polymerase 
molecules deviated from the bulk flow as shown in Figure 
Q7-2B. Why did incubation with short DNA fragments con- 
taining a strong promoter eliminate traces that deviated 
from the bulk flow? 

B. Do these results suggest an explanation for how site- 
specific DNA-binding molecules manage to find their sites 
faster than expected by diffusion? 



(A) EXPERIMENTAL SET-UP 

bulk flow of 

RNA polymerase 

molecules 



(B) SINGLE RNA POLYMERASES 





aligned DNA molecules 
Figure Q7-2 Interactions of individual RNA polymerase molecules 
with DNA (Problem 7-7). (A) Experimental set-up. DNA molecules are 
aligned and anchored to a glass slide, and highly fluorescent RNA 
polymerase molecules are allowed to flow across them. (B) Traces of 
two individual RNA polymerase molecules. The one on the left has 
traveled with the bulk flow, and the one on the right has deviated 
from it. The scale bar is 10 um. (B, reprinted from H. Kabata et al., 
Science 262:1561-1563, 1993. With permission from AAAS. 
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liver Figure Q7-3 Location of the 

sequence differences in 
cDNA clones from ApoB 
RNA isolated from liver and 
intestine (Problem 7-9). The 
encoded amino acid 
sequences, in the one-letter 
code, are shown aligned 
with the cDNA sequences. 
C. Based on your explanation, would you expect a site-spe- 
cific DNA-binding molecule to find its target site faster in a 
population of short DNA molecules or in a population of long 
DNA molecules? Assume that the concentration of target sites 
is identical and that there is one target site per DNA molecule. 

7-8 Most people who are completely blind have circa- 
dian rhythms that are 'free-running;' that is, their rhythms 
are not synchronized to environmental time cues and they 
oscillate on a cycle of about 24.5 hours. Why do you suppose 
the circadian clocks of blind people are not entrained to the 
same 24-hour clock as the majority of the population? Can 
you guess what symptoms might be associated with a free- 
running circadian clock? Do you suppose that blind people 
have trouble sleeping? 

7-9 In humans, two closely related forms of apolipopro- 
tein B (ApoB) are found in blood as constituents of the plasma 
lipoproteins. ApoB48 (molecular mass, 48 kilodaltons) is syn- 
thesized by the intestine and is a key component of chylomi- 
crons, the large lipoprotein particles responsible for delivery 
of dietary triglycerides to adipose tissue for storage. ApoB 100 
(molecular mass, 100,000 kilodaltons) is synthesized in the 
liver for formation of the much smaller, very low-density 
lipoprotein particles used in the distribution of triglycerides 
to meet energy needs. A classic set of studies defined the sur- 
prising relationship between these two proteins. 

Sequences of cloned cDNA copies of the mRNAs from 
these two tissues revealed a single difference: cDNAs from 
intestinal cells had a T, as part of a stop codon, at a point 
where the cDNAs from liver cells had a C, as part of a glu- 
tamine codon (Figure Q7-3). To verify the differences in the 
mRNAs and to search for corresponding differences in the 
genome, RNA and DNA were isolated from intestinal and 
liver cells and then subjected to PCR amplification, using 
oligonucleotides that flanked the region of interest. The 
amplified DNA segments from the four samples were tested 
for the presence of the T or C by hybridization to oligonu- 
cleotides containing either the liver cDNA sequence (oligo- 
Q) or the intestinal cDNA sequence (oligo-STOP). The 
results are shown in Table Q7-1. 

Are the two forms of ApoB produced by transcriptional 
control from two different genes, by a processing control of 
the RNA transcript from a single gene, or by differential 
cleavage of the protein product from a single gene? Explain 
your reasoning. 

Table Q7-1 Hybridization of specific oligonucleotides to 
the amplified segments from liver and intestine RNA and 

DNA (Problem 7-9). 





RNA 
LIVER INTESTINE 


DNA 
LIVER INTESTINE 


Oligo-Q 
Oligo-STOP 


+ 

+ 


+ + 



Hybridization is indicated by +; its absence by - 
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Progress in science is often driven by advances in technology. The entire field of 
cell biology, for example, came into being when optical craftsmen learned to 
grind small lenses of sufficiently high quality to observe cells and their sub- 
structures. Innovations in lens grinding, rather than any conceptual or philo- 
sophical advance, allowed Hooke and van Leeuwenhoek to discover a previously 
unseen cellular world, where tiny creatures tumble and twirl in a small droplet 
of water (Figure 8-1). 

The 21st century promises to be a particularly exciting time for biology. New 
methods for analyzing proteins, DNA, and RNA are fueling an information 
explosion and allowing scientists to study cells and their macromolecules in pre- 
viously unimagined ways. We now have access to the sequences of many billions 
of nucleotides, providing the complete molecular blueprints for hundreds of 
organisms — from microbes and mustard weeds to worms, flies, mice, dogs, 
chimpanzees, and humans. And powerful new techniques are helping us to 
decipher that information, allowing us not only to compile huge, detailed cata- 
logs of genes and proteins but also to begin to unravel how these components 
work together to form functional cells and organisms. The long-range goal is 
nothing short of obtaining a complete understanding of what takes place inside 
a cell as it responds to its environment and interacts with its neighbors. We want 
to know which genes are switched on, which mRNA transcripts are present, and 
which proteins are active — where they are located, with what other proteins and 
other molecules they associate, and to which pathways or networks they belong. 
We also want to understand how the cell successfully manages this staggering 
number of variables and how it chooses among an almost unlimited number of 
possibilities in performing its diverse biological tasks. Such information will per- 
mit us to begin to build a framework for delineating, and eventually predicting, 
how genes and proteins operate to lay the foundations for life. 

In this chapter, we present some of the principal methods used to study the 
molecular components of cells, particularly proteins, DNA, and RNA. We con- 
sider how to separate cells of different types from tissues, how to grow cells out- 
side the body, and how to disrupt cells and isolate their organelles and con- 
stituent macromolecules in pure form. We also present the latest techniques 
used to determine protein structure, function, and interactions, and we discuss 
the breakthroughs in DNA technology that continue to revolutionize our under- 
standing of cell function. 

The techniques and methods described in this chapter have made possible 
the discoveries that are presented throughout this book, and they are currently 
being used by tens of thousands of scientists each day. 



ISOLATING CELLS AND GROWING THEM IN 
CULTURE 

Although the organelles and large molecules in a cell can be visualized with 
microscopes, understanding how these components function requires a 
detailed biochemical analysis. Most biochemical procedures require that large 
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Figure 8-1 Microscopic life. A sample of 
"diverse animalcules" seen by van 
Leeuwenhoek using his simple 
microscope. (A) Bacteria seen in material 
he excavated from between his teeth. 
Those in fig. B he described as "swimming 
first forward and then backwards" (1 692). 
(B) The eucaryotic green alga Vofvox 
(1 700). (Courtesy of the John Innes 
Foundation.) 
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numbers of cells be physically disrupted to gain access to their components. If 
the sample is a piece of tissue, composed of different types of cells, heteroge- 
neous cell populations will be mixed together. To obtain as much information as 
possible about the cells in a tissue, biologists have developed ways of dissociat- 
ing cells from tissues and separating them according to type. These manipula- 
tions result in a relatively homogeneous population of cells that can then be 
analyzed— either directly or after their number has been greatly increased by 
allowing the cells to proliferate in culture. 



Cells Can Be Isolated from Intact Tissues 

Intact tissues provide the most realistic source of material, as they represent the 
actual cells found within the body. The first step in isolating individual cells is to 
disrupt the extracellular matrix and cell-cell junctions that hold the cells 
together. For this purpose, a tissue sample is typically treated with proteolytic 
enzymes (such as trypsin and collagenase) to digest proteins in the extracellular 
matrix and with agents (such as ethylenediaminetetraacetic acid, or EDTA) that 
bind, or chelate, the Ca 2+ on which cell-cell adhesion depends. The tissue can 
then be teased apart into single cells by gentle agitation. 

For some biochemical preparations, the protein of interest can be obtained 
in sufficient quantity without having to separate the tissue or organ into cell 
types. Examples include the preparation of histones from calf thymus, actin 
from rabbit muscle, or tubulin from cow brain. In other cases, obtaining the 
desired protein requires enrichment for a specific cell type of interest. Several 
approaches are used to separate the different cell types from a mixed cell sus- 
pension. The most general cell-separation technique uses an antibody coupled 
to a fluorescent dye to label specific cells. An antibody is chosen that specifi- 
cally binds to the surface of only one cell type in the tissue. The labeled cells can 
then be separated from the unlabeled ones in an electronic fluorescence-acti- 
vated cell sorter. In this remarkable machine, individual cells traveling single 
file in a fine stream pass through a laser beam, and the fluorescence of each cell 
is rapidly measured. A vibrating nozzle generates tiny droplets, most contain- 
ing either one cell or no cells. The droplets containing a single cell are auto- 
matically given a positive or a negative charge at the moment of formation, 
depending on whether the cell they contain is fluorescent; they are then 
deflected by a strong electric field into an appropriate container. Occasional 
clumps of cells, detected by their increased light scattering, are left uncharged 
and are discarded into a waste container. Such machines can accurately select 
1 fluorescent cell from a pool of 1000 unlabeled cells and sort several thousand 
cells each second (Figure 8-2). 

Selected cells can also be obtained by carefully dissecting them from thin 
tissue slices that have been prepared for microscopic examination (discussed in 
Chapter 9). In one approach, a tissue section is coated with a thin plastic film 
and a region containing the cells of interest is irradiated with a focused pulse 
from an infrared laser. This light pulse melts a small circle of the film, binding 
the cells underneath. These captured cells are then removed for further analysis. 
The technique, called laser capture microdissection, can be used to separate and 
analyze cells from different areas of a tumor, allowing their properties or molec- 
ular composition to be compared with neighboring normal cells. A related 
method uses a laser beam to directly cut out a group of cells and catapult them 
into an appropriate container for future analysis (Figure 8-3). 

A uniform population of cells obtained by any of these or other separation 
methods can be used directly for biochemical analysis. After breaking open the 
cells by mechanical disruption, detergents, and other methods, cytoplasm or 
individual organelles can be extracted and then specific molecules purified. 



Cells Can Be Grown in Culture 

Although molecules can be extracted from whole tissues, this is often not the 
most convenient source of material, requiring, for example, early-morning trips 
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to a slaughterhouse. The problem is not only a question of convenience. The 
livestock commonly used as organ sources are not amenable to genetic manip- 
ulation. Moreover, the complexity of intact tissues and organs is an inherent dis- 
advantage when trying to purify particular molecules. Cells grown in culture 
provide a more homogeneous population of cells from which to extract material, 
and they are also much more convenient to work with in the laboratory. Given 
appropriate surroundings, most plant and animal cells can live, multiply, and 
even express differentiated properties in a tissue -culture dish. The cells can be 
watched continuously under the microscope or analyzed biochemically, and the 
effects of adding or removing specific molecules, such as hormones or growth 
factors, can be systematically explored. In addition, by mixing two cell types, the 
interactions between one cell type and another can be studied. 

Experiments performed on cultured cells are sometimes said to be carried 
out in vitro (literally "in glass") to contrast them with experiments using intact 
organisms, which are said to be carried out in vivo (literally "in the living organ- 
ism"). These terms can be confusing, however, because they are often used in a 
very different sense by biochemists. In the biochemistry lab, in vitro refers to 
reactions carried out in a test tube in the absence of living cells, whereas in vivo 



Figure 8-2 A fluorescence-activated cell 
sorter. A cell passing through the laser 
beam is monitored for fluorescence. 
Droplets containing single cells are given 
a negative or positive charge, depending 
on whether the cell is fluorescent or not. 
The droplets are then deflected by an 
electric field into collection tubes 
according to their charge. Note that the 
cell concentration must be adjusted so * 
that most droplets contain no cells and 
flow to a waste container together with 
any cell clumps. 
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Figure 8-3 Microdissection techniques 
to select cells from tissue slices. This 
method uses a laser beam to excise a 
region of interest and eject it into a 
container, and it permits the isolation of 
even a single cell from a tissue sample. 
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Figure 8-4 Light micrographs of cells in culture. (A) Mouse fibroblasts. 
(B) Chick myoblasts fusing to form multinucleate muscle cells. (C) Purified rat 
retinal ganglion nerve cells. (D) Tobacco cells in liquid culture. (A, courtesy of 
Daniel Zicha; B, courtesy of Rosalind Zalin; C, from A. Meyer-Franke et al., 
Neuron 15:805-819, 1995. With permission from Elsevier; D, courtesy of 
Gethin Roberts.) 

refers to any reaction taking place inside a living cell, even if that cell is growing 
in culture. 

Tissue culture began in 1907 with an experiment designed to settle a con- 
troversy in neurobiology. The hypothesis under examination was known as the 
neuronal doctrine, which states that each nerve fiber is the outgrowth of a sin- 
gle nerve cell and not the product of the fusion of many cells. To test this con- 
tention, small pieces of spinal cord were placed on clotted tissue fluid in a warm, 
moist chamber and observed at regular intervals under the microscope. After a 
day or so, individual nerve cells could be seen extending long, thin filaments 
(axons) into the clot. Thus, the neuronal doctrine received strong support, and 
the foundation was laid for the cell-culture revolution. 

These original experiments on nerve fibers used cultures of small tissue 
fragments called explants. Today, cultures are more commonly made from sus- 
pensions of cells dissociated from tissues using the methods described earlier. 
Unlike bacteria, most tissue cells are not adapted to living suspended in fluid 
and require a solid surface on which to grow and divide. For cell cultures, this 
support is usually provided by the surface of a plastic tissue-culture dish. Cells 
vary in their requirements, however, and many do not proliferate or differentiate 
unless the culture dish is coated with materials that cells like to adhere to, such 
as polylysine or extracellular matrix components. 

Cultures prepared directly from the tissues of an organism are called pri- 
mary cultures. These can be made with or without an initial fractionation step to 
separate different cell types. In most cases, cells in primary cultures can be 
removed from the culture dish and recultured repeatedly in so-called secondary 
cultures; in this way, they can be repeatedly subcultured (passaged) for weeks or 
months. Such cells often display many of the differentiated properties appropri- 
ate to their origin (Figure 8-4): fibroblasts continue to secrete collagen; cells 
derived from embryonic skeletal muscle fuse to form muscle fibers that contract 
spontaneously in the culture dish; nerve cells extend axons that are electrically 
excitable and make synapses with other nerve cells; and epithelial cells form 
extensive sheets with many of the properties of an intact epithelium. Because 
these properties are maintained in culture, they are accessible to study in ways 
that are often not possible in intact tissues. 

Cell culture is not limited to animal cells. When a piece of plant tissue is cul- 
tured in a sterile medium containing nutrients and appropriate growth regula- 
tors, many of the cells are stimulated to proliferate indefinitely in a disorganized 
manner, producing a mass of relatively undifferentiated cells called a callus. If 
the nutrients and growth regulators are carefully manipulated, one can induce 
the formation of a shoot and then root apical meristems within the callus, and, 




50|am 



ISOLATING CELLS AND GROWING THEM IN CULTURE 505 

in many species, regenerate a whole new plant. Similar to animal cells, callus 
cultures can be mechanically dissociated into single cells, which will grow and 
divide as a suspension culture (see Figure 8-4D). 

Eucaryotic Cell Lines Are a Widely Used Source of Homogeneous 
Cells 

The cell cultures obtained by disrupting tissues tend to suffer from a problem — 
eventually the cells die. Most vertebrate cells stop dividing after a finite number 
of cell divisions in culture, a process called replicative cell senescence (discussed 
in Chapter 17). Normal human fibroblasts, for example, typically divide only 
25-40 times in culture before they stop. In these cells, the limited proliferation 
capacity reflects a progressive shortening and uncapping of the cell's telomeres, 
the repetitive DNA sequences and associated proteins that cap the ends of each 
chomosome (discussed in Chapter 5). Human somatic cells in the body have 
turned off production of the enzyme, called telomerase, that normally maintains 
the telomeres, which is why their telomeres shorten with each cell division. 
Human fibroblasts can often be coaxed to proliferate indefinitely by providing 
them with the gene that encodes the catalytic subunit of telomerase; in this case, 
they can be propagated as an "immortalized" cell line. 

Some human cells, however, cannot be immortalized by this trick. Although 
their telomeres remain long, they still stop dividing after a limited number of 
divisions because the culture conditions eventually activate cell-cycle check- 
point mechanisms (discussed in Chapter 17) that arrest the cell cycle — a process 
sometimes called "culture shock." In order to immortalize these cells, one has to 
do more than introduce telomerase. One must also inactivate the checkpoint 
mechanisms. This can be done by introducing certain cancer-promoting onco- 
genes, such as those derived from tumor viruses (discussed in Chapter 20). 
Unlike human cells, most rodent cells do not turn off production of telomerase 
and therefore their telomeres do not shorten with each cell division. Therefore, 
if culture shock can be avoided, some rodent cell types will divide indefinitely in 
culture. In addition, rodent cells often undergo genetic changes in culture that 
inactivate their checkpoint mechanisms, thereby spontaneously producing 
immortalized cell lines. 

Cell lines can often be most easily generated from cancer cells, but these cul- 
tures differ from those prepared from normal cells in several ways, and are 
referred to as transformed cell lines. Transformed cell lines often grow without 
attaching to a surface, for example, and they can proliferate to a much higher 
density in a culture dish. Similar properties can be induced experimentally in 
normal cells by transforming them with a tumor-inducing virus or chemical. 
The resulting transformed cell lines can usually cause tumors if injected into a 
susceptible animal (although it is usually only a small subpopulation, called 
cancer stem cells, that can do so — discussed in Chapter 20). 

Both transformed and nontransformed cell lines are extremely useful in cell 
research as sources of very large numbers of cells of a uniform type, especially 
since they can be stored in liquid nitrogen at -196°C for an indefinite period and 
retain their viability when thawed. It is important to keep in mind, however, that 
the cells in both types of cell lines nearly always differ in important ways from 
their normal progenitors in the tissues from which they were derived. 

Some widely used cell lines are listed in Table 8-1. Different lines have differ- 
ent advantages; for example, the PtK epithelial cell lines derived from the rat kan- 
garoo, unlike many other cell lines which round up during mitosis, remain flat 
during mitosis, allowing the mitotic apparatus to be readily observed in action. 

Embryonic Stem Cells Could Revolutionize Medicine 

Among the most promising cell lines to be developed — from a medical point of 
view— are embryonic stem (ES) cells. These remarkable cells, first harvested 
from the inner cell mass of the early mouse embryo, can proliferate indefinitely 
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Table 8-1 Some Commonly Used Cell Lines 

- LINE* CELL TYPE AND ORIGIN 



3T3 

BHK21 

MDCK 

HeLa 

PtK1 

L6 

PCI 2 

SP2 

COS 

293 

CHO 

DT40 

R1 

E14.1 

H1,H9 

S2 

BY2 



fibroblast (mouse) 

fibroblast (Syrian hamster) 

epithelial cell (dog) 

epithelial cell (human) 

epithelial cell (rat kangaroo) 

myoblast (rat) 

chromaffin cell (rat) 

plasma cell (mouse) 

kidney (monkey) 

kidney (human); transformed with adenovirus 

ovary (Chinese hamster) 

lymphoma cell for efficient targeted recombination (chick) 

embryonic stem cell (mouse) 

embryonic stem cell (mouse) 

embryonic stem cell (human) 

macro phage- 1 ike cell (Drosophila) 

undifferentiated meristematic cell (tobacco) 



*Many of these cell lines were derived from tumors All of them are capable of indefinite 
replication in culture and express at least some of the special characteristics of their cell's of origin. 

in culture and yet retain an unrestricted developmental potential If the cells 
from the culture dish are put back into an early embryonic environment, they 
can give rise to all the cell types in the body, including germ cells (Figure 8-5). 
Their descendants in the embryo are able to integrate perfectly into whatever 
site they come to occupy, adopting the character and behavior that normal cells 
would show at that site. 

Cells with properties similar to those of mouse ES cells can now be derived 
from early human embryos, creating a potentially inexhaustible supply of cells 
that might be used to replace and repair damaged mature human tissue. Exper- 
iments in mice suggest that it may be possible, in the future, to use ES cells to 
produce specialized cells for therapy— to replace the skeletal muscle fibers that 
degenerate in victims of muscular dystrophy, the nerve cells that die in patients 
with Parkinson's disease, the insulin- secreting cells that are destroyed in type I 
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Figure 8-5 Embryonic stem (ES) cells 
derived from an embryo. These cultured 
cells can give rise to all of the cell types 
of the body. ES cells are harvested from 
the inner cell mass of an early embryo 
and can be maintained indefinitely as 
stem cells (discussed in Chapter 23) in 
culture. If they are put back into an 
embryo, they will integrate perfectly and 
differentiate to suit whatever 
environment they find themselves. The 
cells can also be kept in culture as an 
immortal cell line; they can then be 
supplied with different hormones or 
growth factors to encourage them to 
differentiate into specific cell types. 
(Based on E. Fuchs and J.A. Segre, Cell 
1 00:1 43-1 55, 2000. With permission 
from Elsevier.) 
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diabetics, and the cardiac muscle cells that die during a heart attack. Perhaps one 
day it may even become possible to grow entire organs from ES cells by a recapit- 
ulation of embryonic development. It is important not to transplant ES cells by 
themselves into adults, as they can produce tumors called teratomas. 

There is another major problem associated with the use of ES-cell-derived 
cells for tissue repair. If the transplanted cells differ genetically from the cells of 
the patient into whom they are grafted, the patient's immune system will reject 
and destroy those cells. This problem can be avoided, of course, if the cells used 
for repair are derived from the patient's own body. As discussed in Chapter 23, 
many adult tissues contain stem cells dedicated to continuous production of 
just one or a few specialized cell types, and a great deal of stem-cell research 
aims to manipulate the behavior of these adult stem cells for use in tissue repair. 

ES cell technology, however, in theory at least, also offers another way 
around the problem of immune rejection, using a strategy known as "therapeu- 
tic cloning," as we explain next. 



Somatic Cell Nuclear Transplantation May Provide a Way to 
Generate Personalized Stem Cells 



The term "cloning" has been used in confusing ways as a shorthand term for sev- 
eral quite distinct types of procedures. It is important to understand the distinc- 
tions, particularly in the context of public debates about the ethics of stem cell 
research. 

As biologists define the term, a clone is simply a set of individuals that are 
genetically identical because they have descended from a single ancestor. The 
simplest type of cloning is the cloning of cells. Thus, one can take a single epi- 
dermal stem cell from the skin and let it grow and divide in culture to obtain a 
large clone of genetically identical epidermal cells, which can, for example, be 
used to help reconstruct the skin of a badly burned patient. This kind of cloning 
is no more than an extension by artificial means of the processes of cell prolifer- 
ation and repair that occur in a normal human body. 

The cloning of entire multicellular animals, called reproductive cloning, is a 
very different enterprise, involving a far more radical departure from the ordi- 
nary course of nature. Normally, each individual animal has both a mother and 
a father, and is not genetically identical to either of them. In reproductive 
cloning, the need for two parents and sexual union is bypassed. For mammals, 
this difficult feat has been achieved in sheep and some other domestic animals 
by somatic cell nuclear transplantation. The procedure begins with an unfertil- 
ized egg cell. The nucleus of this haploid cell is sucked out and replaced by a 
nucleus from a regular diploid somatic cell. The diploid donor cell is typically 
taken from a tissue of an adult individual. The hybrid cell, consisting of a diploid 
donor nucleus in a host egg cytoplasm, is allowed to develop for a short while 
in culture. In a small proportion of cases, this procedure can give rise to an early 
embryo, which is then put into the uterus of a foster mother (Figure 8-6). If the 
experimenter is lucky, development continues like that of a normal embryo, 
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Figure 8-6 Reproductive and 
therapeutic cloning. Cells from adult 
tissue can be used for reproductive 
cloning or for generating personalized 
ES cells (so-called therapeutic cloning). 
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giving rise, eventually, to a whole new animal. An individual produced in this 
way, by reproductive cloning, should be genetically identical to the adult indi- 
vidual that donated the diploid cell (except for the small amount of genetic 
information in mitochondria, which is inherited solely from the egg cytoplasm). 

Therapeutic cloning, which is very different from reproductive cloning, 
employs the technique of somatic cell nuclear transplantation to produce per- 
sonalized ES cells (see Figure 8-6). In this case, the very early embryo generated 
by nuclear transplantation, consisting of about 200 cells, is not transferred to the 
uterus of a foster mother. Instead, it is used as a source from which ES cells are 
derived in culture, with the aim of generating various cell types that can be used 
for tissue repair. Cells obtained by this route are genetically nearly identical to 
the donor of the original nucleus, so they can be grafted back into the donor, 
without fear of immunological rejection. 

Somatic cell nuclear transfer has an additional potential benefit— for study- 
ing inherited human diseases. ES cells that have received a somatic nucleus from 
an individual with an inherited disorder can be used to directly study the way in 
which the disease develops as the ES cells are induced to differentiate into dis- 
tinct cell types. "Disease-specific" ES cells and their differentiated progeny can 
also be used to study the progression of such diseases and to test and develop 
new drugs to treat the disorders. These strategies are still in their infancy and 
some countries outlaw certain aspects of the research. It remains to be seen 
whether human ES cells can be produced by nuclear transfer and whether 
human ES cells will fulfill the great hopes that medical scientists have for them. 



Hybridoma Cell Lines Are Factories That Produce Monoclonal 
Antibodies 

As we see throughout this book, antibodies are particularly useful tools for cell 
biology. Their great specificity allows precise visualization of selected proteins 
among the many thousands that each cell typically produces. Antibodies are 
often produced by inoculating animals with the protein of interest and subse- 
quently isolating the antibodies specific to that protein from the serum of the 
animal. However, only limited quantities of antibodies can be obtained from a 
single inoculated animal, and the antibodies produced will be a heterogeneous 
mixture of antibodies that recognize a variety of different antigenic sites on a 
macromolecule that differs from animal to animal. Moreover, antibodies spe- 
cific for the antigen will constitute only a fraction of the antibodies found in the 
serum. An alternative technology, which allows the production of an infinite 
quantity of identical antibodies and greatly increases the specificity and conve- 
nience of antibody-based methods, is the production of monoclonal antibodies 
by hybridoma cell lines. 

This technology, developed in 1975, has revolutionized the production of 
antibodies for use as tools in cell biology, as well as for the diagnosis and treat- 
ment of certain diseases, including rheumatoid arthritis and cancer. The proce- 
dure requires hybrid cell technology (Figure 8-7), and it involves propagating a 
clone of cells from a single antibody-secreting B lymphocyte to obtain a homo- 
geneous preparation of antibodies in large quantities. B lymphocytes normally 
have a limited life-span in culture, but individual antibody-producing B lym- 
phocytes from an immunized mouse or rat, when fused with cells derived from 
a transformed B lymphocyte cell line, can give rise to hybrids that have both the 
ability to make a particular antibody and the ability to multiply indefinitely in 
culture. These hybridomas are propagated as individual clones, each of which 
provides a permanent and stable source of a single type of monoclonal anti- 
body (Figure 8-8). Each type of monoclonal antibody recognizes a single type of 
antigenic site— for example, a particular cluster of five or six amino acid side 
chains on the surface of a protein. Their uniform specificity makes monoclonal 
antibodies much more useful than conventional antisera for most purposes. 

An important advantage of the hybridoma technique is that monoclonal 
antibodies can be made against molecules that constitute only a minor compo- 
nent of a complex mixture. In an ordinary antiserum made against such a mix- 
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ture, the proportion of antibody molecules that recognize the minor compo- 
nent would be too small to be useful. But if the B lymphocytes that produce the 
various components of this antiserum are made into hybridomas, it becomes 
possible to screen individual hybridoma clones from the large mixture to select 
one that produces the desired type of monoclonal antibody and to propagate 
the selected hybridoma indefinitely so as to produce that antibody in unlimited 
quantities. In principle, therefore, a monoclonal antibody can be made against 
any protein in a biological sample. Once an antibody has been made, it can be 
used to localize the protein in cells and tissues, to follow its movement, and to 
purify the protein to study its structure and function. 



Figure 8-7 The production of hybrid 
cells. It is possible to fuse one cell with 
another to form a heterocaryon, a 
combined cell with two separate nuclei. 
Typically, a suspension of cells is treated 
with certain inactivated viruses or with 
polyethylene glycol, each of which alters 
the plasma membranes of cells in a way 
that induces them to fuse. Eventually, a 
heterocaryon proceeds to mitosis and 
produces a hybrid cell in which the two 
separate nuclear envelopes have been 
disassembled, allowing all the 
chromosomes to be brought together in 
a single large nucleus. Such hybrid cells 
can give rise to immortal hybrid cell lines. 
If one of the parent cells was from a 
tumor cell line, the hybrid cell is called a 
hybridoma. 



mouse immunized 
with antigen X 




cell making 
anti-X antibody 

B lymphocytes (die after 
a few days in culture) 



mutant cell line derived from 
a tumor of B lymphocytes 




(cells grow indefinitely in normal 
medium but die in selective medium) 



FUSION 

I 

resulting hybridoma cells cultured in multiple wells 



AV.V. 1 

I 

2 and pre 

I I 






only hybridoma cells survive and proliferate in the selective medium 

secreted 
, anti-X antibody 



J&_ Hi A A 



>v 



supernatant tested for anti-X 
antibody, and cells from positive 
well cultured at ~1 cell per well 



mmiiiiii 



q.gi n.n 



cells allowed to multiply, and 
individual supernatants tested 
for anti-X antibodies 

positive clones provide 
a continuing source of 
anti-X antibody 






Figure 8-8 Preparation of hybridomas 
that secrete monoclonal antibodies 
against a particular antigen. Here, the 
antigen of interest is designated as 
"antigen X." The selective growth 
medium used after the cell fusion step 
contains an inhibitor (aminopterin) that 
blocks the normal biosynthetic pathways 
by which nucleotides are made. The cells 
must therefore use a bypass pathway to 
synthesize their nucleic acids. This 
pathway is defective in the mutant cell 
line derived from the B cell tumor, but it 
is intact in the normal cells obtained from 
the immunized mouse. Because neither 
cell type used for the initial fusion can 
survive and proliferate on its own, only 
the hybridoma cells do so. 
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Summary 

Tissues can be dissociated into their component cells, from which individual cell types 
can be purified and used for biochemical analysis or for the establishment of cell cul- 
tures. Many animal and plant cells survive and proliferate in a culture dish if they are 
provided with a suitable culture medium containing nutrients and appropriate signal 
molecules. Although many animal cells stop dividing after a finite number of cell divi- 
sions, cells that have been immortalized through spontaneous mutations or genetic 
manipulation can be maintained indefinitely as cell lines. Embryonic stem cells can 
proliferate indefinitely in a culture dish, while retaining the ability to differentiate into 
all the different cell types of the body. They therefore hold great medical promise. 
Hybridoma cells are widely employed to produce unlimited quantities of uniform 
monoclonal antibodies, which are used to detect and purify cell proteins, as well as to 
diagnose and treat diseases. 



PURIFYING PROTEINS 

The challenge of isolating a single type of protein from the thousands of other 
proteins present in a cell is a formidable one, but must be overcome in order to 
study protein function in vitro. As we shall see later in this chapter, recombinant 
DNA technology can enormously simplify this task by "tricking" cells into pro- 
ducing large quantities of a given protein, thereby making its purification much 
easier. Whether the source of the protein is an engineered cell or a natural tissue, 
a purification procedure usually starts with subcellular fractionation to reduce 
the complexity of the material, and is then followed by purification steps of 
increasing specificity. 



Cells Can Be Separated into Their Component Fractions 

In order to purify a protein, it must first be extracted from inside the cell. Cells 
can be broken up in various ways: they can be subjected to osmotic shock or 
ultrasonic vibration, forced through a small orifice, or ground up in a blender. 
These procedures break many of the membranes of the cell (including the 
plasma membrane and endoplasmic reticulum) into fragments that immedi- 
ately reseal to form small closed vesicles. If carefully carried out, however, the 
disruption procedures leave organelles such as nuclei, mitochondria, the Golgi 
apparatus, lysosomes, and peroxisomes largely intact. The suspension of cells is 
thereby reduced to a thick slurry (called a homogenate or extract) that contains 
a variety of membrane-enclosed organelles, each with a distinctive size, charge, 
and density. Provided that the homogenization medium has been carefully cho- 
sen (by trial and error for each organelle), the various components — including 
the vesicles derived from the endoplasmic reticulum, called microsomes — 
retain most of their original biochemical properties. 

The different components of the homogenate must then be separated. Such 
cell fractionations became possible only after the commercial development in 
the early 1940s of an instrument known as the preparative ultracentrifuge, which 
rotates extracts of broken cells at high speeds (Figure 8-9). This treatment sep- 
arates cell components by size and density: in general, the largest units experi- 
ence the largest centrifugal force and move the most rapidly. At relatively low 
speed, large components such as nuclei sediment to form a pellet at the bottom 
of the centrifuge tube; at slightly higher speed, a pellet of mitochondria is 
deposited; and at even higher speeds and with longer periods of centrifugation, 
first the small closed vesicles and then the ribosomes can be collected (Figure 
8-10). All of these fractions are impure, but many of the contaminants can be 
removed by resuspending the pellet and repeating the centrifugation procedure 
several times. 

Centrifugation is the first step in most fractionations, but it separates only 
components that differ greatly in size. A finer degree of separation can be achieved 
by layering the homogenate in a thin band on top of a dilute salt solution that fills 
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Figure 8-9 The preparative 
ultracentrifuge. The sample is contained 
in tubes that are inserted into a ring of 
cylindrical holes in a metal rotor. Rapid 
rotation of the rotor generates enormous 
centrifugal forces, which cause particles 
in the sample to sediment. The vacuum 
reduces friction, preventing heating of 
the rotor and allowing the refrigeration 
system to maintain the sample at 4°C. 
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Figure 8-10 Cell fractionation by centrifugation. Repeated centrifugation 

at progressively higher speeds will fractionate homogenates of cells into 

their components. In general, the smaller the subcellular component, the 

greater is the centrifugal force required to sediment it. Typical values for 

the various centrifugation steps referred to in the figure are: 

low speed: 1000 times gravity for 10 minutes 

medium speed: 20,000 times gravity for 20 minutes 

high speed: 80,000 times gravity for 1 hour 

very high speed: 1 50,000 times gravity for 3 hours 



a centrifuge tube. When centrifuged, the various components in the mixture 
move as a series of distinct bands through the salt solution, each at a different 
rate, in a process called velocity sedimentation (Figure 8-1 1A) . For the procedure 
to work effectively, the bands must be protected from convective mixing, which 
would normally occur whenever a denser solution (for example, one containing 
organelles) finds itself on top of a lighter one (the salt solution) . This is achieved 
by augmenting the solution in the tube with a shallow gradient of sucrose pre- 
pared by a special mixing device. The resulting density gradient — with the dense 
end at the bottom of the tube — keeps each region of the salt solution denser 
than any solution above it, and it thereby prevents convective mixing from dis- 
torting the separation. 

When sedimented through such dilute sucrose gradients, different cell 
components separate into distinct bands that can be collected individually. The 
relative rate at which each component sediments depends primarily on its size 
and shape — normally being described in terms of its sedimentation coefficient, 
or S value. Present-day ultracentrifuges rotate at speeds of up to 80,000 rpm 
and produce forces as high as 500,000 times gravity. These enormous forces 
drive even small macromolecules, such as tRNA molecules and simple 
enzymes, to sediment at an appreciable rate and allow them to be separated 
from one another by size. 

The ultracentrifuge is also used to separate cell components on the basis of 
their buoyant density, independently of their size and shape. In this case the 
sample is sedimented through a steep density gradient that contains a very high 
concentration of sucrose or cesium chloride. Each cell component begins to 
move down the gradient as in Figure 8-1 1A, but it eventually reaches a position 
where the density of the solution is equal to its own density. At this point the 
component floats and can move no farther. A series of distinct bands is thereby 
produced in the centrifuge tube, with the bands closest to the bottom of the tube 
containing the components of highest buoyant density (Figure 8-1 IB). This 
method, called equilibrium sedimentation, is so sensitive that it can separate 
macromolecules that have incorporated heavy isotopes, such as 13 C or 15 N, from 
the same macromolecules that contain the lighter, common isotopes ( 12 C or 
14 N). In fact, the cesium- chloride method was developed in 1957 to separate the 
labeled from the unlabeled DNA produced after exposure of a growing popula- 
tion of bacteria to nucleotide precursors containing 15 N; this classic experiment 
provided direct evidence for the semiconservative replication of DNA (see Fig- 
ure 5-5) . 
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Cell Extracts Provide Accessible Systems to Study Cell Functions 

Studies of organelles and other large subcellular components isolated in the 
ultracentrifuge have contributed enormously to our understanding of the func- 
tions of different cell components. Experiments on mitochondria and chloro- 
plasts purified by centrifugation, for example, demonstrated the central func- 
tion of these organelles in converting energy into forms that the cell can use. 
Similarly resealed vesicles formed from fragments of rough and smooth endo- 
plasmic reticulum (microsomes) have been separated from each other and ana- 
lyzed as functional models of these compartments of the intact cell 

Similarly highly concentrated cell extracts, especially extracts of Xenopus 
laevis (African clawed frog) oocytes, have played a critical role in the study of 
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Figure 8-11 Comparison of velocity 
sedimentation and equilibrium 
sedimentation. (A) In velocity 
sedimentation, subcellular components 
sediment at different speeds according to 
their size and shape when layered over a 
dilute solution containing sucrose. To 
stabilize the sedimenting bands against 
convective mixing caused by small 
differences in temperature or solute 
concentration, the tube contains a 
continuous shallow gradient of sucrose, 
which increases in concentration toward 
the bottom of the tube (typically from 5% 
to 20% sucrose). After centrifugation, the 
different components can be collected 
individually, most simply by puncturing 
the plastic centrifuge tube and collecting 
drops from the bottom, as illustrated 
here. (B) In equilibrium sedimentation, 
subcellular components move up or 
down when centrifuged in a gradient 
until they reach a position where their 
density matches their surroundings. 
Although a sucrose gradient is shown 
here, denser gradients, which are 
especially useful for protein and nucleic 
acid separation, can be formed from 
cesium chloride. The final bands, at 
equilibrium, can be collected as in (A). 
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such complex and highly organized processes as the cell-division cycle, the sep- 
aration of chromosomes on the mitotic spindle, and the vesicular-transport 
steps involved in the movement of proteins from the endoplasmic reticulum 
through the Golgi apparatus to the plasma membrane. 

Cell extracts also provide, in principle, the starting material for the complete 
separation of all of the individual macromolecular components of the cell. We 
now consider how this separation is achieved, focusing on proteins. 



Proteins Can Be Separated by Chromatography 

Proteins are most often fractionated by column chromatography, in which a 
mixture of proteins in solution is passed through a column containing a porous 
solid matrix. The different proteins are retarded to different extents by their 
interaction with the matrix, and they can be collected separately as they flow out 
of the bottom of the column (Figure 8-12). Depending on the choice of matrix, 
proteins can be separated according to their charge (ion-exchange chromatog- 
raphy), their hydrophobicity (hydrophobic chromatography), their size (gel-fil- 
tration chromatography), or their ability to bind to particular small molecules or 
to other macromolecules (affinity chromatography). 

Many types of matrices are commercially available (Figure 8-13). Ion- 
exchange columns are packed with small beads that carry either a positive or a 
negative charge, so that proteins are fractionated according to the arrangement 
of charges on their surface. Hydrophobic columns are packed with beads from 
which hydrophobic side chains protrude, selectively retarding proteins with 
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exposed hydrophobic regions. Gel-filtration columns, which separate proteins 
according to their size, are packed with tiny porous beads: molecules that are 
small enough to enter the pores linger inside successive beads as they pass down 
the column, while larger molecules remain in the solution flowing between the 
beads and therefore move more rapidly, emerging from the column first. Besides 
providing a means of separating molecules, gel-filtration chromatography is a 
convenient way to determine their size. 

Inhomogeneities in the matrices (such as cellulose), which cause an uneven 
flow of solvent through the column, limit the resolution of conventional column 
chromatography. Special chromatography resins (usually silica-based) com- 
posed of tiny spheres (3-10 jam in diameter) can be packed with a special appa- 
ratus to form a uniform column bed. Such high-performance liquid chro- 
matography (HPLC) columns attain a high degree of resolution. In HPLC, the 
solutes equilibrate very rapidly with the interior of the tiny spheres, and so 
solutes with different affinities for the matrix are efficiently separated from one 
another even at very fast flow rates. HPLC is therefore the method of choice for 
separating many proteins and small molecules. 



Affinity Chromatography Exploits Specific Binding Sites on 
Proteins 

If one starts with a complex mixture of proteins, the types of column chro- 
matography just discussed do not produce very highly purified fractions: a sin- 
gle passage through the column generally increases the proportion of a given 
protein in the mixture no more than twentyfold. Because most individual pro- 
teins represent less than 1/1000 of the total cell protein, it is usually necessary to 
use several different types of columns in succession to attain sufficient purity 
(Figure 8-14). A more efficient procedure, known as affinity chromatography, 
takes advantage of the biologically important binding interactions that occur on 
protein surfaces. If a substrate molecule is covalently coupled to an inert matrix 
such as a polysaccharide bead, the enzyme that operates on that substrate will 
often be specifically retained by the matrix and can then be eluted (washed out) 
in nearly pure form. Likewise, short DNA oligonucleotides of a specifically 
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Figure 8-1 2 The separation of 
molecules by column chromatography. 

The sample, a solution containing a 
mixture of different molecules, is applied 
to the top of a cylindrical glass or plastic 
column filled with a permeable solid 
matrix, such as cellulose. A large amount 
of solvent is then pumped slowly 
through the column and collected in 
separate tubes as it emerges from the 
bottom. Because various components of 
the sample travel at different rates 
through the column, they are 
fractionated into different tubes. 
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(C) AFFINITY CHROMATOGRAPHY 



Figure 8-13 Three types of matrices used for chromatography. (A) In ion-exchange chromatography, the insoluble matrix 
carries ionic charges that retard the movement of molecules of opposite charge. Matrices used for separating proteins 
include diethylaminoethylcellulose (DEAE-cellulose), which is positively charged, and carboxymethylcellulose (CM-cellulose) 
and phosphocellulose, which are negatively charged. Analogous matrices based on agarose or other polymers are also 
frequently used. The strength of the association between the dissolved molecules and the ion-exchange matrix depends on 
both the ionic strength and the pH of the solution that is passing down the column, which may therefore be varied 
systematically (as in Figure 8-14) to achieve an effective separation. (B) In gel-filtration chromatography, the matrix is inert 
but porous. Molecules that are small enough to penetrate into the matrix are thereby delayed and travel more slowly 
through the column than larger molecules that cannot penetrate. Beads of cross-linked polysaccharide (dextran, agarose, or 
acrylamide) are available commercially in a wide range of pore sizes, making them suitable for the fractionation of 
molecules of various molecular weights, from less than 500 daltons to more than 5 x 1 6 daltons. (C) Affinity 
chromatography uses an insoluble matrix that is covalently linked to a specific ligand, such as an antibody molecule or an 
enzyme substrate, that will bind a specific protein. Enzyme molecules that bind to immobilized substrates on such columns 
can be eluted with a concentrated solution of the free form of the substrate molecule, while molecules that bind to 
immobilized antibodies can be eluted by dissociating the antibody-antigen complex with concentrated salt solutions or 
solutions of high or low pH. High degrees of purification can be achieved in a single pass through an affinity column. 



designed sequence can be immobilized in this way and used to purify DNA- 
binding proteins that normally recognize this sequence of nucleotides in chro- 
mosomes (see Figure 7-28). Alternatively specific antibodies can be coupled to 
a matrix to purify protein molecules recognized by the antibodies. Because of 
the great specificity of all such affinity columns, 1000- to 10,000-fold purifica- 
tions can sometimes be achieved in a single pass. 



Genetically-Engineered Tags Provide an Easy Way to Purify 
Proteins 

Using the recombinant DNA methods discussed in subsequent sections, any 
gene can be modified to produce its protein with a special recognition tag 
attached to it, so as to make subsequent purification of the protein by affinity 
chromatography simple and rapid. Often the recognition tag is itself an anti- 
genic determinant, or epitope, which can be recognized by a highly specific 
antibody The antibody can then be used both to localize the protein in cells 
and to purify it (Figure 8-15). Other types of tags are specifically designed for 
protein purification. For example, the amino acid histidine binds to certain 
metal ions, including nickel and copper. If genetic engineering techniques are 
used to attach a short string of histidines to one end of a protein, the slightly 
modified protein can be retained selectively on an affinity column containing 
immobilized nickel ions. Metal affinity chromatography can thereby be used to 
purify the modified protein from a complex molecular mixture. 

In other cases, an entire protein is used as the recognition tag. When cells are 
engineered to synthesize the small enzyme glutathione S-transferase (GST) 
attached to a protein of interest, the resulting fusion protein can be purified from 
the other contents of the cell with an affinity column containing glutathione, a 
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substrate molecule that binds specifically and tightly to GST. If the purification 
is carried out under conditions that do not disrupt protein-protein interactions, 
the fusion protein can be isolated in association with the proteins it interacts 
with inside the cell (Figure 8-16). 

As a further refinement of purification methods using recognition tags, an 
amino acid sequence that forms a cleavage site for a highly specific proteolytic 
enzyme can be engineered between the protein of choice and the recognition 
tag. Because the amino acid sequences at the cleavage site are very rarely found 
by chance in proteins, the tag can later be cleaved off without destroying the 
purified protein. 

This type of specific cleavage is used in an especially powerful purification 
methodology known as tandem affinity purification tagging (tap -tagging). Here, 
one end of a protein is engineered to contain two recognition tags that are sep- 
arated by a protease cleavage site. The tag on the very end of the construct is 
chosen to bind irreversibly to an affinity column, allowing the column to be 
washed extensively to remove all contaminating proteins. Protease cleavage 
then releases the protein, which is then further purified using the second tag. 



Figure 8-14 Protein purification by 
chromatography. Typical results 
obtained when three different 
chromatographic steps are used in 
succession to purify a protein. In this 
example, a homogenate of cells was first 
fractionated by allowing it to percolate 
through an ion-exchange resin packed 
into a column (A). The column was 
washed to remove all unbound 
contaminants, and the bound proteins 
were then eluted by passing a solution 
containing a gradually increasing 
concentration of salt onto the top of the 
column. Proteins with the lowest affinity 
for the ion-exchange resin passed 
directly through the column and were 
collected in the earliest fractions eluted 
from the bottom of the column. The 
remaining proteins were eluted in 
sequence according to their affinity for 
the resin — those proteins binding most 
tightly to the resin requiring the highest 
concentration of salt to remove them. 
The protein of interest was eluted in 
several fractions and was detected by its 
enzymatic activity. The fractions with 
activity were pooled and then applied to 
a second, gel-filtration column (B).The 
elution position of the still-impure 
protein was again determined by its 
enzymatic activity, and the active 
fractions were pooled and purified to 
homogeneity on an affinity column (C) 
that contained an immobilized substrate 
of the enzyme. (D) Affinity purification of 
cyclin-binding proteins from 5. cerevisiae, 
as analyzed by SDS polyacrylamide-gel 
electrophoresis, which is described below 
in Figure 8-18. Lane 1 is a total cell 
extract; lane 2 shows the proteins eluted 
from an affinity column containing cyclin 
B2; lane 3 shows one major protein 
eluted from a cyclin B3 affinity column. 
Proteins in lanes 2 and 3 were eluted 
from the affinity columns with salt, and 
the gel was stained with Coomassie blue. 
The scale at the left shows the molecular 
weights of marker proteins, in 
kilodaitons. (D, from D. Kellogg et al„ 
1 Cell Biol. 1 30:675-685, 1 995. With 
permisison from The Rockefeller 
University Press.) 
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Figure 8-1 5 Epitope tagging for the localization or purification of 
proteins. Using standard genetic engineering techniques, a short peptide 
tag can be added to a protein of interest. If the tag is itself an antigenic 
determinant, or epitope, it can be targeted by an appropriate commercially 
available antibody. The antibody, suitably labeled, can be used to 
determine the location of the protein in cells or to purify it by 
immunoprecipitation or affinity chromatography. In immunoprecipitation, 
antibodies directed against the epitope tag are added to a solution 
containing the tagged protein; the antibodies specifically cross-link the 
tagged protein molecules and precipitate them out of solution as 
antibody-protein complexes. 



Because this two-step strategy provides an especially high degree of protein 
purification with relatively little effort, it is used extensively in cell biology. Thus, 
for example, a set of approximately 6000 yeast strains, each with a different gene 
fused to DNA that encodes a tap-tag, has been constructed to allow any yeast 
protein to be rapidly purified. 
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Purified Cell-free Systems Are Required for the Precise Dissection 
of Molecular Functions 

It is important to study biological processes free from all of the complex side 
reactions that occur in a living cell by using purified cell-free systems. To make 
this possible, cell homogenates are fractionated with the aim of purifying each 
of the individual macromolecules that are needed to catalyze a biological pro- 
cess of interest. For example, the experiments to decipher the mechanisms of 
protein synthesis began with a cell homogenate that could translate RNA 
molecules to produce proteins. Fractionation of this homogenate, step by step, 
produced in turn the ribosomes, tRNAs, and various enzymes that together con- 
stitute the protein-synthetic machinery. Once individual pure components were 
available, each could be added or withheld separately to define its exact role in 
the overall process. 

A major goal for cell biologists is the reconstitution of every biological pro- 
cess in a purified cell-free system. Only in this way can one define all of the com- 
ponents needed for the process and control their concentrations, as required to 
work out their precise mechanism of action. Although much remains to be done, 
a great deal of what we know today about the molecular biology of the cell has 
been discovered by studies in such cell-free systems. They have been used, for 
example, to decipher the molecular details of DNA replication and DNA tran- 
scription, RNA splicing, protein translation, muscle contraction, and particle 
transport along microtubules, and many other processes that occur in cells. 



Summary 

Populations of cells can be analyzed biochemically by disrupting them and fraction- 
ating their contents, allowing functional cell- free systems to be developed. Highly 
purified cell-free systems are needed for determining the molecular details of com- 
plex cell processes, requiring extensive purification of all the proteins and other com- 
ponents involved. The proteins in soluble cell extracts can be purified by column 



Figure 8-16 Purification of protein complexes by using a GST-tagged 
fusion protein. GST fusion proteins, produced in cells with recombinant 
DNA techniques, can be captured on an affinity column containing beads 
coated with glutathione. Proteins not bound to the beads are washed 
away. The fusion protein, along with other proteins in the cell that are 
bound tightly to it, can then be eluted with glutathione. The identities of 
these additional proteins can be determined by mass spectrometry (see 
Figure 8-21). Affinity columns can also be made to contain antibodies 
against GST or another convenient small protein or epitope tag (see 
Figure 8-15). 
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chromatography; depending on the type of column matrix, biologically active proteins 
can be separated on the basis of their molecular weight, hydrophobicity, charge char- 
acteristics, or affinity for other molecules. In a typical purification, the sample is 
passed through several different columns in turn — the enriched fractions obtained 
from one column are applied to the next. Recombinant DNA techniques, to be 
described later, allow special recognition tags to be attached to proteins, thereby greatly 
simplifying their purification. 



ANALYZING PROTEINS 

Proteins perform most processes in cells: they catalyze metabolic reactions, use 
nucleotide hydrolysis to do mechanical work, and serve as the major structural 
elements of the cell. The great variety of protein structures and functions has 
stimulated the development of a multitude of techniques to study them. 



Proteins Can Be Separated by SDS Polyacrylamide-Gel 
Electrophoresis 

Proteins usually possess a net positive or negative charge, depending on the 
mixture of charged amino acids they contain. An electric field applied to a solu- 
tion containing a protein molecule causes the protein to migrate at a rate that 
depends on its net charge and on its size and shape. The most popular applica- 
tion of this property is SDS polyacrylamide-gel electrophoresis (SDS-PAGE). It 
uses a highly cross-linked gel of polyacrylamide as the inert matrix through 
which the proteins migrate. The gel is prepared by polymerization of monomers; 
the pore size of the gel can be adjusted so that it is small enough to retard the 
migration of the protein molecules of interest. The proteins themselves are not 
in a simple aqueous solution but in one that includes a powerful negatively 
charged detergent, sodium dodecyl sulfate, or SDS (Figure 8-17). Because this 
detergent binds to hydrophobic regions of the protein molecules, causing them 
to unfold into extended polypeptide chains, the individual protein molecules 
are released from their associations with other proteins or lipid molecules and 
rendered freely soluble in the detergent solution. In addition, a reducing agent 
such as P-mercaptoethanol (see Figure 8-17) is usually added to break any S-S 
linkages in the proteins, so that all of the constituent polypeptides in multisub- 
unit proteins can be analyzed separately. 

What happens when a mixture of SDS-solubilized proteins is run through a 
slab of polyacrylamide gel? Each protein molecule binds large numbers of the 
negatively charged detergent molecules, which mask the protein's intrinsic 
charge and cause it to migrate toward the positive electrode when a voltage is 
applied. Proteins of the same size tend to move through the gel with similar 
speeds because (1) their native structure is completely unfolded by the SDS, so 
that their shapes are the same, and (2) they bind the same amount of SDS and 
therefore have the same amount of negative charge. Larger proteins, with more 
charge, are subjected to larger electrical forces and also to a larger drag. In free 
solution, the two effects would cancel out, but, in the mesh of the polyacry- 
lamide gel, which acts as a molecular sieve, large proteins are retarded much 
more than small ones. As a result, a complex mixture of proteins is fractionated 
into a series of discrete protein bands arranged in order of molecular weight 
(Figure 8-18). The major proteins are readily detected by staining the proteins 
in the gel with a dye such as Coomassie blue. Even minor proteins are seen in 
gels treated with a silver or gold stain, so that as little as 10 ng of protein can be 
detected in a band. 

SDS-PAGE is widely used because it can separate all types of proteins, 
including those that are normally insoluble in water — such as the many proteins 
in membranes. And because the method separates polypeptides by size, it pro- 
vides information about the molecular weight and the subunit composition of 
proteins. Figure 8-19 presents a photograph of a gel that has been used to ana- 
lyze each of the successive stages in the purification of a protein. 
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Figure 8-1 7 The detergent sodium 
dodecyl sulfate (SDS) and the reducing 
agent p-mercaptoethanol. These two 
chemicals are used to solubilize proteins 
for SDS polyacrylamide-gel 
electrophoresis. The SDS is shown here in 
its ionized form. 
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Figure 8-18 SDS polyacrylamide-gel electrophoresis (SDS-PAGE). (A) An 

electrophoresis apparatus. (B) Individual polypeptide chains form a 
complex with negatively charged molecules of sodium dodecyl sulfate 
(SDS) and therefore migrate as a negatively charged SDS-protein complex 
through a porous gel of polyacrylamide. Because the speed of migration 
under these conditions is greater the smaller the polypeptide, this 
technique can be used to determine the approximate molecular weight of 
a polypeptide chain as well as the subunit composition of a protein. If the 
protein contains a large amount of carbohydrate, however, it will move 
anomalously on the gel and its apparent molecular weight estimated by 
SDS-PAGE will be misleading. 
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Specific Proteins Can Be Detected by Blotting with Antibodies 

A specific protein can be identified after its fractionation on a polyacrylamide 
gel hy exposing all the proteins present on the gel to a specific antibody that has 
been coupled to a radioactive isotope, to an easily detectable enzyme, or to a flu- 
orescent dye. For convenience, this procedure is normally carried out after 
transferring (by "blotting") all of the separated proteins present in the gel onto a 
sheet of nitrocellulose paper or nylon membrane. Placing the membrane over 
the gel and driving the proteins out of the gel with a strong electric field trans- 
fers die protein onto the membrane. The membrane is then soaked in a solution 
of labeled antibody to reveal the protein of interest. This method of detecting 
proteins is called Western blotting, or immunoblotting (Figure 8-20). 



Figure 8-19 Analysis of protein samples by SDS polyacrylamide-gel 
electrophoresis. The photograph shows a Coomassie-stained gel that has 
been used to detect the proteins present at successive stages in the 
purification of an enzyme. The leftmost lane (lane 1 ) contains the complex 
mixture of proteins in the starting cell extract, and each succeeding lane 
analyzes the proteins obtained after a chromatographic fractionation of the 
protein sample analyzed in the previous lane (see Figure 8-14). The same 
total amount of protein (1 jug) was loaded onto the gel at the top of each 
lane. Individual proteins normally appear as sharp, dye-stained bands; a band 
broadens, however, when it contains too much protein. (From 
T. Formosa and B.M. Alberts, J. Biol. Chem. 261 :61 07-61 1 8, 1 986.) 




molecular 

weight 

(daltons) 

100,000 



h 40.000 



15,000 



ANALYZING PROTEINS 



519 




* A 



(A) (B) 

Figure 8-20 Western blotting. All the proteins from dividing tobacco cells in culture are first separated by two-dimensional 
polyacrylamide-gel electrophoresis (described in Figure 8-23). In (A), the positions of the proteins are revealed by a 
sensitive protein stain. In (B), the separated proteins on an identical gel were then transferred to a sheet of nitrocellulose 
and exposed to an antibody that recognizes only those proteins that are phosphorylated on threonine residues during 
mitosis. The positions of the dozen or so proteins that are recognized by this antibody are revealed by an enzyme-linked 
second antibody. This technique is also known as immunoblotting (or Western blotting). (From J.A. Traas et al., Plant J. 
2:723-732, 1992. With permission from Blackwell Publishing.) 

Mass Spectrometry Provides a Highly Sensitive Method for 
Identifying Unknown Proteins 

A frequent problem in cell biology and biochemistry is the identification of a 
protein or collection of proteins that has been obtained by one of the purifica- 
tion procedures discussed in the preceding pages (see, for example, Figure 
8-16). Because the genome sequences of most common experimental organ- 
isms are now known, catalogues of all the proteins produced in those organisms 
are available. The task of identifying an unknown protein (or collection of 
unknown proteins) thus reduces to matching some of the amino acid sequences 
present in the unknown sample with known catalogued genes. This task is now 
performed almost exclusively by using mass spectrometry in conjunction with 
computer searches of databases. 

Charged particles have very precise dynamics when subjected to electrical 
and magnetic fields in a vacuum. Mass spectrometry exploits this principle to 
separate ions according to their mass-to-charge Tatio. It is an enormously sensi- 
tive technique. It requires very little material and is capable of determining the 
precise mass of intact proteins and of peptides derived from them by enzymatic 
or chemical cleavage. Masses can be obtained with great accuracy, often with an 
error of less than one part in a million. The most commonly used form of the 
technique is caDed matrix-amsred laser desorption iomzciiion-time-offiigUt 
spectrometry (MALDI-TOF). In this approach, the proteins in the sample are first 
broken into short peptides. These peptides are mixed with an organic acid and 
then dried onto a metal or ceramic slide. A laser then blasts the sample, ejecting 
the peptides from the slide in the form of an ionized gas, in which each molecule 
carries one or more positive charges. The ionized peptides are accelerated in an 
electric field and fly toward a detector. Their mass and charge determines the 
time it takes them to reach the detector: large peptides move more slowly, and 
more highly charged molecules move more quickly. By analyzing those ionized 
peptides that bear a single charge, the precise masses of peptides present in the 
original sample can be determined. MALDI-TOF can also be used to accurately 
measure the mass of intact proteins as large as 200,000 daltons. This information 
is then used to search genomic databases, in which the masses of all proteins 
and of all their predicted peptide fragments have been tabulated from the 
genomic sequences of the organism (Figure 8-21A) . An unambiguous match to 
a particular open reading frame can often be made by knowing the mass of only 
a few peptides derived from a given protein. 

MALDI-TOF provides accurate molecular weight measurements for pro- 
teins and peptides. Moreover, by employing two mass spectrometers in tandem 
(an arrangement known as MS/MS), it is possible to directly determine the 
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Figure 8-21 Use of mass spectrometry to identify proteins and to sequence peptides. An isolated protein is digested 
with trypsin and the peptide fragments are then loaded into the mass spectrometer. Two different approaches can then be 
used to identify the protein. (A) In the first method, peptide masses are measured precisely using MALDI-TOF mass 
spectrometry. Sequence databases are then searched to find the gene that encodes a protein whose calculated tryptic 
digest profile matches these values. (B) Mass spectrometry can also be used to determine directly the amino acid sequence 
of peptide fragments. In this example, tryptic peptides are first separated based on mass within a mass spectrometer. Each 
peptide is then further fragmented, primarily by cleaving its peptide bonds. This treatment generates a nested set of 
peptides, each differing in size by one amino acid. These fragments are fed into a second coupled mass spectrometer, and 
their masses are determined. The difference in masses between two closely related peptides can be used to deduce the 
"missing" amino acid. By repeated applications of this procedure, a partial amino acid sequence of the original protein can 
be determined. For simplicity, the analysis shown begins with a single species of purified protein. In reality, mass 
spectrometry is usually carried out on mixtures of proteins, such as those obtained for affinity chromatography 
experiments (see Figure 8-1 6), and can identify all the proteins present in the mixtures. As explained in the text, mass 
spectrometry can also detect post-translational modifications of proteins. 



amino acid sequences of individual peptides in a complex mixture. As described 
above, the protein sample is first broken down into smaller peptides, which are 
separated from each other by mass spectrometry. Each peptide is then further 
fragmented through collisions with high-energy gas atoms. This method of frag- 
mentation preferentially cleaves the peptide bonds, generating a ladder of frag- 
ments, each differing by a single amino acid. The second mass spectrometer 
then separates these fragments and displays their masses. The amino acid 
sequence of a peptide can then be deduced from these differences in mass (Fig- 
ure 8-21B). 

MS/MS is particularly useful for detecting and precisely mapping post- 
translational modifications of proteins, such as phosphorylations or acetyla- 
tions. Because these modifications impart a characteristic mass increase to an 
amino acid, they are easily detected by mass spectrometry. As described in 
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Chapter 3, proteomics, a general term that encompasses many different experi- 
mental techniques, is the characterization of all proteins in the cell, including all 
protein-protein interactions and all post-translational modifications. In combi- 
nation with the rapid purification techniques discussed in the last section, mass 
spectrometry has emerged as the most powerful method for mapping both the 
post-translational modifications of a given protein and the proteins that remain 
associated with it during purification. 



Two-Dimensional Separation Methods are Especially Powerful 

Because different proteins can have similar sizes, shapes, masses, and overall 
charges, most separation techniques such as SDS polyacrylamide-gel elec- 
trophoresis or ion-exchange chromatography cannot typically display all the 
proteins in a cell or even in an organelle. In contrast, two-dimensional gel elec- 
trophoresis, which combines two different separation procedures, can resolve 
up to 2000 proteins— the total number of different proteins in a simple bac- 
terium — in the form of a two-dimensional protein map. 

In the first step, the proteins are separated by their intrinsic charges. The 
sample is dissolved in a small volume of a solution containing a nonionic 
(uncharged) detergent, together with [3-mercaptoethanol and the denaturing 
reagent urea. This solution solubilizes, denatures, and dissociates all the 
polypeptide chains but leaves their intrinsic charge unchanged. The polypep- 
tide chains are then separated in a pH gradient by a procedure called isoelectric 
focusing, which takes advantage of the variation in the net charge on a protein 
molecule with the pH of its surrounding solution. Every protein has a character- 
istic isoelectric point, the pH at which the protein has no net charge and there- 
fore does not migrate in an electric field. In isoelectric focusing, proteins are 
separated electrophoretically in a narrow tube of polyacrylamide gel in which a 
gradient of pH is established by a mixture of special buffers. Each protein moves 
to a position in the gradient that corresponds to its isoelectric point and remains 
there (Figure 8-22). This is the first dimension of two-dimensional polyacry- 
lamide-gel electrophoresis. 

In the second step, the narrow gel containing the separated proteins is again 
subjected to electrophoresis but in a direction that is at a right angle to the direc- 
tion used in the first step. This time SDS is added, and the proteins separate 
according to their size, as in one-dimensional SDS-PAGE: the original narrow gel 
is soaked in SDS and then placed on one edge of an SDS polyacrylamide-gel slab, 
through which each polypeptide chain migrates to form a discrete spot. This is 
the second dimension of two-dimensional polyacrylamide-gel electrophoresis. 
The only proteins left unresolved are those that have both identical sizes and 
identical isoelectric points, a relatively rare situation. Even trace amounts of each 
polypeptide chain can be detected on the gel by various staining procedures— or 
by autoradiography if the protein sample was initially labeled with a radioisotope 
(Figure 8-23). The technique has such great resolving power that it can distin- 
guish between two proteins that differ in only a single charged amino acid. 
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Figure 8-22 Separation of protein 
molecules by isoelectric focusing. At low 
pH (high H + concentration), the 
carboxylic acid groups of proteins tend to 
be uncharged (-COOH) and their 
nitrogen-containing basic groups fully 
charged (for example, -NH 3 + ), giving 
most proteins a net positive charge. At 
high pH, the carboxylic acid groups are 
negatively charged {-COO") and the basic 
groups tend to be uncharged (for 
example, -NH 2 ), giving most proteins a 
net negative charge. At its isoelectric phi, 
a protein has no net charge since the 
positive and negative charges balance. 
Thus, when a tube containing a fixed pH 
gradient is subjected to a strong electric 
field in the appropriate direction, each 
protein species present migrates until it 
forms a sharp band at its isoelectric pH, 
as shown. 
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Figure 8-23 Two-dimensional 
polyacrylamide-gel electrophoresis. All 

the proteins in an £ coli bacterial cell are 
separated in this gel, in which each spot 
corresponds to a different polypeptide 
chain. The proteins were first separated 
on the basis of their isoelectric points by 
isoelectric focusing from left to right. 
They were then further fractionated 
according to their molecular weights by 
electrophoresis from top to bottom in the 
presence of SDS. Note that different 
proteins are present in very different 
amounts. The bacteria were fed with a 
mixture of radioisotope-labeled amino 
acids so that all of their proteins were 
radioactive and could be detected by 
autoradiography (see pp. 602-603). 
(Courtesy of Patrick O'Farrell.) 



A different, even more powerful, "two-dimensional" technique is now avail- 
able when the aim is to determine all of the proteins present in an organelle or 
another complex mixture of proteins. Because the technique relies on mass 
spectroscopy, it requires that the proteins be from an organism with a com- 
pletely sequenced genome. First, the mixture of proteins present is digested 
with trypsin to produce short peptides. Next, these peptides are separated by a 
series of automated liquid chromatography steps. As the second dimension, 
each separated peptide is fed directly into a tandem mass spectrometer 
(MS/MS) that allows its amino acid sequence, as well as any post-translational 
modifications, to be determined. This arrangement, in which a tandem mass 
spectrometer (MS/MS) is attached to the output of an automated liquid chro- 
matography (LC) system, is referred to as LC-MS/MS. It is now becoming rou- 
tine to subject an entire organelle preparation to LC-MS/MS analysis and to 
identify hundreds of proteins and their modifications. Of course, no organelle 
isolation procedure is perfect, and some of the proteins identified will be con- 
taminating proteins. These can often be excluded by analyzing neighboring 
fractions from the organelle purification and "subtracting" them out from the 
peak organelle fractions. 



Hydrodynamic Measurements Reveal the Size and Shape of a 
Protein Complex 

Most proteins in a cell act as part of larger complexes, and knowledge of the size 
and shape of these complexes often leads to insights regarding their function. 
This information can be obtained in several important ways. Sometimes, a com- 
plex can be directly visualized using electron microscopy, as described in Chap- 
ter 9. A complementary approach relies on the hydrodynamic properties of a 
complex, that is, its behavior as it moves through a liquid medium. Usually, two 
separate measurements are made. One measure is the velocity of a complex as it 
moves under the influence of a centrifugal field produced by an ultracentrifuge 
(see Figure 8-1 1A). The sedimentation constant (or S-value) obtained depends 
on both the size and the shape of the complex and does not, by itself, convey 
especially useful information. However, once a second hydrodynamic measure- 
ment is performed— by charting the migration of a complex through a gel-filtra- 
tion chromatography column (see Figure 8-13B}— both the approximate shape 
of a complex and its molecular weight can be calculated. 

Molecular weight can also be determined more directly by using an analytical 
ultracentrifuge, a complex device that allows protein absorbance measurements 
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to be made on a sample while it is subjected to centrifugal forces. In this 
approach, the sample is centrifuged until it reaches equilibrium, where the cen- 
trifugal force on a protein complex exactly balances its tendency to diffuse away. 
Because this balancing point is dependent on a complex's molecular weight but 
not on its particular shape, the molecular weight can be directly calculated, as 
needed to determine the stoichiometry of each protein in a protein complex. 

Sets of Interacting Proteins Can Be Identified by Biochemical 
Methods 

Because most proteins in the cell function as part of complexes with other pro- 
teins, an important way to begin to characterize the biological role of an unknown 
protein is to identify all of the other proteins to which it specifically binds. 

One method for identifying proteins that bind to one another tightly is co- 
immunoprecipitation. In this case, an antibody recognizes a specific target pro- 
tein; reagents that bind to the antibody and are coupled to a solid matrix then 
drag the complex out of solution to the bottom of a test tube. If the original tar- 
get protein is associated tightly enough with another protein when it is captured 
by the antibody, the partner precipitates as well. This method is useful for iden- 
tifying proteins that are part of a complex inside cells, including those that inter- 
act only transiently — for example, when extracellular signal molecules stimulate 
cells (discussed in Chapter 15). Another method frequently used to identify a 
protein's binding partners is protein affinity chromatography (see Figure 8-13C). 
To employ this technique to capture interacting proteins, a target protein is 
attached to polymer beads that are packed into a column. When the proteins in 
a cell extract are washed through this column, those proteins that interact with 
the target protein are retained by the affinity matrix. These proteins can then be 
eluted and their identity determined by mass spectrometry 

In addition to capturing protein complexes on columns or in test tubes, 
researchers are developing high-density protein arrays to investigate protein 
interactions. These arrays, which contain thousands of different proteins or 
antibodies spotted onto glass slides or immobilized in tiny wells, allow one to 
examine the biochemical activities and binding profiles of a large number of 
proteins at once. For example, if one incubates a fluorescently labeled protein 
with arrays containing thousands of immobilized proteins, the spots that 
remain fluorescent after extensive washing each contain a protein to which the 
labeled protein specifically binds. 

Protein-Protein Interactions Can Also Be Identified by a 
Two-Hybrid Technique in Yeast 

Thus far, we have emphasized biochemical approaches to the study of pro- 
tein-protein interactions. However, a particularly powerful strategy, called the 
two-hybrid system, relies on exploiting the cell's own mechanisms to reveal pro- 
tein-protein interactions. 

The technique takes advantage of the modular nature of gene activator pro- 
teins (see Figure 7-45). These proteins both bind to specific DNA sequences 
and activate gene transcription, and these activities are often performed by two 
separate protein domains. Using recombinant DNA techniques, two such pro- 
tein domains are used to create separate "bait" and "prey" fusion proteins. To 
create the "bait" fusion protein, the DNA sequence that codes for a target pro- 
tein is fused with DNA that encodes the DNA-binding domain of a gene activa- 
tor protein. When this construct is introduced into yeast, the cells produce the 
fusion protein, with the target protein attached to this DNA-binding domain 
(Figure 8-24). This fusion protein binds to the regulatory region of a reporter 
gene, where it serves as "bait" to fish for proteins that interact with the target 
protein. To search for potential binding partners (potential prey for the bait), 
the candidate proteins also have to be constructed as fusion proteins: DNA 
encoding the activation domain of a gene activator protein is fused to a large 
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Figure 8-24 The yeast two-hybrid 
system for detecting protein-protein 

interactions. The target protein is fused 
to a DNA-binding domain that directs the 
fusion protein to the regulatory region of 
a reporter gene as "bait." When this target 
protein binds to another specially 
designed protein in the cell nucleus 
("prey"), their interaction brings together 
two halves of a transcriptional activator, 
which then switches on the expression of 
the reporter gene. 



number of different genes. Members of this collection of genes— encoding 
potential "prey"— are introduced individually into yeast cells containing the 
bait. If the yeast cell receives a DNA clone that expresses a prey partner for the 
bait protein, the two halves of a transcriptional activator are united, switching 
on the reporter gene (see Figure 8-24). 

This ingenious technique sounds complex, but the two-hybrid system is rel- 
atively simple to use in the laboratory. Although the protein-protein interactions 
occur in the yeast cell nucleus, proteins from every part of the cell and from any 
organism can be studied in this way. The two-hybrid system has been scaled up 
to map the interactions that occur among all of the proteins an organism pro- 
duces. In this case, a set of bait and prey fusions is produced for every cell pro- 
tein, and every bait/prey combination can be monitored. In this way protein 
interaction maps have been generated for most of the proteins in yeast, C. ele- 
gans, and Drosophila. 



Combining Data Derived from Different Techniques Produces 
Reliable Protein-Interaction Maps 

As previously discussed in Chapter 3, extensive protein-interaction maps can be 
very useful for identifying the functions of proteins (see Figure 3-82) . For this rea- 
son, both the two-hybrid method and the biochemical technique discussed ear- 
lier known as tap-tagging (see pp. 515-516) have been automated to determine 
the interactions between thousands of proteins. Unfortunately different results 
are found in different experiments, and many of the interactions detected in one 
laboratory are not detected in another. Therefore, the most useful protein-inter- 
action maps are those that combine data from many experiments, requiring that 
each interaction in the map be confirmed by more than one technique. 



Optical Methods Can Monitor Protein Interactions in Real Time 

Once two proteins— or a protein and a small molecule— are known to associate, 
it becomes important to characterize their interaction in more detail. Proteins 
can associate with each other more or less permanently (like the subunits of 
RNA polymerase or the proteosome), or engage in transient encounters that 
may last only a few milliseconds (like a protein kinase and its substrate). 

To understand how a protein functions inside a cell, we need to determine 
how tightly it binds to other proteins, how rapidly it dissociates from them, and 
how covalent modifications, small molecules, or other proteins influence these 
interactions. Such studies of protein dynamics often employ optical methods. 
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Certain amino acids (for example, tryptophan) exhibit weak fluorescence 
that can be detected with sensitive fluorimeters. In many cases, the fluorescence 
intensity, or the emission spectrum of fluorescent amino acids located in a pro- 
tein-protein interface, will change when the proteins associate. When this 
change can be detected by fluorimetry, it provides a sensitive and quantitative 
measure of protein binding. 

A particularly useful method for monitoring the dynamics of a protein's 
binding to other molecules is called surface plasmon resonance (SPR). The SPR 
method has been used to characterize a wide variety of molecular interactions, 
including antibody-antigen binding, ligand-receptor coupling, and the binding 
of proteins to DNA, carbohydrates, small molecules, and other proteins. 

SPR detects binding interactions by monitoring the reflection of a beam of 
light off the interface between an aqueous solution of potential binding 
molecules and a biosensor surface carrying an immobilized bait protein. The 
bait protein is attached to a very thin layer of metal that coats one side of a glass 
prism (Figure 8-25). A light beam is passed through the prism; at a certain angle, 
called the resonance angle, some of the energy from the light interacts with the 
cloud of electrons in the metal film, generating a plasmon — an oscillation of the 
electrons at right angles to the plane of the film, bouncing up and down between 
its upper and lower surfaces like a weight on a spring. The plasmon, in turn, gen- 
erates an electrical field that extends a short distance — about the wavelength of 
the light — above and below the metal surface. Any change in the composition of 
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(B) The binding of prey molecules to bait molecules increases the refractive index of the surface layer. 
This alters the resonance angle for plasmon induction, which can be measured by a detector. 
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Figure 8-25 Surface plasmon resonance. 

(A) SPR can detect binding interactions by 
monitoring the reflection of a beam of 
light off the interface between an aqueous 
solution of potential binding molecules 
(green) and a biosensor surface coated 
with an immobilized bait protein (red). 

(B) A solution of prey proteins is allowed to 
flow past the immobilized bait protein. 
Binding of prey molecules to the bait 
protein produces a measurable change in 
the resonance angle, as does their 
dissociation when a buffer solution washes 
them off. These changes, monitored in real 
time, reflect the association and 
dissociation of the molecular complexes. 
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the environment within the range of the electrical field will cause a measurable 
change in the resonance angle. 

To measure binding, a solution containing proteins (or other molecules) 
that might interact with the immobilized bait protein is allowed to flow past the 
biosensor surface. Proteins binding to the bait change the composition of the 
molecular complexes on the metal surface, causing a change in the resonance 
angle (see Figure 8-25). The changes in the resonance angle are monitored in 
real time and reflect the kinetics of the association— or dissociation— of 
molecules with the bait protein. The association rate (k on ) is measured as the 
molecules interact, and the dissociation rate (A^ ff ) is determined as buffer 
washes the bound molecules from the sensor surface. A binding constant {K) is 
calculated by dividing k n by k on . In addition to determining the kinetics, SPR 
can be used to determine the number of molecules that are bound in each com- 
plex: the magnitude of the SPR signal change is proportional to the mass of the 
immobilized complex. 

The SPR method is particularly useful because it requires only small amounts 
of the protein, the protein does not have to be labeled in any way, and the inter- 
actions of the protein with other molecules can be monitored in real time. 

A third optical method for probing protein interactions uses green fluorescent 
protein (discussed in detail below) and its derivatives of different colors. In this 
application, two proteins of interest are each labeled with a different fluo- 
rochrome, such that the emission spectrum of one fluorochrome overlaps the 
absorption spectrum of the second fluorochrome. If the two proteins— and their 
attached fluorochromes— come very close to each other (within about 1-10 nm), 
the energy of the absorbed light is transferred from one fluorochrome to the 
other. The energy transfer, called fluorescence resonance energy transfer 
(FRET), is determined by illuminating the first fluorochrome and measuring 
emission from the second (Figure 8-26). This technique is especially powerful 
because, when combined with fluorescence microscopy, it can be used to char- 
acterize protein-protein interactions at specific locations inside living cells. 



Some Techniques Can Monitor Single Molecules 

The biochemical methods described so far in this chapter are used to study large 
populations of molecules, a limitation that reflects the small size of typical bio- 
logical molecules relative to the sensitivity of the methods to detect them. How- 
ever, the recent development of highly sensitive and precise measurement 
methods has created a new branch of biophysics— the study of single molecules. 
Single-molecule studies are particularly important in cell biology because many 
processes rely on the activities of only a few critical molecules in the cell 
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Figure 8-26 Fluorescence resonance 
energy transfer (FRET). To determine 
whether (and when) two proteins 
interact inside a cell, the proteins are first 
produced as fusion proteins attached to 
different color variants of green 
fluorescent protein (GFP). (A) In this 
example, protein X is coupled to a blue 
fluorescent protein, which is excited by 
violet light (370-440 nm) and emits blue 
light (440-480 nm); protein Y is coupled 
to a green fluorescent protein, which is 
excited by blue light and emits green 
light (510 nm). (B) If protein X and Y do 
not interact, illuminating the sample with 
violet light yields fluorescence from the 
blue fluorescent protein only. (C) When 
protein X and protein Y interact, FRET can 
now occur. Illuminating the sample with 
violet light excites the blue fluorescent 
protein, whose emission in turn excites 
the green fluorescent protein, resulting in 
an emission of green light. The 
fluorochromes must be quite close 
together— within about 1-10 nm of one 
another— for FRET to occur. Because not 
every molecule of protein X and protein Y 
is bound at all times, some blue light may 
still be detected. But as the two proteins 
begin to interact emission from the 
donor GFP falls as the emission from the 
acceptor GFP rises. 
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The first example of a technique for studying the function of single protein 
molecules was the use of a patch electrode to measure current flow through sin- 
gle ion channels (see Figure 11-33). Another approach is to attach the protein to 
a larger structure, such as a polystyrene bead, which can then be observed by 
conventional microscopy This strategy has been particularly useful in measuring 
the movements of motor proteins. For example, molecules of the motor protein 
kinesin (discussed in Chapter 16) can be attached to a bead, and by observing the 
kinesin-attached bead moving along a microtubule, the step size of the motor 
(that is, the distance moved for each ATP molecule hydrolyzed) can be measured. 
As we will see in Chapter 9, optical microscopes have a limited resolution due to 
the diffraction of light, but computational and optical methods can be used to 
determine the position of a bead to a much finer precision than the resolution 
limit of the microscope. Using such techniques, extremely small movements — on 
the order of nanometers — can easily be detected and quantified. 

Another advantage of attaching molecules to large beads is that these beads 
can serve as "handles" by which the molecules can be manipulated. This allows 
forces to be applied to the molecules, and their response observed. For example, 
the speed or step size of a motor can be measured as a function of the force it is 
pulling against. As discussed in the next chapter, a focused laser beam can be 
used as "optical tweezers" to generate a mechanical force on a bead, allowing 
motor proteins to be studied under an applied force (see Figure 9-35). <CGCG> 
Beads can also be manipulated using magnetic fields, a technology known as 
"magnetic tweezers." If multiple beads are present in a magnetic field, they will 
all experience the same force, potentially allowing large numbers of beads to be 
manipulated in parallel in a single experiment. 

While beads can be used as markers to track protein movements, it is clearly 
preferable to be able to visualize the proteins themselves. In the next chapter, we 
shall see that recent refinements in microscopy have now made this possible. 



Protein Function Can Be Selectively Disrupted With Small 
Molecules 

Chemical inhibitors have contributed to the development of cell biology. For 
example, the microtubule inhibitor colchicine is routinely used to test whether 
microtubules are required for a given biological process; it also led to the first 
purification of tubulin several decades ago. In the past, these small molecules 
were usually natural products; that is, they were synthesized by living creatures. 
Although, as a whole, natural products have been extraordinarily useful in science 
and medicine (see, for example, Table 6-4, p. 385), they acted on a limited number 
of biological processes. However, the recent development of methods to synthe- 
size hundreds of thousands of small molecules and to carry out large-scale auto- 
mated screens holds the promise of identifying chemical inhibitors for virtually 
any biological process. In such approaches, large collections of small chemical 
compounds are simultaneously tested, either on living cells or in cell-free assays. 
Once an inhibitor is identified, it can be used as a probe to identify, through affin- 
ity chromatography (see Figure 8-13C) or other means, the protein to which the 
inhibitor binds. This general strategy, often called chemical biology, has success- 
fully identified inhibitors of many proteins that carry out key processes in cell biol- 
ogy. The kinesin protein that functions in mitosis, for example, was identified by 
this method (Figure 8-27). Chemical inhibitors give the cell biologist great control 
over the timing of inhibition, as drugs can be rapidly added to or removed from 
cells, allowing protein function to be switched on or off quickly. 



Protein Structure Can Be Determined Using X-Ray Diffraction 

The main technique that has been used to discover the three-dimensional struc- 
ture of molecules, including proteins, at atomic resolution is x-ray crystallogra- 
phy. X-rays, like light, are a form of electromagnetic radiation, but they have a 
much shorter wavelength, typically around 0.1 nm (the diameter of a hydrogen 
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Figure 8-27 Small-molecule inhibitors 
for manipulating living cells. 

(A) Chemical structure of monastrol, a 
kinesin inhibitor identified in a large- 
scale screen for small molecules that 
disrupt mitosis. (B) Normal mitotic 
spindle seen in an untreated cell. The 
microtubules are stained green and 
chromosomes blue. (C) Monopolar 
spindle that forms in cells treated with 
monastrol. (B and C, from T.U. Mayer et 
al., Science 286:971-974, 1 999. With 
permission from AAAS.) 
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Figure 8-28 X-ray crystallography. (A) A narrow parallel beam of x-rays is 
directed at a well-ordered crystal (B). Shown here is a protein crystal of ribulose 
bisphosphate carboxylase, an enzyme with a central role in C0 2 fixation during 
photosynthesis. The atoms in the crystal scatter some of the beam, and the 
scattered waves reinforce one another at certain points and appear as a pattern 
of diffraction spots (C). This diffraction pattern, together with the amino acid 
sequence of the protein, can be used to produce an atomic model (D). The 
complete atomic model is hard to interpret, but this simplified version, derived 
from the x-ray diffraction data, shows the protein's structural features clearly (a 
helices, green; (3 strands, red). The components pictured in A to D are not shown 
to scale. (B, courtesy of C. Branden; C, courtesy of J. Hajdu and I. Andersson; 
D, adapted from original provided by B. Furugren.) 



atom). If a narrow parallel beam of x-rays is directed at a sample of a pure pro- 
tein, most of the x-rays pass straight through it. A small fraction, however, are 
scattered by the atoms in the sample. If the sample is a well-ordered crystal, the 
scattered waves reinforce one another at certain points and appear as diffraction 
spots when recorded by a suitable detector (Figure 8-28). 

The position and intensity of each spot in the x-ray diffraction pattern con- 
tain information about the locations of the atoms in the crystal that gave rise to 
it. Deducing the three-dimensional structure of a large molecule from the 
diffraction pattern of its crystal is a complex task and was not achieved for a pro- 
tein molecule until 1960. But in recent years x-ray diffraction analysis has 
become increasingly automated, and now the slowest step is likely to be the gen- 
eration of suitable protein crystals. This step requires large amounts of very pure 
protein and often involves years of trial and error to discover the proper crystal- 
lization conditions; the pace has greatly accelerated with the use of recombinant 
DNA techniques to produce pure proteins and robotic techniques to test large 
numbers of crystallization conditions. 

Analysis of the resulting diffraction pattern produces a complex three- 
dimensional electron-density map. Interpreting this map— translating its con- 
tours into a three-dimensional structure— is a complicated procedure that 
requires knowledge of the amino acid sequence of the protein. Largely by trial 
and error, the sequence and the electron-density map are correlated by com- 
puter to give the best possible fit. The reliability of the final atomic model 
depends on the resolution of the original crystallographic data: 0.5 nm resolu- 
tion might produce a low- resolution map of the polypeptide backbone, whereas 
a resolution of 0.15 nm allows all of the non-hydrogen atoms in the molecule to 
be reliably positioned. 

A complete atomic model is often too complex to appreciate directly, but 
simplified versions that show a protein's essential structural features can be 
readily derived from it (see Panel 3-2, pp. 132-133). The three-dimensional 
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structures of about 20,000 different proteins have now been determined by x-ray 
crystallography or by NMR spectroscopy (see below) — enough to begin to see 
families of common structures emerging. These structures or protein folds often 
seem to be more conserved in evolution than are the amino acid sequences that 
form them (see Figure 3-13). 

X-ray crystallographic techniques can also be applied to the study of 
macromolecular complexes. In a recent triumph, the method was used to 
determine the structure of the ribosome, a large and complex machine made 
of several RNAs and more than 50 proteins (see Figure 6-64). The determina- 
tion required the use of a synchrotron, a radiation source that generates x-rays 
with the intensity needed to analyze the crystals of such large macromolecu- 
lar complexes. <GGCC> 



NMR Can Be Used to Determine Protein Structure in Solution 

Nuclear magnetic resonance (NMR) spectroscopy has been widely used for many 
years to analyze the structure of small molecules. This technique is now also 
increasingly applied to the study of small proteins or protein domains. Unlike x- 
ray crystallography NMR does not depend on having a crystalline sample. It sim- 
ply requires a small volume of concentrated protein solution that is placed in a 
strong magnetic field; indeed, it is the main technique that yields detailed evi- 
dence about the three-dimensional structure of molecules in solution. 

Certain atomic nuclei, particularly hydrogen nuclei, have a magnetic 
moment or spin: that is, they have an intrinsic magnetization, like a bar magnet. 
The spin aligns along the strong magnetic field, but it can be changed to a mis- 
aligned, excited state in response to applied radiofrequency (RF) pulses of elec- 
tromagnetic radiation. When the excited hydrogen nuclei return to their aligned 
state, they emit RF radiation, which can be measured and displayed as a spec- 
trum. The nature of the emitted radiation depends on the environment of each 
hydrogen nucleus, and if one nucleus is excited, it influences the absorption and 
emission of radiation by other nuclei that lie close to it. It is consequently possi- 
ble, by an ingenious elaboration of the basic NMR technique known as two- 
dimensional NMR, to distinguish the signals from hydrogen nuclei in different 
amino acid residues, and to identify and measure the small shifts in these signals 
that occur when these hydrogen nuclei lie close enough together to interact. 
Because the size of such a shift reveals the distance between the interacting pair 
of hydrogen atoms, NMR can provide information about the distances between 
the parts of the protein molecule. By combining this information with a knowl- 
edge of the amino acid sequence, it is possible in principle to compute the three- 
dimensional structure of the protein (Figure 8-29). 
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Figure 8-29 NMR spectroscopy. (A) An 

example of the data from an NMR 
machine. This two-dimensional NMR 
spectrum is derived from the C-terminal 
domain of the enzyme cellulase.The 
spots represent interactions between 
hydrogen atoms that are near neighbors 
in the protein and hence reflect the 
distance that separates them. Complex 
computing methods, in conjunction with 
the known amino acid sequence, enable 
possible compatible structures to be 
derived. (B)Ten structures of the enzyme, 
which all satisfy the distance constraints 
equally well, are shown superimposed on 
one another, giving a good indication of 
the probable three-dimensional 
structure. (Courtesy of P. Kraulis.) 
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For technical reasons the structure of small proteins of about 20,000 daltons 
or less can be most readily determined by NMR spectroscopy Resolution 
decreases as the size of a macromolecule increases. But recent technical 
advances have now pushed the limit to about 100,000 daltons, thereby making 
the majority of proteins accessible for structural analysis by NMR. 

Because NMR studies are performed in solution, this method also offers a 
convenient means of monitoring changes in protein structure, for example dur- 
ing protein folding or when the protein binds to another molecule. NMR is also 
used widely to investigate molecules other than proteins and is valuable, for 
example, as a method to determine the three-dimensional structures of RNA 
molecules and the complex carbohydrate side chains of glycoproteins. 

Some landmarks in the development of x-ray crystallography and NMR are 
listed in Table 8-2. 

Protein Sequence and Structure Provide Clues About Protein 
Function 

Having discussed methods for purifying and analyzing proteins, we now turn to 
a common situation in cell and molecular biology: an investigator has identified 
a gene important for a biological process but has no direct knowledge of the bio- 
chemical properties of its protein product. 

Thanks to the proliferation of protein and nucleic acid sequences that are 
catalogued in genome databases, the function of a gene— and its encoded pro- 
tein—can often be predicted by simply comparing its sequence with those of 
previously characterized genes (see Figure 3-14). Because amino acid sequence 

Table 8-2 Landmarks in the Development of X-ray Crystallography and NMR and Their Application to Biological Molecules 
1 864 Hoppe-Seyler crystallizes, and names, the protein hemoglobin. 

1 895 Rontgen observes that a new form of penetrating radiation, which he names x-rays, is produced when cathode rays 

(electrons) hit a metal target. 

1 91 2 Von Laue obtains the first x-ray diffraction patterns by passing x-rays through a crystal of zinc sulfide. 

W.L Bragg proposes a simple relationship between an x-ray diffraction pattern and the arrangement of atoms in a 
crystal that produce the pattern. 

1 926 Summer obtains crystals of the enzyme urease from extracts of jack beans and demonstrates that proteins possess 

catalytic activity. 

1 93 1 Pauling publishes his first essays on The Nature of the Chemical Bond/ detailing the rules of covalent bonding. 

1 934 Bernal and Crowfoot present the first detailed x-ray diffraction patterns of a protein obtained from crystals of the 
enzyme pepsin. 

1935 Patterson develops an analytical method for determining interatomic spacings from x-ray data. 
1 941 Astbury obtains the first x-ray diffraction pattern of DNA. 

1946 Block and Purcell describe NMR. 

1951 Pauling and Corey propose the structure of a helical conformation of a chain of L-amino acids— the a helix— and the 

structure of the p sheet, both of which were later found in many proteins. 

1 953 Watson and Crick propose the double-helix model of DNA, based on x-ray diffraction patterns obtained by Franklin 
and Wilkins. 

1954 Perutz and colleagues develop heavy-atom methods to solve the phase problem in protein crystallography. 

1 960 Kendrew describes the first detailed structure of a protein (sperm whale myoglobin) to a resolution of 0.2 nm, and 

Perutz presents a lower-resolution structure of the larger protein hemoglobin. 
1 966 Phillips describes the structure of lysozyme, the first enzyme to have its structure analyzed in detail. 

1971 Jeener proposes the use of two-dimensional NMR, and Wuthrich and colleagues first use the method to solve a protein 

structure in the early 1 980s. 

1 976 Kim and Rich and Klug and colleagues describe the detailed three-dimensional structure of tRNA determined by x-ray 

diffraction. 

1 977-1 978 Holmes and Klug determine the structure of tobacco mosaic virus (TMV), and Harrison and Rossman determine the 
structure of two small spherical viruses. 

1 985 Michel, Deisenhofer and colleagues determine the first structure of a transmembrane protein (a bacterial reaction 

center) by x-ray crystallography. Henderson and colleagues obtain the structure of bacteriorhodopsin, a 
transmembrane protein, by high-resolution electron-microscopy methods between 1975 and 1990. 



ANALYZING PROTEINS 



531 



Score = 399 bits (1025), Expect = e-111 ^^^^^_— 

Identities = 198/290 (68%), Positives = 241/290 (82%), Gaps ■ 1/290 

Query: 57 MENFQKVEKIGEGTYGWYKARNKLTGEWALKKIRLDTETEGVPSTAIREISLLKELNH 116 

ME ++KVEKIGEGTYGWYKA +K T E +ALKKIRL+ E EGVPSTAIREISLL*E+NH 
SbjCtt 1 MEQYEKVEKIGEGTYGWYKALDKATNETIALKKIRLEQEDEGVPSTAIREISLLKEMNH 60 

Queryt 117 PNIVKLLDVIHTEMKLYLVFEFLHQDLKKFMDASALO?GIPLPLIKSYLFQLLQGLAFCHS 176 

NIV+L DV+H+E ++YLVFE+L DLKKFMD+ LIKSYL+Q+L G+A+CHS 

Sbjct: 61' GNIVRLHDVVHSEimi-YLVFEYU^DLKKFMDS^i^FiUCI«FI!LIKSYLYQILSGVAYCHS 120 

V 
Query i 177 HRVLHRDLKPQNLLINTB^GAIKLADFGLARAFGVPVRTYTHEWTLWYRAPEILLGOK¥ 235 

HRVLHRDLKPQNLLI+ A+KLADFGLARAFG+PVRT+THEWTLWYRAPEILLG + 

Sbjct: 121 HRVXHRDLKPQNLLIDttl^lALKLADFGLARAFGIPVRTFTHEVVTLWYRAPEILLGilRO 180 

Query: 236 YSTftVDIWSLGCIFAEMV^mRILLFPGDSEIDQLFRIFRTljGTPDEVVWPGVTaiMPDYKfS 295 

YST VD+WS+GCIFAEMV ++ LFPGDSEID+LF+IFR LGTP+E WPGV+ +PD+K + 
Sbjct: 181 YSTFVDVWSVGCIFAEMVHQKtLFPGDSEIDELFKIFRILGTPNE^WPGVSCLPDFK^A 240 

Query: 296 FPKW»JlQD»SK\^PtLD^)G|tgLLSQMLHYDPNKRISAKAALAH^FFQDV 345 

FP+W QD + WP LD G LLS+ML Y+P+KRI+A+ AL H +F+D+ 
Sbjct: 241 FPRW£^3D£AT\A^P^D^G|^LSKJC^YEPSKRITAR0ALEHEYFKDL 290 

determines protein structure, and structure dictates biochemical function, pro- 
teins that share a similar amino acid sequence usually have the same structure 
and usually perform similar biochemical functions, even when they are found in 
distantly related organisms. In modern cell biology, the study of a newly discov- 
ered protein usually begins with a search for previously characterized proteins 
that are similar in their amino acid sequences. 

Searching a collection of known sequences for homologous genes or pro- 
teins is typically done over the World Wide Web, and it simply involves selecting 
a database and entering the desired sequence. A sequence alignment program — 
the most popular are BLAST and FASTA — scans the database for similar 
sequences by sliding the submitted sequence along the archived sequences 
until a cluster of residues falls into full or partial alignment (Figure 8-30). The 
results of even a complex search — which can be performed on either a 
nucleotide or an amino acid sequence — are returned within minutes. Such com- 
parisons can predict the functions of individual proteins, families of proteins, or 
even most of the protein complement of a newly sequenced organism. 

As was explained in Chapter 3, many proteins that adopt the same confor- 
mation and have related functions are too distantly related to be identified as 
clearly homologous from a comparison of their amino acid sequences alone (see 
Figure 3-13). Thus, an ability to reliably predict the three dimensional structure 
of a protein from its amino acid sequence would improve our ability to infer pro- 
tein function from the sequence information in genomic databases. In recent 
years, major progress has been made in predicting the precise structure of a pro- 
tein. These predictions are based, in part, on our knowledge of tens of thousands 
of protein structures that have already been determined by x-ray crystallography 
and NMR spectroscopy and, in part, on computations using our knowledge of 
the physical forces acting on the atoms. However, it remains a substantial and 
important challenge to predict the structures of proteins that are large or have 
multiple domains, or to predict structures at the very high levels of resolution 
needed to assist in computer-based drug discovery. 

While finding homologous sequences and structures for a new protein will 
provide many clues about its function, it is usually necessary to test these 
insights through direct experimentation. However, the clues generated from 
sequence comparisons typically point the investigator in the correct experi- 
mental direction, and their use has therefore become one of the most important 
strategies in modern cell biology. 



Figure 8-30 Results of a BLAST search. 

Sequence databases can be searched to 
find similar amino acid or nucleic acid 
sequences. Here, a search for proteins 
similar to the human cell-cycle regulatory 
protein Cdc2 (Query) locates maize Cdc2 
(Sbjct), which is 68% identical (and 82% 
similar) to human Cdc2 in its amino acid 
sequence. The alignment begins at residue 
57 of the Query protein, suggesting that 
the human protein has an N-terminal 
region that is absent from the maize 
protein. The green blocks indicate 
differences in sequence, and the yellow bar 
summarizes the similarities: when the two 
amino acid sequences are identical, the 
residue is shown; conservative amino acid 
substitutions are indicated by a plus sign 
(+). Only one small gap has been 
introduced— indicated by the red arrow at 
position 1 94 in the Query sequence — to 
align the two sequences maximally. The 
alignment score (Score), which is expressed 
in two different types of units, takes into 
account penalties for substitutions and 
gaps; the higher the alignment score, the 
better the match. The significance of the 
alignment is reflected in the Expectation (E) 
value, which specifies how often a match 
this good would be expected to occur by 
chance. The lower the E value, the more 
significant the match; the extremely low 
value here (e~ 111 ) indicates certain 
significance. E values much higher than 0.1 
are unlikely to reflect true relatedness. For 
example, an E value of 0.1 means there is a 
1 in 10 likelihood that such a match would 
arise solely by chance. 



Summary 

Most proteins function in concert with other proteins, and many methods exist for 
identifying and studying protein-protein interactions. Small- molecule inhibitors 
allow the functions of proteins they act upon to be studied in living cells. Because pro- 
teins with similar structures often have similar functions, the biochemical activity of a 
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protein can often be predicted by searching databases for previously characterized pro- 
teins that are similar in their amino acid sequences. 



ANALYZING AND MANIPULATING DNA 

Until the early 1970s, DNA was the most difficult biological molecule for the bio- 
chemist to analyze. Enormously long and chemically monotonous, the string of 
nucleotides that forms the genetic material of an organism could be examined 
only indirectly, by protein or RNA sequencing or by genetic analysis. Today, the 
situation has changed entirely. From being the most difficult macromolecule of 
the cell to analyze, DNA has become the easiest. It is now possible to isolate a 
specific region of almost any genome, to produce a virtually unlimited number 
of copies of it, and to determine the sequence of its nucleotides in a few hours. 
At the height of the Human Genome Project, large facilities with automated 
machines were generating DNA sequences at the rate of 1000 nucleotides per 
second, around the clock. By related techniques, an isolated gene can be altered 
(engineered) at will and transferred back into the germ line of an animal or 
plant, so as to become a functional and heritable part of the organism's genome. 
These technical breakthroughs in genetic engineering — the ability to 
manipulate DNA with precision in a test tube or an organism — have had a dra- 
matic impact on all aspects of cell biology by facilitating the study of cells and 
their macromolecules in previously unimagined ways. Recombinant DNA tech- 
nology comprises a mixture of techniques, some newly developed and some 
borrowed from other fields such as microbial genetics (Table 8-3). Central to the 
technology are the following key techniques: 

1. Cleavage of DNA at specific sites by restriction nucleases, which greatly 
facilitates the isolation and manipulation of individual genes. 

2. DNA ligation, which makes it possible to design and construct DNA 
molecules that are not found in nature. 

3. DNA cloning through the use of either cloning vectors or the polymerase 
chain reaction, in which a portion of DNA is repeatedly copied to generate 
many billions of identical molecules. 

4. Nucleic acid hybridization, which makes it possible to find a specific 
sequence of DNA or RNA with great accuracy and sensitivity on the basis of 
its ability to selectively bind a complementary nucleic acid sequence. 

5. Rapid determination of the sequence of nucleotides of any DNA (even 
entire genomes), making it possible to identify genes and to deduce the 
amino acid sequence of the proteins they encode. 

6. Simultaneous monitoring of the level of mRNA produced by every gene in 
a cell, using nucleic acid microarrays, in which tens of thousands of 
hybridization reactions take place simultaneously. 

In this section, we describe each of these basic techniques, which together 
have revolutionized the study of cell biology. 



Restriction Nucleases Cut Large DNA Molecules into Fragments 

Unlike a protein, a gene does not exist as a discrete entity in cells, but rather as 
a small region of a much longer DNA molecule. Although the DNA molecules in 
a cell can be randomly broken into small pieces by mechanical force, a fragment 
containing a single gene in a mammalian genome would still be only one among 
a hundred thousand or more DNA fragments, indistinguishable in their average 
size. How could such a gene be purified? Because all DNA molecules consist of 
an approximately equal mixture of the same four nucleotides, they cannot be 
readily separated, as proteins can, on the basis of their different charges and 
binding properties. 

The solution to all of these problems began to emerge with the discovery of 
restriction nucleases. These enzymes, which can be purified from bacteria, cut 
the DNA double helix at specific sites defined by the local nucleotide sequence, 
thereby cleaving a long double-stranded DNA molecule into fragments of 
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Table 8-3 Some Major Steps in the Development of Recombinant DNA and Transgenic Technology 

1869 Miescher first isolates DNA from white blood cells harvested from pus-soaked bandages obtained from a nearby 

hospital. 
1 944 Avery provides evidence that DNA, rather than protein, carries the genetic information during bacterial transformation. 

1 953 Watson and Crick propose the double-helix model for DNA structure based on x-ray results of Franklin and Wilkins. 

1955 Kornberg discovers DNA polymerase, the enzyme now used to produce labeled DNA probes. 

1 961 Marmur and Doty discover DNA renaturation, establishing the specificity and feasibility of nucleic acid hydridization 
reactions. 

1 962 Arber provides the first evidence for the existence of DNA restriction nucleases, leading to their purification and use in 
DNA sequence characterization by Nathans and H. Smith. 

1966 Nirenberg, Ochoa, and Khorana elucidate the genetic code. 

1 967 Gellert discovers DNA ligase, the enzyme used to join DNA fragments together. 

1 972-1 973 DNA cloning techniques are developed by the laboratories of Boyer, Cohen, Berg, and their colleagues at Stanford 

University and the University of California at San Francisco. 
1 975 Southern develops gel-transfer hybridization for the detection of specific DNA sequences. 

1 975-1 977 Sanger and Barrell and Maxam and Gilbert develop rapid DNA-sequencing methods. 

1 981 -1 982 Palmiter and Brinster produce transgenic mice; Spradling and Rubin produce transgenic fruit flies. 

1982 GenBank, NIH's public genetic sequence database, is established at Los Alamos National Laboratory. 
1 985 Mullis and co-workers invent the polymerase chain reaction (PCR). 

1 987 Capecchi and Smithies introduce methods for performing targeted gene replacement in mouse embryonic stem cells. 

1989 Fields and Song develop the yeast two-hybrid system for identifying and studying protein interactions. 

1 989 Olson and colleagues describe sequence-tagged sites, unique stretches of DNA that are used to make physical maps of 
human chromosomes. 

1990 Lipman and colleagues release BLAST, an algorithm used to search for homology between DNA and protein sequences. 

1 990 Simon and colleagues study how to efficiently use bacterial artificial chromosomes, BACs, to carry large pieces of cloned 
human DNA for sequencing. 

1 991 Hood and Hunkapillar introduce new automated DNA sequence technology. 

1 995 Venter and colleagues sequence the first complete genome, that of the bacterium Haemophilus influenzae. 

1 996 Goffeau and an international consortium of researchers announce the completion of the first genome sequence of a 
eucaryote, the yeast Saccharomyces cerevisiae. 

1 996-1 997 Lockhart and colleagues and Brown and DeRisi produce DNA microarrays, which allow the simultaneous monitoring 

of thousands of genes. 
1 998 Sulston and Waterston and colleagues produce the first complete sequence of a multicellular organism, the nematode 

worm Caenorhabditis elegans. 
2001 Consortia of researchers announce the completion of the draft human genome sequence. 

2004 Publication of the "finished" human genome sequence. 

strictly defined sizes. Different restriction nucleases have different sequence 
specificities, and it is relatively simple to find an enzyme that can create a DNA 
fragment that includes a particular gene. The size of the DNA fragment can then 
be used as a basis for partial purification of the gene from a mixture. 

Different species of bacteria make different restriction nucleases, which 
protect them from viruses by degrading incoming viral DNA. Each bacterial 
nuclease recognizes a specific sequence of four to eight nucleotides in DNA. 
These sequences, where they occur in the genome of the bacterium itself, are 
protected from cleavage by methylation at an A or a C nucleotide; the sequences 
in foreign DNA are generally not methylated and so are cleaved by the restriction 
nucleases. Large numbers of restriction nucleases have been purified from vari- 
ous species of bacteria; several hundred, most of which recognize different 
nucleotide sequences, are now available commercially. 

Some restriction nucleases produce staggered cuts, which leave short sin- 
gle-stranded tails at the two ends of each fragment (Figure 8-31). Ends of this 
type are known as cohesive ends, as each tail can form complementary base pairs 
with the tail at any other end produced by the same enzyme (Figure 8-32). The 
cohesive ends generated by restriction enzymes allow any two DNA fragments 
to be easily joined together, as long as the fragments were generated with the 
same restriction nuclease (or with another nuclease that produces the same 
cohesive ends). DNA molecules produced by splicing together two or more DNA 
fragments are called recombinant DNA molecules. 
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Figure 8-31 The DNA nucleotide sequences recognized by four widely 
used restriction nucleases. As in the examples shown, such sequences are 
often six base pairs long and "palindromic" (that is, the nucleotide 
sequence is the same if the helix is turned by 1 80 degrees around the 
center of the short region of helix that is recognized). The enzymes cut the 
two strands of DNA at or near the recognition sequence. For the genes 
encoding some enzymes, such as Hpal, the cleavage leaves blunt ends; for 
others, such as EcoRI, Hindlll, and Pstl, the cleavage is staggered and 
creates cohesive ends. Restriction nucleases are obtained from various 
species of bacteria: Hpal is from Haemophilus parainfluenzae, EcoRI is from 
Escherichia coli, Hindlll is from Haemophilus influenzae, and Pstl is from 
Providencia stuartii. 

Gel Electrophoresis Separates DNA Molecules of Different Sizes 

The same types of gel electrophoresis methods that have proved so useful in the 
analysis of proteins can determine the length and purity of DNA molecules. The 
procedure is actually simpler than for proteins: because each nucleotide in a 
nucleic acid molecule already carries a single negative charge (on the phosphate 
group), there is no need to add the negatively charged detergent SDS that is 
required to make protein molecules move uniformly toward the positive elec- 
trode. For DNA fragments less than 500 nucleotides long, specially designed 
polyacrylamide gels allow the separation of molecules that differ in length by as 
little as a single nucleotide (Figure 8-33A). The pores in polyacrylamide gels, 
however, are too small to permit very large DNA molecules to pass; to separate 
these by size, the much more porous gels formed by dilute solutions of agarose 
(a polysaccharide isolated from seaweed) are used (Figure 8-33B). These DNA 
separation methods are widely used for both analytical and preparative pur- 
poses. 

A variation of agarose-gel electrophoresis, called pulsed-field gel elec- 
trophoresis, makes it possible to separate even extremely long DNA molecules. 
Ordinary gel electrophoresis fails to separate such molecules because the steady 
electric field stretches them out so that they travel end-first through the gel in 
snakelike configurations at a rate that is independent of their length. In pulsed- 
field gel electrophoresis, by contrast, the direction of the electric field changes 
periodically, which forces the molecules to reorient before continuing to move 
snakelike through the gel. This reorientation takes much more time for larger 
molecules, so that longer molecules move more slowly than shorter ones. As a 
consequence, even entire bacterial or yeast chromosomes separate into discrete 
bands in pulsed-field gels and so can be sorted and identified on the basis of 
their size (Figure 8-33C). Although a typical mammalian chromosome of 10 8 
base pairs is too large to be sorted even in this way, large segments of these chro- 
mosomes are readily separated and identified if the chromosomal DNA is first 
cut with a restriction nuclease selected to recognize sequences that occur only 
rarely (once every 10,000 or more nucleotide pairs). 

The DNA bands on agarose or polyacrylamide gels are invisible unless the 
DNA is labeled or stained in some way. One sensitive method of staining DNA is 
to expose it to the dye ethidium bromide, which fluoresces under ultraviolet light 
when it is bound to DNA (see Figure 8-33B,C). An even more sensitive detection 
method incorporates a radioisotope into the DNA molecules before elec- 
trophoresis; 32 P is often used as it can be incorporated into DNA phosphates and 
emits an energetic (3 particle that is easily detected by autoradiography, as in Fig- 
ure 8-33. (For a discussion of radioisotopes, see p. 601). 

Figure 8-32 The use of restriction nucleases to produce DNA fragments 
that can be easily joined together. Fragments with the same cohesive 
ends can readily join by complementary base-pairing between their 
cohesive ends, as illustrated. The two DNA fragments that join in this 
example were both produced by the EcoRI restriction nuclease, whereas 
the three other fragments were produced by different restriction nucleases 
that generated different cohesive ends (see Figure 8-31 ). Blunt-ended 
fragments, like those generated by Hpal (see Figure 8-31), can be spliced 
together with more difficulty. 
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Figure 8-33 Gel electrophoresis techniques for separating DNA 
molecules by size. In the three examples shown, electrophoresis is from 
top to bottom, so that the largest — and thus slowest-moving — DNA 
molecules are near the top of the gel. (A) A polyacrylamide gel with small 
pores was used to fractionate single-stranded DNA. In the size range 10 to 
500 nucleotides, DNA molecules that differ in size by only a single 
nucleotide can be separated from each other. In the example, the four lanes 
represent sets of DNA molecules synthesized in the course of a DNA- 
sequencing procedure. The DNA to be sequenced has been artificially 
replicated from a fixed start site up to a variable stopping point, producing 
a set of partial replicas of differing lengths. (Figure 8-50 explains how such 
sets of partial replicas are synthesized.) Lane 1 shows all the partial replicas 
that terminate in a G, lane 2 all those that terminate in an A, lane 3 all those 
that terminate in a T, and lane 4 all those that terminate in a C. Since the 
DNA molecules used in these reactions were radiolabeled, their positions 
can be determined by autoradiography, as shown. 

(B) An agarose gel with medium-sized pores was used to separate double- 
stranded DNA molecules. This method is most useful in the size range 300 
to 10,000 nucleotide pairs. These DNA molecules are fragments produced 
by cleaving the genome of a bacterial virus with a restriction nuclease, and 
they have been detected by their fluorescence when stained with the dye 
ethidium bromide. (C) The technique of pulsed-field agarose gel 
electrophoresis was used to separate 16 different yeast (Saccharomyces 
cerevisiae) chromosomes, which range in size from 220,000 to 2.5 million 
nucleotide pairs. The DNA was stained as in (B). DNA molecules as large as 
10 7 nucleotide pairs can be separated in this way. (A, courtesy of Leander 
Lauffer and Peter Walter; B, courtesy of Ken Kreuzer; C, from D. Vollrath and 
R.W. Davis, Nucleic Acids Res. 1 5:7865-7876, 1 987. With permission from 
Oxford University Press.) 
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Purified DNA Molecules Can Be Specifically Labeled with 
Radioisotopes or Chemical Markers in vitro 

Two procedures are widely used to label isolated DNA molecules. In the first 
method, a DNA polymerase copies the DNA in the presence of nucleotides that 
are either radioactive (usually labeled with 32 P) or chemically tagged (Figure 
8-34A). In this way "DNA probes" containing many labeled nucleotides can be 
produced for nucleic acid hybridization reactions (discussed below). The sec- 
ond procedure uses the bacteriophage enzyme polynucleotide kinase to trans- 
fer a single 32 P-labeled phosphate from ATP to the 5' end of each DNA chain 
(Figure 8-34B). Because only one 32 P atom is incorporated by the kinase into 
each DNA strand, the DNA molecules labeled in this way are often not radioac- 
tive enough to be used as DNA probes; because they are labeled at only one end, 
however, they have been invaluable for other applications, including DNA foot- 
printing, as discussed in Chapter 7. 

Radioactive labeling methods are being replaced by labeling with molecules 
that can be detected chemically or through fluorescence. To produce such non- 
radioactive DNA molecules, specially modified nucleotide precursors are used 
(Figure 8-34C). A DNA molecule made in this way is allowed to bind to its com- 
plementary DNA sequence by hybridization, as discussed in the next section, 
and is then detected with an antibody (or other ligand) that specifically recog- 
nizes its modified side chain (Figure 8-35). 
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Nucleic Acid Hybridization Reactions Provide a Sensitive Way to 
Detect Specific Nucleotide Sequences 

When an aqueous solution of DNA is heated at 100°C or exposed to a very high 
pH (pH > 13), the complementary base pairs that normally hold the two strands 
of the double helix together are disrupted and the double helix rapidly dissoci- 
ates into two single strands. This process, called DNA denaturation, was for 
many years thought to be irreversible. In 1961, however, it was discovered that 
complementary single strands of DNA readily re-form double helices by a pro- 
cess called hybridization (also called DNA renaturation) if they are kept for a 
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prolonged period at 65°C. Similar hybridization reactions can occur between 
any two single-stranded nucleic acid chains (DNA/DNA, RNA/RNA, or 
RNA/DNA), provided that they have complementary nucleotide sequences. 
These specific hybridization reactions are widely used to detect and character- 
ize specific nucleotide sequences in both RNA and DNA molecules. 

Single-stranded DNA molecules used to detect complementary sequences 
are known as probes; these molecules, which carry radioactive or chemical 
markers to facilitate their detection, can range from fifteen to thousands of 
nucleotides long. Hybridization reactions using DNA probes are so sensitive and 
selective that they can detect complementary sequences present at a concen- 
tration as low as one molecule per cell. It is thus possible to determine how 
many copies of any DNA sequence are present in a particular DNA sample. The 
same technique can be used to search for related but nonidentical genes. To find 
a gene of interest in an organism whose genome has not yet been sequenced, for 
example, a portion of a known gene can be used as a probe (Figure 8-36). 



Figure 8-34 Methods for labeling DNA 
molecules in vitro. (A) A purified DNA 
polymerase enzyme labels all the 
nucleotides in a DNA molecule and can 
thereby produce highly radioactive DNA 
probes. (B) Polynucleotide kinase labels 
only the 5' ends of DNA strands; therefore, 
when labeling is followed by restriction 
nuclease cleavage, as shown, DNA 
molecules containing a single 5'-end- 
labeled strand can be readily obtained. 
(C) The method in (A) is also used to 
produce nonradioactive DNA molecules 
that carry a specific chemical marker that 
can be detected with an appropriate 
antibody. The modified nucleotide shown 
can be incorporated into DNA by DNA 
polymerase, allowing the DNA molecule to 
serve as a probe that can be readily 
detected. The base on the nucleoside 
triphosphate shown is an analog of 
thymine, in which the methyl group on 
T has been replaced by a spacer arm linked 
to the plant steroid digoxigenin. An anti- 
digoxygenin antibody coupled to a visible 
marker such as a fluorescent dye is used to 
visualize the probe. Other chemical labels 
such as biotin can be attached to 
nucleotides and used in essentially the 
same way. 
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Figure 8-35 In situ hybridization to locate specific genes on 
chromosomes. Here, six different DNA probes have been used to mark the 
locations of their respective nucleotide sequences on human chromosome 
5 at metaphase.The probes have been chemically labeled and detected 
with fluorescent antibodies. Both copies of chromosome 5 are shown, 
aligned side by side. Each probe produces two dots on each chromosome, 
since a metaphase chromosome has replicated its DNA and therefore 
contains two identical DNA helices. (Courtesy of David C. Ward.) 



Alternatively, DNA probes can be used in hybridization reactions with RNA 
rather than DNA to find out whether a cell is expressing a given gene. In this case 
a DNA probe that contains part of the gene's sequence is hybridized with RNA 
purified from the cell in question to see whether the RNA includes nucleotide 
sequences matching the probe DNA and, if so, in what quantities. In somewhat 
more elaborate procedures, the DNA probe is treated with specific nucleases 
after the hybridization is complete, to determine the exact regions of the DNA 
probe that have paired with the RNA molecules. One can thereby determine the 
start and stop sites for RNA transcription, as well as the precise boundaries of 
the intron and exon sequences in a gene (Figure 8-37). 

Today, the positions of intron/ exon boundaries are usually determined by 
sequencing the complementary DNA (cDNA) sequences that represent the 
mRNAs expressed in a cell and comparing them with the nucleotide sequence of 
the genome. We describe later how cDNAs are prepared from mRNAs. 

The hybridization of DNA probes to RNAs allows one to determine whether 
or not a particular gene is being transcribed; moreover, when the expression of 
a gene changes, one can determine whether the change is due to transcriptional 
or post-transcriptional controls (see Figure 7-92). These tests of gene expression 
were initially performed with one DNA probe at a time. DNA microarrays now 
allow the simultaneous monitoring of hundreds or thousands of genes at a time, 
as we discuss later. Hybridization methods are in such wide use in cell biology 
today that it is difficult to imagine how we could study gene structure and 
expression without them. 
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Figure 8-36 Stringent versus 
nonstringent hybridization conditions. 

To use a DNA probe to find an identical 
match, stringent hybridization conditions 
are used; the reaction temperature is 
kept just a few degrees below that at 
which a perfect DNA helix denatures in 
the solvent used (its melting temperature), 
so that all imperfect helices formed are 
unstable. When a DNA probe is being 
used to find DNAs with related, as well as 
identical, sequences, less stringent 
conditions are used; hybridization is 
performed at a lower temperature, which 
allows even imperfectly paired double 
helices to form. Only the lower- 
temperature hybridization conditions can 
be used to search for genes that are 
nonidentical but related to gene A (C and 
E in this example). 
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Figure 8-37 The use of nucleic acid 
hybridization to determine the region 
of a cloned DNA fragment that is 
present in an mRNA molecule. The 

method shown requires a nuclease that 
cuts the DNA chain only where it is not 
base-paired to a complementary RNA 
chain. The positions of the introns in 
eucaryotic genes are mapped by the 
method shown. For this type of analysis, 
the DNA is electrophoresed through a 
denaturing agarose gel, which causes it 
to migrate as single-stranded molecules. 
The location of each end of an RNA 
molecule can be determined using 
similar methods. 
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Northern and Southern Blotting Facilitate Hybridization with 
Electrophoretically Separated Nucleic Acid Molecules 

In a complex mixture of nucleic acids, DNA probes are often used to detect only 
those molecules with sequences that are complementary to all or part of the 
probe. Gel electrophoresis can be used to fractionate the many different RNA or 
DNA molecules in a crude mixture according to their size before the hybridiza- 
tion reaction is performed; if the probe binds to molecules of only one or a few 
sizes, one can be certain that the hybridization was indeed specific. Moreover, 
the size information obtained can be invaluable in itself. An example illustrates 
this point. 

Suppose that one wishes to determine the nature of the defect in a mutant 
mouse that produces abnormally low amounts of albumin, a protein that liver 
cells normally secrete into the blood in large amounts. First, one collects identi- 
cal samples of liver tissue from mutant and normal mice (the latter serving as 
controls) and disrupts the cells in a strong detergent to inactivate nucleases that 
might otherwise degrade the nucleic acids. Next, one separates the RNA and 
DNA from all of the other cell components: the proteins present are completely 
denatured and removed by repeated extractions with phenol — a potent organic 
solvent that is partly miscible with water; the nucleic acids, which remain in the 
aqueous phase, are then precipitated with alcohol to separate them from the 
small molecules of the cell. Then, one separates the DNA from the RNA by their 
different solubilities in alcohols and degrades any contaminating nucleic acid of 
the unwanted type by treatment with a highly specific enzyme— either an RNase 
or a DNase. The mRNAs are typically separated from bulk RNA by retention on a 
chromatography column that specifically binds the poly-A tails of mRNAs. 

To analyze the albumin-encoding mRNAs, a technique called Northern 
blotting is used. First, the intact mRNA molecules purified from mutant and 
control liver cells are fractionated on the basis of their sizes into a series of bands 
by gel electrophoresis. Then, to make the RNA molecules accessible to DNA 
probes, a replica of the pattern of RNA bands on the gel is made by transferring 
("blotting") the fractionated RNA molecules onto a sheet of nitrocellulose or 
nylon paper. The paper is then incubated in a solution containing a labeled DNA 
probe, the sequence of which corresponds to part of the template strand that 
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Figure 8-38 Detection of specific RNA or DNA molecules by gel-transfer 
hybridization. In this example, the DNA probe is detected by its 
radioactivity. DNA probes detected by chemical or fluorescence methods 
are also widely used (see Figure 8-34). (A) A mixture of either single- 
stranded RNA molecules (Northern blotting) or the double-stranded DNA 
fragments created by restriction nuclease treatment (Southern blotting) is 
separated according to length by electrophoresis. (B) A sheet of 
nitrocellulose or nylon paper is laid over the gel, and the separated RNA or 
DNA fragments are transferred to the sheet by blotting. (C) The 
nitrocellulose sheet is carefully peeled off the gel. (D) The sheet containing 
the bound nucleic acids is placed in a sealed plastic bag together with a 
buffered salt solution containing a radioactively labeled DNA probe. The 
sheet is exposed to a labeled DNA probe for a prolonged period under 
conditions favoring hybridization. (E) The sheet is removed from the bag and 
washed thoroughly, so that only probe molecules that have hybridized to 
the RNA or DNA immobilized on the paper remain attached. After 
autoradiography, the DNA that has hybridized to the labeled probe shows 
up as bands on the autoradiograph. 

For Southern blotting, the strands of the double-stranded DNA molecules 
on the paper must be separated before the hybridization process; this is 
done by exposing the DNA to alkaline denaturing conditions after the gel 
has been run (not shown). 
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produces albumin mRNA. The RNA molecules that hybridize to the labeled DNA 
probe on the paper (because they are complementary to part of the normal 
albumin gene sequence) are then located by detecting the bound probe by 
autoradiography or by chemical means (Figure 8-38). The sizes of the 
hybridized RNA molecules can be determined by reference to RNA standards of 
known sizes that are electrophoresed side by side with the experimental sample. 
In this way one might discover that liver cells from the mutant mice make albu- 
min mRNA in normal amounts and of normal size; alternatively you might find 
that they make it in normal size but in greatly reduced amounts. Another possi- 
bility is that the mutant albumin mRNA molecules are abnormally short; in this 
case the gel blot could be retested with a series of shorter DNA probes, each cor- 
responding to small portions of the gene, to reveal which part of the normal RNA 
is missing. 

The original gel-transfer hybridization method, called Southern blotting, 
analyzes DNA rather than RNA. (It was named after its inventor, and the North- 
ern and Western blotting techniques were named with reference to it.) Here, iso- 
lated DNA is first cut into readily separable fragments with restriction nucleases. 
The double-stranded fragments are then separated on the basis of size by gel 
electrophoresis, and those complementary to a DNA probe are identified by 
blotting and hybridization, as just described for RNA (see Figure 8-38). To char- 
acterize the structure of the albumin gene in the mutant mice, an albumin-spe- 
cific DNA probe would be used to construct a detailed restriction map of the 
genome in the region of the albumin gene (such a map consists of the pattern of 
DNA fragments produced by various restriction nucleases) . From this map one 
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could determine if the albumin gene has been rearranged in the defective ani- 
mals — for example, by the deletion or the insertion of a short DNA sequence; 
most single-base changes, however, could not be detected in this way. 



Genes Can Be Cloned Using DNA Libraries 

Any DNA fragment can be cloned. In molecular biology, the term DNA cloning 
is used in two senses. In one sense, it literally refers to the act of making many 
identical copies of a DNA molecule — the amplification of a particular DNA 
sequence. However, the term also describes the isolation of a particular stretch 
of DNA (often a particular gene) from the rest of a cell's DNA, because this isola- 
tion is greatly facilitated by making many identical copies of the DNA of interest. 
As discussed earlier in this chapter, cloning, particularly when used in the con- 
text of developmental biology, can also refer to the generation of many geneti- 
cally identical cells starting from a single cell or even to the generation of genet- 
ically identical organisms. In all cases, cloning refers to the act of making many 
genetically identical copies; in this section, we will use the term cloning (or DNA 
cloning or gene cloning) to refer to methods designed to generate many identi- 
cal copies of a segment of nucleic acid. 

DNA cloning in its most general sense can be accomplished in several ways. 
The simplest involves inserting a particular fragment of DNA into the purified 
DNA genome of a self-replicating genetic element — generally a virus or a plas- 
mid. A DNA fragment containing a human gene, for example, can be joined in a 
test tube to the chromosome of a bacterial virus, and the new recombinant DNA 
molecule can then be introduced into a bacterial cell, where the inserted DNA 
fragment will be replicated along with the DNA of the virus. Starting with only 
one such recombinant DNA molecule that infects a single cell, the normal repli- 
cation mechanisms of the virus can produce more than 10 12 identical virus DNA 
molecules in less than a day, thereby amplifying the amount of the inserted 
human DNA fragment by the same factor. A virus or plasmid used in this way is 
known as a cloning vector, and the DNA propagated by insertion into it is said to 
have been cloned. 

To isolate a specific gene, one often begins by constructing a DNA library— 
a comprehensive collection of cloned DNA fragments from a cell, tissue, or 
organism. This library includes (one hopes) at least one fragment that contains 
the gene of interest. Libraries can be constructed with either a virus or a plasmid 
vector and are generally housed in a population of bacterial cells. The principles 
underlying the methods used for cloning genes are the same for either type of 
cloning vector, although the details may differ. Today, most cloning is performed 
with plasmid vectors. 

The plasmid vectors most widely used for gene cloning are small circular 
molecules of double-stranded DNA derived from larger plasmids that occur nat- 
urally in bacterial cells. They generally account for only a minor fraction of the 
total host bacterial cell DNA, but they can easily be separated owing to their 
small size from chromosomal DNA molecules, which are large and precipitate as 
a pellet upon centrifugation. For use as cloning vectors, the purified plasmid 
DNA circles are first cut with a restriction nuclease to create linear DNA 
molecules. The genomic DNA to be used in constructing the library is cut with 
the same restriction nuclease, and the resulting restriction fragments (including 
those containing the gene to be cloned) are then added to the cut plasmids and 
annealed via their cohesive ends to form recombinant DNA circles. These 
recombinant molecules containing foreign DNA inserts are then covalently 
sealed with the enzyme DNA ligase (Figure 8-39). 

In the next step in preparing the library, the recombinant DNA circles are 
introduced into bacterial cells that have been made transiently permeable to 
DNA. These bacterial cells are now said to be transfected with the plasmids. As 
the cells grow and divide, doubling in number every 30 minutes, the recombi- 
nant plasmids also replicate to produce an enormous number of copies of DNA 
circles containing the foreign DNA (Figure 8-40). Many bacterial plasmids carry 
genes for antibiotic resistance (discussed in Chapter 24), a property that can be 
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Figure 8-39 The insertion of a DNA 
fragment into a bacterial plasmid with 
the enzyme DNA ligase. The plasmid is 
cut open with a restriction nuclease (in 
this case one that produces cohesive 
ends) and is mixed with the DNA 
fragment to be cloned (which has been 
prepared with the same restriction 
nuclease). DNA ligase and ATP are added. 
The cohesive ends base-pair, and DNA 
ligase seals the nicks in the DNA 
backbone, producing a complete 
recombinant DNA molecule. 
(Micrographs courtesy of Huntington 
Potter and David Dressier.) 
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exploited to select those cells that have been successfully transfected; if the bac- 
teria are grown in the presence of the antibiotic, only cells containing plasmids 
will survive. Each original bacterial cell that was initially transfected contains, in 
general, a different foreign DNA insert; this insert is inherited by all of the 
progeny cells of that bacterium, which together form a small colony in a culture 
dish. 

For many years, plasmids were used to clone fragments of DNA of 1000 to 
30,000 nucleotide pairs. Larger DNA fragments are more difficult to handle and 
were harder to clone. Then researchers began to use yeast artificial chromosomes 
[YACs), which could accommodate very large pieces of DNA (Figure 8-41). 
Today, new plasmid vectors based on the naturally occurring F plasmid of E. coli 
are used to clone DNA fragments of 300,000 to 1 million nucleotide pairs. Unlike 
smaller bacterial plasmids, the F plasmid — and its derivative, the bacterial arti- 
ficial chromosome (BAC) — is present in only one or two copies per E. coli cell. 
The fact that BACs are kept in such low numbers in bacterial cells may con- 
tribute to their ability to maintain large cloned DNA sequences stably: with only 
a few BACs present, it is less likely that the cloned DNA fragments will become 
scrambled by recombination with sequences carried on other copies of the plas- 
mid. Because of their stability, ability to accept large DNA inserts, and ease of 
handling, BACs are now the preferred vector for building DNA libraries of com- 
plex organisms — including those representing the human and mouse genomes. 



Two Types of DNA Libraries Serve Different Purposes 

Cleaving the entire genome of a cell with a specific restriction nuclease and 
cloning each fragment as just described produces a very large number of DNA 
fragments — on the order of a million for a mammalian genome. The fragments 
are distributed among millions of different colonies of transfected bacterial cells. 
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Figure 8-40 The amplification of the 
DNA fragments inserted into a plasmid. 

To produce large amounts of the DNA of 
interest, the recombinant plasmid DNA in 
Figure 8-39 is introduced into a 
bacterium by transfection, where it will 
replicate many millions of times as the 
bacterium multiplies. 
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When working with BACs rather than typical plasmids, larger fragments can be 
inserted, and so fewer transfected bacterial cells are required to cover the 
genome. In either case, each of the colonies is composed of a clone of cells 
derived from a single ancestor cell, and therefore harbors many copies of a par- 
ticular stretch of the fragmented genome (Figure 8-42). Such a plasmid is said to 
contain a genomic DNA clone, and the entire collection of plasmids is called a 
genomic DNA library. But because the genomic DNA is cut into fragments at 
random, only some fragments contain genes. Many of the genomic DNA clones 
obtained from the DNA of a higher eucaryotic cell contain only noncoding DNA, 
which, as we discussed in Chapter 4, makes up most of the DNA in such genomes. 

An alternative strategy is to begin the cloning process by selecting only those 
DNA sequences that are transcribed into mRNA and thus are presumed to cor- 
respond to protein-encoding genes. This is done by extracting the mRNA from 
cells and then making a DNA copy of each mRNA molecule present — a so-called 
complementary DNA, or cDNA. The copying reaction is catalyzed by the reverse 
transcriptase enzyme of retroviruses, which synthesizes a complementary DNA 
chain on an RNA template. The single-stranded cDNA molecules synthesized by 
the reverse transcriptase are converted into double-stranded cDNA molecules 
by DNA polymerase, and these molecules are inserted into a plasmid or virus 
vector and cloned (Figure 8-43). Each clone obtained in this way is called a 
cDNA clone, and the entire collection of clones derived from one mRNA prepa- 
ration constitutes a cDNA library. 

Figure 8-44 illustrates some important differences between genomic DNA 
clones and cDNA clones. Genomic clones represent a random sample of all of 
the DNA sequences in an organism and, with very rare exceptions, are the same 
regardless of the cell type used to prepare them. By contrast, cDNA clones con- 
tain only those regions of the genome that have been transcribed into mRNA. 
Because the cells of different tissues produce distinct sets of mRNA molecules, a 
distinct cDNA library is obtained for each type of cell used to prepare the library. 

Figure 8-42 Construction of a human genomic DNA library. A genomic 
library is usually stored as a set of bacteria, each bacterium carrying a 
different fragment of human DNA. For simplicity, cloning of just a few 
representative fragments (colored) is shown. In reality, all of the gray DNA 
fragments would also be cloned. 



Figure 8-41 The making of a yeast 
artificial chromosome (YAC). A YAC 
vector allows the cloning of very large 
DNA molecules. TEL, CEN, and ORI are the 
telomere, centromere, and origin of 
replication sequences, respectively, for 
the yeast Saccharomyces cerevisiae; all of 
these are required to propagate the YAC. 
BamHI and EcoRI are sites where the 
corresponding restriction nucleases cut 
the DNA double helix. The sequences 
denoted A and B encode enzymes that 
serve as selectable markers to allow the 
easy isolation of yeast cells that have 
taken up the artificial chromosome. 
Because bacteria divide more rapidly 
than yeasts, most large-scale cloning 
projects now use E. coli as the means for 
amplifying DNA. (Adapted from 
D.T. Burke, G.F. Carle and M.V. Olson, 
Science 236:806-81 2, 1 987. With 
permission from AAAS.) 
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Figure 8-43 The synthesis of cDNA. 

Total mRNA is extracted from a particular 
tissue, and the enzyme reverse 
transcriptase produces DNA copies 
(cDNA) of the mRNA molecules (see 
p. 320). For simplicity, the copying of just 
one of these mRNAs into cDNA is 
illustrated. A short oligonucleotide 
complementary to the poly-A tail at the 
3' end of the mRNA (discussed in Chapter 
6) is first hybridized to the RNA to act as a 
primer for the reverse transcriptase, 
which then copies the RNA into a 
complementary DNA chain, thereby 
forming a DNA/RNA hybrid helix. Treating 
the DNA/RNA hybrid with RNase H (see 
Figure 5 1 2) creates nicks and gaps in the 
RNA strand. The enzyme DNA polymerase 
then copies the remaining single- 
stranded cDNA into double-stranded 
cDNA. The fragment of the original mRNA 
is the primer for this synthesis reaction, 
as shown. Because the DNA polymerase 
used to synthesize the second DNA 
strand can synthesize through the bound 
RNA molecules, the RNA fragment that is 
base-paired to the 3' end of the first DNA 
strand usually acts as the primer for the 
final product of the second strand 
synthesis. This RNA is eventually 
degraded during subsequent cloning 
steps. As a result, the nucleotide 
sequences at the extreme 5' ends of the 
original mRNA molecules are often 
absent from cDNA libraries. 
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Figure 8-44 The differences between 
cDNA clones and genomic DNA clones 
derived from the same region of DNA. In 

this example, gene A is infrequently 
transcribed, whereas gene B is frequently 
transcribed, and both genes contain 
introns (green). In the genomic DNA 
library, both the introns and the 
nontranscribed DNA (pink) are included 
in the clones, and most clones contain, at 
most, only part of the coding sequence 
of a gene (red). In the cDNA clones, the 
intron sequences (yellow) have been 
removed by RNA splicing during the 
formation of the mRNA (blue), and a 
continuous coding sequence is therefore 
present in each clone. Because gene B is 
transcribed more frequently than gene A 
in the cells from which the cDNA library 
was made, it is represented much more 
frequently than A in the cDNA library. In 
contrast, A and B are in principle 
represented equally in the genomic 
DNA library. 
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cDNA Clones Contain Uninterrupted Coding Sequences 

There are several advantages in using a cDNA library for gene cloning. First, spe- 
cialized cells produce large quantities of some proteins. In this case, the mRNA 
encoding the protein is likely to be produced in such large quantities that a 
cDNA library prepared from the cells is highly enriched for the cDNA molecules 
encoding the protein, greatly reducing the problem of identifying the desired 
clone in the library (see Figure 8-44). Hemoglobin, for example, is made in large 
amounts by developing erythrocytes (red blood cells); for this reason the globin 
genes were among the first to be cloned. 

By far the most important advantage of cDNA clones is that they contain the 
uninterrupted coding sequence of a gene. As we have seen, eucaryotic genes 
usually consist of short coding sequences of DNA (exons) separated by much 
longer noncoding sequences (introns); the production of mRNA entails the 
removal of the noncoding sequences from the initial RNA transcript and the 
splicing together of the coding sequences. Neither bacterial nor yeast cells will 
make these modifications to the RNA produced from a gene of a higher eucary- 
otic cell. Thus, when the aim of the cloning is either to deduce the amino acid 
sequence of the protein from the DNA sequence or to produce the protein in 
bulk by expressing the cloned gene in a bacterial or yeast cell, it is much prefer- 
able to start with cDNA. cDNA libraries have an additional use: as described in 
Chapter 7, many mRNAs from humans and other complex organisms are alter- 
natively spliced, and a cDNA library often represents many, if not all, of the alter- 
natively spliced mRNAs produced from a given cell line or tissue. 

Genomic and cDNA libraries are inexhaustible resources, which are widely 
shared among investigators. Today, many such libraries are also available from 
commercial sources. 



Genes Can Be Selectively Amplified by PCR 

Now that so many genome sequences are available, genes can be cloned directly 
without the need to first construct DNA libraries. A technique called the poly- 
merase chain reaction (PCR) makes this rapid cloning possible. Starting with an 
entire genome, PCR allows the DNA from a selected region to be amplified sev- 
eral billionfold, effectively "purifying" this DNA away from the remainder of the 
genome. 

To begin, a pair of DNA oligonucleotides, chosen to flank the desired 
nucleotide sequence of the gene, are synthesized by chemical methods. These 
oligonucleotides are then used to prime DNA synthesis on single strands gener- 
ated by heating the DNA from the entire genome. The newly synthesized DNA is 
produced in a reaction catalyzed in vitro by a purified DNA polymerase, and the 
primers remain at the 5' ends of the final DNA fragments that are made (Figure 
8-45A). 

Nothing special is produced in the first cycle of DNA synthesis; the power of 
the PCR method is revealed only after repeated rounds of DNA synthesis. Every 
cycle doubles the amount of DNA synthesized in the previous cycle. Because 
each cycle requires a brief heat treatment to separate the two strands of the tem- 
plate DNA double helix, the technique requires the use of a special DNA poly- 
merase, isolated from a thermophilic bacterium, that is stable at much higher 
temperatures than normal so that it is not denatured by the repeated heat treat- 
ments. With each round of DNA synthesis, the newly generated fragments serve 
as templates in their turn, and within a few cycles the predominant product is a 
single species of DNA fragment whose length corresponds to the distance 
between the two original primers (see Figure 8-45B). 

In practice, effective DNA amplification requires 20-30 reaction cycles, with 
the products of each cycle serving as the DNA templates for the next— hence the 
term polymerase "chain reaction." A single cycle requires only about 5 minutes, 
and the entire procedure can be easily automated. PCR thereby makes possible 
the "cell-free molecular cloning" of a DNA fragment in a few hours, compared 
with the several days required for standard cloning procedures. This technique 
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is now used routinely to clone DNA from genes of interest directly — starting 
either from genomic DNA or from mRNA isolated from cells (Figure 8-46). 

The PCR method is extremely sensitive; it can detect a single DNA molecule 
in a sample. Trace amounts of RNA can be analyzed in the same way by first tran- 
scribing them into DNA with reverse transcriptase. The PCR cloning technique 
has largely replaced Southern blotting for the diagnosis of genetic diseases and 
for the detection of low levels of viral infection. It also has great promise in foren- 
sic medicine as a means of analyzing minute traces of blood or other tissues — 
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Figure 8-45 Amplification of DNA by the PCR technique. <TACG> Knowledge of the DNA sequence to be amplified is 
used to design two synthetic, primer DNA oligonucleotides. One primer is complementary to the sequence on one strand 
of the DNA double helix, and one is complementary to the sequence on the other strand, but at the opposite end of the 
region to be amplified. These oligonucleotides serve as primers for in vitro DNA synthesis, which is performed by a DNA 
polymerase, and they determine the segment of the DNA to be amplified. (A) PCR starts with a double-stranded DNA, and 
each cycle of the reaction begins with a brief heat treatment to separate the two strands (step 1 ). After strand separation, 
cooling of the DNA in the presence of a large excess of the two primer DNA oligonucleotides allows these primers to 
hybridize to complementary sequences in the two DNA strands (step 2). This mixture is then incubated with DNA 
polymerase and the four deoxyribonucleoside triphosphates to synthesize DNA, starting from the two primers (step 3). The 
entire cycle is then begun again by a heat treatment to separate the newly synthesized DNA strands. (B) As the procedure 
is performed over and over again, the newly synthesized fragments serve as templates in their turn, and within a few cycles 
the predominant DNA is identical to the sequence bracketed by and including the two primers in the original template. Of 
the DNA put into the original reaction, only the sequence bracketed by the two primers is amplified because there are no 
primers attached anywhere else. In the example illustrated in (B), three cycles of reaction produce 1 6 DNA chains, 8 of 
which (boxed in yellow) are the same length as and correspond exactly to one or the other strand of the original bracketed 
sequence shown at the far left; the other strands contain extra DNA downstream of the original sequence, which is 
replicated in the first few cycles. After four more cycles, 240 of the 256 DNA chains correspond exactly to the original 
bracketed sequence, and after several more cycles, essentially all of the DNA strands have this unique length. 
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Figure 8-46 Use of PCR to obtain a 
genomic or cDNA clone. (A) To obtain a 
genomic clone using PCR, chromosomal 
DNA is first purified from cells. PCR 
primers that flank the stretch of DNA to 
be cloned are added, and many cycles of 
the reaction are completed (see Figure 
8-45). Since only the DNA between {and 
including) the primers is amplified, PCR 
provides a way to obtain a short stretch 
of chromosomal DNA selectively in a 
virtually pure form. (B) To use PCR to 
obtain a cDNA clone of a gene, mRNA is 
first purified from cells. The first primer is 
then added to the population of mRNAs, 
and reverse transcriptase is used to make 
a complementary DNA strand. The 
second primer is then added, and the 
single-stranded cDNA molecule is 
amplified through many cycles of PCR, as 
shown in Figure 8-45. For both types of 
cloning, the nucleotide sequence of at 
least part of the region to be cloned must 
be known beforehand. 



even as little as a single cell — and identifying the person from whom the sample 
came by his or her genetic "fingerprint" (Figure 8-47). 



Cells Can Be Used As Factories to Produce Specific Proteins 

The vast majority of the thousands of different proteins in a cell, including many 
with crucially important functions, are present in very small amounts. In the 
past, for most of them, it has been extremely difficult, if not impossible, to obtain 
more than a few micrograms of pure material. One of the most important con- 
tributions of DNA cloning and genetic engineering to cell biology is that they 
have made it possible to produce any of the cell's proteins in nearly unlimited 
amounts. 

Large amounts of a desired protein are produced in living cells by using 
expression vectors (Figure 8-48). These are generally plasmids that have been 
designed to produce a large amount of a stable mRNA that can be efficiently 
translated into protein in the transfected bacterial, yeast, insect, or mammalian 
cell. To prevent the high level of the foreign protein from interfering with the 
transfected cell's growth, the expression vector is often designed to delay the 
synthesis of the foreign mRNA and protein until shortly before the cells are har- 
vested and lysed (Figure 8-49). 

Because the desired protein made from an expression vector is produced 
inside a cell, it must be purified away from the host-cell proteins by chromatog- 
raphy after cell lysis; but because it is such a plentiful species in the cell lysate 
(often 1-10% of the total cell protein), the purification is usually easy to accom- 
plish in only a few steps. As we saw above, manv expression vectors have been 
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Figure 8-47 How PCR is used in forensic science. (A) The DNA sequences that create the variability used in this analysis 
contain runs of short, repeated sequences, such as CACACA . . ., which are found in various positions (loci) in the human 
genome. The number of repeats in each run can be highly variable in the population, ranging from 4 to 40 in different 
individuals. A run of repeated nucleotides of this type is commonly referred to as a hypervoriobie microsotellite sequence— 
also known as a VNTR (variable number of tandem repeat) sequence. Because of the variability in these sequences at each 
locus, individuals usually inherit a different variant from their mother and from their father; two unrelated individuals 
therefore do not usually contain the same pair of sequences. A PCR analysis using primers that bracket the locus produces 
a pair of bands of amplified DNA from each individual, one band representing the maternal variant and the other 
representing the paternal variant. The length of the amplified DNA, and thus the position of the band it produces after 
electrophoresis, depends on the exact number of repeats at the locus. (B) In the schematic example shown here, the same 
three VNTR loci are analyzed (requiring three different pairs of specially selected oligonucleotide primers) from three 
suspects (individuals A, B, and C), producing six DNA bands for each person after polyacrylamide-gel electrophoresis. 
Although some individuals have several bands in common, the overall pattern is quite distinctive for each. The band 
pattern can therefore serve as a "fingerprint" to identify an individual nearly uniquely. The fourth lane (F) contains the 
products of the same reactions carried out on a forensic sample. The starting material for such a PCR can be a single hair or 
a tiny sample of blood that was left at the crime scene. When examining the variability at 5-10 different VNTR loci, the odds 
that two random individuals would share the same genetic pattern by chance can be approximately 1 in 10 billion. In the 
case shown here, individuals A and C can be eliminated from further enquiries, whereas individual B remains a clear 
suspect for committing the crime. A similar approach is now routinely used for paternity testing. 
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Figure 8-48 Production of large amounts of a protein from a protein- 
coding DNA sequence cloned into an expression vector and introduced 
into cells. A plasmid vector has been engineered to contain a highly active 
promoter, which causes unusually large amounts of mRNA to be produced 
from an adjacent protein-coding gene inserted into the plasmid vector. 
Depending on the characteristics of the cloning vector, the plasmid is 
introduced into bacterial, yeast, insect, or mammalian cells, where the 
inserted gene is efficiently transcribed and translated into protein. 



designed to add a molecular tag — a cluster of histidine residues or a small 
marker protein — to the expressed protein to allow easy purification by affinity 
chromatography (see Figure 8-16). A variety of expression vectors are available, 
each engineered to function in the type of cell in which the protein is to be 
made. In this way, cells can be induced to make vast quantities of medically use- 
ful proteins — such as human insulin and growth hormone, interferon, and viral 
antigens for vaccines. More generally, these methods make it possible to pro- 
duce every protein — even those that may be present in only a few copies per 
cell — in large enough amounts to be used in the kinds of detailed structural and 
functional studies that we discussed earlier. 

DNA technology also can produce large amounts of any RNA molecule 
whose gene has been isolated. Studies of RNA splicing, protein synthesis, and 
RNA-based enzymes, for example, are greatly facilitated by the availability of 
pure RNA molecules. Most RNAs are present in only tiny quantities in cells, and 
they are very difficult to purify away from other cell components — especially 
from the many thousands of other RNAs present in the cell. But any RNA of 
interest can be synthesized efficiently in vitro by transcription of its DNA 
sequence (produced by one of the methods just described) with a highly effi- 
cient viral RNA polymerase. The single species of RNA produced is then easily 
purified away from the DNA template and the RNA polymerase. 
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Proteins and Nucleic Acids Can Be Synthesized Directly by 
Chemical Reactions 

Chemical reactions have been devised to synthesize directly specific sequences 
of amino acids or nucleic acids. These methodologies provide direct sources of 
biological molecules and do not rely on any cells or enzymes. Chemical synthe- 
sis is the method of choice for obtaining nucleic acids in the range of 100 
nucleotides or fewer, which are particularly useful in the PCR-based approaches 
discussed above. Chemical synthesis is also routinely used to produce specific 
peptides that, when chemically coupled to other proteins, are used to generate 
antibodies against the peptide. 



DNA Can Be Rapidly Sequenced 

Methods that allow the nucleotide sequence of any DNA fragment to be deter- 
mined simply and quickly have made it possible to determine the DNA 
sequences of tens of thousands of genes, and many complete genomes (see 
Table 1—1, p. 18). The volume of DNA sequence information is now so large 
(many tens of billions of nucleotides) that powerful computers must be used to 
store and analyze it. 

Figure 8-49 Production of large amounts of a protein by using a plasmid 
expression vector. In this example, bacterial cells have been transfected 
with the coding sequence for an enzyme, DNA helicase; transcription from 
this coding sequence is under the control of a viral promoter that becomes 
active only at temperatures of 37°C or higher. The total cell protein has 
been analyzed by SDS polyacrylamide-gel electrophoresis, either from 
bacteria grown at 25°C (no helicase protein made) or after a shift of the 
same bacteria to 42°C for up to 2 hours (helicase protein has become the 
most abundant protein species in the lysate). (Courtesy of Jack Barry.) 
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Large-volume DNA sequencing was made possible through the develop- 
ment in the mid-1970s of the dideoxy method for sequencing DNA, which is 
based on in vitro DNA synthesis performed in the presence of chain-terminat- 
ing dideoxyribonucleoside triphosphates (Figure 8-50). 
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Figure 8-50 The enzymatic — or dideoxy — 
method of sequencing DNA. (A) This method 
relies on the use of dideoxyribonucleoside 
triphosphates, derivatives of the normal 
deoxyribonucleoside triphosphates that lack 
the 3' hydroxyl group. (B) Purified DNA is 
synthesized in vitro in a mixture that contains 
single-stranded molecules of the DNA to be 
sequenced (gray), the enzyme DNA 
polymerase, a short primer DNA (orange) to 
enable the polymerase to start DNA synthesis, 
and the four deoxyribonucleoside 
triphosphates (dATR dCTP, dGTP, dTTP: blue A, 
C, G, and T). If a dideoxyribonucleotide analog 
(red) of one of these nucleotides is also 
present in the nucleotide mixture, it can 
become incorporated into a growing DNA 
chain. Because this chain now lacks a 3' OH 
group, the addition of the next nucleotide is 
blocked, and the DNA chain terminates at that 
point. In the example illustrated, a small 
amount of dideoxyATP (ddATP, symbolized 
here as a red A) has been included in the 
nucleotide mixture. It competes with an 
excess of the normal deoxyATP (dATP, blue A), 
so that ddATP is occasionally incorporated, at 
random, into a growing DNA strand. This 
reaction mixture will eventually produce a set 
of DNAs of different lengths complementary 
to the template DNA that is being sequenced 
and terminating at each of the different As. 
The exact lengths of the DNA synthesis 
products can then be used to determine the 
position of each A in the growing chain. (C) To 
determine the complete sequence of a DNA 
fragment the double-stranded DNA is first 
separated into its single strands and one of 
the strands is used as the template for 
sequencing. Four different chain-terminating 
dideoxyribonucleoside triphosphates (ddATP, 
ddCTP, ddGTP, ddTTP, again shown in red) are 
used in four separate DNA synthesis reactions 
on copies of the same single-stranded DNA 
template (gray). Each reaction produces a set 
of DNA copies that terminate at different 
points in the sequence. The products of these 
four reactions are separated by 
electrophoresis in four parallel lanes of a 
polyacrylamide gel (labeled here A,T, C, and 
G).The newly synthesized fragments are 
detected by a label (either radioactive or 
fluorescent) that has been incorporated either 
into the primer or into one of the 
deoxyribonucleoside triphosphates used to 
extend the DNA chain. In each lane, the bands 
represent fragments that have terminated at a 
given nucleotide (e.g., A in the leftmost lane) 
but at different positions in the DNA. By 
reading off the bands in order, starting at the 
bottom of the gel and working across all 
lanes, the DNA sequence of the newly 
synthesized strand can be determined. The 
sequence is given in the green arrow to the 
right of the gel. This sequence is 
complementary to the template strand (gray) 
from the original double-stranded DNA 
molecule, and identical to a portion of the 
green 5'-to-3' strand. 
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Although the same basic method is still used today, many improvements 
have been made. DNA sequencing is now completely automated: robotic 
devices mix the reagents and then load, run, and read the order of the nucleotide 
bases from the gel. Chain-terminating nucleotides that are each labeled with a 
different colored fluorescent dye facilitate these tasks; in this case, all four syn- 
thesis reactions can be performed in the same tube, and the products can be 
separated in a single lane of a gel. A detector positioned near the bottom of the 
gel reads and records the color of the fluorescent label on each band as it passes 
through a laser beam (Figure 8-51). A computer then reads and stores this 
nucleotide sequence. Some modern systems dispense with the traditional gel 
entirely, separating nucleic acids by capillary electrophoresis, a method that 
facilitates rapid automation. 



Figure 8-51 Automated DNA 
sequencing. Shown at the bottom is a 
tiny part of the raw data from an 
automated DNA-sequencing run as it 
appears on the computer screen. Each 
prominant colored peak represents a 
nucleotide in the DNA sequence — a clear 
stretch of nucleotide sequence can be 
read here between positions 1 73 and 1 94 
from the start of the sequence. The small 
peaks along the baseline represent 
background "noise" and, as long as they 
are much lower than the "signal" peaks, 
they are ignored. This particular example 
is taken from the international project 
that determined the complete nucleotide 
sequence of the genome of the plant 
Arabidopsis. (Courtesy of George 
Murphy.) 



Nucleotide Sequences Are Used to Predict the Amino Acid 
Sequences of Proteins 

Now that DNA sequencing is so rapid and reliable, it has become the preferred 
method for determining, indirectly, the amino acid sequences of most proteins. 
Given a nucleotide sequence that encodes a protein, the procedure is quite 
straightforward. Although in principle there are six different reading frames in 
which a DNA sequence can be translated into protein (three on each strand), the 
correct one is generally recognizable as the only one lacking frequent stop 
codons (Figure 8-52). As we saw when we discussed the genetic code in Chap- 
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Figure 8-52 Finding the regions in a DNA 
sequence that encode a protein. (A) Any 

region of the DNA sequence can, in 
principle, code for six different amino acid 
sequences, because any one of three 
different reading frames can be used to 
interpret the nucleotide sequence on each 
strand. Note that a nucleotide sequence is 
always read in the 5'-to-3' direction and 
encodes a polypeptide from the 
N-terminus to the C-terminus. For a 
random nucleotide sequence read in a 
particular frame, a stop signal for protein 
synthesis is encountered, on average, 
about once every 20 amino acids. In this 
sample sequence of 48 base pairs, each 
such signal (stopcodon) is colored blue, 
and only reading frame 2 lacks a stop 
signal. (B) Search of a 1 700 base-pair DNA 
sequence for a possible protein-encoding 
sequence. The information is displayed as 
in (A), with each stop signal for protein 
synthesis denoted by a blue line. In 
addition, all of the regions between 
possible start and stop signals for protein 
synthesis (see p. 381) are displayed as red 
bars. Only reading frame 1 actually 
encodes a protein, which is 475 amino 
acid residues long. 
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ter 6, a random sequence of nucleotides, read in frame, will encode a stop signal 
for protein synthesis about once every 20 amino acids. Nucleotide sequences 
that encode a stretch of amino acids much longer than this are candidates for 
presumptive exons, and they can be translated (by computer) into amino acid 
sequences and checked against databases for similarities to known proteins 
from other organisms. If necessary, a limited amount of amino acid sequence 
can then be determined from the purified protein to confirm the sequence pre- 
dicted from the DNA. 

The problem comes, however, in determining which nucleotide 
sequences — within a whole genome — represent genes that encode proteins. 
Identifying genes is easiest when the DNA sequence is from a bacterial or 
archaeal chromosome, which lacks introns, or from a cDNA clone. The location 
of genes in these nucleotide sequences can be predicted by examining the DNA 
for certain distinctive features (discussed in Chapter 6). Briefly, these genes that 
encode proteins are identified by searching the nucleotide sequence for open 
reading frames (ORFs) that begin with an initiation codon, usually ATG, and end 
with a termination codon, TAA, TAG, or TGA. To minimize errors, computers 
used to search for ORFs are often directed to count as genes only those 
sequences that are longer than, say, 100 codons in length. 

For more complex genomes, such as those of animals and plants, the pres- 
ence of large introns embedded within the coding portion of genes complicates 
the process. In many multicellular organisms, including humans, the average 
exon is only 150 nucleotides long. Thus one must also search for other features 
that signal the presence of a gene, for example, sequences that signal an 
intron/exon boundary or distinctive upstream regulatory regions. Recent efforts 
to solve the exon prediction problem have turned to artificial intelligence algo- 
rithms, in which the computer learns, based on known examples, what sets of 
features are most indicative of an exon boundary. 

A second major approach to identifying the coding regions in chromosomes 
is through the characterization of the nucleotide sequences of the detectable 
mRNAs (using the corresponding cDNAs). The mRNAs (and the cDNAs produced 
from them) lack introns, regulatory DNA sequences, and the nonessential 
"spacer" DNA that lies between genes. It is therefore useful to sequence large 
numbers of cDNAs to produce a very large database of the coding sequences of an 
organism. These sequences are then readily used to distinguish the exons from the 
introns in the long chromosomal DNA sequences that correspond to genes. 

The Genomes of Many Organisms Have Been Fully Sequenced 

Owing in large part to the automation of DNA sequencing, the genomes of many 
organisms have been fully sequenced; these include plant chloroplasts and ani- 
mal mitochondria, large numbers of bacteria, and archaea, and many of the 
model organisms that are studied routinely in the laboratory, including many 
yeasts, a nematode worm, the fruit fly Drosophila, the model plant Arabidopsis, 
the mouse, dog, chimpanzee, and, last but not least, humans. Researchers have 
also deduced the complete DNA sequences for a wide variety of human 
pathogens. These include the bacteria that cause cholera, tuberculosis, syphilis, 
gonorrhea, Lyme disease, and stomach ulcers, as well as hundreds of viruses — 
including smallpox virus and Epstein-Barr virus (which causes infectious 
mononucleosis). Examination of the genomes of these pathogens provides clues 
about what makes them virulent and will also point the way to new and more 
effective treatments. 

Haemophilus influenzae (a bacterium that can cause ear infections and 
meningitis in children) was the first organism to have its complete genome 
sequence — all 1.8 million nucleotide pairs — determined by the shotgun 
sequencing method, the most common strategy used today. In the shotgun 
method, long sequences of DNA are broken apart randomly into many shorter 
fragments. Each fragment is then sequenced and a computer is used to order 
these pieces into a whole chromosome or genome, using sequence overlap to 
guide the assembly. The shotgun method is the technique of choice for 
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sequencing small genomes. Although larger, more repetitive genome sequences 
are more challenging to assemble, the shotgun method — in combination with 
the analysis of large DNA fragments cloned in BACs — has played a key role in 
their sequencing as well. 

With new sequences appearing at a steadily accelerating pace in the scien- 
tific literature, comparison of the complete genome sequences of different 
organisms allows us to trace the evolutionary relationships among genes and 
organisms, and to discover genes and predict their functions (discussed in 
Chapters 3 and 4). Assigning functions to genes often involves comparing their 
sequences with related sequences from model organisms that have been well 
characterized in the laboratory, such as the bacterium E. coli, the yeasts S. cere- 
visiae and S. pombe, the nematode worm C. elegans, and the fruit fly Drosophila 
(discussed in Chapter 1). 

Although the organisms whose genomes have been sequenced share many 
biochemical pathways and possess many proteins that are homologous in their 
amino acid sequence or structure, the functions of a very large number of newly 
identified proteins remain unknown. Depending on the organism, some 15-40% 
of the proteins encoded by a sequenced genome do not resemble any protein 
that has been studied biochemically. This observation underscores a limitation 
of the emerging field of genomics: although comparative analysis of genomes 
reveals a great deal of information about the relationships between genes and 
organisms, it often does not provide immediate information about how these 
genes function, or what roles they have in the physiology of an organism. Com- 
parison of the full gene complement of several thermophilic bacteria, for exam- 
ple, does not reveal why these bacteria thrive at temperatures exceeding 70°C. 
And examination of the genome of the incredibly radioresistant bacterium 
Deinococcus radiodurans does not explain how this organism can survive a blast 
of radiation that can shatter glass. Further biochemical and genetic studies, like 
those described in the other sections of this chapter, are required to determine 
how genes, and the proteins they produce, function in the context of living 
organisms. 



Summary 

DNA cloning allows a copy of any specific part of a DNA or RNA sequence to be selected 
from the millions of other sequences in a cell and produced in unlimited amounts in 
pure form. DNA sequences can be amplified after cutting chromosomal DNA with a 
restriction nuclease and inserting the resulting DNA fragments into the chromosome 
of a self-replicating genetic element such as a virus or a plasmid. Plasmid vectors are 
generally used, and the resulting "genomic DNA library" is housed in millions of bac- 
terial cells, each carrying a different cloned DNA fragment Individual cells from this 
library that are allowed to proliferate produce large amounts of a single cloned DNA 
fragment. The polymerase chain reaction (PCR) allows DNA cloning to be performed 
directly with a thermostable DNA polymerase— provided that the DNA sequence of 
interest is already known. 

The procedures used to obtain DNA clones that correspond in sequence to mRNA 
molecules are the same except that a DNA copy of the mRNA sequence, called cDNA, is 
first made. Unlike genomic DNA clones, cDNA clones lack intron sequences, making 
them the clones of choice for analyzing the protein product of a gene. 

Nucleic acid hybridization reactions provide a sensitive means of detecting a gene 
or any other nucleotide sequence of interest. Under stringent hybridization conditions 
(a combination of solvent and temperature at which even a perfect double helix is 
barely stable), two strands can pair to form a "hybrid" helix only if their nucleotide 
sequences are almost perfectly complementary. The enormous specificity of this 
hybridization reaction allows any single-stranded sequence of nucleotides to be 
labeled with a radioisotope or chemical and used as a probe to find a complementary 
partner strand, even in a cell or cell extract that contains millions of different DNA and 
RNA sequences. Probes of this type are widely used to detect the nucleic acids corre- 
sponding to specific genes, both to facilitate their purification and characterization, 
and to localize them in cells, tissues, and organisms. 
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The nucleotide sequence ofDNA can be determined rapidly and simply by using 
highly automated techniques based on the dideoxy method for sequencing DN A. This 
technique has made it possible to determine the complete DNA sequences of the 
genomes of many organisms. Comparison of the genome sequences of different organ- 
isms allows us to trace the evolutionary relationships among genes and organisms, 
and it has proved valuable for discovering new genes and predicting their functions. 

Taken together, these techniques for analyzing and manipulating DNA have made 
it possible to identify, isolate, and sequence genes from any organism of interest. 
Related technologies allow scientists to produce the protein products of these genes in 
the large quantities needed for detailed analyses of their structure and function, as 
well as for medical purposes. 



STUDYING GENE EXPRESSION AND FUNCTION 

Ultimately, one wishes to determine how genes — and the proteins they 
encode — function in the intact organism. Although it may seem counterintu- 
itive, one of the most direct ways to find out what a gene does is to see what hap- 
pens to the organism when that gene is missing. Studying mutant organisms 
that have acquired changes or deletions in their nucleotide sequences is a time- 
honored practice in biology and forms the basis of the important field of genet- 
ics. Because mutations can disrupt cell processes, mutants often hold the key to 
understanding gene function. In the classical genetic approach, one begins by 
isolating mutants that have an interesting or unusual appearance: fruit flies with 
white eyes or curly wings, for example. Working backward from the 
phenotype — the appearance or behavior of the individual — one then deter- 
mines the organism's genotype, the form of the gene responsible for that char- 
acteristic (Panel 8-1). 

Today, with numerous genome sequences available, the exploration of gene 
function often begins with a DNA sequence. Here, the challenge is to translate 
sequence into function. One approach, discussed earlier in the chapter, is to 
search databases for well-characterized proteins that have similar amino acid 
sequences to the protein encoded by a new gene, and from there employ some of 
the methods described in the previous section to explore the gene's function fur- 
ther. But to determine directly a gene's function in a cell or organism, the most 
effective approach involves studying mutants that either lack the gene or express 
an altered version of it. Determining which cell processes have been disrupted or 
compromised in such mutants will usually shed light on a gene's biological role. 

In this section, we describe several approaches to determining a gene's func- 
tion, starting from a DNA sequence or an organism with an interesting pheno- 
type. We begin with the classical genetic approach, which starts with a genetic 
screen for isolating mutants of interest and then proceeds toward identification 
of the gene or genes responsible for the observed phenotype. We then describe 
the set of techniques that are collectively called reverse genetics, in which one 
begins with a gene or gene sequence and attempts to determine its function. 
This approach often involves some intelligent guesswork — searching for homol- 
ogous sequences and determining when and where a gene is expressed — as well 
as generating mutant organisms and characterizing their phenotype. 



Classical Genetics Begins by Disrupting a Cell Process by Random 
Mutagenesis 

Before the advent of gene cloning technology, most genes were identified by the 
abnormalities produced when the gene was mutated. This classical genetic 
approach — identifying the genes responsible for mutant phenotypes — is most 
easily performed in organisms that reproduce rapidly and are amenable to 
genetic manipulation, such as bacteria, yeasts, nematode worms, and fruit flies. 
Although spontaneous mutants can sometimes be found by examining 
extremely large populations — thousands or tens of thousands of individual 
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GENES AND PHENOTYPES 

Gene: a functional unit of inheritance, usually corresponding 

to the segment of DNA coding for a single protein. 
Genome: all of an organism's DNA sequences. 

locus: the site of the gene in the genome 

alleles: alternative forms of a gene 



^K ^r 



y/j 



GENOTYPE: the specific set of 
alleles forming the genome of 
an individual 

PHENOTYPE: the visible 
character of the individual 



homozygous A/A 



Wild-type: the normal, 
naturally occurring type 



heterozygous a/A 



Mutant: differing from the 
wild-type because of a genetic 
change (a mutation) 



homozygous a/a 




allele A is dominant (relative to a); allele a is recessive (relative to A) 
In the example above, the phenotype of the heterozygote is the same as that of one of the 
homozygotes; in cases where it is different from both, the two alleles are said to be co-dominant. 



CHROMOSOMES 



centromere 




a chromosome at the beginning of the cell 
cycle, in G-, phase; the single long bar 
represents one long double helix of DNA 



long 
"q " arm 



pafr nf 
autosomes 




long "q" arm 

a chromosome near the end of the cell cycle, in 
metaphase; it is duplicated and condensed, consisting of 
two identical sister chromatids (each containing one 
DNA double helix) joined at the centromere. 

A normal diploid chromosome set, as 
seen in a metaphase spread, 
prepared by bursting open a cell at 
metaphase and staining the scattered 
chromosomes. In the example shown 
schematically here, there are three 
pairs of autosomes {chromosomes 
inherited symmetrically from 
both parents, regardless of sex) and 
two sex chromosomes — an X from 
the mother and a Y from the father. 
The numbers and types of sex 
chromosomes and their role in sex 
determination are variable from one 
class of organisms to another, as is 
the number of pairs of autosomes. 
sex chromosomes 



THE HAPLOID-DIPLOID CYCLE 
OF SEXUAL REPRODUCTION 
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motherly \father 

DIPLOID 



MEfOSfS 



r 



r 



HAPLOID 




maternal 
chromosome 

paternal 
chromosome 

For simplicity, the cycle is shown for only 
one chromosome/chromosome pair. 



MEIOSIS AND GENETIC RECOMBINATION 



genotype Ab 



maternal chromosome 

A 
paternal chromosome 

/// 

a 

diploid germ cell 

AR 

genotype 22- 

ab 




MEIOSIS AND 
RECOMBINATION 





site of crossing-over 

t 




haploid gametes (eggs or sperm) 



The greater the distance 
between two loci on a single 
chromosome, the greater is the 
chance that they will be 
separated by crossing over 
occurring at a site between 
them. If two genes are thus 
reassorted in x% of gametes, 
they are said to be separated on 
a chromosome by a genetic map | 
distance of x map units (or 
x centimorgans). 



TYPES OF MUTATIONS 



m 



POINT MUTATION: maps to a single site in the genome, 
corresponding to a single nucleotide pair or a very 
small part of a single gene 



* 






DELETION: deletes a segment of a chromosome 



» 



INVERSION: inverts a segment of a chromosome 

lethal mutation: causes the developing organism to die 
prematurely. 

conditional mutation: produces its phenotypic effect only 
under certain conditions, called the restrictive conditions. 
Under other conditions— the permissive conditions— the 
effect is not seen. For a temperature-sensitive mutation, the 
restrictive condition typically is high temperature, while the 
permissive condition is low temperature, 
loss-of-function mutation: either reduces or abolishes the 
activity of the gene. These are the most common class of 
mutations. Loss-of-function mutations are usually 
recessive— the organism can usually function normally as 
long as it retains at least one normal copy of the affected 
gene. 

null mutation: a loss-of-function mutation that completely 
abolishes the activity of the gene. 



TRANSLOCATION: breaks off a segment from one 

chromosome and attaches it to another 
gain-of-function mutation: increases the activity of the gene 
or makes it active in inappropriate circumstances; these 
mutations are usually dominant. 

dominant-negative mutation: dominant-acting mutation that 
blocks gene activity, causing a loss-of-function phenotype 
even in the presence of a normal copy of the gene. This 
phenomenon occurs when the mutant gene product 
interferes with the function of the normal gene product, 
suppressor mutation: suppresses the phenotypic effect of 
another mutation, so that the double mutant seems normal. 
An intragenic suppressor mutation lies within the gene 
affected by the first mutation; an extragenic suppressor 
mutation lies in a second gene— often one whose product 
interacts directly with the product of the first. 



TWO GENES OR ONE? 

Given two mutations that produce the same phenotype, how can 
we tell whether they are mutations in the same gene? If the 
mutations are recessive (as they most often are), the answer can 
be found by a complementation test 

COMPLEMENTATION: 
MUTATIONS IN TWO DIFFERENT GENES 



homozygous mutant mother 



homo7vgous mutant father 



In the simplest type of complementation test, an individual who 
is homozygous for one mutation is mated with an individual 
who is homozygous for the other. The phenotype of the 
offspring gives the answer to the question. 

NONCOMPLEMENTATION: 
TWO INDEPENDENT MUTATIONS IN THE SAME GENE 



homozygous mutant mother 



homozygous mutant father 




hybrid offspring shows normal phenotype: 
one normal copy of each gene is present 



hybrid offspring shows mutant phenotype: 

no normal copies of the mutated gene are present 
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organisms — isolating mutant individuals is much more efficient if one generates 
mutations with chemicals or radiation that damage DNA. By treating organisms 
with such mutagens, very large numbers of mutant individuals can be created 
quickly and then screened for a particular defect of interest, as we discuss shortly. 

An alternative approach to chemical or radiation mutagenesis is called 
insertional mutagenesis. This method relies on the fact that exogenous DNA 
inserted randomly into the genome can produce mutations if the inserted frag- 
ment interrupts a gene or its regulatory sequences. The inserted DNA, whose 
sequence is known, then serves as a molecular tag that aids in the subsequent 
identification and cloning of the disrupted gene (Figure 8-53). In Drosophila, 
the use of the transposable P element to inactivate genes has revolutionized the 
study of gene function in the fly. Transposable elements (see Table 5-3, p. 318) 
have also been used to generate mutations in bacteria, yeast, mice, and the flow- 
ering plant Arabidopsis. 

Such classical genetic studies are well suited for dissecting biological pro- 
cesses in experimental organisms, but how can we study gene function in 
humans? Unlike the genetically accessible organisms we have been discussing, 
humans do not reproduce rapidly and they cannot be intentionally treated with 
mutagens. Moreover, any human with a serious defect in an essential process, 
such as DNA replication, would die long before birth. 

There are two main ways that we can study human genes. First, because 
genes and gene functions have been so highly conserved throughout evolution, 
the study of less complex model organisms reveals critical information about 
similar genes and processes in humans. The corresponding human genes can 
then be studied further in cultured human cells. Second, many mutations that 
are not lethal— tissue-specific defects in lysosomes or cell-surface receptors, for 
example— have arisen spontaneously in the human population. Analyses of the 
phenotypes of the affected individuals, together with studies of their cultured 
cells, have provided many unique insights into important human cell functions. 
Although such mutations are rare, they are very efficiently discovered because of 
a unique human property: the mutant individuals call attention to themselves 
by seeking special medical care. 




Figure 8-53 Insertional mutant of the 
snapdragon, Antirrhinum. A mutation in 
a single gene coding for a regulatory 
protein causes leafy shoots to develop in 
place of flowers. The mutation allows 
cells to adopt a character that would be 
appropriate to a different part of the 
normal plant. The mutant plant is on the 
left, the normal plant on the right. 
{Courtesy of Enrico Coen and 
Rosemary Carpenter.) 



Genetic Screens Identify Mutants with Specific Abnormalities 

Once a collection of mutants in a model organism such as yeast or fly has been 
produced, one generally must examine thousands of individuals to find the 
altered phenotype of interest. Such a search is called a genetic screen, and the 
larger the genome, the less likely it is that any particular gene will be mutated. 
Therefore, the larger the genome of an organism, the bigger the screening task 
becomes. The phenotype being screened for can be simple or complex. Simple 
phenotypes are easiest to detect: one can screen many organisms rapidly, for 
example, for mutations that make it impossible for the organism to survive in 
the absence of a particular amino acid or nutrient. 

More complex phenotypes, such as defects in learning or behavior, may 
require more elaborate screens (Figure 8-54). But even genetic screens that are 
used to dissect complex physiological systems should be as simple as possible in 
design, and, if possible, should permit the simultaneous examination of large 
numbers of mutants. As an example, one particularly elegant screen was 
designed to search for genes involved in visual processing in zebrafish. The basis 
of this screen, which monitors the fishes' response to motion, is a change in 
behavior. Wild-type fish tend to swim in the direction of a perceived motion, 
whereas mutants with defects in their visual processing systems swim in ran- 
dom directions— a behavior that is easily detected. One mutant discovered in 
this screen is called lakritz, which is missing 80% of the retinal ganglion cells that 
help to relay visual signals from the eye to the brain. As the cellular organization 
of the zebrafish retina is similar to that of all vertebrates, the study of such 
mutants should also provide insights into visual processing in humans. 

Because defects in genes that are required for fundamental cell processes— 
RNA synthesis and processing or cell- cycle control, for example— are usually 
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Figure 8-54 A behavioral phenotype 
detected in a genetic screen. (A) Wild- 
type C. elegans engage in social feeding. 
The worms migrate around until they 
encounter their neighbors and commence 
feeding on bacteria. (B) Mutant animals 
feed by themselves. (Courtesy of Cornelia 
Bargmann, Cell 94: cover, 1 998. With 
permission from Elsevier.) 



lethal, the functions of these genes are often studied in individuals with condi- 
tional mutations. The mutant individuals function normally as long as "permis- 
sive" conditions prevail, but demonstrate abnormal gene function when sub- 
jected to "nonpermissive" (restrictive) conditions. In organisms with tempera- 
ture-sensitive mutations, for example, the abnormality can be switched on and 
off experimentally simply by changing the temperature; thus, a cell containing a 
temperature-sensitive mutation in a gene essential for survival will die at a non- 
permissive temperature but proliferate normally at the permissive temperature 
(Figure 8-55). The temperature-sensitive gene in such a mutant usually con- 
tains a point mutation that causes a subtle change in its protein product. 

Many temperature-sensitive mutations were found in the bacterial genes 
that encode the proteins required for DNA replication. The mutants were iden- 
tified by screening populations of mutagen-treated bacteria for cells that stop 
making DNA when they are warmed from 30°C to 42°C. These mutants were 
later used to identify and characterize the corresponding DNA replication pro- 
teins (discussed in Chapter 5). Similarly screens for temperature-sensitive 
mutations led to the identification of many proteins involved in regulating the 
cell cycle, as well as many proteins involved in moving proteins through the 
secretory pathway in yeast (see Panel 13-1). Related screening approaches 
demonstrated the function of enzymes involved in the principal metabolic path- 
ways of bacteria and yeast (discussed in Chapter 2) and identified many of the 
gene products responsible for the orderly development of the Drosophila 
embryo (discussed in Chapter 22). 



Mutations Can Cause Loss or Gain of Protein Function 

Gene mutations are generally classed as "loss of function" or "gain of function." 
A loss of function mutation results in a gene product that either does not work 
or works too little; thus, it reveals the normal function of the gene. A gain of func- 
tion mutation results in a gene product that works too much, works at the wrong 
time or place, or works in a new way (Figure 8-56). 



mutant celts proliferate 
and form a colony at 
the permissive 
temperature 




mutagenized cells 
proliferate and form 
colonies at 23°C 



colonies replicated 

onto two identical 

plates and incubated 

at two different 

temperatures 




mutant cells fail to 
proliferate and form 
a colony at the 
nonpermissive 
temperature 



Figure 8-55 Screening for temperature- 
sensitive bacterial or yeast mutants. 

Mutagenized cells are plated out at the 
permissive temperature. They divide and 
form colonies, which are transferred to 
two identical Petri dishes by replica 
plating. One of these plates is incubated 
at the permissive temperature, the other 
at the nonpermissive temperature. Cells 
containing a temperature-sensitive 
mutation in a gene essential for 
proliferation can divide at the normal, 
permissive temperature but fail to divide 
at the elevated, nonpermissive 
temperature. 
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An important early step in the genetic analysis of any mutant cell or organ- 
ism is to determine whether the mutation causes a loss or a gain of function. A 
standard test is to determine whether the mutation is dominant or recessive. A 
dominant mutation is one that still causes the mutant phenotype, in the pres- 
ence of a single copy of the wild-type gene. A recessive mutation is one that is no 
longer able to cause the mutant phenotype in the presence of a single wild-type 
copy of the gene. Although cases have been described in which a loss-of-func- 
tion mutation is dominant or a gain-of-function mutation is recessive, in the 
vast majority of cases, recessive mutations are loss of function, and dominant 
mutations are gain of function. It is easy to determine if a mutation is dominant 
or recessive. One simply mates a mutant with a wild-type to obtain diploid cells 
or organisms. The progeny from the mating will be heterozygous for the muta- 
tion. If the mutant phenotype is no longer observed, one can conclude that the 
mutation is recessive and is very likely to be a loss-of-function mutation. 



Figure 8-56 Gene mutations that affect 
their protein product in different ways. 

In this example, the wild-type protein has 
a specific cell function denoted by the 
red rays. Mutations that eliminate this 
function, increase the function, or render 
the function sensitive to higher 
temperatures are shown. The 
temperature-sensitive conditional 
mutant protein carries an amino acid 
substitution (red) that prevents its proper 
folding at 37°C, but allows the protein to 
fold and function normally at 25°C Such 
conditional mutations are especially 
useful for studying essential genes; the 
organism can be grown under the 
permissive condition and then moved to 
the nonpermissive condition to study the 
function of the gene. 



Complementation Tests Reveal Whether Two Mutations Are in the 
Same Gene or Different Genes 

A large-scale genetic screen can turn up many different mutations that show the 
same phenotype. These defects might lie in different genes that function in the 
same process, or they might represent different mutations in the same gene. 
Alternative forms of a gene are known as alleles. The most common difference 
between alleles is a substitution of a single nucleotide pair, but different alleles 
can also bear deletions, substitutions, and duplications. How can we tell, then, 
whether two mutations that produce the same phenotype occur in the same 
gene or in different genes? If the mutations are recessive— if, for example, they 
represent a loss of function of a particular gene— a complementation test can 
be used to ascertain whether the mutations fall in the same or in different genes. 
To test complementation in a diploid organism, an individual that is homozy- 
gous for one mutation— that is, it possesses two identical alleles of the mutant 
gene in question— is mated with an individual that is homozygous for the other 
mutation. If the two mutations are in the same gene, the offspring show the 
mutant phenotype, because they still will have no normal copies of the gene in 
question (see Panel 8-1). If, in contrast, the mutations fall in different genes, the 
resulting offspring show a normal phenotype, because they retain one normal 
copy (and one mutant copy) of each gene; the mutations thereby complement 
one another and restore a normal phenotype. Complementation testing of 
mutants identified during genetic screens has revealed, for example, that 5 
genes are required for yeast to digest the sugar galactose, 20 genes are needed for 
E. coli to build a functional flagellum, 48 genes are involved in assembling bac- 
teriophage T4 viral particles, and hundreds of genes are involved in the devel- 
opment of an adult nematode worm from a fertilized egg. 



Genes Can Be Ordered in Pathways by Epistasis Analysis 

Once a set of genes involved in a particular hinlogtcal process has been identi- 
fied, the next step is often to determine in which order die genes function. Gene 
order is perhaps easiest to explain for metabolic pathways, where, for example, 
enzyme A is necessary to produce the substrate for enzyme B. In this case, we 
would say that the gene encoding enzyme A acts before (upstream of) the gene 
encoding enzyme B in the pathway. Similarly; where one protein regulates the 
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activity of another protein, we would say that the former gene acts before the lat- 
ter. Gene order can, in many cases, be determined purely by genetic analysis 
without any knowledge of the mechanism of action of the gene products 
involved. 

Suppose we have a biosynthetic process consisting of a sequence of steps, 
such that performance of a step B is conditional on completion of the preceding 
step A; and suppose gene A is required for step A, and gene B is required for step 
B. Then a null mutation (a mutation that abolishes function) in gene A will arrest 
the process at step A, regardless of whether gene B is functional or not, whereas 
a null mutation in gene B will cause arrest at step B only if gene A is still active. 
In such a case, gene A is said to be epistatic to gene B. By comparing the pheno- 
types of the different combinations of mutations, we can therefore discover the 
order in which the genes act. This type of analysis is called epistasis analysis. As 
an example, the pathway of protein secretion in yeast has been analyzed in this 
way. Different mutations in this pathway cause proteins to accumulate aber- 
rantly in the endoplasmic reticulum (ER) or in the Golgi apparatus. When a yeast 
cell is engineered to carry both a mutation that blocks protein processing in the 
ER and a mutation that blocks processing in the Golgi apparatus, proteins accu- 
mulate in the ER. This indicates that proteins must pass through the ER before 
being sent to the Golgi before secretion (Figure 8-57). Strictly speaking, an epis- 
tasis analysis can only provide information about gene order in a pathway when 
both mutations are null alleles. When the mutations retain partial function, their 
epistasis interactions can be difficult to interpret. 

Sometimes, a double mutant will show a new or more severe phenotype 
than either single mutant alone. This type of genetic interaction is called a syn- 
thetic phenotype, and if the phenotype is death of the organism, it is called syn- 
thetic lethality. In most cases, a synthetic phenotype indicates that the two genes 
act in two different parallel pathways, either of which is capable of mediating the 
same cell process. Thus, when both pathways are disrupted in the double 
mutant, the process fails altogether, and the synthetic phenotype is observed. 



Genes Identified by Mutations Can Be Cloned 

Once the mutant organisms are produced in a genetic screen, the next task is 
identifying the gene or genes responsible for the altered phenotype. [f the phe- 
notype has been produced by insertional mutagenesis, locating the disrupted 
gene is fairly simple. DNA fragments containing the insertion (a transposon or a 
retrovirus, for example) are collected and amplified by PCR, and the nucleotide 
sequence of the flanking DNA is determined. Genome databases can then be 
searched for open reading frames containing this flanking sequence. 

If a DNA-damaging chemical was used to generate the mutations, identi- 
fying the inactivated gene is often more laborious, but it can be accomplished 
by several different approaches. In one, the first step is to experimentally 
determine the gene's location in the genome. To map a newly discovered gene, 
its rough chromosomal location is first determined by assessing how far the 
gene lies from other known genes in the genome. Estimating the distance 
between genetic loci is usually done by linkage analysis, a technique that relies 
on the tendency for genes that lie near one another on a chromosome to be 
inherited together. Even closely linked genes, however, can be separated by 
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Figure 8-57 Using genetics to 
determine the order of function of 
genes. In normal cells, secretory proteins 
are loaded into vesicles, which fuse with 
the plasma membrane to secrete their 
contents into the extracellular medium. 
Two mutants, A and B, fail to secrete 
proteins. In mutant A, secretory proteins 
accumulate in the ER. In mutant B, 
secretory proteins accumulate in the 
Golgi. In the double mutant AB, proteins 
accumulate in the ER; this indicates that 
the gene defective in mutant A acts 
before the gene defective in mutant B in 
the secretory pathway. 
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recombination during meiosis. The larger the distance between two genetic loci, 
the greater the chance that they will be separated by a crossover (see Panel 8-1). 
By calculating the recombination frequency between two genes, the approxi- 
mate distance between them can be determined. If the position of one gene in 
the genome is known, that of the second gene can thereby be estimated. 

Because genes are not always located close enough to one another to allow a 
precise pinpointing of their position, linkage analyses often rely on physical 
markers along the genome for estimating the location of an unknown gene. These 
markers are generally short stretches of nucleotides, with a known sequence and 
genome location, that can exist in at least two allelic forms. The simplest markers 
are single- nucleotide polymorphisms (SNPs), short sequences that differ by one 
nucleotide pair among individuals in a population. SNPs can be detected by 
hybridization techniques. Many such physical markers, distributed all along the 
length of chromosomes, have been collected for a variety of organisms. If the dis- 
tribution of these markers is sufficiently dense, one can, through a linkage anal- 
ysis that tests for the tight co-inheritance of one or more SNPs with the mutant 
phenotype, narrow the potential location of a gene to a chromosomal region that 
may contain only a few gene sequences. These are then considered candidate 
genes, and their structure and function can be tested directly to determine which 
gene is responsible for the original mutant phenotype. 



Human Genetics Presents Special Problems and Opportunities 

Although genetic experimentation on humans is considered unethical and is 
legally banned, humans do suffer from a large variety of genetic disorders. The 
linkage analysis described above can be used to identify the genes responsible 
for these heritable conditions. Such studies require DNA samples from a large 
number of families affected by the disease. These samples are examined for the 
presence of physical markers such as SNPs that seem to be closely linked to the 
disease gene, in that they are always inherited by individuals who have the dis- 
ease and not by their unaffected relatives. The disease gene is then located as 
described above (Figure 8-58). The genes for cystic fibrosis and Huntington's 
disease, for example, were discovered in this way 
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CONCLUSION: gene causing disease is co-inherited with SNP marker from diseased mother 
in 75% of the diseased progeny. If this same correlation is observed in other families that 
have been examined, the gene causing disease is mapped to this chromosome close to the 
SNP. Note that an SNP that is either far away from the gene on the same chromosome or 
located on a different chromosome from the gene of interest will be co-inherited only 
50% of the time. 



Figure 8-58 Genetic linkage analysis 
using physical markers on DNA to find a 
human gene. In this example, the 
co-inheritance of a specific human 
phenotype (here a genetic disease) with 
an SNP marker. If individuals who inherit 
the disease nearly always inherit a 
particular SNP marker, then the gene 
causing the disease and the SNP are likely 
to be close together on the chromosome, 
as shown here. To prove that an observed 
linkage is statistically significant, 
hundreds of individuals may need to be 
examined. Note that the linkage will not 
be absolute unless the SNP marker is 
located in the gene itself. Thus, 
occasionally the SNP will be separated 
from the disease gene by crossing over 
during meiosis in the formation of the 
egg or sperm: this has happened in the 
case of the chromosome pair on the far 
right. When working with a sequenced 
genome, this procedure would be 
repeated with SNPs located on either side 
of the initial SNP, until a 100% 
co-inheritance is found. 

Note that the egg and sperm will each 
contribute only one chromosome of each 
pair from the parent to the child. 
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Human Genes Are Inherited in Haplotype Blocks, Which Can Aid 
in the Search for Mutations That Cause Disease 

With the complete human genome sequence in hand, we can now study 
human genetics in a way that was impossible only a few years ago. For exam- 
ple, we can begin to identify those DNA differences that distinguish one indi- 
vidual from another. No two humans (with the exception of identical twins) 
have the same genome. Each of us carries a set of polymorphisms — differences 
in nucleotide sequence— that make us unique. These polymorphisms can be 
used as markers for building genetic maps and performing genetic analyses to 
link particular polymorphisms with specific diseases or predispositions to dis- 
ease. 

The problem is that any two humans typically differ by about 0.1% in their 
nucleotide sequences (approximately one nucleotide difference every 1000 
nucleotides). This translates to about 3 million differences between one per- 
son and another. Theoretically, one would need to search through all 3 million 
of those polymorphisms to identify the one or two that are responsible for a 
particular heritable disease or disease predisposition. To reduce the number of 
polymorphisms we need to examine, researchers are taking advantage of the 
recent discovery that human genes tend to be inherited in blocks. 

The human species is relatively young, and it is thought that we are 
descended from a relatively small population of individuals who lived in Africa 
about 10,000 years ago. Because only a few hundred generations separate us 
from this ancestral population, large segments of human chromosomes have 
passed from parent to child unaltered by the recombination events that occur in 
meiosis. In fact, we observe that certain sets of alleles (including SNPs) are 
inherited in large blocks within chromosomes. These ancestral chromosome 
segments— sets of alleles that have been inherited in clusters with little genetic 
rearrangement across the generations— are called haplotype blocks. Like genes, 
SNPs, and other genetic markers— which exist in different alleleic forms — hap- 
lotype blocks also come in a limited number of "flavors" that are common in the 
human population, each of which represents an allele combination passed 
down from a shared ancestor long ago. 

Researchers are now constructing a human genome map based on these 
haplotype blocks— called a haplotype map (hapmap). Geneticists hope that 
the human haplotype map will make the search for disease-causing and dis- 
ease-susceptibility genes a much more manageable task. Instead of searching 
through each of the many millions of SNPs in the human population, one need 
only search through a considerably smaller set of selected SNPs to identify the 
haplotype block that appears to be inherited by individuals with the disease. 
(These searches still involve DNA samples from large numbers of people, and 
SNPs are now typically scored using robotic technologies.) If a specific haplo- 
type block is more common among people with the disease than in unaffected 
individuals, the mutation linked to that disease will likely be located in that 
same segment of DNA (Figure 8-59). Researchers can then zero in on the spe- 
cific region within the block to search for the specific gene associated with the 
disease. This approach should, in principle, allow one to analyze the genetics 
of those common diseases in which multiple genes confer susceptibility. 

A detailed examination of haplotype blocks can even tell us whether a par- 
ticular allele has been favored by natural selection. As a rule, when a new allele 
of a gene arises that does not confer a selective advantage on the individual, it 
will take a long time for that allele to become common in the population. The 
more common— and therefore older— such an allele is, the smaller should be 
the haplotype block that surrounds it, because it will have had many chances of 
being separated from its neighboring variations by the recombination events 
that occur in meiosis generation after generation. 

A new allele may quickly spread in a population, however, if it confers some 
dramatic advantage on the organism. For example, mutations or variations that 
make an organism more resistant to an infection might be selected for because 
organisms with this variation would be more likely to survive and pass the 
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mutation on to their offspring. Working with haplotype maps of individual 
genes, researchers have detected such positive selection for two human genes 
that confer resistance to malaria. The alleles that confer resistance are 
widespread in the population, but they are embedded in unusually large haplo- 
type blocks, suggesting that they rose to prominence recently in the human gene 
pool (Figure 8-60). 

In revealing the paths along which humans evolved, the human haplotype 
map provides a new window into our past; in helping us discover the genes that 
make us susceptible or resistant to disease, the map may also provide a rough 
guide to our individual futures. 



Figure 8-59 Tracing the inheritance of 
SNPs within haplotype blocks to reveal 
the location of a disease-causing gene. 

An ancestor who acquires a disease- 
causing mutation in gene 1 will pass that 
mutation along to his or her descendants. 
Part of this gene is embedded within a 
haplotype block (red shading) — a cluster of 
variations (about 30 SNPs) that have been 
passed along from the ancestor in a 
continuous chunk. In the 400 generations 
that separate the ancestor from modern 
descendants with the disease, SNPs 
located over most of the ancestral 
200,000-nucleotide-pair region shown 
have been shuffled by meiotic 
recombination in the descendant genome 
(blue). (Note that the overlap of yellow and 
red is seen as orange, and the overlap of 
yellow and blue is seen as green.) The 30 
SNPs within the haplotype block, however, 
have been inherited as a group, as no 
crossover events have yet separated them. 
To locate a gene that causes the inherited 
disease, the SNP patterns in a number of 
people who have the disease need to be 
analyzed. An individual with the disease 
will retain the ancestral pattern of SNPs 
located within the haplotype block shown, 
revealing that the disease-causing 
mutation is likely to lie within that 
haplotype block— thus in gene 1 . The 
beauty of using haplotype maps for this 
type of linkage analysis is that only a 
fraction of the total SNPs need to be 
examined: one should be able to locate 
genes after searching through only about 
10% of the 3 million useful SNPs present in 
the human genome. 
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Figure 8-60 Identification of alleles that 
have been selected for in fairly recent 
human history by the unusually large 
haplotype blocks in which they are 
embedded. The SNPs are indicated in this 
diagram by vertical bars, which are shown 
as white or black according to their DNA 
sequence. Haplotype blocks are shaded in 
red, genes in yellow, and the rest of the 
chromosome in blue. These data suggest 
that this particular allele of gene 2 arose 
relatively recently in human history. 
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Complex Traits Are Influenced by Multiple Genes 

A concert pianist might have an aunt who plays the violin. In another family, the 
parents and the children might all be fat. In a third family the grandmother 
might be an alcoholic, and her grandson might abuse drugs. To what extent are 
such characteristics— musical ability, obesity and addiction— inherited geneti- 
cally? This is a very difficult question to answer. Some traits or diseases "run in 
families" but appear in only a few relatives or with no easily discernible pattern. 

Characteristics that do not follow simple (sometimes called Mendelian) pat- 
terns of inheritance but have a genetically inherited component are termed 
complex traits. These traits are often polygenic; that is, they are influenced by 
multiple genes, each of which makes a small contribution to the phenotype in 
question. The effects of these genes are additive, which means that, together, 
they produce a continuum of varying features within the population. Individu- 
ally, the genes that contribute to a polygenic trait are distributed to offspring in 
simple patterns, but because they all influence the phenotype, the pattern of 
traits inherited by offspring is often highly complex. 

A simple example of a polygenic trait is eye color, which is determined by 
enzymes that control the distribution and production of the pigment melanin: 
the more melanin produced, the darker the eye color. Because numerous genes 
contribute to the formation of melanin, eye color in humans shows enormous 
variation, from the palest gray to a dark chocolate brown. 

Although diseases based on mutations in single genes (for example, sickle- 
cell anemia and hemophilia) were some of the earliest recognized human inher- 
ited phenotypes, only a small fraction of human traits are dictated by single 
genes. The most obvious human phenotypes — from height, weight, eye color, 
and hair color to intelligence, temperament, sociability, and humor — arise from 
the interaction of many genes. Multiple genes also almost certainly underlie a 
propensity for the most common human diseases: diabetes, heart disease, high 
blood pressure, allergies, asthma, and various mental illnesses, including major 
depression and schizophrenia. Researchers are exploring new strategies — 
including the use of the haplotype maps discussed earlier— to understand the 
complex interplay between genes that act together to determine many of our 
most "human" traits. 

Reverse Genetics Begins with a Known Gene and Determines 
Which Cell Processes Require Its Function 

As we have seen, classical genetics starts with a mutant phenotype (or, in the 
case of humans, a range of characteristics) and identifies the mutations (and 
consequently the genes) responsible for it. Recombinant DNA technology, in 
combination with genome sequencing, has made possible a different type of 
genetic approach. Instead of beginning with a mutant organism and using it to 
identify a gene and its protein, an investigator can start with a particular gene 
and proceed to make mutations in it, creating mutant cells or organisms so as to 
analyze the gene's function. Because this approach reverses the traditional 
direction of genetic discovery— proceeding from genes to mutations, rather 
than vice versa — it is commonly referred to as reverse genetics. 

Reverse genetics begins with a cloned gene, a protein with interesting 
properties that has been isolated from a cell, or simply a genome sequence. If 
the starting point is a protein, the gene encoding it is first identified and, if 
necessary, its nucleotide sequence is determined. The gene sequence can then 
be altered in vitro to create a mutant version. This engineered mutant gene, 
together with an appropriate regulatory region, is transferred into a cell where 
it can integrate into a chromosome, becoming a permanent part of the cell's 
genome. All of the descendants of the modified cell will now contain the 
mutant gene. 

If the original cell used for the gene transfer is a fertilized egg, whole multi- 
cellular organisms can be obtained that contain the mutant gene, provided that 
the mutation does not cause lethality. In some of these animals, the altered gene 
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will be incorporated into the germ cells— a germ-line mutation— allowing the 
mutant gene to be passed on to their progeny. 

Genes Can Be Engineered in Several Ways 

We have seen that mutant organisms lacking a particular gene may quickly 
reveal the function of the protein it encodes. For this reason, a gene "knock- 
out" — in which both copies of the gene in a diploid organism have been inacti- 
vated or deleted— is a particularly useful type of mutation. However, there are 
many more types of genetic alterations available to the experimenter. For exam- 
ple, by altering the regulatory region of a gene before it is reintegrated into the 
genome, one can create mutant organisms in which the gene product is 
expressed at abnormally high levels, in the wrong tissue, or at the wrong time in 
development (Figure 8-61). By placing the gene under the control of an 
inducible promoter, the gene can be switched on or off at any time, and die 
effects observed. Inducible promoters that function in only a specific tissue can 
be used to monitor the effects of shutting the gene off (or turning it on) in that 
particular tissue, Finally; dfamitmnP-negatwe mutations are often employed par- 
ticularly in those organisms in which it is simpler to add an altered gene to the 
genome than to replace the endogenous genes with it. The dominant-negative 
strategy exploits the fact that most proteins function as parts of larger protein 
complexes. The inclusion of just one nonfunctional component can often inac- 
tivate such complexes. Therefore, by designing a gene that produces large quan- 
tities of a mutant protein that is inactive but still able to assemble into the com- 
plex, it is often possible to produce a cell in which all the complexes are inacti- 
vated despite the presence of the normal protein (Figure 8-62). 

As noted in the earlier discussion of classical genetics, if a protein is required 
for the survival of the cell (or the organism), a dominant-negative mutant will be 
inviable, making it impossible to test the function of the protein. To avoid this 
problem in reverse genetics, one can couple the mutant gene to an inducible 
promoter in order to produce the faulty gene product only on command— for 
example, in response to an increase in temperature or to the presence of a spe- 
cific signal molecule. 

In studying the action of a gene and the protein it encodes, one does not 
always wish to make drastic changes— flooding cells with huge quantities of the 
protein or eliminating a gene product entirely. It is sometimes useful to make 
slight changes in a protein's structure so that one can begin to dissect which por- 
tions of a protein are important for its function. The activity of an enzyme, for 
example, can be studied by changing a single ammo acid in its active site. Spe- 
cial techniques are required to alter genes (and thus their protein products] in 
such subtle ways. The first step ts often the chemical synthesis of a short DNA 
molecule containing the desired altered portion of the gene's nucleotide 
sequence. This synthetic DNA oligonucleotide is hybridized with single- 
stranded plasmid DNA that contains the DNA sequence to be altered, using con- 
ditions that allow imperfectly matched DNA strands to pair The synthetic 
oligonucleotide will now serve as a primer for DNA synthesis by DNA poly- 
merase, thereby generating a DNA double helix that incorporates the altered 
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Figure 8-61 Ectopic misexpression of 
Wnt, a signaling protein that affects 
development of the body axis in the 
early Xenopus embryo. In this 
experiment, mRNA coding for Wnt was 
injected into the ventral vegetal 
blastomere, inducing a second body axis 
(discussed in Chapter 22). (From S. Sokol 
etal.,Ce// 67:741 -752, 1991. With 
permission from Elsevier.) 
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Figure 8-62 A dominant-negative effect 
of a protein. Here, a gene is engineered 
to produce a mutant protein that 
prevents the normal copies of the same 
protein from performing their function. 
In this simple example, the normal 
protein must form a multisubunit 
complex to be active, and the mutant 
protein blocks function by forming a 
mixed complex that is inactive. In this 
way a single copy of a mutant gene 
located anywhere in the genome can 
inactivate the normal products produced 
by other gene copies. 
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sequence into one of its two strands. After transfection, plasmids that carry the 
fully modified gene sequence are obtained. The modified DNA is then inserted 
into an expression vector so that the redesigned protein can be produced in the 
appropriate type of cells for detailed studies of its function. By changing selected 
amino acids in a protein in this way — a technique called site-directed mutage- 
nesis — one can determine exactly which parts of the polypeptide chain are 
important for such processes as protein folding, interactions with other pro- 
teins, and enzymatic catalysis (Figure 8-63). 



Engineered Genes Can Be Inserted into the Germ Line of Many 
Organisms 

Altered genes can be introduced into cells in a variety of ways. DNA can be 
microinjected into mammalian cells with a glass micropipette or introduced by 
a virus that has been engineered to carry foreign genes. In plant cells, genes are 
frequently introduced by a technique called particle bombardment: DNA sam- 
ples are painted onto tiny gold beads and then literally shot through the cell wall 
with a specially modified gun. Electroporation is the method of choice for intro- 
ducing DNA into bacteria and some other cells. In this technique, a brief electric 
shock renders the cell membrane temporarily permeable, allowing foreign DNA 
to enter the cytoplasm. 
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Figure 8-63 The use of a synthetic 
oligonucleotide to modify the protein- 
coding region of a gene by site-directed 
mutagenesis. (A) A recombinant plasmid 
containing a gene insert is separated into 
its two DNA strands. A synthetic 
oligonucleotide primer corresponding to 
part of the gene sequence but containing 
a single altered nucleotide at a 
predetermined point is added to the 
single-stranded DNA under conditions 
that permit imperfect DNA hybridization 
(see Figure 8-36). (B) The primer 
hybridizes to the DNA, forming a single 
mismatched nucleotide pair. (C)The 
recombinant plasmid is made double- 
stranded by in vitro DNA synthesis 
(starting from the primer) followed by 
sealing by DNA ligase. (D) The double- 
stranded DNA is introduced into a cell, 
where it is replicated. Replication using 
one strand of the template produces a 
normal DNA molecule, but replication 
using the other strand (the one that 
contains the primer) produces a DNA 
molecule carrying the desired mutation. 
Only half of the progeny cells will end up 
with a plasmid that contains the desired 
mutant gene. However, a progeny cell 
that contains the mutated gene can be 
identified, separated from other cells, and 
cultured to produce a pure population of 
cells, all of which carry the mutated gene. 
Only one of the many changes that can 
be engineered in this way is shown here. 
With an oligonucleotide of the 
appropriate sequence, more than one 
amino acid substitution can be made at a 
time, or one or more amino acids can be 
inserted or deleted. Although not shown 
in this figure, it is also possible to create a 
site-directed mutation by using the 
appropriate oligonucleotides and PCR 
(instead of plasmid replication) to amplify 
the mutated gene. 
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Unlike higher eucaryotes (which are multicellular and diploid), bacteria, 
yeasts, and the cellular slime mold Dictyostelium generally exist as haploid sin- 
gle cells. In these organisms, an artificially introduced DNA molecule carrying a 
mutant gene can, with a relatively high frequency replace the single copy of the 
normal gene by homologous recombination; it is therefore easy to produce cells 
in which the mutant gene has replaced the normal gene (Figure 8-64A). In this 
way, cells can be made in order to miss a particular protein or produce an altered 
form of it. The ability to perform direct gene replacements in lower eucaryotes, 
combined with the power of standard genetic analyses in these haploid organ- 
isms, explains in large part why studies in these types of cells have been so 
important for working out the details of those cell processes that are shared by 
all eucaryotes. 



Figure 8-64 Gene replacement, gene 
knockout and gene addition. 

A normal gene can be altered in several 
ways to produce a transgenic organism. 
(A) The normal gene (green) can be 
completely replaced by a mutant copy of 
the gene frec/j.This provides information 
on the activity of the mutant gene 
without interference from the normal 
gene, and thus the effects of small and 
subtle mutations can be determined. 
(B)The normal gene can be inactivated 
completely, for example, by making a 
large deletion in it. (C) A mutant gene can 
simply be added to the genome. In some 
organisms this is the easiest type of 
genetic engineering to perform. This 
approach can provide useful information 
when the introduced mutant gene 
overrides the function of the normal 
gene, as with a dominant-negative 
mutation (see Figure 8-62). 



Animals Can Be Genetically Altered 

Gene additions and replacements are also possible, but more difficult to per- 
form, in animals and plants. Animals and plants that have been genetically engi- 
neered by either gene insertion, gene deletion, or gene replacement are called 
transgenic organisms, and any foreign or modified genes that are added are 
called transgenes. We concentrate our discussion on transgenic mice, as enor- 
mous progress is being made in this area. If a DNA molecule carrying a mutated 
mouse gene is transferred into a mouse cell, it usually inserts into the chromo- 
somes at random, but about once in a thousand times, it replaces one of the two 
copies of the normal gene by homologous recombination. By exploiting these 
rare "gene targeting" events, any specific gene can be altered or inactivated in a 
mouse cell by a direct gene replacement. In the special case in which both 
copies of the gene of interest is completely inactivated or deleted, the resulting 
animal is called a "knockout" mouse. 

The technique works as follows. In the first step, a DNA Fragment containing 
a desired mutant gene (or a DNA fragment designed ro interrupt a target gene) 
is inserted into a vector and then introduced into cultured embryonic stem (ES) 
cells (see Figure 8-5), which are capable of producing cells of many different 
types. After a period of cell proliferation, the rare colonies of cells in which a 
homologous recombination event is likely to have caused a gene replacement to 
occur are isolated. The correct colonies among these are identified by PCR or by 
Southern blotting: they contain recombinant DNA sequences in which the 
inserted fragment has replaced all or part of one copy of the normal gene hi the 
second step, individual ES cells from the identified colony are taken up into a 
fine micropipette and injected into an early mouse embryo. The transfected ES 
cells collaborate with the cells of the host embryo to produce a normal -looking 
mouse? large parts of this chimeric animal, including— in favorable cases— cells 
of the germ line, often derive from the transfected ES ceils (Figure 8-65). 

The mice with the transgene in their germ line are then bred to produce both 
a male and a female animal, each heterozygous for the gene replacement (that 
is, they have one normal and one mutant copy of the gene). When these two 
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mice are mated, one-fourth of their progeny will be homozygous for the altered 
gene. Studies of these homozygotes allow the function of the altered gene — or 
the effects of eliminating the gene's activity— to be examined in the absence of 
the corresponding normal gene. 

The ability to prepare transgenic mice lacking a known normal gene has 
been a major advance, and the technique is now being used to determine the 
functions of all mouse genes (Figure 8-66). A special technique is used to pro- 
duce conditional mutants, in which a selected gene becomes disrupted in a spe- 
cific tissue at a certain time in development. The strategy takes advantage of a 
site-specific recombination system to excise — and thus disable — the target gene 
in a particular place or at a particular time. The most common of these recom- 
bination systems, called Cre/lox, is widely used to engineer gene replacements 
in mice and in plants (see Figure 5-79). In this case, the target gene in ES cells is 
replaced by a fully functional version of the gene that is flanked by a pair of the 
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Figure 8-65 Summary of the procedures 
used for making gene replacements in 

mice. In the first step (A), an altered 
version of the gene is introduced into 
cultured ES (embryonic stem) cells. Only 
a few rare ES cells will have their 
corresponding normal genes replaced by 
the altered gene through a homologous 
recombination event. Although the 
procedure is often laborious, these rare 
cells can be identified and cultured to 
produce many descendants, each of 
which carries an altered gene in place of 
one of its two normal corresponding 
genes. In the next step of the procedure 
(B), these altered ES cells are injected into 
a very early mouse embryo; the cells are 
incorporated into the growing embryo, 
and a mouse produced by such an 
embryo will contain some somatic cells 
(indicated by orange) that carry the 
altered gene. Some of these mice will 
also contain germ-line cells that contain 
the altered gene; when bred with a 
normal mouse, some of the progeny of 
these mice will contain one copy of the 
altered gene in all of their cells. If two 
such mice are bred (not shown), some of 
the progeny will contain two altered 
genes (one on each chromosome) in all 
of their cells. 

If the original gene alteration 
completely inactivates the function of 
the gene, these homozygous mice are 
known as knockout mice. When such 
mice are missing genes that function 
during development they often die with 
specific defects long before they reach 
adulthood. These lethal defects are 
carefully analyzed to help determine the 
normal function of the missing gene. 
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short DNA sequences, called lox sites, that are recognized by the Cre recombinase 
protein. The transgenic mice that result are phenotypically normal. They are 
then mated with transgenic mice that express the Cre recombinase gene under 
the control of an inducible promoter. In the specific cells or tissues in which Cre 
is switched on, it catalyzes recombination between the lox sequences — excising 
a target gene and eliminating its activity Similar recombination systems are 
used to generate conditional mutants in Drosophila (see Figure 22-49). 

Transgenic Plants Are Important for Both Cell Biology and 
Agriculture 

A damaged plant can often repair itself by a process in which mature differenti- 
ated cells "dedifferentiate," proliferate, and then redifferentiate into other cell 
types. In some circumstances, the dedifferentiated cells can even form an apical 
meristem, which can then give rise to an entire new plant, including gametes. 
This remarkable developmental plasticity of plant cells can be exploited to gen- 
erate transgenic plants from cells growing in culture. 

When a piece of plant tissue is cultured in a sterile medium containing 
nurrients and appropriate growth regulators, many of the cells are stimulated to 
proliferate indefinitely in a disorganized manner, producing a mass of relatively 
undifferentiated cells called a callus. By carefully manipulating the nutrients 
and growth regulators, one can induce the formation of shoot and then root api- 
cal meristem s wit bin die callus, and, in many species, regenerate a whole new 
plant. 

Callus cultures can also be mechanically dissociated into single cells, which 
will grow and divide as a suspension culture. In several plants— including 
tobacco, petunia, carrot, potato, and Ambidopsis—a. single cell from such a sus- 
pension culture can be grown into a small clump (a clone) from which a whole 
plant can be regenerated. Such a cell, which has the ability to give rise to all parts 
of the organism, is considered totipotent Just as mutant mice can be derived by 
the genetic manipulation of KS cells in culture, so transgenic plants can be cre- 
ated from single totipotent plant cells that have been transfectcd with DNA in 
culture (Figure 8-67). 

The ability to produce transgenic plants has greatly accelerated progress in 
many areas of plant cell biology, h has had an important role, for example, in iso- 
lating receptors for growth regulators and in analyzing the mechanisms of mor- 
phogenesis and of gene expression in plants. It has also opened up many new 
possibilities in agriculture that could benefit both farmer and consumer. It has 
made it possible, for example, to modify the lipid, starch, and protein stored in 
seeds, to impart pest and virus resistance to plants, and to create modified 
plants that tolerate extreme habitats such as salt marshes or water-stressed soil. 

Many of the major advances in understanding animal development have 
come from studies on the fruit fly Drosophila and the nematode worm C. ele- 
gans, which are amenable to classical genetic analysis, as well as to experimen- 
tal manipulation. Progress in plant developmental biology has. in die past, been 
relatively slow by comparison. Many of die plants that have proved most 
amenable to genetic analysis— such as maize and tomato— have long life cycles 
and very large genomes, making both classical and molecular genetic analysis 
very time-consuming. Increasing attention is consequently being paid to a fast- 
growing small weed, the common wall cress (Ambidopsis ihaUana), which has 



Figure 8-66 Transgenic mice engineered 
to express a mutant DNA helicase show 
premature aging. The helicase, encoded 
by the Xpd gene, is involved in both 
transcription and DNA repair. Compared 
with a wild-type mouse of the same age 
(A), a transgenic mouse that expresses a 
defective version of Xpd (B) exhibits many 
of the symptoms of premature aging, 
including osteoporosis, emaciation, early 
graying, infertility, and reduced life-span. 
The mutation in Xpd used here impairs 
the activity of the helicase and mimics a 
mutation that in humans causes 
trichothiodystrophy, a disorder 
characterized by brittle hair, skeletal 
abnormalities, and a very reduced life 
expectancy. These results indicate that an 
accumulation of DNA damage can 
contribute to the aging process in both 
humans and mice. (From J. de Boer et al., 
Science 296:1 276-1 279, 2002. With 
permission from AAAS.) 
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Figure 8-67 A procedure used to make a 
transgenic plant. (A) Outline of the 
process. A disc is cut out of a leaf and 
incubated in culture with Agrobacterium 
cells that carry a recombinant plasmid 
that contains both a selectable marker 
gene and a desired transgene. The 
wounded cells at the edge of the disc 
release substances that attract the 
Agrobacterium cells and cause them to 
inject DNA into these cells. Only those 
plant cells that take up the appropriate 
DNA and express the selectable marker 
gene survive to proliferate and form a 
callus. The manipulation of growth 
regulators and nutrients supplied to the 
callus induces it to form shoots, which 
subsequently root and grow into adult 
plants carrying the transgene. (B) The 
preparation of the recombinant plasmid 
and its transfer to plant cells. An 
Agrobacterium plasmid that normally 
carries the T-DNA sequence is modified by 
substituting a selectable marker gene 
(such as the kanamycin-resistance gene) 
and a desired transgene between the 
25-nucleotide-pair T-DNA repeats. When 
the Agrobacterium recognizes a plant cell, 
it efficiently passes a DNA strand that 
carries these sequences into the plant cell, 
using the special machinery that normally 
transfers the plasmid's T-DNA sequence. 



several major advantages as a "model plant" (see Figures 1-46 and 22-112). The 
relatively small Arabidopsis genome was the first plant genome to be completely 
sequenced, and the pace of research on this organism now rivals that of the 
model animals. 



Large Collections of Tagged Knockouts Provide a Tool for 
Examining the Function of Every Gene in an Organism 

Extensive collaborative efforts are underway to assemble comprehensive 
libraries of mutations in a variety of model organisms, including S. cerevisiae, C. 
elegans, Drosophila, Arabidopsis, and the mouse. The ultimate aim in each case is 
to produce a collection of mutant strains in which every gene in the organism has 
been systematically deleted or altered in such a way that it can be conditionally 
disrupted. Collections of this type will provide an invaluable resource for investi- 
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Figure 8-68 Making collections of mutant organisms. (A) A deletion cassette for use in yeast contains DNA sequences (red) 
homologous to each end of a target gene x, a selectable marker gene (blue), and a unique "barcode" sequence 
approximately 20 nucleotide pairs in length (green). This DNA is introduced into yeast cells, where it readily replaces the 
target gene by homologous recombination. By using a collection of such cassettes, each specific for one gene, a library of 
yeast mutants can be constructed containing a mutant for every gene. (B) A similar approach can be taken to prepare 
tagged knockout mutants in Arabidopsis and Drosophila. In this case, mutations are generated by the accidental insertion of 
a transposable element into a target gene. The total DNA from the resulting organism can be collected and quickly screened 
for disruption of a gene of interest by using PCR primers that bind to the transposable element and to the target gene. 
A PCR product is detected on the gel only if the transposable element has inserted into the target gene (see Figure 8-45). 

gating gene function on a genomic scale. In some cases, each of the individual 
mutations within the collection will express a distinct molecular tag— in the form 
of a unique DNA sequence— designed to make identification of the altered gene 
rapid and routine. 

In S. cerevisiae, the task of generating a complete set of 6000 mutants, each 
missing only one gene, is made simpler by yeast's propensity for homologous 
recombination. For each gene, a "deletion cassette" is prepared. The cassette 
consists of a special DNA molecule that contains 50 nucleotides identical in 
sequence to each end of the targeted gene, surrounding a selectable marker. In 
addition, a special "barcode" sequence tag is embedded in this DNA molecule to 
facilitate the later rapid identification of each resulting mutant strain (Figure 
8-68). A large mixture of such gene knockout mutants can then be grown under 
various selective test conditions— such as nutritional deprivation, a tempera- 
ture shift, or the presence of various drugs— and the cells that survive can be 
rapidly identified by their unique sequence tags. By assessing how well each 
mutant in the mixture fares, one can begin to assess which genes are essential, 
useful, or irrelevant for growth under the various conditions. 

The challenge in deriving information from the study of such yeast mutants 
lies in deducing a gene's activity or biological role based on a mutant phenotype. 
Some defects— an inability to live without histidine, for example— point directly 
to the function of the wild-type gene. Other connections may not be so obvious. 
What might a sudden sensitivity to cold indicate about the role of a particular 
gene in the yeast cell? Such problems are even greater in organisms that are 
more complex than yeast. The loss of function of a single gene in the mouse, for 
example, may affect many different tissue types at different stages of develop- 
ment—whereas the loss of other genes may have no obvious effect. Adequately 
characterizing mutant phenotypes in mice often requires a thorough examina- 
tion, along with extensive knowledge of mouse anatomy, histology, pathology, 
physiology, and complex behavior. 

The insights generated by examination of mutant libraries, however, will be 
great. For example, studies of an extensive collection of mutants in Mycoplasma 
genitalium— the organism with the smallest known genome— have identified 
the minimum complement of genes essential for cellular life. Analysis of the 
mutant pool suggests that growth under laboratory conditions requires about 
three-quarters of the 480 protein-coding genes in M. genitalium. Approximately 
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100 of these essential genes are of unknown function, which suggests that a sur- 
prising number of the basic molecular mechanisms that underlie life have yet to 
be discovered. 



RNA Interference Is a Simple and Rapid Way to Test Gene 
Function 

Although knocking out a gene in an organism and studying the consequences is 
perhaps the most powerful approach for understanding the functions of the 
gene, a much easier way to inactivate genes has been recently discovered. Called 
RNA interference (RNAi, for short), this method exploits a natural mechanism 
used in many plants, animals, fungi, and protozoa to protect themselves against 
certain viruses and transposable elements (see Figure 7-115). The technique 
introduces into a cell or organism a double -stranded RNA molecule whose 
nucleotide sequence matches that of part of the gene to be inactivated. After the 
RNA is processed, it hybridizes with the mRNA produced by the target gene and 
directs its degradation. The cell subsequently uses small fragments of this 
degraded RNA to produce more double-stranded RNA, which directs the con- 
tinued elimination of the target mRNA. Because these short RNA fragments can 
be passed on to progeny cells, RNAi can cause heritable changes in gene expres- 
sion. But, as we saw in Chapter 7, there is a second mechanism through which 
RNAi can stably inactivate genes. RNA fragments produced by degradation in 
the cytosol can enter the nucleus and interact with the target gene itself, direct- 
ing its packaging into a transcriptionally repressed form of chromatin. This dual 
mode of controlling gene expression makes RNAi an especially effective tool for 
shutting down genes, one at a time. 

RNAi is frequently used to inactivate genes in Drosophila and mammalian 
cell culture lines. Indeed, sets of 15,000 Drosophila RNAi molecules (one for 
every gene) allow researchers, in several months, to test the role of every fly gene 
in any process that can be monitored using cultured cells. Soon, it will be possi- 
ble to carry out the same type of analysis with the 25,000 mouse and human 
genes. RNAi has also been widely used to study gene function in the nematode, 
C. elegans. When working with worms, introducing the double-stranded RNA is 
quite simple: the RNA can be injected directly into the intestine of the animal, or 
the worm can be fed with E. coli engineered to produce the RNA (Figure 8-69). 
The RNA is distributed throughout the body of the worm, where it inhibits 
expression of the target gene in different tissue types. Because the entire genome 
of C. elegans has been sequenced, RNAi is being used to help in assigning func- 
tions to the entire complement of worm genes. 

More recently, a related technique has also been widely applied to mice. In 
this case, the RNAi is not injected or fed to the mouse; rather, recombinant DNA 
techniques are used to make transgenic animals that express the RNAi under the 
control of an inducible promoter. Often this is a specially designed RNA that can 
fold back on itself and, through base pairing, produce a double-stranded region 
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Figure 8-69 Dominant-negative mutation created by RNA interference. (A) Double- 
stranded RNA (dsRNA) can be introduced into C elegans (1) by feeding the worms 
with £ coli expressing the dsRNA or (2) by injecting dsRNA directly into the gut. 
(B) Wild-type worm embryo shortly after the egg has been fertilized. The egg and 
sperm pronuclei {red arrowheads) have migrated and come together in the posterior 
half of the embryo. (C) Worm embryo at the same stage in which a gene involved in 
cell division has been inactivated by RNAi. The two pronuclei have failed to migrate. 
(B and C, from P. Gonczy et al„ Nature 408:331-336, 2000. With permission from 
Macmillan Publishers Ltd.) 
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that is recognized by the RNAi machinery. The process inactivates only the genes 
that exactly match the RNAi sequence. Depending on the inducible promoter 
used, the RNAi can be produced only in a specified tissue or only at a particular 
time in development, allowing the functions of the target genes to be analyzed 
in elaborate detail. 

RNAi has made reverse genetics simple and efficient in many organisms, but 
it has several potential limitations compared with true genetic knockouts. For 
unknown reasons, RNAi does not efficiently inactivate all genes. Moreover, 
within whole organisms, certain tissues may be resistant to the action of RNAi 
(for example, neurons in nematodes). Another problem arises because many 
organisms contain large gene families, the members of which exhibit sequence 
similarity. RNAi therefore sometimes produces "off-target" effects, inactivating 
related genes in addition to the targeted gene. One strategy to avoid such prob- 
lems is to use multiple small RNA molecules matched to different regions of the 
same gene. Ultimately, the results of any RNAi experiment must be viewed as a 
strong clue to, but not necessarily a proof of, normal gene function. 



Reporter Genes and In Situ Hybridization Reveal When and Where 
a Gene Is Expressed 

Important insights into gene function can often be obtained by examining when 
and where a gene is expressed in the cell or in the whole organism. Determining 
the pattern and timing of gene expression can be accomplished by replacing the 
coding portion of the gene under study with a reporter gene. In most cases, the 
expression of the reporter gene is then monitored by tracking the fluorescence 
or enzymatic activity of its protein product (see Figures 9-26 and 9-27). 

As discussed in detail in Chapter 7, regulatory DNA sequences, located 
upstream or downstream of the coding region, control gene expression. These 
regulatory sequences, which determine exactly when and where the gene is 
expressed, can be easily studied by placing a reporter gene under their control 
and introducing these recombinant DNA molecules into cells (Figure 8-70). 
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Figure 8-70 Using a reporter protein to 
determine the pattern of a gene's 
expression. (A) In this example, the 
coding sequence for protein X is replaced 
by the coding sequence for reporter 
protein Y.The expression pattern for 
X and Y are the same. (B) Various 
fragments of DNA containing candidate 
regulatory sequences are added in 
combinations to produce test DNA 
molecules encoding reporter gene Y. 
These recombinant DNA molecules are 
then tested for expression after their 
transfection into a variety of different 
types of mammalian cells. The results are 
summarized in (C). 

For experiments in eucaryotic cells, two 
commonly used reporter proteins are the 
enzyme (3-galactosidase (fi-gal) (see 
Figure 7-55B) and green fluorescent 
protein or GFP (see Figure 9-26). Figure 
7-55B shows an example in which the 
fi-gal gene is used to monitor the activity 
of the Eve gene regulatory sequence in a 
Drosophila embryo. 
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It is also possible to directly observe the time and place that the mRNA prod- 
uct of a gene is expressed. Although this strategy often provides the same gen- 
eral information as the reporter gene approaches discussed above, there are 
instances where it provides additional information; for example, when the gene 
is transcribed but the mRNA is not immediately translated, or when the gene's 
final product is RNA rather than protein. This procedure, called in situ 
hybridization, relies on the principles of nucleic acid hybridization described 
earlier. Typically, tissues are gently fixed so that their RNA is retained in an 
exposed form that can hybridize with a labeled complementary DNA or RNA 
probe. In this way the patterns of differential gene expression can be observed 
in tissues, and the location of specific RNAs in cells can be determined (Figure 
8-71). In the Drosophila embryo, for example, such patterns have provided new 
insights into the mechanisms that create distinctions between cells in different 
positions during development (described in Chapter 22). 

Using similar approaches, it is also possible to visualize specific DNA 
sequences in cells. In this case, tissue, cell, or even chromosome preparations 
are briefly exposed to high pH to disrupt their nucleotide pairs, and nucleic acid 
probes are added, allowed to hybridize with the cells' DNA, and then visualized 
(see Figure 8-35). 



Figure 8-71 In situ hybridization for RNA 
localization. (A) Expression pattern of 
DeitaC mRNA in the early zebrafish 
embryo. This gene codes for a ligand in 
the Notch signaling pathway (discussed 
in Chapter 15), and the pattern shown 
here reflects its role in the development 
of somites — the future segments of the 
vertebrate trunk and tail. (B) High- 
resolution RNA in situ localization reveals 
the sites within the nucleolus of a pea 
cell where ribosomal RNA is synthesized. 
The sausage-like structures, 0.5-1 |im in 
diameter, correspond to the loops of 
chromosomal DNA that contain the 
genes encoding rRNA. Each small white 
spot represents transcription of a single 
rRNA gene. (A, courtesy of Yun-Jin Jiang; 
B, courtesy of Peter Shaw.) 



Expression of Individual Genes Can Be Measured Using 
Quantitative RT-PCR 

Although reporter genes and in situ hybridization reveal patterns of gene expres- 
sion, it is often desirable to quantitate gene expression by directly measuring 
mRNA levels in cells. Although Northern blots (see Figure 8-38) can be adapted 
to this purpose, a more accurate method is based on the principles of PCR (Fig- 
ure 8-72). This method, called quantitative RT-PCR (reverse transcription-poly- 
merase chain reaction), begins with the total population of mRNA molecules 
purified from a tissue or a cell culture. It is important that no DNA be present in 
the preparation; it must be purified away or enzymatically degraded. Two DNA 
primers that specifically match the gene of interest are added, along with reverse 
transcriptase, DNA polymerase, and the four deoxynucleoside triphosphates 
needed for DNA synthesis. The first round of synthesis is the reverse transcrip- 
tion of the mRNA into DNA using one of the primers. Next, a series of heating 
and cooling cycles allows the amplification of that DNA strand by conventional 
PCR (see Figure 8-45). The quantitative part of this method relies on a direct rela- 
tionship between the rate at which the PCR product is generated and the origi- 
nal concentration of the mRNA species of interest. By adding chemical dyes to 
the PCR reaction that fluoresce only when bound to double -stranded DNA, a 
simple fluorescence measurement can be used to track the progress of the reac- 
tion and thereby accurately deduce the starting concentration of the mRNA that 
is amplified (see Figure 8-72). Although it seems complicated, this quantitative 
RT-PCR techniaue (sometimes called real time PCR) is relatively fast and simple 




time (number of PCR cycles) ► 

Figure 8-72 RNA levels can be 
measured by quantitative RT-PCR. The 

fluorescence measured is generated by a 
dye that fluoresces only when bound to 
the double-stranded DNA products of 
the RT-PCR reaction (see Figure 8-46B). 
The red sample has a higher 
concentration of the mRNA being 
measured than does the blue sample, 
since it requires fewer PCR cycles to reach 
the same half-maximal concentration of 
double-stranded DNA. Based on this 
difference, the relative amounts of the 
mRNA in the two samples can be 
precisely determined. 
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Figure 8-73 Using DNA microarrays to monitor the expression of 
thousands of genes simultaneously. To prepare the microarray, DNA 
fragments — each corresponding to a gene — are spotted onto a slide by a 
robot. Prepared arrays are also widely available commercially. In this 
example, mRNA is collected from two different cell samples for a direct 
comparison of their relative levels of gene expression; the two samples, for 
example, could be from cells treated with a hormone and untreated cells of 
the same type. These samples are converted to cDNA and labeled, one with 
a red fluorochrome, the other with a green fluorochrome. The labeled 
samples are mixed and then allowed to hybridize to the microarray. After 
incubation, the array is washed and the fluorescence scanned. In the portion 
of a microarray shown, which represents the expression of 1 10 yeast genes, 
red spots indicate that the gene in sample 1 is expressed at a higher level 
than the corresponding gene in sample 2; green spots indicate that 
expression of the gene is higher in sample 2 than in sample 1 . Yellow spots 
reveal genes that are expressed at equal levels in both cell samples. Dark 
spots indicate little or no expression in either sample of the gene whose 
fragment is located at that position in the array. (Microarray courtesy of 
J.L. DeRisi et al., Science 278:680-686, 1997. With permission from AAAS.) 

to perform in the laboratory; it has displaced Northern blotting as the method of 
choice for quantifying mRNA levels from any given gene. 



Microarrays Monitor the Expression of Thousands of Genes at 
Once 

So far we have discussed techniques that can be used to monitor the expression 
of only a single gene (or relatively few genes) at a time. Developed in the 1990s, 
DNA microarrays have revolutionized the analysis of gene expression by moni- 
toring the RNA products of thousands of genes at once. By examining the 
expression of so many genes simultaneously, we can now begin to identify and 
study the gene expression patterns that underlie cell physiology: we can see 
which genes are switched on (or off) as cells grow, divide, differentiate, or 
respond to hormones or to toxins. 

DNA microarrays are little more than glass microscope slides studded with 
a large number of DNA fragments, each containing a nucleotide sequence that 
serves as a probe for a specific gene. The most dense arrays may contain tens of 
thousands ofdie.se fragments in an area smaller than a postage stamp, allowing 
thousands Of hybridization reactions to be performed in parallel (Figure 8-73)'. 
Some microarrays are prepared from large DNA fragments that have been gen- 
erated by PCR and then spotted onto the slides by a robot Others contain short 
oligonucleotides that are synthesized on the surface of the glass wafer with tech- 
niques similar to those that are used to etch circuits onto computer chips. In 
either case, the exact sequence— and position— of every probe on the chip is 
known. Thus, any nucleotide fragment that hybridizes to a probe on the array 
can be identified as the product of a specific gene simply by detecting the posi- 
tion at which it is hound. 

To use a DNA microarray to monitor gene expression, mRNA from the cells 
being studied is first extracted and converted to cDNA (see Figure JM3). The 
cDNA is then labeled with a fluorescent probe. The microarray is incubated with 
this labeled cDNA sample and hybridization is allowed to occur (see Figure 8-73), 
The array is then washed to remove cDNA that is not tightly bound, and the posi- 
tions in the microarray to which labeled DNA fragments have bound are identified 
by an automated scanning-laser microscope. The array positions are then 
matched to the particular gene whose sample of DNA was spotted m this location. 

Typically the fluorescent DNA from the experimental samples (labeled, for 
example, with a red fluorescent dye) are mixed with a reference sample of cDNA 
fragments labeled with a differently colored fluorescent dye (green, for example). 
Thus, if the amount of RNA expressed from a particular gene in the cells of inter- 
est is increased relative to that of the reference sample, the resulting spot is red. 
Conversely, if the gene's expression is decreased relative to the reference sample, 
the spot is green. If there is no change compared to the reference sample, the spot 
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Figure 8-74 Using cluster analysis to identify sets of genes that are coordinately regulated. Genes that belong to the 
same cluster may be involved in common pathways or processes. To perform a cluster analysis, microarray data are obtained 
from cell samples exposed to a variety of different conditions, and genes that show coordinate changes in their expression 
pattern are grouped together. In this experiment, human fibroblasts were deprived of serum for 48 hours; serum was then 
added back to the cultures at time and the cells were harvested for microarray analysis at different time points. Of the 
8600 genes analyzed on the DNA microarray, just over 300 showed threefold or greater variation in their expression patterns 
in response to serum re-introduction. Here, red indicates an increase in expression; green is a decrease in expression. On the 
basis of the results of many microarray experiments, the 8600 genes have been grouped in clusters based on similar 
patterns of expression. The results of this analysis show that genes involved in wound healing are turned on in response to 
serum, while genes involved in regulating cell cycle progression and cholesterol biosynthesis are shut down. (From 
M.B. Eisen et al., Proc. Natl Acad. Sci. USA. 95:14863-14868, 1998. With permission from National Academy of Sciences.) 

is yellow. Using such an internal reference, gene expression profiles can be tab- 
ulated with great precision. 

So far, DNA microarrays have been used to examine everything from the 
changes in gene expression that make strawberries ripen to the gene expression 
"signatures" of different types of human cancer cells (see Figure 7-3); or from 
changes that occur as cells progress through the cell cycle to those made in 
response to sudden shifts in temperature. Indeed, because microarrays allow 
the simultaneous monitoring of large numbers of genes, they can detect subtle 
changes in a cell, changes that might not be manifested in its outward appear- 
ance or behavior. 

Comprehensive studies of gene expression also provide an additional layer 
of information that is useful for predicting gene function. Earlier, we discussed 
how identifying a protein's interaction partners can yield clues about that pro- 
tein's function. A similar principle holds true for genes: information about a 
gene's function can be deduced by identifying genes that share its expression 
pattern. Using a technique called cluster analysis, one can identify sets of genes 
that are coordinately regulated. Genes that are turned on or turned off together 
under different circumstances are likely to work in concert in the cell: they may 
encode proteins that are part of the same multiprotein machine, or proteins that 
are involved in a complex coordinated activity, such as DNA replication or RNA 
splicing. Characterizing a gene whose function is unknown by grouping it with 
known genes that share its transcriptional behavior is sometimes called "guilt by 
association." Cluster analyses have been used to analyze the gene expression 
profiles that underlie many interesting biological processes, including wound 
healing in humans (Figure 8-74). 

In addition to monitoring the level of mRNA corresponding to every gene in 
a genome, DNA microarrays have many other uses. For example, they can be 
used to monitor the progression of DNA replication in a cell (see Figure 5-32) 
and, when combined with immunoprecipitation, can pinpoint every position in 
the genome occupied by a given gene regulatory protein (see Figure 7-32). 
Microarrays can also be used to quickly identify disease-causing microbes by 
hybridizing DNA from infected tissues to an array containing genomic DNA 
sequences from large collections of pathogens. 

Single-Cell Gene Expression Analysis Reveals Biological "Noise" 

The methods for monitoring mRNAs just described give average expression lev- 
els for each mRNA across a large population of cells. By using a fluorescent 
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reporter protein whose expression is under the control of a promoter of interest, 
it is also possible to accurately measure expression levels in individual cells. 
These new approaches have revealed a startling amount of variability, often 
called biological noise, between the individual cells in a homogeneous popula- 
tion of cells. These studies have also revealed the presence of distinct subpopu- 
lations of cells whose existence would be masked if only the average across a 
whole population were considered. For example, a bimodal distribution of 
expression levels would indicate that the cells can exist in two distinct states 
(Figure 8-75), with the average expression level of the population being some- 
where between them. The behavior of individual cells has important implica- 
tions for understanding biology, for example, by revealing that some cells con- 
stantly and rapidly switch back and forth between two states. 

Currently, there are two approaches for monitoring gene expression in indi- 
vidual cells. In the imaging approach, live cells are mounted on a slide and 
viewed through a fluorescence microscope. This method has the advantage that 
a given cell can be followed over time, allowing temporal changes in expression 
to be measured. The second approach, flow cytometry, works by streaming a 
dilute suspension of cells past an illuminator and measuring the fluorescence of 
individual cells as they flow past the detector (see Figure 8-2). Although it has 
the advantage that the expression levels of very large numbers of cells can be 
measured with precision, flow cytometry does not allow a given cell to be 
tracked over time; hence, it is complementary to the imaging methods. 



Summary 

Genetics and genetic engineering provide powerful tools for the study of gene function 
in both cells and organisms. In the classical genetic approach, random mutagenesis is 
coupled with screening to identify mutants that are deficient in a particular biological 
process. These mutants are then used to locate and study the genes responsible for that 
process. 

Gene function can also be ascertained by reverse genetic techniques. DNA engi- 
neering methods can be used to alter genes and to re-insert them into a cell's chromo- 
somes so that they become a permanent part of the genome. If the cell used for this gene 
transfer is a fertilized egg (for an animal) or a totipotent plant cell in culture, trans- 
genic organisms can be produced that express the mutant gene and pass it on to their 
progeny. Especially important for cell biology is the ability to alter cells and organisms 
in highly specific ways— allowing one to discern the effect on the cell or the organism 
of a designed change in a single protein or RNA molecule. 

Many of these methods are being expanded to investigate gene function on a 
genome-wide scale. The generation of mutant libraries in which every gene in an 
organism has been systematically deleted or disrupted provides invaluable tools for 
exploring the role of each gene in the elaborate molecular collaboration that gives rise 
to life. Technologies such as DNA microarrays can monitor the expression of thousands 
of genes simultaneously, providing detailed, comprehensive snapshots of the dynamic 
patterns of gene expression that underlie complex cell processes. 




Figure 8-75 Different levels of gene 
expression in individual cells within a 
population of E. coli bacteria. For this 
experiment, two different reporter 
proteins (one fluorescing green, the other 
red) controlled by a copy of the same 
promoter, have been introduced into all 
of the bacteria. When illuminated, some 
cells express only one gene copy, and so 
appear either red or green, while others 
express both gene copies, and so appear 
yellow. This experiment also reveals 
variable levels of fluorescence, indicating 
variable levels of gene expression within 
an apparently uniform population of 
cells. (From M.B. Elowitz, AJ. Levine, 
E.O. Siggia and RS. Swain, Science 
297:1 1 83-1 1 86, 2002. With permission 
from AAAS.) 



PROBLEMS 

Which statements are true? Explain why or why not. 

8- 1 Because a monoclonal antibody recognizes a specific 
antigenic site (epitope), it binds only to the specific protein 
against which it was made. 

8-2 Given the inexorable progress of technology it seems 
inevitable that the sensitivity of detection of molecules will 
ultimately be pushed beyond the yoctomole level (10~ 24 mole) . 

8-3 Surface plasmon resonance (SPR) measures associa- 
tion (£r n) and dissociation (Aoff) rates between molecules in 



real time, using small amounts of unlabeled molecules, but 
it does not give the information needed to determine the 
binding constant [K). 

8-4 If each cycle of PCR doubles the amount of DNA syn- 
thesized in the previous cycle, then 10 cycles will give a 10 3 - 
fold amplification, 20 cycles will give a 10 6 -fold amplifica- 
tion, and 30 cycles will give a 10 9 -fold amplification. 

Discuss the following problems. 

8-5 A common step in the isolation of cells from a sam- 
ple of animal tissue is to treat it with trypsin, collagenase, 
and EDTA. Why is such a treatment necessary, and what 
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does each component accomplish? And why does this treat- 
ment not kill the cells? 

8-6 Do you suppose it would be possible to raise an anti- 
body against another antibody? Explain your answer. 

8-7 Distinguish between velocity sedimentation and 
equilibrium sedimentation. For what general purpose is 
each technique used? Which do you suppose might be best 
suited for separating two proteins of different size? 

8-8 Tropomyosin, at 93 kd, sediments at 2.6 S, whereas 
the 65-kd protein, hemoglobin, sediments at 4.3 S. (The sed- 
imentation coefficient S is a linear measure of the rate of 
sedimentation: both increase or decrease in parallel.) These 
two proteins are shown as a-carbon backbone models in 
Figure Q8-1. How is it that the bigger protein sediments 
more slowly than the smaller one? Can you think of an anal- 
ogy from everyday experience that might help you with this 
problem? 

^^^™^&W^. tropomyosin 

hemoglobin 

Figure Q8-1 Backbone models of tropomyosin and hemoglobin 
(Problem 8-8). 

8-9 In the classic paper that demonstrated the semi-con- 
servative replication of DNA, Meselson and Stahl began by 
showing that DNA itself will form a band when subjected to 
equilibrium sedimentation. They mixed randomly frag- 
mented E. coli DNA with a solution of CsCl so that the final 
solution had a density of 1.71 g/mL. As shown in Figure 
Q8-2, with increasing length of centrifugation at 70,000 
times gravity, the DNA, which was initially dispersed 
throughout the centrifuge tube, became concentrated over 
time into a discrete band in the middle. 
A. Describe what is happening with time and explain why 
the DNA forms a discrete band. 



hours 



centrifugal field 




Figure Q8-2 Ultraviolet 
absorption photographs 
showing successive stages 
in the banding of E. coli 
DNA (Problem 8-9). DNA, 
which absorbs UV light, 
shows up as dark regions in 
the photographs. The 
bottom of the centrifuge 
tube is on the right. (From 
M. Meselson and F.W. Stahl, 
Proc Natl Acad. Sci U.S.A. 
44:67 1-682,1 958. With 
permission from National 
Academy of Sciences.) 



B. What is the buoyant density of the DNA? (The density of 
the solution at which DNA "floats" at equilibrium defines 
the "buoyant density" of the DNA.) 

C Even if the DNA were centrifuged for twice as long — or 
even longer — the width of the band remains about what is 
shown at the bottom of Figure Q8-2. Why does the band not 
become even more compressed? Suggest some possible rea- 
sons to explain the thickness of the DNA band at equilib- 
rium. 

8-10 Hybridoma technology allows one to generate mon- 
oclonal antibodies to virtually any protein. Why is it then 
that tagging proteins with epitopes is such a commonly 
used technique, especially since an epitope tag has the 
potential to interfere with the function of the protein? 

8-1 1 How many copies of a protein need to be present in a 
cell in order for it to be visible as a band on a gel? Assume 
that you can load 100 u.g of cell extract onto a gel and that 
you can detect 10 ng in a single band by silver staining. The 
concentration of protein in cells is about 200 mg/mL, and a 
typical mammalian cell has a volume of about 1000 jam 3 and 
a typical bacterium a volume of about 1 urn 3 . Given these 
parameters, calculate the number of copies of a 120-kd pro- 
tein that would need to be present in a mammalian cell and 
in a bacterium in order to give a detectable band on a gel. 
You might try an order-of-magnitude guess before you make 
the calculations. 

8-12 You want to amplify the DNA between the two 
stretches of sequence shown in Figure Q8-3. Of the listed 
primers choose the pair that will allow you to amplify the 
DNA by PCR. 



DNA to be amplified 



5'-GACCTGTGGAAGC - 
3'-CTGGACACCTTCG - 



(1) 5'-GACCTGTCCAAGC-3' 

(2) 5'-CTGGACACCTTCG-3' 

(3) 5'-CGAAGGTGTCCAG-3' 

(4) 5'-GCTTCCACAGGTC-3' 



- CATACGGGATTGA- 3' 
-GTATGCCCTAACT- 5' 



primers 



(5) 5'-CATACGGGATTGA-3' 

(6) 5'-GTATGCCCTAACT-3' 

(7) 5 / -TGTTAGGGCATAC-3 / 

(8) 5'-TCAATCCCGTATG-3' 



Figure Q8-3 DNA to be amplified and potential PCR primers 
(Problem 8-12). 



8-13 In the very first round of PCR using genomic DNA, 
the DNA primers prime synthesis that terminates only when 
the cycle ends (or when a random end of DNA is encoun- 
tered). Yet, by the end of 20 to 30 cycles— a typical amplifi- 
cation—the only visible product is defined precisely by the 
ends of the DNA primers. In what cycle is a double- stranded 
fragment of the correct size first generated? 

8-14 Explain the difference between a gain-of-function 
mutation and a dominant- negative mutation. Why are both 
these types of mutation usually dominant? 

8-1 5 Discuss the following statement: "We would have no 
idea today of the importance of insulin as a regulatory hor- 
mone if its absence were not associated with the devastating 
human disease diabetes. It is the dramatic consequences of 
its absence that focused early efforts on the identification of 
insulin and the study of its normal role in physiology." 
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Chapter 9 



Visualizing Cells 




Because cells are small and complex, it is hard to see their structure, hard to dis- 
cover their molecular composition, and harder still to find out how their various 
components function. The tools at our disposal determine what we can learn 
about cells, and the introduction of new techniques has frequently resulted in 
major advances in cell biology. To understand contemporary cell biology, there- 
fore, it is necessary to know something of its methods. 

In this chapter, we briefly describe some of the principal microscopy meth- 
ods used to study cells. Understanding the structural organization of cells is an 
essential prerequisite for learning how cells function. Optical microscopy will be 
our starting point because cell biology began with the light microscope, and it is 
still an essential tool. In recent years optical microscopy has become ever more 
important, largely owing to the development of methods for the specific label- 
ing and imaging of individual cellular constituents and the reconstruction of 
their three-dimensional architecture. An important advantage of optical 
microscopy is that light is relatively nondestructive. By tagging specific cell com- 
ponents with fluorescent probes, such as intrinsically fluorescent proteins, we 
can thus watch their movement, dynamics, and interactions in living cells. Opti- 
cal microscopy is limited in resolution by the wavelength of visible light. By 
using a beam of electrons instead, electron microscopy can image the macro- 
molecular complexes within cells at almost atomic resolution, and in three 
dimensions. 

Although optical microscopy and electron microscopy are important meth- 
ods, it is what they have enabled scientists to discover about the structural archi- 
tecture of the cell that makes them interesting. Use this chapter as a reference 
and read it in conjunction with the later chapters of the book rather than view- 
ing it as an introduction to them. 
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LOOKING AT CELLS IN THE LIGHT MICROSCOPE 

A typical animal cell is 10-20 jim in diameter, which is about one-fifth the size of 
the smallest particle visible to the naked eye. Only after good light microscopes 
became available in the early part of the nineteenth century did Schleiden and 
Schwann propose that all plant and animal tissues were aggregates of individual 
cells. Their discovery in 1838, known as the cell doctrine, marks the formal birth 
of cell biology. 

Animal cells are not only tiny, but they are also colorless and translucent. 
Consequendy, the discovery of their main internal features depended on the 
development, in the latter part of the nineteenth century, of a variety of stains 
that provided sufficient contrast to make those features visible. Similarly, the far 
more powerful electron microscope introduced in the early 1940s required the 
development of new techniques for preserving and staining cells before the full 
complexities of their internal fine structure could begin to emerge. To this day, 
microscopy relies as much on techniques for preparing the specimen as on the 
performance of the microscope itself. In the following discussions, we therefore 
consider both instruments and specimen preparation, beginning with the light 
microscope. 
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The series of images in Figure 9-1 illustrate an imaginary progression from 
a thumb to a cluster of atoms. Each successive image represents a tenfold 
increase in magnification. The naked eye could see features in the first two pan- 
els, the resolution of the light microscope would extend to about the fourth 
panel, and the electron microscope to between about the seventh and eighth 
panel. Figure 9-2 shows the sizes of various cellular and subcellular structures 
and the ranges of size that different types of microscopes can visualize. 



The Light Microscope Can Resolve Details 0.2 ^m Apart 

A fundamental limitation of all microscopes is that a given type of radiation can- 
not be used to probe structural details much smaller than its own wavelength. 
The ultimate limit to the resolution of a light microscope is therefore set by the 
wavelength of visible light, which ranges from about 0.4 |im (for violet) to 0.7 jam 



Figure 9-1 A sense of scale between 
living cells and atoms. Each diagram 
shows an image magnified by a factor of 
ten in an imaginary progression from a 
thumb, through skin cells, to a ribosome, 
to a cluster of atoms forming part of one 
of the many protein molecules in our 
body. Atomic details of macromolecules, 
as shown in the last two panels, are 
usually beyond the power of the electron 
microscope. 
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(for deep red). In practical terms, bacteria and mitochondria, which are about 
500 nm (0.5 jam) wide, are generally the smallest objects whose shape we can 
clearly discern in the light microscope; smaller details than this are obscured by 
effects resulting from the wavelike nature of light. To understand why this 
occurs, we must follow the path of a beam of light waves as it passes through the 
lenses of a microscope (Figure 9-3). 

Because of its wave nature, light does not follow exactly the idealized 
straight ray paths that geometrical optics predict. Instead, light waves travel 
through an optical system by several slightly different routes, so that they inter- 
fere with one another and cause optical diffraction effects. If two trains of waves 
reaching the same point by different paths are precisely in phase, with crest 
matching crest and trough matching trough, they will reinforce each other so as 
to increase brightness. In contrast, if the trains of waves are out of phase, they 
will interfere with each other in such a way as to cancel each other partly or 
entirely (Figure 9-4). The interaction of light with an object changes the phase 
relationships of the light waves in a way that produces complex interference 
effects. At high magnification, for example, the shadow of an edge that is evenly 
illuminated with light of uniform wavelength appears as a set of parallel lines 
(Figure 9-5), whereas that of a circular spot appears as a set of concentric rings. 
For the same reason, a single point seen through a microscope appears as a 
blurred disc, and two point objects close together give overlapping images and 
may merge into one. No amount of refinement of the lenses can overcome this 
limitation imposed by the wavelike nature of light. 

The limiting separation at which two objects appear distinct — the so-called 
limit of resolution — depends on both the wavelength of the light and the 
numerical aperture of the lens system used. The numerical aperture is a measure 
of the width of the entry pupil of the microscope, scaled according to its distance 
from the object; the wider the microscope opens its eye, so to speak, the more 
sharply it can see (Figure 9-6). Under the best conditions, with violet light 
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Figure 9-2 Resolving power. Sizes of 

cells and their components are drawn on 

a logarithmic scale, indicating the range 

of objects that can be readily resolved by 

the naked eye and in the light and 

electron microscopes. The following units 

of length are commonly employed in 

microscopy: 

jxm (micrometer) = 1 0~ 6 m 

nm (nanometer) = 10" 9 m 

A (Angstrom unit) = 10 -10 m 



specimen - 
condenser 



light 
source 



Figure 9-3 A light microscope. (A) Diagram showing the light path in a 
compound microscope. Light is focused on the specimen by lenses in the 
condenser. A combination of objective lenses and eyepiece lenses are 
arranged to focus an image of the illuminated specimen in the eye. 
(B) A modern research light microscope. (B, courtesy of Andrew Davies.) 
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TWO WAVES IN PHASE 



TWO WAVES OUT OF PHASE 






DIM 



BRIGHT 

Figure 9-4 Interference between light waves. When two light waves 
combine in phase, the amplitude of the resultant wave is larger and the 
brightness is increased. Two light waves that are out of phase cancel each 
other partly and produce a wave whose amplitude, and therefore 
brightness, is decreased. 



(wavelength = 0.4 fim) and a numerical aperture of 1.4, the light microscope can 
theoretically achieve a limit of resolution of just under 0.2 Lim. Microscope mak- 
ers at the end of the nineteenth century achieved this resolution and it is only 
rarely matched in contemporary, factory-produced microscopes. Although it is 
possible to enlarge an image as much as we want — for example, by projecting it 
onto a screen — it is never possible to resolve two objects in the light microscope 
that are separated by less than about 0.2 jum; they will appear as a single object. 
Notice the difference between resolution, discussed above, and detection. If a 
small object, below the resolution limit, itself emits light, then we may still be 
able to see or detect it. Thus, we can see a single fluorescently labeled micro- 
tubule even though it is about ten times thinner than the resolution limit of the 
light microscope. Diffraction effects, however, will cause it to appear blurred and 
at least 0.2 urn thick (see Figure 9-17). Because of the bright light they emit we 
can detect or see the stars in the night sky, even though they are far below the 
angular resolution of our unaided eyes. They all appear as similar points of light, 




Figure 9-5 Images of an edge and of a 
point of light (A) The interference 
effects, or fringes, seen at high 
magnification when light of a specific 
wavelength passes the edge of a solid 
object placed between the light source 
and the observer. (B) The image of a 
point source of light. Diffraction spreads 
this out into a complex, circular pattern, 
whose width depends on the numerical 
aperture of the optical system: the 
smaller the aperture the bigger (more 
blurred) the diffracted image. Two point 
sources can be just resolved when the 
center of the image of one lies on the 
first dark ring in the image of the other: 
this defines the limit of resolution. 
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RESOLUTION: the resolving power of the 
microscope depends on the width of the 
cone of illumination and therefore on both 
the condenser and the objective lens. It is 
calculated using the formula 



resolution = 



0.61 X 
n sin 6 



where: 



0= half the angular width of the cone of 
rays collected by the objective lens 
from a typical point in the specimen 
(since the maximum width is 180°, 
sin 6 has a maximum value of 1) 

n = the refractive index of the medium 
(usually air or oil) separating the 
specimen from the objective and 
condenser lenses 

X = the wavelength of light used (for white 
light a figure of 0.53 jam is commonly 
assumed) 



RE: n sin 9 in the 
equation above is called the numerical aperture 
of the lens (NA) and is a function of its light- 
collecting ability. For dry lenses this cannot be 
more than 1, but for oil-immersion lenses it can 
be as high as 1 4. The higher the numerical 



aperture, the greater the resolution and the 
brighter the image (brightness is important in 
fluorescence microscopy) However, this advan- 
tage is obtained at the expense of very short 
working distances and a very small depth of field. 



Figure 9-6 Numerical aperture. The path 
of light rays passing through a 
transparent specimen in a microscope 
illustrates the concept of numerical 
aperture and its relation to the limit of 
resolution. 



LOOKING AT CELLS IN THE LIGHT MICROSCOPE 



583 



differing only in their color or brightness. Using sensitive detection methods, we 
can detect and follow the behavior of even a single fluorescent protein molecule 
with a light microscope. 

We see next how we can exploit interference and diffraction to study 
unstained cells in the living state. 



Living Cells Are Seen Clearly in a Phase-Contrast or a 
Differential-lnterference-Contrast Microscope 

Microscopists have always been challenged by the possibility that some compo- 
nents of the cell may be lost or distorted during specimen preparation. The only 
certain way to avoid the problem is to examine cells while they are alive, without 
fixing or freezing. For this purpose, light microscopes with special optical sys- 
tems are especially useful. 

When light passes through a living cell, the phase of the light wave is 
changed according to the cell's refractive index: a relatively thick or dense part of 
the cell, such as a nucleus, retards light passing through it. The phase of the light, 
consequently, is shifted relative to light that has passed through an adjacent 
thinner region of the cytoplasm. The phase-contrast microscope and, in a more 
complex way, the differential-interference-contrast microscope exploit the 
interference effects produced when these two sets of waves recombine, thereby 
creating an image of the cell's structure (Figure 9-7). Both types of light 
microscopy are widely used to visualize living cells. <TCAA> 

A simpler way to see some of the features of a living cell is to observe the 
light that is scattered by its various components. In the dark-field microscope, 
the illuminating rays of light are directed from the side so that only scattered 
light enters the microscope lenses. Consequently, the cell appears as a bright 
object against a dark background. With a normal bright-field microscope, light 
passing through a cell in culture forms the image directly. Figure 9-8 compares 
images of the same cell obtained by four kinds of light microscopy. 

Phase-contrast, differential-interference-contrast, and dark-field microsc- 
opy make it possible to watch the movements involved in such processes as mito- 
sis and cell migration. Since many cellular motions are too slow to be seen in real 
time, it is often helpful to make time-lapse movies. Here, the camera records suc- 
cessive frames separated by a short time delay, so that when the resulting picture 
series is played at normal speed, events appear greatly speeded up. 



Images Can Be Enhanced and Analyzed by Digital Techniques 

In recent years electronic, or digital, imaging systems, and the associated tech- 
nology of image processing, have had a major impact on light microscopy. Cer- 
tain practical limitations of microscopes, relating to imperfections in the optical 



(A) incident light 
(white) 



(B) incident light 
(green) 




unstained 
cell 




Figure 9-7 Two ways to obtain contrast 
in light microscopy. (A) The stained 
portion of the cell will absorb light of 
some wavelengths, which depend on the 
stain, but will allow other wavelengths to 
pass through it. A colored image of the 
cell is thereby obtained that is visible in 
the normal bright-field light microscope. 
(B) Light passing through the unstained, 
living cell experiences very little change 
in amplitude, and the structural details 
cannot be seen even if the image is 
highly magnified. The phase of the light, 
however, is altered by its passage 
through either thicker or denser parts of 
the cell, and small phase differences can 
be made visible by exploiting 
interference effects using a phase- 
contrast or a differential-interference- 
contrast microscope. 
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Figure 9-8 Four types of light microscopy. Four images are shown of the same fibroblast cell in culture. All images can be 
obtained with most modern microscopes by interchanging optical components. (A) Bright-field microscopy. (B) Phase- 
contrast microscopy. (C) Nomarski differential-interference-contrast microscopy. (D) Dark-field microscopy. 



system have been largely overcome. Electronic imaging systems have also cir- 
cumvented two fundamental limitations of the human eye: the eye cannot see 
well in extremely dim light, and it cannot perceive small differences in light 
intensity against a bright background. To increase our ability to observe cells in 
low light conditions, we can attach a sensitive digital camera to a microscope. 
These cameras contain a charge-coupled device (CCD), similar to those found in 
consumer digital cameras. Such CCD cameras are often cooled to reduce image 
noise. It is then possible to observe cells for long periods at very low light levels, 
thereby avoiding the damaging effects of prolonged bright light (and heat) . Such 
low-light cameras are especially important for viewing fluorescent molecules in 
living cells, as explained below. 

Because images produced by CCD cameras are in electronic form, they can 
be readily digitized, fed to a computer, and processed in various ways to extract 
latent information. Such image processing makes it possible to compensate for 
various optical faults in microscopes to attain the theoretical limit of resolution. 
Moreover, by digital image processing, contrast can be greatly enhanced to over- 
come the eye's limitations in detecting small differences in light intensity. 
Although this processing also enhances the effects of random background irreg- 
ularities in the optical system, digitally subtracting an image of a blank area of 
the field removes such defects. This procedure reveals small transparent objects 
that were previously impossible to distinguish from the background. 

The high contrast attainable by computer-assisted differential-interference- 
contrast microscopy makes it possible to see even very small objects such as sin- 
gle microtubules (Figure 9-9), which have a diameter of 0.025 jam, less than one- 
tenth the wavelength of light. Individual microtubules can also be seen in a flu- 
orescence microscope if they are fluorescently labeled (see Figure 9-15). In both 
cases, however, the unavoidable diffraction effects badly blur the image so that 
the microtubules appear at least 0.2 urn wide, making it impossible to distin- 
guish a single microtubule from a bundle of several microtubules. 



Figure 9-9 Image processing. (A) Unstained microtubules are shown here 
in an unprocessed digital image, captured using differential-interference- 
contrast microscopy. (B) The image has now been processed, first by 
digitally subtracting the unevenly illuminated background, and second by 
digitally enhancing the contrast. The result of this image processing is a 
picture that is easier to interpret. Note that the microtubules are dynamic 
and some have changed length or position between the before-and-after 
images. (Courtesy of Viki Allan.) 
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Intact Tissues Are Usually Fixed and Sectioned before Microscopy 

Because most tissue samples are too thick for their individual cells to be exam- 
ined directly at high resolution, they must be cut into very thin transparent 
slices, or sections. To first immobilize, kill, and preserve the cells within the tis- 
sue they must be treated with a fixative. Common fixatives include formalde- 
hyde and glutaraldehyde, which form covalent bonds with the free amino 
groups of proteins, cross-linking them so they are stabilized and locked into 
position. 

Because tissues are generally soft and fragile, even after fixation, they need 
to be embedded in a supporting medium before sectioning. The usual embed- 
ding media are waxes or resins. In liquid form these media both permeate and 
surround the fixed tissue; they can then be hardened (by cooling or by polymer- 
ization) to form a solid block, which is readily sectioned with a microtome. This 
is a machine with a sharp blade that operates like a meat slicer (Figure 9-10). 
The sections (typically 1-10 jum thick) are then laid flat on the surface of a glass 
microscope slide. 

There is little in the contents of most cells (which are 70% water by weight) 
to impede the passage of light rays. Thus, most cells in their natural state, even 
if fixed and sectioned, are almost invisible in an ordinary light microscope. 
There are three main approaches to working with thin tissue sections that reveal 
the cells themselves or specific components within them. 

First, and traditionally, sections can be stained with organic dyes that have 
some specific affinity for particular subcellular components. The dye hema- 
toxylin, for example, has an affinity for negatively charged molecules and there- 
fore reveals the distribution of DNA, RNA, and acidic proteins in a cell (Figure 
9-11). The chemical basis for the specificity of many dyes, however, is not 
known. 

Second, sectioned tissues can be used to visualize specific patterns of differ- 
ential gene expression. In situ hybridization, discussed earlier (p. 573), reveals 
the cellular distribution and abundance of specific expressed RNA molecules in 
sectioned material or in whole mounts of small organisms or organs (Figure 
9-12). A third and very sensitive approach, generally and widely applicable for 
localizing proteins of interest, depends on using fluorescent probes and mark- 
ers, as we explain next. 
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a glass cover slip 



Figure 9-10 Making tissue sections. This 
illustration shows how an embedded 
tissue is sectioned with a microtome in 
preparation for examination in the light 
microscope. 
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Figure 9-11 Staining of cellular 
components. (A) This section of 
cells in the urine-collecting ducts 
of the kidney was stained with a 
combination of dyes, hematoxylin 
and eosin, commonly used in 
histology. Each duct is made of 
closely packed cells (with nuclei 
stained red) that form a ring. The 
ring is surrounded by extracellular 
matrix, stained purple. (B)This 
section of a young plant root is 
stained with two dyes, safranin 
and fast green. The fast green 
stains the cellulosic cell walls while 
the safranin stains the lignified 
xylem cell walls bright red. 
(A, from P.R. Wheater et al., 
Functional Histology, 2nd ed. 
London: Churchill Livingstone, 
1 987; B, courtesy of 
Stephen Grace.) 
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Specific Molecules Can Be Located in Cells by Fluorescence 
Microscopy 

Fluorescent molecules absorb light at one wavelength and emit it at another, 
longer wavelength. If we illuminate such a compound at its absorbing wave- 
length and then view it through a filter that allows only light of the emitted wave- 
length to pass, it will glow against a dark background. Because the background 
is dark, even a minute amount of the glowing fluorescent dye can be detected. 
The same number of molecules of an ordinary stain viewed conventionally 
would be practically invisible because the molecules would give only the faintest 
tinge of color to the light transmitted through this stained part of the specimen. 

The fluorescent dyes used for staining cells are visualized with a fluores- 
cence microscope. This microscope is similar to an ordinary light microscope 
except that the illuminating light, from a very powerful source, is passed through 
two sets of filters — one to filter the light before it reaches the specimen and one 
to filter the light obtained from the specimen. The first filter passes only the 
wavelengths that excite the particular fluorescent dye, while the second filter 
blocks out this light and passes only those wavelengths emitted when the dye 
fluoresces (Figure 9-13). 

Fluorescence microscopy is most often used to detect specific proteins or 
other molecules in cells and tissues. A very powerful and widely used technique 
is to couple fluorescent dyes to antibody molecules, which then serve as highly 
specific and versatile staining reagents that bind selectively to the particular 
macromolecules they recognize in cells or in the extracellular matrix. Two fluo- 
rescent dyes that have been commonly used for this purpose are fluorescein, 
which emits an intense green fluorescence when excited with blue light, and 



Figure 9-1 2 RNA in situ hybridization. 

As described in chapter 8 (see Figure 
8-71), it is possible to visualize the 
distribution of different RNAs in tissues 
using in situ hybridization. Here, the 
transcription pattern of five different 
genes involved in patterning the early fly 
embryo is revealed in a single embryo. 
Each RNA probe has been fluorescently 
labeled in a different way, some directly 
and some indirectly, and the resulting 
images false-colored and combined to 
see each individual transcript most 
clearly. The genes whose expression 
pattern is revealed here are wingless 
(yellow), engrailed (blue), short 
gastrulation (red), intermediate 
neuroblasts defective (green), and muscle 
specific homeobox (purple). (From 
D. Kosman et al., Science 305:846, 2004. 
With permission from AAAS.) 
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Figure 9-1 3 The optical system of a 
fluorescence microscope. A filter set 
consists of two barrier filters (1 and 3) 
and a dichroic (beam-splitting) mirror (2). 
This example shows the filter set for 
detection of the fluorescent molecule 
fluorescein. High-numerical-aperture 
objective lenses are especially important 
in this type of microscopy because, for a 
given magnification, the brightness of 
the fluorescent image is proportional to 
the fourth power of the numerical 
aperture (see also Figure 9-6). 
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Figure 9-14 Fluorescent probes. The maximum excitation and emission 
wavelengths of several commonly used fluorescent probes are shown in 
relation to the corresponding colors of the spectrum. The photon emitted 
by a fluorescent molecule is necessarily of lower energy (longer 
wavelength) than the photon absorbed and this accounts for the 
difference between the excitation and emission peaks. CFP, GFP, YFP and 
RFP are cyan, green, yellow and red fluorescent proteins respectively. These 
are not dyes, and are discussed in detail later in the chapter. DAPI is widely 
used as a general fluorescent DNA probe, which absorbs UV light and 
fluoresces bright blue. FITC is an abbreviation for fluorescence 
isothiocyanate, a widely used derivative of fluorescein, which fluoresces 
bright green. The other probes are all commonly used to fluorescently label 
antibodies and other proteins. 



rhodamine, which emits a deep red fluorescence when excited with green-yel- 
low light (Figure 9-14). By coupling one antibody to fluorescein and another to 
rhodamine, the distributions of different molecules can be compared in the 
same cell; the two molecules are visualized separately in the microscope by 
switching back and forth between two sets of filters, each specific for one dye. As 
shown in Figure 9-15, three fluorescent dyes can be used in the same way to dis- 
tinguish between three types of molecules in the same cell. Many newer fluo- 
rescent dyes, such as Cy3, Cy5, and the Alexa dyes, have been specifically devel- 
oped for fluorescence microscopy (see Figure 9-14). These organic fluo- 
rochromes have some disadvantages. They are excited only by light of precise, 
but different, wavelengths, and additionally they fade fairly rapidly when con- 
tinuously illuminated. More stable inorganic fluorochromes have recently been 
developed, however. Tiny crystals of semiconductor material, called nanoparti- 
cles, or quantum dots, can all be excited to fluoresce by a broad spectrum of blue 
light. Their emitted light has a color that depends on the exact size of the 
nanocrystal, between 2 and 10 nm in diameter, and additionally the fluores- 
cence fades only slowly with time (Figure 9-16). These nanoparticles, when cou- 
pled to other probes such as antibodies, are therefore ideal for tracking 
molecules over time. If introduced into a living cell, in an embryo for example, 
the progeny of that cell can be followed many days later by their fluorescence, 
allowing cell lineages to be tracked. 

Fluorescence microscopy methods, discussed later in the chapter, can be 
used to monitor changes in the concentration and location of specific molecules 
inside living cells (see p. 592). 
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Figure 9-15 Multiple-fluorescent-probe 
microscopy. In this composite 
micrograph of a cell in mitosis, three 
different fluorescent probes have been 
used to stain three different cellular 
components. <GTCT> The spindle 
microtubules are revealed with a green 
fluorescent antibody, centromeres with a 
reef fluorescent antibody and the DNA of 
the condensed chromosomes with the 
blue fluorescent dye DAPI. {Courtesy of 
Kevin F.Sullivan.) 
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Antibodies Can Be Used to Detect Specific Molecules 

Antibodies are proteins produced by the vertebrate immune system as a defense 
against infection (discussed in Chapter 24). They are unique among proteins 
because they are made in billions of different forms, each with a different bind- 
ing site that recognizes a specific target molecule (or antigen). The precise anti- 
gen specificity of antibodies makes them powerful tools for the cell biologist. 
When labeled with fluorescent dyes, antibodies are invaluable for locating spe- 
cific molecules in cells by fluorescence microscopy (Figure 9-17); labeled with 
electron-dense particles such as colloidal gold spheres, they are used for similar 
purposes in the electron microscope (discussed below). 

When we use antibodies as probes to detect and assay specific molecules in 
cells we frequently amplify the fluorescent signal they produce by chemical 
methods. For example, although a marker molecule such as a fluorescent dye 
can be linked directly to an antibody used for specific recognition — the primary 
antibody—a stronger signal is achieved by using an unlabeled primary antibody 
and then detecting it with a group of labeled secondary antibodies that bind to it 
(Figure 9-18). This process is called indirect immunocytochemistry 

The most sensitive amplification methods use an enzyme as a marker 
molecule attached to the secondary antibody. The enzyme alkaline phos- 
phatase, for example, in the presence of appropriate chemicals, produces inor- 
ganic phosphate that in turn leads to the local formation of a colored precipitate. 
This reveals the location of the secondary antibody and hence the location of the 
antibody-antigen complex. Since each enzyme molecule acts catalytically to 
generate many thousands of molecules of product, even tiny amounts of antigen 
can be detected. An enzyme-linked immunosorbent assay (ELISA) based on this 
principle is frequently used in medicine as a sensitive test— for pregnancy or for 
various types of infections, for example. Although the enzyme amplification 
makes enzyme-linked methods very sensitive, diffusion of the colored precipi- 
tate away from the enzyme limits the spatial resolution of this method for 



Figure 9-16 Fluorescent nanoparticles 
or quantum dots. Quantum dots are tiny 
nanoparticles of cadmium selenide, a 
semiconductor, with a coating to make 
them water-soluble (A). They can be 
coupled to protein probes such as 
antibodies or streptavidin and, when 
introduced into a cell, will bind to a 
protein of interest. Different-sized 
quantum dots emit light of different 
colors — the larger the dot the longer the 
wavelength — but they are all excited by 
the same blue light. (B) Quantum dots 
can keep shining for weeks, unlike most 
fluorescent organic dyes. In this cell, a 
nuclear protein is labeled (green) with an 
organic fluorescent dye (Alexa 488), while 
microtubules are stained (red) with 
quantum dots bound to streptavidin. On 
continuous exposure to blue light the 
fluorescent dye fades quickly while the 
quantum dots continue to fluoresce. 
(B, from X. Wu et al., Nat. Biotechnol. 
21:41-46, 2003. With permission from 
Macmillan Publishers Ltd.) 
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Figure 9-1 7 Immunofluorescence. 

(A) A transmission electron micrograph of 
the periphery of a cultured epithelial cell 
showing the distribution of microtubules 
and other filaments. (B) The same area 
stained with fluorescent antibodies 
against tubulin, the protein that 
assembles to form microtubules, using 
the technique of indirect 
immunocytochemistry (see Figure 9-18). 
Red arrows indicate individual 
microtubules that are readily 
recognizable in both images. Note that, 
because of diffraction effects, the 
microtubules in the light microscope 
appear 0.2 \im wide rather than their true 
width of 0.025 |im. (From M. Osborn, 
R. Webster and K. Weber, 1 Cell Biol. 
77:R27-R34, 1978. With permission from 
The Rockefeller University Press.) 
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microscopy, and fluorescent labels are usually used for the most precise optical 
localization. 

Antibodies are made most simply by injecting a sample of the antigen sev- 
eral times into an animal such as a rabbit or a goat and then collecting the anti- 
body-rich serum. This antiserum contains a heterogeneous mixture of antibod- 
ies, each produced by a different antibody-secreting cell (a B lymphocyte). The 
different antibodies recognize various parts of the antigen molecule (called an 
antigenic determinant, or epitope), as well as impurities in the antigen prepara- 
tion. Removing the unwanted antibody molecules that bind to other molecules 
sharpens the specificity of an antiserum for a particular antigen; an antiserum 
produced against protein X, for example, when passed through an affinity col- 
umn of antigens X, will bind to these antigens, allowing other antibodies to pass 
through the column. Purified anti-X antibody can subsequently be eluted from 
the column. Even so, the heterogeneity of such antisera sometimes limits their 
usefulness. The use of monoclonal antibodies largely overcomes this problem 
(see Figure 8-8). However, monoclonal antibodies can also have problems. Since 
they are single-antibody protein species, they show almost perfect specificity for 
a single site or epitope on the antigen, but the accessibility of the epitope, and 
thus the usefulness of the antibody, may depend on the specimen preparation. 
For example, some monoclonal antibodies will react only with unfixed antigens, 
others only after the use of particular fixatives, and still others only with proteins 
denatured on SDS polyacrylamide gels, and not with the proteins in their native 
conformation. 



Figure 9-18 Indirect immunocyto- 
chemistry. This detection method is very 
sensitive because many molecules of the 
secondary antibody recognize each 
primary antibody. The secondary antibody 
is covalently coupled to a marker 
molecule that makes it readily detectable. 
Commonly used marker molecules 
include fluorescent dyes (for fluorescence 
microscopy), the enzyme horseradish 
peroxidase (for either conventional light 
microscopy or electron microscopy), 
colloidal gold spheres (for electron 
microscopy), and the enzymes alkaline 
phosphatase or peroxidase (for 
biochemical detection). 



Imaging of Complex Three-Dimensional Objects Is Possible with 
the Optical Microscope 

For ordinary light microscopy, as we have seen, a tissue has to be sliced into thin 
sections to be examined; the thinner the section, the crisper the image. The pro- 
cess of sectioning loses information about the third dimension. How, then, can 
we get a picture of the three-dimensional architecture of a cell or tissue, and how 
can we view the microscopic structure of a specimen that, for one reason or 
another, cannot first be sliced into sections? Although an optical microscope is 
focused on a particular focal plane within complex three-dimensional speci- 
mens, all the other parts of the specimen, above and below the plane of focus, 
are also illuminated and the light originating from these regions contributes to 
the image as "out- of- focus" blur. This can make it very hard to interpret the 
image in detail and can lead to fine image structure being obscured by the out- 
of-focus light. 

Two distinct but complementary approaches solve this problem: one is 
computational, the other is optical. These three-dimensional microscopic imag- 
ing methods make it possible to focus on a chosen plane in a thick specimen 
while rejecting the light that comes from out-of-focus regions above and below 
that plane. Thus one sees a crisp, thin optical section. From a series of such opti- 
cal sections taken at different depths and stored in a computer, it is easy to 
reconstruct a three-dimensional image. The methods do for the microscopist 
what the CT scanner does (by different means) for the radiologist investigating a 
human body: both machines give detailed sectional views of the interior of an 
intact structure. 
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The computational approach is often called image deconvolution. To under- 
stand how it works, remember that the wavelike nature of light means that the 
microscope lens system produces a small blurred disc as the image of a point 
light source (see Figure 9-5), with increased blurring if the point source lies 
above or below the focal plane. This blurred image of a point source is called the 
point spread function. An image of a complex object can then be thought of as 
being built up by replacing each point of the specimen by a corresponding 
blurred disc, resulting in an image that is blurred overall. For deconvolution, we 
first obtain a series of (blurred) images, usually with a cooled CCD camera, 
focusing the microscope in turn on a series of focal planes — in effect, a (blurred) 
three-dimensional image. The stack of digital images is then processed by com- 
puter to remove as much of the blur as possible. Essentially the computer pro- 
gram uses the microscope's point spread function to determine what the effect 
of the blurring would have been on the image, and then applies an equivalent 
"deblurring" (deconvolution), turning the blurred three-dimensional image into 
a series of clean optical sections. The computation required is quite complex, 
and used to be a serious limitation. However, with faster and cheaper comput- 
ers, the image deconvolution method is gaining in power and popularity. Figure 
9-19 shows an example. 



Figure 9-19 Image deconvolution. 

(A) A light micrograph of the large 
polytene chromosomes from Drosopht 
stained with a fluorescent DNA-bindin 
dye. (B) The same field of view after 
image deconvolution clearly reveals tr 
banding pattern on the chromosomes 
Each band is about 0.25 |im thick, 
approaching the resolution limit of the 
light microscope. (Courtesy of the Joh 
Sedat Laboratory.) 



The Confocal Microscope Produces Optical Sections by Excluding 
Out-of-Focus Light 

The confocal microscope achieves a result similar to that of deconvolution, but 
does so by manipulating the light before it is measured; thus it is an analog tech- 
nique rather than a digital one. The optical details of the confocal microscope are 
complex, but the basic idea is simple, as illustrated in Figure 9-20, and the results 
are far superior to those obtained by conventional light microscopy (Figure 
9-21). 

The microscope is generally used with fluorescence optics (see Figure 9-13), 
but instead of illuminating the whole specimen at once, in the usual way, the 
optical system at any instant focuses a spot of light onto a single point at a spe- 
cific depth in the specimen. It requires a very bright source of pinpoint illumi- 
nation that is usually supplied by a laser whose light has been passed through a 
pinhole. The fluorescence emitted from the illuminated material is collected 
and brought to an image at a suitable light detector. A pinhole aperture is placed 
in front of the detector, at a position that is confocal with the illuminating pin- 
hole — that is, precisely where the rays emitted from the illuminated point in the 
specimen come to a focus. Thus, the light from this point in the specimen con- 
verges on this aperture and enters the detector. 

By contrast, the light from regions out of the plane of focus of the spotlight 
is also out of focus at the pinhole aperture and is therefore largely excluded from 
the detector (see Figure 9-20). To build up a two-dimensional image, data from 
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Figure 9-20 The confocal fluorescence 
microscope. This simplified diagram 
shows that the basic arrangement of 
optical components is similar to that of 
the standard fluorescence microscope 
shown in Figure 9-1 3, except that a laser 
is used to illuminate a small pinhole 
whose image is focused at a single point 
in the specimen (A). Emitted fluorescence 
from this focal point in the specimen is 
focused at a second (confocal) pinhole 
(B). Emitted light from elsewhere in the 
specimen is not focused at the pinhole 
and therefore does not contribute to the 
final image (C). By scanning the beam of 
light across the specimen, a very sharp 
two-dimensional image of the exact 
plane of focus is built up that is not 
significantly degraded by light from other 
regions of the specimen. 



each point in the plane of focus are collected sequentially by scanning across the 
field in a raster pattern (as on a television screen) and are displayed on a video 
screen. Although not shown in Figure 9-20, the scanning is usually done by 
deflecting the beam with an oscillating mirror placed between the dichroic mir- 
ror and the objective lens in such a way that the illuminating spotlight and the 
confocal pinhole at the detector remain strictly in register. 

The confocal microscope has been used to resolve the structure of numer- 
ous complex three-dimensional objects (Figure 9-22), including the networks of 
cytoskeletal fibers in the cytoplasm and the arrangements of chromosomes and 
genes in the nucleus. 

The relative merits of deconvolution methods and confocal microscopy for 
three-dimensional optical microscopy are still the subject of debate. Confocal 
microscopes are generally easier to use than deconvolution systems and the 
final optical sections can be seen quickly. In contrast, the cooled CCD (charge- 
coupled device) cameras used for deconvolution systems are extremely efficient 
at collecting small amounts of light, and they can be used to make detailed 
three-dimensional images from specimens that are too weakly stained or too 
easily damaged by the bright light used for confocal microscopy. 

Both methods, however, have another drawback; neither is good at coping 
with thick specimens. Deconvolution methods quickly become ineffective any 
deeper than about 40 |im into a specimen, while confocal microscopes can only 
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Figure 9-21 Conventional and confocal 
fluorescence microscopy compared. 

These two micrographs are of the same 
intact gastrula-stage Drosophila embryo 
that has been stained with a fluorescent 
probe for actin filaments. (A) The 
conventional, unprocessed image is 
blurred by the presence of fluorescent 
structures above and below the plane of 
focus. (B) In the confocal image, this out- 
of-focus information is removed, 
resulting in a crisp optical section of the 
cells in the embryo. (Courtesy of Richard 
Warn and Peter Shaw.) 
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obtain images up to a depth of about 150 |im. Special confocal microscopes can 
now take advantage of the way in which fluorescent molecules are excited, to 
probe even deeper into a specimen. Fluorescent molecules are usually excited 
by a single high-energy photon, of shorter wavelength than the emitted light, 
but they can in addition be excited by the absorption of two (or more) photons 
of lower energy, as long as they both arrive within a femtosecond or so of each 
other. The use of this longer-wavelength excitation has some important advan- 
tages. In addition to reducing background noise, red or near infrared light can 
penetrate deeper within a specimen. Multiphoton confocal microscopes, con- 
structed to take advantage of this "two-photon" effect, can typically obtain 
sharp images even at a depth of 0.5 mm within a specimen. This is particularly 
valuable for studies of living tissues, notably in imaging the dynamic activity of 
synapses and neurons just below the surface of living brains (Figure 9-23). 

Fluorescent Proteins Can Be Used to Tag Individual Proteins in 
Living Cells and Organisms 

Even the most stable cellular structures must be assembled, disassembled, and 
reorganized during the cell's life cycle. Other structures, often enormous on the 
molecular scale, rapidly change, move, and reorganize themselves as the cell 
conducts its internal affairs and responds to its environment. Complex, highly 
organized pieces of molecular machinery move components around the cell, 
controlling traffic into and out of the nucleus, from one organelle to another, 
and into and out of the cell itself. 

Various techniques have been developed to make specific components of 
living cells visible in the microscope. Most of these methods use fluorescent pro- 
teins, and they require a trade-off between structural preservation and efficient 
labeling. All of the fluorescent molecules discussed so far are made outside the 
cell and then artificially introduced into it. Now new opportunities have been 
opened up by the discovery of genes coding for protein molecules that are them- 
selves inherently fluorescent. Genetic engineering then enables the creation of 
lines of cells or organisms that make their own visible tags and labels, without 
the introduction of foreign molecules. These cellular exhibitionists display their 
inner workings in glowing fluorescent color. 

Foremost among the fluorescent proteins used for these purposes by cell 
biologists is the green fluorescent protein (GFP), isolated from the jellyfish 
Aequoria victoria. This protein is encoded in the normal way by a single gene 
that can be cloned and introduced into cells of other species. The freshly trans- 
lated protein is not fluorescent, but within an hour or so (less for some alleles of 
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Figure 9-22 Three-dimensional 
reconstruction from confocal 
microscope images. Pollen grains, in this 
case from a passion flower, have a 
complex sculptured cell wall that 
contains fluorescent compounds. Images 
obtained at different depths through the 
grain, using a confocal microscope, can 
be recombined to give a three- 
dimensional view of the whole grain, 
shown on the right. Three selected 
individual optical sections from the full 
set of 30, each of which shows little 
contribution from its neighbors, are 
shown on the left. (Courtesy of 
Brad Amos.) 




Figure 9-23 Multi-photon imaging. 
Infrared laser light causes less damage to 
living cells and can also penetrate further, 
allowing microscopists to peer deeper 
into living tissues. The two-photon effect, 
in which a fluorochrome can be excited 
by two coincident infrared photons 
instead of a single high-energy photon, 
allows us to see nearly 0.5 mm inside the 
cortex of a live mouse brain. A dye, 
whose fluorescence changes with the 
calcium concentration, reveals active 
synapses (yellow) on the dendritic spines 
(red) that change as a function of time. 
(Courtesy of Karel Svoboda.) 
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Figure 9-24 Green fluorescent protein (GFP). The structure of GFP, shown 
here schematically, highlights the eleven (3 strands that form the staves of a 
barrel. Buried within the barrel is the active chromophore (dark green) that is 
formed post-translationally from the protruding side chains of three amino 
acid residues. (Adapted from M. Ormo et al., Science 273:1392-1395, 1996. 
With permission from AAAS.) 

the gene, more for others) it undergoes a self-catalyzed post-translational mod- 
ification to generate an efficient and bright fluorescent center, shielded within 
the interior of a barrel-like protein (Figure 9-24). Extensive site-directed muta- 
genesis performed on the original gene sequence has resulted in useful fluores- 
cence in organisms ranging from animals and plants to fungi and microbes. The 
fluorescence efficiency has also been improved, and variants have been gener- 
ated with altered absorption and emission spectra in the blue-green-yellow 
range. Recently a family of related fluorescent proteins discovered in corals, has 
extended the range into the red region of the spectrum (see Figure 9-14). 

One of the simplest uses of GFP is as a reporter molecule, a fluorescent 
probe to monitor gene expression. A transgenic organism can be made with the 
GFP-coding sequence placed under the transcriptional control of the promoter 
belonging to a gene of interest, giving a directly visible readout of the gene's 
expression pattern in the living organism (Figure 9-25). In another application, 
a peptide location signal can be added to the GFP to direct it to a particular cel- 
lular compartment, such as the endoplasmic reticulum or a mitochondrion, 
lighting up these organelles so they can be observed in the living state (see Fig- 
ure 12-35B). 

The GFP DNA-coding sequence can also be inserted at the beginning or end 
of the gene for another protein, yielding a chimeric product consisting of that 
protein with a GFP domain attached. In many cases, this GFP-fusion protein 
behaves in the same way as the original protein, directly revealing its location 
and activities by means of its genetically encoded contrast (Figure 9-26). 
<TAAT> It is often possible to prove that the GFP-fusion protein is functionally 
equivalent to the untagged protein, for example by using it to rescue a mutant 
lacking that protein. GFP tagging is the clearest and most unequivocal way of 
showing the distribution and dynamics of a protein in a living organism (Figure 
9-27). <TTCT> 




Protein Dynamics Can Be Followed in Living Cells 

Fluorescent proteins are now exploited, not just to see where in a cell a particu- 
lar protein is located, but also to uncover its kinetic properties and to find out 
whether it might interact with other proteins. We now describe three techniques 
in which GFP and its relatives are used in this way. 

The first is the monitoring of interactions between one protein and another 
by fluorescence resonance energy transfer (FRET). In this technique, whose 
principles have been described earlier (see Figure 8-26), the two molecules of 
interest are each labeled with a different fTuorochrome, chosen so that the emis- 
sion spectrum of one fluorochrome overlaps with the absorption spectrum of 
the other. If the two proteins bind so as to bring their fluorochromes into very 
close proximity (closer than about 5 nm), one fluorochrome transfers the energy 
of the absorbed light directly to the other. Thus, when the complex is illuminated 
at the excitation wavelength of the first fluorochrome, fluorescent light is pro- 



Figure 9-25 Green fluorescent protein (GFP) as a reporter. For this 
experiment, carried out in the fruit fly, the GFP gene was joined (using 
recombinant DNA techniques) to a fly promoter that is active only in a 
specialized set of neurons. This image of a live fly embryo was captured by a 
fluorescence microscope and shows approximately 20 neurons, each with 
long projections (axons and dendrites) that communicate with other 
(nonfluorescent) cells. These neurons are located just under the surface of the 
animal and allow it to sense its immediate environment. (From W.B. Grueber et 
al., Curr. Biol. 13:618-626, 2003. With permission from Elsevier.) 
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Figure 9-26 GFP-tagged proteins. 

(A) The upper surface of the leaves of 
Arabidopsis plants are covered with huge 
branched single-cell hairs that rise up 
from the surface of the epidermis. These 
hairs, or trichomes, can be imaged in the 
scanning electron microscope. (B) If an 
Arabidopsis plant is transformed with a 
DNA sequence coding for talin {an actin- 
binding protein), fused to a DNA 
sequence coding for GFR the fluorescent 
talin protein produced binds to actin 
filaments in all the living cells of the 
transgenic plant. Confocal microscopy 
can reveal the dynamics of the entire 
actin cytoskeleton of thetrichome 
(green). The red fluorescence arises from 
chlorophyll in cells within the leaf below 
the epidermis. (A, courtesy of Paul 
Linstead; B, courtesy of Jaideep Mathur.) 
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duced at the emission wavelength of the second. This method can be used with 
two different spectral variants of GFP as fluorochromes to monitor processes 
such as the interaction of signaling molecules with their receptors, or proteins in 
macromolecular complexes (Figure 9-28). 

The complexity and rapidity of many intracellular processes, such as the 
actions of signaling molecules or the movements of cytoskeletal proteins, make 
them difficult to study at a single-cell level. Ideally, we would like to be able to 
introduce any molecule of interest into a living cell at a precise time and location 
and follow its subsequent behavior, as well as the response of the cell to that 
molecule. Microinjection is limited by the difficulty of controlling the place and 
time of delivery. A more powerful approach involves synthesizing an inactive 
form of the fluorescent molecule of interest, introducing it into the cell, and then 
activating it suddenly at a chosen site in the cell by focusing a spot of light on it. 
This process is referred to as photoactivation. Inactive photosensitive precur- 
sors of this type, often called caged molecules, have been made for many fluo- 
rescent molecules. A microscope can be used to focus a strong pulse of light 
from a laser on any tiny region of the cell, so that the experimenter can control 
exactly where and when the fluorescent molecule is photoactivated. 

One class of caged fluorescent proteins is made by attaching a photoactivat- 
able fluorescent tag to a purified protein. It is important that the modified pro- 
tein remain biologically active: labeling with a caged fluorescent dye adds a 
bulky group to the surface of a protein, which can easily change the protein's 
properties. A satisfactory labeling protocol is usually found by trial and error. 
Once a biologically active labeled protein has been produced, it needs to be 
introduced into the living cell (see Figure 9-34), where its behavior can be fol- 
lowed. Tubulin, labeled with caged fluorescein for example, when injected into 
a dividing cell, can be incorporated into microtubules of the mitotic spindle. 
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Figure 9-27 Dynamics of GFP tagging. 

This sequence of micrographs shows a 
set of three-dimensional images of a 
living nucleus taken over the course of an 
hour. Tobacco cells have been stably 
transformed with GFP fused to a 
spliceosomal protein that is concentrated 
in small nuclear bodies called Cajal 
bodies (see Figure 6-48). The fluorescent 
Cajal bodies, easily visible in a living cell 
with confocal microscopy, are dynamic 
structures that move around within the 
nucleus. (Courtesy of Kurt Boudonck, 
Liam Dolan, and Peter Shaw.) 
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When a small region of the spindle is illuminated with a laser, the labeled tubu- 
lin becomes fluorescent, so that its movement along the spindle microtubules 
can be readily followed (Figure 9-29). 

A more recent development in photoactivation is the discovery that the genes 
encoding GFP and related fluorescent proteins can be mutated to produce pro- 
tein variants, usually with a single amino acid change, that fluoresce only weakly 
under normal excitation conditions, but can be induced to fluoresce strongly by 
activating them with a strong pulse of light at a different wavelength. In principle 
the microscopist can then follow the local in vivo behavior of any protein that can 
be expressed as a fusion with one of these GFP variants. These genetically 
encoded, photoactivateable fluorescent proteins thus avoid the need to intro- 
duce the probe into the cell, and allow the lifetime and behaviour of any protein 
to be studied independently of other newly synthesized proteins (Figure 9-30). 

A third way to exploit GFP fused to a protein of interest is to use a strong 
focussed beam of light from a laser to extinguish the GFP fluorescence in a spec- 
ified region of the cell. By analyzing the way in which the remaining fluorescent 
protein molecules move into the bleached area as a function of time, we can 
obtain information about the protein's kinetic parameters. This technique, usu- 
ally carried out with a confocal microscope, is known as fluorescence recovery 
after photobleaching (FRAP) and, like photoactivation, can deliver valuable 
quantitative data about the protein of interest, such as diffusion coefficients 
<ATGT>, active transport rates, or binding and dissociation rates from other pro- 
teins (Figure 9-31). 



Figure 9-29 Determining microtubule flux in the mitotic spindle with 
caged fluorescein linked to tubulin. (A) A metaphase spindle formed in 
Wfrofrom an extract of Xenopus eggs has incorporated three fluorescent 
markers: rhodamine-labeled tubulin (red) to mark all the microtubules, a 
blue DNA-binding dye that labels the chromosomes, and caged- 
fluorescein-labeled tubulin, which is also incorporated into all the 
microtubules but is invisible because it is nonfluorescent until activated by 
ultraviolet light. (B) A beam of UV light uncages the caged-fluorescein- 
labeled tubulin locally, mainly just to the left side of the metaphase plate. 
Over the next few minutes (after 1 .5 minutes in C, after 2.5 minutes in D), 
the uncaged-fluorescein-tubulin signal moves toward the left spindle pole, 
indicating that tubulin is continuously moving poleward even though the 
spindle (visualized by the red rhodamine-labeled tubulin fluorescence) 
remains largely unchanged. (From K.E. Sawin andTJ. Mitchison, J. Cell Biol. 
1 1 2:941 -954, 1 991 . With permission from The Rockefeller University Press.) 



Figure 9-28 Fluorescence resonance 
energy transfer (FRET) imaging. This 
experiment shows that a protein called 
Slal p can interact tightly with another 
protein, called Abpl p, which is involved 
in cortical actin attachment at the surface 
of a budding yeast cell. Slal pis 
expressed in the yeast cell (A) as a fusion 
protein with a yellow variant of GFP 
(YFP), while Abp1 p is expressed as a 
fusion protein (B) with a cyan variant of 
GFP (CFP). The FRET signal (see also 
Figure 8-26), displayed here in red (C), is 
obtained by exciting the CFP but 
recording only the fluorescence emitted 
from the YFP, which will occur only when 
the two molecules are tightly associated 
(within 0.5 nm). The spots at the cortex 
(D), seen when (A), (B), and (C) are 
superimposed, are of three sorts, those 
where Slal p is found alone (arrows in A), 
those where Abp1 p is found alone 
(arrowhead in B), and those where they 
are closely associated and generate a 
FRET signal, shown in the false-colored 
and corrected image (E). Since Slal p was 
already known to form part of the 
endocytic machinery, this experiment 
physically connects that process with the 
process of actin attachment to the cell 
cortex. (From D.T.Warren et al., J. CeliSci. 
1 1 5:1 703-1 71 5, 2002. With permission 
from The Company of Biologists.) 
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Light-Emitting Indicators Can Measure Rapidly Changing 
Intracellular Ion Concentrations 

One way to study the chemistry of a single living cell is to insert the tip of a fine, 
glass, ion-sensitive microelectrode directly into the cell interior through the 
plasma membrane. This technique is used to measure the intracellular concen- 
trations of common inorganic ions, such as H + , Na + , K\ CI", and Ca 2+ . However, 
ion-sensitive microelectrodes reveal the ion concentration only at one point in 
a cell, and for an ion present at a very low concentration, such as Ca 2+ , their 
responses are slow and somewhat erratic. Thus, these microelectrodes are not 
ideally suited to record the rapid and transient changes in the concentration of 
cytosolic Ca 2+ that have an important role in allowing cells to respond to extra- 
cellular signals. Such changes can be analyzed with ion-sensitive indicators, 
whose light emission reflects the local concentration of the ion. Some of these 
indicators are luminescent (emitting light spontaneously), while others are flu- 
orescent (emitting light on exposure to light). 

Aequorin is a luminescent protein isolated from a marine jellyfish; it emits 
light in the presence of Ca 2+ and responds to changes in Ca 2+ concentration in 
the range of 0.5-10 |iM. If micro injected into an egg, for example, aequorin emits 
a flash of light in response to the sudden localized release of free Ca 2+ into the 
cytoplasm that occurs when the egg is fertilized (Figure 9-32). Aequorin has also 
been expressed transgenically in plants and other organisms to provide a 
method of monitoring Ca 2+ in all their cells without the need for microinjection, 
which can be a difficult procedure. 
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Figure 9-30 Photoactivation. 

Photoactivation is the light-induced 
activation of an inert molecule to an 
active state. In this experiment a 
photoactivatable variant of GFP is 
expressed in a cultured animal cell. 
Before activation (time 0), little or no GFP 
fluorescence is detected in the selected 
region (red circle) when excited by blue 
light at 488 nm. After activation of the 
GFP however, using a UV laser pulse at 
41 3 nm, it rapidly fluoresces brightly in 
the selected region (green). The 
movement of GFP, as it diffuses out of 
this region, can be measured. Since only 
the photoactivated proteins are 
fluorescent within the cell, the trafficking, 
turnover and degradative pathways of 
proteins can be monitored. (B, from 
J. Lippincott-Schwartz and G.H. Patterson, 
Science 300:87-91 , 2003. With permission 
from AAAS.) 
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Figure 9-31 Fluorescence recovery after 
photobleaching (FRAP). A strong focused 
pulse of laser light will extinguish, or 
bleach, the fluorescence of GFP. By 
selectively photobleaching a set of 
fluorescently tagged protein molecules 
within a defined region of a cell, the 
microscopist can monitor recovery over 
time, as the remaining fluorescent 
molecules move into the bleached region. 
The experiment shown in (A) uses monkey 
cells in culture that express 
galactosyltransferase, an enzyme that 
constantly recycles between the Golgi 
apparatus and the endoplasmic reticulum. 
The Golgi apparatus in one of the two cells 
is selectively photobleached, while the 
production of new fluorescent protein is 
blocked by treating the cells with 
cycloheximide.The recovery, resulting from 
fluorescent enzyme molecules moving 
from the ER to the Golgi, can then be 
followed over a period of time. 
(B) Schematic diagram of the experiment 
shown in (A). (A, from J. Lippincott- 
Schwartz et al., Histochem. Cell Biol. 
1 16:97-107, 2001 . With permission from 
Springer-Verlag.) 
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Figure 9-32 Aequorin, a luminescent protein. The luminescent protein 
aequorin emits light in the presence of free Ca 2+ . Here, an egg of the 
medaka fish has been injected with aequorin, which has diffused 
throughout the cytosol, and the egg has then been fertilized with a sperm 
and examined with the help of a very sensitive camera. The four 
photographs were taken looking down on the site of sperm entry at 
intervals of 10 seconds and reveal a wave of release of free Ca 2+ into the 
cytosol from internal stores just beneath the plasma membrane. This wave 
sweeps across the egg starting from the site of sperm entry, as indicated in 
the diagrams on the left. (Photographs reproduced from J.C. Gilkey, 
L.F. Jaffe, E.B. Ridgway and G.T. Reynolds, J. Ceil Biol. 76:448-466, 1 978. With 
permission from The Rockefeller University Press.) 



Bioluminescent molecules like aequorin emit tiny amounts of light — at best, 
a few photons per indicator molecule — that are difficult to measure. Fluorescent 
indicators produce orders of magnitude more photons per molecule; they are 
therefore easier to measure and can give better spatial resolution. Fluorescent 
Ca 2+ indicators have been synthesized that bind Ca 2+ tightly and are excited by 
or emit light at slightly different wavelengths when they are free of Ca 2+ than 
when they are in their Ca 2+ -bound form. By measuring the ratio of fluorescence 
intensity at two excitation or emission wavelengths, we can determine the con- 
centration ratio of the Ca 2+ -bound indicator to the Ca 2+ -free indicator, thereby 
providing an accurate measurement of the free Ca 2+ concentration. <CGTC> 
Indicators of this type are widely used for second-by-second monitoring of 
changes in intracellular Ca 2+ concentrations in the different parts of a cell 
viewed in a fluorescence microscope (Figure 9-33). <AGGA> 

Similar fluorescent indicators measure other ions; some detect H + , for 
example, and hence measure intracellular pH. Some of these indicators can 
enter cells by diffusion and thus need not be microinjected; this makes it pos- 
sible to monitor large numbers of individual cells simultaneously in a fluores- 
cence microscope. New types of indicators, used in conjunction with modern 
image-processing methods, are leading to similarly rapid and precise methods 
for analyzing changes in the concentrations of many types of small molecules 
in cells. 
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Several Srategies Are Available by Which Membrane-lmpermeant 
Substances Can Be Introduced into Cells 

It is often useful to introduce membrane-impermeant molecules into a living 
cell, whether they are antibodies that recognize intracellular proteins, normal 
cell proteins tagged with a fluorescent label, or molecules that influence cell 
behavior. One approach is to microinject the molecules into the cell through a 
glass micropipette. 

When microinjected into a cell, antibodies can block the function of the 
molecule that they recognize. Anti-myosin-II antibodies injected into a fertilized 
sea urchin egg, for example, prevent the egg cell from dividing in two, even 
though nuclear division occurs normally. This observation demonstrates that 
this myosin has an essential role in the contractile process that divides the cyto- 
plasm during cell division, but that it is not required for nuclear division. 



Figure 9-33 Visualizing intracellular Ca 2+ concentrations by using a 
fluorescent indicator. The branching tree of dendrites of a Purkinje cell in 
the cerebellum receives more than 1 00,000 synapses from other neurons. 
The output from the cell is conveyed along the single axon seen leaving 
the cell body at the bottom of the picture. This image of the intracellular 
Ca 2+ concentration in a single Purkinje cell (from the brain of a guinea pig) 
was taken with a low-light camera and the Ca 2+ -sensitive fluorescent 
indictorfura-2. The concentration offreeCa 2+ is represented by different 
colors, red being the highest and blue the lowest. The highest Ca 2+ levels 
are present in the thousands of dendritic branches. (Courtesy of 
D.W. Tank, J.A. Connor, M. Sugimori and R.R. Llinas.) 
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Microinjection, although widely used, demands that each cell be injected 
individually; therefore, it is possible to study at most only a few hundred cells at 
a time. Other approaches allow large populations of cells to be permeabilized 
simultaneously. Partly disrupting the structure of the cell plasma membrane, for 
example, makes it more permeable; this is usually accomplished by using a pow- 
erful electric shock or a chemical such as a low concentration of detergent. The 
electrical technique has the advantage of creating large pores in the plasma 
membrane without damaging intracellular membranes. Depending on the cell 
type and the size of the electric shock, the pores allow even macromolecules to 
enter (and leave) the cytosol rapidly. This process of electroporation is valuable 
also in molecular genetics, as a way of introducing DNA molecules into cells. 
With a limited treatment, a large fraction of the cells repair their plasma mem- 
brane and survive. 

A third method for introducing large molecules into cells is to cause mem- 
brane-enclosed vesicles that contain these molecules to fuse with the cell's 
plasma membrane thus delivering their cargo. Thus method is used routinely to 
deliver nucleic acids into mammalian cells, either DNA for transfection studies 
or RNA for RNAi experiments (discussed in Chapter 8). In the medical field it is 
also being explored as a method for the targeted delivering of new pharmaceu- 
ticals. 

Finally, DNA and RNA can also be physically introduced into cells by simply 
blasting them in at high velocity, coated onto tiny gold particles. Living cells, 
shot with these nucleic-acid-coated gold particles (typically less than 1 um in 
diameter) can successfully incorporate the introduced RNA (used for transient 
expression studies or RNAi, for example) or DNA (for stable transfection). All 
four of these methods, illustrated in Figure 9-34, are used widely in cell biology. 



Figure 9-34 Methods of introducing a 
membrane-impermeant substance into a 
cell. (A) The substance is injected through a 
micropipette, either by applying pressure 
or, if the substance is electrically charged, 
by applying a voltage that drives the 
substance into the cell as an ionic current 
(a technique called iontophoresis). (B) The 
cell membrane is made transiently 
permeable to the substance by disrupting 
the membrane structure with a brief but 
intense electric shock (2000 V/cm for 
200 jusec, for example).(C) Membrane- 
enclosed vesicles are loaded with the 
desired substance and then induced to 
fuse with the target cells. (D) Gold particles 
coated with DNA are used to introduce a 
novel gene into the nucleus. 



Light Can Be Used to Manipulate Microscopic Objects As Well As 
to Image Them 

Photons carry a small amount of momentum. This means that an object that 
absorbs or deflects a beam of light experiences a small force. With ordinary light 
sources, this radiation pressure is too small to be significant. But it is important 
on a cosmic scale (helping prevent gravitational collapse inside stars), and, more 
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modestly, in the cell biology lab, where an intense focused laser beam can exert 
large enough forces to push small objects around inside a cell. If the laser beam 
is focused on an object having a higher refractive index than its surroundings, 
the beam is refracted, causing very large numbers of photons to change direc- 
tion. The pattern of photon deflection holds the object at the focus of the beam; 
if it begins to drift away from this position, radiation pressure pushes it back by 
acting more strongly on one side than the other. Thus, by steering a focused laser 
beam, usually an infrared laser, which is minimally absorbed by the cellular con- 
stituents, one can create "optical tweezers" to move subcellular objects like 
organelles and chromosomes around. This method, sometimes referred to as 
laser tweezers <CGCG> <CACA>, has been used to measure the forces exerted by 
single actin-myosin molecules, by single microtubule motors, and by RNA poly- 
merase (Figure 9-35). 

Intense focused laser beams that are more strongly absorbed by biological 
material can also be used more straightforwardly as optical knives — to kill indi- 
vidual cells, to cut or burn holes in them, or to detach one intracellular compo- 
nent from another. In these ways, optical devices can provide a basic toolkit for 
cellular microsurgery. 



Figure 9-35 Optical tweezers. A focused 
laser beam can be used to trap 
microscopic particles and move them 
about at will. <CGCG> In this experiment, 
such optical tweezers are used to pick up 
a small silica bead (0.2 nm, arrow), coated 
with few kinesin molecules {0 sec), and 
place it on an isolated ciliary axoneme 
that is built from microtubules (30 sec). 
The bright halo seen here is the reflection 
of the laser at the interface between the 
water and the coverslip. The kinesin on 
the released bead (1 min) couples ATP 
hydrolysis to movement along the 
microtubules of the axoneme, and 
powers the transport of the bead along it 
(3 min). (From S.M. Block et al., Nature 
348:348-352, 1990. With permission from 
Macmillan Publishers Ltd.) 



Single Molecules Can Be Visualized by Using Total Internal 
Reflection Fluorescence Microscopy 

While beads can be used as markers to track protein movements, it is clearly 
preferable to be able to visualize the proteins themselves. In principle this can 
be accomplished by labeling the protein with a fluorescent molecule, either by 
chemically attaching a small fluorescent molecule to isolated protein molecules 
or by expressing fluorescent protein fusion constructs (see p. 593). In ordinary 
microscopes, however, single fluorescent molecules cannot be reliably detected. 
The limitation has nothing to do with the resolution limit, but instead arises 
from the interference of light emitted by out-of-focus molecules that tends to 
blot out the fluorescence from the particular molecule of interest. This problem 
can be solved by the use of a specialized optical technique called total internal 
reflectance fluorescence (TIRF) microscopy. In a TIRF microscope, laser light 
shines onto the coverslip surface at the precise critical angle at which total inter- 
nal reflection occurs (Figure 9-36A). Because of total internal reflection, the 
light does not enter the sample, and the majority of fluorescent molecules are 
not, therefore, illuminated. However, electromagnetic energy does extend, as an 
evanescent field, for a very short distance beyond the surface of the coverslip 
and into the specimen, allowing just those molecules in the layer closest to the 
surface to become excited. When these molecules fluoresce, their emitted light 
is no longer competing with out-of-focus light from the overlying molecules, 
and can now be detected. TIRF has allowed several dramatic experiments, for 
instance imaging of single motor proteins moving along microtubules or single 
actin filaments forming and branching, although at present the technique is 
restricted to a thin layer within only 100-200 nm of the cell surface (Figure 9-36B 
andC). 
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Individual Molecules Can Be Touched and Moved Using Atomic 
Force Microscopy 

While TIRF allows single molecules to be visualized, it is strictly a passive obser- 
vation method. In order to probe molecular function, it is ultimately useful to be 
able to manipulate individual molecules themselves, and atomic force 
microscopy (AFM) provides a method to do just that. In an AFM device, an 
extremely small and sharply pointed tip, of silicon or silicon nitride, is made 
using nanofabrication methods similar to those used in the semiconductor 
industry. The tip of the AFM is attached to a springy cantilever arm mounted on 
a highly precise positioning system that allows it to be moved over very small dis- 
tances. In addition to this precise movement capability, the AFM is able to mea- 
sure the mechanical force felt by its tip as it moves over the surface (Figure 
9-37A). When AFM was first developed, it was intended as an imaging technol- 
ogy to measure molecular-scale features on a surface. When used in this mode, 
the probe is scanned over the surface, moving up and down as necessary to main- 
tain a constant interaction force with the surface, thus revealing any objects such 
as proteins that might be present on the otherwise flat surface (see Figures 10-14 
and 10-32). AFM is not limited to simply imaging surfaces, however, and can also 
be used to pick up and move single molecules, in a molecular-scale version of the 
optical tweezers described above. Using this technology, the mechanical proper- 
ties of individual protein molecules can be measured in detail. For example, AFM 
has been used to unfold a single protein molecule in order to measure the ener- 
getics of domain folding (Figure 9-37B). The full potential to probe proteins 
mechanically, as well as to assemble individual proteins into defined arrange- 
ments using AFM, is only now starting to be explored, but it seems likely that this 
tool will become increasingly important in the future. 



Figure 9-36 TIRF microscopy allows the 
detection of single fluorescent 
molecules. (A) TIRF microscopy uses 
excitatory laser light to illuminate the 
coverslip surface at the critical angle at 
which all the light is reflected by the 
glass-water interface. Some 
electromagnetic energy extends a short 
distance across the interface as an 
evanescent wave that excites just those 
molecules that are very close to the 
surface. (B)TIRF microscopy is used here 
to image individual myosin-GFP 
molecules (green dots) attached to non- 
fluorescent actin filaments (C), which are 
invisible but stuck to the surface of the 
coverslip. (Courtesy of Dmitry Cherny and 
Clive R. Bagshaw.) 



Molecules Can Be Labeled with Radioisotopes 

As we have just seen, in cell biology it is often important to determine the quan- 
tities of specific molecules and to know where they are in the cell and how their 
level or location changes in response to extracellular signals. The molecules of 
interest range from small inorganic ions, such as Ca 2+ or H + , to large macro- 
molecules, such as specific proteins, RNAs, or DNA sequences. We have so far 
described how sensitive fluorescence methods can be used for assaying these 
types of molecules, as well as for following the dynamic behavior of many of 
them in living cells. In ending this section, we describe how radioisotopes are 
used to trace the path of specific molecules through the cell. 
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Figure 9-37 Single protein molecules can be manipulated by 
atomic force microscopy. (A) Schematic diagram of the key 
components of an atomic force microscope (AFM), showing the 
force-sensing tip attached to one end of a single protein molecule in 
the experiment described in (B). (B) Titin is an enormous protein 
molecule that provides muscle with its passive elasticity (see Figure 
16-76). The extensibility of this protein can be tested directly, using a 
short artificially produced protein that contains eight repeated Ig- 
domainsfrom one region of the titin protein. In this experiment the 
tip of the AFM is used to pick up, and progressively stretch, a single 
molecule until it eventually ruptures. As force is applied, each Ig- 
domain suddenly begins to unfold, and the force needed in each 
case (about 200 pN) can be recorded. The region of the 
force-extension curve shown in green records the sequential 
unfolding event for each of the eight protein domains. (Adapted 
from W.A. Linke et al., J. Struct Biol. 1 37:1 94-205, 2002. With 
permission from Elsevier.) 
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Most naturally occurring elements are a mixture of slightly different iso- 
topes. These differ from one another in the mass of their atomic nuclei, but 
because they have the same number of protons and electrons, they have the 
same chemical properties. In radioactive isotopes, or radioisotopes, the nucleus 
is unstable and undergoes random disintegration to produce a different atom. 
In the course of these disintegrations, either energetic subatomic particles, such 
as electrons, or radiations, such as gamma- rays, are given off. By using chemical 
synthesis to incorporate one or more radioactive atoms into a small molecule of 
interest, such as a sugar or an amino acid, the fate of that molecule (and of spe- 
cific atoms in it) can be traced during any biological reaction. 

Although naturally occurring radioisotopes are rare (because of their insta- 
bility), they can be produced in large amounts in nuclear reactors, where stable 
atoms are bombarded with high-energy particles. As a result, radioisotopes of 
many biologically important elements are readily available (Table 9-1). The 
radiation they emit is detected in various ways. Electrons ((3 particles) can be 
detected in a Geiger counter by the ionization they produce in a gas, or they can 
be measured in a scintillation counter by the small flashes of light they induce in 
a scintillation fluid. These methods make it possible to measure accurately the 
quantity of a particular radioisotope present in a biological specimen. Using 
either light or electron microscopy, it is also possible to determine the location 
of a radioisotope in a specimen by autoradiography, as we describe below All of 
these methods of detection are extremely sensitive: in favorable circumstances, 
nearly every disintegration — and therefore every radioactive atom that decays — 
can be detected. 



Table 9-1 Some 


Radioisotopes in 


Common Use in 


Biological Research 


ISOTOPE 


HALF-LIFE 


32p 


14 days 


131, 


8.1 days 


35 S 


87 days 


14 C 


5570 years 


45 Ca 


1 64 days 


3 H 


12.3 years 



The isotopes are arranged in decreasing 
order of the energy of the p radiation 
(electrons) they emit. 131 l also emits 
y radiation. The half-life is the time 
required for 50% of the atoms of an 
isotope to disintegrate. 
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Radioisotopes Are Used to Trace Molecules in Cells and 
Organisms 

One of the earliest uses of radioactivity in biology was to trace the chemical 
pathway of carbon during photosynthesis. Unicellular green algae were main- 
tained in an atmosphere containing radioactively labeled C0 2 ( 14 C0 2 ), and at 
various times after they had been exposed to sunlight, their soluble contents 
were separated by paper chromatography. Small molecules containing 14 C 
atoms derived from C0 2 were detected by a sheet of photographic film placed 
over the dried paper chromatogram. In this way most of the principal compo- 
nents in the photosynthetic pathway from C0 2 to sugar were identified. 

Radioactive molecules can be used to follow the course of almost any process 
in cells. In a typical experiment the cells are supplied with a precursor molecule in 
radioactive form. The radioactive molecules mix with the preexisting unlabeled 
ones; both are treated identically by the cell as they differ only in the weight of 
their atomic nuclei. Changes in the location or chemical form of the radioactive 
molecules can be followed as a function of time. The resolution of such experi- 
ments is often sharpened by using a pulse-chase labeling protocol, in which the 
radioactive material (the pulse) is added for only a very brief period and then 
washed away and replaced by nonradioactive molecules (the chase). Samples are 
taken at regular intervals, and the chemical form or location of the radioactivity is 
identified for each sample (Figure 9-38). Pulse-chase experiments, combined 
with autoradiography, have been important, for example, in elucidating the path- 
way taken by secreted proteins from the ER to the cell exterior. 

Radioisotopic labeling is a uniquely valuable way of distinguishing between 
molecules that are chemically identical but have different histories— for exam- 
pie, those diat differ in their time of synthesis. In this way, for example, it was 
shown diat almost all of the molecules in a living cell are continually being 
degraded and replaced, even when the cell Is not growing and is apparently in a 
steady stale. This "turnover," which sometimes takes place very slowly would be 
almost impossible to detect without radioisotopes. 

Today, nearly all common small molecules are available in radioactive form 
from commercial sources, and virtually any biological molecule, no matter how 
complicated, can be radioactively labeled. Compounds can be made with 
radioactive atoms incorporated at particular positions in their structure, 
enabling the separate fates of different parts of the same molecule to be followed 
during biological reactions (Figure 9-39). 

As mentioned previously, one of the important uses of radioactivity in cell 
biology is to localize a radioactive compound in sections of whole cells or tissues 
by autoradiography, In this procedure, living cells are briefly exposed to a pulse 
of a specific radioactive compound and then incubated for a variable period— 
to allow them time to incorporate the compound— before being fixed and pro- 
cessed for light or electron microscopy Bach preparation is then overlaid with a 
thin FUm of photographic emulsion and left in the dark for several days, during 
which die radioisotope decays. The emulsion is then developed, and the posi- 
tion of the radioactivity in each cell is indicated by the position of the developed 
silver grains (see Figure .3-29). If cells are exposed'to A H- thymidine, a radioactive 
precursor of DNA, for example, it can be shown that DNA is made in the nucleus 



PULSE 



CHASE 




Figure 9-38 The logic of a typical pulse- 
chase experiment using radioisotopes. 

The chambers labeled A, B, C, and D 
represent either different compartments 
in the cell (detected by autoradiography 
or by cell-fractionation experiments) or 
different chemical compounds (detected 
by chromatography or other chemical 
methods). 
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Figure 9-39 Radioisotopically labeled 
molecules. Three commercially available 
radioactive forms of ATP, with the 
radioactive atoms shown in red. The 
nomenclature used to identify the 
position and type of the radioactive 
atoms is also shown. 
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and remains there (Figure 9-40). By contrast, if cells are exposed to 3 H-uridine, 
a radioactive precursor of RNA, it is found that RNA is initially made in the 
nucleus (see Figure 4-62) and then moves rapidly into the cytoplasm. Radiola- 
beled molecules can also be detected by autoradiography after they are sepa- 
rated from other molecules by gel electrophoresis: the positions of both proteins 
(see Figure 8-23) and nucleic acids (see Figure 8-33A) are commonly detected 
on gels in this way. 



Summary 

Many light-microscope techniques are available for observing cells. Cells that have 
been fixed and stained can be studied in a conventional light microscope, whereas 
antibodies coupled to fluorescent dyes can be used to locate specific molecules in 
cells in a fluorescence microscope. Living cells can be seen with phase-contrast, dif- 
ferential-interference-contrast, dark-field, or bright-field microscopes. All forms of 
light microscopy are facilitated by digital image-processing techniques, which 
enhance sensitivity and refine the image. Confocal microscopy and image deconvo- 
lution both provide thin optical sections and can be used to reconstruct three- 
dimensional images. 

Techniques are now available for detecting, measuring, and following almost any 
desired molecule in a living cell Fluorescent indicator dyes can be introduced to mea- 
sure the concentrations of specific ions in individual cells or in different parts of a cell 
Fluorescent proteins are especially versatile probes that can be attached to other pro- 
teins by genetic manipulation. Virtually any protein of interest can be genetically engi- 
neered as a fluorescent- fusion protein, and then imaged in living cells by fluorescence 
microscopy. The dynamic behavior and interactions of many molecules can now be fol- 
lowed in living cells by variations on the use of fluorescent protein tags, in some cases at 
the level of single molecules. Radioactive isotopes of various elements can also be used 
to follow the fate of specific molecules both biochemically and microscopically. 




20 nm 

Figure 9-40 Autoradiography. This tissue 
has been exposed for a short period to 
3 H-thymidine. Cells that are replicating 
their DNA incorporate this radioactively 
labeled DNA precursor into their nuclei 
and can subsequently be visualized by 
autoradiography. The silver grains, seen 
here as black dots in the photographic 
emulsion over the section, reveal which 
cell was making new DNA. The labeled 
nucleus shown here is in the sensory 
epithelium from the inner ear of a 
chicken. (Courtesy of Mark Warchol and 
Jeffrey Corwin.) 
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LOOKING AT CELLS AND MOLECULES IN THE 
ELECTRON MICROSCOPE 

Light microscopy is limited in the fineness of detail that it can reveal. Micro- 
scopes using other types of radiation— in particular electron microscopes — can 
resolve much smaller structures than is possible with visible light. This higher 
resolution comes at a cost: specimen preparation for electron microscopy is 
much more complex and il is harder to be sure that what we see in the image 
corresponds precisely to the actual structure being examined, It is now possible, 
liowever, to use very rapid freezing to preserve structures faithfully for electron 
microscopy Digital image analysis can be used to reconstruct three-dimen- 
sional objects by combining information either from many individual particles 
or from multiple tilted views of a single object. Together these approaches are 
extending the resolution and scope of electron microscopy to the point at which 
we can begin to faithfully image the structures of individual macromolecules 
and the complexes they form. 

The Electron Microscope Resolves the Fine Structure of the Cell 

The relationship between the limit of resolution and the wavelength of the illu- 
minating radiation (see Figure 9-6) holds true for any form of radiation, whether 
it is a beam of light or a beam of electrons. With electrons, however, the limit of 
resolution can be made very small. The wavelength of an electron decreases as 
its velocity increases. In an electron microscope with an accelerating voltage of 
100,000 y the wavelength of an electron is 0.004 nm. In theory the resolution of 
such a microscope should be about 0.002 nm, which is 100,000 times that of the 
light microscope. Because the aberrations of an electron lens are considerably 
harder to correct than those of a glass fens, however, the practical resolving 
power of most modern electron microscopes is P at best, 0.1 nm (1 A) (Figure 
9-41). This is because only the very center of the electron lenses can be used, 
and the effective numerical aperture is tiny. Furthermore, problems of specimen 
preparation, contrast, and radiation damage have generally limited the normal 
effective resolution for biological objects to 1 nm (10 A)' This is nonetheless 
about 200 rimes better than the resolution of the light microscope. Moreover, in 
recent years, the performance of electron microscopes has been improved by 
the development of electron ill u initiation sources called held emission guns. 
These very bright and coherent sources can substantially improve the resolution 
achieved. 

In overall design the transmission electron microscope (TEM) is similar to a 
light microscope, although it is much larger and "upside down" (Figure 9-42). 
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Figure 9-41 The limit of resolution of 
the electron microscope. This 
transmission electron micrograph of a 
thin layer of gold shows the individual 
files of atoms in the crystal as bright 
spots. The distance between adjacent 
files of gold atoms is about 0.2 nm (2 A). 
{Courtesy of Graham Hills.) 



Figure 9-42 The principal features of a 
light microscope and a transmission 
electron microscope. These drawings 
emphasize the similarities of overall 
design. Whereas the lenses in the light 
microscope are made of glass, those in 
the electron microscope are magnetic 
coils. The electron microscope requires 
that the specimen be placed in a vacuum. 
The inset shows a transmission electron 
microscope in use. (Photograph courtesy 
of FEI Company Ltd.) 
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The source of illumination is a filament or cathode that emits electrons at the 
top of a cylindrical column about 2 m high. Since electrons are scattered by col- 
lisions with air molecules, air must first be pumped out of the column to create 
a vacuum. The electrons are then accelerated from the filament by a nearby 
anode and allowed to pass through a tiny hole to form an electron beam that 
travels down the column. Magnetic coils placed at intervals along the column 
focus the electron beam, just as glass lenses focus the light in a light microscope. 
The specimen is put into the vacuum, through an airlock, into the path of the 
electron beam. As in light microscopy the specimen is usually stained — in this 
case, with electron-dense material, as we see in the next section. Some of the 
electrons passing through the specimen are scattered by structures stained with 
the electron-dense material; the remainder are focused to form an image, in a 
manner analogous to the way an image is formed in a light microscope. The 
image can be observed on a phosphorescent screen or recorded, either on a 
photographic plate or with a high-resolution digital camera. Because the scat- 
tered electrons are lost from the beam, the dense regions of the specimen show 
up in the image as areas of reduced electron flux, which look dark. 

Biological Specimens Require Special Preparation for the Electron 
Microscope 

In the early days of its application to biological materials, the electron micro- 
scope revealed many previously unimagined structures in cells. But before these 
discoveries could be made, electron microscopists had to develop new proce- 
dures for embedding, cutting, and staining tissues. 

Since the specimen is exposed to a very high vacuum in the electron micro- 
scope, living tissue is usually killed and preserved by fixation — first with glu- 
taraldehyde, which covalently cross-links protein molecules to their neighbors, 
and then with osmium tetroxide, which binds to and stabilizes lipid bilayers as 
well as proteins (Figure 9-43). Because electrons have very limited penetrating 
power, the fixed tissues normally have to be cut into extremely thin sections 
(50-100 nm thick, about 1/200 the thickness of a single cell) before they are 
viewed. This is achieved by dehydrating the specimen and permeating it with a 
monomeric resin that polymerizes to form a solid block of plastic; the block is 
then cut with a fine glass or diamond knife on a special microtome. These thin 
sections, free of water and other volatile solvents, are placed on a small circular 
metal grid for viewing in the microscope (Figure 9-44). <AACA> 

The steps required to prepare biological material for viewing in the electron 
microscope have challenged electron microscopists from the beginning. How 
can we be sure that the image of the fixed, dehydrated, resin-embedded speci- 
men finally seen bears any relation to the delicate aqueous biological system 
that was originally present in the living cell? The best current approaches to this 
problem depend on rapid freezing. If an aqueous system is cooled fast enough 
to a low enough temperature, the water and other components in it do not have 
time to rearrange themselves or crystallize into ice. Instead, the water is super- 
cooled into a rigid but noncrystalline state— a "glass"— called vitreous ice. This 
state can be achieved by slamming the specimen onto a polished copper block 
cooled by liquid helium, by plunging it into or spraying it with a jet of a coolant 
such as liquid propane, or by cooling it at high pressure. 

Some frozen specimens can be examined directly in the electron micro- 
scope using a special, cooled specimen holder. In other cases the frozen block 
can be fractured to reveal interior surfaces, or the surrounding ice can be sub- 
limed away to expose external surfaces. However, we often want to examine thin 
sections, and stain them to yield adequate contrast in the electron microscope 
image (discussed further below). A compromise is therefore to rapid-freeze the 
tissue, then replace the water, maintained in the vitreous (glassy) state, by 
organic solvents, and finally embed the tissue in plastic resin, cut sections, and 
stain. Although technically still difficult, this approach stabilizes and preserves 
the tissue in a condition very close to its original living state (Figure 9-45). 
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Figure 9-43 Two common chemical 
fixatives used for electron microscopy. 

The two reactive aldehyde groups of 
glutaraldehyde enable it to cross-link 
various types of molecules, forming 
covalent bonds between them. Osmium 
tetroxide forms cross-linked complexes 
with many organic compounds, and in 
the process becomes reduced. This 
reaction is especially useful for fixing cell 
membranes, since the C=C double bonds 
present in many fatty acids react with 
osmium tetroxide. 



copper grid covered with carbon 
and/or plastic film 

specimen in ribbon 
of thin sections 




Figure 9-44 The copper grid that 
supports the thin sections of a 
specimen inaTEM. 
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Figure 9-45 Thin section of a cell. This 
thin section is of a yeast ceil that has 
been very rapidly frozen and the vitreous 
ice replaced by organic solvents and then 
by plastic resin. The nucleus, 
mitochondria, cell wall, Golgi stacks, and 
ribosomes can all be readily seen in a 
state that is presumed to be as life-like as 
possible. {Courtesy of Andrew Staehelin.) 
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Contrast in the electron microscope depends on the atomic number of the 
atoms in the specimen: the higher the atomic number, the more electrons are 
scattered and the greater the contrast. Biological tissues are composed of atoms 
of very low atomic number (mainly carbon, oxygen, nitrogen, and hydrogen). To 
make them visible, they are usually impregnated (before or after sectioning) 
with the salts of heavy metals such as uranium and lead. The degree of impreg- 
nation, or "staining," vviih these salts reveals different cellular constituents with 
various degrees of contrast. Lipids, for example, tend to stain darkly after 
osmium fixation, revealing the location of cell membranes. 



Specific Macromolecules Can Be Localized by Immunogold 
Electron Microscopy 

We have seen how antibodies can be used in conjunction with fluorescence 
microscopy to localize specific macromolecules. An analogous method— 
immunogold electron microscopy— can be used in the electron microscope. 
The usual procedure is to incubate a thin section with a specific: primary ami- 
body and then with a secondary antibody to which a colloidal gold particle has 
been attached. The gold particle is electron -dense and can be seen as a black dot 
in the electron microscope (Figure 9-46). 

Thin sections often fail to convey the three-dimensional arrangement of cel- 
lular components in the TEM and can be very misleading: a linear structure such 
as a microtubule may appear in section as a pointlike object, for example, and a 
section through protruding parts of a single irregularly shaped solid body may give 
the appearance of two or more separate objects. The third dimension can be 
reconstructed from serial sections (Figure 9-47), but this is still a lengthy and 
tedious process. 

Even thin sections, however, have a significant depth compared with the res- 
olution of die electron microscope, so they can also be misleading in an oppo- 
site way. The optical design of the electron microscope— die very small aperture 
used— produces a large depth of field, so the image seen corresponds to a super- 
imposition (a projection) of the structures at different depths. A further compli- 
cation for immunogold labeling is that the antibodies and colloidal gold parti- 
cles do not penetrate into the resin used for embedding; therefore, they deteel 
antigens only at the surface of the section. This means that first, the sensitivity 
of detection is low t since antigen molecules present in the deeper parts of the 
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section are not detected, and second, we may get a false impression of which 
structures contain the antigen and which do not. A solution to this problem is to 
label the specimen before embedding it in plastic, when cells and tissues are still 
fully accessible to labeling reagents. Extremely small gold particles, about 1 nm 
in diameter, work best for this procedure. Such small gold particles are usually 
not directly visible in the final sections, so additional silver or gold is nucleated 
around the tiny 1 nm gold particles in a chemical process very much like photo- 
graphic development. 



Figure 9-46 Localizing proteins in the 
electron microscope. Immunogold 
electron microscopy is used here to 
localize four different protein 
components to particular locations 
within the spindle pole body of yeast. 
At the top is a thin section of a yeast 
mitotic spindle showing the spindle 
microtubules that cross the nucleus, and 
connect at each end to spindle pole 
bodies embedded in the nuclear 
envelope. A diagram of the components 
of a single spindle pole body is shown 
below. Antibodies against four different 
proteins of the spindle pole body are 
used, together with colloidal gold 
particles (black dots), to reveal where 
within the complex structure each 
protein is located. (Courtesy of John 
Kilmartin.) 



Images of Surfaces Can Be Obtained by Scanning Electron 
Microscopy 

A scanning electron microscope (SEM) directly produces an image of the three- 
dimensional structure of the surface of a specimen. The SEM is usually a smaller, 
simpler, and cheaper device than a transmission electron microscope. Whereas 
the TEM uses the electrons that have passed through the specimen to form an 




2 nm 




Figure 9-47 A three-dimensional 
reconstruction from serial sections. 

(A) A three-dimensional reconstruction of 
the mitochondrial compartment of a live 
yeast cell, assembled from a stack of 
optical sections, shows its complex 
branching structure. Single thin sections 
of such a structure in the electron 
microscope sometimes give misleading 
impressions. In this example (B), most 
sections through a cell containing a 
branched mitochondrion seem to 
contain two or three separate 
mitochondria (compare Figure 9-45). 
Sections 4 and 7, moreover, might be 
interpreted as showing a mitochondrion 
in the process of dividing. The true three- 
dimensional shape, however, can be 
reconstructed from serial sections. 
(A, courtesy of Stefan Hell.) 
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image, the SEM uses electrons that are scattered or emitted from the specimen's 
surface. The specimen to be examined is fixed, dried, and coated with a thin 
layer of heavy metal. Alternatively, it can be rapidly frozen, and then transferred 
to a cooled specimen stage for direct examination in the microscope. Often an 
entire plant part or small animal can be put into the microscope with very little 
preparation (Figure 9-48). The specimen, prepared in any of these ways, is then 
scanned with a very narrow beam of electrons. The quantity of electrons scat- 
tered or emitted as this primary beam bombards each successive point of the 
metallic surface is measured and used to control the intensity of a second beam, 
which moves in synchrony with the primary beam and forms an image on a tele- 
vision screen. In this way, a highly enlarged image of the surface as a whole is 
built up (Figure 9-49). 

The SEM technique provides great depth of field; moreover, since the 
amount of electron scattering depends on the angle of the surface relative to the 
beam, the image has highlights and shadows that give it a three-dimensional 
appearance (see Figure 9-48 and Figure 9-50). Only surface features can be 
examined, however, and in most forms of SEM, the resolution attainable is not 
very high (about 10 nm, with an effective magnification of up to 20,000 times). 
As a result, the technique is usually used to study whole cells and tissues rather 
than subcellular organelles. <AACG> Very high-resolution SEMs have, however, 
been developed with a bright coherent-field emission gun as the electron 
source. This type of SEM can produce images that rival TEM images in resolu- 
tion (Figure 9-51). 



Metal Shadowing Allows Surface Features to Be Examined at 
High Resolution by Transmission Electron Microscopy 

The TEM can also be used to study the surface of a specimen— and generally at 
a higher resolution than in the SEM— to reveal the shape of individual macro- 
molecules for example. As in scanning electron microscopy, a thin film of a 
heavy metal such as platinum is evaporated onto the dried specimen. In this 
case, however, the metal is sprayed from an oblique angle so as to deposit a coat- 
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Figure 9-48 A developing wheat flower, 
or spike. This delicate flower spike was 
rapidly frozen, coated with a thin metal 
film, and examined in the frozen state in 
a SEM. This micrograph, which is at a low 
magnification, demonstrates the large 
depth of focus of the SEM. (Courtesy of 
Kim Findlay.) 
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Figure 9-49 The scanning electron microscope. In a SEM, the specimen is scanned by a beam of electrons brought to a 
focus on the specimen by the electromagnetic coils that act as lenses. The detector measures the quantity of electrons 
scattered or emitted as the beam bombards each successive point on the surface of the specimen and controls the 
intensity of successive points in an image built up on a video screen. The SEM creates striking images of three-dimensional 
objects with great depth of focus and a resolution between 3 nm and 20 nm depending on the instrument. (Photograph 
courtesy of Andrew Davies.) 
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Figure 9-50 Scanning electron 
microscopy. (A) A scanning electron 
micrograph of the stereocilia projecting 
from a hair cell in the inner ear of a 
bullfrog. <CATA> For comparison, the 
same structure is shown by (B) 
differential-interference-contrast light 
microscopy and (C) thin-section 
transmission electron microscopy. 
(Courtesy of Richard Jacobs and James 
Hudspeth.) 
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ing that is thicker in some places than others — a process known as metal shad- 
owing because a shadow effect is created that gives the image a three-dimen- 
sional appearance. 

Some specimens coated in this way are thin enough or small enough for the 
electron beam to penetrate them directly. This is the case for individual 
molecules, macromolecular complexes, and viruses — all of which can be dried 
down, before shadowing, onto a flat supporting film made of a material that is 
relatively transparent to electrons, such as carbon or plastic. The internal struc- 
ture of cells can also be imaged using metal shadowing. In this case samples are 
very rapidly frozen (as described above) and then cracked open with a knife 
blade. The ice level at the fractured surface is lowered by the sublimation of ice 
in a vacuum as the temperature is raised— in a process called freeze-drying. The 
parts of the cell exposed by Lhis etching process are then shadowed as before to 
make a metal replica. The organic material of the cell remains must be dissolved 
away after shadowing to leave only the thin metal replica of die surface of the 
specimen. The replica is then reinforced with a film of carbon so it can he placed 
on a grid and examined in the transmission electron microscope in the ordinary 
way (Figure 9-52), This technique exposes structures in the interior of the cell 
and can reveal their tliree-dimensional organization with exceptional clarity 
(Figure 9-53). 
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Figure 9-51 The nuclear pore. Rapidly 
frozen nuclear envelopes were imaged in 
a high-resolution SEM, equipped with a 
field emission gun as the source of 
electrons. These views of each side of a 
nuclear pore represent the limit of 
resolution of the SEM, and should be 
compared with Figure 12-9. (Courtesy of 
Martin Goldberg and Terry Allen.) 
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Negative Staining and Cryoelectron Microscopy Both Allow 
Macromolecules to Be Viewed at High Resolution 

Although isolated macromolecules, such as DNA or large proteins, can be visu- 
alized readily in the electron microscope if they are shadowed with a heavy 
metal to provide contrast, finer detail can be seen by using negative staining. In 
this technique, the molecules, supported on a thin film of carbon, are mixed 
with a solution of a heavy-metal salt such as uranyl acetate. After the sample has 
dried, a very thin film of metal salt covers the carbon film everywhere except 
where it has been excluded by the presence of an adsorbed macromolecule. 
Because the macromolecule allows electrons to pass through it much more 
readily than does the surrounding heavy-metal stain, a reversed or negative 
image of the molecule is created. Negative staining is especially useful for view- 
ing large macromolecular aggregates such as viruses or ribosomes, and for see- 
ing the suhunit structure of protein filaments (Figure 9-54). 

Shadowing and negative staining can provide high- contrast surface views of 
small macromolecular assemblies, but the size of the smallest metal panicles in 
the shadow or stain used limits the resolution of both techniques. Recent meth- 
ods provide an alternative that has allowed us to visualize directly at high reso- 
lution even the interior features of three-dimensional structures such as viruses 
and organelles. In this technique, called cryoelectron microscopy, rapid freez- 
ing to form vitreous ice is again the key. A very thin (about 100 nm) film of an 
aqueous suspension of vims or purified macromolecular complex is prepared 
on a microscope grid. The specimen is then rapidly frozen by plunging it into a 
coolant. A special sample holder is used to keep this hyd rated specimen at 
-L60°C in die vacuum of the microscope, where it can be viewed directly with- 
out fixation, staining, or drying. Unlike negative staining, in which what we see 
is the envelope of stain exclusion around the particle, hyd rated cryoelectron 
microscopy produces an image from the macromolecular structure itself. How- 
ever, to extract the maximum amount of structural information, special image- 
processing techniques must be used, as we describe next. 
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Multiple Images Can Be Combined to Increase Resolution 

Any image, whether produced by an electron microscope or by an optical micro- 
scope, is made by particles— electrons or photons— striking a detector of some 
sort. But these particles are governed by quantum mechanics, so the numbers 
reaching the detector are predictable only in a statistical sense. In the limit of 
very large numbers of particles, the distribution at the detector is accurately 
determined by the imaged specimen. However, with smaller numbers of parti'^ 
cles, this underlying structure in the image is obscured by the statistical fluctua- 
tions in the numbers of particles detected in each region. The term noise 
describes the random variability that confuses the underlying image of the spec- 



Figure 9-52 The preparation of a metal- 
shadowed replica of the surface of a 
specimen. Note that the thickness of the 
metal reflects the surface contours of the 
original specimen. 
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Figure 9-53 A regular array of protein 
filaments in an insect muscle. To obtain 
this image, the muscle cells were rapidly 
frozen to liquid helium temperature, 
fractured through the cytoplasm, and 
subjected to deep etching. A metal- 
shadowed replica was then prepared and 
examined at high magnification. (Courtesy 
of Roger Cooke and John Heuser.) 
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Figure 9-54 Negatively stained actin 
filaments. In this transmission electron 
micrograph, each filament is about 8 nm 
in diameter and is seen, on close 
inspection, to be composed of a helical 
chain of globular actin molecules. 
(Courtesy of Roger Craig.) 



100 nm 



imen itself. Noise is important in light microscopy at low light levels, but it is a 
particularly severe problem for electron microscopy of unstained macro- 
molecules. A protein molecule can tolerate a dose of only a few tens of electrons 
per square nanometer without damage, and this dose is orders of magnitude 
below what is needed to define an image at atomic resolution. 

The solution is to obtain images of many identical molecules — perhaps tens 
of thousands of individual images— and combine them to produce an averaged 
image, revealing structural details that were hidden by the noise in the original 
images. This procedure is called single-particle reconstruction. Before combin- 
ing all the individual images, however, they must be aligned with each other. 
Sometimes it is possible to induce proteins and complexes to form crystalline 
arrays, in which each molecule is held in the same orientation in a regular lat- 
tice. In this case, the alignment problem is easily solved, and several protein 
structures have been determined at atomic resolution by this type of electron 
crystallography. In principle, however, crystalline arrays are not absolutely 
required. With the help of a computer, the digital images of randomly dis- 
tributed and unaligned molecules can be processed and combined to yield 
high-resolution reconstructions. <[ Vn> Although structures that have some 
intrinsic symmetry make the task of alignment easier and more accurate, this 
technique has has also been used for objects, like ribosomes, with no symmetry. 
Figure 9-55 shows the structure of an icosahedral virus that has been deter- 
mined at high resolution by the combination of many particles and multiple 
views. 

With well-ordered crystalline arrays, a resolution of 0.3 nm has been 
achieved by electron microscopy — enough to begin to see the internal atomic 



Figure 9-55 Single-particle 
reconstruction. Spherical protein shells of 
the hepatitis B virus are preserved in a thin 
film of ice (A) and imaged in the 
transmission electron microscope. 
Thousands of individual particles were 
combined by single-particle 
reconstruction to produce the three- 
dimensional map of the icosahedral 
particle shown in (B). The two views of a 
single protein dimer (C), forming the 
spikes on the surface of the shell, show 
that the resolution of the reconstruction 
(0.74 nm) is sufficient to resolve the 
complete fold of the polypeptide chain. 
(A, courtesy of B. Bottcher, S.A. Wynne, and 
R.A. Crowther; B and C, from B. Bottcher, 
SA Wynne, and R.A. Crowther, Nature 
386:88-91, 1997. With permission from 
Macmillan Publishers Ltd.) 
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arrangements in a protein and to rival x-ray crystallography in resolution. With 
single-particle reconstruction, the present limit is about 0.5 nm, enough to iden- 
tify protein su bun its and domains, and limited protein secondary structure. 
Although electron microscopy is unlikely to supersede x-ray crystallography 
(discussed in Chapter 8) as a method for macromolecular structure determina- 
tion, it has some very clear advanlages. First, it does not absolutely require crys- 
talline specimens. Second, it can deal with extremely large complexes—struc- 
tures that may be too large or too variable to crystallize satisfactorily: 

The analysis oi large and complex macromolecular structures is helped con- 
siderably if the atomic structure of one or more of the suhunits is known, for 
example from x-ray crystallography. Molecular models can then be mathemati- 
cally "fitted" into the envelope of the structure determined at lower resolution 
using the electron microscope. Figure 9-56 shows the structure of a ribosome 
with the location of a bound release factor displayed in this way (see also Figures 
6-74 and 6-75). 




Different Views of a Single Object Can Be Combined to Give a 
Three-dimensional Reconstruction 

The detectors used to record images from electron microscopes produce two- 
dimensional pictures. Because of the large depth of field of the microscope, all 
the parts of the three-dimensional specimen are in focus, and the resulting 
image is a projection of the structure along the viewing direction. The lost infor- 
mation in the third dimension can be recovered if we have views of the same 
specimen from many different directions. The computational methods for this 
technique were worked out in the 1960s, and they are widely used in medical 
computed tomography (CT) scans. In a CT scan, the imaging equipment is 
moved around the patient to generate the different views. In electron-micro- 
scope (EM) tomography, the specimen holder is tilted in the microscope, which 
achieves the same result. In this way, we can arrive at a three-dimensional 
reconstruction, in a chosen standard orientation, by combining a set of different 
views of a single object in the microscope's field of view. Each individual view 
will be very noisy, but by combining them in three dimensions and taking an 
average, the noise can be largely eliminated, yielding a clear view of the molec- 
ular structure. Starting with thick plastic sections of embedded material, three- 
dimensional reconstructions, or tomograms, <ATCC> <CGAT> are used exten- 
sively to describe the detailed anatomy or small regions of the cell, such as the 
Golgi apparatus (Figure 9-57) or the cytoskeleton. Increasingly, however, micro- 
scopists are applying EM tomography lo unstained frozen hyd rated sections, 
and even to rapidly frozen whole celis or organelles [Figure 9-58). electron 
microscopy now provides a robust bridge between the scale of the single 
molecule and that of the whole cell 







Figure 9-56 Single-particle reconstruction 
and molecular model fitting. Bacterial 
ribosomes, with and without the release 
factor required for peptide release from the 
ribosome, were used here to derive high- 
resolution three-dimensional cryo-EM maps 
at a resolution of better than 1 nm. Images 
of nearly 20,000 separate ribosomes, 
preserved in ice, were used to produce 
single particle reconstructions. In (A) the 
30S ribosomal subunit (yellow) and the 50S 
subunit (blue) can be distinguished from 
the additional electron density that can be 
attributed to the release factor RF2 (pink). 
The known molecular structure of RF2 has 
then been modeled into this electron 
density (B). (From U.B.S. Rawat et al., Nature 
421:87-90, 2003. With permission from 
Macmillan Publishers Ltd.) 



Summary 

Determining the detailed structure of the membranes and organelles in cells requires 
the higher resolution attainable in a transmission electron microscope. Specific 
macromolecules can be localized with colloidal gold linked to antibodies. Three- 
dimensional views of the surfaces of cells and tissues are obtained by scanning electron 
microscopy. The shapes of isolated macromolecules that have been shadowed with a 
heavy metal or outlined by negative staining can also he readily deter mined by elec- 
tron microscopy Using computational methods, either multiple images or views from 
different directions can be combined to produce detailed reconstructions of macro- 
molecules and molecular complexes through the techniques oj electron tomography 
and single-panicle reconstruction, often applied to cryo- preserved specimens. The res- 
olution obtained with these methods means that atomic structures of individual 
macromolecules can often he 'fmecrto the Images derived by electron microscopy, and 
thai the TEM is increasingly able to completely bridge the gap between stntctures 
determined by x-ray crystallography and those determined in the light microscope. 
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Figure 9-57 Electron microscope (EM) 
tomography. Samples that have been 
rapidly frozen, and then freeze- 
substituted and embedded in plastic, 
preserve their structure in a condition 
that is very close to their original living 
state. This experiment shows an analysis 
of the three-dimensional structure of the 
Golgi apparatus from a rat kidney cell 
prepared in this way. Several thick 
sections (250 nm) of the cell have been 
tilted in a high-voltage electron 
microscope, along two different axes, and 
about 160 different views recorded. The 
digital data were combined using EM 
tomography methods to produce a final 
three-dimensional reconstruction at a 
resolution of about 7 nm. The computer 
can then present very thin slices of the 
complete three-dimensional data set, or 
tomogram, and two serial slices, each 
only 4 nm thick, are shown here (A) and 
(B). Very little changes from one slice to 
the next, but using the full data set, and 
by manually color coding the membranes 
(B), a full three-dimensional picture of the 
complete Golgi complex and its 
associated vesicles can be presented (C). 
(From M.S. Ladinsky et alJ.Cel! Biol. 
1 44;1 1 35-1 1 49, 1 999. With permission 
from The Rockefeller University Press.) 



Figure 9-58 Combining cryo-EM 
tomography and single-particle 
reconstruction. In addition to sections, 
the technique of EM tomography may 
also be applied to small unfixed 
specimens that are rapidly frozen and 
examined, while still frozen, using a 
tilting stage in the microscope. In this 
experiment the small nuclei of 
Dictyostelium are gently isolated and 
then very rapidly frozen before a series of 
tilted views of them is recorded. These 
different digital views are combined by 
EM tomography methods to produce a 
three-dimensional tomogram. Two thin 
digital slices (10 nm) through this 
tomogram are shown, in which top views 
(A) and side views (B) of individual 
nuclear pores can be seen. In the three- 
dimensional model (C), a surface 
rendering of the pores (blue) can be seen 
embedded in the nuclear envelope 
(yellow). From a series of tomograms it 
was possible to extract data sets for 
nearly 300 separate nuclear pores, whose 
structures could then be averaged using 
the techniques of single particle 
reconstruction. The surface-rendered 
view of one of these reconstructed pores 
is shown from the nuclear face in (D) and 
in section in (E) and should be compared 
with Figure 12-10. The pore complex is 
colored blue and the nuclear basket 
brown. (From M. Beck et al., Science 
306:1 387-1 390, 2004. With permission 
from AAAS.) 
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PROBLEMS 

Which statements are true? Explain why or why not. 

9-1 Because the DNA double helix is only 10 nm wide- 
well below the resolution of the light microscope— it is 
impossible to see chromosomes in living cells without spe- 
cial stains. 

9-2 A fluorescent molecule, having absorbed a single 
photon of light at one wavelength, always emits it at a longer 
wavelength. 

9-3 Caged molecules can be introduced into a cell and 
then activated by a strong pulse of laser light at the precise 
time and cellular location chosen by the experimenter. 

Discuss the following problems. 

9-4 The diagrams in Figure Q9-1 show the paths of light 
rays passing through a specimen with a dry lens and with an 
oil-immersion lens. Offer an explanation for why oil-immer- 
sion lenses should give better resolution. Air, glass, and oil 
have refractive indices of 1.00, 1.51, and 1.51, respectively 
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Figure Q9-1 Paths of light rays through dry and oil-immersion 
lenses (Problem 9-4). The white circle at the origin of the light rays is 
the specimen. 

9-5 Figure Q9-2 shows a diagram of the human eye. The 
refractive indices of the components in the light path are: 
cornea 1.38, aqueous humor 1.33, crystalline lens 1.41, and 
vitreous humor 1.38. Where does the main refraction— the 
main focusing— occur? What role do you suppose the lens 
plays? 
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Figure Q9-2 Diagram of the 

human eye (Problem 9-5). 



\ retina 






9-6 Why do humans see so poorly underwater? And why 
do goggles help? 

9-7 Explain the difference between resolution and mag- 
nification. 

9-8 Antibodies that bind to specific proteins are impor- 
tant tools for defining the locations of molecules in cells. 
The sensitivity of the primary antibody— the antibody that 
reacts with the target molecule— is often enhanced by using 
labeled secondary antibodies that bind to it. What are the 
advantages and disadvantages of using secondary antibod- 
ies that carry fluorescent tags versus those that carry bound 
enzymes? 



9-9 Figure Q9-3 shows a series of modified GFPs that 
emit light in a range of colors. How do you suppose the exact 
same chromophore can fluoresce at so many different wave- 
lengths? 



uttftmi 



Figure Q9-3 A rainbow of colors produced by modified GFPs 
(Problem 9-9). (From R.F. Service, Science 306:1457, 2004. With 
permission from AAAS.) 

9-1 Consider a fluorescent detector designed to report the 
cellular location of active protein tyrosine kinases. A blue 
(cyan) fluorescent protein (CFP) and a yellow fluorescent pro- 
tein (YFP) were fused to either end of a hybrid protein 
domain. The hybrid protein segment consisted of a substrate 
peptide recognized by the Abl protein tyrosine kinase and a 
phosphotyrosine binding domain (Figure Q9-4A). Stimula- 
tion of the CFP domain does not cause emission by the YFP 
domain when the domains are separated. When the CFP and 
YFP domains are brought close together, however, fluores- 
cence resonance energy transfer (FRET) allows excitation of 
CFP to stimulate emission by YFP FRET shows up experimen- 
tally as an increase in the ratio of emission at 526 nm versus 
476 nm (YFP/CFP) when CFP is excited by 434-nm light. 

Incubation of the reporter protein with Abl protein tyro- 
sine kinase in the presence of ATP gave an increase in 
YFP/CFP emission (Figure Q9-4B). In the absence of ATP or 
the Abl protein, no FRET occurred. FRET was also elimi- 
nated by addition of a tyrosine phosphatase (Figure Q9-4B). 
Describe as best you can how the reporter protein detects 
active Abl protein tyrosine kinase. 
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Figure Q9-4 Fluorescent 
reporter protein designed 
to detect tyrosine 
phosphorylation (Problem 
9-10). (A) Domain structure 
of reporter protein. Four 
domains are indicated: CFP, 
YFP, tyrosine kinase substrate 
peptide, and a 
phosphotyrosine-binding 
domain. (B) FRET assay. 
YFP/CFP is normalized to 
1 .0 at time zero. The reporter 
was incubated in the presence 
(or absence) of Abl and ATP 
for the indicated times. 
Arrow indicates time of 
addition of a tyrosine 
phosphatase. (From 
A.Y.Ting,K.H.KIain, 
R.L KlemkeandR.Y.Tsien, 
Proc Natl Acad. Sci. U.S.A. 
98:15003-15008,2001. 
With permission from National 
Academy of Sciences.) 



9-11 The practical resolving power of modern electron 
microscopes is around 0.1 nm. The major reason for this 
constraint is the small numerical aperture (n sin 6), which is 
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limited by (half the angular width of rays collected at the 
objective lens). Assuming that the wavelength (A) of the 
electron is 0.004 nm and that the refractive index {n) is 1.0, 
calculate the value for 6. How does that value compare with 
a 9 of 60°, which is typical for light microscopes? 



resolution = 



0.61 A 
nsin 6 



9-12 It is difficult to tell bumps from pits just by looking at 
the pattern of shadows. Consider Figure Q9-5, which shows a 
set of shaded circles. In Figure Q9-5A the circles appear to be 
bumps; however, when the picture is simply turned upside 
down (Figure Q9-5B), the circles seem to be pits. This is a clas- 
sic illusion. The same illusion is present in metal shadowing, 
as shown in the two electron micrographs in Figure Q9-5. In 
one the membrane appears to be covered in bumps, while in 
the other the membrane looks heavily pitted. Is it possible for 
an electron microscopist to be sure that one view is correct, or 
is it all arbitrary? Explain your reasoning. 
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Membrane Structure 




Cell membranes are crucial to the life of the cell The plasma membrane 
encloses the cell, defines its boundaries, and maintains the essential differences 
between the cytosol and the extracellular environment. Inside eucaryotic cells, 
the membranes of the endoplasmic reticulum, Golgi apparatus, mitochondria, 
and other membrane-enclosed organelles maintain the characteristic differ- 
ences between the contents of each organelle and the cytosol. Ion gradients 
across membranes, established by the activities of specialized membrane pro- 
teins, can be used to synthesize ATR to drive the transmembrane movement of 
selected solutes, or, as in nerve and muscle cells, to produce and transmit elec- 
trical signals. In all cells, the plasma membrane also contains proteins that act 
as sensors of external signals, allowing the cell to change its behavior in 
response to environmental cues, including signals from other cells; these pro- 
tein sensors, or receptors, transfer information — rather than molecules — across 
the membrane. 

Despite their differing functions, all biological membranes have a common 
general structure: each is a very thin film of lipid (fatty) and protein molecules, 
held together mainly by noncovalent interactions (Figure 10-1). Cell mem- 
branes are dynamic, fluid structures, and most of their molecules move about in 
the plane of the membrane. The lipid molecules are arranged as a continuous 
double layer about 5 nm thick. This lipid bilayer provides the basic fluid struc- 
ture of the membrane and serves as a relatively impermeable barrier to the pas- 
sage of most water-soluble molecules. Protein molecules that span the lipid 
bilayer {transmembrane proteins) see Figure 10-1) mediate nearly all of the other 
functions of the membrane, transporting specific molecules across it, for exam- 
ple, or catalyzing membrane-associated reactions such as ATP synthesis. In the 
plasma membrane, some transmembrane proteins serve as structural links that 
connect the cytoskeleton through the lipid bilayer to either the extracellular 
matrix or an adjacent cell, while others serve as receptors to detect and trans- 
duce chemical signals in the cell's environment. As would be expected, it takes 
many different membrane proteins to enable a cell to function and interact with 
its environment, and it is estimated that about 30% of the proteins encoded in 
an animal cell's genome are membrane proteins. 

In this chapter, we consider the structure and organization of the two main 
constituents of biological membranes — the lipids and the proteins. Although we 
focus mainly on the plasma membrane, most concepts discussed apply to the 
various internal membranes in cells as well. The functions of cell membranes 
are considered in later chapters: their role in ATP synthesis, for example, is dis- 
cussed in Chapter 14; their role in the transmembrane transport of small 
molecules in Chapter 11; and their roles in cell signaling and cell adhesion in 
Chapters 15 and 19, respectively. In Chapters 12 and 13, we discuss the internal 
membranes of the cell and the protein traffic through and between them. 
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THE LIPID BILAYER 

The lipid bilayer provides the basic structure for all cell membranes. It is easily 
seen by electron microscopy, and its structure is attributable exclusively to the 
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special properties of the lipid molecules, which assemble spontaneously into 
bilayers even under simple artificial conditions. 



Phosphoglycerides, Sphingolipids, and Sterols Are the Major 
Lipids in Cell Membranes 

Lipid molecules constitute about 50% of the mass of most animal cell mem- 
branes, nearly all of the remainder being protein. There are approximately 5 x 
10 6 lipid molecules in a 1 |im x 1 jam area of lipid bilayer, or about 10 9 lipid 
molecules in the plasma membrane of a small animal cell. All of the lipid 
molecules in cell membranes are amphiphilic — that is, they have a hydrophilic 
("water-loving") or polar end and a hydrophobic ("water-fearing") or nonpolar 
end. 

The most abundant membrane lipids are the phospholipids. These have a 
polar head group and two hydrophobic hydrocarbon tails. In animal, plant, and 
bacterial cells, the tails are usually fatty acids, and they can differ in length (they 
normally contain between 14 and 24 carbon atoms). One tail typically has one 
or more ds-double bonds (that is, it is unsaturated), while the other tail does not 
(that is, it is saturated). As shown in Figure 10-2, each c/5-double bond creates a 
small kink in the tail. Differences in the length and saturation of the fatty acid 
tails influence how phospholipid molecules pack against one another, thereby 
affecting the fluidity of the membrane, as we discuss later. 

The main phospholipids in most animal cell membranes are the phospho- 
glycerides, which have a three-carbon glycerol backbone (see Figure 10-2). Two 
long-chain fatty acids are linked through ester bonds to adjacent carbon atoms 
of the glycerol, and the third carbon atom is attached to a phosphate group, 
which in turn is linked to one of several different types of head group. By com- 
bining several different fatty acids and head groups, cells make many different 
phosphoglycerides. Phosphatidylethanolamine, phosphatidylserine, and phos- 
phatidylcholine are the main ones in mammalian cell membranes (Figure 
10-3A-C) 

Another important phospholipid, called sphingomyelin, is built from sphin- 
gosine rather than glycerol (Figure 10-3D-E). Sphingosine is a long acyl chain 
with an amino group (NH 2 ) and two hydroxyl groups (OH) at one end of the 
molecule. In sphingomyelin, a fatty acid tail is attached to the amino group, and 
a phosphocholine group is attached to the terminal hydroxyl group, leaving one 
hydroxyl group free. The free hydroxyl group contributes to the polar properties 
of the adjacent head group, as it can form hydrogen bonds with the head group 
of a neighboring lipid, with a water molecule, or with a membrane protein. 
Together, the phospholipids phosphatidylcholine, phosphatidylethanolamine, 
phosphatidylserine, and sphingomyelin constitute more than half the mass of 
lipid in most mammalian cell membranes (see Table 10-1). 



Figure 1 0-1 Three views of a cell 
membrane. <TAGC> (A) An electron 
micrograph of a plasma membrane (of a 
human red blood cell) seen in cross 
section. (B and C) These drawings show 
two-dimensional and three-dimensional 
views of a cell membrane and the 
general disposition of its lipid and 
protein constituents. (A, courtesy of 
Daniel S. Friend.) 
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Figure 1 0-2 The parts of a phosphoglyceride molecule. This example is a phosphatidylcholine, 
represented (A) schematically, (B) by a formula, (C) as a space-filling model, and (D) as a symbol. The kink 
resulting from the c/'s-double bond is exaggerated for emphasis. 
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Figure 10-3 Four major phospholipids in mammalian plasma membranes. Different head groups are represented by different colors in the 
symbols. The lipid molecules shown in (A-C) are phosphoglycerides, which are derived from glycerol. The molecule in (D) is sphingomyelin, 
which is derived from sphingosine (E) and is therefore a sphingolipid. Note that only phosphatidylserine carries a net negative charge, the 
importance of which we discuss later; the other three are electrically neutral at physiological pH, carrying one positive and one negative charge. 
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In addition to phospholipids, the lipid bilayers in many cell membranes 
contain cholesterol and glycolipids. Eucaryotic plasma membranes contain 
especially large amounts of cholesterol (Figure 10-4)— up to one molecule for 
every phospholipid molecule. Cholesterol is a sterol. It contains a rigid ring 
structure, to which is attached a single polar hydroxyl group and a short non- 
polar hydrocarbon chain. The cholesterol molecules orient themselves in the 
bilayer with their hydroxyl group close to the polar head groups of adjacent 
phospholipid molecules (Figure 10-5). 



Figure 10-4 The structure of cholesterol. 

Cholesterol is represented (A) by a 
formula, (B) by a schematic drawing, and 
(C) as a space-filling model. 



Phospholipids Spontaneously Form Bilayers 

The shape and amphiphilic nature of the phospholipid molecules cause them to 
form bilayers spontaneously in aqueous environments. As discussed in Chapter 
2, hydrophilic molecules dissolve readily in water because they contain charged 
groups or uncharged polar groups that can form either favorable electrostatic 
interactions or hydrogen bonds with water molecules. Hydrophobic molecules, 
by contrast, are insoluble in water because all, or almost all, of their atoms are 
uncharged and nonpolar and therefore cannot form energetically favorable 
interactions with water molecules. If dispersed in water, they force the adjacent 
water molecules to reorganize into ice-like cages diat surround the hydrophobic 
molecule (Figure 10-6), Because these cage structures are more ordered than 
the surrounding water, Lheir formation increases the free energy. This free 
energy cost is minimized, however, if the hydrophobic molecules (or the 
hydrophobic portions of amphiphilic molecules] cluster together so that the 
smallest number of water molecules is affected. 

The hydrophobic and hydrophilic regions of lipid molecules behave in the 
same way Thus, lipid molecules spontaneously aggregate to bury their 
hydrophobic hydrocarbon tails in the interior and expose their hydrophilic 
heads to water. Depending on their shape, they can do this in either of two ways: 
they can form spherical micelles, with the tails inward, or they can form double- 
layered sheets, or bilayers, with the hydrophobic tails sandwiched between the 
hydrophilic head groups (Figure 10-7), 

Being cylindrical, phospholipid molecules spontaneously form bilayers in 
aqueous environments. In this energetically most favorable arrangement, the 
hydrophilic heads face the water at each surface of the bilayer, and the 
hydrophobic tails are shielded from the water in the interior. The same forces 
that drive phospholipids to form bilayers also provide a self-healing property. A 
small tear in the bilayer creates a free edge with water; because this is energeti- 
cally unfavorable, the lipids tend to rearrange spontaneously to eliminate the free 
edge. (In eucaryotic plasma membranes, the fusion of intracellular vesicles 
repairs larger tears.) The prohibition of free edges has a profound consequence: 
the only way for a bilayer to avoid having edges is by closing in on itself and form- 
ing a sealed compartment (Figure 10-8). This remarkable behavior, fundamental 



t • 




polar 
head 
groups 

cholesteroj- 

stiffened 

region 



more 
fluid 
region 



Figure 10-5 Cholesterol in a lipid 
bilayer. Schematic drawing of a 
cholesterol molecule interacting with two 
phospholipid molecules in one 
monolayer of a lipid bilayer. 
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Figure 10-6 How hydrophilic and hydrophobic molecules interact differently with water. (A) Because 
acetone is polar, it can form favorable electrostatic interactions with water molecules, which are also polar. 
Thus, acetone readily dissolves in water. (B) By contrast, 2-methyl propane is entirely hydrophobic. Because 
it cannot form favorable interactions with water, it would force adjacent water molecules to reorganize into 
icelike cage structures, which increases the free energy. This compound is therefore virtually insoluble in 
water. The symbol 8" indicates a partial negative charge, and 6 + indicates a partial positive charge. Polar 
atoms are shown in color and nonpolar groups are shown in gray. 



to the creation of a living cell, follows directly from the shape and amphiphilic 
nature of the phospholipid molecule. 

A lipid bilayer also has other characteristics that make it an ideal structure 
for cell membranes. One of the most important of these is its fluidity, which is 
crucial to many membrane functions. 



The Lipid Bilayer Is a Two-dimensional Fluid 

Around 1970, researchers first recognized that individual lipid molecules are 
able to diffuse freely within lipid bilayers. The initial demonstration came from 
studies of synthetic lipid bilayers. Two types of preparations have been very use- 
ful in such studies: (1) bilayers made in the form of spherical vesicles, called 
liposomes, which can vary in size from about 25 nm to 1 \im in diameter 
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Figure 10-7 Packing arrangements of lipid molecules in an aqueous 
environment. (A) Cone-shaped lipid molecules (above) form micelles, whereas 
cylinder-shaped phospholipid molecules (below) form bilayers. (B) A lipid micelle 
and a lipid bilayer seen in cross section. Lipid molecules spontaneously form one 
or the other structure in water, depending on their shape. 
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Figure 1 0-8 The spontaneous closure of 
a phospholipid bilayer to form a sealed 
compartment. The closed structure is 
stable because it avoids the exposure of 
the hydrophobic hydrocarbon tails to 
water, which would be energetically 
unfavorable. 
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depending on how they are produced (Figure 10-9); and (2) planar bilayers, 
called black membranes, formed across a hole in a partition between two aque- 
ous compartments (Figure 10-10). 

Various techniques have been used to measure the motion of individual lipid 
molecules and their components. One can construct a lipid molecule, for exam- 
ple, with a fluorescent dye or a small gold particle attached to its polar head group 
and follow the diffusion of even individual molecules in a membrane. Alterna- 
tively, one can modify a lipid head group to carry a "spin label/' such as a nitroxyl 
group £:N-0); this contains an unpaired electron whose spin creates a param- 
agnetic signal that can be detected by electron spin resonance [ESR) spec- 
troscopy. (The principles of this technique are similar to those of nuclear mag- 
netic resonance, discussed in Chapter 80 The motion and orientation of a spin- 
labeled lipid in a bilayer can be deduced from the ESR spectrum. Such studies 
show that phospholipid molecules in synthetic bilayers very rarely migrate from 
the monolayer (also called a leaflet) on one side to that on the other. This process, 
known as "flip-flop," occurs less than once a month for any individual molecule, 
although cholesterol is an exception to this rule and can flip-flop rapidly. In con- 
trast, lipid molecules readily exchange places with their neighbors within a 
monolayer (~10 7 times per second). This gives rise to a rapid lateral diffusion, 
with a diffusion coefficient (D) of about 10" 8 cm 2 /sec, which means that an aver- 
age lipid molecule diffuses the length of a large bacterial cell (~2 urn) in about 1 
second. These studies have also shown that individual lipid molecules rotate very 
rapidly about their long axis and have flexible hydrocarbon chains. Computer 
simulations show that lipid molecules in membranes are very 1 disordered, pre- 
senting an irregular surface of variously spaced and oriented head groups to the 
water phase on either side of the bilayer [Hgurc 10-1 1 .). 

Similar mobility studies on labeled lipid molecules in isolated biological 
membranes and in living cells give results similar to those in synthetic bilayers. 
They demonstrate that the lipid component of a biological membrane is a two- 
dimensional liquid in which the constituent molecules are free to move laterally. 
As in synthetic bilayers. individual phospholipid molecules are normally con- 
fined to their own monolayer, This confinement creates a problem for their syn- 
thesis. Phospholipid molecules are manufactured in only one monolayer of a 
membrane, mainly in the cytosalic monolayer of the endoplasmic reticulum 
membrane. If none of these newly made molecules could migrate reasonably 
promptly to the noncytosolic monolayer, new lipid bilayer could not be made. 
The problem is solved by a special class of transmembrane enzymes called phos- 
pholipid translocators, which catalyze the rapid flip-flop of phospholipids from 
one monolayer to the other, as discussed in Chapter 12. 



The Fluidity of a Lipid Bilayer Depends on Its Composition 

The fluidity of cell membranes has to be precisely regulated. Certain membrane 
transport processes and enzyme activities, for example, cease when the bilayer 
viscosity is experimentally increased beyond a threshold level. 

The fluidity of a lipid bilayer depends on both its composition and its tem- 
perature, as is readily demonstrated in studies of synthetic bilayers. A synthetic 
bilayer made from a single type of phospholipid changes from a liquid state to a 
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Figure 10-9 Liposomes. (A) An electron 
micrograph of unfixed, unstained 
phospholipid vesicles — liposomes — in 
water rapidly frozen to liquid nitrogen 
temperature. (B) A drawing of a small 
spherical liposome seen in cross section. 
Liposomes are commonly used as model 
membranes in experimental studies. 
(A, from P. Frederik and W. Hubert, Meth. 
Enzymol. 391 :431, 2005. With permission 
from Elsevier.) 




lipid bilayer (black membrane) 



Figure 1 0-1 A cross-sectional view of 
a black membrane, a synthetic lipid 
bilayer. This planar bilayer appears black 
when it forms across a small hole in a 
partition separating two aqueous 
compartments. Black membranes are 
used to measure the permeability 
properties of synthetic membranes. 
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Figure 10-1 1 The mobility of phospholipid molecules in an artificial lipid bilayer- <CACA> Starting with 
a model of 1 00 phosphatidylcholine molecules arranged in a regular bilayer, a computer calculated the 
position of every atom after 300 picoseconds of simulated time. From these theoretical calculations (taking 
weeks of processor time in 1995), a model of the lipid bilayer emerges that accounts for almost all of the 
measurable properties of a synthetic lipid bilayer, such as its thickness, number of lipid molecules per 
membrane area, depth of water penetration, and unevenness of the two surfaces. Note that the tails in one 
monolayer can interact with those in the other monolayer, if the tails are long enough. (B) The different 
motions of a lipid molecule in a bilayer. (A, based on 5.W. Chiu et al., Biophys. J. 69:1 230-1 245, 1 995. With 
permission from the Biophysical Society.) 



two-dimensional rigid crystalline (or gel) state at a characteristic freezing point. 
This change of state is called a phase transition, and the temperature at which it 
occurs is lower (that is, the membrane becomes more difficult to freeze) if the 
hydrocarbon chains are short or have double bonds. A shorter chain length 
reduces the tendency of the hydrocarbon tails to interact with one another, in 
both the same and opposite monolayer, and ds-double bonds produce kinks in 
the hydrocarbon chains that make them more difficult to pack together, so that 
the membrane remains fluid at lower temperatures (Figure 10-12). Bacteria, 
yeasts, and other organisms whose temperature fluctuates with that of their 
environment adjust the fatty acid composition of their membrane lipids to 
maintain a relatively constant fluidity. As the temperature falls, for instance, the 
cells of those organisms synthesize fatty acids with more ds-double bonds, and 
they avoid the decrease in bilayer fluidity that would otherwise result from the 
temperature drop. 

Cholesterol modulates the properties of lipid bilayers. When mixed with 
phospholipids, it enhances the permeability-barrier properties of the lipid 
bilayer. It inserts into the bilayer with its hydroxyl group close to the polar head 
groups of the phospholipids, so that its rigid, platelike steroid rings interact 
with— and partly immobilize— those regions of the hydrocarbon chains closest 
to the polar head groups (see Figure 10-5). By decreasing the mobility of the first 
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Figure 1 0-1 2 The influence of cis- 
double bonds in hydrocarbon chains. 

The double bonds make it more difficult 
to pack the chains together, thereby 
making the lipid bilayer more difficult to 
freeze. In addition, because the 
hydrocarbon chains of unsaturated lipids 
are more spread apart, lipid bilayers 
containing them are thinner than bilayers 
formed exclusively from saturated lipids. 
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Table 1 0-1 Approximate Lipid Compositions of Different Cell Membranes 
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few CH 2 groups of the hydrocarbon chains of the phospholipid molecules, 
cholesterol makes the lipid bilayer less deformable in this region and thereby 
decreases the permeability of the bilayer to small water-soluble molecules. 
Although cholesterol tightens the packing of the lipids in a bilayer, it does not 
make membranes any less fluid. At the high concentrations found in most 
eucaryotic plasma membranes, cholesterol also prevents the hydrocarbon 
chains from coming together and crystallizing. 

Table 10-1 compares the lipid compositions of several biological mem- 
branes. Note that bacterial plasma membranes are often composed of one main 
type of phospholipid and contain no cholesterol; their mechanical stability is 
enhanced by an overlying cell wall (see Figure 11-18). In archaea, lipids usually 
contain 20-25-carbon-long prenyl chains instead of fatty acids, prenyl and fatty 
acid chains are similarly hydrophobic and flexible (see Figure 10-20F) . Thus, lipid 
bilayers can be built from molecules with similar features but different molecu- 
lar designs. The plasma membranes of most eucaryotic cells are more varied than 
those of procaryotes and archaea, not only in containing large amounts of 
cholesterol but also in containing a mixture of different phospholipids. 

Analysis of membrane lipids by mass spectrometry has revealed that the 
lipid composition of a typical cell membrane is much more complex than origi- 
nally thought. According to these studies, membranes are composed of a bewil- 
dering variety of 500-1000 different lipid species. While some of this complexity 
reflects the combinatorial variation in head groups, hydrocarbon chain lengths, 
and desaturation of the major phospholipid classes, membranes also contain 
many structurally distinct minor lipids, at least some of which have important 
functions. The inositol phospholipids, for example, are present in small quanti- 
ties but have crucial functions in guiding membrane traffic and in cell signaling 
(discussed in Chapters 13 and 15, respectively). Their local synthesis and 
destruction are regulated by a large number of enzymes, which create both small 
intracellular signaling molecules and lipid docking sites on membranes that 
recruit specific proteins from the cytosol, as we discuss later. 



Despite Their Fluidity, Lipid Bilayers Can Form Domains of 
Different Compositions 

Because a lipid bilayer is a two-dimensional fluid, we might expect most types of 
lipid molecules in it to be randomly distributed in their own monolayer. The van 
der Waals attractive forces between neighboring hydrocarbon tails are not selec- 
tive enough to hold groups of phospholipid molecules together. With certain 
lipid mixtures, however, different lipids can come together transiently, creating 
a dynamic patchwork of different domains. In synthetic lipid bilayers composed 
of phosphatidylcholine, sphingomyelin, and cholesterol, van der Waals forces 
between the long and saturated hydrocarbon chains of the sphingomyelin 
molecules can be just strong enough to hold the adjacent molecules together 
transiently (Figure 10-13). 
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There has been a long debate among scientists whether the lipid molecules 
in the plasma membrane of animal cells can transiently assemble into special- 
ized domains, called lipid rafts. Certain specialized regions of the plasma mem- 
brane, such as the caveolae involved in endocytosis (discussed in Chapter 13), 
are enriched in sphingolipids and cholesterol, and it is thought that the specific 
proteins that assemble there help stabilize these rafts. Because the hydrocarbon 
chains of sphingolipids are longer and straighter than those of other membrane 
lipids, raft domains are thicker than other parts of the bilayer (see Figure 10-12) 
and better accommodate certain membrane proteins (Figure 10-14). Thus, the 
lateral segregation of proteins and of lipids into raft domains would, in principle, 
be a mutually stabilizing process. In this way, lipid rafts could help organize 
membrane proteins — concentrating them either for transport in membrane vesi- 
cles (discussed in Chapter 13) or for working together in protein assemblies, as 
when they convert extracellular signals into intracellular ones (discussed in 
Chapter 15). 



Figure 10-13 Lateral phase separation 
in artifical lipid bilayers. (A) Giant 
liposomes produced from a 1:1 mixture 
of phosphatidylcholine and spingomyelin 
form uniform bilayers, whereas (B) 
liposomes produced from a 1:1:1 mixture 
of phosphatidylcholine, spingomyelin, 
and cholesterol form bilayers with two 
immiscible phases. The liposomes are 
stained with trace concentrations of a 
fluorescent dye that preferentially 
partitions into one of the phases. The 
average size of the domains formed in 
these giant artificial liposomes is much 
larger than that expected in biological 
membranes, where rafts may be as small 
as a few nanometers in diameter. (A, from 
N. Kahya et al., J. Struct. Biol. 147:77-89, 
2004. With permission from Elsevier; 
B, courtesy of Petra Schwille.) 



Lipid Droplets Are Surrounded by a Phospholipid Monolayer 

Most cells store an excess of lipids in lipid droplets, from where they can be 
retrieved as building blocks for membrane synthesis or as a food source. Fat 
cells, also called adipocytes, are specialized for lipid storage (see Figure 14-34). 
They contain vast numbers of large lipid droplets, from which fatty acids can be 
liberated on demand and exported to other cells through the bloodstream. 
Lipid droplets store neutral lipids, such as triacylglycerides and cholesterol 
esters, which are synthesized from fatty acids and cholesterol by enzymes in the 
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500 nm 

Figure 10-1 4 The effects of lipid rafts in artificial lipid bilayers. (A) The surface contours of a synthetic bilayer containing lipid 
rafts, analyzed by atomic force microscopy. Note that the raft areas, shown in orange, are thicker than the rest of the bilayer; as 
in Figure 1 0-1 3, the rafts primarily contain sphingomyelin and cholesterol. The sharp, yellow spikes are incorporated protein 
molecules, which are attached to the bilayer by a glycosylphosphatidyl-inositol (GPI) anchor (illustrated in Figure 1 0-1 9, 
example 6), and preferentially partition into the raft domains. (B) Because of both their increased thickness and lipid 
composition, rafts are thought to concentrate specific membrane proteins (dark green). (A, from D.E. Saslowsky et al., J. Biol. 
Chem. 277:26966-26970, 2002. With permission from the American Society for Biochemistry and Molecular Biology.) 
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Figure 10-1 5 A model for the formation 
of lipid droplets. Neutral lipids are 
deposited between the two monolayers of 
the endoplasmic reticulum membrane. 
There, they aggregate into a three- 
dimensional droplet which buds and 
pinches off from the endoplasmic 
reticulum membrane as a unique 
organelle, surrounded by a single 
monolayer of phospholipids and 
associated proteins. (Adapted from 
S. Martin and R.G. Parton, Nat Rev. Mol. 
Cell Biol. 7:373-378, 2006. With permission 
from Macmillan Publishers Ltd.) 



endoplasmic reticulum membrane. Because these lipids do not contain 
hydrophilic head groups, they are exclusively hydrophobic molecules, which 
aggregate into three-dimensional droplets rather than into bilayers. 

Lipid droplets are unique organelles because they are surrounded by a single 
monolayer of phospholipids, which contains a large variety of proteins. Some of 
the proteins are enzymes involved in lipid metabolism, but the functions of most 
are unknown. Lipid droplets form rapidly when cells are exposed to high con- 
centrations of fatty acids. They form from discrete regions of the endoplasmic 
reticulum membrane where many enzymes of lipid metabolism are concen- 
trated. Figure 10-15 shows one model of how lipid droplets may form and 
acquire their surrounding monolayer of phospholipids and proteins. 

The Asymmetry of the Lipid Bilayer Is Functionally Important 

The lipid compositions of the two monolayers of the lipid bilayer in many mem- 
branes are strikingly different. In the human red blood cell membrane, for exam- 
ple, almost all of the phospholipid molecules that have choline — 
(CH 3 )3N + CH 2 CH20H — in their head group (phosphatidylcholine and sphin- 
gomyelin) are in the outer monolayer, whereas almost all that contain a termi- 
nal primary amino group (phosphatidylethanolamine and phosphatidylserine) 
are in the inner monolayer (Figure 10-16). Because the negatively charged 
phosphatidylserine is located in the inner monolayer, there is a significant dif- 
ference in charge between the two halves of the bilayer. We discuss in Chapter 12 
how membrane-bound phospholipid translocators generate and maintain lipid 
asymmetry. 

Lipid asymmetry is functionally important, especially in converting extra- 
cellular signals into intracellular ones (discussed in Chapter 15). Many cytosolic 
proteins bind to specific lipid head groups found in the cytosolic monolayer of 
the lipid bilayer. The enzyme protein kinase C (PKC), for example, is activated in 
response to various extracellular signals. It binds to the cytosolic face of the 
plasma membrane, where phosphatidylserine is concentrated, and requires this 
negatively charged phospholipid for its activity. 
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Figure 1 0-1 6 The asymmetrical 
distribution of phospholipids and 
glycolipids in the lipid bilayer of human 
red blood cells. The colors used for the 
phospholipid head groups are those 
introduced in Figure 10-3. In addition, 
glycolipids are drawn with hexagonal 
polar head groups (blue). Cholesterol 
(not shown) is thought to be distributed 
roughly equally in both monolayers. 
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Figure 10-1 7 Two signaling functions of inositol phospholipids in the 
cytosolic leaflet of the plasma membrane. (A) Some extracellular signals 
activate PI 3-kinase, which phosphorylates inositol phospholipids, creating 
docking sites for various intracellular signaling proteins. (B) Some 
extracellular signals activate phospholipases that cleave inositol 
phospholipids, generating fragments that help relay the signal into the cell 
(see also Figure 1 5-38). (C) The sites where different classes of 
phospholipases cleave phospholipids. The structure of phosphatidylinositol 
(4,5) diphosphate is shown. Phospholipases C operate in the signaling 
pathways shown in (B). 



In other cases, specific lipid head groups must first be modified to create 
protein-binding sites at a particular time and place. Phosphatidylinositol, for 
instance, is one of the minor phospholipids that are concentrated in the cytoso- 
lic monolayer of cell membranes. Various lipid kinases can add phosphate 
groups at distinct positions in the inositol ring, creating binding sites that recruit 
specific proteins from the cytosol to the membrane. An important example of 
such a lipid kinase is phosphoinositide 3-kinase (PI 3-kinase) ', which is activated 
in response to extracellular signals and helps to recruit specific intracellular sig- 
naling proteins to the cytosolic face of the plasma membrane (Figure 10-17A). 
Similar lipid kinases phosphorylate inositol phospholipids in intracellular mem- 
branes and thereby help to recruit proteins that guide membrane transport. 

Phospholipids in the plasma membrane are used in yet another way to con- 
vert extracellular signals into intracellular ones. The plasma membrane contains 
various phospholipases that are activated by extracellular signals to cleave spe- 
cific phospholipid molecules, generating fragments of these molecules that act 
as short-lived intracellular mediators. Phospholipase C, for example, cleaves an 
inositol phospholipid in the cytosolic monolayer of the plasma membrane to 
generate two fragments, one of which remains in the membrane and helps acti- 
vate protein kinase C, while the other is released into the cytosol and stimulates 
the release of Ca 2+ from the endoplasmic reticulum (Figure 10-17B-C). 

Animals exploit the phospholipid asymmetry of their plasma membranes to 
distinguish between live and dead cells. When animal cells undergo apoptosis (a 
form of programmed cell death, discussed in Chapter 18), phosphatidylserine, 
which is normally confined to the cytosolic monolayer of the plasma membrane 
lipid bilayer, rapidly translocates to the extracellular monolayer. The phos- 
phatidylserine exposed on the cell surface signals neighboring cells, such as 
macrophages, to phagocytose the dead cell and digest it. The translocation of the 
phosphatidylserine in apoptotic cells is thought to occur by two mechanisms: 

1. The phospholipid translocator that normally transports this lipid from the 
noncytosolic monolayer to the cytosolic monolayer is inactivated. 

2. A "scramblase" that transfers phospholipids nonspecifically in both direc- 
tions between the two monolayers is activated. 
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Figure 10-18 Glycolipid molecules. 

(A) Galactocerebroside is called a neutral 
glycolipid because the sugar that forms its 
head group is uncharged. (B) A 
ganglioside always contains one or more 
negatively charged sialic acid residues 
(also called A/-acetylneuraminic acid, or 
NANA), whose structure is shown in (C). 
Whereas in bacteria and plants almost all 
glycolipids are derived from glycerol, as 
are most phospholipids, in animal cells 
almost all glycolipids are based on 
sphingosine, as is the case for 
sphingomyelin (see Figure 10-3). 
Gal = galactose; Glc = glucose, 
GalNAc = ^-acetylgalactosamine; these 
three sugars are uncharged. 



(C) sialic acid (NANA) 



Glycolipids Are Found on the Surface of All Plasma Membranes 

Sugar-containing lipid molecules called glycolipids, found exclusively in the 
noncytosolic monolayer of the lipid bilayer, have the most extreme asymmetry 
in their membrane distribution. In animal cells they are made from sphingosine, 
just like sphingomyelin. These intriguing molecules tend to self-associate, partly 
through hydrogen bonds between their sugars and partly through van der Waals 
forces between their long and straight hydrocarbon chains, and they may pref- 
erentially partition into lipid rafts. The asymmetric distribution of glycolipids in 
the bilayer results from the addition of sugar groups to the lipid molecules in the 
lumen of the Golgi apparatus. Thus, the compartment in which they are manu- 
factured is topologically equivalent to the exterior of the cell (discussed in Chap- 
ter 12). As they are delivered to the plasma membrane, the sugar groups are 
exposed at the cell surface (see Figure 10-16), where they have important roles 
in interactions of the cell with its surroundings. 

Glycolipids probably occur in all animal cell plasma membranes, where they 
generally constitute about 5% of the lipid molecules in the outer monolayer. 
They are also found in some intracellular membranes. The most complex of the 
glycolipids, the gangliosides, contain oligosaccharides with one or more sialic 
acid residues, which give gangliosides a net negative charge (Figure 10-18). The 
most abundant of the more than 40 different gangliosides that have been iden- 
tified are in the plasma membrane of nerve cells, where gangliosides constitute 
5-10% of the total lipid mass; they are also found in much smaller quantities in 
other cell types. 

Hints as to the functions of glycolipids come from their localization. In the 
plasma membrane of epithelial cells, for example, glycolipids are confined to 
the exposed apical surface, where they may help to protect the membrane 
against the harsh conditions frequently found there (such as low pH and high 
concentrations of degradative enzymes). Charged glycolipids, such as ganglio- 
sides, may be important because of their electrical effects: their presence alters 
the electrical field across the membrane and the concentrations of ions — espe- 
cially Ca 2+ — at the membrane surface. Glycolipids are also thought to function 
in cell-recognition processes, in which membrane-bound carbohydrate-bind- 
ing proteins (lectins) bind to the sugar groups on both glycolipids and glycopro- 
teins in the process of cell-cell adhesion (discussed in Chapter 19). Surprisingly, 
however, mutant mice that are deficient in all of their complex gangliosides 
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show no obvious abnormalities, although the males cannot transport testos- 
terone normally in the testes and are consequently sterile. 

Whatever their normal function, some glycolipids provide entry points for 
certain bacterial toxins. The ganglioside G M i (see Figure 10-18), for example, 
acts as a cell-surface receptor for the bacterial toxin that causes the debilitating 
diarrhea of cholera. Cholera toxin binds to and enters only those cells that have 
Gmi on their surface, including intestinal epithelial cells. Its entry into a cell 
leads to a prolonged increase in the concentration of intracellular cyclic AMP 
(discussed in Chapter 15), which in turn causes a large efflux of Na + and water 
into the intestine. 

Summary 

Biological membranes consist of a continuous double layer of lipid molecules in which 
membrane proteins are embedded. This lipid bilayer is fluid, with individual lipid 
molecules able to diffuse rapidly within their own monolayer. The membrane lipid 
molecules are amphiphilic. When placed in water they assemble spontaneously into 
bilayers, which form sealed compartments. 

Cells contain 500-1000 different lipid species. There are three major classes of 
membrane lipids— phospholipids, cholesterol and glycolipids— and hundreds of 
minor classes. The lipid compositions of the inner and outer monolayers are different, 
reflecting the different functions of the two faces of a cell membrane. Different mix- 
tures of lipids are found in the membranes of cells of different types, as well as in the 
various membranes of a single eucaryotic cell Inositol phospholids are a minor class 
of phospholipids, which in the cytosolic leaflet of the plasma membrane lipid bilayer 
play an important part in cell signaling: in response to extracellular signals, specific 
lipid kinases phosphorylate the head groups of these lipids to form docking sites for 
cytosolic signaling proteins, whereas specific phospholipases cleave certain inositol 
phospholipids to generate small intracellular signaling molecules. 

MEMBRANE PROTEINS 

Although the lipid bilayer provides the basic structure of biological membranes, 
the membrane proteins perform most of the membrane's specific tasks and there- 
fore give each type of cell membrane its characteristic functional properties. 
Accordingly, the amounts and types of proteins in a membrane are highly variable. 
In the myelin membrane, which serves mainly as electrical insulation for nerve 
cell axons, less than 25% of the membrane mass is protein. By contrast, in the 
membranes involved in ATP production (such as the internal membranes of mito- 
chondria and chloroplasts), approximately 75% is protein. A typical plasma mem- 
brane is somewhere in between, with protein accounting for about half of its mass. 
Because lipid molecules are small compared with protein molecules, there are 
always many more lipid molecules than protein molecules in cell membranes — 
about 50 lipid molecules for each protein molecule in cell membranes that are 
50% protein by mass. Membrane proteins vary widely in structure and in the way 
they associate with the lipid bilayer, which reflects their diverse functions. 

Membrane Proteins Can Be Associated with the Lipid Bilayer in 
Various Ways 

Figure 10-19 shows the different ways in which membrane proteins can associ- 
ate with the membrane. Many extend through the lipid bilayer, with part of their 
mass on either side (Figure 10-19, examples 1, 2, and 3). Like their lipid neigh- 
bors, these transmembrane proteins are amphiphilic, having hydrophobic and 
hydrophilic regions. Their hydrophobic regions pass through the membrane 
and interact with the hydrophobic tails of the lipid molecules in the interior of 
the bilayer, where they are sequestered away from water. Their hydrophilic 
regions are exposed to water on either side of the membrane. The covalent 
attachment of a fatty acid chain that inserts into the cytosolic monolayer of the 
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lipid bilayer increases the hydrophobicity of some of these transmembrane pro- 
teins (see Figure 10-19, example 1). 

Other membrane proteins are located entirely in the cytosol and are associ- 
ated with the cytosolic monolayer of the lipid bilayer, either by an amphiphilic a 
helix exposed on the surface of the protein (Figure 10-19, example 4) or by one 
or more covalently attached lipid chains (Figure 10-19, example 5). Yet other 
membrane proteins are entirely exposed at the external cell surface, being 
attached to the lipid bilayer only by a covalent linkage (via a specific oligosac- 
charide) to phosphatidylinositol in the outer lipid monolayer of the plasma 
membrane (Figure 10-19, example 6). 

The lipid-linked proteins in example 5 in Figure 10-19 are made as soluble 
proteins in the cytosol and are subsequently anchored to the membrane by the 
covalent attachment of a lipid group. The proteins in example 6, however, are 
made as single-pass transmembrane proteins in the endoplasmic reticulum 
(ER). While still in the ER, the transmembrane segment of the protein is cleaved 
off and a glycosylphosphatidylinositol (GPI) anchor is added, leaving the pro- 
tein bound to the noncytosolic surface of the membrane solely by this anchor 
(discussed in Chapter 12). Transport vesicles eventually deliver the protein to 
the plasma membrane (discussed in Chapter 13). Proteins bound to the plasma 
membrane by a GPI anchor can be readily distinguished by the use of an enzyme 
called phosphatidylinositol-specific phospholipase C. This enzyme cuts these 
proteins free from their anchors, thereby releasing them from the membrane. 

Some membrane proteins do not extend into the hydrophobic interior of 
the lipid bilayer at all; they are instead bound to either face of the membrane by 
noncovalent interactions with other membrane proteins (Figure 10-19, exam- 
ples 7 and 8). Many of the proteins of this type can be released from the mem- 
brane by relatively gentle extraction procedures, such as exposure to solutions of 
very high or low ionic strength or of extreme pH, which interfere with pro- 
tein-protein interactions but leave the lipid bilayer intact; these proteins are 
referred to as peripheral membrane proteins. Transmembrane proteins and 
many proteins held in the bilayer by lipid groups or hydrophobic polypeptide 
regions that insert into the hydrophobic core of the lipid bilayer cannot be 
released in these ways. These proteins are called integral membrane proteins. 

Lipid Anchors Control the Membrane Localization of Some 
Signaling Proteins 

How a membrane protein is associated with the lipid bilayer reflects the func- 
tion of the protein. Only transmembrane proteins can function on both sides of 
the bilayer or transport molecules across it. Cell-surface receptors, for example, 
are transmembrane proteins that bind signal molecules in the extracellular 
space and generate different intracellular signals on the opposite side of the 
plasma membrane. To transfer small hydrophilic molecules across a membrane, 
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Figure 1 0-19 Various ways in which 
membrane proteins associate with the 
lipid bilayer. Most transmembrane 
proteins are thought to extend across the 
bilayer as (1) a single a helix, (2) as 
multiple a helices, or (3) as a rolled-up 
P sheet (a P barrel). Some of these 
"single-pass" and "multipass" proteins 
have a covalently attached fatty acid 
chain inserted in the cytosolic lipid 
monolayer (1). Other membrane proteins 
are exposed at only one side of the 
membrane. (4) Some of these are 
anchored to the cytosolic surface by an 
amphiphilic a helix that partitions into 
the cytosolic monolayer of the lipid 
bilayer through the hydrophobic face of 
the helix. (5) Others are attached to the 
bilayer solely by a covalently attached 
lipid chain — either a fatty acid chain or a 
prenyl group (see Figure 10-20)— in the 
cytosolic monolayer or, (6) via an 
oligosaccharide linker, to phosphatidyl- 
inositol in the noncytosolic monolayer — 
called a GPI anchor. (7, 8) Finally, many 
proteins are attached to the membrane 
only by noncovalent interactions with 
other membrane proteins. The way in 
which the structure in (5) is formed is 
illustrated in Figure 1 0-20, while the way 
in which the GPI anchor shown in (6) is 
formed is illustrated in Figure 12-56. The 
details of how membrane proteins 
become associated with the lipid bilayer 
are discussed in Chapter 12. 
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a membrane transport protein must provide a path for the molecules to cross 
the hydrophobic permeability barrier of the lipid bilayer; the molecular archi- 
tecture of multipass membrane proteins is ideally suited for this task, as we dis- 
cuss in Chapter 11. 

Proteins that function on only one side of the lipid bilayer, by contrast, are 
often associated exclusively with either the lipid monolayer or a protein domain 
on that side. Some intracellular signaling proteins, for example, that are involved 
in converting extracellular signals into intracellular ones are bound to the 
cytosolic half of the plasma membrane by one or more covalently attached lipid 
groups, which can be fatty acid chains or prenyl groups (Figure 10-20). In some 
cases, myristic acid, a saturated 14-carbon fatty acid, is added to the N-terminal 
amino group of the protein during its synthesis on the ribosome. All members of 
the Src family of cytoplasmic protein tyrosine kinases (discussed in Chapter 15) 
are myristoylated in this way. Membrane attachment through a single lipid 
anchor is not very strong, however, and a second lipid group is often added to 
anchor proteins more firmly to a membrane. For most Src kinases, the second 
lipid modification is the attachment of palmitic acid, a saturated 16-carbon fatty 
acid, to a cysteine side chain of the protein. This modification occurs in 
response to an extracellular signal and helps recruit the kinases to the plasma 
membrane. When the signaling pathway is turned off, the palmitic acid is 
removed, allowing the kinase to return to the cytosol. Other intracellular signal- 
ing proteins, such as the Ras family small GTPases (discussed in Chapter 15), use 
a combination of prenyl group and palmitic acid attachment to recruit the pro- 
teins to the plasma membrane. 



In Most Transmembrane Proteins the Polypeptide Chain Crosses 
the Lipid Bilayer in an a-Helical Conformation 

A transmembrane protein always has a unique orientation in the membrane. 
This reflects both the asymmetric manner in which it is inserted into the lipid 
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Figure 1 0-20 Membrane protein attachment by a fatty acid chain or a prenyl group. The covalent attachment of either 
type of lipid can help localize a water-soluble protein to a membrane after its synthesis in the cytosol. (A) A fatty acid chain 
(myristic acid) is attached via an amide linkage to an N-terminal glycine. (B) A fatty acid chain (palmitic acid) is attached via 
a thioester linkage to a cysteine. (C) A prenyl group (either farnesyl or a longer geranylgeranyl group) is attached via a 
thioether linkage to a cysteine residue that is initially located four residues from the protein's C-terminus. After prenylation, 
the terminal three amino acids are cleaved off, and the new C-terminus is methylated before insertion of the anchor into 
the membrane (not shown). The structures of the lipid anchors are shown below: (D) a myristoyl anchor (a 14-carbon 
saturated fatty acid chain), (E) a palmitoyl anchor (a 16-carbon saturated fatty acid chain), and (F) a farnesyl anchor 
(a 15-carbon unsaturated hydrocarbon chain). 
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Figure 1 0-2 1 A segment of a transmembrane polypeptide chain crossing the 
lipid bilayer as an a helix. <GTAG> Only the a-carbon backbone of the 
polypeptide chain is shown, with the hydrophobic amino acids in green and 
yellow. The polypeptide segment shown is part of the bacterial photosynthetic 
reaction center illustrated in Figure 10-34, the structure of which was 
determined by x-ray diffraction. (Based on data from J. Deisenhofer et al., Nature 
318:618-624, 1985, and H. Michel et al., EMBOJ. 5:1149-1158, 1986. All with 
permission from Macmillan Publishers Ltd.) 

bilayer in the ER during its biosynthesis (discussed in Chapter 12) and the differ- 
ent functions of its cytosolic and noncytosolic domains. These domains are sep- 
arated by the membrane-spanning segments of the polypeptide chain, which 
contact the hydrophobic environment of the lipid bilayer and are composed 
largely of amino acids with nonpolar side chains. Because the peptide bonds 
themselves are polar and because water is absent, all peptide bonds in the bilayer 
are driven to form hydrogen bonds with one another. The hydrogen-bonding 
between peptide bonds is maximized if the polypeptide chain forms a regular a 
helix as it crosses the bilayer, and this is how most membrane-spanning seg- 
ments of polypeptide chains traverse the bilayer (Figure 10-21). 

In single-pass transmembrane proteins, the polypeptide chain crosses 
only once (see Figure 10-19, example 1), whereas in multipass transmembrane 
proteins, the polypeptide chain crosses multiple times (see Figure 10-19, exam- 
ple 2) . An alternative way for the peptide bonds in the lipid bilayer to satisfy their 
hydrogen-bonding requirements is for multiple transmembrane strands of a 
polypeptide chain to be arranged as a (3 sheet that is rolled up into a closed bar- 
rel (a so-called p barrel; see Figure 10-19, example 3). This form of multipass 
transmembrane structure is seen in the porin proteins that we discuss later. 

Rapid progress in the x-ray crystallography of membrane proteins has 
enabled us to determine the three-dimensional structure of many of them. The 
structures confirm that it is often possible to predict from the protein's amino 
acid sequence which parts of the polypeptide chain extend across the lipid 
bilayer. Segments containing about 20-30 amino acids with a high degree of 
hydrophobicity are long enough to span a lipid bilayer as an a helix, and they 
can often be identified in hydropathy plots (Figure 10-22). From such plots, it is 
estimated that about 20% of the kind of an organism's proteins are transmem- 
brane proteins, emphasizing their importance. Hydropathy plots cannot iden- 
tify the membrane-spanning segments of a p barrel, as 10 amino acids or fewer 
are sufficient to traverse a lipid bilayer as an extended p strand and only every 
other amino acid side chain is hydrophobic. 

The strong drive to maximize hydrogen-bonding in the absence of water 
means that a polypeptide chain that enters the bilayer is likely to pass entirely 
through it before changing direction, since chain bending requires a loss of reg- 
ular hydrogen-bonding interactions. But multipass membrane proteins can also 
contain regions that fold into the membrane from either side, squeezing into 
spaces between transmembrane a helices without contacting the hydrophobic 
core of the lipid bilayer. Because such regions of the polypeptide chain interact 
only with other polypeptide regions, they do not need to maximize hydrogen- 
bonding; they can therefore have a variety of secondary structures, including 
helices that extend only part way across the lipid bilayer (Figure 10-23). Such 
regions are important for the function of some membrane proteins, including 
the K + and water channels; the regions contribute to the walls of the pores 
traversing the membrane and confer substrate specificity on the channels, as we 
discuss in Chapter 11. These regions cannot be identified in hydropathy plots 
and are only revealed by x-ray crystallography electron diffraction (a technique 
similar to x-ray diffraction but performed on two-dimensional arrays of pro- 
teins), or NMR studies of the protein's three-dimensional structure. 
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Transmembrane a Helices Often Interact with One Another 

The transmembrane a helices of many single-pass membrane proteins do not 
contribute to the folding of the protein domains on either side of the membrane. 
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As a consequence, it is often possible to engineer cells to produce the cytosolic 
or extracellular domains of these proteins as water-soluble protein. This 
approach has been invaluable to study the structure and function of these 
domains, especially of those in transmembrane receptor proteins (discussed in 
Chapter 15) . A transmembrane a helix, even in a single-pass membrane protein, 
however, often does more than just anchor the protein to the lipid bilayer. Many 
single-pass membrane proteins form homodimers, which are held together by 
strong and highly specific interactions between the two transmembrane a 
helices; the sequence of the hydrophobic amino acids of these helices contains 
the information that directs the protein-protein interaction. 

Similarly, the transmembrane a helices in multipass membrane proteins 
occupy specific positions in the folded protein structure that are determined by 
interactions between the neighboring helices. These interactions are crucial for 
the structure and function of the many channels and transporters that move 
molecules across lipid bilayers. In many cases, one can use proteases to cut the 
loops of the polypeptide chain that link the transmembrane a helices on either 
side of the bilayer and the helices stay together and function normally. In some 

Figure 10-23 Two a helices in the aquaporin water channel, each of 
which spans only halfway through the lipid bilayer. In the membrane, the 
protein forms a tetramer of four such two-helix segments, such that the 
colored surface shown here is buried at an interface formed by 
protein-protein interactions. The mechanism by which the channel allows 
the passage of water molecules across the lipid bilayer is discussed in more 
detail in Chapter 11, 



Figure 1 0-22 Using hydropathy plots to 
localize potential ct-helical membrane- 
spanning segments in a polypeptide 
chain. The free energy needed to transfer 
successive segments of a polypeptide 
chain from a nonpolar solvent to water is 
calculated from the amino acid 
composition of each segment using data 
obtained from model compounds. This 
calculation is made for segments of a 
fixed size (usually around 1 0-20 amino 
acids), beginning with each successive 
amino acid in the chain. The "hydropathy 
index" of the segment is plotted on the Y 
axis as a function of its location in the 
chain. A positive value indicates that free 
energy is required for transfer to water 
(i.e., the segment is hydrophobic), and 
the value assigned is an index of the 
amount of energy needed. Peaks in the 
hydropathy index appear at the positions 
of hydrophobic segments in the amino 
acid sequence. (A and B) Two examples of 
membrane proteins discussed later in this 
chapter are shown. Glycophorin (A) has a 
single membrane-spanning a helix and 
one corresponding peak in the 
hydropathy plot. Bacteriorhodopsin (B) 
has seven membrane-spanning a helices 
and seven corresponding peaks in the 
hydropathy plot. (C) The proportion of 
predicted membrane proteins encoded 
by the genomes of £ coll, S. cerevisiae, 
and human. The area shaded in green 
indicates the fraction of proteins that 
contain at least one predicted 
transmembrane helix. The data for 
£ coll and 5. cerevisiae represent the 
whole genome; the data for human 
represent only part of the genome; in 
each case, the area under the curve is 
proportional to the number of genes 
analyzed. (A, adapted from D. Eisenberg, 
Annu. Rev. Biochem. 53:595-624, 1984. 
With permission from Annual Reviews; 
C, adapted from D. Boyd et al., Protein Sci. 
7:201-205, 1998. With permission from 
The Protein Society.) 
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Figure 10-24 Converting a single-chain 
multipass protein into a two-chain 
multipass protein. (A) Proteolytic 
cleavage of one loop to create two 
fragments that stay together and 
function normally. (B) Expression of the 
same two fragments from separate genes 
gives rise to a similar protein that 
functions normally. 
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cases, one can even express engineered genes encoding separate pieces of a mul- 
tipass protein in living cells, and one finds that the separate pieces assemble 
properly to form a functional transmembrane protein (Figure 10-24), emphasiz- 
ing the exquisite specificity with which transmembrane a helices can interact. 

In multipass membrane proteins, neighboring transmembrane helices in 
the folded structure of the protein shield many of the transmembrane helices 
from the membrane lipids. Why, then, are these shielded helices nevertheless 
composed primarily of hydrophobic amino acids? The answer lies in the way in 
which multipass proteins are integrated into the membrane during their biosyn- 
thesis. As we discuss in Chapter 12, transmembrane a helices are inserted into 
the lipid bilayer sequentially by a protein translocator. After leaving the translo- 
cator, each helix is transiently surrounded by lipids in the bilayer, which requires 
that the helix be hydrophobic. It is only as the protein folds up into its final struc- 
ture that contacts are made between adjacent helices and protein-protein con- 
tacts replace some of the protein-lipid contacts (Figure 10-25). 



Some P Barrels Form Large Transmembrane Channels 

Multipass transmembrane proteins that have their transmembrane segments 
arranged as a p barrel rather than as an a helix are comparatively rigid and 
tend to crystallize readily Thus, some of them were among the first multipass 
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Figure 1 0-25 Steps in the folding of a 
multipass transmembrane protein. 

When the newly synthesized 
transmembrane a helices are released 
into the lipid bilayer, they are initially 
surrounded by lipid molecules. As the 
protein folds, contacts between the 
helices displace some of the lipid 
molecules surrounding the helices. 
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membrane protein structures to be determined by x-ray crystallography. The 
number of p strands in a p barrel varies widely, from as few as 8 strands to as 
many as 22 (Figure 10-26). 

p barrel proteins are abundant in the outer membrane of mitochondria, 
chloroplasts, and many bacteria. Some are pore-forming proteins, which create 
water-filled channels that allow selected small hydrophilic molecules to cross 
the lipid bilayer of the bacterial outer membrane. The porins are well-studied 
examples (example 3 in Figure 10-26). The porin barrel is formed from a 16- 
strand, antiparallel P sheet, which is sufficiently large to roll up into a cylindrical 
structure. Polar amino acid side chains line the aqueous channel on the inside, 
while nonpolar side chains project from the outside of the barrel to interact with 
the hydrophobic core of the lipid bilayer. Loops of the polypeptide chain often 
protrude into the lumen of the channel, narrowing it so that only certain solutes 
can pass. Some porins are therefore highly selective: maltoporin, for example, 
preferentially allows maltose and maltose oligomers to cross the outer mem- 
brane of E. coli. 

The FepA protein is a more complex example of a P barrel transport protein 
(example 4 in Figure 10-26). It transports iron ions across the bacterial outer 
membrane. It is constructed from 22 P strands, and a large globular domain 
completely fills the inside of the barrel. Iron ions bind to this domain, which is 
thought to undergo a large conformational change to transfer the iron across the 
membrane. 

Not all P barrel proteins are transport proteins. Some form smaller barrels 
that are completely filled by amino acid side chains that project into the center 
of the barrel. These proteins function as receptors or enzymes (examples 1 and 
2 in Figure 10-26), and the barrel serves as a rigid anchor, which holds the pro- 
tein in the membrane and orients the cytosolic loops that form binding sites for 
specific intracellular molecules. 

Although P barrel proteins have various functions, they are largely restricted 
to bacterial, mitochondrial, and chloroplast outer membranes. Most multipass 
transmembrane proteins in eucaryotic cells and in the bacterial plasma mem- 
brane are constructed from transmembrane a helices. The helices can slide 
against each other, allowing conformational changes in the protein that can 
open and shut ion channels, transport solutes, or transduce extracellular signals 
into intracellular ones. In P barrel proteins, by contrast, hydrogen bonds bind 
each P strand rigidly to its neighbors, making conformational changes within 
the wall of the barrel unlikely. 



Many Membrane Proteins Are Glycosylated 

Most transmembrane proteins in animal cells are glycosylated. As in glycolipids, 
the sugar residues are added in the lumen of the ER and the Golgi apparatus 



Figure 1 0-26 p barrels formed from 
different numbers of p strands. <TGCT> 
(1 ) The £ coli OmpA protein serves as a 
receptor for a bacterial virus. (2) The £ 
coli OMPLA protein is an enzyme (a 
lipase) that hydrolyzes lipid molecules. 
The amino acids that catalyze the 
enzymatic reaction (shown in red) 
protrude from the outside surface of the 
barrel. (3) A porin from the bacterium 
Rhodobacter copsulatus forms a water- 
filled pore across the outer membrane. 
The diameter of the channel is restricted 
by loops (shown in blue) that protrude 
into the channel. (4) The £ coli FepA 
protein transports iron ions. The inside of 
the barrel is completely filled by a 
globular protein domain (shown in blue) 
that contains an iron-binding site (not 
shown). This domain is thought to 
change its conformation to transport the 
bound iron, but the molecular details of 
the changes are not known. 
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Figure 10-27 A single-pass transmembrane protein. Note that the 
polypeptide chain traverses the lipid bilayer as a right-handed a helix and 
that the oligosaccharide chains and disulfide bonds are all on the 
noncytosolic surface of the membrane. The sulfhydryl groups in the 
cytosolic domain of the protein do not normally form disulfide bonds 
because the reducing environment in the cytosol maintains these groups 
in their reduced (-SH) form. 



(discussed in Chapters 12 and 13). For this reason, the oligosaccharide chains 
are always present on the noncytosolic side of the membrane. Another impor- 
tant difference between proteins (or parts of proteins) on the two sides of the 
membrane results from the reducing environment of the cytosol. This environ- 
ment decreases the likelihood that intrachain or interchain disulfide (S-S) 
bonds will form between cysteines on the cytosolic side of membranes. These 
bonds form on the noncytosolic side, where they can help stabilize either the 
folded structure of the polypeptide chain or its association with other polypep- 
tide chains (Figure 10-27). 

Because most plasma membrane proteins are glycosylated, carbohydrates 
extensively coat the surface of all eucaryotic cells. These carbohydrates occur as 
oligosaccharide chains covalently bound to membrane proteins (glycoproteins) 
and lipids (glycolipids). They also occur as the polysaccharide chains of integral 
membrane proteoglycan molecules. Proteoglycans, which consist of long 
polysaccharide chains linked covalently to a protein core, are found mainly out- 
side the cell, as part of the extracellular matrix (discussed in Chapter 19). But, for 
some proteoglycans, the protein core either extends across the lipid bilayer or is 
attached to the bilayer by a glycosylphosphatidylinositol (GPI) anchor. 

The terms cell coat or glycocalyx are sometimes used to describe the carbo- 
hydrate-rich zone on the cell surface. This carbohydrate layer can be visualized 
by various stains, such as ruthenium red (Figure 10-28A), as well as by its affin- 
ity for carbohydrate-binding proteins called lectins, which can be labeled with a 
fluorescent dye or some other visible marker. Although most of the sugar groups 
are attached to intrinsic plasma membrane molecules, the carbohydrate layer 
also contains both glycoproteins and proteoglycans that have been secreted into 
the extracellular space and then adsorbed onto the cell surface (Figure 10-28B). 
Many of these adsorbed macromolecules are components of the extracellular 
matrix, so that the boundary between the plasma membrane and the extracel- 
lular matrix is often not sharply defined. One of the many functions of the car- 
bohydrate layer is to protect cells against mechanical and chemical damage; it 
also keeps various other cells at a distance, preventing unwanted protein-pro- 
tein interactions. 

The oligosaccharide side chains of glycoproteins and glycolipids are enor- 
mously diverse in their arrangement of sugars. Although they usually contain 
fewer than 15 sugars, they are often branched, and the sugars can be bonded 
together by various covalent linkages— unlike the amino acids in a polypeptide 
chain, which are all linked by identical peptide bonds. Even three sugars can be 
put together to form hundreds of different trisaccharides. Both the diversity and 
the exposed position of the oligosaccharides on the cell surface make them 
especially well suited to function in specific cell-recognition processes. As we 
discuss in Chapter 19, plasma membrane-bound lectins that recognize specific 
oligosaccharides on cell-surface glycolipids and glycoproteins mediate a variety 
of transient cell-cell adhesion processes, including those occurring in 
sperm-egg interactions, blood clotting, lymphocyte recirculation, and inflam- 
matory responses. 
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Membrane Proteins Can Be Solubilized and Purified in Detergents 

In general, only agents that disrupt hydrophobic associations and destroy the 
lipid bilayer can solubilize transmembrane proteins (and some other tightly 
bound membrane proteins). The most useful of these for the membrane bio- 
chemist are detergents, which are small amphiphilic molecules of variable 
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Figure 1 0-28 The carbohydrate layer on 
the cell surface. This electron micrograph 
of the surface of a lymphocyte stained 
with ruthenium red emphasizes the thick 
carbohydrate-rich layer surrounding the 
cell. (B) The carbohydrate layer is made 
up of the oligosaccharide side chains of 
glycolipidsand integral membrane 
glycoproteins and the polysaccharide 
chains on integral membrane 
proteoglycans. In addition, adsorbed 
glycoproteins, and adsorbed 
proteoglycans (not shown) contribute to 
the carbohydrate layer in many cells. 
Note that all of the carbohydrate is on 
the noncytosolic surface of the 
membrane. (A, courtesy of Audrey M. 
Glauert and G.M.W. Cook.) 
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structure. Detergents are much more soluble in water than lipids. Their polar 
(hydrophilic) ends can be either charged (ionic), as in sodium dodecyl sulfate 
(SDS), or uncharged (nonionic), as in octylglucoside and Triton (Figure 10-29A). 
At low concentration, detergents are monomeric in solution, but when their 
concentration is increased above a threshold, called the critical micelle concen- 
tration or CMC, they aggregate to form micelles (Figure 10-29B-C). Detergent 
molecules rapidly diffuse in and out of micelles, keeping the concentration of 
monomer in the solution constant, no matter how many micelles are present. 
Both the CMC and the average number of detergent molecules in a micelle are 
characteristic properties of each detergent, but they also depend on the tem- 
perature, pH, and salt concentration. Detergent solutions are therefore complex 
systems and are difficult to study. 

When mixed with membranes, the hydrophobic ends of detergents bind to 
the hydrophobic regions of the membrane proteins, where they displace lipid 
molecules with a collar of detergent molecules. Since the other end of the deter- 
gent molecule is polar, this binding tends to bring the membrane proteins into 
solution as detergent-protein complexes (Figure 10-30). Usually, some lipid 
molecules also remain attached to the protein. 

Strong ionic detergents, such as SDS, can solubilize even the most 
hydrophobic membrane proteins. This allows the proteins to be analyzed by 
SDS polyacrylamide-gel electrophoresis (discussed in Chapter 8), a procedure 
that has revolutionized the study of membrane proteins. Such strong detergents 
unfold (denature) proteins by binding to their internal "hydrophobic cores," 
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Figure 10-29 The structure and function of detergent micelles. <GCGA> (A) Three commonly used detergents are sodium 
dodecyl sulfate (SDS), an anionic detergent, and Triton X-100 and fi-octylglucoside, two nonionic detergents. Triton X-100 is 
a mixture of compounds in which the region in brackets is repeated between 9 and 1 times. The hydrophobic portion of 
each detergent is shown in yellow, and the hydrophilic portion is shown in orange. (B) At low concentration, detergent 
molecules are monomeric in solution. As their concentration is increased beyond the critical micelle concentration (CMC), 
some of the detergent molecules form micelles. Note that the concentration of detergent monomer stays constant above 
the CMC. (C) Because they have both polar and nonpolar ends, detergent molecules are amphiphilic; and because they are 
cone-shaped, they form micelles rather than bilayers (see Figure 1 0-7). Detergent micelles have irregular shapes, and, due 
to packing constraints, the hydrophobic tails are partially exposed to water. The space-filling model shows the structure of 
a micelle composed of 20 pi-octylglucoside molecules, predicted by molecular dynamics calculations (B, adapted from 
G. Gunnarsson, B. Jonsson and H. Wennerstrom, J. Phys. Chem. 84:3114-31 21, 1980; C, from S. Bogusz, R.M. Venable and 
R.W. Pastor, 1 Phys. Chem. B. 1 04:5462-5470, 2000. With permission from the American Chemical Society.) 



thereby rendering the proteins inactive and unusable for functional studies. 
Nonetheless, proteins can be readily separated and purified in their SDS-dena- 
tured form. In some cases, removal of the detergent allows the purified protein 
to renature, with recovery of functional activity. 

Many hydrophobic membrane proteins can be solubilized and then puri- 
fied in an active form by the use of mild detergents. These detergents cover the 
hydrophobic regions on membrane-spanning segments that become exposed 
after lipid removal but do not unfold the protein. If the detergent concentration 
of a solution of solubilized membrane proteins is reduced (by dilution, for 
example), membrane proteins do not remain soluble. In the presence of an 
excess of phospholipid molecules in such a solution, membrane proteins incor- 
porate into small liposomes that form spontaneously. In this way, functionally 
active membrane protein systems can be reconstituted from purified compo- 
nents, providing a powerful means of analyzing the activities of membrane 
transporters, ion channels, signaling receptors, and so on (Figure 10-31). Such 
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Figure 10-30 Solubilizing membrane 
proteins with a mild nonionic 
detergent. The detergent disrupts the 
lipid bilayer and brings the proteins into 
solution as protein-lipid- detergent 
complexes. The phospholipids in the 
membrane are also solubilized by the 
detergent. 
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Figure 10-31 The use of mild nonionic 
detergents for solubilizing, purifying, 
and reconstituting functional 
membrane protein systems. In this 
example, functional Na + -K + pump 
molecules are purified and incorporated 
into phospholipid vesicles. The Na + -K + 
pump is an ion pump that is present in 
the plasma membrane of most animal 
cells; it uses the energy of ATP hydrolysis 
to pump Na + out of the cell and K + in, as 
discussed in Chapter 11. 
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functional reconstitution, for example, provided proof for the hypothesis that 
the transmembrane ATPases use H + gradients in mitochondrial, chloroplast, 
and bacterial membranes to synthesize ATE 

Detergents have also played a crucial part in the purification and crystal- 
lization of membrane proteins. The development of new detergents and new 
expression systems producing large quantities of membrane proteins from 
cDNA clones has led to a rapid increase in the number of structures of mem- 
brane proteins and protein complexes that are known. 



Bacteriorhodopsin Is a Light-Driven Proton Pump That Traverses 
the Lipid Bilayer as Seven a Helices 

In Chapter 11, we consider how multipass transmembrane proteins mediate the 
selective transport of small hydrophilic molecules across cell membranes. But a 
detailed understanding of how a membrane transport protein actually works 
requires precise information about its three-dimensional structure in the bilayer. 
Bacteriorhodopsin was the first membrane transport protein whose structure was 
determined. It has remained the prototype of many multipass membrane pro- 
teins with a similar structure, and it merits a brief digression here. 

The "purple membrane" of the archaean Halobacterium salinarum is a spe- 
cialized patch in the plasma membrane that contains a single species of protein 
molecule, bacteriorhodopsin (Figure 10-32). Each bacteriorhodopsin molecule 
contains a single light-absorbing group, or chromophore (called retinal), which 
gives the protein its purple color. Retinal is vitamin A in its aldehyde form and is 
identical to the chromophore found in rhodopsin of the photoreceptor cells of 
the vertebrate eye (discussed in Chapter 15). Retinal is covalently linked to a 
lysine side chain of the bacteriorhodopsin protein. When activated by a single 
photon of light, the excited chromophore changes its shape and causes a series 
of small conformational changes in the protein, resulting in the transfer of one 
H + from the inside to the outside of the cell (Figure 10-33). In bright light, each 
bacteriorhodopsin molecule can pump several hundred protons per second. 
The light-driven proton transfer establishes an H + gradient across the plasma 
membrane, which in turn drives the production of ATP by a second protein in 
the cell's plasma membrane. The energy stored in the H + gradient also drives 
other energy-requiring processes in the cell. Thus, bacteriorhodopsin converts 
solar energy into a proton gradient, which provides energy to the archaeal cell. 
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Figure 10-32 Patches of purple 
membrane, which contain 
bacteriorhodopsin in the archaean 
Halobacterium salinarum. (A) These 
archaea live in saltwater pools, where 
they are exposed to sunlight. They have 
evolved a variety of light-activated 
proteins, including bacteriorhodopsin, 
which is a light-activated proton pump in 
the plasma membrane. (B)The 
bacteriorhodopsin molecules in the 
purple membrane patches are tightly 
packed into two-dimensional crystalline 
arrays. (C) Details of the molecular 
surface visualized by atomic force 
microscopy. With this technique 
individual bacteriorhodopsin molecules 
can be seen. (D) Outline of the 
approximate locations of three 
bacteriorhodopsin monomers and their 
individual a helices in the images shown 
in (B). (B-D, courtesy of Dieter 
Oesterhelt.) 
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The numerous bacteriorhodopsin molecules in the purple membrane are 
arranged as a planar two-dimensional crystal. The regular packing has made it 
possible to determine the three-dimensional structure and orientation of bacte- 
riorhodopsin in the membrane to moderate resolution (3 A) by an approach that 
uses a combination of electron microscopy and electron diffraction analysis. This 
procedure, known as electron crystallography, is analogous to the study of 
three-dimensional crystals of soluble proteins by x-ray diffraction analysis. It has 
provided the first structural views of many membrane proteins that were found 
to be difficult to crystallize from detergent solutions. For bacteriorhodopsin, the 
structure obtained by electron crystallography was later confirmed and extended 
to very high resolution by x-ray crystallography. Each bacteriorhodopsin 
molecule is folded into seven closely packed a helices (each containing about 25 
amino acids), which pass through the lipid bilayer at slightly different angles. By 
obtaining very well ordered protein crystals and freezing them at very low tem- 
peratures, it has also been possible to determine the structures of some of the 
protein's intermediate conformations during its H + pumping cycle. 

Bacteriorhodopsin is a member of a large superfamily of membrane pro- 
teins with similar structures but different functions. For example, rhodopsin in 
rod cells of the vertebrate retina and many cell-surface receptor proteins that 
bind extracellular signal molecules are also built from seven transmembrane a 
helices. These proteins function as signal transducers rather than as trans- 
porters: each responds to an extracellular signal by activating a GTP-binding 
protein (G protein) inside the cell and are therefore called G-protein- coupled 
receptors [GPCRs), as we discuss in Chapter 15. Although the structures of bac- 
teriorhodopsins and GPCRs are strikingly similar, they show no sequence simi- 
larity and thus probably belong to two evolutionary distant branches of an 
ancient protein family. 

The high-resolution crystal structure of bacteriorhodopsin reveals many 
lipid molecules that are bound in specific places on the protein surface (Figure 
10-33B). Interactions with specific lipids are thought to help stabilize many 
membrane proteins, which work best and crystallize more readily if some of the 
lipids remain bound during detergent extraction, or if specific lipids are added 
back to the proteins in detergent solutions. The specificity of these lipid-protein 
interactions helps explain why eucaryotic membranes contain such a variety of 
lipids, with head groups that differ in size, shape, and charge. We can think of the 
membrane lipids as constituting a two-dimensional solvent for the proteins in 
the membrane, just as water constitutes a three-dimensional solvent for proteins 



Figure 10-33 The three-dimensional 
structure of a bacteriorhodopsin 

molecule. <TTAA> (A) The polypeptide 
chain crosses the lipid bilayer seven times 
as a helices. The location of the retinal 
chromophore (purple) and the probable 
pathway taken by protons during the 
light-activated pumping cycle are shown. 
The first and key step is the passing of a 
H + from the chromophore to the side 
chain of aspartic acid 85 (red, located next 
to the chromophore) that occurs upon 
absorption of a photon by the 
chromophore. Subsequently, other 
H + transfers— in the numerical order 
indicated and utilizing the hydrophilic 
amino acid side chains that line a path 
through the membrane — complete the 
pumping cycle and return the enzyme to 
its starting state. Color code: glutamic acid 
(orange), aspartic acid (red), arginine (blue). 
(B) The high-resolution crystal structure of 
bacteriorhodopsin shows many lipid 
molecules [yellow with redhead groups) 
that are tightly bound to specific places 
on the surface of the protein. (A, adapted 
from H. Luecke et al., Science 286:255-260, 
1 999. With permission from AAAS; 
B, from H. Luecke et aL, J. Mol. Biol 
291:899-911, 1999. With permission from 
Academic Press.) 
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in an aqueous solution. Some membrane proteins can function only in the pres- 
ence of specific lipid head groups, just as many enzymes in aqueous solution 
require a particular ion for activity. 



Membrane Proteins Often Function as Large Complexes 

Many membrane proteins function as part of multicomponent complexes, sev- 
eral of which have been studied by x-ray crystallography. One is a bacterial pho- 
tosynthetic reaction center, which was the first transmembrane protein complex 
to be crystallized and analyzed by x-ray diffraction. The results of this analysis 
were of general importance to membrane biology because they showed for the 
first time how multiple polypeptides associate in a membrane to form a com- 
plex protein machine (Figure 10-34). In Chapter 14, we discuss how such pho- 
tosynthetic complexes function to capture light energy and use it to pump pro- 
tons across the membrane. Many of the membrane protein complexes involved 
in photosynthesis, proton pumping, and electron transport are even larger than 
the photosynthetic reaction center. The enormous photosystem II complex from 
cyanobacteria, for example, contains 19 protein subunits and well over 60 trans- 
membrane helices. Membrane proteins are often arranged in large complexes, 
not only for harvesting various forms of energy, but also for transducing extra- 
cellular signals into intracellular ones (discussed in Chapter 15). 



Many Membrane Proteins Diffuse in the Plane of the Membrane 

Like most membrane lipids, membrane proteins do not tumble (flip-flop) across 
the lipid bilayer, but they do rotate about an axis perpendicular to the plane of 
the bilayer (rotational diffusion). In addition, many membrane proteins are able 
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Figure 10-34 The three-dimensional 
structure of the photosynthetic reaction 
center of the bacterium 
Rhodopseudomonas viridis. <ATCA> 
The structure was determined by x-ray 
diffraction analysis of crystals of this 
transmembrane protein complex. The 
complex consists of four subunits L, M, H ; 
and a cytochrome. The L and M subunits 
form the core of the reaction center, and 
each contains five a helices that span the 
lipid bilayer. The locations of the various 
electron carrier coenzymes are shown in 
black. Note that the coenzymes are 
arranged in the spaces between the 
helices. The special pair of chlorophyll 
molecules (discussed in Chapter 14) is 
shown in turquoise. (Adapted from a 
drawing by J. Richardson based on data 
from J. Deisenhofer et al., Nature 
318:618-624, 1985. With permission from 
Macmillan Publishers Ltd.) 
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Figure 10-35 An experiment 
demonstrating the diffusion of proteins 
in the plasma membrane of 
mouse-human hybrid cells. The mouse 
and human proteins are initially confined 
to their own halves of the newly formed 
heterocaryon plasma membrane, but they 
intermix over time. The two antibodies 
used to visualize the proteins can be 
distinguished in a fluorescence microscope 
because fluorescein is green and 
rhodamine is red. (Based on L.D. Frye and 
M. EdidinJ. CellSci. 7:319-335, 1970. With 
permission from The Company of 
Biologists.) 



to move laterally within the membrane (lateral diffusion). An experiment in 
which mouse cells were artificially fused with human cells to produce hybrid 
cells (heterocaryons) provided the first direct evidence that some plasma mem- 
brane proteins are mobile in the plane of the membrane. Two differently 
labeled antibodies were used to distinguish selected mouse and human plasma 
membrane proteins. Although at first the mouse and human proteins were con- 
fined to their own halves of the newly formed heterocaryon, the two sets of pro- 
teins diffused and mixed over the entire cell surface in about half an hour (Fig- 
ure 10-35). 

The lateral diffusion rates of membrane proteins can be measured by using 
the technique of fluorescence recovery after photobleaching (FRAP). The method 
usually involves marking the membrane protein of interest with a specific fluores- 
cent group. This can be done either with a fluorescent ligand such as a fluo- 
rophore-labeled antibody that binds to the protein or with recombinant DNA 
technology to express the protein fused to green fluorescent protein (GFP) (dis- 
cussed in Chapter 9). The fluorescent group is then bleached in a small area of 
membrane by a laser beam, and the time taken for adjacent membrane proteins 
carrying unbleached ligand or GFP to diffuse into the bleached area is measured 
(Figure 10-36A). A complementary technique is fluorescence loss in photo- 
bleaching (FLIP). Here, a laser beam continuously irradiates a small area of 
membrane to bleach all the fluorescent molecules that diffuse into it, thereby 
gradually depleting the surrounding membrane of fluorescently labeled 
molecules (Figure 10-36B). From such FRAP and FLIP measurements, we can 
calculate the diffusion coefficient for the marked cell- surface protein. The values 
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of the diffusion coefficients for different membrane proteins in different cells are 
highly variable, because interactions with other proteins impede the diffusion of 
the proteins to varying degrees. Measurements of proteins that are minimally 
impeded in this way indicate that cell membranes have a viscosity comparable 
to that of olive oil. 

One drawback to the FRAP and FLIP techniques is that they monitor the 
movement of large populations of molecules in a relatively large area of mem- 
brane; one cannot follow individual protein molecules. If a protein fails to 
migrate into a bleached area, for example, one cannot tell whether the molecule 
is truly immobile or just restricted in its movement to a very small region of 
membrane — perhaps by cytoskeletal proteins. Single-particle tracking tech- 
niques overcome this problem by labeling individual membrane molecules with 
antibodies coupled to fluorescent dyes or tiny gold particles and tracking their 



Figure 10-36 Measuring the rate of lateral 
diffusion of a membrane protein by photo- 
bleaching techniques. <ATGT> A specific 
protein of interest can be expressed as a 
fusion protein with green fluorescent 
protein (GFP), which is intrinsically 
fluorescent. (A) In the FRAP technique, 
fluorescent molecules are bleached in a 
small area using a laser beam. The 
fluorescence intensity recovers as the 
bleached molecules diffuse away and 
unbleached molecules diffuse into the 
irradiated area (shown here in side and top 
views). The diffusion coefficient is calculated 
from a graph of the rate of recovery: the 
greater the diffusion coefficient of the 
membrane protein, the faster the recovery, 
(B) In the FLIP technique, a small area of 
membrane is irradiated continuously, and 
fluorescence is measured in a separate area. 
Fluorescence in the second area 
progressively decreases as fluorescent 
proteins diffuse out and bleached molecules 
diffuse in; eventually, all of the fluorescent 
protein molecules are bleached, as long as 
they are mobile and not permanently 
anchored to the cytoskeleton or 
extracellular matrix. 
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movement by video microscopy. Using single-particle tracking, one can record 
the diffusion path of a single membrane protein molecule over time. Results 
from all of these techniques indicate that plasma membrane proteins differ 
widely in their diffusion characteristics, as we now discuss. 



Figure 10-37 How membrane molecules 
can be restricted to a particular 
membrane domain. In this drawing of an 
epithelial cell, protein A (in the apical 
membrane) and protein B (in the basal 
and lateral membranes) can diffuse 
laterally in their own domains but are 
prevented from entering the other 
domain, at least partly by the specialized 
cell junction called a tight junction. Lipid 
molecules in the outer (noncytosolic) 
monolayer of the plasma membrane are 
likewise unable to diffuse between the 
two domains; lipids in the inner 
(cytosolic) monolayer, however, are able 
to do so (not shown). The basal lamina is 
a thin mat of extracellular matrix that 
separates epithelial sheets from other 
tissues (discussed in Chapter 19). 



Cells Can Confine Proteins and Lipids to Specific Domains Within 
a Membrane 

The recognition that biological membranes are two-dimensional fluids was a 
major advance in understanding membrane structure and function. It has 
become clear, however, that the picture of a membrane as a lipid sea in which all 
proteins float freely is greatly oversimplified. Many cells confine membrane pro- 
teins to specific regions in a continuous lipid bilayer. We have already discussed 
how bacteriorhodopsin molecules in the purple membrane of Halobacterium 
assemble into large two-dimensional crystals, in which individual protein 
molecules are relatively fixed in relationship to one another (see Figure 10-32); 
large aggregates of this kind diffuse very slowly. 

In epithelial cells, such as those that line the gut or the tubules of the kidney, 
certain plasma membrane enzymes and transport proteins are confined to the 
apical surface of the cells, whereas others are confined to the basal and lateral 
surfaces (Figure 10-37). This asymmetric distribution of membrane proteins is 
often essential for the function of the epithelium, as we discuss in Chapters 11 
and 19. The lipid compositions of these two membrane domains are also differ- 
ent, demonstrating that epithelial cells can prevent the diffusion of lipid as well 
as protein molecules between the domains. Experiments with labeled lipids, 
however, suggest that only lipid molecules in the outer monolayer of the mem- 
brane are confined in this way. The barriers set up by a specific type of intercel- 
lular junction (called a tight junction, discussed in Chapter 19) maintain the sep- 
aration of both protein and lipid molecules. Clearly, the membrane proteins that 
form these intercellular junctions cannot be allowed to diffuse laterally in the 
interacting membranes. 

A cell can also create membrane domains without using intercellular junc- 
tions. The mammalian spermatozoon, for instance, is a single cell that consists 
of several structurally and functionally distinct parts covered by a continuous 
plasma membrane. When a sperm cell is examined by immunofluorescence 
microscopy with a variety of antibodies, each of which react with a specific cell- 
surface molecule, the plasma membrane is found to consist of at least three dis- 
tinct domains (Figure 10-38). Some of the membrane molecules are able to dif- 
fuse freely within the confines of their own domain. The molecular nature of the 
"fence" that prevents the molecules from leaving their domain is not known. 
Many other cells have similar membrane fences that confine membrane protein 
diffusion to certain membrane domains. The plasma membrane of nerve cells, 
for example, contains a domain enclosing the cell body and dendrites, and 
another enclosing the axon. In this case, it is thought that a belt of actin fila- 
ments tightly associated with the plasma membrane at the cell-body-axon junc- 
tion forms part of the barrier. 



646 Chapter 10: Membrane Structure 




anterior head 
posterior head 




20 }*m 

Figure 10-39 shows four common ways of immobilizing specific membrane 
proteins through protein-protein interactions. 



The Cortical Cytoskeleton Gives Membranes Mechanical Strength 
and Restricts Membrane Protein Diffusion 

As shown in Figure 10-39B and C, a common way in which a cell restricts the lat- 
eral mobility of specific membrane proteins is to tether them to macromolecu- 
lar assemblies on either side of the membrane. The characteristic biconcave 
shape of a red blood cell (Figure 10-40), for example, results from interactions 
of its plasma membrane proteins with an underlying cytoskeleton, which con- 
sists mainly of a meshwork of the filamentous protein spectrin. Spectrin is a 
long, thin, flexible rod about 100 nm in length. Being the principal component 
of the red cell cytoskeleton, it maintains the structural integrity and shape of the 
plasma membrane, which is the red cell's only membrane, as the cell has no 
nucleus or other organelles. The spectrin cytoskeleton is riveted to the mem- 
brane through various membrane proteins. The final result is a deformable, net- 
like meshwork that covers the entire cytosolic surface of the red cell membrane 
(Figure 10-41). This spectrin-based cytoskeleton enables the red cell to with- 
stand the stress on its membrane as it is forced through narrow capillaries. Mice 
and humans with genetic abnormalities in spectrin are anemic and have red 
cells that are spherical (instead of concave) and fragile; the severity of the ane- 
mia increases with the degree of spectrin deficiency. 

An analogous but much more elaborate and complicated cytoskeletal net- 
work exists beneath the plasma membrane of most other cells in our body This 
network, which constitutes the cortical region (or cortex) of the cytoplasm, is 
rich in actin filaments, which are attached to the plasma membrane in numer- 
ous ways. The cortex of nucleated cells contains proteins that are structurally 
homologous to spectrin and the other components of the red cell cytoskeleton. 
We discuss the cortical cytoskeleton in nucleated cells and its interactions with 
the plasma membrane in Chapter 16. 



Figure 1 0-38 Three domains in the 
plasma membrane of a guinea pig 
sperm. (A) A drawing of a guinea pig 
sperm. In the three pairs of micrographs, 
phase-contrast micrographs are on the 
left, and the same cell is shown with 
cell-surface immunofluorescence staining 
on the right. Different monoclonal 
antibodies selectively label cell-surface 
molecules on (B) the anterior head, 
(C) the posterior head, and (D) the tail. 
(Micrographs courtesy of Selena Carroll 
and Diana Myles.) 
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Figure 10-39 Four ways of restricting the 
lateral mobility of specific plasma 
membrane proteins. (A) The proteins can 
self-assemble into large aggregates (as 
seen for bacteriorhodopsin in the purple 
membrane of Halobacterium); they can be 
tethered by interactions with assemblies 
of macromolecules (B) outside or (C) inside 
the cell; or they can interact with proteins 
on the surface of another cell (D). 
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The cortical cytoskeletal network underlying the plasma membrane restricts 
diffusion of not only the proteins that are directly anchored to it. Because the 
cytoskeletal filaments are often closely apposed to the cytosolic membrane sur- 
face, they can form mechanical barriers that obstruct the free diffusion of mem- 
brane proteins. These barriers partition the membrane into small domains, or cor- 
rals (Figure 10-42), which can be either permanent, as in the sperm (see Figure 
10-38) , or transient. The barriers can be detected when the diffusion of individual 
membrane proteins is followed by high-speed, single -particle tracking. The pro- 
teins diffuse rapidly but are confined within an individual corral; occasionally 



Figure 10-40 A scanning electron 
micrograph of human red blood cells. 

<GTAC> The cells have a biconcave shape 
and lack a nucleus and other organelles. 
(Courtesy of Bernadette Chailley.) 
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Figure 1 0-41 The spectrin-based cytoskeleton on the cytosolic side of the human red 
blood cell plasma membrane. (A) The arrangement shown in the drawing has been 
deduced mainly from studies on the interactions of purified proteins in vitro. Spectrin 
dimers (enlarged in the box on the right) are linked together into a netlike meshwork by 
"junctional complexes" (enlarged in the box on the left). Each spectrin heterodimer 
consists of two antiparallel, loosely intertwined, flexible polypeptide chains called a and 
P. The two chains are attached noncovalently to each other at multiple points, including 
at both ends. Both the a and pi chains are composed largely of repeating domains. The 
junctional complexes are composed of short actin filaments (containing 1 3 actin 
monomers), band 4. 1, adducin, and a tropomyosin molecule that probably determines 
the length of the actin filaments. The cytoskeleton is linked to the membrane through 
two transmembrane proteins— a multipass protein called band 3 and a single-pass 
protein called glycophorin. The spectrin tetramers bind to some band 3 proteins via 
ankyrin molecules, and to glycophorin and band 3 (not shown) via band 4.1 proteins. 
(B) The electron micrograph shows the cytoskeleton on the cytosolic side of a red blood 
cell membrane after fixation and negative staining. The spectrin meshwork has been 
purposely stretched out to allow the details of its structure to be seen. In a normal cell, 
the meshwork shown would be much more crowded and occupy only about one-tenth 
of this area. (B, courtesy of T. Byers and D. Branton, Proc Natl Acad. Sci. U.S.A. 
82:61 53-61 57, 1 985. With permission from National Academy of Sciences.) 
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however, thermal motions cause a few cortical filaments to detach transiently 
from the membrane, allowing the protein to escape into an adjacent corral. 

The extent to which a transmembrane protein is confined within a corral 
depends on its association with other proteins and the size of its cytoplasmic 
domain; proteins with a large cytusolic domain will have a harder time passing 
through barriers. When a cell- surface receptor binds its extracellular signal 
molecules, for example, large protein complexes build up on the cytosnlk: 
domain of the receptor, making it more difficult for the receptor to escape from 
its corral. It is thought that corraling helps concentrate activated signaling com- 
plexes, increasing the speed and efficiency of the signaling process (discussed in 
Chapter ]5). 



Figure 10-42 Corraling of membrane 
proteins by cortical cytoskeletal 
filaments. (A) How cytoskeletal filaments 
are thought to provide diffusion barriers 
that divide the membrane into small 
domains, or corrals. (B) High-speed, 
single-particle tracking was used to 
follow the paths of a fluorescently- 
labeled membrane protein over time. The 
trace shows that membrane proteins 
diffuse within a tightly delimited 
membrane domain (shown by different 
colors of the trace) and only infrequently 
escape into a neighboring domain. 
(Adapted from A. Kusumi et at, Annu. Rev. 
Biophys. Biomol. Struct. 34:351-378, 2005. 
With permission from Annual Reviews.) 



Summary 

Whereas the lipid bilayer determines the basic structure of biological membranes, pro- 
teins are responsible for most membrane functions, serving as specific receptors, 
enzymes, transport proteins, and so on. Many membrane proteins extend across the 
lipid bilayer. Some of these transmembrane proteins are single-pass proteins, in which 
the polypeptide chain crosses the bilayer as a single a helix. Others are multipass pro- 
teins, in which the polypeptide chain crosses the bilayer multiple times— either as a 
series of a helices or as a p sheet in the form of a closed barrel All proteins responsible 
for the transmembrane transport of ions and other small water-soluble molecules are 
multipass proteins. Some membrane- associated proteins do not span the bilayer but 
instead are attached to either side of the membrane. Many of these are bound by non- 
covalent interactions with transmembrane proteins, but others are bound via cova- 
lently attached lipid groups. In the plasma membrane of all eucaryotic cells, most of 
the proteins exposed on the cell surface and some of the lipid molecules in the outer 
lipid monolayer have oligosaccharide chains covalently attached to them. Like the 
lipid molecules in the bilayer, many membrane proteins are able to diffuse rapidly in 
the plane of the membrane. However, cells have ways of immobilizing specific mem- 
brane proteins, as well as ways of confining both membrane protein and lipid 
molecules to particular domains in a continuous lipid bilayer. 



PROBLEMS 

Which statements are true? Explain why or why not. 

10-1 Although lipid molecules are free to diffuse in the 
plane of the bilayer, they cannot flip-flop across the bilayer 
unless enzyme catalysts called phospholipid translocators 
are present in the membrane. 

10-2 Whereas all the carbohydrate in the plasma mem- 
brane faces outward on the external surface of the cell, all the 
carbohydrate on internal membranes faces toward the 
cytosol. 



10-3 Although membrane domains with different protein 
compositions are well known, there are at present no exam- 
ples of membrane domains that differ in lipid composition. 

Discuss the following problems. 

1 0-4 When a lipid bilayer is torn, why does it not seal itself 
by forming a "hemi-micelle" cap at the edges, as shown in 
Figure Q10-1? 

10-5 Margarine is made from vegetable oil by a chemical 
process. Do you suppose this process converts saturated fatty 
acids to unsaturated ones, or vice versa? Explain vour answer. 



END-OF-CHAPTER PROBLEMS 
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Figure Q10-1 A torn lipid bilayer sealed with a hypothetical "hemi- 
micelle" cap (Problem 10-4). 

1 0-6 If a lipid raft is typically 70 nm in diameter and each 
lipid molecule has a diameter of 0.5 nm, about how many 
lipid molecules would there be in a lipid raft composed 
entirely of lipid? At a ratio of 50 lipid molecules per protein 
molecule (50% protein by mass) how many proteins would 
be in a typical raft? (Neglect the loss of lipid from the raft 
that would be required to accommodate the protein.) 

10-7 A classic paper studied the behavior of lipids in the 
two monolayers of a membrane by labeling individual 
molecules with nitroxide groups, which are stable free radi- 
cals (Figure Q10-2). These spin-labeled lipids can be 
detected by electron spin-resonance (ESR) spectroscopy, a 
technique that does not harm living cells. Spin-labeled lipids 
are introduced into small lipid vesicles, which are then fused 
with cells, thereby transferring the labeled lipids into the 
plasma membrane. 

The two spin-labeled phospholipids shown in Figure 
Q10-2 were incorporated into intact human red cell mem- 
branes in this way To determine whether they were intro- 
duced equally into the two monolayers of the bilayer, ascor- 
bic acid (vitamin C), which is a water-soluble reducing agent 
that does not cross membranes, was added to the medium 
to destroy any nitroxide radicals exposed on the outside of 
the cell. The ESR signal was followed as a function of time in 
the presence and absence of ascorbic acid as indicated in 
Figure Q10-3A and B. 

A. Ignoring for the moment the difference in extent of loss 
of ESR signal, offer an explanation for why phospholipid 1 
(Figure Q10-3A) reacts faster with ascorbate than does 
phospholipid 2 (Figure Q10-3B). Note that phospholipid 1 
reaches a plateau in about 15 minutes, whereas phospho- 
lipid 2 takes almost an hour. 

B. To investigate the difference 
in extent of loss of ESR signal with 
the two phospholipids, the exper- 
iments were repeated using red J#ft \^h nitr0Xlde 
cell ghosts that had been resealed 
to make them impermeable to 
ascorbate (Figure Q10-3C and D). 
Resealed red cell ghosts are miss- 
ing all of their cytoplasm but have 
an intact plasma membrane. In 
these experiments the loss of ESR 

signal for both phospholipids was phospholipid i 

negligible in the absence of ascor- 
bate and reached a plateau at 50% 
in the presence of ascorbate. What 

Figure Q1 0-2 Structures of two 
nitroxide-labeled lipids (Problem 
10-7). The nitroxide radical is shown at 
the top, and its position of attachment 
to the phospholipids is shown below. 




do you suppose might account for the difference in extent of 

loss of ESR signal in experiments with red cell ghosts (Figure 

Q10-3C and D) versus those with normal red cells (Figure 

Q10-3AandB). 

C Were the spin-labeled phospholipids introduced 

equally into the two monolayers of the red cell membrane? 

10-8 Monomeric single-pass transmembrane proteins 
span a membrane with a single a helix that has characteris- 
tic chemical properties in the region of the bilayer. Which of 
the three 20-amino acid sequences listed below is the most 
likely candidate for such a transmembrane segment? 
Explain the reasons for your choice. (See back of book for 
one-letter amino acid code; FAMILY VW is a convenient 
mnemonic for hydrophobic amino acids.) 

A. ITLIYFGVMAGVIGTILLIS 

B. ITPIYFGPMAGVIGTPLLIS 

C. ITEIYFGRMAGVIGTDLLIS 

1 0-9 You are studying the binding of proteins to the cyto- 
plasmic face of cultured neuroblastoma cells and have 
found a method that gives a good yield of inside-out vesicles 
from the plasma membrane. Unfortunately, your prepara- 
tions are contaminated with variable amounts of right-side- 
out vesicles. Nothing you have tried avoids this problem. A 
friend suggests that you pass your vesicles over an affinity 
column made of lectin coupled to solid beads. What is the 
point of your friend's suggestion? 

10-10 Glycophorin, a protein in the plasma membrane of 
the red blood cell, normally exists as a homodimer that is 
held together entirely by interactions between its trans- 
membrane domains. Since transmembrane domains are 
hydrophobic, how is it that they can associate with one 
another so specifically? 
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Figure Q10-3 Decrease in ESR signal intensity as a function of time 
in intact red cells and red cell ghosts in the presence and absence of 
ascorbate (Problem 10-7). (A and B) Phospholipid 1 and 
phospholipid 2 in intact red cells. (C and D) Phospholipid 1 and 
phospholipid 2 in red cell ghosts. 
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Chapter 1 1 



Membrane Transport of Small 
Molecules and the Electrical 
Properties of Membranes 




Because of its hydrophobic interior, the lipid bilayer of cell membranes prevents 
the passage of most polar molecules. This barrier function allows the cell to 
maintain concentrations of solutes in its cytosol that differ from those in the 
extracellular fluid and in each of the intracellular membrane-enclosed compart- 
ments. To benefit from this barrier, however, cells have had to evolve ways of 
transferring specific water-soluble molecules and ions across their membranes 
in order to ingest essential nutrients, excrete metabolic waste products, and reg- 
ulate intracellular ion concentrations. Cells use specialized transmembrane pro- 
teins to transport inorganic ions and small water-soluble organic molecules 
across the lipid bilayer. Cells can also transfer macromolecules and even large 
particles across their membranes, but the mechanisms involved in most of these 
cases differ from those used for transferring small molecules, and they are dis- 
cussed in Chapters 12 and 13. The importance of membrane transport is 
reflected in the large number of genes in all organisms that code for transport 
proteins, which make up 15-30% of the membrane proteins in all cells. Some 
specialized mammalian cells devote up to two-thirds of their total metabolic 
energy consumption to membrane transport processes. 

We begin this chapter by describing some general principles of how small 
water-soluble molecules traverse cell membranes. We then consider, in turn, 
the two main classes of membrane proteins that mediate this traffic of 
molecules back and forth across lipid bilayers: transporters, which have moving 
parts to transport specific molecules across membranes, and channels, which 
form a narrow hydrophilic pore, allowing passive transmembrane movement, 
primarily of small inorganic ions. Transporters can be coupled to a source of 
energy to catalyze active transport, and a combination of selective passive per- 
meability and active transport creates large differences in the composition of 
the cytosol compared with that of either the extracellular fluid (Table 11-1) or 
the fluid within membrane-enclosed organelles. By generating ionic concen- 
tration differences across the lipid bilayer, cell membranes can store potential 
energy in the form of electrochemical gradients, which drive various transport 
processes, convey electrical signals in electrically excitable cells, and (in mito- 
chondria, chloroplasts, and bacteria) make most of the cell's ATP. We focus our 
discussion mainly on transport across the plasma membrane, but similar 
mechanisms operate across the other membranes of the eucaryotic cell, as dis- 
cussed in later chapters. 

In the last part of the chapter, we concentrate mainly on the functions of ion 
channels in neurons (nerve cells). In these cells, channel proteins perform at 
their highest level of sophistication, enabling networks of neurons to carry out 
all the human brain's astonishing feats. 
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PRINCIPLES OF MEMBRANETRANSPORT 

We begin this section by describing the permeability properties of protein-free, 
synthetic lipid bilayers. We then introduce some of the terms used to describe 
the various forms of membrane transport and some strategies for characterizing 
the proteins and processes involved. 
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Table 1 1 -1 A Comparison of Ion Concentrations Inside and Outside a Typical 
Mammalian Cell 



COMPONENT 


INTRACELLULAR 


EXTRACELLULAR 




CONCENTRATION (mM) 


CONCENTRATION (mM) 


Cations 




145 


Na + 


5-15 


K+ 


140 


5 


Mg 2+ 


0.5 


1-2 


Ca 2+ 


10" 4 


1-2 


H+ 


7x10- 5 (10" 72 MorpH7.2} 


4x 1(r 5 (1Cr 7 - 4 MorpH7.4) 


Anions* 


ci- 


5-15 


110 



*The cell must contain equal quantities of positive and negative charges (that is, it must be 
electrically neutral). Thus, in addition to CI", the cell contains many other anions not listed in this 
table; in fact, most cell constituents are negatively charged (HCtV, P0 4 3 ~, proteins, nucleic acids, 
metabolites carrying phosphate and carboxyl groups, etc). The concentrations of Ca 2+ and Mg 2 ^ 
given are for the free ions There is a total of about 20 mM Mg 2+ and 1 -2 mM Ca 24 in cells, but 
both are mostly bound to proteins and other substances and, for Ca 2f , stored within various 
organelles, 



Protein-Free Lipid Bilayers Are Highly Impermeable to Ions 

Given enough time, virtually any molecule will diffuse across a protein-free 
lipid bilayer down its concentration gradient. The rate of diffusion, however, 
varies enormously, depending partly on the size of the molecule but mostly on 
its relative solubility in oil. In general, the smaller the molecule and the more 
soluble it is in oil (the more hydrophobic, or nonpolar, it is), the more rapidly it 
will diffuse across a lipid bilayer. Small nonpolar molecules, such as O2 and 
CO2, readily dissolve in lipid bilayers and therefore diffuse rapidly across them. 
Small uncharged polar molecules, such as water or urea, also diffuse across a 
bilayer, albeit much more slowly (Figure 11-1). By contrast, lipid bilayers are 
highly impermeable to charged molecules (ions), no matter how small: the 
charge and high degree of hydration of such molecules prevents them from 
entering the hydrocarbon phase of the bilayer. Thus, synthetic lipid bilayers are 
10 9 times more permeable to water than to even such small ions as Na + or K + 
(Figure 11-2). 



There Are Two Main Classes of Membrane Transport Proteins: 
Transporters and Channels 

Like synthetic lipid bilayers, cell membranes allow water and nonpolar 
molecules to permeate by simple diffusion. Cell membranes, however, also have 
to allow the passage of various polar molecules, such as ions, sugars, amino 
acids, nucleotides, and many cell metabolites that cross synthetic lipid bilayers 
only very slowly. Special membrane transport proteins transfer such solutes 
across cell membranes. These proteins occur in many forms and in all types of 
biological membranes. Each protein transports a particular class of molecule 
(such as ions, sugars, or amino acids) and often only certain molecular species 
of the class. Studies in the 1950s found that bacteria with a single-gene mutation 
were unable to transport sugars across their plasma membrane, thereby demon- 
strating the specificity of membrane transport proteins. We now know that 
humans with similar mutations suffer from various inherited diseases that hin- 
der the transport of a specific solute in the kidney, intestine, or other cell type. 
Individuals with the inherited disease cystinuria, for example, cannot transport 
certain amino acids (including cystine, the disulfide-linked dimer of cysteine) 
from either the urine or the intestine into the blood; the resulting accumulation 
of cystine in the urine leads to the formation of cystine stones in the kidneys. 
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Figure 1 1 -1 The relative permeability of 
a synthetic lipid bilayer to different 
classes of molecules. The smaller the 
molecule and, more importantly, the less 
strongly it associates with water, the 
more rapidly the molecule diffuses across 
the bilayer. 
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Figure 11-2 Permeability coefficients for the passage of various 
molecules through synthetic lipid bilayers. The rate of flow of a solute 
across the bilayer is directly proportional to the difference in its 
concentration on the two sides of the membrane. Multiplying this 
concentration difference (in mol/cm 3 ) by the permeability coefficient 
(in cm/sec) gives the flow of solute in moles per second per square 
centimeter of bilayer. A concentration difference of tryptophan of 
1 -4 mol/cm 3 (1 _4 /1 (T 3 L = 0.1 M), for example, would cause a flow of 
1CT 4 mol/cm 3 x 1CT 7 cm/sec = 1CT 11 mol/sec through 1 cm 2 of bilayer, or 
6 x 10 4 molecules/sec through 1 (im 2 of bilayer. 



All membrane transport proteins that have been studied in detail have been 
found to be multipass transmembrane proteins — that is, their polypeptide 
chains traverse the lipid bilayer multiple times. By forming a continuous protein 
pathway across the membrane, these proteins enable specific hydrophilic 
solutes to cross the membrane without coming into direct contact with the 
hydrophobic interior of the lipid bilayer. 

Transporters and channels are the two major classes of membrane transport 
proteins (Figure 1 1-3). Transporters (also called carriers, or permeases) bind the 
specific solute to be transported and undergo a series of conformational 
changes to transfer the bound solute across the membrane. Channels, in con- 
trast, interact with the solute to be transported much more weakly. They form 
aqueous pores that extend across the lipid bilayer; when open, these pores allow 
specific solutes (usually inorganic ions of appropriate size and charge) to pass 
through them and thereby cross the membrane. Not surprisingly, transport 
through channels occurs at a much faster rate than transport mediated by trans- 
porters. Although water can diffuse across synthetic lipid bilayers, all cells con- 
tain specific channel proteins (called water channels, or aquaporins) that greatly 
increase the permeability of these membranes to water, as we discuss later. 
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Active Transport Is Mediated by Transporters Coupled to an 
Energy Source 

All channels and many transporters allow solutes to cross the membrane only 
passively ("downhill"), a process called passive transport, or facilitated diffu- 
sion. In the case of transport of a single uncharged molecule, the difference in 
the concentration on the two sides of the membrane — its concentration gradi- 
ent—drives passive transport and determines its direction (Figure 11-4A). 

If the solute carries a net charge, however, both its concentration gradient 
and the electrical potential difference across the membrane, the membrane 
potential influence its transport. The concentration gradient and the electrical 
gradient combine to form a net driving force, the electrochemical gradient, for 
each charged solute (Figure 11-4B). We discuss electrochemical gradients in 
more detail in Chapter 14. In fact, almost all plasma membranes have an elec- 
trical potential difference (voltage gradient) across them, with the inside usu- 
ally negative with respect to the outside. This potential difference favors the 
entry of positively charged ions into the cell but opposes the entry of negatively 
charged ions. 



Figure 1 1 -3 Transporters and channel 
proteins. (A) A transporter alternates 
between two conformations, so that the 
solute-binding site is sequentially 
accessible on one side of the bilayer and 
then on the other. (B) In contrast, a 
channel protein forms a water-filled pore 
across the bilayer through which specific 
solutes can diffuse. 
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Cells also require transport proteins that will actively pump certain solutes 
across the membrane against their electrochemical gradients ("uphill"); this 
process, known as active transport, is mediated by transporters, which are also 
called pumps. In active transport, the pumping activity of the transporter is 
directional because it is tightly coupled to a source of metabolic energy, such as 
ATP hydrolysis or an ion gradient, as discussed later. Thus, transmembrane 
movement of small molecules mediated by transporters can be either active or 
passive, whereas that mediated by channels is always passive. 

Summary 

Lipid bilayers are highly impermeable to most polar molecules. To transport small 
water-soluble molecules into or out of cells or intracellular membrane- enclosed com- 
partments, cell membranes contain various membrane transport proteins, each of 
which is responsible for transferring a particular solute or class of solutes across the 
membrane. There are two classes of membrane transport proteins— transporters and 
channels. Both form continuous protein pathways across the lipid bilayer. Whereas 
transmembrane movement mediated by transporters can be either active or passive, 
solute flow through channel proteins is always passive. 



Figure 11-4 Passive and active 
transport compared. (A) Passive 
transport down an electrochemical 
gradient occurs spontaneously, either by 
simple diffusion through the lipid bilayer 
or by facilitated diffusion through 
channels and passive transporters. By 
contrast, active transport requires an 
input of metabolic energy and is always 
mediated by transporters that harvest 
metabolic energy to pump the solute 
against its electrochemical gradient. 
(B) An electrochemical gradient 
combines the membrane potential and 
the concentration gradient; they can 
work additively to increase the driving 
force on an ion across the membrane 
(middle) or can work against each 
other (right). 



TRANSPORTERS AND ACTIVE MEMBRANE 
TRANSPORT 

The process by which a transporter transfers a solute molecule across the lipid 
bilayer resembles an enzyme-substrate reaction, and in many ways transporters 
behave like enzymes. In contrast to ordinary enzyme-substrate reactions, how- 
ever, the transporter does not modify the transported solute but instead delivers 
it unchanged to the other side of the membrane. 

Each type of transporter has one or more specific binding sites for its solute 
(substrate). It transfers the solute across the lipid bilayer by undergoing 
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reversible conformational changes that alternately expose the solute-binding 
site first on one side of the membrane and then on the other. Figure 11-5 shows 
a schematic model of how a transporter operates. When the transporter is satu- 
rated (that is, when all solute-binding sites are occupied), the rate of transport is 
maximal. This rate, referred to as V max {V for velocity), is characteristic of the 
specific carrier. Vmax measures the rate with which the carrier can flip between 
its two conformational states. In addition, each transporter has a characteristic 
affinity for its solute, reflected in the K m of the reaction, which is equal to the 
concentration of solute when the transport rate is half its maximum value (Fig- 
ure 1 1-6) . As with enzymes, the binding of solute can be blocked specifically by 
either competitive inhibitors (which compete for the same binding site and may 
or may not be transported) or noncompetitive inhibitors (which bind elsewhere 
and specifically alter the structure of the transporter). 

As we discuss below, it requires only a relatively minor modification of the 
model shown in Figure 11-5 to link a transporter to a source of energy in order 
to pump a solute uphill against its electrochemical gradient. Cells carry out such 
active transport in three main ways (Figure 11-7): 

1. Coupled transporters couple the uphill transport of one solute across the 
membrane to the downhill transport of another. 

2. ATP-driven pumps couple uphill transport to the hydrolysis of ATP. 

3. Light-driven pumps, which are found mainly in bacteria and archaea, cou- 
ple uphill transport to an input of energy from light, as with bacterio- 
rhodopsin (discussed in Chapter 10). 

Amino acid sequence comparisons suggest that, in many cases, there are 
strong similarities in molecular design between transporters that mediate active 
transport and those that mediate passive transport. Some bacterial transporters, 
for example that use the energy stored in the H + gradient across the plasma 
membrane to drive the active uptake of various sugars are structurally similar to 
the transporters that mediate passive glucose transport into most animal cells. 
This suggests an evolutionary relationship between various transporters. Given 
the importance of small metabolites and sugars as energy sources, it is not sur- 
prising that the superfamily of transporters is an ancient one. 

We begin our discussion of active transport by considering transporters that 
are driven by ion gradients. These proteins have a crucial role in the transport of 
small metabolites across membranes in all cells. We then discuss ATP-driven 
pumps, including the Na + pump that is found in the plasma membrane of 
almost all cells. 



Figure 1 1 -6 The kinetics of simple diffusion and transporter-mediated 
diffusion. Whereas the rate of simple diffusion is always proportional to 
the solute concentration, the rate of transporter-mediated diffusion 
reaches a maximum (V max ) when the transporter is saturated. The solute 
concentration when transport is at half its maximal value approximates the 
binding constant (K m ) of the transporter for the solute and is analogous to 
the K m of an enzyme for its substrate. The graph applies to a transporter 
moving a single solute; the kinetics of coupled transport of two or more 
solutes is more complex. 



Figure 11-5 A model of how a 
conformational change in a transporter 
could mediate the passive movement of 
a solute. The transporter shown can exist 
in two conformational states: in state A, 
the binding sites for solute are exposed on 
the outside of the lipid bilayer; in state B, 
the same sites are exposed on the other 
side of the bilayer. The transition between 
the two states can occur randomly. It is 
completely reversible and does not 
depend on whether the solute binding site 
is occupied. Therefore, if the solute 
concentration is higher on the outside of 
the bilayer, more solute binds to the 
transporter in the A conformation than in 
the B conformation, and there is a net 
transport of solute down its concentration 
gradient (or, if the solute is an ion, down 
its electrochemical gradient). 
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Figure 11 -7 Three ways of driving active 
transport. The actively transported 
molecule is shown in yellow, and the 
energy source is shown in red. 



Active Transport Can Be Driven by Ion Gradients 

Some transporters simply mediate the movement of a single solute from one 
side of the membrane to the other at a rate determined hy their 1W and K m ; 
they are called uniporters. Others function as coupled transporters* in which the 
transfer of one solute strictly depends on the transport of a second. Coupled 
transport involves either the simultaneous transfer of a second solute in the 
same direction, performed by symporters (also called co-transporters), or the 
transfer of a second solute in the opposite direction, performed by antiporters 
(also called exchangers) (Figure 11-8). 

The tight coupling between the transfer of two solutes allows these coupled 
transporters to harvest the energy stored in the electrochemical gradient of one 
solute, typically an ion. to transport the other, in this way the free energy 
released during the movement of an inorganic ion down an electrochemical gra- 
dient is used as the driving force to pump other solutes uphill, against their elec- 
trochemical gradient, This principle can work in either direction; some coupled 
transporters function as syraponers, others as antiporters, in the plasma mem- 
brane of animal cells, Na + is the usual co-transported ion, the electrochemical 
gradient of which provides a large driving force for the active transport of a sec- 
ond molecule. The Na + that enters the cell during transport is subsequently 
pumped out by an ATP-driven Na + pump in the plasma membrane (as we dis- 
cuss later), which, by maintaining the Na + gradient, indirectly drives the trans- 
port. (For this reason ion-driven carriers are said to mediate secondary active 
transport, whereas ATP-driven carriers are said to mediate primary active trans- 
port) 

Intestinal and kidney epithelial cells, for example, contain a variety of sym- 
porters that are driven by the Na + gradient across the plasma membrane. Each 
Na + -driven symporter is specific for importing a small group of related sugars or 
amino acids into the cell, and the solute and Na + bind to different sites on the 
transporter. Because the Na + tends to move into the cell down its electrochemi- 
cal gradient, the sugar or amino acid is, in a sense, "dragged" into the cell with it. 
The greater the electrochemical gradient for Na + , the greater the rate of solute 
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Figure 11-8 Three types of transporter- 
mediated movement. <ACCC> This 
schematic diagram shows transporters 
functioning as uniporters, symporters, 
and antiporters. 
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Figure 11-9 One way in which a glucose transporter can be driven by a Na + gradient. As in the model 
shown in Figure 1 1 -5, the transporter oscillates between two alternate states, A and B. In the A state, the 
protein is open to the extracellular space; in the B state, it is open to the cytosol. Binding of Na + and 
glucose is cooperative— that is, the binding of either ligand induces a conformational change that 
increases the protein's affinity for the other ligand. Since the Na + concentration is much higher in the 
extracellular space than in the cytosol, glucose is more likely to bind to the transporter in the A state. 
Therefore, both Na + and glucose enter the cell (via an A -> B transition) much more often than they leave it 
(via aB^A transition). The overall result is the net transport of both Na + and glucose into the cell. Note 
that, because the binding is cooperative, if one of the two solutes is missing, the other fails to bind to the 
transporter. Thus, the transporter undergoes a conformational switch between the two states only if both 
solutes or neither are bound. 



entry; conversely, if the Na + concentration in the extracellular fluid is reduced, 
solute transport decreases (Figure 11-9). 

In bacteria and yeasts, as well as in many membrane-enclosed organelles of 
animal cells, most active transport systems driven by ion gradients depend on 
H + rather than Na + gradients, reflecting the predominance of H + pumps and the 
virtual absence of Na + pumps in these membranes. The electrochemical H + gra- 
dient drives the active transport of many sugars and amino acids across the 
plasma membrane and into bacterial cells. One well-studied H + -driven sym- 
porter is lactose permease, which transports lactose across the plasma mem- 
brane of E. coll Structural and biophysical studies of the permease, as well as 
extensive analyses of mutant forms of the protein, have led to a detailed model 
of how the symporter works. The permease consists of 12 loosely packed trans- 
membrane a helices. During the transport cycle, some of the helices undergo 
sliding motions that cause them to tilt. These motions alternately open and 
close a crevice between the helices, exposing the binding sites for lactose and 
H + , first on one side of the membrane and then on the other (Figure 11-10). 



Transporters in the Plasma Membrane Regulate Cytosolic pH 

Most proteins operate optimally at a particular pH. Lysosomal enzymes, for 
example, function best at the low pH (-5) found in lysosomes, whereas cytoso- 
lic enzymes function best at the close to neutral pH (-7.2) found in the cytosol. 
It is therefore crucial that cells control the pH of their intracellular compart- 
ments. 

Most cells have one or more types of Na + -driven antiporters in their plasma 
membrane that help to maintain the cytosolic pH at about 7.2. These trans- 
porters use the energy stored in the Na + gradient to pump out excess H + , which 
either leaks in or is produced in the cell by acid-forming reactions. Two mecha- 
nisms are used: either H + is directly transported out of the cell or HC0 3 " is 
brought into the cell to neutralize H + in the cytosol (according to the reaction 
HC0 3 - + H + -> H 2 + CO2). One of the antiporters that uses the first mechanism 
is a Na + -H + exchanger, which couples an influx of Na + to an efflux of H + . Another, 
which uses a combination of the two mechanisms, is a Na + -driven Ct-HCChr 
exchanger that couples an influx of Na + and HCO3" to an efflux of CI" and H + (so 
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Figure 11-10The molecular mechanism 
of the bacterial lactose permease 
suggested from its crystal structure. 

(A) The 1 2 transmembrane helices of the 
permease are clustered into two lobes, 
shown in two shades of green. The loops 
that connect the helices on either side of 
the membrane are omitted for clarity. 
During transport, the helices slide and tilt 
in the membrane, exposing binding sites 
for the di saccharide lactose (yellow) and 
H + to either side of the membrane. (B) In 
one conformational state, the H + - and 
lactose-binding sites are accessible to the 
extracellular space (top row); in the other, 
they are exposed to the cytosol (bottom 
row). Loading the solutes on the 
extracellular side is favored because 
arginine(R) 144 forms a bond with 
glutamic acid (E) 1 26, leaving E269 free to 
accept H + . Unloading the solutes on the 
cytosolic side is favored because R144 
forms a bond with E269, which 
destabilizes the bound H + . In addition, 
the lactose-binding site is partially 
disrupted due to the rearrangement of 
the helices. Because the transition 
between the two protonated states 
(middle) is forbidden, H + can only be 
transported when a lactose is also 
transported. In this way, the 
electrochemical H + gradient drives 
lactose import. (Adapted from 
J. Abramson et al., Science 301 : 
610-615, 2003. With permission from 
AAAS.) 



that NaHCO : s comes in and 1 1C1 goes out). The Na + -d riven CI--HCO3- exchanger 
is twice as effective as the Na + -H + exchanger: it pumps out one H + and neutral- 
izes another for each Na + that enters the cell. If HC0 3 " is available, as is usually 
the case, this aniiporter is the most important transporter regulating the cytoso- 
lic pR The pi I inside the cell regulates both exchangers; when the pH In the 
cytosol falls, both exchangers increase their activity: 

A AW"- independent Ct-HCQ A - exchanger adjusts the cytosolic pH in the 
reverse direction, Like the Na H -dependent transporters, pH regulates the Na + - 
independent C{--HC0 3 - exchanger, but the exchangers activity increases as the 
cytosol becomes too alkaline. The movement of HCO3- in this case is normally 
out of the cell, down its electrochemical gradient, which decreases the pH of the 
cytosol A INkr-independent Cl--HCC) : r exchanger in the membrane of red blood 
cells (called band 3 protein— see Figure 10-41) facilitates the quick discharge of 
C0 2 (as HCO : f 1 as the cells pass dirough capillaries in the lung. 

The intracellular pH is not entirely regulated by these coupled transporters: 
ATP-driven rT pumps are also used to control the pH of many intracellular com- 
partments. As discussed in Chapter 13, H* pumps maintain "the low pH in lyso- 
somes, as well as in endosomes and secretory vesicles. These H 4 pumps use the 
energy of ATP hydrolysis to pump H + into these organelles from the cytosol. 



An Asymmetric Distribution of Transporters in Epithelial Cells 
Underlies the Transcellular Transport of Solutes 

In epithelial cells, such as those that absorb nutrients from the gut, trans- 
porters are distributed nonuniformly in the plasma membrane and thereby 
contribute to the transcellular transport of absorbed solutes. By the action of 
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the transporters in these cells, solutes are moved across the epithelial cell layer 
into the extracellular fluid from where they pass into the blood. As shown in Fig- 
ure 11-11, Na + -linked symporters located in the apical (absorptive) domain of 
the plasma membrane actively transport nutrients into the cell, building up sub- 
stantial concentration gradients for these solutes across the plasma membrane. 
Na + -independent transport proteins in the basal and lateral (basolateral) 
domain allow the nutrients to leave the cell passively down these concentration 
gradients. 

In many of these epithelial cells, the plasma membrane area is greatly 
increased by the formation of thousands of microvilli, which extend as thin, fin- 
gerlike projections from the apical surface of each cell. Such microvilli can 
increase the total absorptive area of a cell as much as 25-fold, thereby enhanc- 
ing its transport capabilities. 

As we have seen, ion gradients have a crucial role in driving many essential 
transport processes in cells. Ion pumps that use the energy of ATP hydrolysis 
establish and maintain these gradients, as we discuss next. 



Figure 11-11 Transcellular transport. 

<GGAT> The transcellular transport of 
glucose across an intestinal epithelial cell 
depends on the nonuniform distribution 
of transporters in the cell's plasma 
membrane. The process shown here 
results in the transport of glucose from 
the intestinal lumen to the extracellular 
fluid {from where it passes into the blood). 
Glucose is pumped into the cell through 
the apical domain of the membrane by a 
Na + -powered glucose symporter. Glucose 
passes out of the cell (down its 
concentration gradient) by passive 
movement through a different glucose 
transporter in the basal and lateral 
membrane domains. The Na + gradient 
driving the glucose symport is maintained 
by a Na + pump in the basal and lateral 
plasma membrane domains, which keeps 
the internal concentration of Na + low. 
Adjacent cells are connected by 
impermeable tight junctions, which have 
a dual function in the transport process 
illustrated: they prevent solutes from 
crossing the epithelium between cells, 
allowing a concentration gradient of 
glucose to be maintained across the cell 
sheet, and they also serve as diffusion 
barriers within the plasma membrane, 
which help confine the various 
transporters to their respective membrane 
domains (see Figure 10-37). 



There Are Three Classes of ATP-Driven Pumps 

ATP-driven pumps are often called transport ATPases because they hydrolyze 
ATP to ADP and phosphate and use the energy released to pump ions or other 
solutes across a membrane. There are three principal classes of ATP-driven 
pumps (Figure 11-12), and representatives of each are found in all procaryotic 
and eucaryotic cells. 

1. P-type pumps are structurally and functionally related multipass trans- 
membrane proteins. They are called "P-type" because they phosphorylate 
themselves during the pumping cycle. This class includes many of the ion 
pumps that are responsible for setting up and maintaining gradients of 
Na + , K + , H + , and Ca 2+ across cell membranes. 

2. F-type pumps are turbine-like proteins, constructed from multiple differ- 
ent subunits. They differ structurally from P-type ATPases and are found in 
the plasma membrane of bacteria, the inner membrane of mitochondria, 
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and the thylakoid membrane of chloroplasts. They are often called ATP 
synthases because they normally work in reverse: instead of using ATP 
hydrolysis to drive H + transport, they use the H + gradient across the mem- 
brane to drive the synthesis of ATP from ADP and phosphate. The H + gra- 
dient is generated either during the electron-transport steps of oxidative 
phosphorylation (in aerobic bacteria and mitochondria), during photo- 
synthesis (in chloroplasts), or by the light-activated H + pump (bacterio- 
rhodopsin) in Halobacterium. We discuss these proteins in detail in Chap- 
ter 14. 

Structurally related to the F-type ATPases is a distinct family of V-type 
ATPases that normally pump H + rather than synthesize ATP. They pump H + 
into organelles, such as lysosomes, synaptic vesicles, and plant vacuoles to 
acidify the interior of these organelles (see Figure 13-36). 
ABC transporters primarily pump small molecules across cell mem- 
branes, in contrast to P-type and the F- or V-type ATPases, which exclu- 
sively transport ions. 

For the remainder of this section, we focus on P-type pumps and ABC trans- 
porters. 



Figure 1 1 -12 Three types of ATP-driven 
pumps. The different molecular designs 
of the pumps are cartooned here. Like 
any enzyme, pumps can work in reverse: 
when the electrochemical gradients of 
the solutes are reversed and the ATP/ADP 
ratio is low, they can synthesize ATP from 
ADR as shown for the F-type ATPase, 
which normally works in this mode. 



The Ca 2+ Pump Is the Best-Understood P-type ATPase 

Eucaryotic cells maintain very low concentrations of free Ca 2+ in their cytosol 
(~10~ 7 M) in the face of a very much higher extracellular Ca 2+ concentration 
(~ 10" 3 M) . Even a small influx of Ca 2+ significantly increases the concentration of 
free Ca 2+ in the cytosol, and the flow of Ca 2+ down its steep concentration gradi- 
ent in response to extracellular signals is one means of transmitting these signals 
rapidly across the plasma membrane (discussed in Chapter 15). It is important, 
therefore, that the cell maintain a steep Ca 2+ gradient across its plasma mem- 
brane. Ca 2+ transporters that actively pump Ca 2+ out of the cell help maintain 
the gradient. One of these is a P-type Ca 2+ ATPase; the other is an antiporter 
(called a Na + -Ca 2+ exchanger) that is driven by the Na + electrochemical gradient 
across the membrane (see Figure 15-41). 

The best-understood P-type transport ATPase is the Ca 2+ pump, or Ca 2+ 
ATPase, in the sarcoplasmic reticulum (SR) membrane of skeletal muscle cells. 
The SR is a specialized type of endoplasmic reticulum that forms a network of 
tubular sacs in the muscle cell cytoplasm and serves as an intracellular store of 
Ca 2+ . (When an action potential depolarizes the muscle cell plasma membrane, 
Ca 2+ is released into the cytosol from the SR through Ca 2 ^ -release channels, stim- 
ulating the muscle to contract, as discussed in Chapter 16.) The Ca 2+ pump, 
which accounts for about 90% of the membrane protein of the SR, moves Ca 2+ 
from the cytosol back into the SR. The endoplasmic reticulum of nonmuscle 
cells contains a similar Ca 2+ pump, but in smaller quantities. 

The three-dimensional structure of the SR Ca 2+ pump has been determined 
by x-ray crystallography. This structure and the analysis of a related fungal H + 
pump have provided the first views of P-type transport ATPases, which are all 
thought to have similar structures. They contain 10 transmembrane a helices, 
three of which line a central channel that spans the lipid bilayer. In the unphos- 
phorylated state, two helices are disrupted and form a cavity that binds two Ca 2+ 
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Figure 1 1 -1 3 A model of how the 
sarcoplasmic reticulum Ca 2+ pump moves 
Ca 2+ . (A) The structures of the 
unphosphorylated, Ca 2+ -bound state (left) 
and the phosphorylated, Ca 2+ -free state 
(right) were determined by x-ray 
crystallography. (B)The model shows how 
ATP binding and hydrolysis cause drastic 
conformational changes, bringing the 
nucleotide-binding and phosphorylation 
domains into close proximity. This change is 
thought to cause a 90° rotation of the 
activator domain, which leads to a 
rearrangement of the transmembrane 
helices. The rearrangement of the helices 
disrupts the Ca 2+ -binding cavity and releases 
the Ca 2+ into the lumen of the sarcoplasmic 
reticulum. (Adapted from C.Toyoshima et al., 
Nature 405:647-655, 2000. With permission 
from Macmillan Publishers Ltd.) 
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ions and is accessible from the cytosolic side of the membrane. The binding of 
ATP to a binding site on the same side of the membrane and the subsequent 
transfer of the terminal phosphate group of the ATP to an aspartic acid of an 
adjacent domain lead to a drastic rearrangement of the transmembrane helices. 
The rearrangement disrupts the Ca 2+ -binding site and releases the Ca 2+ ions on 
the other side of the membrane, into the lumen of the SR (Figure 11-13). An 
essential characteristic of all P-type pumps is that the pump transiently phos- 
phorylates itself during the pumping cycle. 



The Plasma Membrane P-type Na + -K + Pump Establishes the Na + 
Gradient Across the Plasma Membrane 

The concentration of K + is typically 10-30 times higher inside cells than outside, 
whereas the reverse is true of Na + (see Table 1 1-1, p. 652) . A Na + -K + pump, or Na + 
pump, found in the plasma membrane of virtually all animal cells, maintains 
these concentration differences. The pump operates as an ATP-driven 
antiporter, actively pumping Na + out of the cell against its steep electrochemical 
gradient and pumping K + in (Figure 11-14). Because the pump hydrolyzes ATP 
to pump Na + out and K + in, it is also known as a Na + -K + ATPase. The pump 
belongs to the family of P-type ATPases and functions very similarly to the Ca 2+ 
pump (Figure 11-15). 

We mentioned earlier that the Na + gradient produced by the Na + -K + pump 
drives the transport of most nutrients into animal cells and also has a crucial 
role in regulating cytosolic pH. A typical animal cell devotes almost one-third of 
its energy to fueling this pump, and the pump consumes even more energy in 
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electrically active nerve cells, which, as we shall see, repeatedly gain small 
amounts of Na + and lose small amounts of K + during the propagation of nerve 
impulses. 

Like any enzyme, the Na + -K + pump can be driven in reverse, in this case to 
produce ATP. When the Na + and K + gradients are experimentally increased to 
such an extent that the energy stored in their electrochemical gradients is 
greater than the chemical energy of ATP hydrolysis, these ions move down their 
electrochemical gradients and ATP is synthesized from ADP and phosphate by 
the Na + -K + pump. Thus, the phosphorylated form of the pump (step 2 in Figure 
11-15) can relax by either donating its phosphate to ADP (step 2 to step 1) or 
changing its conformation (step 2 to step 3). Whether the overall change in free 
energy is used to synthesize ATP or to pump Na + out of the cell depends on the 
relative concentrations of ATP, ADP, and phosphate, as well as on the electro- 
chemical gradients for Na + and K + . 

Since the Na + -K + pump drives three positively charged ions out of the cell 
for every two it pumps in, it is electrogenic. It drives a net current across the 



Figure 11 -14 The Na + -K + pump. <GAGT> 
This transporter actively pumps Na + out of 
and K + into a cell against their 
electrochemical gradients. For every 
molecule of ATP hydrolyzed inside the cell, 
three Na + are pumped out and two K + are 
pumped in. The specific inhibitor ouabain 
and K + compete for the same site on the 
extracellular side of the pump. 



EXTRACELLULAR SPACE 




Figure 11-15 A model of the pumping 
cycle of the Na + -K + pump. (1 ) The binding 
of intracellular Na + and the subsequent 
phosphorylation by ATP of the 
cytoplasmic face of the pump induce a 
conformational change in the protein that 

(2) transfers the Na + across the membrane 
and releases it on the outside of the cell. 

(3) Then, the binding of K + on the 
extracellular surface and the subsequent 
dephosphorylation of the pump return 
the protein to its original conformation, 
which (4) transfers the K + across the 
membrane and releases it into the cytosol. 
These changes in conformation are 
analogous to the A o B transitions shown 
in Figure 1 1-5, except that here the Na + - 
dependent phosphorylation and the In- 
dependent dephosphorylation of the 
protein cause the conformational 
transitions to occur in an orderly manner, 
enabling the protein to do useful work. 
Although for simplicity the diagram shows 
only one Na + - and one K + -binding site, in 
the real pump there are three Na + - and 
two K + -binding sites. 
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membrane, tending to create an electrical potential, with the cell's inside being 
negative relative to the outside. This electrogenic effect of the pump, however, 
seldom contributes more than 10% to the membrane potential. The remaining 
90%, as we discuss later, depends only indirectly on the Na + -K + pump. 

On the other hand, the Na + -K + pump does have a direct role in controlling 
the solute concentration inside the cell and thereby helps regulate osmolality 
(or tonicity) of the cytosol. All cells contain specialized water channel proteins 
called aquaporins (discussed in detail on p. 673) in their plasma membrane to 
facilitate water flow across this membrane. Thus, water moves into or out of cells 
down its concentration gradient, a process called osmosis. As explained in Panel 
11-1, cells contain a high concentration of solutes, including numerous nega- 
tively charged organic molecules that are confined inside the cell (the so-called 
fixed anions) and their accompanying cations that are required for charge bal- 
ance. This tends to create a large osmotic gradient that tends to "pull" water into 
the cell. Animal cells counteract this effect by an opposite osmotic gradient due 
to a high concentration of inorganic ions — chiefly Na + and CI"— in the extracel- 
lular fluid. The Na + -K + pump helps maintain osmotic balance by pumping out 
the Na + that leaks in down its steep electrochemical gradient. The CI" is kept out 
by the membrane potential. 

In the special case of human red blood cells, which lack a nucleus and other 
organelles and have a plasma membrane that has an unusually high permeabil- 
ity to water, osmotic water movements can greatly influence cell volume, and 
the Na + -K + pump plays an important part in maintaining red cell volume. If 
these cells are placed in a hypotonic solution (that is, a solution having a low 
solute concentration and therefore a high water concentration), there is net 
movement of water into the cells, causing them to swell and burst (lyse); con- 
versely, if the cells are placed in a hypertonic solution, they shrink (Figure 
11-16). The role of the Na + -K + pump in controlling red cell volume is indicated 
by the observation that the cells swell, and may eventually burst, if they are 
treated with ouabain, which inhibits the Na + -K + pump. For most animal cells, 
however, osmosis and the Na + -K + pump have only minor roles in regulating cell 
volume. This is because most of the cytoplasm is in a gel-like state and resists 
large changes in its volume in response to changes in osmolarity. 

Nonanimal cells cope with their osmotic problems in various ways. Plant 
cells and many bacteria are prevented from bursting by the semirigid cell wall 
that surrounds their plasma membrane. In amoebae, the excess water that flows 
in osmotically is collected in contractile vacuoles, which periodically discharge 
their contents to the exterior (see Panel 11-1). Bacteria have also evolved strate- 
gies that allow them to lose ions, and even macromolecules, quickly when sub- 
jected to an osmotic shock. 

ABC Transporters Constitute the Largest Family of Membrane 
Transport Proteins 

The last type of carrier protein that we discuss is the family of the ABC trans- 
porters, so named because each member contains two highly conserved 
ATPase domains or ATP-binding "cassettes" (Figure 11-17). ATP binding leads 
to dimerization of the two ATP-binding domains, and ATP hydrolysis leads to 
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Figure 11-16 Response of a human red 
blood cell to changes in osmolarity of the 
extracellular fluid. <GTAC> The cell swells 
or shrinks as water moves into or out of the 
cell down its concentration gradient. 
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SOURCES OF INTRACELLULAR OSMOLARITY 




1 Macromolecules themselves contribute 
very little to the osmolarity of the cell 
interior since, despite their large size, each 
one counts only as a single molecule and 
there are relatively few of them compared 
to the number of small molecules in the 
cell. However, most biological macro- 
molecules are highly charged, and they 
attract many inorganic ions of opposite 
charge. Because of their large numbers, 
these counterions make a major 
contribution to intracellular osmolarity. 
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2 As the result of active transport and 
metabolic processes, the cell contains 
a high concentration of small organic 
molecules, such as sugars, amino acids, 
and nucleotides, to which its plasma 
membrane is impermeable. Because 
most of these metabolites are charged, 
they also attract counterions. Both the 
small metabolites and their counterions 
make a further major contribution to 
intracellular osmolarity. 
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3 The osmolarity of the extracellular fluid is 
usually due mainly to small inorganic ions. 
These leak slowly across the plasma membrane 
into the cell. If they were not pumped out, 
and if there were no other molecules inside 
the cell that interacted with them so as to 
influence their distribution, they would 
eventually come to equilibrium with equal 
concentrations inside and outside the cell. 
However, the presence of charged macro- 
molecules and metabolites in the cell that 
attract these ions gives rise to the Donnan 
effect: it causes the total concentration of 
inorganic ions (and therefore their 
contribution to the osmolarity) to be greater 
inside than outside the cell at equilibrium. 



H2O THE PROBLEM 



Because of the above factors, a cell that does nothing to 
control its osmolarity will have a higher concentration of 
solutes inside than outside. As a result, water will be higher in 
concentration outside the cell than inside. This difference 
in water concentration across the plasma membrane will cause 
water to move continuously into the cell by osmosis. 



THE SOLUTION 

Animal cells and bacteria control their 
intracellular osmolarity by actively 
pumping out inorganic ions, such as Na + , 
so that their cytoplasm contains a lower 
total concentration of inorganic ions than 
the extracellular fluid, thereby compensating 
for their excess of organic solutes. 
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Plant cells are prevented from swelling by 
their rigid walls and so can tolerate an 
osmotic difference across their plasma 
membranes: an internal turgor pressure is 
built up, which at equilibrium forces out as 
much water as enters. 
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Many protozoa avoid becoming 
swollen with water, despite an 
osmotic difference across the 
plasma membrane, by periodically 
extruding water from 
special contractile vacuoles. 
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Figure 11-17 Typical ABC transporters in procaryotes (A) and eucaryotes (B). Transporters consist of multiple domains: 
typically, two hydrophobic domains, each built of six membrane-spanning segments that form the translocation pathway 
and provide substrate specificity, and two ATPase domains (also called ATP-binding cassettes) protruding into the cytosol. 
In some cases, the two halves of the transporter are formed by a single polypeptide, whereas in other cases they are 
formed by two or more separate polypeptides that assemble into a similar structure (see Figure 1 0-24). Without ATP 
bound, the transporter exposes a substrate-binding site to either the extracellular space (in procaryotes) or the intracellular 
space (in eucaryotes or procaryotes). ATP binding induces a conformational change that exposes the substrate-binding 
pocket to the opposite face; ATP hydrolysis followed by ADP dissociation returns the transporter to its original 
conformation. Most individual ABC transporters are unidirectional. Both importing and exporting ABC transporters are 
found in bacteria, but in eucaryotes almost all ABC transporters export substances from the cytosol— either to the 
extracellular space or to a membrane-bound intracellular compartment such as the ER or the mitochondria. 



their dissociation. These structural changes in the cytosolic domains are 
thought to be transmitted to the transmembrane segments, driving cycles of 
conformational changes that alternately expose substrate-binding sites on one 
side of the membrane and then on the other. In this way, ABC transporters use 
ATP binding and hydrolysis to transport small molecules across the bilayer. 

ABC transporters constitute the largest family of membrane transport pro- 
teins and are of great clinical importance. The first of these proteins to be char- 
acterized was found in bacteria. We have already mentioned that the plasma 
membranes of all bacteria contain transporters that use the H + gradient across 
the membrane to pump a variety of nutrients into the cell. Bacteria also have 
transport ATPases that use the energy of ATP hydrolysis to import certain small 
molecules. In bacteria such as E. coli, which have double membranes (Figure 
11-18), the transport ATPases are located in the inner membrane, and an auxil- 
iary mechanism operates to capture the nutrients and deliver them to the trans- 
porters (Figure 11-19}. 

In K coll 78 genes (an amazing 5% of the bacterium's genes) encode ABC 
transporters, and animal genomes encode more. Although each transporter is 
thought to be specific for a particular molecule or class of molecules, the variety 
of substrates transported by this superfamily is great and includes inorganic 
ions, amino acids, mono- and polysaccharides, peptides, and even proteins. 
Whereas bacterial ABC transporters are used for both import and export, those 
identified in eucaryotes seem mostly specialized for export. 

Indeed, the first eucaryotic ABC transporters identified were discovered 
because of their ability to pump hydrophobic drugs out of the cytosol. One of 
these transporters is the multidrug resistance (MDR) protein, the overexpression 
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Figure 11 -1 8 A small section of the 
double membrane of an E. coli bacterium. 

The inner membrane is the cell's plasma 
membrane. Between the inner and outer 
lipid bilayer membranes is a highly porous, 
rigid peptidoglycan layer, composed of 
protein and polysaccharide, that 
constitutes the bacterial cell wall. It is 
attached to lipoprotein molecules in the 
outer membrane and fills the periplasmic 
space (only a little of the peptidoglycan 
layer is shown). This space also contains a 
variety of soluble protein molecules. The 
dashed threads (shown in green) at the top 
represent the polysaccharide chains of the 
special lipopolysaccharide molecules that 
form the external monolayer of the outer 
membrane; for clarity, only a few of these 
chains are shown. Bacteria with double 
membranes are called Gram-negative 
because they do not retain the dark blue 
dye used in Gram staining. Bacteria with 
single membranes (but thicker cell walls), 
such as staphylococci and streptococci, 
retain the blue dye and are therefore called 
Gram-positive] their single membrane is 
analogous to the inner (plasma) 
membrane of Gram-negative bacteria. 
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Figure 11-19 The auxiliary transport 
system associated with transport 
ATPases in bacteria with double 
membranes. The solute diffuses through 
channel-forming proteins (porins) in the 
outer membrane and binds to a 
periplasm ic substrate-binding protein, 
which undergoes a conformational 
change that enables it to bind to an ABC 
transporter in the plasma membrane. The 
ABC transporter then picks up the solute 
and actively transfers it across the plasma 
membrane in a reaction driven by ATP 
hydrolysis. The peptidoglycan is omitted 
for simplicity; its porous structure allows 
the substrate-binding proteins and 
water-soluble solutes to move through it 
by simple diffusion. 
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of which in human cancer cells can make the cells simultaneously resistant to a 
variety of chemically unrelated cytotoxic drugs that are widely used in cancer 
chemotherapy. Treatment with any one of these drugs can result in the selective 
survival and overgrowth of those cancer cells that express more of the MDR 
transporter. These cells can pump drugs out of the cell very efficiently and are 
therefore relatively resistant to the toxic effects of the anticancer drugs. Selection 
for cancer cells with resistance to one drug can thereby lead to resistance to a 
wide variety of anti-cancer drugs. Some studies indicate that up to 40% of 
human cancers develop multidrug resistance, making it a major hurdle in the 
battle against cancer, 

A related and equally sinister phenomenon occurs in the protisi Plasmo- 
dium falciparum, which causes malaria. More than 200 million people are 
infected worldwide with this parasite, which remains a major cause of human 
death, killing more than a million people every yean The development of resis- 
tance to the antimalarial drug chioroquine hampers the control of malaria. The 
resistant H falciparum have amplified a gene encoding an ABC transporter that 
pumps out the chioroquine. 

In most vertebrate cells, an ABC transporter in the endoplasmic reticulum 
(ER) membrane actively transports a wide variety of peptides from the cytosol 
intij the ER lumen. These peptides are produced by protein degradation in pro- 
teasomes (discussed in Chapter 6). They are carried from the ER to the cell sur- 
face, where they are displayed for scrutiny by cytotoxic T lymphocytes, which 
will kill the cell if the peptides are derived from a virus or other microorganisms 
lurking in the cytosol of an infected cell (discussed in Chapter 25). 

Yet another member of the ABC transporter family is the cystic fibrosis trans- 
membrane conductance regulator protein (CFTR), which was discovered 
through studies of the common genetic disease cystic fibrosis. This disease is 
caused by a mutation in the gene encoding CFTR. which functions as a CI" chan- 
nel m the plasma membrane of epithelial cells. CFTR regulates ion concentra- 
tions in the extracellular fluid, especially in the lung. One in 27 Caucasians car- 
Ties a gene encoding a mutant form of this protein; in 1 in 2900. both copies of 
the gene are mutated, causing the disease. In contrast to other ABC transporters, 
ATP binding and hydrolysis do not drive the transport process. Instead, they 
control the opening and closing of the CI" channel, which provides a passive 
conduit for CI" to move down its electrochemical gradient. Thus, ABC proteins 
can apparently function as either transporters or channels. 
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Summary 

Transporters bind specific solutes and transfer them across the lipid bilayer by under- 
going conformational changes that expose the solute-binding site sequentially on one 
side of the membrane and then on the other. Some transporters simply move a single 
solute "downhill/' whereas others can act as pumps to move a solute "uphill" against 
its electrochemical gradient, using energ\> provided by ATP hydrolysis, by a downhill 
flow of another solute (such as Na + orH+)> or by light to drive the requisite series of con- 
formational changes in an orderly manner. Transporters belong to a small number of 
protein families. Each family contains proteins of similar amino acid sequences that 
are thought to have evolved from a common ancestral protein and to operate by a sim- 
ilar mechanism. The family of P- type transport ATPases, which includes Ca 2 ^ and 
Na + -K + pumps, is an important example; each of these ATPases sequentially phospho- 
rylates and dephosphorylates itself during the pumping cycle. The superfamily of ABC 
transporters is the largest family of membrane transport proteins and is especially 
important clinically. It includes proteins that are responsible for cystic fibrosis, as well 
as for drug resistance in both cancer cells and malaria-causing parasites. 

ION CHANNELS AND THE ELECTRICAL PROPERTIES 
OF MEMBRANES 

Unlike carrier proteins, channel proteins form hydrophilic pores across mem- 
branes. One class of channel proteins found in virtually all animals forms gap 
junctions between two adjacent cells; each plasma membrane contributes 
equally to the formation of the channel, which connects the cytoplasm of the 
two cells. These channels are discussed in Chapter 19 and will not be considered 
further here. Both gap junctions and porins, the channel-forming proteins of the 
outer membranes of bacteria, mitochondria, and chloroplasts (discussed in 
Chapter 10), have relatively large and permissive pores, which would be disas- 
trous if they directly connected the inside of a cell to an extracellular space. 
Indeed, many bacterial toxins do exactly that to kill other cells (discussed in 
Chapter 24). 

In contrast, most channel proteins in the plasma membrane of animal and 
plant cells that connect the cytosol to the cell exterior necessarily have narrow, 
highly selective pores that can open and close rapidly. Because these proteins 
are concerned specifically with inorganic ion transport, they are referred to as 
ion channels. For transport efficiency, ion channels have an advantage over car- 
riers in that up to 100 million ions can pass through one open channel each sec- 
ond—a rate 10 5 times greater than the fastest rate of transport mediated by any 
known carrier protein. However, channels cannot be coupled to an energy 
source to perform active transport, so the transport that they mediate is always 
passive (downhill). Thus, the function of ion channels is to allow specific inor- 
ganic ions— primarily Na + , K + , Ca 2+ , or CI"— to diffuse rapidly down their elec- 
trochemical gradients across the lipid bilayer. As we shall see, the ability to con- 
trol ion fluxes through these channels is essential for many cell functions. Nerve 
cells (neurons), in particular, have made a specialty of using ion channels, and 
we shall consider how they use many different ion channels to receive, conduct, 
and transmit signals. 

Ion Channels Are Ion-Selective and Fluctuate Between Open and 
Closed States 

Two important properties distinguish ion channels from simple aqueous pores. 
First, they show ion selectivity, permitting some inorganic ions to pass, but not 
others. This suggests that their pores must be narrow enough in places to force 
permeating ions into intimate contact witii the walls of the channel so that only 
ions of appropriate size and charge can pass. The permeating ions have to shed 
most or all of their associated water molecules to pass t often in single file, 
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through the narrowest part of the channel, which is called the selectivity filter; 
this limits their rate of passage (Figure 11-20). Thus, as the ion concentration 
increases, the flux of the ion through a channel increases proportionally but 
then levels off (saturates) at a maximum rate. 

The second important distinction between ion channels and simple aque- 
ous pores is that ion channels are not continuously open. Instead, they are 
gated, which allows them to open briefly and then close again (Figure 11-21). 
Moreover, with prolonged (chemical or electrical) stimulation, most channels go 
into a closed "desensitized" or "inactivated" state, in which they are refractory to 
further opening until the stimulus has been removed, as we discuss later. In 
most cases, the gate opens in response to a specific stimulus. The main types of 
stimuli that are known to cause ion channels to open are a change in the voltage 
across the membrane (voltage-gated channels), a mechanical stress (mechani- 
cally gated channels), or the binding of a ligand (ligand-gated channels). The lig- 
and can be either an extracellular mediator— specifically, a neurotransmitter 
(transmitter-gated channels) — or an intracellular mediator such as an ion (ion- 
gated channels) or a nucleotide (nucleotide-gated channels). In addition, protein 
phosphorylation and dephosphorylation regulates the activity of many ion 
channels; this type of channel regulation is discussed, together with nucleotide- 
gated ion channels, in Chapter 15. 

More than 100 types of ion channels have been described thus far, and new 
ones are still being discovered, each characterized by the ions it conducts, the 
mechanism by which it is gated, and its abundance and localization in the cell. 
Ion channels are responsible for the electrical excitability of muscle cells, and 
they mediate most forms of electrical signaling in the nervous system. A single 
neuron might typically contain 10 or more kinds of ion channels, located in dif- 
ferent domains of its plasma membrane. But ion channels are not restricted to 
electrically excitable cells. They are present in all animal cells and are found in 
plant cells and microorganisms: they propagate the leaf closing response of the 
mimosa plant, for example, and allow the single-celled Paramecium to reverse 
direction after a collision. 

Perhaps the most common ion channels are those that are permeable 
mainly to K + . These channels are found in the plasma membrane of almost all 
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Figure 1 1-20 A typical ion channel, 
which fluctuates between closed and 
open conformations. The channel 
protein shown here in cross section forms 
a hydrophilic pore across the lipid bilayer 
only in the "open" conformational state. 
Polar groups are thought to line the wall 
of the pore, while hydrophobic amino 
acid side chains interact with the lipid 
bilayer (not shown). The pore narrows to 
atomic dimensions in one region (the 
selectivity filter), where the ion selectivity 
of the channel is largely determined. 



Figure 1 1-21 The gating of ion 
channels. This drawing shows different 
kinds of stimuli that open ion 
channels. Mechanically gated channels 
often have cytoplasmic extensions 
CYTOSOL that link the channel to the 
cytoskeleton (not shown). 
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animal cells. An important subset of K + channels opens even in an unstimulated 
or "resting" cell, and hence these channels are sometimes called K + leak chan- 
nels. Although this term applies to many different K + channels, depending on 
the cell type, they serve a common purpose. By making the plasma membrane 
much more permeable to K + than to other ions, they have a crucial role in main- 
taining the membrane potential across all plasma membranes. 

The Membrane Potential in Animal Cells Depends Mainly on K + 
Leak Channels and the K + Gradient Across the Plasma Membrane 

A membrane potential arises when there is a difference in the electrical charge 
on the two sides of a membrane, due to a slight excess of positive ions over neg- 
ative ones on one side and a slight deficit on the other. Such charge differences 
can result both from active electrogenic pumping (see p. 662) and from passive 
ion diffusion. As we discuss in Chapter 14, electrogenic H + pumps in the mito- 
chondrial inner membrane generate most of the membrane potential across 
this membrane. Electrogenic pumps also generate most of the electrical poten- 
tial across the plasma membrane in plants and fungi. In typical animal cells, 
however, passive ion movements make the largest contribution to the electrical 
potential across the plasma membrane. 

As explained earlier, the Na + -K + pump helps maintain an osmotic balance 
across the animal cell membrane by keeping the intracellular concentration of 
Na + low. Because there is little Na + inside the cell, other cations have to be plen- 
tiful there to balance the charge carried by the cell's fixed anions — the negatively 
charged organic molecules that are confined inside the cell. This balancing role 
is performed largely by K + , which is actively pumped into the cell by the Na + -K + 
pump and can also move freely in or out through the K* leak channels in the 
plasma membrane. Because of the presence of these channels, K + comes almost 
to equilibrium, where an electrical force exerted by an excess of negative charges 
attracting K + into the cell balances the tendency of K + to leak out down its con- 
centration gradient. The membrane potential is the manifestation of this elec- 
trical force, and we can calculate its equilibrium value from the steepness of the 
K + concentration gradient. The following argument may help to make this clear. 

Suppose that initially there is no voltage gradient across the plasma mem- 
brane (the membrane potential is zero) but the concentration of K + is high 
inside the cell and low outside. K + will tend to leave the cell through the K + leak 
channels, driven by its concentration gradient. As K + begins to move out, each 
ion leaves behind an unbalanced negative charge, thereby creating an electrical 
field, or membrane potential, which will tend to oppose the further efflux of K + . 
The net efflux of K + halts when the membrane potential reaches a value at which 
this electrical driving force on K + exactly balances the effect of its concentration 
gradient — that is, when the electrochemical gradient for K + is zero. Although CI" 
ions also equilibrate across the membrane, the membrane potential keeps most 
of these ions out of the cell because their charge is negative. 

The equilibrium condition, in which there is no net flow of ions across the 
plasma membrane, defines the resting membrane potential for this idealized cell. 
A simple but very important formula, the Nernst equation, quantifies the equilib- 
rium condition and, as explained in Panel 11-2, makes it possible to calculate the 
theoretical resting membrane potential if we know the ratio of internal and exter- 
nal ion concentrations. As the plasma membrane of a real cell is not exclusively 
permeable to K + and CI", however, the actual resting membrane potential is usu- 
ally not exactly equal to that predicted by the Nernst equation for K + or CI". 

The Resting Potential Decays Only Slowly When the Na + -K + Pump 
Is Stopped 

Only a minute number of ions must move across the plasma membrane to set up 
the membrane potential. Thus, we can think of the membrane potential as arising 
from movements of charge that leave ion concentrations practically unaffected 
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THE NERNST EQUATION AND ION FLOW 



The flow of any ion through a membrane channel protein 
is driven by the electrochemical gradient for that ion. This 
gradient represents the combination of two influences: 
the voltage gradient and the concentration gradient of 
the ion across the membrane. When these two influences 
just balance each other the electrochemical gradient for 
the ion is zero and there is no net flow of the ion through 
the channel. The voltage gradient (membrane potential) 
at which this equilibrium is reached is called the 
equilibrium potential for the ion. It can be calculated 
from an equation that will be derived below, called the 
Nernst equation. 



At the point where the concentration and voltage 
gradients just balance, 



AG conc + AG vo , t = 



and the ion distribution is at equlibrium across the 
membrane. 



Thus, 



and, therefore, 



zFV-RT ln^°. 



V= 2L|n-^ 



zF 



The Nernst equation is 



zF C. 



where 



l/=the equilibrium potential in volts (internal 

potential minus external potential) 
C and Cj = outside and inside concentrations of the 

ion, respectively 
R = the gas constant (2 cal mol" 1 K~ 1 ) 
T= the absolute temperature (K) 
F = Faraday's constant (2.3 x 10 4 cal V" 1 mol" 1 ) 
z = the valence (charge) of the ion 
In = logarithm to the base e 



The Nernst equation is derived as follows: 

A molecule in solution (a solute) tends to move from a 
region of high concentration to a region of low 
concentration simply due to the random movement of 
molecules, which results in their equilibrium. 
Consequently, movement down a concentration gradient 
is accompanied by a favorable free-energy change 
(AG < 0), whereas movement up a concentration gradient 
is accompanied by an unfavorable free-energy change 
(AG > 0). (Free energy is introduced in Chapter 2, and 
discussed in the context of redox reactions in 
Panel 14—1, p. 830.) 

The free-energy change per mole of solute moved across 
the plasma membrane (AG conc ) is equal to -RT In C / Cj. 



or, using the constant that converts natural logarithms to 
base 10, 

V=2.3^log 10 -^ 



zF 



For a univalent ion, 
RT 



2.3 — = 58 mV at 20°C and 61.5 mVat 37°C 



Thus, for such an ion at 37°C, 

V= + 61.5 mV for C /Q = 10, 

V=0iorC n /G='\. 



whereas 



The K + equilibrium potential (V K ), for example, is 
61.5 log 10 ([K + ] o /[K + ]i) millivolts 
(-89 mV for a typical cell where [K + ] = 5 mM 
and[K + ] i= 140 mM). 

At V Kl there is no net flow of K + across the membrane. 

Similarly, when the membrane potential has a value of 
61.5log l0 ([Na + ] o /[Na + ]i), 
the Na + equilibrium potential (V Na ), 
there is no net flow of Na + . 

For any particular membrane potential, l/ M , the net 
force tending to drive a particular type of ion out of the 
cell, is proportional to the difference between V M and the 
equilibrium potential for the ion: hence, 
for K + it is V M - V K 
and for Na + it is V M - t/ Na . 
The number of ions that go to form the layer of charge 
adjacent to the membrane is minute compared with the 
total number inside the cell. For example, the movement 
of 6000 Na + ions across 1 ^m 2 of membrane will carry 
sufficient charge to shift the membrane potential by 
about 100 mV. 



If the solute is an ion, moving it into a cell across a 
membrane whose inside is at a voltage V relative to the 
outside will cause an additional free-energy change (per 
mole of solute moved) of AG VO | t = zFV. 



Because there are about 3 x 10 7 Na + ions in a typical cell 
(1 urn 3 of bulk cytoplasm), such a movement of charge 
will generally have a negligible effect on the ion 
concentration gradients across the membrane. 
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exact balance of charges on each side of the 
membrane; membrane potential = 



a few of the positive ions (red) cross the 
membrane from right to left, leaving their 
negative counterions (red) behind; this sets 
up a nonzero membrane potential 



and result in only a very slight discrepancy in the number of positive and nega- 
tive ions on the two sides of the membrane (Figure 11-22). Moreover, these 
movements of charge are generally rapid, taking only a few milliseconds or less. 

Consider the change in the membrane potential in a real cell after the sud- 
den inactivation of the Na + -K + pump. A slight drop in the membrane potential 
occurs immediately. This is because the pump is electrogenic and, when active, 
makes a small direct contribution to the membrane potential by pumping out 
three Na + for every two K + that it pumps in. However, switching off the pump 
does not abolish the major component of the resting potential, which is gener- 
ated by the K + equilibrium mechanism described above. This component of the 
membrane potential persists as long as the Na + concentration inside the cell 
stays low and the K + ion concentration high — typically for many minutes. But 
the plasma membrane is somewhat permeable to all small ions, including Na + . 
Therefore, without the Na + -K + pump, the ion gradients set up by the pump will 
eventually run down, and the membrane potential established by diffusion 
through the K + leak channels will fall as well. As Na + enters, the osmotic balance 
is upset, and water seeps into the cell (see Panel 11-1, p. 664), and the cell even- 
tually comes to a new resting state where Na + , K + , and CI" are all at equilibrium 
across the membrane. The membrane potential in this state is much less than it 
was in the normal cell with an active Na + -K + pump. 

The resting potential of an animal cell varies between -20 mV and -120 mV; 
depending on the organism and cell type. Although the K + gradient always has a 
major influence on this potential, the gradients of other ions (and the disequili- 
brating effects of ion pumps) also have a significant effect: the more permeable 
the membrane for a given ion, the more strongly the membrane potential tends 
to be driven toward the equilibrium value for that ion. Consequently, changes in 
a membrane's permeability to ions can cause significant changes in the mem- 
brane potential. This is one of the key principles relating the electrical excitabil- 
ity of cells to the activities of ion channels. 

To understand how ion channels select their ions and how they open and 
close, we need to know their atomic structure. The first ion channel to be crys- 
tallized and studied by x-ray diffraction was a bacterial K + channel. The details 
of its structure revolutionized our understanding of ion channels. 



Figure 1 1 -22 The ionic basis of a 
membrane potential. A small flow of 
ions carries sufficient charge to cause a 
large change in the membrane potential. 
The ions that give rise to the membrane 
potential lie in a thin (< 1 nm) surface 
layer close to the membrane, held there 
by their electrical attraction to their 
oppositely charged counterparts 
(counterions) on the other side of the 
membrane. For a typical cell, 
1 microcoulombofcharge(6x10 12 
monovalent ions) per square centimeter 
of membrane, transferred from one side 
of the membrane to the other, changes 
the membrane potential by roughly 1 V. 
This means, for example, that in a 
spherical cell of diameter 1 jam, the 
number of K + ions that have to flow out 
to alter the membrane potential by 
100 mV is only about 1/100,000 of the 
total number of K + ions in the cytosol. 



The Three-Dimensional Structure of a Bacterial K + Channel Shows 
How an Ion Channel Can Work 

Scientists were puzzled by the remarkable ability of ion channels to combine 
exquisite ion selectivity with a high conductance. K + leak channels, for example, 
conduct K + 10,000-fold better than Na + , yet the two ions are both featureless 
spheres and have similar diameters (0.133 nm and 0.095 nm, respectively). A sin- 
gle amino acid substitution in the pore of an animal cell K + channel can result in 
a loss of ion selectivity and cell death. We cannot explain the normal K + selectiv- 
ity by pore size, because Na + is smaller than K + . Moreover, the high conductance 
rate is incompatible with the channel's having selective, high-affinity K + -binding 
sites, as the binding of K + ions to such sites would greatly slow their passage. 
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Figure 1 1 -23 The structure of a bacterial K + channel. <ATTA> (A) Two transmembrane a helices from only two of the four 
identical subunits are shown. From the cytosolic side, the pore opens up into a vestibule in the middle of the membrane. 
The vestibule facilitates transport by allowing the K + ions to remain hydrated even though they are halfway across the 
membrane. The narrow selectivity filter links the vestibule to the outside of the cell. Carbonyl oxygens line the walls of the 
selectivity filter and form transient binding sites for dehydrated K + ions. The positions of the K + ions in the pore were 
determined by soaking crystals of the channel protein in a solution containing rubidium ions, which are more electron- 
dense but only slightly larger than K + ions; from the differences in the diffraction patterns obtained with K + ions and with 
rubidium ions in the channel, the positions of the ions could be calculated. Two K + ions occupy sites in the selectivity filter, 
while a third K + ion is located in the center of the vestibule, where it is stabilized by electrical interactions with the more 
negatively charged ends of the pore helices. The ends of the four pore helices (only two of which are shown) point 
precisely toward the center of the vestibule, thereby guiding K + ions into the selectivity filter. Negatively charged amino 
acids (indicated by red minus signs) are concentrated near the channel entrance and exit. (B) Because of the polarity of the 
hydrogen bonds {red) that link adjacent turns of an a helix, every a helix has an electric dipole along its axis, with a more 
negatively charged C-terminal end (5 _ ) and a more positively charged N-terminal end (5 + ). (A, adapted from D.A. Doyle et 
al., Science 280:69-77, 1998. With permission from AAAS.) 

The puzzle was solved when the structure of a bacterial K + channel was 
determined by x-ray crystallography. The channel is made from four identical 
transmembrane subunits, which together form a central pore through the mem- 
brane (Figure 11-23). Negatively charged amino acids concentrated at the 
cytosolic entrance to the pore are thought to attract cations and repel anions, 
making the channel cation- selective. Each subunit contributes two transmem- 
brane a helices, which are tilted outward in the membrane and together form a 
cone, with its wide end facing the outside of the cell where K + ions exit from the 
channel. The polypeptide chain that connects the two transmembrane helices 
forms a short a helix (the pore helix) and a crucial loop that protrudes into the 
wide section of the cone to form the selectivity filter. The selectivity loops from 
the four subunits form a short, rigid, narrow pore, which is lined by the carbonyl 
oxygen atoms of their polypeptide backbones. Because the selectivity loops of all 
known K + channels have similar amino acid sequences, it is likely that they form 
a closely similar structure. The crystal structure shows two K + ions in single file 
within the selectivity filter, separated by about 0.8 nm. Mutual repulsion 
between the two ions is thought to help move them through the pore into the 
extracellular fluid. 

The structure of the selectivity filter explains the ion selectivity of the chan- 
nel. A K + ion must lose almost all of its bound water molecules to enter the filter, 
where it interacts instead with the carbonyl oxygens lining the filter; the oxygens 
are rigidly spaced at the exact distance to accommodate a K + ion. A Na + ion, in 
contrast, cannot enter the filter because the carbonyl oxygens are too far away 
from the smaller Na + ion to compensate for the energy expense associated with 
the loss of water molecules required for entry (Figure 11-24). 

Structural studies of K + channels and other channels have also indicated 
some general principles of how channels may open and close. This gating seems 
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Figure 1 1 -24 K + specificity of the 
selectivity filter in a K + channel. The 

drawing shows K + and Na + ions (A) in the 
vestibule and (B) in the selectivity filter of 
the pore, viewed in cross section. In the 
vestibule, the ions are hydrated. In the 
selectivity filter, they have lost their 
water, and the carbonyl oxygens are 
placed precisely to accommodate a 
dehydrated K + ion. The dehydration of 
the K + ion requires energy, which is 
precisely balanced by the energy 
regained by the interaction of the ion 
with the carbonyl oxygens that serve as 
surrogate water molecules. Because the 
Na + ion is too small to interact with the 
oxygens, it can enter the selectivity filter 
only at a great energetic expense. The 
filter therefore selects K + ions with high 
specificity. (Adapted from D.A. Doyle et 
al., Science 280:69-77, 1 998. With 
permission from AAAS.) 



to involve movement of the helices in the membrane so that they either obstruct 
(in the closed state) or free (in the open state) the path for ion movement. 
Depending on the particular type of channel, helices are thought to tilt, rotate, 
or bend during gating. The structure of a closed K + channel shows that by tilting 
the inner helices, the pore constricts like a diaphragm at its cytosolic end (Fig- 
ure 11-25). Bulky hydrophobic amino acid side chains block the small opening 
that remains, preventing the entry of ions. 

Most ion channels are constructed from multiple identical subunits, each of 
which contributes to a common central pore. A recently determined crystal 
structure of a CI" channel, however, has revealed that some ion channels are 
built very differently. Although the protein is a dimer formed by two identical 
subunits, each of the subunits contains its own pore through which CI" ions 
move. In the center of the membrane, amino acid side chains form a selectivity 
filter, which is conceptually similar to that in K + channels. But, unlike the filter in 
K + channels, different regions of the protein contribute the side chains, and they 
are not symmetrically arranged (Figure 11-26). 



Aquaporins Are Permeable to Water But Impermeable to Ions 

We discussed earlier that procaryotic and eucaryotic cells have water chan- 
nels, or aquaporins, embedded in their plasma membrane to allow water to 
move readily across this membrane. Aquaporins are especially abundant in 
cells that must transport water at particularly high rates, such as the epithelial 
cells of the kidney. 



inner helix 





CLOSED 



OPEN 



Figure 1 1 -25 A model for the gating of 
a bacterial K + channel. The channel is 
viewed in cross section. To adopt the 
closed conformation, the four inner 
transmembrane helices that line the pore 
on the cytosolic side of the selectivity 
filter (see Figure 1 1 -23) rearrange to 
close the cytosolic entrance to the 
channel. (Adapted from E. Perozo et al., 
Science 285:73-78, 1999. With permission 
from AAAS.) 
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Aquaporins must solve a problem that is opposite to that facing ion chan- 
nels. To avoid disrupting ion gradients across membranes, they have to allow the 
rapid passage of water molecules while completely blocking the passage of ions. 
The crystal structure of an aquaporin reveals how it achieves this remarkable 
selectivity. The channels have a narrow pore that allows water molecules to tra- 
verse the membrane in single file, following the path of carbonyl oxygens that 
line one side of the pore (Figure 11-27A and B). Hydrophobic amino acids line 
the other side of the pore. The pore is too narrow for any hydrated ion to enter, 
and the energy cost of dehydrating an ion would be enormous because the 
hydrophobic wall of the pore cannot interact with a dehydrated ion to compen- 
sate for the loss of water. This design readily explains why the aquaporins can- 
not conduct K + , Na + , Ca 2+ , or CI" ions. To understand why these channels are 
also impermeable to H + , recall that most protons are present in cells as H 3 + , 
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Figure 1 1 -26 Comparison of CI" and K + 
channel architectures. (A) The CI" channel is 
a"double-channel"dimer built of two 
identical subunits, each of which contains its 
own ion-conducting pore. The upper 
cartoon is a schematic view of the 
extracellular face of the channel, showing 
the two identical ion-conducting pores. The 
lower cartoon shows a cross section through 
one subunit, viewed from within the 
membrane. {Dotted black line in top cartoon 
indicates the plane of section.) The subunit 
is a single polypeptide chain consisting of 
two portions, which, though similar, span 
the membrane with opposite orientations 
(white arrows): each portion contributes one 
pore helix oriented such that its positively 
charged end points towards a centrally 
positioned selectivity filter. Both elements, 
the selectivity filter and the helix dipoles, 
contribute to the selectivity of the channel 
for negatively charged CI" ions. (B) By 
contrast, the K + channel is a tetramer built 
of four identical subunits, each of which 
contributes to a centrally located pore. All 
four subunits have the same orientation in 
the membrane (white arrows). Four pore 
helices, one contributed by each subunit, 
point their negatively charged ends towards 
a vestibule, stabilizing a positively charged 
K + ion there (also see Figure 1 1-23). (Lower 
cartoons in A and B, adapted from R. Dutzler 
et al., Nature 41 5:287-294, 2002. With 
permission from Macmillan Publishers Ltd.) 

Figure 1 1 -27 The structure of aquaporins. 

(A) A ribbon diagram of an aquaporin 
monomer. In the membrane, aquaporins 
form tetramers, with each monomer 
containing a pore in its center (not shown). 

(B) A space-filling model of an aquaporin 
monomer, which has been cut and opened 
like a book, so that the inside of the pore is 
visible. Hydrophilic amino acids lining the 
pores are colored red and blue, whereas 
hydrophobic amino acids lining the pore 
are colored yellow. The amino acids not 
involved in forming the pore are shown in 
green. Note that one face of the pore is 
lined with hydrophilic amino acids, which 
provide transient hydrogen bonds to water 
molecules; these bonds help line up the 
transiting water molecules in a single row 
and orient them as they traverse the 
membrane. By contrast, the other side of 
the pore is devoid of such amino acids, 
providing a hydrophobic slide that does not 
allow hydrogen bonds to form. (C and D) A 
model explaining why aquaporins are 
impermeable to H + . (C) In water, H + diffuses 
extremely rapidly by being relayed from 
one water molecule to the next. (D) Two 
strategically placed asparagines in the 
center of each aquaporin pore help tether 

a central water molecule such that both 
valencies on its oxygen are occupied, 
thereby preventing an H + relay. (A and B, 
adapted from R.M. Stroud et al., Curr. Opin. 
Struct. Biol. 1 3:424-431 , 2003. With 
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which diffuses through water extremely rapidly, using a molecular relay mecha- 
nism that requires the making and breaking of hydrogen bonds between adja- 
cent water molecules (Figure 11-27C). Aquaporins contain two strategically 
placed asparagines, which bind to the oxygen atom of the central water 
molecule in the line of water molecules traversing the pore. Because both 
valences of this oxygen are unavailable for hydrogen bonding, the central water 
molecule cannot participate in an H + relay, and the pore is therefore imperme- 
able to protons (Figure 11-27C and D). 

Some bacterial water channels similar to aquaporins also conduct glycerol 
and small sugars, which interact with similarly positioned carbonyl oxygens lin- 
ing the pore. Such transient contacts that solutes make with the pore walls 
ensure that the transport is highly specific, without significantly impeding the 
speed with which the solute passes. Each individual aquaporin channel passes 
about 10 9 water molecules per second. 

The cells that make most sophisticated use of channels are neurons. Before 
discussing how they do so, we digress briefly to describe how a typical neuron is 
organized. 



The Function of a Neuron Depends on Its Elongated Structure 

The fundamental task of a neuron, or nerve cell, is to receive, conduct, and 
transmit signals. To perform these functions, neurons are often extremely elon- 
gated. A single neuron in a human, extending, for example, from the spinal cord 
to a muscle in the foot, may be as long as 1 meter. Every neuron consists of a cell 
body (containing the nucleus) with a number of thin processes radiating out- 
ward from it. Usually one long axon conducts signals away from the cell body 
toward distant targets, and several shorter branching dendrites extend from the 
cell body like antennae, providing an enlarged surface area to receive signals 
from the axons of other neurons (Figure 11-28), although the cell body itself 
also receives signals. A typical axon divides at its far end into many branches, 
passing on its message to many target cells simultaneously. Likewise, the extent 
of branching of the dendrites can be very great — in some cases sufficient to 
receive as many as 100,000 inputs on a single neuron. 

Despite the varied significance of the signals carried by different classes of 
neurons, the form of the signal is always the same, consisting of changes in the 
electrical potential across the neuron's plasma membrane. The signal spreads 
because an electrical disturbance produced in one part of the cell spreads to 
other parts, although the disturbance becomes weaker with increasing distance 
from its source, unless the neuron expends energy to amplify it as it travels. Over 
short distances this attenuation is unimportant; in fact, many small neurons 
conduct their signals passively, without amplification. For long-distance com- 
munication, however, such passive spread is inadequate. Thus, larger neurons 
employ an active signaling mechanism, which is one of their most striking fea- 
tures. An electrical stimulus that exceeds a certain threshold strength triggers an 
explosion of electrical activity that propagates rapidly along the neuron's plasma 
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Figure 1 1-28 A typical vertebrate 

neuron. The arrows indicate the direction 
in which signals are conveyed. The single 
axon conducts signals away from the cell 
body, while the multiple dendrites (and 
the cell body) receive signals from the 
axons of other neurons. The nerve 
terminals end on the dendrites or cell 
body of other neurons or on other cell 
types, such as muscle or gland cells. 
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membrane and is sustained by automatic amplification all along the way. This 
traveling wave of electrical excitation, known as an action potential, or nerve 
impulse, can carry a message without attenuation from one end of a neuron to 
the other at speeds of 100 meters per second or more. Action potentials are the 
direct consequence of the properties of voltage-gated cation channels, as we 
now discuss. 



Voltage-Gated Cation Channels Generate Action Potentials in 
Electrically Excitable Cells 

The plasma membrane of all electrically excitable cells— not only neurons, but 
also muscle, endocrine, and egg cells — contains voltage-gated cation channels, 
which are responsible for generating the action potentials. An action potential is 
triggered by a depolarization of the plasma membrane— that is, by a shift in the 
membrane potential to a less negative value inside. (We shall see later how the 
action of a neurotransmitter causes depolarization.) In nerve and skeletal mus- 
cle cells, a stimulus that causes sufficient depolarization promptly opens the 
voltage-gated Na + channels, allowing a small amount of Na + to enter the cell 
down its electrochemical gradient. The influx of positive charge depolarizes the 
membrane further, thereby opening more Na + channels, which admit more Na + 
ions, causing still further depolarization. This self-amplification process (an 
example of positive feedback, discussed in Chapter 15), continues until, within a 
fraction of a millisecond, the electrical potential in the local region of membrane 
has shifted from its resting value of about -70 mV to almost as far as the Na + 
equilibrium potential of about +50 mV (see Panel 11-2, p. 670). At this point, 
when the net electrochemical driving force for the flow of Na + is almost zero, the 
cell would come to a new resting state, with all of its Na + channels permanently 
open, if the open conformation of the channel were stable. Two mechanisms 
that act in concert to save the cell from such a permanent electrical spasm: the 
Na + channels inactivate and voltage-gated K + channels open. 

The Na + channels have an automatic inactivating mechanism, which causes 
the channels to reclose rapidly even though the membrane is still depolarized. 
The Na + channels remain in this inactivated state, unable to reopen, until after 
the membrane potential has returned to its initial negative value. The whole 
cycle from initial stimulus to the return to the original resting state takes a few 
milliseconds or less. The Na + channel can therefore exist in three distinct 
states — closed, open, and inactivated. Figure 11-29 shows how the changes in 
state contribute to the rise and fall of the action potential. 
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Figure 11-29 An action potential. <CGAG> 
(A) An action potential is triggered by a 
brief pulse of current, which (B) partially 
depolarizes the membrane, as shown in the 
plot of membrane potential versus time. 
The green curve shows how the membrane 
potential would have simply relaxed back 
to the resting value after the initial 
depolarizing stimulus if there had been no 
voltage-gated Na + channels in the 
membrane; this relatively slow return of the 
membrane potential to its initial value of 
-70 mV in the absence of open Na + 
channels occurs because of the efflux of K + 
through voltage-gated K + channels, which 
open in response to membrane 
depolarization and drive the membrane 
back toward the K + equilibrium potential. 
The red curve shows the course of the 
action potential that is caused by the 
opening and subsequent inactivation of 
voltage-gated Na + channels, whose state is 
shown in (C). The membrane cannot fire a 
second action potential until the Na + 
channels have returned to the closed 
conformation; until then, the membrane is 
refractory to stimulation. 
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Figure 1 1 -30 The propagation of an 
action potential along an axon. (A) The 

voltages that would be recorded from a 
set of intracellular electrodes placed at 
intervals along the axon. (B) The changes 
in the Na + channels and the current flows 
(orange arrows) that give rise to the 
traveling disturbance of the membrane 
potential. The region of the axon with a 
depolarized membrane is shaded in blue. 
Note that an action potential can only 
travel away from the site of 
depolarization, because Na + -channel 
inactivation prevents the depolarization 
from spreading backward. 
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This description of an action potential applies only to a small patch of 
plasma membrane. The self- amplifying depolarization of the patch, however, is 
sufficient to depolarize neighboring regions of membrane, which then go 
through the same cycle. In this way the action potential sweeps like a wave from 
the initial site of depolarization over the entire plasma membrane, as shown in 
Figure 11-30. 

Voltage-gated K + channels provide a second mechanism in most nerve cells 
to help bring the activated plasma membrane more rapidly back toward its orig- 
inal negative potential, ready to transmit a second impulse. These channels 
open in response to membrane depolarization in much the same way that the 
Na + channels do, but with slightly slower kinetics; for this reason they are some- 
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times called delayed K + channels. Once the K + channels open, the efflux of K + 
rapidly overwhelms the transient influx of Na + and quickly drives the membrane 
back toward the K + equilibrium potential, even before the inactivation of the Na + 
channels is complete. 

Like the Na + channel, the voltage-gated K + channels automatically inacti- 
vate. Studies of mutant voltage-gated K + channels show that the N-terminal 20 
amino acids of the channel protein are required for rapid inactivation: altering 
this region changes the kinetics of channel inactivation, and removal of the 
region abolishes inactivation. Amazingly, in the latter case, exposing the cyto- 
plasmic face of the plasma membrane to a small synthetic peptide correspond- 
ing to the missing N-terminus restores inactivation. These findings suggest that 
the N-terminus of each K + channel subunit acts like a tethered ball that occludes 
the cytoplasmic end of the pore soon after it opens, thereby inactivating the 
channel (Figure 11-31). A similar mechanism is thought to operate in the rapid 
inactivation of voltage-gated Na + channels (which we discuss later), although a 
different segment of the protein seems to be involved. 

The electrochemical mechanism of the action potential was first established 
by a famous series of experiments carried out in the 1940s and 1950s. Because 
the techniques for studying electrical events in small cells had not yet been 
developed, the experiments exploited the giant neurons in the squid. Despite 
the many technical advances made since then, the logic of the original analysis 
continues to serve as a model for present-day work. Panel 11-3 summarizes 
some of the key original experiments. 



Myelination Increases the Speed and Efficiency of Action 
Potential Propagation in Nerve Cells 

The axons of many vertebrate neurons are insulated by a myelin sheath, which 
greatly increases the rate at which an axon can conduct an action potential. The 
importance of myelination is dramatically demonstrated by the demyelinating 
disease multiple sclerosis, in which the immune system destroys myelin sheaths 
in some regions of the central nervous system; in the affected regions, the prop- 
agation of nerve impulses is greatly slowed, often with devastating neurological 
consequences. 

Myelin is formed by specialized supporting cells, called glial cells. Schwann 
cells myelinate axons in peripheral nerves, and oligodendrocytes do so in the 
central nervous system. These glial cells wrap layer upon layer of their own 
plasma membrane in a tight spiral around the axon (Figure 11-32A and B), 
thereby insulating the axonal membrane so that little current can leak across it. 
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Figure 11-31 The"ball-and-chain" 
model of rapid inactivation of a voltage- 
gated K + channel. When the membrane 
is depolarized, the channel opens and 
begins to conduct ions. If the 
depolarization is maintained, the open 
channel adopts an inactive conformation, 
in which the pore is occluded by the N- 
terminal 20 amino acid "ball," which is 
linked to the channel proper by a 
segment of unfolded polypeptide chain 
that serves as the "chain." For simplicity, 
only two balls are shown; in fact, there 
are four, one from each subunit. A similar 
mechanism, using a different segment of 
the polypeptide chain, is thought to 
operate in Na + channel inactivation. 
Internal forces stabilize each state against 
small disturbances, but a sufficiently 
violent collision with other molecules can 
cause the channel to flip from one of 
these states to another. The state of 
lowest energy depends on the 
membrane potential because the 
different conformations have different 
charge distributions. When the 
membrane is at rest (highly polarized), 
the closed conformation has the lowest 
free energy and is therefore most stable; 
when the membrane is depolarized, the 
energy of the open conformation is lower, 
so the channel has a high probability of 
opening. But the free energy of the 
inactivated conformation is lower still; 
therefore, after a randomly variable 
period spent in the open state, the 
channel becomes inactivated. Thus, the 
open conformation corresponds to a 
metastable state that can exist only 
transiently. The red arrows indicate the 
sequence that follows a sudden 
depolarization; the black arrow indicates 
the return to the original conformation as 
the lowest energy state after the 
membrane is reoolarized. 



PANEL 11-3: Some Classical Experiments on the Squid Giant Axon 



1. Action potentials are recorded with an intracellular electrode 

The squid giant axon is about 0.5-1 mm in diameter and several centi- 
meters long. An electrode in the form of a glass capillary tube containing 
a conducting solution can be thrust down the axis of the axon so that its 
tip lies deep in the cytoplasm. With its help, one can measure the voltage . 
difference between the inside and the outside of the axon— that is, the 
membrane potential— as an action potential sweeps past the electrode. 
The action potential is triggered by a brief electrical stimulus to one end 
of the axon. It does not matter which end, because the excitation can 
travel in either direction; and it does not matter how big the stimulus is, 
as long as it exceeds a certain threshold: the action potential is all or none 
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2. Action potentials depend only on the neuronal plasma 
membrane and on gradients of Na + and K + across it 

The three most plentiful ions, both inside and outside the axon, 
are Na + , K + , and CI". As in other cells, the Na + -K + pump maintains 
a concentration gradient: the concentration of Na + is about 9 times 
lower inside the axon than outside, while the concentration of K + 
is about 20 times higher inside than outside. Which ions are 
important for the action potential? 

The squid giant axon is so large and robust that it is possible to 
extrude the gel-like cytoplasm from it, like toothpaste from a tube, 
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and then to perfuse it internally with pure artificial solutions 
of Na + , K + , and CI" or S0 4 2 ~. Remarkably, if (and only if) the 
concentrations of Na + and K + inside and outside approximate 
those found naturally, the axon will still propagate action 
potentials of the normal form. The important part of the cell 
for electrical signaling, therefore, must be the plasma 
membrane; the important ions are Na + and K + ; and a sufficient 
source of free energy to power the action potential must be 
provided by the concentration gradients of these ions across 
the membrane, because all other sources of metabolic energy 
have presumably been removed by the perfusion. 
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At rest, the membrane is chiefly permeable to K + ; during the 
action potential, it becomes transiently permeable to Na + 

At rest the membrane potential is close to the equilibrium potential 
for K + . When the external concentration of K + is changed, the 
resting potential changes roughly in accordance with the Nernst 
equation for K + (see Panel 11-2). At rest, therefore, the membrane 
is chiefly permeable to K + : K + leak channels provide the main ion 
pathway through the membrane. 

If the external concentration of Na + is varied, there is no effect 
on the resting potential. However, the height of the peak of the 
action potential varies roughly in accordance with the Nernst 
equation for Na + . During the action potential, therefore, the 
membrane appears to be chiefly permeable to Na + : Na + channels 
have opened. In the aftermath of the action potential, the 



membrane potential reverts to a negative value that depends 
on the external concentration of K + and is even closer to 
the K + equilibrium potential than the resting potential is: the 
membrane has lost most of its permeability to Na + and has 
become even more permeable to K + than before— that is, Na + 
channels have closed, and additional K + channels have opened. 



The form of the action 
potential when the 
external medium contains 
100%, 50%, or 33% of the 
normal concentration 
of Na + . 
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4. Voltage clamping reveals how the membrane potential 
controls opening and closing of ion channels 

The membrane potential can be held constant ("voltage 
clamped") throughout the axon by passing a suitable current 
through a bare metal wire inserted along the axis of the axon 
while monitoring the membrane potential with another 
intracellular electrode. When the membrane is abruptly shifted 
from the resting potential and held in a depolarized state (A), 
Na + channels rapidly open until the Na + permeability of the 
membrane is much greater than the K + permeability; they then 
close again spontaneously, even though the membrane potential 
is clamped and unchanging. K + channels also open but with a 
delay, so that the K + permeability increases as the Na + permeability 
falls (B). If the experiment is now very promptly repeated, by 
returning the membrane briefly to the resting potential and then 
quickly depolarizing it again, the response is different: prolonged 
depolarization has caused the Na + channels to enter an inactivated 
state, so that the second depolarization fails to cause a rise and 
fall similar to the first. Recovery from this state requires a 



relatively long time— about 10 milliseconds — spent at the 
repolarized (resting) membrane potential. 

In a normal undamped axon, an inrush of Na + through the 
opened Na + channels produces the spike of the action potential; 
inactivation of Na + channels and opening of K + channels bring 
the membrane rapidly back down to the resting potential. 
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Figure 1 1-32 Myelination. (A) A myelinated axon from a peripheral nerve. Each Schwann cell wraps its 
plasma membrane concentrically around the axon to form a segment of myelin sheath about 1 mm long. 
For clarity, the membrane layers of the myelin in this drawing are shown less compacted than they are in 
reality (see part B). (B) An electron micrograph of a section from a nerve in the leg of a young rat. Two 
Schwann cells can be seen: one near the bottom is just beginning to myelinate its axon; the one above it 
has formed an almost mature myelin sheath. (C) Fluorescence micrograph and diagram of individual 
myelinated axons teased apart in a nerve. Three different proteins are detected by staining with antibodies. 
Voltage-gated Na + channels (stained in green) are concentrated in the axonal membrane at the nodes of 
Ranvier. An extracellular protein (called Caspr, stained in red) marks the end of each myelin sheath. Caspr 
assembles at the junctions where the glial cell plasma membrane tightly abuts the axon to provide the 
electrical seal. Voltage-gated K + channels (stained in blue) localize to regions in the axon plasma membrane 
that are close to the nodes. (B, from Cedric S. Raine, in Myelin [P. Morell, ed.]. New York: Plenum, 1976; 
C, from M.N. Rasband and P. Shrager, J. Physiol. 525:63-73, 2000. With permission from Blackwell 
Publishing.) 



The myelin sheath is interrupted at regularly spaced nodes of Ranvier, where 
almost all the Na + channels in the axon are concentrated (see Figure 11-32C). 
Because the ensheathed portions of the axonal membrane have excellent cable 
properties (in other words, they behave electrically much like well-designed 
underwater telegraph cables), a depolarization of the membrane at one node 
almost immediately spreads passively to the next node. Thus, an action poten- 
tial propagates along a myelinated axon by jumping from node to node, a pro- 
cess called saltatory conduction. This type of conduction has two main advan- 
tages: action potentials travel faster, and metabolic energy is conserved because 
the active excitation is confined to the small regions of axonal plasma mem- 
brane at nodes of Ranvier. 



Patch-Clamp Recording Indicates That Individual Gated Channels 
Open in an AII-or-Nothing Fashion 

Neuron and skeletal muscle cell plasma membranes contain many thousands of 
voltage-gated Na + channels, and the current crossing the membrane is the sum 
of the currents flowing through all of these. An intracellular microelectrode can 
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record this aggregate current, as shown in Figure 1 1-30. Remarkably, however, it 
is also possible to record current flowing through individual channels. Patch- 
clamp recording, developed in the 1970s and 80s has revolutionized the study of 
ion channels, making it possible to examine transport through a single molecule 
of channel protein in a small patch of membrane covering the mouth of a 
micropipette (Figure 11-33). With this simple but powerful technique, one can 
study the detailed properties of ion channels in all sorts of cell types. This work 
has led to the discovery that even cells that are not electrically excitable usually 
have a variety of gated ion channels in their plasma membrane. Many of these 
cells, such as yeasts, are too small to be investigated by the traditional electro- 
physiologist's method of impalement with an intracellular microelectrode. 

Patch-clamp recording indicates that individual voltage-gated Na + channels 
open in an all-or-nothing fashion. A channel opens and closes at random, but 
when open, the channel always has the same large conductance, allowing more 
than 1000 ions to pass per millisecond. Therefore, the aggregate current crossing 
the membrane of an entire cell does not indicate the degree to which a typical 
individual channel is open but rather the total number of channels in its mem- 
brane that are open at any one time (Figure 11-34). 

Some simple physical principles allow us to understand voltage-gating. The 
interior of the resting neuron or muscle cell is at an electrical potential about 
50-100 mVmore negative than the external medium. Although this potential dif- 
ference seems small, it exists across a plasma membrane only about 5 nm thick, 
so that the resulting voltage gradient is about 100,000 V/cm. Proteins in the 



Figure 1 1-33 The technique of patch- 
clamp recording. Because of the 
extremely tight seal between the 
micropipette and the membrane, current 
can enter or leave the micropipette only 
by passing through the channels in the 
patch of membrane covering its tip. The 
term clamp is used because an electronic 
device is employed to maintain, or 
"clamp," the membrane potential at a set 
value while recording the ionic current 
through individual channels. The current 
through these channels can be recorded 
with the patch still attached to the rest of 
the cell, as in (A), or detached, as in (B). 
The advantage of the detached patch is 
that it is easy to alter the composition of 
the solution on either side of the 
membrane to test the effect of various 
solutes on channel behavior. A detached 
patch can also be produced with the 
opposite orientation, so that the 
cytoplasmic surface of the membrane 
faces the inside of the pipette. 



Figure 1 1-34 Patch-clamp measurements for a single voltage-gated Na + 
channel. A tiny patch of plasma membrane was detached from an 
embryonic rat muscle cell, as in Figure 1 1-33. (A) The membrane was 
depolarized by an abrupt shift of potential. (B) Three current records from 
three experiments performed on the same patch of membrane. Each major 
current step in (B) represents the opening and closing of a single channel. 
A comparison of the three records shows that, whereas the durations of 
channel opening and closing vary greatly, the rate at which current flows 
through an open channel is practically constant. The minor fluctuations in 
the current records arise largely from electrical noise in the recording 
apparatus. Current is measured in picoamperes (pA). By convention, the 
electrical potential on the outside of the cell is defined as zero. (C)The sum 
of the currents measured in 144 repetitions of the same experiment. This 
aggregate current is equivalent to the usual Na + current that would be 
observed flowing through a relatively large region of membrane 
containing 144 channels. A comparison of (B) and (C) reveals that the time 
course of the aggregate current reflects the probability that any individual 
channel will be in the open state; this probability decreases with time as 
the channels in the depolarized membrane adopt their inactivated 
conformation. (Data from J. Patlak and R. Horn, J. Gen. Physiol. 79:333-351, 
1982. With permission from The Rockefeller University Press.) 
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membrane are thus subjected to a very large electrical field that can profoundly 
affect their conformation. These proteins, like all others, have many charged 
groups, as well as polarized bonds between their various atoms. The electrical 
field therefore exerts forces on the molecular structure. For many membrane 
proteins the effects of changes in the membrane electrical field are probably 
insignificant, but voltage-gated ion channels can adopt alternative conforma- 
tions whose stabilities depend on the strength of the field. Voltage-gated Na + , K + , 
and Ca 2+ channels, for example, have characteristic positively charged amino 
acids in one of their transmembrane segments that respond to depolarization by 
moving outward, triggering conformational changes that open the channel. 
Each conformation can "flip" to another conformation if given a sufficient jolt by 
the random thermal movements of the surroundings, and it is the relative sta- 
bility of the closed, open, and inactivated conformations against flipping that is 
altered by changes in the membrane potential (see legend to Figure 11-31). 

Voltage-Gated Cation Channels Are Evolutionary and 
Structurally Related 

Na + channels are not the only kind of voltage-gated cation channel that can gen- 
erate an action potential. The action potentials in some muscle, egg, and 
endocrine cells, for example, depend on voltage-gated Ca 2+ channels rather than 
on Na + channels. 

There is a surprising amount of structural and functional diversity within 
each of these three classes, generated both by multiple genes and by the alter- 
native splicing of RNA transcripts produced from the same gene. Nonetheless, 
the amino acid sequences of the known voltage-gated Na + , K + , and Ca 2+ chan- 
nels show striking similarities, demonstrating that they all belong to a large 
superfamily of evolutionary and structurally related proteins and share many 
of the design principles. Whereas the single-celled yeast S. cerevisiae contains a 
single gene that codes for a voltage-gated K + channel, the genome of the worm 
C. elegans contains 68 genes that encode different but related K + channels. This 
complexity indicates that even a simple nervous system made up of only 302 
neurons uses a large number of different ion channels to compute its responses. 

Humans who inherit mutant genes encoding ion channel proteins can suf- 
fer from a variety of nerve, muscle, brain, or heart diseases, depending in which 
cells the channel encoded by the mutant gene normally functions. Mutations in 
genes that encode voltage-gated Na + channels in skeletal muscle cells, for exam- 
ple, can cause myotonia, a condition in which there is a delay in muscle relax- 
ation after voluntary contraction, causing painful muscle spasms. In some 
cases, this occurs because the abnormal channels fail to inactivate normally; as 
a result, Na + entry persists after an action potential finishes and repeatedly reini- 
tiates membrane depolarization and muscle contraction. Similarly, mutations 
that affect Na + or K + channels in the brain can cause epilepsy, in which excessive 
synchronized firing of large groups of neurons cause epileptic seizures (convul- 
sions, or fits). 

Transmitter-Gated Ion Channels Convert Chemical Signals into 
Electrical Ones at Chemical Synapses 

Neuronal signals are transmitted from cell to cell at specialized sites of contact 
known as synapses. The usual mechanism of transmission is indirect. The cells 
are electrically isolated from one another, the presynaptic cell being separated 
from the postsynaptic cell by a narrow synaptic cleft. A change of electrical 
potential in the presynaptic cell triggers it to release small signal molecules 
known as neurotransmitters, which are stored in membrane-enclosed synaptic 
vesicles and released by exocytosis. The neurotransmitter diffuses rapidly across 
the synaptic cleft and provokes an electrical change in the postsynaptic cell by 
binding to transmitter- gated ion channels (Figure 11-35) and opening them. 
After the neurotransmitter has been secreted, it is rapidly removed: it is either 
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destroyed by specific enzymes in the synaptic cleft or taken up by the nerve ter- 
minal that released it or by surrounding glial cells. Reuptake is mediated by a 
variety of Na + -dependent neurotransmitter transporters; in this way neuro- 
transmitters are recycled, allowing cells to keep up with high rates of release. 
Rapid removal ensures both spatial and temporal precision of signaling at a 
synapse. It decreases the chances that the neurotransmitter will influence 
neighboring cells, and it clears the synaptic cleft before the next pulse of neuro- 
transmitter is released, so that the timing of repeated, rapid signaling events can 
be accurately communicated to the postsynaptic cell. As we shall see, signaling 
via such chemical synapses is far more versatile and adaptable than direct elec- 
trical coupling via gap junctions at electrical synapses (discussed in Chapter 19), 
which are also used by neurons but to a much smaller extent. 

Transmitter-gated ion channels are specialized for rapidly converting 
extracellular chemical signals into electrical signals at chemical synapses. The 
channels are concentrated in the plasma membrane of the postsynaptic cell in 
the region of the synapse and open transiently in response to the binding of neu- 
rotransmitter molecules, thereby producing a brief permeability change in the 
membrane (see Figure 11-35A). Unlike the voltage-gated channels responsible 
for action potentials, transmitter-gated channels are relatively insensitive to the 
membrane potential and therefore cannot by themselves produce a self-ampli- 
fying excitation. Instead, they produce local permeability changes, and hence 
changes of membrane potential, that are graded according to the amount of 
neurotransmitter released at the synapse and how long it persists there. An 
action potential can be triggered from this site only if the local membrane 
potential depolarizes enough to open a sufficient number of nearby voltage- 
gated cation channels that are present in the same target cell membrane. 



Figure 1 1 -35 A chemical synapse. 

<CTGA> (A) When an action potential 
reaches the nerve terminal in a 
presynaptic cell, it stimulates the terminal 
to release its neurotransmitter. The 
neurotransmitter molecules are 
contained in synaptic vesicles and are 
released to the cell exterior when the 
vesicles fuse with the plasma membrane 
of the nerve terminal. The released 
neurotransmitter binds to and opens the 
transmitter-gated ion channels 
concentrated in the plasma membrane of 
the postsynaptic target cell at the 
synapse. The resulting ion flows alter the 
membrane potential of the target cell, 
thereby transmitting a signal from the 
excited nerve. (B) A thin section electron 
micrograph of two nerve terminal 
synapses on a dendrite of a postsynaptic 
cell. (B, courtesy of Cedric Raine.) 
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Chemical Synapses Can Be Excitatory or Inhibitory 

Transmitter-gated ion channels differ from one another in several important 
ways. First, as receptors, they have a highly selective binding site for the neuro- 
transmitter that is released from the presynaptic nerve terminal. Second, as 
channels, they are selective in the type of ions that they let pass across the 
plasma membrane; this determines the nature of the postsynaptic response. 
Excitatory neurotransmitters open cation channels, causing an influx of Na + 
that depolarizes the postsynaptic membrane toward the threshold potential for 
firing an action potential. Inhibitory neurotransmitters, by contrast, open 
either CI" channels or K + channels, and this suppresses firing by making it harder 
for excitatory influences to depolarize the postsynaptic membrane. Many trans- 
mitters can be either excitatory or inhibitory, depending on where they are 
released, what receptors they bind to, and the ionic conditions that they 
encounter. Acetylcholine, for example, can either excite or inhibit, depending on 
the type of acetylcholine receptors it binds to. Usually, however, acetylcholine, 
glutamate, and serotonin are used as excitatory transmitters, and y-aminobu- 
tyric acid (GABA) and glycine are used as inhibitory transmitters. Glutamate, for 
instance, mediates most of the excitatory signaling in the vertebrate brain. 

We have already discussed how the opening of cation channels depolarizes 
a membrane. We can understand the effect of opening Cl~ channels as follows. 
The concentration of CI" is much higher outside the cell than inside (see Table 
11-1, p. 652), but the membrane potential opposes its influx. In fact, for many 
neurons, the equilibrium potential for CI" is close to the resting potential — or 
even more negative. For this reason, opening CI" channels tends to buffer the 
membrane potential; as the membrane starts to depolarize, more negatively 
charged CI" ions enter the cell and counteract the depolarizatrion. Thus, the 
opening of CI" channels makes it more difficult to depolarize the membrane and 
hence to excite the cell. The opening of K + channels has a similar effect. The 
effects of toxins that block their action demonstrate the importance of 
inhibitory neurotransmitters: strychnine, for example, by binding to glycine 
receptors and blocking the inhibitory action of glycine, causes muscle spasms, 
convulsions, and death. 

However, not all chemical signaling in the nervous system operates through 
ligand-gated ion channels. Many of the signaling molecules that are secreted by 
nerve terminals, including a large variety of neuropeptides, bind to receptors 
that regulate ion channels only indirectly. We discuss these so-called G-protein- 
coupled receptors and enzyme-coupled receptors in detail in Chapter 15. Whereas 
signaling mediated by excitatory and inhibitory neurotransmitters binding to 
transmitter-gated ion channels is generally immediate, simple, and brief, signal- 
ing mediated by ligands binding to G-protein- coupled receptors and enzyme- 
coupled receptors tends to be far slower and more complex, and longer lasting 
in its consequences. 



The Acetylcholine Receptors at the Neuromuscular Junction Are 
Transmitter-Gated Cation Channels 

The best-studied example of a transmitter-gated ion channel is the acetylcholine 
receptor of skeletal muscle cells. This channel is opened transiently by acetyl- 
choline released from the nerve terminal at a neuromuscular junction — the 
specialized chemical synapse between a motor neuron and a skeletal muscle 
cell (Figure 11-36). This synapse has been intensively investigated because it is 
readily accessible to electrophysiological study, unlike most of the synapses in 
the central nervous system. 

The acetylcholine receptor has a special place in the history of ion channels. 
It was the first ion channel to be purified, the first to have its complete amino acid 
sequence determined, the first to be functionally reconstituted in synthetic lipid 
bilayers, and the first for which the electrical signal of a single open channel was 
recorded. Its gene was also the first ion channel gene to be cloned and 
sequenced, and its three-dimensional structure has been determined, albeit at 



muscle cell 



myelinated axon nerve 




body of 
Schwann ceft 



axon terminals 



10 Mm 



Figure 1 1 -36 A low-magnification 
scanning electron micrograph of a 
neuromuscular junction in a frog. The 

termination of a single axon on a skeletal 
muscle cell is shown. {From J. Desaki and 
Y. Uehara, J. Neurocytol. 10:101-110,1981. 
With permission from Kluwer Academic 
Publishers.) 
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only moderate resolution. There were at least two reasons for the rapid progress 
in purifying and characterizing this receptor. First, an unusually rich source of 
the acetylcholine receptors exists in the electric organs of electric fish and rays 
(these organs are modified muscles designed to deliver a large electric shock to 
prey). Second, certain neurotoxins (such as a-bungarotoxin) in the venom of 
certain snakes bind with high affinity (K& - 10 9 liters/mole) and specificity to the 
receptor and can therefore be used to purify it by affinity chromatography. Flu- 
orescent or radiolabeled a-bungarotoxin can also be used to localize and count 
acetylcholine receptors. In this way, researchers have shown that the receptors 
are densely packed in the muscle cell plasma membrane at a neuromuscular 
junction (about 20,000 such receptors per u;m 2 ), with relatively few receptors 
elsewhere in the same membrane. 

The acetylcholine receptor of skeletal muscle is composed of five transmem- 
brane polypeptides, two of one kind and three others, encoded by four separate 
genes. The four genes are strikingly similar in sequence, implying that they 
evolved from a single ancestral gene. The two identical polypeptides in the pen- 
tamer each contribute to one of two binding sites for acetylcholine that are nes- 
tled between adjoining subunits. When two acetylcholine molecules bind to the 
pentameric complex, they induce a conformational change: the helices that line 
the pore rotate to disrupt a ring of hydrophobic amino acids that blocks ion flow 
in the closed state. With ligand bound, the channel still flickers between open and 
closed states, but now it has a 90% probability of being open. This state contin- 
ues until hydrolysis by a specific enzyme (acetylcholinesterase) located at the neu- 
romuscular junction lowers the concentration of acetylcholine sufficiently. Once 
freed of its bound neurotransmitter, the acetylcholine receptor reverts to its ini- 
tial resting state. If the presence of acetylcholine persists for a prolonged time as 
a result of excessive nerve stimulation, the channel inactivates (Figure 11-37). 

The general shape of the acetylcholine receptor and the likely arrangement 
of its subunits have been determined by electron microscopy (Figure 11-38). 
The five subunits are arranged in a ring, forming a water-filled transmembrane 
channel that consists of a narrow pore through the lipid bilayer, which widens 
into vestibules at both ends. Clusters of negatively charged amino acids at either 
end of the pore help to exclude negative ions and encourage any positive ion of 
diameter less than 0.65 nm to pass through. The normal traffic consists chiefly 
of Na + and K + , together with some Ca 2+ . Thus, unlike voltage-gated cation chan- 
nels, such as the K + channel discussed earlier, there is little selectivity among 
cations, and the relative contributions of the different cations to the current 
through the channel depend chiefly on their concentrations and on the electro- 
chemical driving forces. When the muscle cell membrane is at its resting poten- 
tial, the net driving force for K + is near zero, since the voltage gradient nearly bal- 
ances the K + concentration gradient across the membrane (see Panel 11-2, 
p. 670). For Na + , in contrast, the voltage gradient and the concentration gradient 
both act in the same direction to drive the ion into the cell. (The same is true for 
Ca 2+ , but the extracellular concentration of Ca 2+ is so much lower than that of 
Na + that Ca 2+ makes only a small contribution to the total inward current.) 
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Figure 1 1 -37 Three conformations of 
the acetylcholine receptor. The binding 
of two acetylcholine molecules opens 
this transmitter-gated ion channel. It 
then maintains a high probability of 
being open until the acetylcholine has 
been hydrolyzed. In the persistent 
presence of acetylcholine, however, the 
channel inactivates (desensitizes). 
Normally, the acetylcholine is rapidly 
hydrolyzed and the channel closes within 
about 1 millisecond, well before 
significant desensitization occurs. 
Desensitization would occur after about 
20 milliseconds in the continued 
presence of acetylcholine. 
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Therefore, the opening of the acetylcholine receptor channels leads to a large 
net influx of Na + (a peak rate of about 30,000 ions per channel each millisecond) . 
This influx causes a membrane depolarization that signals the muscle to con- 
tract, as discussed below. 



Transmitter-Gated Ion Channels Are Major Targets for 
Psychoactive Drugs 

The ion channels that open directly in response to the neurotransmitters acetyl- 
choline, serotonin, GABA, and glycine contain subunits that are structurally 
similar and probably form transmembrane pores in the same way even though 
they have distinct neurotransmitter-binding specificities and ion selectivities. 
These channels are all built from homologous polypeptide subunits, which 
probably assemble as a pentamer resembling the acetylcholine receptor. Gluta- 
mate-gated ion channels are constructed from a distinct family of subunits and 
are thought to form tetramers resembling the K + channels discussed earlier. 

For each class of transmitter-gated ion channel, there are alternative forms of 
each type of subunit, either encoded by distinct genes or generated by alternative 
RNA splicing of the same gene product. The subunits assemble in different com- 
binations to form an extremely diverse set of distinct channel subtypes, with dif- 
ferent ligand affinities, different channel conductances, different rates of opening 
and closing, and different sensitivities to drugs and toxins. Vertebrate neurons, 
for example, have acetylcholine -gated ion channels that differ from those of mus- 
cle cells in that they are usually formed from two subunits of one type and three 
of another; but there are at least nine genes coding for different versions of the 
first type of subunit and at least three coding for different versions of the second, 
with further diversity due to alternative RNA splicing. Subsets of acetylcholine - 
sensitive neurons performing different functions in the brain express different 
combinations of these subunits. This, in principle, and already to some extent in 
practice, makes it possible to design drugs targeted against narrowly defined 
groups of neurons or synapses, thereby specifically influencing particular brain 
functions. 

Indeed, transmitter- gated ion channels have for a long time been important 
targets for drugs. A surgeon, for example, can relax muscles for the duration of 
an operation by blocking the acetylcholine receptors on skeletal muscle cells 
with curare, a drug from a plant that was originally used by South American Indi- 
ans to make poison arrows. Most drugs used to treat insomnia, anxiety, depres- 
sion, and schizophrenia exert their effects at chemical synapses, and many of 
these act by binding to transmitter- gated channels. Both barbiturates and tran- 
quilizers, such as Valium and Librium, for example, bind to GABA receptors, 
potentiating the inhibitory action of GABA by allowing lower concentrations of 
this neurotransmitter to open CI" channels. The new molecular biology of ion 



Figure 11-38 A model for the structure 
of the acetylcholine receptor. (A) Five 
homologous subunits (a, a, p, y, 5) 
combine to form a transmembrane 
aqueous pore. The pore is lined by a ring 
of five transmembrane a helices, one 
contributed by each subunit. In its closed 
conformation, the pore is thought to be 
occluded by the hydrophobic side chains 
of five leucines, one from each a helix, 
which form a gate near the middle of the 
lipid bilayer. The negatively charged side 
chains at either end of the pore ensure 
that only positively charged ions pass 
through the channel. (B) Both of the 
a subunits contribute to an 
acetylcholine-binding site nestled 
between adjoining subunits; when 
acetylcholine binds to both sites, the 
channel undergoes a conformational 
change that opens the gate, possibly by 
rotating the helices containing the 
occluding leucines to move outward. In 
the structural drawing (right), the parts of 
the channel that move in response to 
AChR binding to open the pore are 
colored in blue. (Adapted from N. Unwin, 
Ce//72[Suppl.]:31-41, 1993. With 
permission from Elsevier.) 
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channels, by revealing both their diversity and the details of their structure, 
holds out the hope of designing a new generation of psychoactive drugs that will 
act still more selectively to alleviate the miseries of mental illness. 

In addition to ion channels, many other components of the synaptic signal- 
ing machinery are potential targets for psychoactive drugs. As mentioned ear- 
lier, after release into the synaptic cleft, many neurotransmitters are cleared by 
reuptake mechanisms mediated by Na + -driven transporters. The inhibition of 
such a transporter prolongs the effect of the transmitter and thereby strengthens 
synaptic transmission. Many antidepressant drugs, including Prozac, for exam- 
ple, inhibit the uptake of serotonin; others inhibit the uptake of both serotonin 
and norepinephrine. 

Ion channels are the basic molecular components from which neuronal 
devices for signaling and computation are built. To provide a glimpse of how 
sophisticated the functions of these devices can be, we consider several exam- 
ples that demonstrate how groups of ion channels work together in synaptic 
communication between electrically excitable cells. 



Neuromuscular Transmission Involves the Sequential Activation 
of Five Different Sets of Ion Channels 
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The following process, in which a nerve impulse stimulates a muscle cell to con- 
tract, illustrates the importance of ion channels to electrically excitable cells. 
This apparently simple response requires the sequential activation of at least 
five different sets of ion channels, all within a few milliseconds (Figure 11-39). 

1 . The process is initiated when the nerve impulse reaches the nerve terminal 
and depolarizes the plasma membrane of the terminal. The depolarization 
transiently opens voltage-gated Ca 2+ channels in this membrane. As the 
Ca 2+ concentration outside cells is more than 1000 times greater than the 
free Ca 2+ concentration inside, Ca 2+ flows into the nerve terminal. The 
increase in Ca 2+ concentration in the cytosol of the nerve terminal triggers 
the local release of acetylcholine into the synaptic cleft. 

2. The released acetylcholine binds to acetylcholine receptors in the muscle 
cell plasma membrane, transiently opening the cation channels associated 
with them. The resulting influx of Na + causes a local membrane depolar- 
ization. 

3. The local depolarization of the muscle cell plasma membrane opens volt- 
age-gated Na + channels in this membrane, allowing more Na + to enter, 
which further depolarizes the membrane. This, in turn, opens neighboring 
voltage-gated Na + channels and results in a self-propagating depolariza- 
tion (an action potential) that spreads to involve the entire plasma mem- 
brane (see Figure 11-30). 



Figure 1 1-39 The system of ion 
channels at a neuromuscular junction. 

These gated ion channels are essential for 
the stimulation of muscle contraction by 
a nerve impulse. The various channels are 
numbered in the sequence in which they 
are activated, as described in the text. 
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4. The generalized depolarization of the muscle cell plasma membrane acti- 
vates voltage-gated Ca 2+ channels in specialized regions (the transverse [T] 
tubules — discussed in Chapter 16) of this membrane. 

5. This, in turn, causes Ca 2+ -gated Ca 2+ release channels in an adjacent region 
of the sarcoplasmic reticulum (SR) membrane to open transiently and 
release the Ca 2+ stored in the SR into the cytosol. The T-tubule and SR mem- 
branes are closely apposed with the two types of channels joined together in 
a specialized structure (see Figure 16-77). It is the sudden increase in the 
cytosolic Ca 2+ concentration that causes the myofibrils in the muscle cell to 
contract. 

Whereas the activation of muscle contraction by a motor neuron is complex, 
an even more sophisticated interplay of ion channels is required for a neuron to 
integrate a large number of input signals at synapses and compute an appropri- 
ate output, as we now discuss. 



Single Neurons Are Complex Computation Devices 

In the central nervous system, a single neuron can receive inputs from thou- 
sands of other neurons, and can in turn form synapses with many thousands of 
other cells. Several thousand nerve terminals, for example, make synapses on an 
average motor neuron in the spinal cord; its cell body and dendrites are almost 
completely covered with them (Figure 1 1-40). Some of these synapses transmit 
signals from the brain or spinal cord; others bring sensory information from 
muscles or from the skin. The motor neuron must combine the information 
received from all these sources and react either by firing action potentials along 
its axon or by remaining quiet. 

Of the many synapses on a neuron, some tend to excite it, others to inhibit it. 
Neurotransmitter released at an excitatory synapse causes a small depolarization 
in the postsynaptic membrane called an excitatory postsynaptic potential (excita- 
tory PSP), while neurotransmitter released at an inhibitory synapse generally 
causes a small hyperpolarization called an inhibitory PSP The membrane of the 
dendrites and cell body of most neurons contains a relatively low density of volt- 
age-gated Na + channels, and an individual excitatory PSP is generally too small to 



Figure 1 1-40 A motor neuron cell body in 
the spinal cord. (A) Many thousands of 
nerve terminals synapse on the cell body 
and dendrites. These deliver signals from 
other parts of the organism to control the 
firing of action potentials along the single 
axon of this large cell. (B) Micrograph 
showing a nerve cell body and its dendrites 
stained with a fluorescent antibody that 
recognizes a cytoskeletal protein (green). 
Thousands of axon terminals (red) from 
other nerve cells (not visible) make 
synapses on the cell body and dendrites; 
they are stained with a fluorescent antibody 
that recognizes a protein in synaptic 
vesicles. (B, courtesy of Olaf Mundigl and 
PietrodeCamilli.) 
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Figure 1 1 -41 The magnitude of the combined postsynaptic potential (PSP) is reflected in the frequency of firing of 
action potentials. When successive action potentials arrive at the same synapse, each PSP produced adds to the preceding 
one to produce a larger combined PSP. A comparison of (A) and (B) shows how the firing frequency of an axon increases 
with an increase in the combined PSP, while (C) summarizes the general relationship. 

trigger an action potential. Instead, each incoming signal is reflected in a local 
PSP of graded magnitude, which decreases with distance from the site of the 
synapse. If signals arrive simultaneously at several synapses in the same region of 
the dendritic tree, the total PSP in that neighborhood will be roughly the sum of 
the individual PSPs, with inhibitory PSPs making a negative contribution to the 
total. The PSPs from each neighborhood spread passively and converge on the 
cell body. For long-distance transmission, the combined magnitude of the PSP is 
then translated, or encoded, into the frequency of firing of action potentials (Fig- 
ure 11-41). This encoding is achieved by a special set of gated ion channels that 
are present at high density at the base of the axon, adjacent to the cell body, in a 
region known as the initial segment, or axon hillock (see Figure 11-40). 



Neuronal Computation Requires a Combination of at Least Three 
Kinds of K + Channels 

We have seen that the intensity of stimulation a neuron receives is encoded for 
long-distance transmission by the frequency of action potentials that the neuron 
fires: the stronger the stimulation, the higher the frequency of action potentials. 
Action potentials are initiated at the initial segment, a unique region of each 
neuron with plentiful voltage-gated Na + channels. But to perform its special 
function of encoding, the membrane of the initial segment also contains at least 
four other classes of ion channels — three selective for K + and one selective for 
Ca 2+ . The three varieties of K + channels have different properties; we shall refer to 
them as delayed, rapidly inactivating, and Ca 2+ -activated K + channels. 

To understand the need for multiple types of channels, consider first what 
would happen if the only voltage -gated ion channels present in the nerve cell 
were the Na + channels. Below a certain threshold level of synaptic stimulation, 
the depolarization of the initial segment membrane would be insufficient to 
trigger an action potential. With gradually increasing stimulation, the threshold 
would be crossed, the Na + channels would open, and an action potential would 
fire. The action potential would be terminated in the usual way by inactivation 
of the Na + channels. Before another action potential could fire, these channels 
would have to recover from their inactivation. But that would require a return of 
the membrane voltage to a very negative value, which would not occur as long 
as the strong depolarizing stimulus (from PSPs) was maintained. An additional 
channel type is needed, therefore, to repolarize the membrane after each action 
potential to prepare the cell to fire again. 

The delayed K + channels perform this task, as discussed previously in rela- 
tion to the propagation of the action potential (see p. 677). They are voltage- 
gated, but because of their slower kinetics they open only during the falling phase 
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of the action potential, when the Na + channels are inactive. Their opening per- 
mits an efflux of K + that drives the membrane back toward the K + equilibrium 
potential, which is so negative that the Na + channels rapidly recover from their 
inactivated state. Repolarization of the membrane also closes the delayed K + 
channels. The initial segment is now reset so that the depolarizing stimulus from 
synaptic inputs can fire another action potential. In this way sustained stimula- 
tion of the dendrites and cell body leads to repetitive firing of the axon. 

Repetitive firing in itself, however, is not enough. The frequency of the firing 
has to reflect the intensity of the stimulation, and a simple system of Na + channels 
and delayed K + channels is inadequate for this purpose. Below a certain threshold 
level of steady stimulation, the cell will not fire at all; above that threshold level, it 
will abruptly begin to fire at a relatively rapid rate. The rapidly inactivating K + 
channels solve the problem. These, too, are voltage-gated and open when the 
membrane is depolarized, but their specific voltage sensitivity and kinetics of 
inactivation are such that they act to reduce the rate of firing at levels of stimula- 
tion that are only just above the threshold required for firing. Thus, they remove 
the discontinuity in the relationship between the firing rate and the intensity of 
stimulation. The result is a firing rate that is proportional to the strength of the 
depolarizing stimulus over a very broad range (see Figure 11-41C). 

The process of encoding is usually further modulated by the two other types 
of ion channels in the initial segment that were mentioned at the outset, namely 
voltage-gated Ca 2+ channels and Ca 2+ -activated K + channels. They act together 
to decrease the response of the cell to an unchanging, prolonged stimulation — 
a process called adaptation. These Ca 2+ channels are similar to the Ca 2+ chan- 
nels that mediate the release of neurotransmitter from presynaptic axon termi- 
nals; they open when an action potential fires, transiently allowing Ca 2+ into the 
initial segment. 

The Ca 2+ -activated K + channel is both structurally and functionally differ- 
ent from any of the channel types described earlier. It opens in response to a 
raised concentration of Ca 2+ at the cytoplasmic face of the nerve cell membrane. 
Suppose we apply a strong depolarizing stimulus for a long time, triggering a 
long train of action potentials. Each action potential permits a brief influx of 
Ca 2+ through the voltage-gated Ca 2+ channels, so that the intracellular Ca 2+ con- 
centration gradually builds up to a level high enough to open the Ca 2+ - activated 
K + channels. Because the resulting increased permeability of the membrane to 
K + makes the membrane harder to depolarize, it increases the delay between 
one action potential and the next. In this way, a neuron that is stimulated con- 
tinuously for a prolonged period becomes gradually less responsive to the con- 
stant stimulus. 

Such adaptation, which can also occur by other mechanisms, allows a neu- 
ron — indeed, the nervous system generally — to react sensitively to change, even 
against a high background level of steady stimulation. It is one of the strategies 
that help us, for example, to feel a light touch on the shoulder and yet ignore the 
constant pressure of our clothing. We discuss adaptation as a general feature in 
cell signaling processes in more detail in Chapter 15. 

Other neurons do different computations, reacting to their synaptic inputs 
in myriad ways, reflecting the different assortments of members of the various 
ion channel families that reside in their membranes. There are several hundred 
genes that code for ion channels in the human genome, with over 150 encoding 
voltage-gated channels alone. Further complexity is introduced by alternative 
splicing of RNAs and assembling channels from different combinations of 
diverse subunits. The multiplicity of ion channels evidently allows for many dif- 
ferent types of neurons, the electrical behavior of which is specifically tuned to 
the particular tasks that they must perform. 

One of the crucial properties of the nervous system is its ability to learn and 
remember, which seems to depend largely on long-term changes in specific 
synapses. We end this chapter by considering a remarkable type of ion channel 
that is thought to have a special role in some forms of learning and memory. It 
is located at many synapses in the central nervous system, where it is gated by 
both voltage and the excitatory neurotransmitter glutamate. It is also the site of 
action of the psychoactive drug phencyclidine, or angel dust. 
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Long-Term Potentiation (LTP) in the Mammalian Hippocampus 
Depends on Ca 2+ Entry Through NMDA-Receptor Channels 

Practically all animals can learn, but mammals seem to learn exceptionally well 
(or so we like to think). In a mammal's brain, the region called the hippocampus 
has a special role in learning. When it is destroyed on both sides of the brain, the 
ability to form new memories is largely lost, although previous long-established 
memories remain. Correspondingly, some synapses in the hippocampus show 
marked functional alterations with repeated use: whereas occasional single 
action potentials in the presynaptic cells leave no lasting trace, a short burst of 
repetitive firing causes long-term potentiation (LTP), such that subsequent sin- 
gle action potentials in the presynaptic cells evoke a greatly enhanced response 
in the postsynaptic cells. The effect lasts hours, days, or weeks, according to the 
number and intensity of the bursts of repetitive firing. Only the synapses that 
were activated exhibit LTP; synapses that have remained quiet on the same post- 
synaptic cell are not affected. However, while the cell is receiving a burst of 
repetitive stimulation via one set of synapses, if a single action potential is deliv- 
ered at another synapse on its surface, that latter synapse also will undergo LTP, 
even though a single action potential delivered there at another time would 
leave no such lasting trace. 

The underlying rule in such synapses seems to be that LTP occurs on any occa- 
sion when a presynaptic cell fires (once or more) at a time when the postsynaptic 
membrane is strongly depolarized (either through recent repetitive firing of the 
same presynaptic cell or by other means). This rule reflects the behavior of a par- 
ticular class of ion channels in the postsynaptic membrane. Glutamate is the main 
excitatory neurotransmitter in the mammalian central nervous system, and gluta- 
mate-gated ion channels are the most common of all transmitter-gated channels 
in the brain. In the hippocampus, as elsewhere, most of the depolarizing current 
responsible for excitatory PSPs is carried by glutamate-gated ion channels, called 
AMPA receptors, that operate in the standard way. But the current has, in addition, 
a second and more intriguing component, which is mediated by a separate sub- 
class of glutamate-gated ion channels known as NMDA receptors, so named 
because they are selectively activated by the artificial glutamate analog N-methyl- 
D-aspartate. The NMDA- receptor channels are doubly gated, opening only when 
two conditions are satisfied simultaneously: glutamate must be bound to the 
receptor, and the membrane must be strongly depolarized. The second condition 
is required for releasing the Mg 2+ that normally blocks the resting channel. This 
means that NMDA receptors are normally activated only when AMPA receptors 
are activated as well and depolarize the membrane. The NMDA receptors are crit- 
ical for LTP. When they are selectively blocked with a specific inhibitor, or in trans- 
genic animals in which the gene has been knocked out, LTP does not occur, even 
though ordinary synaptic transmission continues. Such animals exhibit specific 
deficits in their learning abilities but behave almost normally otherwise. 

How do NMDA receptors mediate such a remarkable effect? The answer is that 
these channels, when open, are highly permeable to Ca 2+ , which acts as an intra- 
cellular mediator in the postsynaptic cell, triggering a cascade of changes that are 
responsible for LTP Thus, LTP is prevented when Ca 2+ levels are held artificially low 
in the postsynaptic cell by injecting the Ca 2+ chelator EGTA into it, and LTP can be 
induced by artificially raising intracellular Ca 2+ levels. Among the long-term 
changes that increase the sensitivity of the postsynaptic cell to glutamate is the 
insertion of new AMPA receptors into the plasma membrane (Figure 11-42). Evi- 
dence also indicates that changes can occur in the presynaptic cell as well, so that 
it releases more glutamate than normal when it is activated subsequently 

If synapses expressed only LTP they would quickly become saturated and, 
thus, be of limited value as an information storage device. In fact, synapses also 
exhibit long-term depression (LTD), which surprisingly also requires NMDA 
receptor activation and a rise in Ca 2+ . How does Ca 2+ trigger opposite effects at 
the same synapse? It turns out that this bidirectional control of synaptic 
strength depends on the magnitude of the rise in Ca 2+ : high Ca 2+ levels activate 
protein kinases and LTP, whereas modest Ca 2+ levels activate protein phos- 
phatases and LTD. 
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There is evidence that NMDA receptors have an important role in learning 
and related phenomena in other parts of the brain, as well as in the hippocam- 
pus. In Chapter 21 we see, moreover, that NMDA receptors have a crucial role in 
adjusting the anatomical pattern of synaptic connections in the light of experi- 
ence during the development of the nervous system. 

Thus, neurotransmitters released at synapses, besides relaying transient 
electrical signals, can also alter concentrations of intracellular mediators that 
bring about lasting changes in the efficacy of synaptic transmission. However, it 
is still uncertain how these changes endure for weeks, months, or a lifetime in 
the face of the normal turnover of cell constituents. 

Some of the ion channels that we have discussed are summarized in Table 
11-2. 



Summary 

Ion channels form aqueous pores across the lipid hilayer and allow inorganic ions of 
appropriate size and charge to cross the membrane down their electrochemical gradi- 
ents at rates about 1000 times greater than those achieved by any known transporter 
The channels are "gated" and usually open transiently in response to a specific pertur- 
bation in the membrane, such as a change in membrane potential (voltage-gated chan- 
nels) or the binding of a neurotransmitter (transmitter- gated channels). 

K + -selective leak channels have an important role in determining the resting 
membrane potential across the plasma membrane in most animal cells. Voltage-gated 
cation channels are responsible for the generation of self-amplifying action potentials 
in electrically excitable cells, such as neurons and skeletal muscle cells. Transmitter- 
gated ion channels convert chemical signals to electrical signals at chemical synapses. 
Excitatory neurotransmitters, such as acetylcholine and glutamate, open transmitter- 
gated cation channels and thereby depolarize the postsynaptic membrane toward the 
threshold level for firing an action potential Inhibitory neurotransmitters, such as 
GABA and glycine, open transmitter- gated CT or K + channels and thereby suppress 
firing by keeping the postsynaptic membrane polarized. A subclass of glutamate -gated 
ion channels, called NMDA-receptor channels, is highly permeable to Ca 2+ , which can 
trigger the long-term changes in synapses such as LTP and LTD that are thought to be 
involved in some forms of learning and memory. 

Table 1 1 -2 Some Ion Channel Families 



CHANNEL TYPE 

Voltage-gated 
cation channels 



Transmitter-gated 
ion channels 



REPRESENTATIVE EXAA 

voltage-gated Na + channels 

voltage-gated K + channels (including delayed and early) 

voltage-gated Ca 2+ channels 

acetylcholine-gated cation channels 

glutamate-gated Ca 2+ channels 

serotonin-gated cation channels 

GABA-gated CI~ channels 

glycine-gated CI" channels 



- excitatory 



- inhibitory 




AfiU 



increased Ca 2+ in the cytosol 
induces postsynaptic cell to 
insert new AMPA receptor in the 
plasma membrane, increasing 
the cell's sensitivity to 
glutamate 




Figure 1 1 -42 The signaling events in 
long-term potentiation. Although not 
shown, evidence suggests that changes 
can also occur in the presynaptic nerve 
terminals in LTP, which may be stimulated 
by retrograde signals from the 
postsynaptic cell. 
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Ion channels work together in complex ways to control the behavior of electrically 
excitable cells. A typical neuron, for example, receives thousands of excitatory and 
inhibitory inputs, which combine by spatial and temporal summation to produce a 
postsynaptic potential (PSP) in the cell body The magnitude of the PSP is translated 
into the rate of firing of action potentials by a mixture of cation channels in the mem- 
brane of the initial segment. 



PROBLEMS 

Which statements are true? Explain why or why not. 

1 1 -1 Transport by transporters can be either active or pas- 
sive, whereas transport by channels is always passive. 

11-2 Transporters saturate at high concentrations of the 
transported molecule when all their binding sites are occu- 
pied; channels, on the other hand, do not bind the ions they 
transport and thus the flux of ions through a channel does 
not saturate. 

11-3 The membrane potential arises from movements of 
charge that leave ion concentrations practically unaffected, 
causing only a very slight discrepancy in the number of pos- 
itive and negative ions on the two sides of the membrane. 

Discuss the following problems. 

11-4 Order Ca 2+ , C0 2 , ethanol, glucose, RNA, and H 2 
according to their ability to diffuse through a lipid bilayer, 
beginning with the one that crosses the bilayer most readily. 
Explain your order. 

11-5 How is it possible for some molecules to be at equi- 
librium across a biological membrane and yet not be at the 
same concentration on both sides? 

11-6 Ion transporters are "linked" together — not physi- 
cally, but as a consequence of their actions. For example, 
cells can raise their intracellular pH, when it becomes too 
acidic, by exchanging external Na + for internal H + , using a 
Na + -H + antiporter. The change in internal Na + is then 
redressed using the Na + -K + pump. 

A. Can these two transporters, operating together, normal- 
ize both the H + and the Na + concentrations inside the cell? 

B. Does the linked action of these two pumps cause imbal- 
ances in either the K + concentration or the membrane 
potential? Why or why not? 

1 1 -7 Microvilli increase the surface area of intestinal cells, 
providing more efficient absorption of nutrients. Microvilli 
are shown in profile and cross section in Figure Ql 1-1. From 
the dimensions given in the figure, estimate the increase in 



profile 



cross-section 




surface area that microvilli provide (for the portion of the 
plasma membrane in contact with the lumen of the gut) rel- 
ative to the corresponding surface of a cell with a "flat" 
plasma membrane. 

11-8 According to Newton's laws of motion, an ion 
exposed to an electric field in a vacuum would experience a 
constant acceleration from the electric driving force, just as 
a falling body in a vacuum constantly accelerates due to 
gravity. In water, however, an ion moves at constant velocity 
in an electric field. Why do you suppose that is? 

1 1 -9 The "ball-and-chain" model for the rapid inactivation 
of voltage-gated K + channels has been elegantly confirmed 
for the shaker K + channel from Drosophila melanogaster. 
(The shaker K + channel in Drosophila is named after a 
mutant form that causes excitable behavior— even anes- 
thetized flies keep twitching.) Deletion of the N-terminal 
amino acids from the normal shaker channel gives rise to a 
channel that opens in response to membrane depolariza- 
tion, but stays open instead of rapidly closing as the normal 
channel does. A peptide (MAAVAGLYGLGEDRQHRKKQ) that 
corresponds to the deleted N-terminus can inactivate the 
open channel at 100 uJVI. 

Is the concentration of free peptide (100 jxM) that is 
required to inactivate the defective K + channel anywhere 
near the normal local concentration of the tethered ball on 
a normal channel? Assume that the tethered ball can explore 
a hemisphere [volume = (2/3)71^] with a radius of 21.4 nm, 
the length ofthe polypeptide "chain" (Figure Q 11-2). Calcu- 
late the concentration for one ball in this hemisphere. How 
does that value compare with the concentration of free pep- 
tide needed to inactivate the channel? 




1 nm 



0.1 urn 



Figure Ql 1-2 A "ball" tethered by a "chain" to a voltage-gated K + 
channel (Problem 11-9). 

11-10 The squid giant axon occupies a unique position in 
the history of our understanding of cell membrane poten- 
tials and nerve action. When an electrode is stuck into an 
intact giant axon, the membrane potential registers -70 mV 
When the axon, suspended in a bath of seawater, is stimu- 
lated to conduct a nerve impulse, the membrane potential 
changes transiently from -70 mV to +40 mV 

Figure Ql 1-1 Microvilli of intestinal epithelial cells in profile and 
cross section (Problem 1 1 -7). (Left panel, from Rippel Electron 
Microscope Facility, Dartmouth College; Right panel, from David 
Burgess.) 



694 Chapter 1 1 : Membrane Transport of Small Molecules and the Electrical Properties of Membranes 



For univalent ions and at 20°C (293 K), the Nernst equa- 
tion reduces to 

y=58mVxlog(C /Ci) 

where C and Q are the concentrations outside and inside, 
respectively. 

Using this equation, calculate the potential across the 
resting membrane (1) assuming that it is due solely to K + 
and (2) assuming that it is due solely to Na + . (The Na + and K + 
concentrations in axon cytoplasm and in seawater are given 
in Table Ql 1-1 .) Which calculation is closer to the measured 



Table Q11-1 Ionic composition of seawater and of cytoplasm from 
the squid giant axon (Problem 11-10). 



ION 

Na + 
K + 



65 mM 
344 mM 



430 mM 
9mM 



resting potential? Which calculation is closer to the mea- 
sured action potential? Explain why these assumptions 
approximate the measured resting and action potentials. 
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Intracellular Compartments 
and Protein Sorting 




Unlike a bacterium, which generally consists of a single intracellular compart- 
ment surrounded by a plasma membrane, a eucaryotic cell is elaborately sub- 
divided into functionally distinct, membrane- enclosed compartments. Each 
compartment, or organelle, contains its own characteristic set of enzymes and 
other specialized molecules, and complex distribution systems transport spe- 
cific products from one compartment to another. To understand the eucaryotic 
cell, it is essential to know how the cell creates and maintains these compart- 
ments, what occurs in each of them, and how molecules move between them. 

Proteins confer upon each compartment its characteristic structural and 
functional properties. They catalyze the reactions that occur in each organelle 
and selectively transport small molecules into and out of its interior, or lumen. 
Proteins also serve as organelle-specific surface markers that direct new deliver- 
ies of proteins and lipids to the appropriate organelle. 

An animal cell contains about 10 billion (10 10 ) protein molecules of perhaps 
10,000 kinds, and the synthesis of almost all of them begins in the cytosol. Each 
newly synthesized protein is then delivered specifically to the cell compartment 
that requires it. The intracellular transport of proteins is the central theme of 
both this chapter and the next. By tracing the protein traffic from one compart- 
ment to another, one can begin to make sense of the otherwise bewildering 
maze of intracellular membranes. 
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THE COMPARTMENTALIZATION OF CELLS 

In this brief overview of the compartments of the cell and the relationships 
between them, we organize the organelles conceptually into a small number of 
discrete families, discuss how proteins are directed to specific organelles, and 
explain how proteins cross organelle membranes. 



All Eucaryotic Cells Have the Same Basic Set of Membrane- 
Enclosed Organelles 

Many vital biochemical processes take place in or on membrane surfaces. Mem- 
brane-bound enzymes, for example, catalyze lipid metabolism, and oxidative 
phosphorylation and photosynthesis both require a membrane to couple the 
transport of H + to the synthesis of ATP. In addition to providing increased mem- 
brane area to host biochemical reactions, intracellular membrane systems form 
enclosed compartments that are separate from the cytosol, thus creating func- 
tionally specialized aqueous spaces within the cell. Because the lipid bilayer of 
organelle membranes is impermeable to most hydrophilic molecules, the mem- 
brane of each organelle must contain membrane transport proteins to import 
and export specific metabolites. Each organelle membrane must also have a 
mechanism for importing, and incorporating into the organelle, the specific 
proteins that make the organelle unique. 

Figure 12-1 illustrates the major intracellular compartments common to 
eucaryotic cells. The nucleus contains the genome (aside from mitochondrial 
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endosome 



peroxisome 



free 
polyribosomes 




cytosol 

lysosome 
Golgi apparatus 
mitochondrion 

endoplasmic reticulum 
with membrane-bound 
polyribosomes 

nucleus 
asma membrane 



Figure 12-1 The major intracellular 
compartments of an animal cell. <ATCC> 
The cytosol (gray), endoplasmic reticulum, 
Golgi apparatus, nucleus, mitochondrion, 
endosome, lysosome, and peroxisome are 
distinct compartments isolated from the 
rest of the cell by at least one selectively 
permeable membrane. 
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and chloroplast DNA) and it is the principal site of DNA and RNA synthesis. The 
surrounding cytoplasm consists of the cytosol and the cytoplasmic organelles 
suspended in it. The cytosol constitutes a little more than half the total volume 
of the cell, and it is the site of protein synthesis and degradation. It also per- 
forms most of the cell's intermediary metabolism — that is, the many reactions 
that degrade some small molecules and synthesize others to provide the build- 
ing blocks for macromolecules (discussed in Chapter 2). 

About half the total area of membrane in a eucaryotic cell encloses the 
labyrinthine spaces of the endoplasmic reticulum (ER). The rough Eft has many 
ribosomes bound to its cytosolic surface; these synthesize both soluble and inte- 
gral membrane proteins, most of which are destined either for secretion to the 
cell exterior or for other organelles. We shall see that, whereas proteins are trans- 
ported into other organelles only after their synthesis is complete, they are 
transported into the ER as they are synthesized. This explains why the ER mem- 
brane is unique in having ribosomes tethered to it. The ER also produces most 
of the lipid for the rest of the cell and functions as a store for Ca 2+ ions. Regions 
of the ER that lack bound ribosomes are called smooth ER. The ER sends many 
of its proteins and lipids to the Golgi apparatus, which consists of organized 
stacks of disclike compartments called Golgi cisternae. The Golgi apparatus 
receives lipids and proteins from the ER and dispatches them to various desti- 
nations, usually covalently modifying them en route. 

Mitochondria and (in plants) chloroplasts generate most of the ATP that cells 
use to drive reactions requiring an input of free energy; chloroplasts are a spe- 
cialized version of plastids, which can also have other functions in plant cells, 
such as the storage of food or pigment molecules. Lysosomes contain digestive 
enzymes that degrade defunct intracellular organelles, as well as macro- 
molecules and particles taken in from outside the cell by endocytosis. On their 
way to lysosomes, endocytosed material must first pass through a series of 
organelles called endosomes. Finally, peroxisomes are small vesicular compart- 
ments that contain enzymes used in various oxidation reactions. 

In general, each membrane-enclosed organelle performs the same set of 
basic functions in all cell types. But to serve the specialized functions of cells, 
these organelles vary in abundance and can have additional properties that dif- 
fer from cell type to cell type. 

On average, the membrane-enclosed compartments together occupy nearly 
half the volume of a cell (Table 12-1), and a large amount of intracellular mem- 
brane is required to make them all. In liver and pancreatic cells, for example, the 
endoplasmic reticulum has a total membrane surface area that is, respectively, 25 
times and 12 times that of the plasma membrane (Table 12-2). In terms of its area 
and mass, the plasma membrane is only a minor membrane in most eucaryotic 
cells, and organelles are packed tightly in the cytosol (Figure 12-2). 
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Table 12-1 Relative Volumes Occupied by the Major Intracellular Compartments in 
a Liver Cell (Hepatocyte) 



INTRACELLULAR COMPARTMENT 


PERCENTAGE OF TOTAL 




CELL VOLUME 


Cytosol 


54 


Mitochondria 


22 


Rough ER cisternae 


9 


Smooth ER cisternae plus Golgi cisternae 


6 


Nucleus 


6 


Peroxisomes 


1 


Lysosomes 


1 


Endosomes 


1 



Membrane-enclosed organelles often have characteristic positions in the 
cytosol. In most cells, for example, the Golgi apparatus is located close to the 
nucleus, whereas the network of ER tubules extends from the nucleus through- 
out the entire cytosol. These characteristic distributions depend on interactions 
of the organelles with the cytoskeleton. The localization of both the ER and the 
Golgi apparatus, for instance, depends on an intact microtubule array; if the 
microtubules are experimentally depolymerized with a drug, the Golgi appara- 
tus fragments and disperses throughout the cell, and the ER network collapses 
toward the cell center (discussed in Chapter 16). 



Evolutionary Origins Explain the Topological Relationships of 
Organelles 

To understand the relationships between the compartments of the cell, it is help- 
ful to consider how they might have evolved. The precursors of the first eucary- 
otic cells are thought to have been simple organisms that resembled bacteria, 
which generally have a plasma membrane but no internal membranes. The 
plasma membrane in such cells therefore provides all membrane-dependent 
functions, including the pumping of ions, ATP synthesis, protein secretion, and 
lipid synthesis. Typical present-day eucaryotic cells are 10-30 times larger in lin- 
ear dimension and 1000-10,000 times greater in volume than a typical bacterium 



Table 12-2 Relative Amounts of Membrane Types ir 


Two Kinds of Eucaryotic Cells 


MEMBRANE TYPE 


PERCENTAGE OF TOTAL CELL MEMBRANE 


LIVER 


PANCREATIC 


^H^^^H 






Plasma membrane 






Rough ER membrane 


35 


60 


Smooth ER membrane 


16 


<1 


Golgi apparatus membrane 


7 


10 


Mitochondria 






Outer membrane 


7 


4 


Inner membrane 


32 


17 


Nucleus 






Inner membrane 


0.2 


0.7 


Secretory vesicle membrane 


not determined 


3 


Lysosome membrane 


0.4 


not determined 


Peroxisome membrane 


0.4 


not determined 


Endosome membrane 


0.4 


not determined 



*These two cells are of very different sizes: the average hepatocyte has a volume of about 
5000 Jim 3 compared with 1 000 Jim 3 for the pancreatic exocrine cell. Total cell membrane areas 
are estimated at about 1 10,000 Jim 2 and 13,000 Jim 2 , respectively. 
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such as E. coll The profusion of internal membranes can be seen, in part, as an 
adaptation to this increase in size: the eucaryotic cell has a much smaller ratio of 
surface area to volume, and its plasma membrane therefore presumably has too 
small an area to sustain the many vital functions that membranes perform. The 
extensive internal membrane systems of a eucaryotic cell alleviate this problem. 

The evolution of internal membranes evidently accompanied the special- 
ization of membrane function. Consider, for example, the generation of thy- 
lakoid vesicles in chloroplasts. These vesicles form when chloroplasts in green 
leaf cells develop from proplastids, small precursor organelles that are present in 
all immature plant cells. These organelles are surrounded by a double mem- 
brane and develop according to the needs of the differentiated cells: they turn 
into chloroplasts in leaf cells, for example, but into organelles that store starch, 
fat, or pigments in other cell types (Figure 12-3A). When they convert into 
chloroplasts, specialized membrane patches form, invaginate, and pinch off 
from the inner membrane of the proplastid. The vesicles that result form a new 
specialized compartment, the thylakoid, which harbors all of the chloroplast's 
photosynthetic machinery (Figure 12-3B). 

Other compartments in eucaryotic cells may have originated in a conceptu- 
ally similar way (Figure 12-4). The invagination and pinching off of specialized 
intracellular membrane structures from the plasma membrane creates 
organelles with an interior that is topologically equivalent to the exterior of the 
cell. We shall see that this topological relationship holds for all of the organelles 
involved in the secretory and endocytic pathways, including the ER, Golgi appa- 
ratus, endosomes, and lysosomes. We can therefore think of all of these 
organelles as members of the same family. As we discuss in detail in the next 
chapter, their interiors communicate extensively with one another and with the 



Figure 12-2 An electron micrograph of 
part of a liver cell seen in cross section. 

Examples of most of the major 
intracellular compartments are indicated. 
(Courtesy of Daniel S. Friend.) 
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outside of the cell via transport vesicles, which bud off from one organelle and 
fuse with another (Figure 12-5). 

As described in Chapter 14, mitochondria and plastids differ from the other 
membrane-enclosed organelles because they contain their own genomes. The 
nature of these genomes, and the close resemblance of the proteins in these 
organelles to those in some present-day bacteria, strongly suggest that mito- 
chondria and plastids evolved from bacteria that were engulfed by other cells 
with which they initially lived in symbiosis (discussed in Chapters 1 and 14). 
According to the hypothetical scheme shown in Figure 12-4B, the inner mem- 
brane of mitochondria and plastids corresponds to the original plasma mem- 
brane of the bacterium, while the lumen of these organelles evolved from the 
bacterial cytosol. As we might expect from such an endocytic origin, these two 
organelles are surrounded by a double membrane, and they remain isolated from 
the extensive vesicular traffic that connects the interiors of most of the other 
membrane-enclosed organelles to each other and to the outside of the cell. 

The evolutionary scheme described above groups the intracellular compart- 
ments in eucaryotic cells into four distinct families: (1) the nucleus and the 
cytosol, which communicate with each other through nuclear pore complexes 
and are thus topologically continuous (although functionally distinct); (2) all 
organelles that function in the secretory and endocytic pathways — including the 
ER, Golgi apparatus, endosomes, and lysosomes, the numerous classes of trans- 
port intermediates such as transport vesicles that move between them, and pos- 
sibly peroxisomes; (3) the mitochondria; and (4) the plastids (in plants only). 



Proteins Can Move Between Compartments in Different Ways 

The synthesis of all proteins begins on ribosomes in the cytosol, except for the 
few that are synthesized on the ribosomes of mitochondria and plastids. Their 
subsequent fate depends on their amino acid sequence, which can contain sort- 
ing signals that direct their delivery to locations outside the cytosol. Most pro- 
teins do not have a sorting signal and consequently remain in the cytosol as per- 
manent residents. Many others, however, have specific sorting signals that direct 
their transport from the cytosol into the nucleus, the ER, mitochondria, plastids, 
or peroxisomes; sorting signals can also direct the transport of proteins from the 
ER to other destinations in the cell. 



(A) 



plastid 
family 



proplastid 




storage plastid 


chromoplast 


chloroplast 


(stores fat, oil 


(stores 


(carries out 


or starch) 


pigments) 


photosynthesis) 



(B) outer 

membrane 



inner 
membrane 



specialized region 
of proplastid 
inner membrane 



thylakoid space 

thylakoid 
membrane 





proplastid 



thylakoid 
vesicles 
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Figure 12-3 Development of plastids. (A) Proplastids are inherited with the cytoplasm of plant egg cells. As immature 
plant cells differentiate, the proplastids develop according to the needs of the specialized cell: they can become 
chloroplasts (in green leaf cells), storage plastids that accumulate starch (e.g., in potato tubers) or oil and lipid droplets 
(e.g., in fatty seeds), or chromoplasts that harbor pigments (e.g., in flower petals). (B) Development of the thylakoid. As 
chloroplasts develop, patches of specialized membrane in the proplastid inner membrane invaginateand pinch off to form 
thylakoid vesicles, which then develop into the mature thylakoid. The thylakoid membrane forms a separate compartment, 
the thylakoid space, which is structurally and functionally distinct from the rest of the chloroplast. Thylakoids can grow and 
divide autonomously as chloroplasts proliferate. 
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To understand the general principles by which sorting signals operate, it is 
important to distinguish three fundamentally different ways by which proteins 
move from one compartment to another. These three mechanisms are described 
below, and their sites of action in the cell are outlined in Figure 12-6. We discuss 
the first two mechanisms in this chapter, and the third (green arrows in Figure 
12-6) in Chapter 13. 

1. In gated transport, proteins move between the cytosol and the nucleus 
(which are topologically equivalent) through nuclear pore complexes in 
the nuclear envelope. The nuclear pore complexes function as selective 
gates that actively transport specific macromolecules and macromolecular 
assemblies, although they also allow free diffusion of smaller molecules. 

2. In transmembrane transport, transmembrane protein translocators 
directly transport specific proteins across a membrane from the cytosol 
into a space that is topologically distinct. The transported protein molecule 



plasma membrane 

A 



lysosome 



rough ER 



nuclear 
envelope 




Figure 12-4 Hypothetical schemes for 
the evolutionary origins of some 
organelles. The origins of mitochondria, 
chloroplasts, ER, and the cell nucleus 
might explain the topological 
relationships of these compartments in 
eucaryotic cells. 

(A) A possible pathway for the 
evolution of the cell nucleus and the ER. 
In some bacteria the single DNA 
molecule that comprises the bacterial 
chromosome is attached to an 
invagination of the plasma membrane. 
Such an invagination in a very ancient 
procaryotic cell could have rearranged to 
form an envelope around the DNA, while 
still allowing the DNA access to the cell 
cytosol (as is required for DNA to direct 
protein synthesis). This envelope is 
presumed to have eventually pinched off 
completely from the plasma membrane, 
producing a nuclear compartment 
surrounded by a double membrane. 

As illustrated, gated passageways 
called nuclear pore complexes (NPCs) 
penetrate the nuclear envelope. Because 
it is surrounded by two membranes that 
are in continuity where they are 
penetrated by NPCs, the nuclear 
compartment is topologically equivalent 
to the cytosol; in fact, during mitosis the 
nuclear contents mix with the cytosol. 
The lumen of the ER is continuous with 
the space between the inner and outer 
nuclear membranes, and it is 
topologically equivalent to the 
extracellular space (see Figure 12-5). 

(B) Mitochondria (and plastids) are 
thought to have originated when a 
bacterium was engulfed by a larger pre- 
eucaryotic cell. This could explain why 
they contain their own genomes and 
why the lumens of these organelles 
remain isolated from the membrane 
traffic that interconnects the lumens of 
many other intracellular compartments. 



Golgi endosome 
apparatus 



Figure 12-5 Topological relationships 
between compartments of the 
secretory and endocytic pathways in a 
eucaryotic cell. Topologically equivalent 
spaces are shown in red. In principle, 
cycles of membrane budding and fusion 
permit the lumen of any of these 
organelles to communicate with any 
other and with the cell exterior by 
means of transport vesicles. Blue arrows 
indicate the extensive outbound and 
inbound vesicular traffic (discussed in 
Chapter 13). Some organelles, most 
notably mitochondria and (in plant 
cells) plastids, do not take part in this 
communication and are isolated 
from the traffic between organelles 
shown here. 
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Figure 12-6 A simplified "roadmap"of protein traffic. Proteins can move 
from one compartment to another by gated transport (red), 
transmembrane transport (blue), or vesicular transport (green). The sorting 
signals that direct a given protein's movement through the system, and 
thereby determine its eventual location in the cell, are contained in each 
protein's amino acid sequence. The journey begins with the synthesis of a 
protein on a ribosome in the cytosol and terminates when the protein 
reaches its final destination. At each intermediate station (boxes), a decision 
is made as to whether the protein is to be retained in that compartment or 
transported further. In principle, a sorting signal could be required for 
either retention in or exit from a compartment. 

We shall refer to this figure often as a guide in this chapter and the next, 
highlighting in color the particular pathway being discussed. 



usually must unfold to snake through the translocator. The initial transport 
of selected proteins from the cytosol into the ER lumen or mitochondria, 
for example, occurs in this way 
3. In vesicular transport, membrane-enclosed transport intermediates — 
which may be small, spherical transport vesicles or larger, irregularly 
shaped organelle fragments — ferry proteins from one compartment to 
another. The transport vesicles and fragments become loaded with a cargo 
of molecules derived from the lumen of one compartment as they bud and 
pinch off from its membrane; they discharge their cargo into a second 
compartment by fusing with the membrane enclosing that compartment 
(Figure 12-7). The transfer of soluble proteins from the ER to the Golgi 
apparatus, for example, occurs in this way. Because the transported pro- 
teins do not cross a membrane, vesicular transport can move proteins only 
between compartments that are topologically equivalent (see Figure 12-5). 

Each mode of protein transfer is usually guided by sorting signals in the 
transported protein, which are recognized by complementary sorting receptors. 
If a large protein is to be imported into the nucleus, for example, it must possess 
a sorting signal that receptor proteins recognize to guide it through the nuclear 
pore complex. If a protein is to be transferred directly across a membrane, it 
must possess a sorting signal that the membrane translocator recognizes. Like- 
wise, if a protein is to be loaded into a certain type of vesicle or retained in cer- 
tain organelles, a complementary receptor in the appropriate membrane must 
recognize its sorting signal. 



Signal Sequences Direct Proteins to the Correct Cell Address 

Most protein sorting signals reside in a stretch of amino acid sequence, typically 
15-60 residues long. These signal sequences are often found at the N-terminus; 
specialized signal peptidases remove the signal sequence from the finished pro- 
tein once the sorting process is complete. Signal sequences can also be internal 
stretches of amino acids, which remain part of the protein. In some cases, sort- 
ing signals are composed of multiple internal amino acid sequences that form a 
specific three-dimensional arrangement of atoms on the protein's surface, called 
a signal patch. 

Each signal sequence specifies a particular destination in the cell. Proteins 
destined for initial transfer to the ER usually have a signal sequence at their N- 
terminus, which characteristically includes a sequence composed of about 5-10 



Figure 1 2-7 Vesicle budding and fusion during vesicular transport. 

Transport vesicles bud from one compartment (donor) and fuse with 
another (target) compartment. In the process, soluble components (red 
dots) are transferred from lumen to lumen. Note that membrane is also 
transferred and that the original orientation of both proteins and lipids in 
the donor-compartment membrane is preserved in the target- 
compartment membrane. Thus, membrane proteins retain their 
asymmetric orientation, with the same domains always facing the cytosol. 
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hydrophobic amino acids. Many of these proteins will in turn pass from the ER 
to the Golgi apparatus, but those with a specific signal sequence of four amino 
acids at their C-terminus are recognized as ER residents and are returned to the 
ER. Proteins destined for mitochondria have signal sequences of yet another 
type, in which positively charged amino acids alternate with hydrophobic ones. 
Finally, many proteins destined for peroxisomes have a signal sequence of three 
characteristic amino acids at their C-terminus. 

Table 12-3 presents some specific signal sequences. Experiments in which 
the peptide is transferred from one protein to another by genetic engineering 
techniques have demonstrated the importance of each of these signal sequences 
for protein targeting. Placing the N-terminal ER signal sequence at the beginning 
of a cytosolic protein, for example, redirects the protein to the ER. Signal 
sequences are therefore both necessary and sufficient for protein targeting. Even 
though their amino acid sequences can vary greatly, the signal sequences of all 
proteins having the same destination are functionally interchangeable; physical 
properties, such as hydrophobicity, often seem to be more important in the sig- 
nal-recognition process than the exact amino acid sequence. 

Signal sequences are recognized by complementary sorting receptors that 
guide proteins to their appropriate destination, where the receptors unload 
their cargo. The receptors function catalytically: after completing one round of 
targeting, they return to their point of origin to be reused. Most sorting receptors 
recognize classes of proteins rather than an individual protein species. They can 
therefore be viewed as public transportation systems, dedicated to delivering 
different components to their correct location in the cell. 

Panel 12-1 describes the main ways of studying how proteins are directed 
from the cytosol to a specific compartment and the mechanism of their translo- 
cation across membranes. 



Most Organelles Cannot Be Constructed De Novo:They Require 
Information in the Organelle Itself 

When a cell reproduces by division, it has to duplicate its organelles. In general, 
cells do this by incorporating new molecules into the existing organelles, 
thereby enlarging them; the enlarged organelles then divide and are distributed 
to the two daughter cells. Thus, each daughter cell inherits a complete set of spe- 
cialized cell membranes from its mother. This inheritance is essential because a 
cell could not make such membranes from scratch. If the ER were completely 
removed from a cell, for example, how could the cell reconstruct it? As we dis- 
cuss later, the membrane proteins that define the ER and perform many of its 
functions are themselves products of the ER. A new ER could not be made with- 
out an existing ER or, at least, a membrane that specifically contains the protein 



Table 12-3 Some Typical Signal Sequences 



FUNCTION OF SIGNAL SEQUENCE 


EXAMPLE OF SIGNAL SEQUENCE 


Import into nucleus 
Export from nucleus 
Import into mitochondria 

Import into plastid 

Import into peroxisomes 
Import into ER 

Return to ER 


-Pro-Pro-Lys-Lys-Lys-Arg-Lys-Val- 

-Ala- -Lys- -Ala-Gly- -Asp- - 
+ H 3 N-Met-Leu-Ser-Leu-Arg-Gln-Ser-lle-Arg-Phe-Phe-Lys-Pro-Ala-Thr-Arg-Thr-Leu-Cys-Ser- 

Ser-Arg-Tyr-Leu-Leu- 
+ H 3 N-Met-Val-Ala-Met-Ala-IVIet-Ala-Ser-Leu-Gln-Ser-Ser-l\/let-Ser-Ser-Leu-Ser-Leu-Ser-Ser- 

Asn-Ser-Phe-Leu-Gly-Gln-Pro-Leu-Ser-Pro-lle-Thr-Leu-Ser-Pro-Phe-Leu-Gln-Gly- 
-Ser-Lys- -COCT 
+ H 3 N-Met-Met-Ser-Phe-Val-Ser- -Thr-Glu-Ala-Glu- 

Gln-Leu-Thr-Lys-Cys-Glu-Val-Phe-Gln- 
-Lys-Asp-Glu- -COO" 



Some characteristic features of the different classes of signal sequences are highlighted in color. Where they are known to be important for the 
function of the signal sequence, positively charged amino acids are shown in red and negatively charged amino acids are shown in green. Similarly, 
important hydrophobic amino acids are shown in white and important hydroxylated amino acids are shown in blue. + U^ indicates the N-terminus 
of a protein; COO" indicates the C-terminus. 



PANEL 1 2-1 : Approaches to Studying Signal Sequences and Protein Translocation Across Membranes 









ATRANSFECTION APPROACH FOR 
DEFINING SIGNAL SEQUENCES 

One way to show that a signal sequence is required 
and sufficient to target a protein to a specific 
intracellular compartment is to create a fusion protein 
in which the signal sequence is attached by genetic 
engineering techniques to a protein that normally 
resides in the cytosol. After the cDNA encoding this 
protein is transfected into cells, the location of the 
fusion protein is determined by immunostaining or 
by cell fractionation. 

signal sequence a or b gene encoding cytosolic protein 



A 



plasmid used to transfect cells 




signal sequence a ( ■ 
directs fusion protein 
to organelle A 



signal sequence b ( IB ) 
directs fusion protein 
to organelle B 



By altering the signal sequence using site-directed 
mutagenesis, we can determine which structural 
features are important for its function. 









A BIOCHEMICAL APPROACH FOR STUDYING THE 

MECHANISM OF PROTEIN TRANSLOCATION 

In this approach, a labeled protein containing a specific signal sequence is 
transported into isolated organelles in vitro. The labeled protein is usually 
produced by cell-free translation of a purified mRNA encoding the protein; 
radioactive amino acids are used to label the newly synthesized protein so 
that it can be distinguished from the many other proteins that are present 
in the in vitro translation system. 

Three methods are commonly used to test if the labeled protein has been 
translocated into the organelle: 



1. The labeled protein co-fractionates 
with the organelle during centrifugation. 



signal sequence 





radioactively 
labeled protein 



2. The signal sequence is removed by 
a specific protease that is present 
inside the organelle. 

incubated without organelle 
I incubated with organelle 




& 



protein transported 
into isolated organelle 



3. The protein is protected from 
digestion when proteases are 
added to the incubation medium, 
but is susceptible if a detergent is 
first added to disrupt the 
organelle membrane. 

By exploiting such in vitro assays, 
(proteins, ATP, GTP, etc.) are requ 



radioactive proteins on SDS gel 
C <) * ^ protease * . 







no detergent plus detergent 

one can determine what components 
ired for the translocation process. 



GENETIC APPROACHES FOR STUDYING THE 
MECHANISM OF PROTEIN TRANSLOCATION 

wild-type yeast cell 



enzyme in cytosol: 
- cell lives without 
histidine as nutrient 




translocation 
apparatus 

engineered yeast cell 
histidinol 




enzyme targeted to 

- ER: cell dies without 

histidine as nutrient 



mutant engineered cell 




not all enzyme taken 
up into ER: cell lives 
without histidine as 
nutrient 



Yeast cells with mutations in genes that encode components of the 
translocation machinery have been useful for studying protein 
translocation. Because mutant cells that cannot translocate any proteins 
across their membranes will die, the challenge is to find mutations that 
cause only a partial defect in protein translocation. 

One experimental strategy uses genetic engineering to design special 
yeast cells. The enzyme histidinol dehydrogenase, for example, normally 
resides in the cytosol, where it is required to produce the essential amino 
acid histidine from its precursor histidinol. A yeast strain is constructed in 
which the histidinol dehydrogenase gene is replaced by a re-engineered 
gene encoding a fusion protein with an added signal sequence that 
misdirects the enzyme into the endoplasmic reticulum (ER). When such 
cells are grown without histidine, they die because all of the histidinol 
dehydrogenase is sequestered in the ER, where it is of no use. Cells with a 
mutation that only partially inactivates the mechanism for translocating 
proteins from the cytosol to the ER, however, will survive because the 
cytosol retains enough of the dehydrogenase to produce histidine. 

Often one obtains a cell in which the mutant protein in the translocation 
machinery still functions partially at normal temperature but is completely 
inactive at higher temperature. A cell carrying such a temperature- 
sensitive mutation dies at higher temperature, whether or not histidine is 
present, as it cannot transport any protein into the ER. The normal gene 
that was disabled by the temperature-sensitive mutation can be identified 
by transfecting the mutant cells with a yeast plasmid vector into which 
random yeast genomic DNA fragments have been cloned: the specific 
DNA fragment that rescues the mutant cells when they are grown at high 
temperature should encode the wild-type version of the mutant gene. 



mutant translocation apparatus 
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translocators required to import selected proteins into the ER from the cytosol 
(including the ER- specific translocators themselves). The same is true for mito- 
chondria and plastids (see Figure 12-6). 

Thus, it seems that the information required to construct an organelle does 
not reside exclusively in the DNA that specifies the organelle's proteins. Infor- 
mation in the form of at least one distinct protein that preexists in the organelle 
membrane is also required, and this information is passed from parent cell to 
progeny cell in the form of the organelle itself. Presumably, such information is 
essential for the propagation of the cell's compartmental organization, just as 
the information in DNA is essential for the propagation of the cell's nucleotide 
and amino acid sequences. 

As we discuss in more detail in Chapter 13, however, the ER sheds a constant 
stream of membrane vesicles that incorporate only a subset of ER proteins and 
therefore have a different composition from the ER itself. Similarly, the plasma 
membrane constantly produces various types of specialized endocytic vesicles. 
Thus, some organelles can form from other organelles and do not have to be 
inherited at cell division. 



Summary 

Eucaryotic cells contain intracellular membranes that enclose nearly half the cell's 
total volume in separate intracellular compartments called organelles. The main types 
of organelles present in all eucaryotic cells are the endoplasmic reticulum, Golgi appa- 
ratus, nucleus, mitochondria, lysosomes, endosomes, and peroxisomes; plant cells also 
contain plastids, such as chloroplasts. Each organelle contains a distinct set of pro- 
teins, which mediate its unique functions. 

Each newly synthesized organelle protein must find its way from a ribosome in the 
cytosol, where the protein is made, to the organelle where it functions. It does so by fol- 
lowing a specific pathway, guided by sorting signals in its amino acid sequence that 
function as signal sequences or signal patches. Sorting signals are recognized by com- 
plementary sorting receptors, which deliver the protein to the appropriate target 
organelle. Proteins that function in the cytosol do not contain sorting signals and 
therefore remain there after they are synthesized. 

During cell division, organelles such as the ER and mitochondria are distributed 
intact to each daughter cell. These organelles contain information that is required for 
their construction, and so they cannot be made de novo. 



CYTOSOL 



THE TRANSPORT OF MOLECULES BETWEEN THE 
NUCLEUS AND THE CYTOSOL 

The nuclear envelope encloses the DNA and defines the nuclear compartment. 
This envelope consists of two concentric membranes, which are penetrated by 
nuclear pore complexes (Figure 12-8). Although the inner and outer nuclear 
membranes are continuous, they maintain distinct protein compositions. The 
inner nuclear membrane contains specific proteins that act as anchoring sites 
for chromatin and for the nuclear lamina, a protein meshwork that provides 
structural support for the nuclear envelope. The inner membrane is surrounded 
by the outer nuclear membrane, which is continuous with the membrane of the 
ER. Like the membrane of the ER (discussed later), the outer nuclear membrane 
is studded with ribosomes engaged in protein synthesis. The proteins made on 
these ribosomes are transported into the space between the inner and outer 
nuclear membranes (the perinuclear space), which is continuous with the ER 
lumen. 

Bidirectional traffic occurs continuously between the cytosol and the 
nucleus. The many proteins that function in the nucleus — including histones, 
DNA and RNA polymerases, gene regulatory proteins, and RNA-processing pro- 
teins — are selectively imported into the nuclear compartment from the cytosol, 
where they are made. At the same time, tRNAs and mRNAs are synthesized in the 
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nuclear compartment and then exported to the cytosol. Like the import process, 
the export process is selective; mRNAs, for example, are exported only after they 
have been properly modified by RNA-processing reactions in the nucleus. In 
some cases, the transport process is complex. Ribosomal proteins, for instance, 
are made in the cytosol and imported into the nucleus, where they assemble 
with newly made ribosomal RNA into particles. The particles are then exported 
to the cytosol, where they assemble into ribosomes. Each of these steps requires 
selective transport across the nuclear envelope. 



Nuclear Pore Complexes Perforate the Nuclear Envelope 

Large, elaborate structures known as nuclear pore complexes (NPCs) perforate 
the nuclear envelope of all eucaryotes. In animal cells, each NPC has an esti- 
mated molecular mass of about 125 million daltons and is composed of about 
30 different NPC proteins, or nucleoporins, which are present in multiple copies 
and are arranged with a striking octagonal symmetry (Figure 12-9). 

The nuclear envelope of a typical mammalian cell contains 3000-4000 
NPCs, and the total traffic that passes through each NPC is enormous: each NPC 
can transport up to 500 macromolecules per second and can transport in both 
directions at the same time. How it coordinates the bidirectional flow of macro- 
molecules to avoid congestion and head-on collisions is not known. 

Each NPC contains one or more aqueous passages, through which small 
water-soluble molecules can diffuse passively. Researchers have determined the 
effective size of these passages by injecting labeled water-soluble molecules of 
different sizes into the cytosol and then measuring their rate of diffusion into the 
nucleus. Small molecules (5000 daltons or less) diffuse in so fast that we can con- 
sider the nuclear envelope freely permeable to them. Large proteins, however, 
traverse the NPC much more slowly; the larger a protein, the more slowly it 
passes through the NPC. Proteins larger than 60,000 daltons can barely enter by 
passive diffusion. This size cut-off to free diffusion is thought to result from the 
NPC structure (Figure 12-10) . Many NPC proteins that line the central pore con- 
tain extensive unstructured regions, which are thought to form a disordered tan- 
gle (much like a kelp bed in the ocean), blocking the central opening in the NPC 
to the passage of large macromolecules, but leaving small openings to allow the 
diffusion of smaller molecules. 

Because many cell proteins are too large to diffuse passively through the 
NPCs, the nuclear compartment and the cytosol can maintain different comple- 
ments of proteins. Mature cytosolic ribosomes, for example, are about 30 nm in 
diameter and thus cannot diffuse through the NPC, confining protein synthesis 
to the cytosol. But how does the nucleus export newly made ribosomal subunits 
or import large molecules, such as DNA and RNA polymerases, which have sub- 
unit molecular weights of 100,000-200,000 daltons? As we discuss next, these 
and many other protein and RNA molecules bind to specific receptor proteins 
that ferry large molecules actively through NPCs. 




perinuclear 
space 



Figure 1 2-8 The nuclear envelope. The 

double-membrane envelope is 
penetrated by pores in which NPCs are 
positioned and is continuous with the 
endoplasmic reticulum. The ribosomes 
that are normally bound to the cytosolic 
surface of the ER membrane and outer 
nuclear membrane are not shown. The 
nuclear lamina is a fibrous meshwork 
underlying the inner membrane. 



Nuclear Localization Signals Direct Nuclear Proteins to the 
Nucleus 

When proteins are experimentally extracted from the nucleus and reintro- 
duced into the cytosol, even the very large ones reaccumulate efficiently in the 
nucleus. Sorting signals called nuclear localization signals are responsible for 
the selectivity of this active nuclear import process. The signals have been pre- 
cisely defined by using recombinant DNA technology for numerous nuclear 
proteins, as well as for proteins that enter the nucleus only transiently (Figure 
12-11). In many proteins, signals consist of one or two short sequences that 
are rich in the positively charged amino acids lysine and arginine (see Table 
12-3, p. 702), with the precise sequence varying for different nuclear proteins. 
Other nuclear proteins contain different signals, some of which are not yet 
characterized. 
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Figure 1 2-9 The arrangement of NPCs in the nuclear envelope. (A) A small region of the nuclear envelope. In cross section, an NPC seems to 
have four structural building blocks: column subunits, which form the bulk of the pore wall; annular subunits, which are centrally located; 
lumenal subunits, which contain transmembrane proteins that anchor the complex to the nuclear membrane; and ring subunits, which form 
the cytosolic and nuclear faces of the complex. In addition, fibrils protrude from both the cytosolic and the nuclear sides of the NPC. On the 
nuclear side, the fibrils converge to form basketlike structures. Immunoelectron microscopic studies show that the proteins that make up the 
core of the NPC are oriented symmetrically across the nuclear envelope, so that the nuclear and cytosolic sides of the core look identical. In 
contrast, the proteins that make up the fibrils are different on the cytosolic and nuclear sides of the NPC. The eight-fold rotational and two-fold 
transverse symmetry of the core NPC explains how such a huge structure can be formed from only about 30 different proteins: many of these 
proteins are present in 16 copies (or multiples of 16). Disordered domains of the core proteins (not shown) are thought to extend toward the 
center of the NPC, blocking the passive diffusion of large macromolecules. (B) A scanning electron micrograph of the nuclear side of the 
nuclear envelope of an oocyte (see also Figure 9-55). (C) An electron micrograph showing a side view of two NPCs (brackets); note that the 
inner and outer nuclear membranes are continuous at the edges of the pore. (D) An electron micrograph showing face-on views of negatively 
stained NPCs. The membrane has been removed by detergent extraction. Note that some of the NPCs contain material in their center, which is 
thought to be macromolecules in transit through these NPCs. (B, from M.W. Goldberg and T.D. Allen, J. Cell Biol. 1 1 9:1 429-1 440, 1 992. With 
permission from The Rockefeller University Press; C, courtesy of Werner Franke and Ulrich Scheer; D, courtesy of Ron Milligan.) 

Nuclear localization signals can be located almost anywhere in the amino 
acid sequence and are thought to form loops or patches on the protein surface. 
Many function even when linked as short peptides to lysine side chains on the 
surface of a cytosolic protein, suggesting that the precise location of the signal 
within the amino acid sequence of a nuclear protein is not important. Moreover, 
as long as one of the protein subunits of a multicomponent complex displays a 
nuclear localization signal, the complex can be imported into the nucleus. 
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Figure 1 2-1 A model for the gated 
diffusion barrier of the NPC. The 

drawing shows a side view of an NPC. 
Unstructured regions of the proteins 
lining the central pore form a tangled 
meshwork, which blocks the passive 
diffusion of large macromolecules. 
During active transport, however, even 
particles up to 39 nm in diameter can 
pass through NPCs. 
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(A) LOCALIZATION OF T-ANTIGEN CONTAINING 
ITS NORMAL NUCLEAR IMPORT SIGNAL 
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One can make the transport of nuclear proteins through NPCs visible by 
coating gold particles with a nuclear localization signal, injecting the particles 
into the cytosol, and then following their fate by electron microscopy (Figure 
12-12). Transport begins when the particles bind to tentaclelike fibrils that 
extend from the rim of the NPC into the cytoplasm, and then proceed through 
the center of the NPC. Presumably, the unstructured regions of the NPC proteins 
that form a diffusion barrier for large molecules (mentioned earlier) are pushed 
away to allow the coated gold particles to squeeze through. 

Macromolecular transport across NPCs differs fundamentally from the 
transport of proteins across the membranes of other organelles, in that it occurs 
through a large aqueous pore rather than through a protein transporter span- 
ning one or more lipid bilayers. For this reason, fully folded nuclear proteins can 
be transported into the nucleus through an NPC, and newly formed ribosomal 
subunits are transported out of the nucleus as an assembled particle. By con- 
trast, proteins have to be extensively unfolded to be transported into most other 
organelles, as we discuss later. In the electron microscope, however, very large 
particles traversing the NPC seem to become compressed as they squeeze 
through the pore, indicating that they undergo restructuring during transport. 
The export of some very large mRNAs has been extensively studied, as discussed 
in Chapter 6 (see Figure 6-39). 



Nuclear Import Receptors Bind to Both Nuclear Localization 
Signals and NPC proteins 

To initiate nuclear import, most nuclear localization signals must be recognized 
by nuclear import receptors, which are encoded by a family of related genes. 
Each family member encodes a receptor protein that is specialized for the trans- 
port of a subset of cargo proteins — those that contain nuclear localization sig- 
nals to which the receptor can bind (Figure 12-13A). 

The import receptors are soluble cytosolic proteins that bind both to the 
nuclear localization signal on the protein to be transported and to NPC proteins, 
some of which form the fibrils of the NPC that extend into the cytosol. These fib- 
rils, as well as many of the NPC proteins that line the center of the NPC and con- 
tribute to the diffusion barrier, include a large number of short amino-acid 
repeats that contain phenylalanine and glycine and are therefore called FG- 
repeats (named after the one-letter code for these amino acids, discussed in 
Chapter 3). FG-repeats serve as binding sites for the import receptors. They are 
thought to line the path through the NPCs taken by the import receptors and 
their bound cargo proteins. The receptor-cargo complexes move along the 
transport path by repeatedly binding, dissociating, and then re-binding to adja- 
cent FG-repeat sequences. In this way, the complexes hop from one NPC protein 
to another to traverse the tangled interior of the NPC. Once inside the nucleus, 
the import receptors dissociate from their cargo and return to the cytosol. 

Nuclear import receptors do not always bind to nuclear proteins directly. 
Additional adaptor proteins sometimes form a bridge between the import recep- 
tors and the nuclear localization signals on the proteins to be transported (Figure 
12-13B). Some adaptor proteins are structurally related to nuclear import recep- 
tors, suggesting a common evolutionary origin. By using a variety of different 



Figure 1 2-1 1 The function of a nuclear 
localization signal. Immunofluorescence 
micrographs showing the cell location of 
SV40 virus T-antigen containing or 
lacking a short sequence that serves as a 
nuclear localization signal. (A) The normal 
T-antigen protein contains the lysine-rich 
sequence indicated and is imported to its 
site of action in the nucleus, as indicated 
by immunofluorescence staining with 
antibodies against the T-antigen. 
(B) T-antigen with an altered nuclear 
localization signal (a threonine replacing 
a lysine) remains in the cytosol. (From 
D. Kalderon, B. Roberts, W. Richardson 
and A. Smith, Cell 39:499-509, 1984. 
With permission from Elsevier.) 
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Figure 1 2-1 2 Visualizing active import 
through NPCs. This series of electron 
micrographs shows colloidal gold spheres 
(arrowheads) coated with peptides 
containing nuclear localization signals 
entering the nucleus through NPCs. Gold 
particles were injected into living cells, 
which then were fixed and prepared for 
electron microscopy at various times after 
injection. (A) Gold particles are first seen 
in proximity to the cytosolic fibrils of the 
NPCs. (B, C) They migrate to the center of 
the NPCs, where they are first seen 
exclusively on the cytosolic face. (D) They 
then appear on the nuclear face. These 
gold particles have much larger diameters 
than the diffusion channel in the NPC and 
are imported by active transport. (From 
N. Pante and U. Aebi, Science 
273:1729-1732, 1996. With permission 
from AAAS.) 
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import receptors and adaptors, cells are able to recognize the broad repertoire of 
nuclear localization signals that are displayed on nuclear proteins. 



Nuclear Export Works Like Nuclear Import, But in Reverse 

The nuclear export of large molecules, such as newribosomal subunits and RNA 
molecules, occurs through NPCs and also depends on a selective transport sys- 
tem. The transport system relies on nuclear export signals on the macro- 
molecules to be exported, as well as on complementary nuclear export recep- 
tors. These receptors bind to both the export signal and NPC proteins to guide 
their cargo through the NPC to the cytosol. 

Many nuclear export receptors are structurally related to nuclear import 
receptors, and they are encoded by the same gene family of nuclear transport 
receptors, or karyopherins. In yeast, there are 14 genes encoding members of 
this family; in animal cells, the number is significantly larger. It is often not pos- 
sible to tell from their amino acid sequence alone whether a particular family 
member works as a nuclear import or nuclear export receptor. As might be 
expected, therefore, the import and export transport systems work in similar 
ways but in opposite directions: the import receptors bind their cargo molecules 
in the cytosol, release them in the nucleus, and are then exported to the cytosol 
for reuse, while the export receptors function in the opposite fashion. 



Figure 12-13 Nuclear import receptors. 

(A) Many nuclear import receptors bind 
both to NPC proteins and to a nuclear 
localization signal on the cargo proteins 
they transport. Cargo proteins 1, 2, and 3 
in this example contain different nuclear 
localization signals, and therefore each 
binds to different nuclear import 
receptors. (B) Cargo protein 4 requires an 
adaptor protein to bind to its nuclear 
import receptor. The adaptors are 
structurally related to nuclear import 
receptors and recognize nuclear 
localization signals on cargo proteins. 
They also contain a nuclear localization 
signal that binds them to an import 
receptor. 



The Ran GTPase Imposes Directionality on Transport Through 
NPCs 

The import of nuclear proteins through the NPC concentrates specific proteins 
in the nucleus and thereby increases order in the cell. The cell obtains the energy 
it needs for this process through the hydrolysis of GTP by the monomeric 
GTPase Ran. Ran is found in both the cytosol and the nucleus, and it is required 
for both nuclear import and export. 

Like other GTPases, Ran is a molecular switch that can exist in two confor- 
mational states, depending on whether GDP or GTP is bound (discussed in 
Chapter 3) . Two Ran-specific regulatory proteins trigger the conversion between 
the two states: a cytosolic GTPase- activating protein (GAP) triggers GTP hydrol- 
ysis and thus converts Ran-GTP to Ran-GDP, and a nuclear guanine exchange 
factor (GEF) promotes the exchange of GDP for GTP and thus converts Ran-GDP 
to Ran-GTP. Because Ran-GAP is located in the cytosol and Ran-GEF is located 
in the nucleus, the cytosol contains mainly Ran-GDP, and the nucleus contains 
mainly Ran-GTP (Figure 12-14). 

This gradient of the two conformational forms of Ran drives nuclear trans- 
port in the appropriate direction (Figure 12-15). Docking of nuclear import 
receptors to FG-repeats on the cytosolic side of the NPC, for example, occurs 
whether or not these receptors are loaded with an appropriate cargo. Import 
receptors then hop from FG-repeat to FG-repeat. If they reach the nuclear side 
of the pore complex, Ran-GTP binds to them, and, if they arrive loaded with 
cargo molecules, the Ran-GTP binding causes the import receptors to release 
their cargo (Figure 12-16). Because the Ran-GDP in the cytosol does not bind to 
the cargo receptors, unloading occurs only on the nuclear side of the NPC. In 
this way, the nuclear localization of Ran-GTP creates the directionality. 
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Having discharged its cargo in the nucleus, the empty import receptor with 
Ran-GTP bound is transported back through the pore complex to the cytosol. 
There, Ran-GAP triggers Ran-GTP to hydrolyze its bound GTP, thereby convert- 
ing it to Ran- GDP. The import receptor is then ready for another cycle of nuclear 
import. 

Nuclear export occurs by a similar mechanism, except that Ran-GTP in the 
nucleus promotes cargo binding to the export receptor, rather than dissociating 
it. Once the export receptor moves through the pore to the cytosol, its Ran-GTP 
encounters Ran-GAP and hydrolyzes GTP. As a result, the export receptor 
releases both its cargo and Ran-GDP in the cytosol. Free export receptors are 
then returned to the nucleus to complete the cycle (see Figure 12-15). 



Figure 12-1 4 The compartmentalization 
of Ran-GDP and Ran-GTP. Localization of 
Ran-GDP in the cytosol and Ran-GTP in 
the nucleus results from the localization 
of two Ran regulatory proteins: Ran 
GTPase-activating protein (Ran-GAP) is 
located in the cytosol and Ran guanine 
nucleotide exchange factor (Ran-GEF) 
binds to chromatin and is therefore 
located in the nucleus. 



Transport Through NPCs Can Be Regulated by Controlling Access 
to the Transport Machinery 

Some proteins, such as those that bind newly made mRNAs in the nucleus, con- 
tain both nuclear localization signals and nuclear export signals. These proteins 
continually shuttle back and forth between the nucleus and the cytosol. The rel- 
ative rates of their import and export determine the steady-state localization of 
such shuttling proteins. If the rate of import exceeds the rate of export, a protein 
will be located mainly in the nucleus. Conversely, if the rate of export exceeds 
the rate of import, a protein will be located mainly in the cytosol. Thus, chang- 
ing the rate of import, export, or both, can change the location of a protein. 
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Figure 12-15 A model explaining how 
GTP hydrolysis by Ran in the cytosol 
provides directionality to nuclear 
transport. Movement through the NPC of 
loaded nuclear transport receptors 
occurs along the FG-repeats displayed by 
certain NPC proteins. The differential 
localization of Ran-GTP in the nucleus 
and Ran-GDP in the cytosol provides 
directionality (red arrows) to both nuclear 
import (left) and nuclear export (right). 
Ran-GAP stimulates the hydrolysis of GTP 
to produce Ran-GDP on the cytosolic side 
of the NPC (see Figure 12-14). 

Ran-GDP is imported into the nucleus 
by its own import receptor, which is 
specific for the GDP-bound conformation 
of Ran. The Ran-GDP receptor is 
structurally unrelated to the main family 
of nuclear transport receptors. However, 
it also binds to FG-repeats in NPC 
proteins and hops across the NPC. 
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Some shuttling proteins move continuously into and out of the nucleus. In 
other cases, however, the transport is stringently controlled. As discussed in 
Chapter 7, cells control the activity of some gene regulatory proteins by keeping 
them out of the nucleus until they are needed there (Figure 12-17). In many 
cases, cells control transport by regulating nuclear localization and export sig- 
nals; these can be turned on or off, often by phosphorylation of amino acids 
close to the signal sequences (Figure 12-18). 

Other gene regulatory proteins are bound to inhibitory cytosolic proteins 
that either anchor them in the cytosol (through interactions with the cytoskele- 
ton or specific organelles) or mask their nuclear localization signals so that they 
cannot interact with nuclear import receptors. An appropriate stimulus releases 
the gene regulatory protein from its cytosolic anchor or mask, and it is then 
transported into the nucleus. One important example is the latent gene regula- 
tory protein that controls the expression of proteins involved in cholesterol 
metabolism. The protein is made and stored in an inactive form as a transmem- 
brane protein in the ER. When deprived of cholesterol, the protein exits the ER 
to the Golgi apparatus where it meets specific proteases that cleave it, releasing 
its cytosolic domain into the cytosol. This domain is then imported into the 
nucleus, where it activates the transcription of genes required for both choles- 
terol import and synthesis. 

As we discussed in detail in Chapter 6, cells control the export of mRNA from 
the nucleus in a similar way. Proteins that guide the mRNA out of the nucleus load 
onto the RNA as transcription and splicing proceed. Upon entry into the cytosol, 
the proteins are stripped off and rapidly returned to the nucleus. Other RNAs, such 
as snRNAs and tRNAs, are exported by different nuclear export receptors. 
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Figure 12-16 How the binding of 
Ran-GTP can cause nuclear import 
receptors to release their cargo. (A) The 

structure of a nuclear transport receptor 
with bound Ran-GTP. The receptor 
contains repeated a-helical motifs that 
stack on top of each other, forming a 
flexible springlike structure, which can 
adopt multiple conformations in 
response to the binding of cargo proteins 
and Ran-GTP. Note that cargo proteins 
and Ran-GTP bind to different regions of 
the coiled spring. The Ran-GTP partly 
covers a conserved loop (red) of the 
receptor, which, in the Ran-free state, is 
thought to be important for cargo 
binding. (B) The cycle of loading in the 
cytosol and unloading in the nucleus of a 
nuclear import receptor. (A, adapted from 
Y.M. Chook and G. Blobel, Nature 
399:230-237, 1999. With permission from 
Macmillan Publishers Ltd.) 



During Mitosis the Nuclear Envelope Disassembles 

The nuclear lamina, located on the nuclear side of the inner nuclear membrane, 
is a meshwork of interconnected protein subunits called nuclear lamins. The 
lamins are a special class of intermediate filament proteins (discussed in Chapter 
16) that polymerize into a two-dimensional lattice (Figure 12-19). The nuclear 
lamina gives shape and stability to the nuclear envelope, to which it is anchored 
by attachment to both the NPCs and integral membrane proteins of the inner 
nuclear membrane. The lamina also interacts directly with chromatin, which 

Figure 1 2-1 7 The control of nuclear transport in the fly embryo. The 

gene regulatory protein dorsal is expressed uniformly throughout this early 
Drosophila embryo, which is shown in cross section. The protein has been 
stained with an enzyme-coupled antibody that yields a brown product. It is 
active only in cells at the ventral side (bottom) of the embryo, where it is 
found in nuclei. (Courtesy of Siegfried Roth.) 
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itself interacts with integral membrane proteins of the inner nuclear membrane. 
Together with the lamina, these inner membrane proteins provide structural 
links between the DNA and the nuclear envelope. 

When a nucleus disassembles during mitosis, the nuclear lamina depoly- 
merizes. The disassembly is at least partly a consequence of direct phosphoryla- 
tion of the nuclear lamins by the cyclin- dependent protein kinase Cdk that is 
activated at the onset of mitosis (discussed in Chapter 16). At the same time, 
proteins of the inner nuclear membrane are phosphorylated, and the NPCs dis- 
assemble and disperse in the cytosol. During this process, some NPC proteins 
become bound to nuclear import receptors, which play an important part in the 
reassembly of NPCs at the end of mitosis. Nuclear envelope membrane pro- 
teins — no longer tethered to the pore complexes, lamina, or chromatin — dis- 
perse throughout the ER membrane. The dynein motor protein, which moves 
along microtubules (discussed in Chapter 17), actively participates in tearing 
the nuclear envelope off the chromatin. Together, these processes break down 
the barriers that normally separate the nucleus and cytosol, and the nuclear pro- 
teins that are not bound to membranes or chromosomes intermix completely 
with those of the cytosol (Figure 12-20). 

Later in mitosis, the nuclear envelope reassembles on the surface of the 
chromosomes. In addition to its crucial role in nuclear transport, Ran-GTPase 
also acts as a positional marker for chromatin during cell division, when the 
nuclear and cytosolic components intermix. Because Ran-GEF remains bound 
to chromatin when the nuclear envelope breaks down, Ran molecules close to 
chromatin are mainly in their GTP-bound conformation. By contrast, Ran 
molecules further away have a high likelihood of encountering Ran- GAP, which 
is distributed throughout the cytosol; these Ran molecules are therefore mainly 
in their GDP-bound conformation. 

Chromatin in mitotic cells is therefore surrounded by a cloud of Ran-GTP. 
This cloud locally displaces nuclear import receptors from NPC proteins, which 
starts the assembly process of NPCs attached to the chromosome surface. At the 
same time, inner nuclear membrane proteins and dephosphorylated lamins 
bind again to chromatin. ER membranes wrap around groups of chromosomes 
and continue fusing until they form a sealed nuclear envelope. During this pro- 
cess, the NPCs start actively reimporting proteins that contain nuclear localiza- 
tion signals. Because the nuclear envelope is initially closely applied to the sur- 
face of the chromosomes, the newly formed nucleus excludes all proteins except 
those initially bound to the mitotic chromosomes and those that are selectively 
imported through NPCs. In this way, all other large proteins are kept out of the 
newly assembled nucleus. 



Figure 1 2-1 8 The control of nuclear 
import during T cell activation. <AGTT> 
The nuclear factor of activated T cells 
(NF-AT) is a gene regulatory protein that, 
in the resting T cell, is found in the 
cytosol in a phosphorylated state. When 
T cells are activated by foreign antigen 
(discussed in Chapter 25), the 
intracellular Ca 2+ concentration increases. 
In high Ca 2+ , the protein phosphatase 
calcineurin binds to NF-AT and 
dephosphorylates it. The 
dephosphorylation exposes nuclear 
import signals and blocks a nuclear 
export signal. The complex of NF-AT and 
calcineurin is then imported into the 
nucleus, where NF-AT activates the 
transcription of numerous genes required 
for T cell activation. 

The response shuts off when Ca 2+ levels 
decrease, releasing NF-AT from 
calcineurin. Rephosphorylation of NF-AT 
inactivates the nuclear import signals 
and re-exposes the nuclear export signal, 
causing NF-AT to relocate to the cytosol. 
Some of the most potent 
immunosuppressive drugs, including 
cyclosporin A and FK506, inhibit the 
ability of calcineurin to dephosphorylate 
NF-AT and thereby block the nuclear 
accumulation of NF-AT and T cell 
activation. 
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Figure 1 2-1 9 The nuclear lamina. An 

electron micrograph of a portion of the 
nuclear lamina in a Xenopus oocyte 
prepared by freeze-drying and metal 
shadowing. The lamina is formed by a 
regular lattice of specialized intermediate 
filaments. (Courtesy of Ueli Aebi.) 



71 2 Chapter 1 2: Intracellular Compartments and Protein Sorting 



nuclear pore lamins DNA 
complex 



nuclear envelope 



inner nuclear membrane 
outer nuclear membrane 



FUSION OF ENVELOPED 
CHROMOSOMES 



nuclear pore 

complex 

proteins 




nuclear envelope 
fragment 



LATE 
TELOPHASE 



FUSION OF NUCLEAR 
ENVELOPE FRAGMENTS 



EARLY TELOPHASE 



Nuclear localization signals are not cleaved off after transport into the 
nucleus, presumably because nuclear proteins need to be imported repeatedly, 
once after every cell division. In contrast, once a protein molecule has been 
imported into any of the other membrane-enclosed organelles, it is passed on 
from generation to generation within that compartment and does not need to 
be translocated ever again; the signal sequence on these molecules is often 
removed after protein translocation. 

As we discuss in Chapter 17, the cloud of Ran-GTP surrounding chromatin 
is also important in assembling the mitotic spindle in a dividing cell. 



Summary 

The nuclear envelope consists of an inner and an outer nuclear membrane. The outer 
membrane is continuous with the ER membrane, and the space between it and the 
inner membrane is continuous with theER lumen. RNA molecules, which are made in 
the nucleus, and ribosomal subunits, which are assembled there, are exported to the 
cytosol; in contrast, all the proteins that function in the nucleus are synthesized in the 
cytosol and are then imported. The extensive traffic of materials between the nucleus 
and cytosol occurs through nuclear pore complexes (NPCs), which provide a direct 
passageway across the nuclear envelope. Small molecules diffuse passively through the 
NPCs, but large macromolecules have to be actively transported. 

Proteins containing nuclear localization signals are actively transported inward 
through NPCs, while RNA molecules and newly made ribosomal subunits contain 
nuclear export signals, which direct their outward active transport through NPCs. Some 
proteins, including the nuclear import and export receptors, continually shuttle between 
the cytosol and nucleus. Ran-GTPase provides both the free energy and the directionality 
for nuclear transport. Cells regulate the transport of nuclear proteins and RNA molecules 
through the NPCs by controlling the access of these molecules to the transport machinery. 
Because nuclear localization signals are not removed, nuclear proteins can be imported 
repeatedly, as is required each time that the nucleus reassembles after mitosis. 



Figure 12-20 The breakdown and 
re-formation of the nuclear envelope 
during mitosis. Phosphorylation of the 
lamins is thought to trigger the 
disassembly of the nuclear lamina, which 
helps the nuclear envelope to break up. 
Dephosphorylation of the lamins is 
thought to help reverse the process. An 
analogous phosphorylation and 
dephosphorylation cycle occurs for some 
nucleoporins and proteins of the inner 
nuclear membrane, and some of these 
dephosphorylations are also involved in 
the reassembly process shown. 

As indicated, the nuclear envelope 
initially reforms around individual 
decondensing chromatids. Eventually as 
decondensation progresses, these 
structures fuse to form a single complete 
nucleus. 
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THE TRANSPORT OF PROTEINS INTO 
MITOCHONDRIA AND CHLOROPLASTS 

Mitochondria and chloroplasts are double-membrane-enclosed organelles (dis- 
cussed in Chapter 14). They specialize in ATP synthesis, using energy derived 
from electron transport and oxidative phosphorylation in mitochondria and 
from photosynthesis in chloroplasts. Although both organelles contain their 
own DNA, ribosomes, and other components required for protein synthesis, 
most of their proteins are encoded in the cell nucleus and imported from the 
cytosol. Each imported protein must reach the particular organelle subcom- 
partment in which it functions. 

There are two subcompartments in mitochondria: the internal matrix space 
and the intermembrane space. These compartments are formed by the two 
concentric mitochondrial membranes: the inner membrane, which encloses 
the matrix space and forms extensive invaginations called cristae, and the outer 
membrane, which is in contact with the cytosol (Figure 12-21A). Chloroplasts 
have the same two subcompartments plus an additional sub compartment, the 
thylakoid space, which is surrounded by the thylakoid membrane (Figure 
12-2 IB). Each of the subcompartments in mitochondria and chloroplasts con- 
tains a distinct set of proteins. 

New mitochondria and chloroplasts are produced by the growth of preexist- 
ing organelles, followed by fission (discussed in Chapter 14). The growth depends 
mainly on the import of proteins from the cytosol. The imported proteins must 
be transported across a number of membranes in succession and end up in the 
appropriate place. The process of protein movement across membranes is often 
called protein translocation. This section explains how it occurs. 
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Translocation into Mitochondria Depends on Signal Sequences 
and Protein Translocators 

Proteins imported into mitochondria are usually taken up from the cytosol 
within seconds or minutes of their release from ribosomes. Thus, in contrast to 
the protein translocation into the ER, described later, mitochondrial proteins are 
first fully synthesized as mitochondrial precursor proteins in the cytosol and 
then translocated into mitochondria by a post-translational mechanism. One or 
more signal sequences direct all mitochondrial precursor proteins to their appro- 
priate mitochondrial sub compartment. Many proteins entering the matrix space 
contain a signal sequence at their N-terminus that a signal peptidase rapidly 
removes after import. Others, including all outer membrane and many inner 
membrane and intermembrane space proteins, have an internal signal sequence 
that is not removed. The signal sequences are both necessary and sufficient for 
the import and correct localization of the proteins: when genetic engineering 
techniques are used to link these signals to a cytosolic protein, the signals direct 
the protein to the correct mitochondrial subcompartment. 
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Figure 12-21 The subcompartments of 
mitochondria and chloroplasts. In 

contrast to the cristae of mitochondria 
(A), the thylakoids of chloroplasts (B) are 
not connected to the inner membrane 
and therefore form a compartment with a 
separate internal space (see Figure 12-3). 
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Figure 12-22 A signal sequence for mitochondrial protein import. Cytochrome oxidase is a large multiprotein complex 
located in the inner mitochondrial membrane, where it functions as the terminal enzyme in the electron-transport chain 
(discussed in Chapter 14). (A) The first 18 amino acids of the precursor to subunit IV of this enzyme serve as a signal 
sequence for import of the subunit into the mitochondrion. (B) When the signal sequence is folded as an a helix, the 
positively charged residues (red) are clustered on one face of the helix, while the nonpolar residues (yellow) are clustered 
primarily on the opposite face. Amino acids with uncharged polar side chains are shaded blue. Signal sequences that direct 
proteins into the matrix space always have the potential to form such an amphiphilic a helix, which is recognized by specific 
receptor proteins on the mitochondrial surface. (C) The structure of a signal sequence of alcohol dehydrogenase, another 
mitochondrial matrix enzyme, bound to an import receptor was determined by NMR.The amphiphilic a helix binds with its 
hydrophobic face to a hydrophobic groove in the receptor. (C, adapted from Y. Abe et al., Cell 1 00:55 1 -560, 2000. With 
permission from Elsevier.) 

The signal sequences that direct precursor proteins into the mitochondrial 
matrix space are best understood. They all form an amphiphilic a helix, in which 
positively charged residues cluster on one side of the helix, while uncharged 
hydrophobic residues cluster on the opposite side. Specific receptor proteins 
that initiate protein translocation recognize this configuration rather than the 
precise amino acid sequence of the signal sequence (Figure 12-22). 

Multisubunit protein complexes that function as protein translocators 
mediate protein translocation across mitochondrial membranes. The TOM 
complex transfers proteins across the outer membrane, and two TIM com- 
plexes (TIM23 and TIM22) transfer proteins across the inner membrane (Figure 
12-23). (TOM and TIM stand for translocase of the outer and inner mitochon- 
drial membranes, respectively.) These complexes contain some components 
that act as receptors for mitochondrial precursor proteins, and other compo- 
nents that form the translocation channels. 
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Figure 12-23 The protein translocators 
in the mitochondrial membranes. The 

TOM, TIM, SAM, and OXA complexes are 
multimeric membrane protein assemblies 
that catalyze protein transport across 
mitochondrial membranes. The protein 
components of the TIM22 and TIM23 
complexes that line the import channel 
are structurally related, suggesting a 
common evolutionary origin of both TIM 
complexes. As indicated, one of the core 
components of theTIM23 complex 
contains a hydrophobic a-helical 
extension that is inserted into the outer 
mitochondrial membrane; the complex is 
therefore unusual in that it 
simultaneously spans two membranes. 

On the matrix side, theTIM23 complex 
is bound to a protein complex containing 
mitochondrial Hsp70, which acts as an 
import ATPase using ATP hydrolysis to 
pull proteins through the pore. 
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The TOM complex is required for the import of all nucleus- encoded mito- 
chondrial proteins. It initially transports their signal sequences into the inter- 
membrane space and helps to insert transmembrane proteins into the outer 
membrane. (3-Barrel proteins, which are particularly abundant in the outer mem- 
brane, are then passed on to an additional translocator, the SAM complex, which 
helps them to fold properly in the outer membrane. The TIM23 complex trans- 
ports some soluble proteins into the matrix space and helps to insert transmem- 
brane proteins into the inner membrane. The TIM22 complex mediates the inser- 
tion of a subclass of inner membrane proteins, including the transporter that 
moves ADP, ATP, and phosphate in and out of mitochondria. Yet another protein 
translocator in the inner mitochondrial membrane, the OXA complex, mediates 
the insertion of those inner membrane proteins that are synthesized within mito- 
chondria. It also helps to insert some imported inner membrane proteins that are 
initially transported into the matrix space by the other complexes. 



Mitochondrial Precursor Proteins Are Imported as Unfolded 
Polypeptide Chains 

We have learned almost everything we know about the molecular mechanism of 
protein import into mitochondria from analyses of cell-free, reconstituted trans- 
port systems, in which purified mitochondria in a test tube import radiolabeled 
mitochondrial precursor proteins. By changing the conditions in the test tube, it 
is possible to establish the biochemical requirements for the import. 

Mitochondrial precursor proteins do not fold into their native structures 
after they are synthesized; instead, they remain unfolded in the cytosol through 
interactions with other proteins. Some of these interacting proteins are general 
chaperone proteins of the Hsp70 family (discussed in Chapter 6), whereas others 
are dedicated to mitochondrial precursor proteins and bind directly to their sig- 
nal sequences. All the interacting proteins help to prevent the precursor proteins 
from aggregating or folding up spontaneously before they engage with the TOM 
complex in the outer mitochondrial membrane. As a first step in the import pro- 
cess, the import receptors of the TOM complex bind the signal sequence of the 
mitochondrial precursor protein. The interacting proteins are then stripped off, 
and the unfolded polypeptide chain is fed — signal sequence first — into the 
translocation channel. 

In principle, a protein could reach the mitochondrial matrix space by either 
crossing the two membranes all at once or crossing one at a time. One can dis- 
tinguish between these possibilities by cooling a cell-free mitochondrial import 
system to arrest the proteins at an intermediate step in the translocation pro- 
cess. The result is that the arrested proteins no longer contain their N-terminal 
signal sequence, indicating that the N-terminus must be in the matrix space 
where the signal peptidase is located, but the bulk of the protein can still be 
attacked from outside the mitochondria by externally added proteolytic 
enzymes (Figure 12-24). Clearly, the precursor proteins can pass through both 
mitochondrial membranes at once to enter the matrix space (Figure 12-25). It is 
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Figure 12-24 Proteins transiently span 
the inner and outer mitochondrial 
membranes during their translocation 
into the matrix space. When isolated 
mitochondria are incubated with a 
precursor protein at 5°C, the precursor is 
only partly translocated across 
mitochondrial membranes. The N-terminal 
signal sequence (red) is cleaved off in the 
matrix space; but most of the polypeptide 
chain remains outside the mitochondrion, 
where it is accessible to externally added 
proteolytic enzymes. Upon warming to 
25°C, the translocation is completed. Once 
inside the mitochondrion, the polypeptide 
chain is protected from added proteolytic 
enzymes. As a control, when detergents 
are added to disrupt the mitochondrial 
membranes, the imported proteins can be 
readily digested by the same protease 
treatment. 
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thought that the TOM complex first transports the signal sequence across the 
outer membrane to the intermembrane space, where it binds to a TIM complex, 
opening the channel in the complex. The polypeptide chain then either enters 
the matrix space or inserts into the inner membrane. 

Although the TOM and TIM complexes usually work together to transport 
precursor proteins across both membranes at the same time, they can work 
independently. In isolated outer membranes, for example, the TOM complex 
can translocate the signal sequence of precursor proteins across the membrane. 
Similarly, if the outer membrane is experimentally disrupted in isolated mito- 
chondria, the exposed TIM23 complex can efficiently import precursor proteins 
into the matrix space. 



Figure 12-25 Protein import by 
mitochondria. <ACGG>The N-terminal 
signal sequence of the mitochondrial 
precursor protein is recognized by 
receptors of the TOM complex. The protein 
is then translocated through theTIM23 
complex so that it transiently spans both 
mitochondrial membranes. The signal 
sequence is cleaved off by a signal 
peptidase in the matrix space to form the 
mature protein. The free signal sequence is 
then rapidly degraded (not shown). 



ATP Hydrolysis and a Membrane Potential Drive Protein Import 
Into the Matrix Space 

Directional transport requires energy, which in most biological systems is sup- 
plied by ATP hydrolysis. ATP hydrolysis fuels mitochondrial protein import at 
two discrete sites, one outside the mitochondria and one in the matrix space 
(Figure 12-26). In addition, protein import requires another energy source, 
which is the membrane potential across the inner mitochondrial membrane. 

The first requirement for energy occurs at the initial stage of the transloca- 
tion process, when the unfolded precursor protein, associated with chaperone 
proteins, interacts with the import receptors of the TOM complex. As discussed 
in Chapter 6, the binding and release of newly synthesized polypeptides from 
the Hsp70 family of chaperone proteins requires ATP hydrolysis. The require- 
ment for Hsp70 and ATP in the cytosol can be bypassed if the precursor protein 
is artificially unfolded prior to adding it to purified mitochondria. 

Once the signal sequence has passed through the TOM complex and is 
bound to a TIM complex, further translocation through the TIM translocation 
channel requires the membrane potential, which is the electrical component of 
the electrochemical H + gradient across the inner membrane (see Figure 11-4). 
Pumping of H + from the matrix space to the intermembrane space, driven by 
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Figure 1 2-26 The role of energy in 
protein import into the mitochondrial 
matrix space. (1) Bound cytosolic Hsp70 is 
released from the protein in a step that 
depends on ATP hydrolysis. After initial 
insertion of the signal sequence and of 
adjacent portions of the polypeptide 
chain into the TOM complex, the signal 
sequence interacts with a TIM complex. 

(2) The signal sequence is then 
translocated into the matrix space in a 
process that requires a membrane 
potential across the inner membrane. 

(3) Mitochondrial Hsp70, which is part of 
an import ATPase complex, binds to 
regions of the polypeptide chain as they 
become exposed in the matrix space, 
pulling the protein through the 
translocation channel. 
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electron transport processes in the inner membrane (discussed in Chapter 14), 
maintains the electrochemical gradient. The energy in the electrochemical H + 
gradient across the inner membrane not only helps drive most of the cell's ATP 
synthesis, but it also drives the translocation of the positively charged signal 
sequences through the TIM complexes by electrophoresis. 

Mitochondrial Hsp70 also plays a crucial part in the import process. Mito- 
chondria containing mutant forms of the protein fail to import precursor pro- 
teins. The Hsp70 is part of a multisubunit protein assembly that is bound to the 
matrix side of the TIM23 complex and acts as a motor to pull the precursor pro- 
tein into the matrix space. Like its cytosolic cousin, mitochondrial Hsp70 has a 
high affinity for unfolded polypeptide chains, and it binds tightly to an imported 
protein as soon as the protein emerges from the TIM translocator in the matrix 
space. The Hsp70 then releases the protein in an ATP-dependent step. This 
energy-driven cycle of binding and subsequent release is thought to provide the 
final driving force needed to complete protein import after a protein has initially 
inserted into the TIM23 complex (see Figure 12-26). 

After the initial interaction with mitochondrial Hsp70, many imported 
matrix proteins are passed on to another chaperone protein, mitochondrial 
Hsp60. As discussed in Chapter 6, Hsp60 helps the unfolded polypeptide chain 
to fold by binding and releasing it through cycles of ATP hydrolysis. 



Bacteria and Mitochondria Use Similar Mechanisms to Insert 
Porins into their Outer Membrane 

The outer mitochondrial membrane, like the outer membrane of Gram- negative 
bacteria (see Figure 11-18), contains abundant pore-forming proteins called 
porins and is thus freely permeable to inorganic ions and metabolites (but not 
to most proteins). Porins are (3-barrel proteins (see Figure 10-26) and are first 
imported through the TOM complex (Figure 12-27). In contrast to other outer 
membrane proteins, which are anchored in the membrane through a-helical 
regions, the TOM complex cannot integrate porins into the lipid bilayer. Instead, 
porins are first transported into the intermembrane space, where they tran- 
siently bind specialized chaperone proteins, which keep the porins from aggre- 
gating. They then bind to the SAM complex in the outer membrane, which both 
inserts them into the outer membrane and helps them fold properly. 

One of the central subunits of the SAM complex is homologous to a bacterial 
outer membrane protein that helps insert (3-barrel proteins into the bacterial 
outer membrane from the periplasmic space (the topological equivalent of the 
intermembrane space in mitochondria). This conserved pathway for inserting (3- 
barrel proteins is further evidence for the endosymbiotic origin of mitochondria. 



Transport Into the Inner Mitochondrial Membrane and 
Intermembrane Space Occurs Via Several Routes 

The same mechanism that transports proteins into the matrix space, using the 
TOM and TIM23 translocators (see Figure 12-25), also mediates the initial 
translocation of many proteins that are destined for the inner mitochondrial 
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Figure 12-27 Integration of porins into 
the outer mitochondrial membrane. 

After translocation through the TOM 
complex, p-barrel proteins bind to 
chaperones in the intermembrane space. 
The SAM complex then inserts the 
unfolded polypeptide chain into the 
outer membrane. 
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Figure 1 2-28 Protein import from the cytosol into the inner mitochondrial membrane and 
intermembrane space. (A) The N-terminal signal sequence [red) initiates import into the matrix space (see 
Figure 1 2-25). A hydrophobic sequence [orange) that follows the matrix-targeting signal binds to the TIM23 
translocator in the inner membrane and stops translocation. The remainder of the protein is then pulled into 
the intermembrane space through the TOM translocator in the outer membrane, and the hydrophobic 
sequence is released into the inner membrane. (B) A second route for protein integration into the inner 
membrane first delivers the protein completely into the matrix space. Cleavage of the signal sequence (red) 
used for the initial translocation unmasks an adjacent hydrophobic signal sequence (orange) at the new 
N-terminus. This signal then directs the protein into the inner membrane, using the same OXA-dependent 
pathway that inserts proteins that are encoded by the mitochondrial genome and translated in the matrix 
space. (C) Some soluble proteins of the intermembrane space also use the pathways shown in (A) and (B) 
before they are released into the intermembrane space by a second signal peptidase, which has its active site 
in the intermembrane space and removes the hydrophobic signal sequence. (D) Metabolite transporters 
contain internal signal sequences and snake through the TOM complex as a loop. They then bind to the 
chaperones in the intermembrane space, which guide the proteins to the TIM22 complex. The TIM22 
complex is specialized for the insertion of multipass inner membrane proteins. 



membrane or the intermembrane space. In the most common translocation 
route taken, only the N-terminal signal sequence of the transported protein 
actually enters the matrix space (Figure 12-28A). A hydrophobic amino acid 
sequence, strategically placed after the N-terminal signal sequence, acts as a 
stop-transfer sequence, preventing further translocation across the inner mem- 
brane. The TOM complex pulls the remainder of the protein through the outer 
membrane into the intermembrane space; the signal sequence is cleaved off in 
the matrix; and the hydrophobic sequence, released from TIM23, remains 
anchored in the inner membrane. 

In another transport route to the inner membrane or intermembrane space, 
the TIM23 complex initially translocates the entire protein into the matrix space 
(Figure 12-28B). A matrix signal peptidase then removes the N-terminal signal 
sequence, exposing a hydrophobic sequence at the new N-terminus. This signal 
sequence guides the protein to the OXA complex, which inserts the protein into 
the inner membrane (see Figure 12-23). As mentioned earlier, the OXA complex 
is primarily used to insert proteins that are encoded and translated in the mito- 
chondrion into the inner membrane, and only a few imported proteins use this 
pathway. Translocators that are closely related to the OXA complex are found in 
the plasma membrane of bacteria and in the thylakoid membrane of chloro- 
plasts, where they are thought to help insert membrane proteins by a similar 
mechanism. 
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Many proteins that use these pathways to the inner membrane remain 
anchored there through their hydrophobic signal sequence (see Figure 12-28A 
and B). Others, however, are released into the intermembrane space by a pro- 
tease that removes the membrane anchor (Figure 12-28C). Many of these 
cleaved proteins remain attached to the outer surface of the inner membrane as 
peripheral subunits of protein complexes that also contain transmembrane pro- 
teins. 

Mitochondria are the principal sites of ATP synthesis in the cell, but they also 
contain many metabolic enzymes, such as those of the citric acid cycle. Thus, in 
addition to proteins, mitochondria must also transport small metabolites across 
their membranes. While the outer membrane contains porins, which make the 
membrane freely permeable to such small molecules, the inner membrane does 
not. Instead, a family of metabolite- specific transporters transfers a vast number 
of small molecules across the inner membrane. In yeast cells, these transporters 
comprise a family of 35 different proteins, the most abundant of which transport 
ATP, ADP, and phosphate. These are multipass transmembrane proteins, which 
do not have cleavable signal sequences at their N-termini but instead contain 
internal signal sequences. They cross the TOM complex in the outer membrane, 
and intermembrane space chaperones guide them to the TIM22 complex, which 
inserts them into the inner membrane by a process that requires the membrane 
potential, but not mitochondrial Hsp70 or ATP (Figure 12-28D). An energetically 
favorable partitioning of the hydrophobic transmembrane regions into the inner 
membrane is also likely to help drive this process. 

Two Signal Sequences Direct Proteins to theThylakoid Membrane 
in Chloroplasts 

Protein transport into chloroplasts resembles transport into mitochondria. 
Both processes occur post-translationally, use separate translocation complexes 
in each membrane, require energy, and use amphiphilic N-terminal signal 
sequences that are removed after use. With the exception of some of the chaper- 
one molecules, however, the protein components that form the translocation 
complexes differ. Moreover, whereas mitochondria harness the electrochemical 
H + gradient across their inner membrane to drive transport, chloroplasts, which 
have an electrochemical H + gradient across their thylakoid membrane but not 
their inner membrane, use GTP and ATP hydrolysis to power import across their 
double membrane. The functional similarities may thus result from convergent 
evolution, reflecting the common requirements for translocation across a dou- 
ble membrane. 

Although the signal sequences for import into chloroplasts superficially 
resemble those for import into mitochondria, the same plant cells have both 
mitochondria and chloroplasts, so proteins must partition appropriately 
between them. In plants, for example, a bacterial enzyme can be directed specif- 
ically to mitochondria if it is experimentally joined to an N-terminal signal 
sequence of a mitochondrial protein; the same enzyme joined to an N-terminal 
signal sequence of a chloroplast protein ends up in chloroplasts. Thus, the 
import receptors on each organelle distinguish between the different signal 
sequences. 

Chloroplasts have an extra membrane-enclosed compartment, the thy- 
lakoid. Many chloroplast proteins, including the protein subunits of the photo- 
synthetic system and of the ATP synthase (discussed in Chapter 14) are located 
in the thylakoid membrane. Like the precursors of some mitochondrial proteins, 
the transport of these precursor proteins from the cytosol to their final destina- 
tion occurs in two steps. First, they pass across the double membrane at special 
contact sites into the matrix space (called the stroma in chloroplasts) and then 
they translocate either into the thylakoid membrane or into the thylakoid space 
(Figure 12-29A). The precursors of these proteins have a hydrophobic thylakoid 
signal sequence following the N-terminal chloroplast signal sequence. After the 
N-terminal signal sequence has been used to import the protein into the stroma, 
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Figure 12-29 Translocation of 
chloroplast precursor proteins into the 
thylakoid space. (A) The precursor 
protein contains an N-terminal 
chloroplast signal sequence (red), 
followed immediately by a thylakoid 
signal sequence (orange). The chloroplast 
signal sequence initiates translocation 
into the stroma through a membrane 
contact site by a mechanism similar to 
that used for the translocation of 
mitochondrial precursor proteins into the 
matrix space. The signal sequence is then 
cleaved off, unmasking the thylakoid 
signal sequence, which initiates 
translocation across the thylakoid 
membrane. (B) Translocation into the 
thylakoid space or thylakoid membrane 
can occur by any one of at least four 
routes: (1) a Sec pathway, so called 
because it uses components that are 
homologs of Sec proteins, which mediate 
protein translocation across the bacterial 
plasma membrane (discussed later), 
(2) an SRP-like pathway, so called because 
it uses a chloroplast homolog of the 
signal recognition particle, or SRP 
(discussed later); (3) a 7">A7"(twin arginine 
translocation) pathway, so called because 
two arginines are critical in the signal 
sequences that direct proteins into this 
pathway, which depends on the H + 
gradient across the thylakoid membrane; 
and (4) a spontaneous insertion pathway 
that seems not to require any protein 
translocator. 



a stromal signal peptidase removes it, unmasking the thylakoid signal sequence 
that initiates transport across the thylakoid membrane. There are at least four 
routes by which proteins cross or become integrated into the thylakoid mem- 
brane, distinguished by their need for different stromal chaperones and energy 
sources (Figure 12-29B). 



Summary 

Although mitochondria and chloroplasts have their own genetic systems, they produce 
only a small proportion of their own proteins. Instead, the two organelles import most 
of their proteins from the cytosol, using similar mechanisms. In both cases, proteins are 
transported in an unfolded state across both outer and inner membranes simultane- 
ously into the matrix space or stroma. Both ATP hydrolysis and a membrane potential 
across the inner membrane drive translocation into mitochondria, whereas GTP and 
ATP hydrolysis drive translocation into chloroplasts. Chaperone proteins of the cytoso- 
lic Hsp70 family maintain the precursor proteins in an unfolded state, and a second 
set ofHsp70 proteins in the matrix space or stroma pull the polypeptide chain into the 
organelle. Only proteins that contain a specific signal sequence are translocated. The 
signal sequence can either be located at the N-terminus and cleaved off after import or 
be internal and retained. Transport into the inner membrane sometimes uses a second, 
hydrophobic signal sequence that is unmasked when the first signal sequence is 
removed. In chloroplasts, import from the stroma into the thylakoid can occur by sev- 
eral routes, distinguished by the chaperones and energy source used. 
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PEROXISOMES 

Peroxisomes differ from mitochondria and chloroplasts in many ways. Most 
notably, they are surrounded by only a single membrane, and they do not contain 
DNA or ribosomes. Thus, lacking a genome, all of their proteins are encoded in the 
nucleus. Peroxisomes acquire most of these proteins by selective import from the 
cytosol, although some of them enter the peroxisome membrane via the ER. 

Because we do not discuss peroxisomes elsewhere, we shall digress to con- 
sider some of the functions of this diverse family of organelles, before discussing 
their biosynthesis. All eucaryotic cells have peroxisomes. They contain oxidative 
enzymes, such as catalase and urate oxidase, at such high concentrations that, in 
some cells, the peroxisomes stand out in electron micrographs because of the 
presence of a crystalloid core (Figure 12-30). 

Like mitochondria, peroxisomes are major sites of oxygen utilization. One 
hypothesis is that peroxisomes are a vestige of an ancient organelle that per- 
formed all the oxygen metabolism in the primitive ancestors of eucaryotic cells. 
When the oxygen produced by photo synthetic bacteria first accumulated in the 
atmosphere, it would have been highly toxic to most cells. Peroxisomes might 
have lowered the intracellular concentration of oxygen, while also exploiting its 
chemical reactivity to perform useful oxidation reactions. According to this view, 
the later development of mitochondria rendered peroxisomes largely obsolete 
because many of the same biochemical reactions — which had formerly been 
carried out in peroxisomes without producing energy — were now coupled to 
ATP formation by means of oxidative phosphorylation. The oxidation reactions 
performed by peroxisomes in present-day cells would therefore be those whose 
functions were not taken over by mitochondria. 
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Peroxisomes Use Molecular Oxygen and Hydrogen Peroxide to 
Perform Oxidation Reactions 



Peroxisomes are so named because they usually contain one or more enzymes 
that use molecular oxygen to remove hydrogen atoms from specific organic sub- 
strates (designated here as R) in an oxidation reaction that produces hydrogen 
peroxide (H2O2): 

RH 2 + 2 -> R + H 2 2 

Catalase uses the H2O2 generated by other enzymes in the organelle to oxi- 
dize a variety of other substrates — including phenols, formic acid, formalde- 
hyde, and alcohol — by the "peroxidation" reaction: H2O2 + R'H 2 -> R' + 2H 2 0. 
This type of oxidation reaction is particularly important in liver and kidney cells, 
where the peroxisomes detoxify various toxic molecules that enter the blood- 
stream. About 25% of the ethanol we drink is oxidized to acetaldehyde in this 
way In addition, when excess H2O2 accumulates in the cell, catalase converts it 
to H2O through the reaction 

2H 2 2 -> 2H 2 + 2 

A major function of the oxidation reactions performed in peroxisomes is the 
breakdown of fatty acid molecules. The process called ft oxidation shortens the 
alkyl chains of fatty acids sequentially in blocks of two carbon atoms at a time, 
thereby converting the fatty acids to acetyl CoA. The peroxisomes then export 
the acetyl CoA to the cytosol for reuse in biosynthetic reactions. In mammalian 
cells, (3 oxidation occurs in both mitochondria and peroxisomes; in yeast and 
plant cells, however, this essential reaction occurs exclusively in peroxisomes. 

An essential biosynthetic function of animal peroxisomes is to catalyze the 
first reactions in the formation of plasmalogens, which are the most abundant 
class of phospholipids in myelin (Figure 12-31). Plasmalogen deficiencies cause 
profound abnormalities in the myelination of nerve cell axons, which is why 
many peroxisomal disorders lead to neurological disease. 

Peroxisomes are unusually diverse organelles, and even in the various cell 
types of a single organism they may contain different sets of enzymes. They also 
adapt remarkably to changing conditions. Yeasts grown on sugar, for example, 
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Figure 12-30 An electron micrograph of 
three peroxisomes in a rat liver cell. The 

paracrystalline, electron-dense inclusions 
are composed of the enzyme urate 
oxidase. (Courtesy of Daniel S. Friend.) 
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Figure 1 2-31 The structure of a plasmalogen. Plasmalogens are very 
abundant in the myelin sheaths that insulate the axons of nerve cells. They 
make up some 80-90% of the myelin membrane phospholipids. In 
addition to an ethanolamine head group and a long-chain fatty acid 
attached to the same glycerol phosphate backbone used for 
phospholipids, plasmalogens contain an unusual fatty alcohol that is 
attached through an ether linkage (bottom left). 

have small peroxisomes. But when some yeasts are grown on methanol they 
develop large peroxisomes that oxidize methanol; and when grown on fatty 
acids they develop large peroxisomes that break down fatty acids to acetyl CoA 
by (3 oxidation. 

Peroxisomes are also important in plants. Two types of plant peroxisomes 
have been studied extensively. One is present in leaves, where it participates in 
photorespiration (discussed in Chapter 14) (Figure 12-32A). The other type of 
peroxisome is present in germinating seeds, where it converts the fatty acids 
stored in seed lipids into the sugars needed for the growth of the young plant. 
Because this conversion of fats to sugars is accomplished by a series of reactions 
known as the glyoxylate cycle, these peroxisomes are also called glyoxysomes 
(Figure 12-32B). In the glyoxylate cycle, two molecules of acetyl CoA produced 
by fatty acid breakdown in the peroxisome are used to make succinic acid, which 
then leaves the peroxisome and is converted into glucose in the cytosol. The gly- 
oxylate cycle does not occur in animal cells, and animals are therefore unable to 
convert the fatty acids in fats into carbohydrates. 
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A Short Signal Sequence Directs the Import of Proteins into 
Peroxisomes 

A specific sequence of three amino acids (Ser-Lys-Leu) located at the C-termi- 
nus of many peroxisomal proteins functions as an import signal (see Table 12-3, 
p. 702). Other peroxisomal proteins contain a signal sequence near the N-termi- 
nus. If either sequence is attached to a cytosolic protein, the protein is imported 
into peroxisomes. The import process is still poorly understood, although it is 
known to involve both soluble receptor proteins in the cytosol, which recognize 
the targeting signals, and docking proteins on the cytosolic surface of the perox- 
isome. At least 23 distinct proteins, called peroxins, participate in the import 
process, which is driven by ATP hydrolysis. A complex of at least six different per- 
oxins forms a membrane translocator. Because even oligomeric proteins do not 
have to unfold to be imported into peroxisomes, the mechanism differs from 
that used by mitochondria and chloroplasts. At least one soluble import recep- 
tor, the peroxin Pex5, accompanies its cargo all the way into peroxisomes and, 
after cargo release, cycles back to the cytosol. These aspects of peroxisomal pro- 
tein import resemble protein transport into the nucleus. 



Figure 12-32 Electron micrographs of 
two types of peroxisomes found in 
plant cells. (A) A peroxisome with a 
paracrystalline core in a tobacco leaf 
mesophyll cell. Its close association with 
chloroplasts is thought to facilitate the 
exchange of materials between these 
organelles during photorespiration. 
(B) Peroxisomes in a fat-storing cotyledon 
cell of a tomato seed 4 days after 
germination. Here the peroxisomes 
(glyoxysomes) are associated with the 
lipid droplets that store fat, reflecting 
their central role in fat mobilization and 
gluconeogenesis during seed 
germination. (A, from S.E. Frederick and 
E.H. Newcomb, J. Cell Biol. 43:343-353, 
1 969. With permission from The 
Rockefeller Press; B, from W.P. Wergin, 
P.J. Gruber and E.H. Newcomb, 
J. Ultrastruct. Res. 30:533-557, 1970. With 
permission from Academic Press.) 
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The importance of this import process and of peroxisomes is demonstrated 
by the inherited human disease Zellweger syndrome, in which a defect in import- 
ing proteins into peroxisomes leads to a profound peroxisomal deficiency. These 
individuals, whose cells contain "empty" peroxisomes, have severe abnormalities 
in their brain, liver, and kidneys, and they die soon after birth. A mutation in the 
gene encoding peroxin Pex2, a peroxisomal integral membrane protein involved 
in protein import, causes one form of the disease. A defective receptor for the N- 
terminal import signal causes a milder inherited peroxisomal disease. 

It has long been debated whether new peroxisomes arise from preexisting 
ones by organelle growth and fission — and therefore replicate in an autonomous 
way as mentioned earlier for mitochondria and plastids — or whether they derive 
as a specialized compartment from the endoplasmic reticulum (ER). Aspects of 
both views may be true (Figure 12-33). Most peroxisomal membrane proteins 
are made in the cytosol and insert into the membrane of preexisting ones, yet 
others are first integrated into the ER membrane from where they may bud in 
specialized peroxisomal precursor vesicles. New precursor vesicles may then 
fuse with one another and begin importing additional peroxisomal proteins 
using their own protein import machinery to grow into mature peroxisomes, 
which can enter into a cycle of growth and fission. 



Figure 12-33 A model that explains how 
peroxisomes proliferate and how new 
peroxisomes may arise. Peroxisome 
precursor vesicles are thought to bud 
from the ER. The machinery that drives 
the budding reaction and that selects 
only peroxisomal proteins for packaging 
into these vesicles — and not ER proteins 
or proteins destined for other locations in 
the cell — is not known. Peroxisome 
precursor vesicles may then fuse with 
one another or with preexisting 
peroxisomes. The peroxisome membrane 
contains import receptor proteins. 
Cytosolic ribosomes synthesize 
peroxisomal proteins, including new 
copies of the import receptor, and then 
import them into the organelle. 
Presumably, the lipids required for 
growth are also imported, although some 
may derive directly from the ER in the 
membrane of peroxisome precursor 
vesicles. (We discuss later the transport of 
lipids made in the ER through the cytosol 
to other organelles.) 



Summary 

Peroxisomes are specialized for carrying out oxidation reactions using molecular oxy- 
gen. They generate hydrogen peroxide, which they employ for oxidative purposes — and 
contain catalase to destroy the excess. Like mitochondria and plastids, peroxisomes are 
self -replicating organelles. Because they do not contain DNA or ribosomes, however, all 
of their proteins are encoded in the cell nucleus. Some of these proteins are conveyed to 
peroxisomes via the ER, but most are synthesized in the cytosol. A specific sequence of 
three amino acids near the C-terminus of many of the cytosolic proteins functions as a 
peroxisomal import signal. The mechanism of protein import differs from that of 
mitochondria and chloroplasts, in that even oligomeric proteins are imported from 
the cytosol without unfolding. 



THE ENDOPLASMIC RETICULUM 

All eucaryotic cells have an endoplasmic reticulum (ER). Its membrane typi- 
cally constitutes more than half of the total membrane of an average animal cell 
(see Table 12-2, p. 697). The ER is organized into a netlike labyrinth of branching 
tubules and flattened sacs that extends throughout the cytosol (Figure 12-34). 
The tubules and sacs interconnect, and their membrane is continuous with the 
outer nuclear membrane. Thus, the ER and nuclear membranes form a contin- 
uous sheet enclosing a single internal space, called the ER lumen or the ER cis- 
ternal space, which often occupies more than 10% of the total cell volume (see 
Table 12-1, p. 697). 
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Figure 12-34 Fluorescent micrographs 
of the endoplasmic reticulum. <TCGT> 
(A) Part of the ER network in a cultured 
mammalian cell, stained with an 
antibody that binds to a protein retained 
in the ER.The ER extends as a network 
throughout the entire cytosol, so that all 
regions of the cytosol are close to some 
portion of the ER membrane. (B) Part of 
an ER network in a living plant cell that 
was genetically engineered to express a 
fluorescent protein in the ER. (A, courtesy 
of Hugh Pelham; B, courtesy of Petra 
Boevink and Chris Hawes.) 
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The ER has a central role in both lipid and protein biosynthesis, and it also 
serves as an intracellular Ca 2+ store that is used in many cell signaling responses 
(discussed in Chapter 15). The ER membrane is the site of production of all the 
transmembrane proteins and lipids for most of the cell's organelles, including 
the ER itself, the Golgi apparatus, lysosomes, endosomes, secretory vesicles, and 
the plasma membrane. The ER membrane also makes most of the lipids for 
mitochondrial and peroxisomal membranes. In addition, almost all of the pro- 
teins that will be secreted to the cell exterior — plus those destined for the lumen 
of the ER, Golgi apparatus, or lysosomes — are initially delivered to the ER lumen. 



The ER Is Structurally and Functionally Diverse 

While the various functions of the ER are essential to every cell, their relative 
importance varies greatly between individual cell types. To meet different func- 
tional demands, distinct regions of the ER become highly specialized. We 
observe such functional specialization as dramatic changes in ER structure, and 
different cell types can therefore possess characteristically different types of ER 
membrane. One of the most remarkable ER specializations is the rough ER. 

Mammalian cells begin to import most proteins into the ER before complete 
synthesis of the polypeptide chain — that is, import is a co-translational process 
(Figure 12-35A). In contrast, the import of proteins into mitochondria, chloro- 
plasts, nuclei, and peroxisomes is a post-translational process (Figure 12-35B). 
In co-translational transport, the ribosome that is synthesizing the protein is 
attached directly to the ER membrane, enabling one end of the protein to be 
translocated into the ER while the rest of the polypeptide chain is being assem- 
bled. These membrane-bound ribosomes coat the surface of the ER, creating 
regions termed rough endoplasmic reticulum, or rough ER (Figure 12-36A). 

Regions of ER that lack bound ribosomes are called smooth endoplasmic 
reticulum, or smooth ER. Most cells have scanty regions of smooth ER, and the 
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Figure 12-35 Co-translational and post- 
translational protein translocation. 

(A) Ribosomes bind to the ER membrane 
during co-translational translocation. 

(B) By contrast, ribosomes complete the 
synthesis of a protein and release it prior 
to post-translational translocation. 
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Figure 12-36 The rough and smooth ER. (A) An electron micrograph of 
the rough ER in a pancreatic exocrine cell that makes and secretes large 
amounts of digestive enzymes every day. The cytosol is filled with closely 
packed sheets of ER membrane studded with ribosomes. At the top left is 
a portion of the nucleus and its nuclear envelope; note that the outer 
nuclear membrane, which is continuous with the ER, is also studded with 
ribosomes. (B) Abundant smooth ER in a steroid-hormone-secreting cell. 
This electron micrograph is of a testosterone-secreting Leydig cell in the 
human testis. (C) A three-dimensional reconstruction of a region of 
smooth ER and rough ER in a liver cell. The rough ER forms oriented stacks 
of flattened cisternae, each having a lumenal space 20-30 nm wide. The 
smooth ER membrane is connected to these cisternae and forms a fine 
network of tubules 30-60 nm in diameter. (A, courtesy of Lelio Orci; 
B, courtesy of Daniel S. Friend; C, after R.V. Krstic, Ultrastructure of the 
Mammalian Cell. New York: Springer-Verlag, 1979.) 



ER is often partly smooth and partly rough. Areas of smooth ER from which 
transport vesicles carrying newly synthesized proteins and lipids bud off for 
transport to the Golgi apparatus are called transitional ER. In certain specialized 
cells, the smooth ER is abundant and has additional functions. It is prominent, 
for example, in cells that specialize in lipid metabolism, such as cells that syn- 
thesize steroid hormones from cholesterol; the expanded smooth ER accommo- 
dates the enzymes that make cholesterol and modify it to form the hormones 
(Figure 12-36B). 

The main cell type in the liver, the hepatocyte, also has abundant smooth ER. 
It is the principal site of production of lipoprotein particles, which carry lipids 
via the bloodstream to other parts of the body. The enzymes that synthesize the 
lipid components of the particles are located in the membrane of the smooth 
ER, which also contains enzymes that catalyze a series of reactions to detoxify 
both lipid-soluble drugs and various harmful compounds produced by 
metabolism. The most extensively studied of these detoxification reactions are 
carried out by the cytochrome P450 family of enzymes, which catalyze a series of 
reactions in which water- insoluble drugs or metabolites that would otherwise 
accumulate to toxic levels in cell membranes are rendered sufficiently water-sol- 
uble to leave the cell and be excreted in the urine. Because the rough ER alone 
cannot house enough of these and other necessary enzymes, a major portion of 
the membrane in a hepatocyte normally consists of smooth ER (Figure 12-36C; 
see Table 12-2). 

Another crucially important function of the ER in most eucaryotic cells is to 
sequester Ca 2+ from the cytosol. The release of Ca 2+ into the cytosol from the ER, 
and its subsequent reuptake, occurs in many rapid responses to extracellular 
signals, as discussed in Chapter 15. A Ca 2+ pump transports Ca 2+ from the 
cytosol into the ER lumen. A high concentration of Ca 2+ -binding proteins in the 
ER facilitates Ca 2+ storage. In some cell types, and perhaps in most, specific 
regions of the ER are specialized for Ca 2+ storage. Muscle cells have an abun- 
dant, modified smooth ER, called the sarcoplasmic reticulum. The release and 
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reuptake of Ca 2+ by the sarcoplasmic reticulum trigger myofibril contraction 
and relaxation, respectively, during each round of muscle contraction (dis- 
cussed in Chapter 16). 

To study the functions and biochemistry of the ER, it is necessary to isolate 
it. This may seem to be a hopeless task because the ER is intricately interleaved 
with other components of the cytoplasm. Fortunately, when tissues or cells are 
disrupted by homogenization, the ER breaks into fragments, which reseal to 
form small (-100-200 nm in diameter) closed vesicles called microsomes. 
Microsomes are relatively easy to purify. To the biochemist, microsomes repre- 
sent small authentic versions of the ER, still capable of protein translocation, 
protein glycosylation, Ca 2+ uptake and release, and lipid synthesis. Microsomes 
derived from rough ER are studded with ribosomes and are called rough micro- 
somes. The ribosomes are always found on the outside surface, so the interior of 
the microsome is biochemically equivalent to the lumenal space of the ER (Fig- 
ure 12-37A). 

Many vesicles similar in size to rough microsomes, but lacking attached 
ribosomes, are also found in cell homogenates. Such smooth microsomes are 
derived in part from smooth portions of the ER and in part from vesiculated 
fragments of the plasma membrane, Golgi apparatus, endosomes, and mito- 
chondria (the ratio depending on the tissue). Thus, whereas rough microsomes 
are clearly derived from rough portions of ER, it is not easy to determine the ori- 
gins of "smooth microsomes" prepared from disrupted cells. The smooth micro- 
somes prepared from liver or muscle cells are an exception. Because of the 
unusually large quantities of smooth ER or sarcoplasmic reticulum, respectively, 
most of the smooth microsomes in homogenates of these tissues are derived 
from the smooth ER. The ribosomes attached to rough microsomes make them 
more dense than smooth microsomes. As a result, we can use equilibrium cen- 
trifugation to separate the rough and smooth microsomes (Figure 12-37B). 
Microsomes have been invaluable in elucidating the molecular aspects of ER 
function, as we discuss next. 



Figure 1 2-37 The isolation of purified 
rough and smooth microsomes from 

the ER. (A) A thin section electron 
micrograph of the purified rough ER 
fraction shows an abundance of 
ribosome-studded vesicles. (B) When 
sedimented to equilibrium through a 
gradient of sucrose, the two types of 
microsomes separate from each other on 
the basis of their different densities. 
(A, courtesy of George Palade.) 



Signal Sequences Were First Discovered in Proteins Imported into 
the Rough ER 

The ER captures selected proteins from the cytosol as they are being synthe- 
sized. These proteins are of two types: transmembrane proteins, which are only 
partly translocated across the ER membrane and become embedded in it, and 
water-soluble proteins, which are fully translocated across the ER membrane 
and are released into the ER lumen. Some of the transmembrane proteins func- 
tion in the ER, but many are destined to reside in the plasma membrane or the 
membrane of another organelle. The water-soluble proteins are destined either 
for secretion or for residence in the lumen of an organelle. All of these proteins, 
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Figure 12-38 The signal hypothesis. 

A simplified view of protein translocation 
across the ER membrane, as originally 
proposed. When the ER signal sequence 
emerges from the ribosome, it directs the 
ribosome to a translocator on the ER 
membrane that forms a pore in the 
membrane through which the 
polypeptide is translocated. A signal 
peptidase is closely associated with the 
translocator and clips off the signal 
sequence during translation, and the 
mature protein is released into the lumen 
of the ER immediately after synthesis. The 
translocator is closed until the ribosome 
has bound, so that the permeability 
barrier of the ER membrane is maintained 
at all times. 



regardless of their subsequent fate, are directed to the ER membrane by an ER 
signal sequence, which initiates their translocation by a common mechanism. 
Signal sequences (and the signal sequence strategy of protein sorting) were 
first discovered in the early 1970s in secreted proteins that are translocated 
across the ER membrane as a first step toward their eventual discharge from the 
cell. In the key experiment, the mRNA encoding a secreted protein was trans- 
lated by ribosomes in vitro. When microsomes were omitted from this cell-free 
system, the protein synthesized was slightly larger than the normal secreted pro- 
tein, the extra length being the N-terminal leader peptide. In the presence of 
microsomes derived from the rough ER, however, a protein of the correct size 
was produced. According to the signal hypothesis, the leader is a signal sequence 
that directs the secreted protein to the ER membrane and is then cleaved off by 
a signal peptidase in the ER membrane before the polypeptide chain has been 
completed (Figure 12-38) . Cell-free systems in which proteins are imported into 
microsomes have provided powerful procedures for identifying, purifying, and 
studying the various components of the molecular machinery responsible for 
the ER import process. 



A Signal-Recognition Particle (SRP) Directs ER Signal Sequences 
to a Specific Receptor in the Rough ER Membrane 

The ER signal sequence is guided to the ER membrane by at least two compo- 
nents: a signal-recognition particle (SRP), which cycles between the ER mem- 
brane and the cytosol and binds to the signal sequence, and an SRP receptor in 
the ER membrane. The SRP is a complex particle, consisting of six different 
polypeptide chains bound to a single small RNA molecule (Figure 12-39). SRP 
and its receptor are found in all cells, indicating that this protein-targeting 
mechanism arose early in evolution and has been conserved. 

ER signal sequences vary greatly in amino acid sequence, but each has 
eight or more nonpolar amino acids at its center (see Table 12-3, p. 702). How 
can the SRP bind specifically to so many different sequences? The answer has 
come from the crystal structure of the SRP protein, which shows that the sig- 
nal-sequence-binding site is a large hydrophobic pocket lined by methion- 
ines. Because methionines have unbranched, flexible side chains, the pocket 
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Figure 12-39 The signal-recognition particle (SRP). (A) A mammalian SRP is a rodlike complex containing six protein 
subunits and one RNA molecule (shown in red). The SRP RNA forms the backbone that links the domain of the SRP 
containing the signal sequence binding pocket to the domain responsible for pausing translation. The three-dimensional 
outline of the SRP (shown in gray) was determined by cryo-electron microscopy. Where known, crystal structures of 
individual SRP pieces are fitted into the envelope and shown as ribbon diagrams. A bound signal sequence is shown as a 
green helix. (B) SRP bound to the ribsosome visualized by cryo-electron microscopy. SRP binds to the large ribosomal 
subunit so that its signal sequence binding pocket is positioned near the nascent chain exit site and its translational pause 
domain is positioned at the interface between the ribosomal subunits, where it interferes with elongation factor binding. 
(Adapted from M. Halic et al., Nature 427:808-8 14, 2004. With permission from Macmillan Publishers Ltd.) 



is sufficiently plastic to accommodate hydrophobic signal sequences of differ- 
ent sequences, sizes, and shapes. 

The SRP is a rodlike structure that wraps around the large ribosomal sub- 
unit, with one end binding to the ER signal sequence as it emerges as part of the 
newly made polypeptide chain from the ribosome; the other end blocks the 
elongation factor binding site at the interface between the large and small ribo- 
somal subunits (Figure 12-39). This block halts protein synthesis as soon as the 
signal peptide has emerged from the ribosome. The transient pause presumably 
gives the ribosome enough time to bind to the ER membrane before completion 
of the polypeptide chain, thereby ensuring that the protein is not released into 
the cytosol. This safety device may be especially important for secreted and lyso- 
somal hydrolases that could wreak havoc in the cytosol; cells that secrete large 
amounts of hydrolases, however, take the added precaution of having high con- 
centrations of hydrolase inhibitors in their cytosol. The pause also ensures that 
large portions of a protein that could fold into a compact structure are not made 
before reaching the translocator in the ER membrane. Thus, in contrast to the 
post-translational import of proteins into mitochondria and chloroplasts, chap- 
erone proteins are not required to keep the protein unfolded. 

Once formed, the SRP-ribosome complex binds to the SRP receptor, which 
is an integral membrane protein complex embedded in the rough ER mem- 
brane. This interaction brings the SRP-ribosome complex to a protein translo- 
cator. The SRP and SRP receptor are then released, and the translocator transfers 
the growing polypeptide chain across the membrane (Figure 12-40). 

This co-translational transfer process creates two spatially separate popula- 
tions of ribosomes in the cytosol. Membrane-bound ribosomes, attached to the 
cytosolic side of the ER membrane, are engaged in the synthesis of proteins that 
are being concurrently translocated into the ER. Free ribosomes, unattached to 
any membrane, synthesize all other proteins encoded by the nuclear genome. 
Membrane-bound and free ribosomes are structurally and functionally identi- 
cal. They differ only in the proteins they are making at any given time. 

Since many ribosomes can bind to a single mRNA molecule, a polyribosome 
is usually formed, which becomes attached to the ER membrane, directed there 
by the signal sequences on multiple growing polypeptide chains (Figure 
12-41A). The individual ribosomes associated with such an mRNA molecule can 
return to the cytosol when they finish translation and intermix with the pool of 
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Figure 1 2-40 How ER signal sequences and SRP direct ribosomes to the ER membrane. <TTCC> The SRP and its receptor are thought to act 
in concert. The SRP binds to both the exposed ER signal sequence and the ribosome, thereby inducing a pause in translation. The SRP receptor 
in the ER membrane, which is composed of two different polypeptide chains, binds the SRP-ribosome complex and directs it to the 
translocator. In a poorly understood reaction, the SRP and SRP receptor are then released, leaving the ribosome bound to the translocator in 
the ER membrane. The translocator then inserts the polypeptide chain into the membrane and transfers it across the lipid bilayer. Because one 
of the SRP proteins and both chains of the SRP receptor contain GTP-binding domains, it is thought that conformational changes that occur 
during cycles of GTP binding and hydrolysis (discussed in Chapter 1 5) ensure that SRP release occurs only after the ribosome has become 
properly engaged with the translocator in the ER membrane. The translocator is closed until the ribosome has bound, so that the permeability 
barrier of the ER membrane is maintained at all times. 
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Figure 1 2-41 Free and membrane-bound ribosomes. (A) A common pool of ribosomes synthesizes the proteins that stay in 
the cytosol and those that are transported into the ER. The ER signal sequence on a newly formed polypeptide chain binds to 
SRP, which directs the translating ribosome to the ER membrane. The mRNA molecule remains permanently bound to the ER 
as part of a polyribosome, while the ribosomes that move along it are recycled; at the end of each round of protein synthesis 
the ribosomal subunits are released and rejoin the common pool in the cytosol. (B) A thin-section electron micrograph of 
polyribosomes attached to the ER membrane. The plane of section in some places cuts through the ER roughly parallel to the 
membrane, giving a face-on view of the rosettelike pattern of the polyribosomes. (B, courtesy of George Palade.) 
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free ribosomes. The mRNA itself, however, remains attached to the ER mem- 
brane by a changing population of ribosomes, each transiently held at the mem- 
brane by the translocator (Figure 12-41B). 



The Polypeptide Chain Passes Through an Aqueous Pore in the 
Translocator 

It has long been debated whether polypeptide chains are transferred across the 
ER membrane in direct contact with the lipid bilayer or through a pore in a pro- 
tein translocator. The debate ended with the identification of the translocator, 
which was shown to form a water-filled pore in the membrane through which 
the polypeptide chain passes. The core of the translocator, called the Sec61 
complex, is built from three subunits that are highly conserved from bacteria to 
eucaryotic cells. Recently, the structure of the Sec61 complex was determined by 
x-ray crystallography. The structure suggests that a helices contributed by the 
largest subsunit surround a central pore through which the polypeptide chain 
may traverse the membrane (Figure 12-42). The pore is gated by a short helix 
that is thought to keep the translocator closed when it is idle and to move aside 
when it is engaged in passing a polypeptide chain. According to this view, the 
pore is a dynamic gated structure that opens only transiently when a polypep- 
tide chain traverses the membrane. In an idle translocator it is important to keep 
the pore closed, so that the membrane remains impermeable to ions, such as 
Ca 2+ , which otherwise would leak out of the ER. 

The structure of the Sec61 complex suggests that the pore can also open 
along a seam on its side. This opening allows a translocating peptide chain lat- 
eral access into the hydrophobic core of the membrane, a process that is impor- 
tant both for the release of a cleaved signal peptide into the membrane (see Fig- 
ure 12-38) and for the integration of membrane proteins into the bilayer, as we 
discuss later. 

In eucaryotic cells, four Sec61 complexes form a large translocator assembly 
that can be visualized on the ribosomes after detergent solubilization of ER 
membranes (Figure 12-43). It is likely that not all four Sec61 complexes in the 
eucaryotic translocator participate in protein translocation directly. Some of 
them may be inactive, providing binding sites for the ribosome and for acces- 
sory proteins that help polypeptide chains fold as they enter the ER. According 
to one view, the bound ribosome forms a tight seal with the translocator, with 
the space inside the ribosome continuous with the lumen of the ER so that no 
molecules can escape from the ER. Alternatively, the structure of the Sec61 com- 
plex suggests that the pore in the translocator could form a tight-fitting 
diaphragm around the translocating chain that prevents the escape of other 
molecules. 
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Figure 12-42 Structure of the Sec61 
complex. (A) A top view of the structure of 
the Sec61 complex of the archae 
Methanococcus jannaschii seen from the 
cytosol side of the membrane. The Sec61a 
subunit is shown in blue and red; the two 
smaller (3- and y-subunits are shown in 
gray. The yellow short helix is thought to 
form a plug that seals the pore when the 
translocator is closed. To open, the 
complex rearranges itself to move the 
plug helix out of the way. It is thought that 
the pore of the Sec61 complex can also 
open sideways at a seam. (B) Model of the 
closed and open states of the translocator 
are shown, illustrating how a signal 
sequence could be released into the 
membrane after opening of the seam. 
(A, from B. van der Berg et al., Nature 
427:36-44, 2004. With permission from 
Macmillan Publishers Ltd.) 
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Figure 12-43 A ribosome bound to the 
eucaryotic protein translocator. 

(A) A reconstruction of the complex from 
electron microscopic images viewed from 
the side. (B) A view of the translocator 
seen from the ER lumen. The translocator 
is thought to contain four copies of the 
Sec61 complex. (C) A schematic drawing 
of a membrane-bound ribosome 
attached to the translocator with the 
tunnel in the large ribosomal subunit, 
through which the growing polypeptide 
chain exits from the ribosome (see Figure 
6-70). (D) A schematic model of how four 
Sec61 complexes and various accessory 
proteins may be arranged in the 
translocator. The red dots indicate 
ribosome attachment sites that are seen 
in the electron microscopic 
reconstructions and the red circle shows 
the position of the end of the ribosome 
exit tunnel. It is not known which of the 
fourSec61 complexes are actively 
participating in the protein translocation 
event. Domains of the accessory proteins 
extend across the membrane and form 
the lumenal bulge seen in the side view 
in (A). (A, B, and C, adapted from 
J.F. Menetret et al., J. Mol. Biol. 
348:445-457, 2005. With permission 
from Academic Press.) 



Translocation Across the ER Membrane Does Not Always Require 
Ongoing Polypeptide Chain Elongation 

As we have seen, translocation of proteins into mitochondria, chloroplasts, and 
peroxisomes occurs post-translationally, after the protein has been made and 
released into the cytosol, whereas translocation across the ER membrane usu- 
ally occurs during translation (co-translationally). This explains why ribosomes 
are bound to the ER but usually not to other organelles. 

Some completely synthesized proteins, however, are imported into the ER, 
demonstrating that translocation does not always require ongoing translation. 
Post-translational protein translocation is especially common across the yeast ER 
membrane and the bacterial plasma membrane (which is thought to be evolu- 
tionarily related to the ER; see Figure 12-4). To function in post-translational 
translocation, the translocator needs accessory proteins that feed the polypep- 
tide chain into the pore and drive translocation (Figure 12-44). In bacteria, a 
translocation motor protein, the SecA ATPase, attaches to the cytosolic side of the 
translocator, where it undergoes cyclic conformational changes driven by ATP 
hydrolysis. Each time an ATP is hydrolyzed, a portion of the SecA protein inserts 
into the pore of the translocator, pushing a short segment of the passenger pro- 
tein with it. As a result of this ratchet mechanism, the SecA protein progressively 
pushes the polypeptide chain of the transported protein across the membrane. 

Eucaryotic cells use a different set of accessory proteins that associate with 
the Sec61 complex. These proteins span the ER membrane and use a small 
domain on the lumenal side of the ER membrane to deposit a Hsp70-like chap- 
erone protein (called BiP, for Ending protein) onto the polypeptide chain as it 
emerges from the pore into the ER lumen. Cycles of BiP binding and release 
drive unidirectional translocation, as described earlier for the mitochondrial 
Hsp70 proteins that pull proteins across mitochondrial membranes. 

Proteins that are transported into the ER by a post-translational mechanism 
are first released into the cytosol, where they bind to chaperone proteins to pre- 
vent folding, as discussed earlier for proteins destined for mitochondria and 
chloroplasts. 
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Figure 12-44 Three ways in which protein translocation can be driven through structurally similar translocators. 

(A) Co-translational translocation. The ribosome is brought to the membrane by the SRP and SRP receptor and engages with 
the Sec61 protein translocator.The growing polypeptide chain is threaded across the membrane as it is made. No additional 
energy is needed, as the only path available to the growing chain is to cross the membrane. (B) Post-translational 
translocation in eucaryotic cells. An additional complex composed of the Sec62, Sec63, Sec71, and Sec72 proteins is attached 
to the Sec61 translocator and deposits BiP molecules onto the translocating chain as it emerges into the ER lumen. 
ATP-driven cycles of BiP binding and release pull the protein into the lumen, a mechanism that closely resembles the 
mechanism of mitochondrial import in Figure 1 2-26. (C) Post-translational translocation in bacteria. The completed 
polypeptide chain is fed from the cytosolic side into a translocator in the plasma membrane by the SecA ATPase. 
ATP-hydrolysis-driven conformational changes drive a pistonlike motion in SecA, each cycle pushing about 20 amino acids of 
the protein chain through the pore of the translocator.The Sec pathway used for protein translocation across the thylakoid 
membrane in chloroplasts uses a similar mechanism (see Figure 1 2-29B). 

Whereas the Sec61 translocator, SRP, and SRP receptor are found in all organisms, SecA is found exclusively in bacteria, and 
the Sec62, Sec63, Sec71 , and Sec72 proteins are found exclusively in eucaryotic cells. (Adapted from P. Walter and 
A.E. Johnson, Annu. Rev. Cell Biol. 1 0:87-1 1 9, 1 994. With permission from Annual Reviews.) 



In Single-Pass Transmembrane Proteins, a Single Internal ER 
Signal Sequence Remains in the Lipid Bilayer as a Membrane- 
Spanning a Helix 

The signal sequence in the growing polypeptide chain is thought to trigger the 
opening of the pore in the protein translocator: after the signal sequence is 
released from the SRP and the growing chain has reached a sufficient length, the 
signal sequence binds to a specific site inside the pore itself, thereby opening the 
pore. An ER signal sequence is therefore recognized twice: first by an SRP in the 
cytosol and then by a binding site in the pore of the protein translocator, where 
it serves as a start-transfer signal (or start- transfer peptide) that opens the pore 
(illustrated for an ER soluble protein in Figure 12-45). Dual recognition may 
help to ensure that only appropriate proteins enter the lumen of the ER. 

While bound in the translocation pore, a signal sequence is in contact not 
only with the Sec61 complex, which forms the walls of the pore, but also with 
the hydrophobic lipid core of the membrane. This was shown in chemical cross- 
linking experiments in which signal sequences and the hydrocarbon chains of 
lipids could be covalently linked together. When the nascent polypeptide chain 
grows long enough, an ER signal peptidase cleaves off signal sequences and 
releases them from the pore into the membrane, where they are rapidly 
degraded to amino acids by other proteases in the ER membrane. To release the 
signal sequence into the membrane, the translocator has to open laterally. The 
translocator is therefore gated in two directions: it can open to form a pore across 
the membrane to let the hydrophilic portions of proteins cross the lipid bilayer, 
and it can open laterally within the membrane to let hydrophobic portions of 
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proteins partition into the lipid bilayer. Lateral gating of the pore is an essential 
step during the integration of membrane proteins. 

The integration of membrane proteins requires that some parts of the 
polypeptide chain be translocated across the lipid bilayer whereas others are not. 
Despite this additional complexity, all modes of insertion of membrane proteins 
are variants of the sequence of events just described for transferring a soluble 
protein into the lumen of the ER. We begin by describing the three ways in which 
single-pass transmembrane proteins (see Figure 10-19) become inserted into 
the ER. 

In the simplest case, an N-terminal signal sequence initiates translocation, 
just as for a soluble protein, but an additional hydrophobic segment in the 
polypeptide chain stops the transfer process before the entire polypeptide chain 
is translocated. This stop-transfer signal anchors the protein in the membrane 
after the ER signal sequence (the start-transfer signal) has been released from 
the translocator and has been cleaved off (Figure 12-46). The lateral gating 
mechanism transfers the stop -transfer sequence into the bilayer, and it remains 
there as a single a-helical membrane-spanning segment, with the N-terminus of 
the protein on the lumenal side of the membrane and the C-terminus on the 
cytosolic side. 

In the other two cases, the signal sequence is internal, rather than at the N-ter- 
minal end of the protein. Like the N-terminal ER signal sequences, the SRP also 



Figure 12-45 A model to explain how a 
soluble protein is translocated across 
the ER membrane. On binding an ER 
signal sequence (which acts as a start- 
transfer signal), the translocator opens its 
pore, allowing the transfer of the 
polypeptide chain across the lipid bilayer 
as a loop. After the protein has been 
completely translocated, the pore closes, 
but the translocator now opens laterally 
within the lipid bilayer, allowing the 
hydrophobic signal sequence to diffuse 
into the bilayer, where it is rapidly 
degraded. (In this figure and the three 
figures that follow, the ribosomes have 
been omitted for clarity.) 
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Figure 12-46 How a single-pass 
transmembrane protein with a cleaved 
ER signal sequence is integrated into 
the ER membrane. In this protein the 
co-translational translocation process is 
initiated by an N-terminal ER signal 
sequence (red) that functions as a 
start-transfer signal, as in Figure 1 2-45. In 
addition to this start-transfer sequence, 
however, the protein also contains a 
stop-transfer sequence (orange). When 
the stop-transfer sequence enters the 
translocator and interacts with a binding 
site, the translocator changes its 
conformation and discharges the protein 
laterally into the lipid bilayer. 
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binds to an internal signal sequence. The SRP brings the ribosome making the 
protein to the ER membrane and serves as a start-transfer signal that initiates the 
protein's translocation. After release from the translocator, the internal start-trans- 
fer sequence remains in the lipid bilayer as a single membrane-spanning a helix. 
Internal start- transfer sequences can bind to the translocation apparatus in 
either of two orientations; this in turn determines which protein segment (the 
one preceding or the one following the start- transfer sequence) is moved across 
the membrane into the ER lumen. In one case, the resulting membrane protein 
has its C-terminus on the lumenal side (Pathway A in Figure 12-47), while in the 
other, it has its N-terminus on the lumenal side (Pathway B in Figure 12-47). The 
orientation of the start-transfer sequence depends on the distribution of nearby 
charged amino acids, as described in the figure legend. 



Combinations of Start-Transfer and Stop-Transfer Signals 
Determine the Topology of Multipass Transmembrane Proteins 

In multipass transmembrane proteins, the polypeptide chain passes back 
and forth repeatedly across the lipid bilayer (see Figure 10-19). It is thought 
that an internal signal sequence serves as a start-transfer signal in these pro- 
teins to initiate translocation, which continues until the translocator encoun- 
ters a stop-transfer sequence. In double-pass transmembrane proteins, for 
example, the polypeptide can then be released into the bilayer (Figure 12-48). 
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Figure 12-47 Integration of a single- 
pass transmembrane protein with an 
internal signal sequence into the ER 
membrane. An internal ER signal 
sequence that functions as a start- 
transfer signal can bind to the 
translocator in one of two different ways, 
leading to a membrane protein that has 
either its C-terminus (pathway A) or its 
N-terminus (pathway B) in the ER lumen. 
Proteins are directed into either pathway 
by features in the polypeptide chain 
flanking the internal start transfer 
sequence: if there are more positively 
charged amino acids immediately 
preceding the hydrophobic core of the 
start-transfer sequence than there are 
following it, the membrane protein is 
inserted into the translocator in the 
orientation shown in pathway A, whereas 
if there are more positively charged 
amino acids immediately following the 
hydrophobic core of the start-transfer 
sequence than there are preceding it, the 
membrane protein is inserted into the 
translocator in the orientation shown in 
pathway B. Because translocation cannot 
start before a start-transfer sequence 
appears outside the ribosome, 
translocation of the N-terminal portion of 
the protein shown in (B) can occur only 
after this portion has been fully 
synthesized. 

Note that there are two ways to insert a 
single-pass membrane-spanning protein 
whose N-terminus is located in the ER 
lumen: that shown in Figure 1 2-46 and 
that shown in (B) here. 



THE ENDOPLASMIC RETICULUM 



735 



stop-transfer 
sequence 




ER LUMEN 



hydrophobic 
stop-transfer- 
peptide- 
binding site 



hydrophobic 
start-transfer- 
peptide- 
binding site 



translocator protein 



Figure 12-48 Integration of a double- 
pass transmembrane protein with an 
internal signal sequence into the ER 
membrane. In this protein, an internal ER 
signal sequence acts as a start-transfer 
signal (as in Figure 12-47) and initiates 
the transfer of the C-terminal part of the 
protein. At some point after a stop- 
transfer sequence has entered the 
translocator, the translocator discharges 
the sequence laterally into the 
membrane. 



mature transmembrane protein 
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In more complex multipass proteins, in which many hydrophobic a helices span 
the bilayer, a second start- transfer sequence reinitiates translocation further 
down the polypeptide chain until the next stop-transfer sequence causes 
polypeptide release, and so on for subsequent start-transfer and stop-transfer 
sequences (Figure 12-49). 

Whether a given hydrophobic signal sequence functions as a start- transfer 
or stop-transfer sequence must depend on its location in a polypeptide chain, 
since its function can be switched by changing its location in the protein by 
using recombinant DNA techniques. Thus, the distinction between start-trans- 
fer and stop -transfer sequences results mostly from their relative order in the 
growing polypeptide chain. It seems that the SRP begins scanning an unfolded 
polypeptide chain for hydrophobic segments at its N-terminus and proceeds 
toward the C-terminus, in the direction that the protein is synthesized. By rec- 
ognizing the first appropriate hydrophobic segment to emerge from the ribo- 
some, the SRP sets the "reading frame" for membrane integration: after the SRP 
initiates translocation, the translocator recognizes the next appropriate 
hydrophobic segment in the direction of transfer as a stop-transfer sequence, 
causing the region of the polypeptide chain in between to be threaded across the 
membrane. A similar scanning process continues until all of the hydrophobic 
regions in the protein have been inserted into the membrane. 
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Figure 12-49 The insertion of the 
multipass membrane protein rhodopsin 
into the ER membrane. Rhodopsin is the 
light-sensitive protein in rod 
photoreceptor cells in the mammalian 
retina (discussed in Chapter 15). 

(A) A hydrophobicity plot identifies seven 
short hydrophobic regions in rhodopsin. 

(B) The hydrophobic region nearest the 
N-terminus serves as a start-transfer 
sequence that causes the preceding 
N-terminal portion of the protein to pass 
across the ER membrane. Subsequent 
hydrophobic sequences function in 
alternation as start-transfer and stop- 
transfer sequences. (C) The final 
integrated rhodopsin has its N-terminus 
located in the ER lumen and its 
C-terminus located in the cytosol.The 
blue hexagons represent covalently 
attached oligosaccharides. Arrows 
indicate the paired start and stop signals 
inserted into the translocator. 
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Because membrane proteins are always inserted from the cytosolic side of 
the ER in this programmed manner, all copies of the same polypeptide chain will 
have the same orientation in the lipid bilayer. This generates an asymmetrical 
ER membrane in which the protein domains exposed on one side are different 
from those exposed on the other side. This asymmetry is maintained during the 
many membrane budding and fusion events that transport the proteins made in 
the ER to other cell membranes (discussed in Chapter 13). Thus, the way in 
which a newly synthesized protein is inserted into the ER membrane determines 
the orientation of the protein in all of the other membranes as well. 

When proteins dissociate from a membrane and are then reconstituted into 
artificial lipid vesicles, a random mixture of right- side -out and inside-out pro- 
tein orientations usually results. Thus, the protein asymmetry observed in cell 
membranes seems not to be an inherent property of the protein, but instead 
results solely from the process by which proteins are inserted into the ER mem- 
brane from the cytosol. 



Translocated Polypeptide Chains Fold and Assemble in the 
Lumen of the Rough ER 

Many of the proteins in the lumen of the ER are in transit, en route to other des- 
tinations; others, however, normally reside there and are present at high con- 
centrations. These ER resident proteins contain an ER retention signal of four 
amino acids at their C-terminus that is responsible for retaining the protein in 
the ER (see Table 12-3; discussed in Chapter 13). Some of these proteins func- 
tion as catalysts that help the many proteins that are translocated into the ER to 
fold and assemble correctly. 

One important ER resident protein is protein disulfide isomerase (PDI), which 
catalyzes the oxidation of free sulfhydryl (SH) groups on cysteines to form disul- 
fide (S-S) bonds. Almost all cysteines in protein domains exposed to either the 
extracellular space or the lumen of organelles in the secretory and endocytic 
pathways are disulfide-bonded. By contrast, disulfide bonds form only very rarely 
in domains exposed to the cytosol because of the reducing environment there. 

Another ER resident protein is the chaperone protein BiP. We have already 
discussed how BiP pulls proteins post-translationally into the ER through the ER 
translocator. Like other chaperones, BiP recognizes incorrectly folded proteins, as 
well as protein subunits that have not yet assembled into their final oligomeric 
complexes. It does so by binding to exposed amino acid sequences that would 
normally be buried in the interior of correctly folded or assembled polypeptide 
chains. An example of a BiP-binding site is a stretch of alternating hydrophobic 
and hydrophilic amino acids that would normally be buried in a (3 sheet. The 
bound BiP both prevents the protein from aggregating and helps to keep it in the 
ER (and thus out of the Golgi apparatus and later parts of the secretory pathway). 
Like some other members of the Hsp70 family of proteins, which bind unfolded 
proteins and facilitate their import into mitochondria and chloroplasts, BiP 
hydrolyzes ATP to provide the energy needed to help proteins translocate post- 
translationally into the ER. It also helps these and other proteins fold. 



Most Proteins Synthesized in the Rough ER Are Glycosylated by 
the Addition of a Common A/-Linked Oligosaccharide 

The covalent addition of sugars to proteins is one of the major biosynthetic 
functions of the ER. About half of all eucaryotic proteins are glycosylated. Most 
of the soluble and membrane-bound proteins that are made in the ER — includ- 
ing those destined for transport to the Golgi apparatus, lysosomes, plasma 
membrane, or extracellular space — are glycoproteins. In contrast, very few pro- 
teins in the cytosol are glycosylated, and those that are carry a much simpler 
sugar modification, in which a single Af-acetylglucosamine group is added to a 
serine or threonine residue of the protein. 
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Figure 12-50 The asparagine-linked (N-\ inked) precursor oligosaccharide 
that is added to most proteins in the rough ER membrane. The five 
sugars in the gray box form the "core region" of this oligosaccharide. For 
many glycoproteins, only the core sugars survive the extensive 
oligosaccharide trimming process that takes place in the Golgi apparatus. 
Only asparagines in the sequences Asn-X-Ser and Asn-X-Thr (where X is any 
amino acid except proline) become glycosylated. These two sequences 
occur much less frequently in glycoproteins than in nonglycosylated 
cytosolic proteins; evidently there has been selective pressure against 
these sequences during protein evolution, presumably because 
glycosylation at too many sites would interfere with protein folding. 



An important advance in understanding the process of protein glycosyla- 
tion was the discovery that a preformed precursor oligosaccharide (composed of 
iV-acetylglucosamine, mannose, and glucose and containing a total of 14 sugars) 
is transferred en bloc to proteins in the ER. Because this oligosaccharide is trans- 
ferred to the side-chain NH 2 group of an asparagine amino acid in the protein, 
it is said to be N-linked or asparagine-linked (Figure 12-50). The transfer is cat- 
alyzed by a membrane-bound enzyme complex, an oligosaccharyl transferase, 
which has its active site exposed on the lumenal side of the ER membrane; this 
explains why cytosolic proteins are not glycosylated in this way A special lipid 
molecule called dolichol holds the precursor oligosaccharide in the ER mem- 
brane. It transfers its oligosaccharide chain to the target asparagine in a single 
enzymatic step immediately after that amino acid has reached the ER lumen 
during protein translocation (Figure 12-51). One copy of oligosaccharyl trans- 
ferase is associated with each protein translocator, allowing it to scan and glyco- 
sylate the incoming polypeptide chains efficiently. 

The precursor oligosaccharide is linked to the dolichol lipid by a high- 
energy pyrophosphate bond, which provides the activation energy that drives 
the glycosylation reaction illustrated in Figure 12-51. The entire precursor 
oligosaccharide is built up sugar by sugar on this membrane-bound lipid 
molecule and is then transferred to a protein. The sugars are first activated in the 
cytosol by the formation of nucleotide- sugar intermediates, which then donate 
their sugar (directly or indirectly) to the lipid in an orderly sequence. Partway 
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Figure 12-51 Protein glycosylation in 
the rough ER. Almost as soon as a 
polypeptide chain enters the ER lumen, it 
is glycosylated on target asparagine 
amino acids. The precursor 
oligosaccharide shown in Figure 12-50 is 
transferred to the asparagine as an intact 
unit in a reaction catalyzed by a 
membrane-bound oligosaccharyl 
transferase enzyme. As with signal 
peptidase, one copy of this enzyme is 
associated with each protein translocator 
in the ER membrane. (The ribosome is 
not shown.) 
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Figure 1 2-52 Synthesis of the lipid- 
linked precursor oligosaccharide in the 
rough ER membrane. The oligosaccharide 
is assembled sugar by sugar onto the 
carrier lipid dolichol (a polyisoprenoid; see 
Panel 2-5, pp. 114-115). Dolichol is long 
and very hydrophobic: its 22 five-carbon 
units can span the thickness of a lipid 
bilayer more than three times, so that the 
attached oligosaccharide is firmly 
anchored in the membrane. The first sugar 
is linked to dolichol by a pyrophosphate 
bridge. This high-energy bond activates 
the oligosaccharide for its eventual 
transfer from the lipid to an asparagine 
side chain of a nascent polypeptide on the 
lumenal side of the rough ER. As indicated, 
the synthesis of the oligosaccharide starts 
on the cytosolic side of the ER membrane 
and continues on the lumenal face after 
the (Man) 5 (GlcNAc) 2 lipid intermediate is 
flipped across the bilayer by a transporter. 
All the subsequent glycosyl transfer 
reactions on the lumenal side of the ER 
involve transfers from dolichol-P-glucose 
and dolichol-P-mannose; these activated, 
lipid-linked monosaccharides are 
synthesized from dolichol phosphate and 
UDP-glucose or GDP-mannose (as 
appropriate) on the cytosolic side of the 
ER and are then thought to be flipped 
across the ER membrane. GlcNAc = 
A/-acetylglucosamine; Man = mannose; 
Glc = glucose. 



through this process, the lipid-linked oligosaccharide is flipped, with the help of 
a transporter, from the cytosolic to the lumenal side of the ER membrane (Fig- 
ure 12-52). 

All of the diversity of the N- linked oligosaccharide structures on mature gly- 
coproteins results from the later modification of the original precursor oligosac- 
charide. While still in the ER, three glucoses (see Figure 12-50) and one mannose 
are quickly removed from the oligosaccharides of most glycoproteins. We shall 
return to the importance of glucose trimming shortly. This oligosaccharide 
"trimming" or "processing" continues in the Golgi apparatus and is discussed in 
Chapter 13. 

The AMinked oligosaccharides are by far the most common oligosaccha- 
rides, being found on 90% of all glycoproteins. Less frequently, oligosaccharides 
are linked to the hydroxyl group on the side chain of a serine, threonine, or 
hydroxylysine amino acid. These O-linked oligosaccharides are formed in the 
Golgi apparatus. 



Oligosaccharides Are Used as Tags to Mark the State of Protein 
Folding 

It has long been debated why glycosylation is such a common modification of 
proteins that enter the ER. One particularly puzzling observation has been that 
some proteins require AZ-linked glycosylation for proper folding in the ER, yet 
the precise location of the oligosaccharides attached to the protein's surface 
does not seem to matter. A clue to the role of glycosylation in protein folding 
came from studies of two ER chaperone proteins, which are called calnexin and 
calreticulin because they require Ca 2+ for their activities. These chaperones are 
carbohydrate-binding proteins, or lectins, which bind to oligosaccharides on 
incompletely folded proteins and retain them in the ER. Like other chaperones, 
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they prevent incompletely folded proteins from becoming irreversibly aggre- 
gated. Both calnexin and calreticulin also promote the association of incom- 
pletely folded proteins with another ER chaperone, which binds to cysteines 
that have not yet formed disulfide bonds. 

Calnexin and calreticulin recognize AMinked oligosaccharides that contain 
a single terminal glucose, and therefore bind proteins only after two of the three 
glucoses on the precursor oligosaccharide have been removed by ER glucosi- 
dases. When the third glucose has been removed, the protein dissociates from its 
chaperone and can leave the ER. 

How, then, do calnexin and calreticulin distinguish properly folded from 
incompletely folded proteins? The answer lies in yet another ER enzyme, a glu- 
cosyl transferase that keeps adding a glucose to those oligosaccharides that have 
lost their last glucose. It adds the glucose, however, only to oligosaccharides that 
are attached to unfolded proteins. Thus, an unfolded protein undergoes contin- 
uous cycles of glucose trimming (by glucosidase) and glucose addition (by gly- 
cosyl transferase), maintaining an affinity for calnexin and calreticulin until it 
has achieved its fully folded state (Figure 12-53). 



Improperly Folded Proteins Are Exported from the ER and 
Degraded in the Cytosol 

Despite all the help from chaperones, many protein molecules (more than 80% 
for some proteins) translocated into the ER fail to achieve their properly folded 
or oligomeric state. Such proteins are exported from the ER back into the 
cytosol, where they are degraded. The mechanism of retrotranslocation, also 
called dislocation, is still unknown but is likely to be similar to other post-trans- 
lational modes of translocation. For example, like translocation into mitochon- 
dria or chloroplasts, chaperone proteins are probably necessary to keep the 
polypeptide chain in an unfolded state prior to and during transport. Similarly, 
a source of energy is required to provide directionality to the transport and to 
pull the protein into the cytosol. Finally, a translocator, possibly composed of 
some of the same components that are used for forward transport into the ER 
(such as Sec61), is presumably necessary. 

Selecting proteins from the ER for degradation is a challenging process. Mis- 
folded proteins or unassembled protein subunits should be degraded, but folding 



UNFOLDED 



' 



NCOMPLETELY 
FOLDED 



NORMALLY FOLDED 



I I^glucose / glue 

\ * I udp 



GLUCOSE > 
TRIMMING 




"ft 



ER membrane 



CYTOSOL 



Figure 1 2-53 The role of AMinked 
glycosylation in ER protein folding. The 

ER-membrane-bound chaperone protein 
calnexin binds to incompletely folded 
proteins containing one terminal glucose 
on AMinked oligosaccharides, trapping 
the protein in the ER. Removal of the 
terminal glucose by a glucosidase 
releases the protein from calnexin. A 
glucosyl transferase is the crucial enzyme 
that determines whether the protein is 
folded properly or not: if the protein is 
still incompletely folded, the enzyme 
transfers a new glucose from 
UDP-glucose to the AMinked 
oligosaccharide, renewing the protein's 
affinity for calnexin and retaining it in the 
ER.The cycle repeats until the protein has 
folded completely. Calreticulin functions 
similarly, except that it is a soluble ER 
resident protein. Another ER chaperone, 
ERp57 (not shown), collaborates with 
calnexin and calreticulin in retaining an 
incompletely folded protein in the ER. 
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Figure 1 2-54 The export and 
degradation of misfolded ER proteins. 

Misfolded soluble proteins in the ER 
lumen are translocated back into the 
cytosol, where they are deglycosylated, 
ubiquitylated, and degraded in 
proteasomes. Misfolded membrane 
proteins follow a similar pathway. 



chaperone 



intermediates of newly made proteins should not. Help in making this distinc- 
tion comes from the A/-linked oligosaccharides, which serve as timers that mea- 
sure how long a protein has spent in the ER. The slow trimming of a particular 
mannose on the core-oligosaccharide tree by an enzyme (a mannosidase) in the 
ER is thought to create a new oligosaccharide structure that the retrotransloca- 
tion apparatus recognizes. Proteins that fold and exit from the ER faster than the 
action of the mannosidase would therefore escape degradation. 

Once a misfolded protein has been retrotranslocated into the cytosol, an A/- 
glycanase removes its oligosaccharide chains en bloc. The deglycosylated 
polypeptide is rapidly ubiquitylated by ER-bound ubiquitin-conjugating 
enzymes and is then fed into proteasomes (discussed in Chapter 6), where it is 
degraded (Figure 12-54). 



Misfolded Proteins in the ER Activate an Unfolded Protein 
Response 

Cells carefully monitor the amount of misfolded protein in various compart- 
ments. An accumulation of misfolded proteins in the cytosol, for example, trig- 
gers a heat-shock response (discussed in Chapter 6), which stimulates the tran- 
scription of genes encoding cytosolic chaperones that help to refold the pro- 
teins. Similarly, an accumulation of misfolded proteins in the ER triggers an 
unfolded protein response, which includes an increased transcription of genes 
encoding ER chaperones, proteins involved in retro translocation and protein 
degradation in the cytosol, and many other proteins that help to increase the 
protein folding capacity of the ER. 

How do misfolded proteins in the ER signal to the nucleus? There are three 
parallel pathways that execute the unfolded protein response (Figure 12-55A). 
The first pathway, which was initially discovered in yeast cells, is particularly 
remarkable. Misfolded proteins in the ER activate a transmembrane protein 
kinase in the ER, which causes the kinase to oligomerize and phosphorylate 
itself. (Some cell- surface receptors in the plasma membrane are activated in a 
similar way, as discussed in Chapter 15). The oligomerization and autophos- 
phorylation activates an endoribonuclease domain in the cytosolic portion of 
the same molecule, which cleaves a specific, cytosolic RNA molecule at two 
positions, excising an intron. The separated exons are then joined by an RNA 
ligase, generating a spliced mRNA, which is translated to produce an active 
gene regulatory protein. This protein activates the transcription of the genes 
encoding the proteins that mediate the unfolded protein response (Figure 
12-55B). 
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Figure 12-55 The unfolded protein 
response. (A) By three parallel 
intracellular signaling pathways, the 
accumulation of misfolded proteins in 
the ER lumen signals to the nucleus to 
activate the transcription of genes that 
encode proteins that help the cell to 
cope with the abundance of misfolded 
proteins in the ER. (B) Regulated mRNA 
splicing is a key regulatory switch in 
Pathway 1 of the unfolded protein 
response. 
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Misfolded proteins also activate a second transmembrane kinase in the ER, 
which inhibits a translation initiation factor by phosphorylating it and thereby 
reduces the production of new proteins throughout the cell. One consequence 
of the reduction in protein translation is to reduce the flux of proteins into the 
ER, thereby limiting the load of proteins that need to be folded there. Some pro- 
teins, however, are preferentially translated when translation initiation factors 
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are scarce (see p. 490), and one of these is a gene regulatory protein that helps 
activate the transcription of the genes encoding proteins active in the unfolded 
protein response. 

Finally, a third gene regulatory protein is initially synthesized as an integral 
ER membrane protein. Because it is covalently tethered to the membrane, it can- 
not activate the transcription of genes in the nucleus. When misfolded proteins 
accumulate in the ER, the transmembrane protein is transported to the Golgi 
apparatus, where it encounters proteases that cleave off its cytosolic domain, 
which can now migrate to the nucleus and help activate the transcription of the 
genes encoding proteins involved in the unfolded protein response. The relative 
importance of each of these three pathways differs in different cell types, 
enabling each cell type to tailor the unfolded protein response to its particular 
needs. 



Some Membrane Proteins Acquire a Covalently Attached 
Glycosylphosphatidylinositol (GPI) Anchor 

As discussed in Chapter 10, several cytosolic enzymes catalyze the covalent addi- 
tion of a single fatty acid chain or prenyl group to selected proteins. The attached 
lipids help to direct these proteins to cell membranes. A related process is cat- 
alyzed by ER enzymes, which covalently attach a glycosylphosphatidyl-inositol 
(GPI) anchor to the C -terminus of some membrane proteins destined for the 
plasma membrane. This linkage forms in the lumen of the ER, where, at the same 
time, the transmembrane segment of the protein is cleaved off (Figure 12-56). A 
large number of plasma membrane proteins are modified in this way. Since they 
are attached to the exterior of the plasma membrane only by their GPI anchors, 
they can in principle be released from cells in soluble form in response to signals 
that activate a specific phospholipase in the plasma membrane. Trypanosome 
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Figure 1 2-56 The attachment of a GPI anchor to a protein in the ER. Immediately after the 
completion of protein synthesis, the precursor protein remains anchored in the ER membrane by 
a hydrophobic C-terminal sequence of 15-20 amino acids; the rest of the protein is in the ER 
lumen. Within less than a minute, an enzyme in the ER cuts the protein free from its membrane- 
bound C-terminus and simultaneously attaches the new C-terminus to an amino group on a 
preassembled GPI intermediate. The sugar chain contains an inositol attached to the lipid from 
which the GPI anchor derives its name. It is followed by a glucosamine and three mannoses. The 
terminal mannose links to a phosphoethanolamine that provides the amino group to attach the 
protein. The signal that specifies this modification is contained within the hydrophobic 
C-terminal sequence and a few amino acids adjacent to it on the lumenal side of the ER 
membrane; if this signal is added to other proteins, they too become modified in this way. 
Because of the covalently linked lipid anchor, the protein remains membrane-bound, with all of its 
amino acids exposed initially on the lumenal side of the ER and eventually on the cell exterior. 
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Figure 1 2-57 The synthesis of phosphatidylcholine. As illustrated, this phospholipid is synthesized from 
glycerol 3-phosphate, cytidine-diphosphocholine (CDP-choline), and fatty acids delivered to the ER by fatty 
acid binding protein. 



parasites, for example, use this mechanism to shed their coat of GPI-anchored 
surface proteins when attacked by the immune system. GPI anchors may also be 
used to direct plasma membrane proteins into lipid rafts and thus segregate the 
proteins from other membrane proteins, as we discuss in Chapter 13. 



The ER Assembles Most Lipid Bilayers 

The ER membrane synthesizes nearly all of the major classes of lipids, including 
both phospholipids and cholesterol, required for the production of new cell 
membranes. The major phospholipid made is phosphatidylcholine (also called 
lecithin), which can be formed in three steps from choline, two fatty acids, and 
glycerol phosphate (Figure 12-57). Each step is catalyzed by enzymes in the ER 
membrane that have their active sites facing the cytosol, where all of the 
required metabolites are found. Thus, phospholipid synthesis occurs exclusively 
in the cytosolic leaflet of the ER membrane. Because fatty acids are not soluble 
in water, they are shepherded from their sites of synthesis to the ER by a fatty 
acid binding protein in the cytosol. After arrival in the ER membrane and acti- 
vation with CoA, acyl transferases successively add two fatty acids to glycerol 
phosphate to produce phosphatidic acid. Phosphatidic acid is sufficiently 
water-insoluble to remain in the lipid bilayer, and it cannot be extracted from 
the bilayer by the fatty acid binding proteins. It is therefore this first step that 
enlarges the ER lipid bilayer. The later steps determine the head group of a newly 
formed lipid molecule and therefore the chemical nature of the bilayer, but they 
do not result in net membrane growth. The two other major membrane phos- 
pholipids — phosphatidyl ethanolamine and phosphatidylserine — as well as the 
minor phospholipid phosphatidylinositol (PI), are all synthesized in this way. 

Because phospholipid synthesis takes place in the cytosolic half of the ER 
lipid bilayer, there needs to be a mechanism that transfers some of the newly 
formed phospholipid molecules to the lumenal leaflet of the bilayer. In synthetic 
lipid bilayers, lipids do not "flip-flop" in this way. In the ER, however, phospho- 
lipids equilibrate across the membrane within minutes, which is almost 100,000 
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Figure 1 2-58 The role of phospholipid translocators in lipid bilayer synthesis. (A) Because 
new lipid molecules are added only to the cytosolic half of the bilayer and lipid molecules do 
not flip spontaneously from one monolayer to the other, a membrane-bound phospholipid 
translocator (called a scramblase) is required to transfer lipid molecules from the cytosolic half 
to the lumenal half so that the membrane grows as a bilayer. The scramblase is not specific for 
particular phospholipid head groups and therefore equilibrates the different phospholipids 
between the two monolayers. (B) Fueled by ATP hydrolysis, a head-group-specific flippase in the 
plasma membrane actively flips phosphatidylserineand phosphatidyl-ethanolamine 
directionally from the extracellular to the cytosolic leaflet, creating the characteristically 
asymmetric lipid bilayer of the plasma membrane of animal cells (see Figure 1 0-16). 



times faster than can be accounted for by spontaneous "flip-flop." This rapid 
trans-bilayer movement is mediated by a poorly characterized phospholipid 
translocator called a scramblase, which equilibrates phospholipids between the 
two leaflets of the lipid bilayer (Figure 12-58). Thus, the different types of phos- 
pholipids are thought to be equally distributed between the two leaflets of the 
ER membrane. 

The plasma membrane contains a different type of phospholipid transloca- 
tor that belongs to the family of P- type pumps (discussed in Chapter 11). These 
flippases specifically remove phospholipids containing in their head groups free 
amino groups (phosphatidylserine and phosphatidylethanolamine — see Figure 
10-3) from the extracellular leaflet and use the energy of ATP hydrolysis to flip 
them directionally into the leaflet facing the cytosol. The plasma membrane 
therefore has a highly asymmetric phospholipid composition, which is actively 
maintained by the flippases (see Figure 10-16). The plasma membrane also con- 
tains a scramblase but, in contrast to the ER scramblase, which is always active, 
the plasma membrane enzyme is regulated and only activated in some situa- 
tions, such as in apoptosis and in activated platelets, where it acts to abolish 
lipid asymmetry; the resulting exposure of phosphotidylserine on the surface of 
apoptotic cells serves as a signal for phagocytic cells to ingest and degrade the 
dead cell. 

The ER also produces cholesterol and ceramide (Figure 12-59). Ceramide is 
made by condensing the amino acid serine with a fatty acid to form the amino 
alcohol sphingosine (see Figure 10-3); a second fatty acid is then added to form 
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Figure 12-59 The structure of ceramide. 
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ceramide. The ceramide is exported to the Golgi apparatus, where it serves as a 
precursor for the synthesis of two types of lipids: oligosaccharide chains are 
added to form glycosphingolipids (glycolipids; see Figure 10-18), and phospho- 
choline head groups are transferred from phosphatidylcholine to other 
ceramide molecules to form sphingomyelin (discussed in Chapter 10). Thus, 
both glycolipids and sphingomyelin are produced relatively late in the process 
of membrane synthesis. Because they are produced by enzymes exposed to the 
Golgi lumen, they are found exclusively in the noncytosolic leaflet of the lipid 
bilayers that contain them. 

As discussed in Chapter 13, the plasma membrane and the membranes of 
the Golgi apparatus, lysosomes, and endosomes all form part of a membrane 
system that communicates with the ER by means of transport vesicles, which 
transfer both proteins and lipids. Mitochondria and plastids, however, do not 
belong to this system, and they therefore require different mechanisms to 
import proteins and lipids for growth. We have already seen that they import 
most of their proteins from the cytosol. Although mitochondria modify some of 
the lipids they import, they do not synthesize lipids de novo; instead, their lipids 
have to be imported from the ER, either directly, or indirectly by way of other cell 
membranes. In either case, special mechanisms are required for the transfer. 

The details of how lipid distribution between different membranes is cat- 
alyzed and regulated are not known. Water-soluble carrier proteins called phos- 
pholipid exchange proteins (or phospholipid transfer proteins) are thought to 
transfer individual phospholipid molecules between membranes, functioning 
much like fatty acid binding proteins that shepherd fatty acids through the 
cytosol. In addition, mitochondria are often seen in close juxtaposition to ER 
membranes in electron micrographs, and there may be specific lipid transfer 
mechanisms that operate between adjacent membranes. 

Summary 

The extensive ER network serves as a factory for the production of almost all of the cell's 
lipids. In addition, a major portion of the cell's protein synthesis occurs on the cytoso- 
lic surface of the ER: all proteins destined for secretion and all proteins destined for the 
ER itself the Golgi apparatus, the lysosomes, the endosomes, and the plasma mem- 
brane are first imported into the ERfrom the cytosol. In the ER lumen, the proteins fold 
and oligomerize, disulfide bonds are formed, and N -linked oligosaccharides are 
added. The pattern ofN-linked glycosylation is used to indicate the extent of protein 
folding, so that proteins leave the ER only when they are properly folded. Proteins that 
do not fold or oligomerize correctly are translocated back into the cytosol, where they 
are deglycosylated, ubiquitylated, and degraded in proteasomes. Ifmisfolded proteins 
accumulate in excess in the ER, they trigger an unfolded protein response, which acti- 
vates appropriate genes in the nucleus to help the ER to cope. 

Only proteins that carry a special ER signal sequence are imported into the ER. The 
signal sequence is recognized by a signal recognition particle (SRP), which binds both 
the growing polypeptide chain and a ribosome and directs them to a receptor protein 
on the cytosolic surface of the rough ER membrane. This binding to the ER membrane 
initiates the translocation process by threading a loop of polypeptide chain across the 
ER membrane through the hydrophilic pore in a transmembrane protein translocator. 

Soluble proteins — destined for the ER lumen, for secretion, or for transfer to the 
lumen of other organelles — pass completely into the ER lumen. Transmembrane pro- 
teins destined for the ER or for other cell membranes are translocated partway across 
the ER membrane and remain anchored there by one or more membrane- spanning a- 
helical regions in their polypeptide chains. These hydrophobic portions of the protein 
can act either as start-transfer or stop-transfer signals during the translocation pro- 
cess. When a polypeptide contains multiple, alternating start-transfer and stop-trans- 
fer signals, it will pass back and forth across the bilayer multiple times as a multipass 
transmembrane protein. 

The asymmetry of protein insertion and glycosylation in the ER establishes the sid- 
edness of the membranes of all the other organelles that the ER supplies with mem- 
brane proteins. 
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PROBLEMS 

Which statements are true? Explain why or why not. 

12-1 Like the lumen of the ER, the interior of the nucleus 
is topologically equivalent to the outside of the cell. 

12-2 Membrane -bound and free ribosomes, which are 
structurally and functionally identical, differ only in the pro- 
teins they happen to be making at a particular time. 

1 2-3 To avoid the inevitable congestion that would occur if 
two-way traffic through a single pore were allowed, nuclear 
pore complexes are specialized so that some mediate import 
while others mediate export. 

1 2-4 Peroxisomes are found in only a few specialized types 
of eucaryotic cell. 

1 2-5 In multipass transmembrane proteins the odd-num- 
bered transmembrane segments (counting from the N-ter- 
minus) act as start-transfer signals and the even-numbered 
segments act as stop-transfer signals. 

Discuss the following problems. 

1 2-6 What is the fate of a protein with no sorting signal? 

12-7 Imagine that you have engineered a set of genes, 
each encoding a protein with a pair of conflicting signal 
sequences that specify different compartments. If the genes 
were expressed in a cell, predict which signal would win out 
for the following combinations. Explain your reasoning. 

A. Signals for import into nucleus and import into ER. 

B. Signals for import into peroxisomes and import into ER. 

C. Signals for import into mitochondria and retention in ER. 

D. Signals for import into nucleus and export from nucleus. 

1 2-8 The rough ER is the site of synthesis of many differ- 
ent classes of membrane protein. Some of these proteins 
remain in the ER, whereas others are sorted to compart- 
ments such as the Golgi apparatus, lysosomes, and the 
plasma membrane. One measure of the difficulty of the sort- 
ing problem is the degree of "purification" that must be 
achieved during transport from the ER. Are proteins bound 
for the plasma membrane common or rare among all ER 
membrane proteins? 

A few simple considerations allow one to answer this 
question. In a typical growing cell that is dividing once every 
24 hours, the equivalent of one new plasma membrane 
must transit the ER every day. If the ER membrane is 20 
times the area of a plasma membrane, what is the ratio of 
plasma membrane proteins to other membrane proteins in 
the ER? (Assume that all proteins on their way to the plasma 
membrane remain in the ER for 30 minutes on average 
before exiting, and that the ratio of proteins to lipids in the 
ER and plasma membranes is the same.) 

12-9 Before nuclear pore complexes were well under- 
stood, it was unclear whether nuclear proteins diffused pas- 
sively into the nucleus and accumulated there by binding to 
residents of the nucleus such as chromosomes, or were 
actively transported and accumulated regardless of their 
affinity for nuclear components. 

A classic experiment that addressed this problem used 
several forms of radioactive nucleoplasmin, which is a large 
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Figure Q1 2-1 Cell location of injected nucleoplasmin and 
nucleoplasmin components (Problem 12-9). Schematic diagrams of 
autoradiographs show the cytoplasm and nucleus with the location 
of nucleoplasmin indicated by the red areas. 

pentameric protein involved in chromatin assembly. In this 
experiment, either the intact protein or the nucleoplasmin 
heads, tails, or heads with a single tail were injected into the 
cytoplasm of a frog oocyte or into the nucleus (Figure 
Q12-1). All forms of nucleoplasmin, except heads, accumu- 
lated in the nucleus when injected into the cytoplasm, and 
all forms were retained in the nucleus when injected there. 

A. What portion of the nucleoplasmin molecule is respon- 
sible for localization in the nucleus? 

B. How do these experiments distinguish between active 
transport, in which a nuclear localization signal triggers 
transport by the nuclear pore complex, and passive diffu- 
sion, in which a binding site for a nuclear component allows 
accumulation in the nucleus? 

12-10 Assuming that 32 million histone octamers are 
required to package the human genome, how many histone 
molecules must be transported per second per pore com- 
plex in cells whose nuclei contain 3000 nuclear pores and 
are dividing once per day? 

12-11 The structures of Ran-GDP and Ran-GTP (actually 
Ran-GppNp, a stable GTP analog) are strikingly different, as 
shown in Figure Q12-2. Not surprisingly, Ran-GDP binds to 
a different set of proteins than does Ran-GTP. 

To look at the uptake of Ran itself into the nuclei of per- 
meabilized cells, you attach a red fluorescent tag to a cys- 
teine side chain in Ran to make it visible. This modified Ran 
supports normal nuclear uptake. Fluorescent Ran-GDP is 
taken up by nuclei only if cytoplasm is added, whereas a 
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Figure Q12-2 Structures 
of Ran-GDP and Ran- 
GppNp (Problem 12-11). 
The red portions of the 
structures show the 
segment of Ran that 
differs most dramatically 
when GDP or the GTP 
analog is bound. 
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Figure Q12-3 Proteins eluted 
from Ran-GDP (lane 1) and 
RanQ69L-GTP (lane 2) affinity 
columns (Problem 12-11). The 
molecular weights of marker 
proteins are shown on the left. 
(From K. Ribbeck et al., EMBOJ. 
17:6587-6598, 1998. With 
permission from Macmillan 
Publishers Ltd.) 



mutant form, RanQ69L-GTP, which is unable to hydrolyze 
GTP, is not taken up in the presence or absence of cyto- 
plasm. To identify the cytoplasmic protein that is crucial for 
Ran-GDP uptake, you construct affinity columns with 
bound Ran-GDP or bound RanQ69L-GTP and pass cyto- 
plasm through them. Cytoplasm passed over a Ran-GDP 
column no longer supports nuclear uptake of Ran-GDP, 
whereas cytoplasm passed over a column of RanQ69L-GTP 
retains this activity. You elute the bound proteins from each 
column and analyze them on an SDS polyacrylamide gel, 
looking for differences that might identify the factor that is 
required for nuclear uptake of Ran (Figure Q12-3). 

A. Why did you use RanQ69L-GTP instead of Ran- GTP in 
these experiments? Would Ran-GppNp in place of RanQ69L- 
GTP have achieved the same purpose? 

B. Which of the many proteins eluted from the two differ- 
ent affinity columns is a likely candidate for the factor that 
promotes nuclear import of Ran-GDP? 

C. What other protein or proteins would you predict that 
the Ran-GDP import factor would be likely to bind in order 
to carry out its function? 

D. How might you confirm that the factor you have identi- 
fied is necessary for promoting the nuclear uptake of Ran? 

12-12 Components of the TIM complex, the multisubunit 
protein translocator in the mitochondrial inner membrane, 
are much less abundant than those of the TOM complex. 
They were initially identified by using a genetic trick. The 
yeast Ura3 gene, whose product is normally located in the 
cytosol where it is essential for synthesis of uracil, was mod- 
ified to carry an import signal for the mitochondrial matrix. 
A population of cells carrying the modified Ura3 gene was 
then grown in the absence of uracil. Most cells died, but the 
rare cells that grew were shown to be defective for mitochon- 
drial import. Explain how this selection identifies cells with 
defects in components required for mitochondrial import. 
Why do normal cells with the modified Ura3 gene not grow 
in the absence of uracil? Why do cells that are defective for 
mitochondrial import grow in the absence of uracil? 

12-13 If the enzyme dihydrofolate reductase (DHFR), 
which is normally located in the cytosol, is engineered to 
carry a mitochondrial targeting sequence at its N-terminus, 
it is efficiently imported into mitochondria. If the modified 
DHFR is first incubated with methotrexate, which binds 
tightly to the active site, the enzyme remains in the cytosol. 
How do you suppose that the binding of methotrexate inter- 
feres with mitochondrial import? 



12-14 Why do mitochondria need a fancy translocator to 
import proteins across the outer membrane? Their outer 
membranes already have large pores formed by porins. 

1 2-1 5 Catalase, an enzyme normally found in peroxisomes, 
is present in normal amounts in cells that do not have visi- 
ble peroxisomes. It is possible to determine the location of 
catalase in such cells using immunofluorescence 
microscopy with antibodies specific for catalase. Fluores- 
cence micrographs of normal cells and peroxisome-defi- 
cient cells are shown in Figure Q12-4. Where is catalase 
located in cells without peroxisomes (Figure Q12-4B)? Why 
does catalase show up as small dots of fluorescence in nor- 
mal cells (Figure Q12-4A)? 




10 urn 
Figure Q12-4 Location of catalase in cells as determined by 
immunofluorescence microscopy (Problem 12-15). (A) Normal cells. 
(B) Peroxisome-deficient cells. Cells were reacted with antibodies 
specific for catalase, washed, and then stained with a fluorescein- 
labeled second antibody that is specific for the catalase-specific 
antibody. The two panels are at the same magnification. (From 
N. Kinoshita et al., J. Biol. Chem. 273:241 22-241 30, 1 998. With 
permission from the American Society for Biochemistry and 
Molecular Biology.) 



12-16 Examine the multipass transmembrane protein 
shown in Figure Q12-5. What would you predict would be 
the effect of converting the first hydrophobic transmem- 
brane segment to a hydrophilic segment? Sketch the 
arrangement of the modified protein in the ER membrane. 

COOH 
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Figure Q12-5 Arrangement of a multipass transmembrane protein in 
the ER membrane (Problem 12-16). Hexagons represent covalently 
attached oligosaccharides. 



12-17 All new phospholipids are added to the cytoplasmic 
leaflet of the ER membrane, yet the ER membrane has a 
symmetrical distribution of different phospholipids in its 
two leaflets. By contrast, the plasma membrane, which 
receives all its membrane components ultimately from the 
ER, has a very asymmetrical distribution of phospholipids in 
the two leaflets of its lipid bilayer. How is symmetry gener- 
ated in the ER membrane, and how is asymmetry generated 
in the plasma membrane? 
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lapter 1 3 



Intracellular Vesicular Traffic 




Every cell must eat, communicate with the world around it, and quickly respond 
to changes in its environment. To help accomplish these tasks, cells continually 
adjust the composition of their plasma membrane in rapid response to need. 
They use an elaborate internal membrane system to add and remove cell-surface 
proteins embedded in the membrane, such as receptors, ion channels, and trans- 
porters. Through the process of exocytosis, the biosynthetic-secretory pathway 
delivers newly synthesized proteins, carbohydrates, and lipids to either the 
plasma membrane or the extracellular space. By the converse process of endocy- 
tosis (Figure 13-1) cells remove plasma membrane components and deliver 
them to internal compartments called endosomes, from where they can be recy- 
cled to the same or different regions of the plasma membrane or can be delivered 
to lysosomes for degradation. Cells also use endocytosis to capture important 
nutrients, such as vitamins, lipids, cholesterol, and iron; these are taken up 
together with the macromolecules to which they bind and are then released in 
endosomes or lysosomes and transported into the cytosol, where they are used in 
various biosynthetic processes. 

The interior space, or lumen, of each membrane-enclosed compartment 
along the biosynthetic-secretory and endocytic pathways is topologicaJly equiv- 
alent to the lumen of most other membrane-enclosed compartments and to the 
cell exterior. Proteins can travel in this space without having to cross a mem- 
brane, being passed from one compartment to another by means of numerous 
membrane-enclosed transport containers. Some of these containers are small 
spherical vesicles, while others are larger irregular vesicles or tubules formed 
from the donor compartment. We shall use the term transport vesicle to apply 
to all forms of these containers. 

Within a eucaryotic cell, transport vesicles continually bud off from one 
membrane and fuse with another, carrying membrane components and soluble 
molecules, which are referred to as cargo (Figure 13-2). This membrane traffic 
flows along highly organized, directional routes, which allow the cell to secrete, 
eat, and remodel its plasma membrane. The biosynthetic-secretory pathway 
leads outward from the endoplasmic reticulum (ER) toward the Golgi apparatus 
and cell surface, with a side route leading to lysosomes, while the endocytic 
pathway leads inward from the plasma membrane. In each case, the flow of 
membrane between compartments is balanced, with retrieval pathways balanc- 
ing the flow in the opposite direction, bringing membrane and selected proteins 
back to the compartment of origin (Figure 13-3). 

To perform its function, each transport vesicle that buds from a compart- 
ment must be selective. It must take up only the appropriate molecules and 
must fuse only with the appropriate target membrane. A vesicle carrying cargo 
from the Golgi apparatus to the plasma membrane, for example, must exclude 
proteins that are to stay in the Golgi apparatus, and it must fuse only with the 
plasma membrane and not with any other organelle. 

We begin this chapter by considering the molecular mechanisms of budding 
and fusion that underlie all vesicular transport. We then discuss the fundamen- 
tal problem of how, in the face of this transport, the cell maintains the differ- 
ences between the compartments. Finally, we consider the function of the Golgi 
apparatus, lysosomes, secretory vesicles, and endosomes, as we trace the path- 
ways that connect these organelles. 
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(A) exocytosis 




(B) endocytosis 



Figure 13-1 Exocytosis and endocytosis. 

(A) In exocytosis, a transport vesicle fuses 
with the plasma membrane. Its content is 
released into the extracellular space, 
while the vesicle membrane {red) 
becomes continuous with the plasma 
membrane. (B) In endocytosis, a plasma 
membrane patch {red) is internalized 
forming a transport vesicle. Its content 
derives from the extracellular space. 



THE MOLECULAR MECHANISMS OF MEMBRANE 
TRANSPORT AND THE MAINTENANCE OF 
COMPARTMENTAL DIVERSITY 

Vesicular transport mediates a continuous exchange of components between 
the ten or more chemically distinct, membrane-enclosed compartments that 
collectively comprise the biosynthetic-secretory and endocytic pathways. With 
this massive exchange, how can each compartment maintain its special iden- 
tity? To answer this question, we must first consider what defines the character 
of a compartment. Above all, it is the composition of the enclosing membrane: 
molecular markers displayed on the cytosolic surface of the membrane serve as 
guidance cues for incoming traffic to ensure that transport vesicles fuse only 
with the correct compartment. Many of these membrane markers, however, are 
found on more than one compartment, and it is the specific combination of 
marker molecules that gives each compartment its unique molecular address. 

How are these membrane markers kept at high concentration on one com- 
partment and at low concentration on another? To answer this question, we 
need to consider how patches of membrane, enriched or depleted in specific 
membrane components, bud off from one compartment and transfer to 
another. Panel 13-1 outlines some of the genetic and biochemical strategies that 
have been used to study the molecular machinery involved in vesicular trans- 
port. 

We begin by discussing how cells segregate proteins into separate mem- 
brane domains by assembling a special protein coat on the membrane's cytoso- 
lic face. We consider how coats form, what they are made of, and how they are 
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Figure 1 3-2 Vesicular transport. 

Transport vesicles bud off from one 
compartment and fuse with another. As 
they do so, they carry material as cargo 
from the lumen (the space within a 
membrane-enclosed compartment) and 
membrane of the donor compartment to 
the lumen and membrane of the target 
compartment, as shown. Note that the 
processes of vesicle budding and fusion 
are not symmetrical: budding requires a 
membrane fusion event initiated from 
the lumenal side of the membrane, while 
vesicle fusion requires a membrane 
fusion event initiated from the 
cytoplasmic side of both donor and 
target membranes. 
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Figure 13-3 A "road-map" of the biosynthetic-secretory and endocytic pathways. (A) In this diagram, which was 
introduced in Chapter 12, the endocytic and biosynthetic-secretory pathways are illustrated with green and red arrows, 
respectively. In addition, blue arrows denote retrieval pathways for the backflow of selected components. (B) The 
compartments of the eucaryotic cell involved in vesicular transport. The lumen of each membrane-enclosed compartment 
is topologically equivalent to the outside of the cell. All compartments shown communicate with one another and the 
outside of the cell by means of transport vesicles. In the biosynthetic-secretory pathway (red arrows), protein molecules are 
transported from the ER to the plasma membrane or (via endosomes) to lysosomes. In the endocytic pathway (green 
arrows), molecules are ingested in vesicles derived from the plasma membrane and delivered to early endosomes and then 
(via late endosomes) to lysosomes. Many endocytosed molecules are retrieved from early endosomes and returned to the 
cell surface for reuse; similarly, some molecules are retrieved from the early and late endosomes and returned to the Golgi 
apparatus, and some are retrieved from the Golgi apparatus and returned to the ER. All of these retrieval pathways are 
shown with blue arrows, as in part (A). 

used to extract specific components from a membrane for delivery to another 
compartment. Finally, we discuss how transport vesicles dock at the appropriate 
target membrane and fuse with it to deliver their cargo. 



There Are Various Types of Coated Vesicles 

Most transport vesicles form from specialized, coated regions of membranes. 
They bud off as coated vesicles, which have a distinctive cage of proteins cover- 
ing their cytosolic surface. Before the vesicles fuse with a target membrane, they 
discard their coat, as is required for the two cytosolic membrane surfaces to 
interact directly and fuse. 

The coat performs two main functions. First, it concentrates specific mem- 
brane proteins in a specialized patch, which then gives rise to the vesicle mem- 
brane. In this way, it selects the appropriate molecules for transport. Second, the 
coat molds the forming vesicle. Coat proteins assemble into a curved, basketlike 
lattice that deforms the membrane patch and thereby shapes the vesicle. This 
may explain why vesicles with the same type of coat often have a relatively uni- 
form size and shape. 

There are three well- characterized types of coated vesicles, distinguished by 
their coat proteins: clathrin-coated, COPI-coated, and COPII-coated (Figure 
13-4). Each type is used for different transport steps. Clathrin-coated vesicles, 
for example, mediate transport from the Golgi apparatus and from the plasma 
membrane, whereas COPI- and COPII-coated vesicles most commonly mediate 
transport from the ER and from the Golgi cisternae (Figure 13-5). There is, how- 
ever, much more variety in coated vesicles and their functions than this short list 
suggests. As we discuss below, there are several types of clathrin-coated vesicles, 



PANEL 13-1: Strategies Used to Study the Molecular Mechanisms fnvolved in Vesicular Transport 






CELL-FREE SYSTEMS FOR STUDYING THE COMPONENTS AND MECHANISM OF VESICULAR 
TRANSPORT 



Vesicular transport can be reconstituted in cell-free systems. 
This was first achieved for the Golgi stack. When Golgi stacks 
are isolated from cells and incubated with cytosol and with 
ATP as a source of energy, transport vesicles bud from their 
rims and appear to transport proteins between cisternae. By 
following the progressive processing of the oligosaccharides 
on a glycoprotein as it moves from one Golgi compartment 
to the next, it is possible to follow the process of vesicular 
transport. 

To follow the transport, two distinct populations of Golgi 
stacks are incubated together. The "donor" population is 
isolated from mutant cells that lack the enzyme 
/V-acetylglucosamine (GlcNAc) transferase I and that have 
been infected with a virus; because of the mutation, the 
major viral glycoprotein fails to be modified with GlcNAc in 



the Golgi apparatus of the mutant cells. The "acceptor" 
Golgi stacks are isolated from uninfected wild-type cells and 
thus contain a good copy of GlcNAc transferase I, but lack 
the viral glycoprotein. In the mixture of Golgi stacks the viral 
glycoprotein acquires GlcNAc, indicating that it must have 
been transported between the Golgi stacks— presumably by 
vesicles that bud from the cis compartment of the donor 
Golgi and fuse with the medial compartment of the acceptor 
Golgi or by homotypic fusion between the cisternae of two 
Golgi stacks. This transport-dependent glycosylation is 
monitored by measuring the transfer of 3 H-GlcNAc from 
UDP- 3 H-GlcNAc to the viral glycoprotein. Transport occurs 
only when ATP and cytosol are added. By fractionating the 
cytosol, a number of specific cytosolic proteins have been 
identified that are required for the budding and fusion of 
transport vesicles. 
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Similar cell-free systems have been used to study transport from the medial to the trans Golgi network, from the trans Golgi 
network to the plasma membrane, from endosomes to lysosomes, from the trans Golgi network to late endosomes, and 
to study homotypic fusion between like compartments — such as endosomes and immature secretory vesicles. 



GENETIC APPROACHES FOR STUDYING VESICULAR TRANSPORT 



Genetic studies of mutant yeast cells defective for secretion 
have identified more than 25 genes that are involved in the 
secretory pathway. Many of the mutant genes encode 
temperature-sensitive proteins. These function normally at 
25°C, but when the mutant cells (A-l) are shifted to an 
elevated temperature, such as 35°C, they fail to transport 
proteins from the ER to the Golgi apparatus, others from one 
Golgi cisterna to another, and still others from the Golgi 
apparatus to the vacuole (the yeast lysosome) or to the 
plasma membrane. 

Once a protein required for secretion has been identified in 
this way, a phenomenon called multicopy suppression can be 
used to identify genes that encode other proteins that 
interact with it. At high temperatures, a 
temperature-sensitive mutant protein often has too low an 
affinity for its normal interaction partners. If the interacting 
proteins are produced at much higher concentration than 
normal, however, sufficient binding occurs to cure the defect. 



To create an experimental paradigm in which such high 
concentrations of ligand are present, mutant yeast cells (with 
a temperature-sensitive mutation in a gene involved in 
vesicular transport) are transfected with a yeast plasmid 
vector into which random normal yeast genomic DNA 
fragments have been cloned. Because these plasmids are 
maintained in cells at high copy number, any cells that 
happen to carry plasmids with intact genes will overproduce 
the normal gene product, allowing rare cells to survive at the 
high temperature. The relevant DNA fragments, which 
presumably encode proteins that interact with the original 
mutant protein, can then be isolated from the surviving cell 
clones. 

The genetic and biochemical approaches complement each 
other, and many of the proteins involved in vesicular 
transport have been identified independently by biochemical 
studies of mammalian cell-free systems and by genetic studies 
in yeast. 



GFP FUSION PROTEINS HAVE REVOLUTIONIZED 
THE STUDY OF INTRACELLULAR TRANSPORT 

One way to follow the whereabouts of a protein in living 
cells is to construct fusion proteins, in which green 
fluorescent protein (GFP) is attached by genetic 
engineering techniques to the protein of interest. When 
a cDNA encoding such a fusion protein is expressed in a 
cell, the protein is readily visible in a fluorescent 
microscope, so that it can be followed in living cells in 
real time. Fortunately for most proteins studied the 
addition of GFP does not perturb the protein's function. 

GFP fusion proteins are widely used to study the location 
and movement of proteins in cells. GFP fused to proteins 
that shuttle in and out of the nucleus, for example, 
facilitates studies of nuclear transport and its regulation. 
GFP fused to mitochondrial or Golgi proteins is used to 
study the behavior of these organelles. GFP fused to 
plasma membrane proteins allows measurement of the 
kinetics of their movement from the ER through the 
secretory pathway. Dramatic examples of such 
experiments can be seen as movies on the DVD that 
accompanies this book. 

The study of GFP fusion proteins is often combined with 
FRAP and FLIP techniques (discussed in Chapter 10), in 
which the GFP in selected regions of the cell is bleached 
by strong laser light. The rate of diffusion of unbleached 
GFP fusion proteins into that area can then be 
determined to provide measurement of the protein's 
diffusion or transport in the cell. In this way, for example, 
it was determined that many Golgi enzymes recycle 
between the Golgi apparatus and the ER. <GCTG> 

(A-D, courtesy of Jennifer Lippincott-Schwartz Lab.) 
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(A) In this experiment, cultured cells express a GFP 
fusion protein consisting of GFP attached to a viral 
coat protein — called vesicular stomatitis virus coat 
protein. The viral protein is an integral membrane 
protein that normally moves through the secretory 
pathway from the ER to the cell surface, where the 
virus would be assembled if cells also expressed 
the other viral components. The viral protein 
contains a mutation that allows export from the 
ER only at a low temperature. Thus, at the high 
temperature shown, the fusion protein labels the ER. 




(B) As the temperature is lowered, the GFP fusion 
protein rapidly accumulates at ER exit sites. 




(C) The fusion protein then moves to the Golgi apparatus. 
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(D) Finally, the fusion protein is delivered to the plasma 
membrane where the delivered protein diffuses into the 
plasma membrane (the arrows bracket a fusion event). 
From such studies the kinetics of each step in the 
pathway can be determined. 
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Figure 13-4 Electron micrograph of 
clathrin-coated, COPI-coated, and COPII- 
coated vesicles. All are shown in electron 
micrographs at the same scale. 
(A) Clathrin-coated vesicles. (B) Golgi 
cisternae {arrows) from a eel I -free system 
in which COPI-coated vesicles bud in the 
test tube. (C) COPII-coated vesicles. 
(A and B, from L Orci, B. Glick and 
J. Rothman, Cell 46:1 71 -1 84, 1 986. 
With permission from Elsevier; C, courtesy 
of Charles Barlowe and Lelio Orci.) 
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each specialized for a different transport step, and the COPI- and COPII-coated 
vesicles may be similarly diverse. 



The Assembly of a Clathrin Coat Drives Vesicle Formation 

Clathrin-coated vesicles, the first coated vesicles to be identified, transport 
material from the plasma membrane and between endosomal and Golgi com- 
partments. COPI-coated vesicles and COPII-coated vesicles transport material 
early in the secretory pathway: COPII-coated vesicles bud from the ER, and 
COPI-coated vesicles bud from Golgi compartments (see Figure 13-5). We dis- 
cuss clathrin-coated vesicles first, as they provide a good example of how vesi- 
cles form. 

The major protein component of clathrin-coated vesicles is clathrin itself. 
Each clathrin subunit consists of three large and three small polypeptide chains 
that together form a three-legged structure called a triskelion, Clathrin triske- 
lions assemble into a basketlike convex framework of hexagons and pentagons 
to form coated pits on the cytosolic surface of membranes (Figure 13-6). Under 
appropriate conditions, isolated triskelions spontaneously self-assemble into 
typical polyhedral cages in a test tube, even in the absence of the membrane 
vesicles that these baskets normally enclose (Figure 13-7). Thus, the clathrin 
triskelions determine the geometry of the clathrin cage. 

Adaptor proteins, another major coat component in clathrin-coated vesi- 
cles, form a discrete second layer of the coat, positioned between the clathrin 
cage and the membrane. They bind the clathrin coat to the membrane and trap 
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Figure 1 3-5 Use of different coats in 
vesicular traffic Different coat proteins 
select different cargo and shape the 
transport vesicles that mediate the various 
steps in the biosynthetic- secretory and 
endocytic pathways. When the same coats 
function in different places in the cell, they 
usually incorporate different coat protein 
subunits that modify their properties (not 
shown). Many differentiated cells have 
additional pathways beside those shown 
here, including a sorting pathway from the 
trans Golgi network to the apical surface 
of epithelial cells and a specialized 
recycling pathway for proteins of synaptic 
vesicles in the synapses of neurons. 
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Figure 13-6 Clathrin-coated pits and vesicles. This rapid-freeze, deep-etch 
electron micrograph shows numerous clathrin-coated pits and vesicles on 
the inner surface of the plasma membrane of cultured fibroblasts. The cells 
were rapidly frozen in liquid helium, fractured, and deep-etched to expose 
the cytoplasmic surface of the plasma membrane. (From J. Heuser, J. Cell Bioi. 
84:560-583, 1980. With permission from The Rockefeller University Press.) 



various transmembrane proteins, including transmembrane receptors that cap- 
ture soluble cargo molecules inside the vesicle — so-called cargo receptors. In this 
way, a selected set of transmembrane proteins, together with the soluble pro- 
teins that interact with them, are packaged into each newly formed clathrin- 
coated transport vesicle (Figure 13-8). 

There are several types of adaptor proteins. The best characterized have four 
different protein subunits; others are single-chain proteins. Each type of adap- 
tor protein is specific for a different set of cargo receptors and its use leads to the 
formation of distinct clathrin-coated vesicles. Clathrin-coated vesicles budding 
from different membranes use different adaptor proteins and thus package dif- 
ferent receptors and cargo molecules. 

The sequential assembly of adaptor complexes and the clathrin coat on the 
cytosolic surface of the membrane generate forces that result in the formation of 
a clathrin-coated vesicle. Lateral interactions between adaptor complexes and 
between clathrin molecules aid in forming the vesicle. 
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Not All Coats Form Basketlike Structures 

Not all coats are as regular and universal as the examples of clathrin or COP 
coats suggest. Some coats maybe better described as specialized protein assem- 
blies that form patches dedicated to specific cargo proteins. An example is a coat 
called retromer, which assembles on endosomes and forms vesicles that return 
acid hydrolase receptors, such as the mannose-6-phosphate receptor, to the Golgi 
apparatus (Figure 13-9). We discuss later the important role of these receptors 
in delivering enzymes into new lysosomes. 
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Figure 13-7 The structure of a clathrin coat <TATT> (A) Electron micrographs of clathrin triskelions shadowed with platinum. 
Each triskelion is composed of three clathrin heavy chains and three clathrin light chains as shown in (B). (C and D) A cryoelectron 
micrograph taken of a clathrin coat composed of 36 triskelions organized in a network of 1 2 pentagons and 6 hexagons, with 
heavy chains (C) and light chains (D) highlighted. The interwoven legs of the clathrin triskelions form an outer shell into which the 
N-terminal domains of the triskelions protrude to form an inner layer visible through the openings. It is this inner layer that 
contacts the adaptor proteins shown in Figure 13-8. Although the coat shown is too small to enclose a membrane vesicle, the 
clathrin coats on vesicles are constructed in a similar way, from 1 2 pentagons and a larger number of hexagons, resembling the 
architecture of a soccer ball. (A, from E. Ungewickell and D. Branton, Nature 289:420-422, 1 981 ; C and D, from A. Fotin et al., 
Nature 432:573-579, 2004. All with permission from Macmillan Publishers Ltd.) 
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Retromer is a multiprotein complex that assembles into a coat on endoso- 
mal membranes only when: 

it can bind to the cytoplasmic tails of the cargo receptors, 
it can interact directly with a curved phospholipid bilayer, and 
it can bind to a specific phosphorylated phosphatidylinositol lipid (a phos- 
phoinositide), which acts as an endosomal marker, as we discuss next. 
Because these three requirements must be met simultaneously, retromer is 
thought to act as a coincidence detector and only assemble at the right time and 
place. Upon binding as a dimer, it stabilizes the membrane curvature, which 
makes the binding of additional retromers in its proximity more likely. The coop- 
erative assembly of retromer then leads to the formation and budding of a trans- 
port vesicle, which delivers its cargo to the Golgi apparatus. 

The adaptor proteins found in clathrin coats also bind to phosphoinositides, 
which not only have a major role in directing when and where coats assemble in 
the cell, but are used much more widely as molecular markers of compartment 
identity. This helps to control membrane trafficking events, as we discuss next. 
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Figure 1 3-8 The assembly and 
disassembly of a clathrin coat. The 

assembly of the coat introduces 
curvature into the membrane, which 
leads in turn to the formation of 
uniformly sized coated buds. The adaptor 
proteins bind clathrin triskelions and 
membrane-bound cargo receptors, 
thereby mediating the selective 
recruitment of both membrane and 
cargo molecules into the vesicle. The 
clathrin coat is rapidly lost shortly after 
the vesicle forms. 
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Figure 13-9 A model for retromer 
assembly on endosomal membranes. The 

four retromer subunits, SNX1, VPS29, 
VPS35, and VPS26, form coated domains on 
endosome membranes that capture cargo 
molecules, including transmembrane 
proteins such as acid hydrolase receptors, 
into vesicles that return to the trans Golgi 
network. VPS35 binds to the cytoplasmic 
tails of the transmembrane cargo proteins. 
SNX1 protein contains different protein 
modules: a PX domain that binds to the 
phosphorylated phosphatidylinositol PI(3)P, 
and a BAR domain that mediates 
dimerization and attachment to highly 
curved membranes. Both PX and BAR 
domains are protein modules that are 
found in many proteins, where they carry 
out similar functions. With the exception of 
the PI(3)P, which is enlarged for visibility, 
the membrane and other components are 
drawn approximately to scale. (Adapted 
from J.S. Bonifacino and R. Rojas, Nat Rev. 
MoL Cell Biol. 7:568-579, 2006. With 
permission from Macmillan Publishers Ltd.) 
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Phosphoinositides Mark Organelles and Membrane Domains 

Although inositol phospholipids typically comprise less than 10% of the total 
phospholipids in a membrane, they have important regulatory functions. They 
can undergo rapid cycles of phosphorylation and dephosphorylation at the 3', 
4', and 5' positions of their inositol sugar head groups to produce various types 
of phosphoinositides (PIPs). The interconversion of phosphatidylinositol (PI) 
and PIPs is highly compartmentalized: different organelles in the endocytic and 
biosynthetic-secretory pathways have distinct sets of PI and PIP kinases and PIP 
phosphatases (Figure 13-10). The distribution, regulation, and local balance of 
these enzymes determine the steady-state distribution of each PIP species. As a 
consequence, the distribution of PIPs varies from organelle to organelle, and 
often within a continuous membrane from one region to another, thereby defin- 
ing specialized membrane domains. 

Many proteins involved at different steps in vesicular transport contain 
domains that bind with high specificity to the head groups of particular PIPs, 
distinguishing one phosphorylated form from another. Local control of the PI 
and PIP kinases and PIP phosphatases can therefore be used to rapidly control 
the binding of proteins to a membrane or membrane domain. The production 
of a particular type of PIP recruits proteins containing matching PIP-binding 
domains. The PIP-binding proteins then help regulate vesicle formation and 
other steps in membrane transport (Figure 13-11). The same strategy is widely 
used to recruit specific intracellular signaling proteins to the plasma membrane 
in response to extracellular signals (discussed in Chapter 15). 



Figure 13-10 Phosphatidylinositol (PI) 
and phosphoinositides (PIPs). (A, B) The 

structure of PI shows the free hydroxyl 
groups in the inositol sugar that can in 
principle be modified. (C) Phosphorylation 
of one, two, or three of the hydroxyl 
groups on PI by PI and PIP kinases 
produces a variety of PIP species. They are 
named according to the ring position (in 
parentheses) and the number of 
phosphate groups (subscript) added to PI. 
PI(3,4)P 2 is shown. (D) Animal cells have 
several PI and PIP kinases and a similar 
number of PIP phosphatases, which are 
localized to different organelles, where 
they are regulated to catalyze the 
production of particular PIPs. The red and 
green arrows show the kinase and 
phosphatase reactions, respectively. 
(E, F) Phosphoinositide head groups are 
recognized by protein domains that 
discriminate between the different forms. 
In this way, selected groups of proteins 
containing such domains are recruited to 
regions of membrane in which these 
phosphoinositides are present. PI(3)P and 
PI(4,5)P 2 are shown. (D, modified from 
M.A. de Matteis and A. Godi, Nat Cell Biol. 
6:487-492, 2004. With permission from 
Macmillan Publishers Ltd.) 



Cytoplasmic Proteins Regulate the Pinching-Off and Uncoating of 
Coated Vesicles 

As a clathrin-coated bud grows, soluble cytoplasmic proteins, including dynamin, 
assemble as a ring around the neck of each bud (Figure 13-12). Dynamin contains 
a PI(4,5)P 2 -binding domain, which tethers the protein to the membrane, and a 
GTPase domain, which regulates the rate at which vesicles pinch off from the 
membrane. The pinching-off process brings the two noncytosolic leaflets of the 
membrane into close proximity and fuses them, sealing off the forming vesicle. To 
perform this task, dynamin recruits other proteins to the neck of the budding vesi- 
cle, which, together with dynamin, help to bend the patch of membrane— by 
directly distorting the bilayer structure, by changing its lipid composition through 
the recruitment of lipid-modifying enzymes, or by both mechanisms. 

Once released from the membrane, the vesicle rapidly loses its clathrin coat. 
A PIP phosphatase that is co-packaged into clathrin-coated vesicles depletes 
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PI(4,5)P 2 from the membrane, which weakens the binding of the adaptor pro- 
teins. In addition, an Hsp70 chaperone protein functions as an uncoating 
ATPase, using the energy of ATP hydrolysis to peel off the clathrin coat. Auxillin, 
another vesicle protein, is believed to activate the ATPase. Because the coated 
bud persists much longer than the coat on the vesicle, additional control mech- 
anisms must somehow prevent the clathrin from being removed before it has 
formed a complete vesicle (discussed below). 



Figure 13-1 1 The intracellular location of 
phosphoinositides. Different types of PIPs 
are located in different membranes and 
membrane domains, where they are often 
associated with specific vesicular transport 
events. The membrane of secretory 
vesicles, for example, contains PI(4)P. When 
the vesicles fuse with the plasma 
membrane, a PI 5-kinase that is localized 
there converts the PI(4)P into PI(4,5)P 2 .The 
PI(4,5)P 2 , in turn, helps recruit adaptor 
proteins, which initiate the formation of a 
clathrin-coated pit, as the first step in 
clathrin-mediated endocyosis. Once the 
clathrin-coated vesicle buds off from the 
plasma membrane, a PI(5)P phosphatase 
hydrolyzes PI(4,5)P 2 , which weakens the 
binding of the adaptor proteins, 
promoting vesicle uncoating. We discuss 
phagocytosis and the distinction between 
regulated and constitutive exocytosis later 
in the chapter. (Modified from 
MA de Matteis and A. Godi, Nat. Cell Biol. 
6:487-492, 2004. With permission from 
Macmillan Publishers Ltd.) 



Monomeric GTPases Control Coat Assembly 

TO balance the vesicular traffic to and from a compartment, coat proteins must 
assemble only when and where they are needed, While local production of PIPs 
plays a major part in regulating the assembly of clathrin coats on the plasma 
membrane and Golgi apparatus, cells have additional ways of regulating coat 
formation. Coat-recruitment Gl Pases, for example, control the assembly of 
clathrin coats on endosomes and the CO PI and COPII coats on Golgi and EH 
membranes. 
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Figure 13-1 2 The role of dynamin in 
pinching off clathrin-coated vesicles. 

{A) The dynamin assembles into a ring 
around the neck of the forming bud. The 
dynamin ring is thought to recruit other 
proteins to the vesicle neck, which, 
together with dynamin, destabilize the 
interacting lipid bilayers so that the 
noncytoplasmic leaflets flow together. 
The newly formed vesicle then pinches 
off from the membrane. Specific 
mutations in dynamin can either enhance 
or block the pinching-off process. 
(B) Dynamin was discovered as the 
protein defective in the shibire mutant of 
Drosophila. These mutant flies become 
paralyzed because clathrin-mediated 
endocytosis stops, and the synaptic 
vesicle membrane fails to recycle, 
blocking neurotransmitter release. 
Deeply invaginated clathrin-coated pits 
form in the fly's nerve cells, with a ring of 
mutant dynamin assembled around the 
neck, as shown in this thin-section 
electron micrograph. The pinching-off 
process fails because membrane fusion 
does not take place. (B, from J.H. Koenig 
and K. Ikeda, J. Neurosci. 9:3844-3860, 
1 989. With permission from the Society of 
Neuroscience.) 
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Many steps in vesicular transport depend on a variety of GTP-binding pro- 
teins that control both the spatial and temporal aspects of membrane exchange. 
As discussed in Chapter 3, GTP-binding proteins regulate most processes in 
eucaryotic cells. They act as molecular switches, which flip between an active 
state with GTP bound and an inactive state with GDP bound. Two classes of pro- 
teins regulate the flipping: guanine-nucleotide-exchange factors (GEFs) activate 
the proteins by catalyzing the exchange of GDP for GTP, and GTPase-activating 
proteins (GAPs) inactivate the proteins by triggering the hydrolysis of the bound 
GTP to GDP (see Figure 3-71) Although both monomeric GTP-binding proteins 
(monomeric GTPases) and trimeric GTP-binding proteins (G proteins) have 
important roles in vesicular transport, the roles of the monomeric GTPases are 
better understood, and we focus on them here. 

Coat-recruitment GTPases are members of a family of monomeric GTPases. 
They include the Arf proteins, which are responsible for both COPI coat assembly 
and clathrin coat assembly at Golgi membranes, and the Sari protein, which is 
responsible for COPII coat assembly at the ER membrane. Coat-recruitment 
GTPases are usually found in high concentration in the cytosol in an inactive, 
GDP-bound state. When a COPII-coated vesicle is to bud from the ER membrane, 
a specific Sarl-GEF embedded in the ER membrane binds to cytosolic Sari, caus- 
ing the Sari to release its GDP and bind GTP in its place. (Recall that GTP is pre- 
sent in much higher concentration in the cytosol than GDP and therefore will 
spontaneously bind after GDP is released.) In its GTP-bound state, the Sari pro- 
tein exposes an amphiphilic helix, which inserts into the cytoplasmic leaflet of the 
lipid bilayer of the ER membrane. The tightly bound Sari now recruits coat pro- 
tein subunits to the ER membrane to initiate budding (Figure 13-13). Other GEFs 
and coat-recruitment GTPases operate in a similar way on other membranes. 

Some coat protein subunits also interact, albeit more weakly, with the head 
groups of certain lipid molecules, in particular phosphatidic acid and phospho- 
inositides, as well as with the cytoplasmic tail of some of the transmembrane 
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Figure 1 3-1 3 Formation of a COPII- 
coated vesicle. (A) The Sari protein is a 
coat-recruitment GTPase. Inactive, 
soluble Sari -GDP binds to a Sar1-GEF in 
the ER membrane, causing the Sari to 
release its GDP and bind GTP. The 
Sar1-GEF (also called Sec12) was 
discovered as one of the temperature- 
sensitive mutants that block ER to Golgi 
transport (see Panel 13-1). A GTP- 
triggered conformational change in Sari 
exposes an amphipathic helix, which 
inserts into the cytoplasmic leaflet of the 
ER membrane, initiating a process of 
membrane curvature. (B) GTP-bound Sari 
binds to a complex of two COPII coat 
proteins, called Sec23/24.The crystal 
structure of the Sec23/24 proteins 
together with Sari predicts how the 
amphiphilic helix of Sari anchors the 
complex to the membrane. Sec24 has 
several different binding sites for 
cytoplasmic tails of cargo receptors. The 
entire surface of the complex that 
attaches to the membrane is gently 
curved to match the diameter of 
COPII-coated vesicles. (C) A complex of 
two additional COPII coat proteins, called 
Seel 3/31 , forms the outer shell of the 
coat. Like clathrin, Seel 3/31 alone can 
assemble into symmetrical cages with 
appropriate dimensions to enclose a 
COPII-coated vesicle. (D) Membrane- 
bound, active Sar1-GTP recruits COPII 
subunits to the membrane. This causes 
the membrane to form a bud, which 
includes selected transmembrane 
proteins. A subsequent membrane-fusion 
event pinches off the coated vesicle. 

Other coated vesicles are thought to 
form in a similar way. The coat- 
recruitment GTPase Arf also contains a 
regulated amphiphilic helix but in 
contrast to Sari, the helix also has a 
covalently attached fatty acid chain that 
contributes to its hydrophobicity. As in 
Sari, the regulated amphiphilic helix is 
retracted in the GDP-bound state and 
exposed in the GTP-bound state. As we 
discuss later, Rob GTPases regulate their 
membrane attachment in a similar 
manner (see Figure 13-14). (C, modified 
from S.M. Stagg et al., Nature 
439:234-238, 2006. With permission from 
Macmillan Publishers Ltd.) 
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proteins they recruit into the bud. Together, these protein-lipid and 
protein-protein and interactions tightly bind the coat to the membrane, causing 
the membrane to deform into a bud, which then pinches off as a coated vesicle. 
The coat- recruitment GTPases also have a role in coat disassembly. The 
hydrolysis of bound GTP to GDP causes the GTPase to change its conformation 
so that its hydrophobic tail pops out of the membrane, causing the vesicle's coat 
to disassemble. Although it is not known what triggers the GTP hydrolysis pro- 
cess, it has been proposed that the GTPases work like timers, which hydrolyze 
GTP at a slow but predictable rate. COPII coats, for example, accelerate GTP 
hydrolysis by Sari, thereby triggering coat disassembly at a certain time after 
coat assembly has begun. Thus, a fully formed vesicle will be produced only 
when bud formation occurs faster than the timed disassembly process; other- 
wise, disassembly will be triggered before a vesicle pinches off, and the process 
will have to start again at a more appropriate time and place. 

Not All Transport Vesicles Are Spherical 

Although vesicle budding at various locations in the cell has many similarities, 
each cell membrane poses its own special challenges. The plasma membrane, 
for example, is comparatively flat and stiff, owing to its cholesterol-rich lipid 
composition and underlying cortical cytoskeleton. Thus, clathrin coats have to 
produce considerable force to introduce curvature, especially at the neck of the 
bud where dynamin and its associated proteins facilitate the sharp bends 
required for the pinching-off process. In contrast, vesicle budding from many 
intracellular membranes occurs preferentially at regions where the membranes 
are already curved, such as the rims of the Golgi cisternae or ends of membrane 
tubules. In these places, the primary function of the coats is to capture the 
appropriate cargo proteins rather than to deform the membrane. 

Transport vesicles occur in various sizes and shapes. When living cells are 
genetically engineered to express fluorescent membrane components, the 
endosomes and trans Golgi network are seen in a fluorescence microscope to 
continually send out long tubules. Coat proteins assemble onto the tubules and 
help recruit specific cargo. The tubules then either regress or pinch off with the 
help of dynamin-like proteins to form transport vesicles. Depending on the rel- 
ative efficiencies of tubule formation and the pinching-off process, vesicles of 
different sixes and shapes are produced. Thus, vesicular transport does not nec- 
essarily occur only through uniformly sized spherical vesicles, but can involve 
larger portions of a donor organelle. 

Tubules have a much higher surface-to-volume ratio than the organelles 
from which they form. They are therefore relatively enriched in membrane pro- 
teins compared with soluble cargo proteins. As we discuss later, this property of 
tubules is used for sorting proteins in endosomes. 

Rab Proteins Guide Vesicle Targeting 

To ensure an orderly flow of vesicular traffic, transport vesicles must be highly 
selective in recognizing the correct target membrane with which to fuse. 
Because of the diversity and crowding of membrane systems in the cytoplasm, a 
vesicle is likely to encounter many potential target membranes before it finds 
the correct one. Specificity in targeting is ensured because all transport vesicles 
display surface markers that identify them according to their origin and type of 
cargo, and target membranes display complementary receptors that recognize 
the appropriate markers. This crucial process depends on two types of proteins: 
Rah proteins direct the vesicle to specific spots on the correct target membrane, 
and then SNARE proteins mediate the fusion of the lipid bilayers. 

Rab proteins play a central part in the specificity of vesicular transport. Like 
the coat-recruitment GTPases discussed earlier {see Figure 13-13], tiiey also are 
monomerie GTPases. With over 60 known members, they are the largest subfam- 
ily of such GTPases. tiach Rab protein is associated with one or more membrane- 
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Table 13-1 Subcellular Locations of Some Rab Proteins 



PROTEIN 


ORGANELLE 


Rab1 


ERand Golgi complex 


Rab2 


cis Golgi network 


Rab3A 


synaptic vesicles, secretory granules 


Rab4/Rab11 


recycling endosomes 


RabSA 


plasma membrane, clathrin-coated vesicles, early endosomes 


Rab5C 


early endosomes 


Rab6 


medial and trans Golgi cisternae 


Rab7 


late endosomes 


Rab8 


early endosomes 


Rab9 


late endosomes, trans Golgi network 



enclosed organelles of the biosynthetic-secretory or endocytic pathways, and 
each of these organelles has at least one Rab protein on its cytosolic surface 
(Table 13-1). Their highly selective distribution on these membrane systems 
makes Rab proteins ideal molecular markers for identifying each membrane 
type and guiding vesicular traffic between them. Rab proteins can function on 
transport vesicles, on target membranes, or both. 

Like the coat-recruitment GTPases, Rab proteins cycle between a membrane 
and the cytosol and regulate the reversible assembly of protein complexes on 
the membrane. In their GDP-bound state they are inactive and bound to 
another protein [Rab-GDP dissociation inhibitor ox GDI) that keeps them solu- 
ble in the cytosol, while in their GTP-bound state they are active and tightly 
associated with the membrane of an organelle or transport vesicle. Membrane 
bound Rab-GEFs activate Rab proteins on both transport vesicle and target 
membranes, as activate Rab molecules are usually required on both sides. Once 
in its GTP-bound state and membrane-bound through a hydrophobic lipid 
anchor, Rab proteins bind to other proteins, called Rab effectors, which facili- 
tate vesicle transport, membrane tethering, and fusion (Figure 13-14). The GTP 
hydrolysis sets the concentration of active Rab and, consequently, the concen- 
tration of its effectors on the membrane. 

In contrast to the highly conserved structure of Rab proteins, the structures 
of Rab effectors vary greatly. Some Rab effectors, for example, are motor proteins 
that propel vesicles along actin filaments or microtubules to their target mem- 
brane. Others are tethering proteins, some of which have long threadlike 
domains that serve as "fishing lines" that can extend to link two membranes 
more than 200 nm apart; other tethering proteins are large protein complexes 
that link two membranes that are closer together. Rab effectors can also interact 
with SNAREs, thus coupling membrane tethering to fusion. 
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Figure 13-14 Tethering of a vesicle to a 
target membrane. Rab effector proteins 
interact via active Rab proteins (Rab-GTPs, 
yellow) located on the target membrane, 
vesicle membrane, or both, to establish 
the first connection between the two 
membranes that are going to fuse. In the 
example shown here, the Rab effector is a 
filamentous tethering protein {green). 
Next, SNARE proteins on the two 
membranes {red and blue) pair to dock the 
vesicle to the target membrane and 
catalyze the fusion of the two apposed 
lipid bilayers. 
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The same Rab proteins can bind to multiple effectors. The assembly of Rab 
proteins and their effectors on a membrane is cooperative and results in the for- 
mation of large, specialized membrane patches. Rab5, for example, assembles 
on endosomal membranes and mediates the capture of clathrin-coated vesicles 
arriving from the plasma membrane. It recruits tethering proteins to catch the 
incoming vesicles. Rab5-GEF initially recruits Rab5 to the endosome and con- 
verts it to its active GTP-bound form, which becomes anchored to the mem- 
brane (Figure 13-15). Active Rab5 recruits more Rab5-GEF to the endosome, 
thereby stimulating the recruitment of more Rab5 to the same site. In addition, 
active Rab5 activates a PI 3-kinase, which locally converts PI to PI(3)F! which in 
turn binds some of the Rab effectors. This type of positive feedback greatly 
enhances the assembly process and helps to establish functionally distinct 
membrane domains within a continuous membrane. 

The endosomal membrane provides a striking example of how different Rab 
proteins and their effectors help to create multiple specialized membrane 
domains, each fulfilling a particular set of functions. Thus, while the Rab5 
domain receives incoming vesicles from the plasma membrane, distinct Rabll 
and Rab4 domains in the same membrane are thought to organize the budding 
of recycling vesicles that return proteins from the endosome to the plasma 
membrane. Once assembled, these domains coexist in the same membrane over 
extended periods. 



Figure 1 3-1 5 The formation of a Rab5 
domain on the endosome membrane. 

A Rab5-specific GEF in the endosome 
membrane binds a Rab5 protein and 
induces it to exchange GDP for GTP. GTP 
binding alters the conformation of the 
Rab protein, exposing an amphiphilic 
helix and a covalently attached lipid 
group, which together anchor the 
Rab5-GTP to the membrane. Active Rab5 
activates PI 3-kinase, which converts PI 
into PI(3)P. PI(3)P and active Rab5 
together bind a variety of Rab effector 
proteins that contain PI(3)P-binding sites, 
including filamentous tethering proteins 
that catch incoming clathrin-coated 
vesicles from the plasma membrane. 
Active Rab5 also recruits more Rab5-GEF, 
further enhancing the assembly of the 
Rab5 domain on the membrane. 

Controlled cycles of GTP hydrolysis and 
GDP-GTP exchange are thought to 
regulate the size and activity of such Rab 
domains dynamically. Unlike SNAREs, 
which are integral membrane proteins, 
the GDP/GTP cycle coupled to the 
membrane/cytosol translocation cycle 
endows the Rab machinery with the 
ability to undergo assembly and 
disassembly on the membrane. For 
example, during transport from early to 
late endosomes, Rab5 can be moved and 
replaced by Rab7, thus committing cargo 
to degradation. (Adapted from M. Zerial 
and H. McBride, Nat. Rev. Mot. Cell Biol. 
2:1 07-1 1 7, 2001 . With permission from 
Macmillan Publishers Ltd.) 



SNAREs Mediate Membrane Fusion 

Once a transport vesicle has been tethered to its target membrane, it unloads its 
cargo by membrane fusion. Membrane fusion requires bringing the lipid bilay- 
ers of two membranes to within 1.5 nm of each other so that they can join. When 
the membranes are in such close apposition, lipids can flow from one bilayer to 
the other. For this close approach, water must be displaced from the hydrophilic 
surface of the membrane — a process that is highly energetically unfavorable. It 
seems likely that specialized fusion proteins that overcome this energy barrier 
catalyze all membrane fusions in cells. We have already discussed the role of 
dynamin in a related task during the pinching-off of clathrin-coated vesicles 
(see Figure 13-12). 

The SNARE proteins (also called SNAREs, for short) catalyze the membrane 
fusion reactions in vesicular transport. They also provide an additional layer of 
specificity in the transport process by helping to ensure that only correctly tar- 
geted vesicles fuse. There are at least 35 different SNAREs in an animal cell, each 
associated with a particular organelle in the biosynthetic-secretory or endocytic 
pathway. These transmembrane proteins exist as complementary sets — with v- 
SNAREs usually found on vesicle membranes and t-SNAREs usually found on 
target membranes (see Figure 13-14). A v- SNARE is a single polypeptide chain, 
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Figure 1 3-1 6 The structure of a trans-SNARE complex. The SNAREs responsible for docking 
synaptic vesicles at the plasma membrane of nerve terminals consist of three proteins. The 
v-SNARE synaptobrevin and the t-SNARE syntaxin are both transmembrane proteins, and each 
contributes one a helix to the complex. The t-SNARE Snap25 is a peripheral membrane protein 
that contributes two a helices to the four-helix bundle. They are connected by a loop (omitted 
from the figure) that lies parallel to the membrane and has fatty acyl chains attached to anchor 
it there. In many other t-SNARES, each a helix is a separate protein anchored in the membrane 
by a transmembrane helix. Trans-SNARE complexes always consist of four tightly intertwined 
a helices, three contributed by a t-SNARE and one by a v-SNARE. The t-SNAREs are composed of 
multiple chains, one of which is always a transmembrane protein and contributes one helix, and 
one or two additional light chains that may or may not be transmembrane proteins and that 
contribute the remaining two helices to the four-helix bundle of the trans-SNARE complex. The 
crystal structure of a stable complex of the four intertwining a helices contributed by these 
proteins is modeled here in the context of the whole proteins. The a helices are shown as rods 
for simplicity. (Adapted from R.B. Sutton et al., Nature 395:347-353, 1 998. With permission from 
Macmillan Publishers Ltd.) 

whereas a t-SNARE is composed of two or three proteins (Figure 13-16). The v- 
SNAREs and t-SNAREs have characteristic helical domains, and when a v- 
SNARE interacts with a t-SNARE, the helical domains of one wrap around the 
helical domains of the other to form a stable four-helix bundle. The resulting 
trans-SNARE complexes lock the two membranes together. 

SNAREs have been best characterized in neurons, where they mediate the 
docking and fusion of synaptic vesicles at the nerve terminal's plasma mem- 
brane in the process of neurotransmitter release. The bacteria that cause tetanus 
and botulism secrete powerful proteolytic neurotoxins that enter specific neu- 
rons and cleave SNARE proteins in the nerve terminals. In this way, the toxins 
block synaptic transmissions, often leading to death. 

It is thought that the trans-SNARE complexes catalyze membrane fusion by 
using the energy that is freed when the interacting helices wrap around each 
other to pull the membrane faces together, simultaneously squeezing out water 
molecules from the interface (Figure 13-17). When liposomes containing puri- 
fied v-SNAREs are mixed with liposomes containing complementary t-SNAREs, 
their membranes fuse, albeit slowly. In the cell, other proteins recruited to the 
fusion site, presumably Rab effectors, cooperate with SNAREs to accelerate 
fusion. 

Fusion does not always follow immediately after v-SNAREs and t-SNAREs 
pair. As we discuss later, in the process of regulated exocytosis, fusion is delayed 
until secretion is triggered by a specific extracellular signal. In this case, a local- 
ized influx of Ca 2+ triggers fusion, presumably by releasing inhibitory proteins 
that prevent the complete zipping-up of the trans-SNARE complexes. 

Figure 1 3-1 7 A model for how SNARE proteins may catalyze membrane 
fusion. Bilayer fusion occurs in multiple steps. A tight pairing between 
v- and t-SNAREs forces lipid bilayers into close apposition and expels water 
molecules from the interface. Lipid molecules in the two interacting 
leaflets of the bilayers then flow between the membranes to form a 
connecting stalk. Lipids of the other two leaflets then contact each other, 
forming a new bilayer, which widens the fusion zone (hemifusion, or half- 
fusion). Rupture of the new bilayer completes the fusion reaction. 
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Rab proteins, which can regulate the availability of SNARE proteins, exert an 
additional layer of control. t-SNAREs in target membranes are often associated 
with inhibitory proteins that must be released before the t-SNARE can function. 
Rab proteins and their effectors trigger the release of such SNARE inhibitory 
proteins. In this way, SNARE proteins are concentrated and activated in the cor- 
rect location on the membrane, where tethering proteins capture incoming vesi- 
cles. Rab proteins thus speed up the process by which appropriate SNARE pro- 
teins in two membranes find each other. 

For vesicular transport to operate normally, transport vesicles must incor- 
porate the appropriate SNARE and Rab proteins. Not surprisingly therefore, 
many transport vesicles will form only if they incorporate the appropriate com- 
plement of SNARE and Rab proteins in their membrane. How this crucial con- 
trol process operates during vesicle budding remains a mystery 



Interacting SNAREs Need to Be Pried Apart Before They Can 
Function Again 

Most SNARE proteins in cells have already participated in multiple rounds of 
vesicular transport and are sometimes present in a membrane as stable com- 
plexes with partner SNAREs. The complexes have to disassemble before the 
SNAREs can mediate new rounds of transport. A crucial protein called NSF cycles 
between membranes and the cytosol and catalyzes the disassembly process. It is 
an ATPase that uses the energy of ATP hydrolysis to unravel the intimate interac- 
tions between the helical domains of paired SNARE proteins (Figure 13-18). The 
requirement for NSF-mediated reactivation of SNAREs by SNARE complex disas- 
sembly helps prevent membranes from fusing indiscriminately: if the t-SNAREs 
in a target membrane were always active, any membrane containing an appro- 
priate v- SNARE might fuse whenever the two membranes made contact. It is not 
known how the activity of NSF is controlled so that the SNARE machinery is acti- 
vated at the right time and place, but Rab effectors are likely candidates to play 
a part in this process. 



Viral Fusion Proteins and SNAREs May Use Similar Fusion 
Mechanisms 

Membrane fusion is important in other processes beside vesicular transport. The 
plasma membranes of a sperm and an egg fuse during fertilization (discussed in 
Chapter 21), and myoblasts fuse with one another during the development of 
multinucleate muscle fibers (discussed in Chapter 22). Likewise, mitochondria 
fuse and fragment in a dynamic way (discussed in Chapter 14). All cell membrane 
fusions require special proteins and are tightly regulated to ensure that only 
appropriate membranes fuse. The controls are crucial for maintaining both the 
identity of cells and the individuality of each type of intracellular compartment. 
The membrane fusions catalyzed by viral fusion proteins are the best under- 
stood. These proteins have a crucial role in permitting the entry of enveloped 
viruses (which have a lipid-bilayer-based membrane coat) into the cells that 
they infect (discussed in Chapters 5 and 24). For example, viruses such as the 
human immunodeficiency virus (HIV), which causes AIDS, bind to cell-surface 
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Figure 13-18 Dissociation of SNARE 
pairs by NSF after a membrane fusion 
cycle. After a v-SNARE and t-SNARE have 
mediated the fusion of a transport vesicle 
on a target membrane, the NSF binds to 
the SNARE complex and, with the help of 
two accessory proteins, hydrolyzes ATP to 
pry the SNAREs apart. 
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Figure 13-1 9 The entry of enveloped viruses into cells. (A) Electron micrographs showing how HIV enters a cell by fusing 
its membrane with the plasma membrane of the cell. (B) A model for the fusion process. HIV binds first to the CD4 protein 
on the surface of the target cell. The viral gpl 20 protein, which is bound to the HIV fusion protein, mediates this 
interaction. A second cell-surface protein on the target cell, which normally serves as a receptor for chemokines (discussed 
in Chapters 24 and 25), now interacts with gpl 20. This interaction releases the HIV fusion protein from gpl 20, allowing the 
previously buried hydrophobic fusion peptide to insert into the plasma membrane of the target cell. The fusion protein, 
which is a trimer (not shown), thus becomes transiently anchored as an integral membrane protein in the two opposing 
membranes. The fusion protein then spontaneously rearranges, collapsing into a tightly packed six-helix bundle. The 
energy released by this rearrangement in multiple copies of the fusion protein pulls the two membranes together, 
overcoming the high activation energy barrier that normally prevents membrane fusion. Thus, like a mouse trap, the HIV 
fusion protein contains a reservoir of potential energy, which is released and harnessed to do mechanical work. (A, from 
B.S. Stein et al., Cell 49:659-668, 1987. With permission from Elsevier; B, adapted from a drawing by Wayne Hendrickson.) 

receptors and then fuse with the plasma membrane of the target cell (Figure 
13-19). This fusion event allows the viral nucleic acid inside the nucleocapsid to 
enter the cytosol, where it replicates. Other viruses, such as the influenza virus, 
first enter the cell by receptor-mediated endocytosis (discussed later) and are 
delivered to endosomes; the low pH in endosomes activates a fusion protein in 
the viral envelope that catalyzes the fusion of the viral and endosomal mem- 
branes, releasing the viral nucleic acid into the cytosol. 

The three-dimensional structures of the fusion proteins of HIV and influenza 
virus provide valuable insights into the molecular mechanism of the membrane 
fusion catalyzed by these proteins. Exposure of the HIV fusion protein to recep- 
tors on the target cell membrane, or exposure of the influenza fusion protein to 
low pH, uncovers previously buried hydrophobic regions. These regions, called 
fusion peptides, then insert directly into the lipid bilayer of the target cell mem- 
brane. In this way, the fusion proteins transiently become integral membrane 
proteins in two separate lipid bilayers. Structural rearrangements in the fusion 
proteins then bring the two lipid bilayers into very close apposition and destabi- 
lize them so that the bilayers fuse (see Figure 13-19). Thus, viral fusion proteins 
and SNAREs promote lipid bilayer fusion in similar ways. 
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Summary 

Directed and selective transport of particular membrane components from one mem- 
brane-enclosed compartment of a eucaryotic cell to another maintains the differences 
between those compartments. Transport vesicles, which can be spherical, tubular, or 
irregularly shaped, bud from specialized coated regions of the donor membrane. The 
assembly of the coat helps to collect specific membrane and soluble cargo molecules for 
transport and to drive the formation of the vesicle. 

There are various types of coated vesicles. The best characterized are clathrin- 
coated vesicles, which mediate transport from the plasma membrane and the trans 
Golgi network, and COPI- and COPII-coated vesicles, which mediate transport between 
Golgi cisternae and between the ER and the Golgi apparatus, respectively. In clathrin- 
coated vesicles, adaptor proteins link the clathrin to the vesicle membrane and also trap 
specific cargo molecules for packaging into the vesicle. The coat is shed rapidly after 
budding, enabling the vesicle to fuse with its appropriate target membrane. 

Local synthesis of phosphoinositides creates binding sites that trigger coat assem- 
bly and vesicle budding. In addition, monomeric GTPases help regulate various steps in 
vesicular transport, including both vesicle budding and docking. The coat-recruitment 
GTPases, including Sari and the Arf proteins, regulate coat assembly and disassembly. 
A large family ofRab proteins functions as vesicle targeting GTPases. Rab proteins are 
recruited to transport vesicles and target membranes. The assembly and disassembly of 
Rab proteins and their effectors in specialized membrane domains are dynamically 
controlled by GTP binding and hydrolysis. Active Rab proteins recruit Rab effectors, 
such as motor proteins, which transport vesicles on actin filaments or microtubules, 
and filamentous tethering proteins, which help ensure that the vesicles deliver their 
contents only to the appropriate membrane-enclosed compartment. Complementary v- 
SNARE proteins on transport vesicles and t-SNARE proteins on the target membrane 
form stable trans-SNARE complexes, which force the two membranes into close apposi- 
tion so that their lipid bilayers can fuse. 



TRANSPORT FROM THE ER THROUGH THE GOLGI 
APPARATUS 

As discussed in Chapter 12, newly synthesized proteins cross the ER membrane 
from the cytosol to enter the biosynthetic-secretory pathway During their sub- 
sequent transport, from the ER to the Golgi apparatus and from the Golgi appa- 
ratus to the cell surface and elsewhere, these proteins are successively modified 
as they pass through a series of compartments. Transfer from one compartment 
to the next involves a delicate balance between forward and backward (retrieval) 
transport pathways. Some transport vesicles select cargo molecules and move 
them to the next compartment in the pathway while others retrieve escaped 
proteins and return them to a previous compartment where they normally func- 
tion. Thus, the pathway from the ER to the cell surface consists of many sorting 
steps, which continuously select membrane and soluble lumenal proteins for 
packaging and transport— in vesicles or organelle fragments that bud from the 
ER and Golgi apparatus. 

In this section we focus mainly on the Golgi apparatus (also called the Golgi 
complex). It is a major site of carbohydrate synthesis, as well as a sorting and 
dispatching station for products of the ER. The cell makes many polysaccharides 
in the Golgi apparatus, including the pectin and hemicellulose of the cell wall in 
plants and most of the glycosaminoglycans of the extracellular matrix in animals 
(discussed in Chapter 19). The Golgi apparatus also lies on the exit route from 
the ER, and a large proportion of the carbohydrates that it makes are attached as 
oligosaccharide side chains to the many proteins and lipids that the ER sends to 
it. A subset of these oligosaccharide groups serve as tags to direct specific pro- 
teins into vesicles that then transport them to lysosomes. But most proteins and 
lipids, once they have acquired their appropriate oligosaccharides in the Golgi 
apparatus, are recognized in other ways for targeting into the transport vesicles 
going to other destinations. 
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Proteins Leave the ER in COPII-Coated Transport Vesicles 

To initiate their journey along the biosynthetic-secretory pathway, proteins that 
have entered the ER and are destined for the Golgi apparatus or beyond are first 
packaged into small COPII-coated transport vesicles. These vesicles bud from 
specialized regions of the ER called ER exit sites, whose membrane lacks bound 
ribosomes. Most animal cells have ER exit sites dispersed throughout the ER 
network. 

Originally it was thought that all proteins that are not tethered in the ER 
enter transport vesicles by default. It is now clear, however, that entry into vesi- 
cles that leave the ER is usually a selective process. Many membrane proteins are 
actively recruited into such vesicles, where they become concentrated. It is 
thought that these cargo proteins display exit (transport) signals on their cytoso- 
lic surface that components of the COPII coat recognize (Figure 13-20); these 
coat components act as cargo receptors and are recycled back to the ER after 
they have delivered their cargo to the Golgi apparatus. Soluble cargo proteins in 
the ER lumen, by contrast, have exit signals that attach them to transmembrane 
cargo receptors, which in turn bind through exit signals in their cytoplasmic tails 
to components of the COPII coat. At a lower rate, proteins without exit signals 
can also enter transport vesicles, so that even proteins that normally function in 
the ER (so-called ER resident proteins) slowly leak out of the ER and are delivered 
to the Golgi apparatus. Similarly secretory proteins that are made in high con- 
centrations may leave the ER without the help of exit signals or cargo receptors. 

The exit signals that direct soluble proteins out of the ER for transport to the 
Golgi apparatus and beyond are not well understood. Some transmembrane 
proteins that serve as cargo receptors for packaging some secretory proteins into 
COPII-coated vesicles are lectins that bind to oligosaccharides. The ERGIC53 
lectin, for example, binds to mannose and is thought to recognize this sugar on 
two secreted blood-clotting factors (Factor V and Factor VIII), thereby packaging 
the proteins into transport vesicles in the ER. ERGIC53's role in protein transport 
was identified because humans who lack it owing to an inherited mutation have 
lowered serum levels of Factors V and VIII, and they therefore bleed excessively. 



Only Proteins That Are Properly Folded and Assembled Can Leave 
theER 

To exit from the ER, proteins must be properly folded and, if they are subunits of 
multimeric protein complexes, they may need to be completely assembled. Those 
that are misfolded or incompletely assembled remain in the ER, where they are 
bound to chaperone proteins (discussed in Chapter 6), such as BiPox calnexin. 
The chaperones may cover up the exit signals or somehow anchor the proteins in 
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Figure 1 3-20 The recruitment of cargo 
molecules into ER transport vesicles. By 

binding directly or indirectly to the COPII 
coat, membrane and soluble cargo 
proteins, respectively, become 
concentrated in the transport vesicles as 
they leave the ER. Membrane proteins are 
packaged into budding transport vesicles 
through interactions of exit signals on 
their cytosolic tails with the COPII coat. 
Some of the membrane proteins that the 
coat traps function as cargo receptors, 
binding soluble proteins in the lumen 
and helping to package them into 
vesicles. A typical 50-nm transport vesicle 
contains about 200 membrane proteins, 
which can be of many different types. As 
indicated, unfolded or incompletely 
assembled proteins are bound to 
chaperones and retained in the ER 
compartment. 
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Figure 13-21 Retention of incompletely 
assembled antibody molecules in the 

ER. Antibodies are made up of two heavy 
and two light chains (discussed in 
Chapter 25), which assemble in the ER. 
The chaperone BiP is thought to bind to 
all incompletely assembled antibody 
molecules and to cover up an exit signal. 
Thus, only completely assembled 
antibodies leave the ER and are secreted. 
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the ER (Figure 13-21). Such failed proteins are eventually transported back into 
the cytosol, where they are degraded by proteasomes (discussed in Chapters 6 and 
12). This quality-control step prevents the onward transport of misfolded or mis- 
assembled proteins that could potentially interfere with the functions of normal 
proteins. There is a surprising amount of corrective action. More than 90% of the 
newly synthesized subunits of the T cell receptor (discussed in Chapter 25) and of 
the acetylcholine receptor (discussed in Chapter 11), for example, are normally 
degraded without ever reaching the cell surface where they function. Thus, cells 
must make a large excess of many protein molecules to produce a select few that 
fold, assemble, and function properly 

The process of continual degradation of a portion of ER proteins also pro- 
vides an early warning system to alert the immune system when a virus infects 
cells. Using specialized ABC-type transporters, the ER imports peptide frag- 
ments of viral proteins produced by proteases in the proteasome. The foreign 
peptides are loaded onto class I MHC proteins in the ER lumen and then trans- 
ported to the cell surface. T lymphocytes then recognize the peptides as non-self 
antigens and kill the infected cells (discussed in Chapter 25). 

Sometimes, however, there are drawbacks to the stringent quality- control 
mechanism. The predominant mutations that cause cystic fibrosis, a common 
inherited disease, result in the production of a slightly misfolded form of a 
plasma membrane protein important for CI" transport. Although the mutant 
protein would function perfectly normally if it reached the plasma membrane, it 
remains in the ER. This devastating disease thus results not because the muta- 
tion inactivates the protein but because the active protein is discarded before it 
reaches the plasma membrane. 



Vesicular Tubular Clusters Mediate Transport from the ER to the 
Golgi Apparatus 

After transport vesicles have budded from ER exit sites and have shed their coat, 
they begin to fuse with one another. This fusion of membranes from the same 
compartment is called homotypic fusion, to distinguish it from heterotypic 
fusion, in which a membrane from one compartment fuses with the membrane 
of a different compartment. As with heterotypic fusion, homotypic fusion 
requires a set of matching SNAREs. In this case, however, the interaction is sym- 
metrical, with both membranes contributing v-SNAREs and t-SNAREs (Figure 
13-22). 

The structures formed when ER-derived vesicles fuse with one another are 
called vesicular tubular clusters, because they have a convoluted appearance in 
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the electron microscope (Figure 13-23A). These clusters constitute a new com- 
partment that is separate from the ER and lacks many of the proteins that function 
in the ER. They are generated continually and function as transport containers 
that bring material from the ER to the Golgi apparatus. The clusters are relatively 
short-lived because they move quickly along microtubules to the Golgi apparatus, 
with which they fuse to deliver their contents (Figure 13-23B). 

As soon as vesicular tubular clusters form, they begin to bud off transport 
vesicles of their own. Unlike the COPII-coated vesicles that bud from the ER, 
these vesicles are COPI-coated. They carry back to the ER resident proteins that 
have escaped, as well as proteins such as cargo receptors that participated in the 
ER budding reaction and are being returned. This retrieval process demon- 
strates the exquisite control mechanisms that regulate coat assembly reactions. 
The COPI coat assembly begins only seconds after the COPII coats have been 
shed. It remains a mystery how this switch in coat assembly is controlled. 

The retrieval (or retrograde) transport continues as the vesicular tubular 
clusters move towards the Golgi apparatus. Thus, the clusters continuously 
mature, gradually changing their composition as selected proteins are returned 
to the ER. A similar retrieval process continues from the Golgi apparatus, after 
the vesicular tubular clusters have delivered their cargo. 



The Retrieval Pathway to the ER Uses Sorting Signals 

The retrieval pathway for returning escaped proteins back to the ER depends on 
ER retrieval signals. Resident ER membrane proteins, for example, contain sig- 
nals that bind directly to COPI coats and are thus packaged into COPI-coated 



Figure 13-22 Homotypic membrane 
fusion. <AAAA> In step 1, NSF pries apart 
identical pairs of v-SNAREs and t-SNAREs 
in both membranes (see Figure 13-18). In 
steps 2 and 3, the separated matching 
SNAREs on adjacent identical membranes 
interact, which leads to membrane fusion 
and the formation of one continuous 
compartment called a vesicular tubular 
cluster. Subsequently, the compartment 
grows by further homotypic fusion with 
vesicles from the same kind of membrane, 
displaying matching SNAREs. Homotypic 
fusion is not restricted to the formation of 
vesicular tubular clusters; in a similar 
process, endosomes fuse to generate 
larger endosomes. Rab proteins help 
regulate the extent of homotypic fusion 
and hence the size of the compartments 
in a cell (not shown). 



Figure 1 3-23 Vesicular tubular clusters. 

(A) An electron micrograph section of 
vesicular tubular clusters forming from 
the ER membrane. Many of the vesicle- 
like structures seen in the micrograph are 
cross sections of tubules that extend 
above and below the plane of this thin 
section and are interconnected. 

(B) Vesicular tubular clusters move along 
microtubules to carry proteins from the 
ER to the Golgi apparatus. COPI coats 
mediate the budding of vesicles that 
return to the ER from these clusters. As 
indicated, the coats quickly disassemble 
after the vesicles have formed. 

(A, courtesy of William Balch.) 
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transport vesicles for retrograde delivery to the ER. The best-characterized 
retrieval signal of this type consists of two lysines, followed by any two other 
amino acids, at the extreme C-terminal end of the ER membrane protein. It is 
called a KKXX sequence, based on the single-letter amino acid code. 

Soluble ER resident proteins, such as BiP, also contain a short retrieval signal 
at their C-terminal end, but it is different: it consists of a Lys-Asp-Glu-Leu or a 
similar sequence. If this signal (called the KDEL sequence) is removed from BiP 
by genetic engineering, the protein is slowly secreted from the cell. If the signal 
is transferred to a protein that is normally secreted, the protein is now efficiently 
returned to the ER, where it accumulates. 

Unlike the retrieval signals on ER membrane proteins, which can interact 
directly with the COPI coat, soluble ER resident proteins must bind to special- 
ized receptor proteins such as the KDEL receptor— a multipass transmembrane 
protein that binds to the KDEL sequence and packages any protein displaying it 
into COPI-coated retrograde transport vesicles (Figure 13-24). To accomplish 
this task, the KDEL receptor itself must cycle between the ER and the Golgi 
apparatus, and its affinity for the KDEL sequence must differ in these two com- 
partments. The receptor must have a high affinity for the KDEL sequence in 
vesicular tubular clusters and the Golgi apparatus, so as to capture escaped, sol- 
uble ER resident proteins that are present there at low concentration. It must 
have a low affinity for the KDEL sequence in the ER, however, to unload its cargo 
in spite of the very high concentration of KDEL-containing resident proteins in 
the ER. 

How does the affinity of the KDEL receptor change depending on the com- 
partment in which it resides? The answer is not known, but it may be related to 
the different ionic conditions and pH in the different compartments, which are 
regulated by ion transporters in the compartment membrane. As we discuss 
later, pH-sensitive protein-protein interactions form the basis for many of the 
sorting steps in the cell. 

Most membrane proteins that function at the interface between the ER and 
Golgi apparatus, including v- and t-SNAREs and some cargo receptors, enter the 
retrieval pathway back to the ER. Whereas the recycling of some of these pro- 
teins is mediated by signals, as just described, for others no specific signal seems 
to be required. Thus, while retrieval signals increase the efficiency of the 
retrieval process, some proteins randomly enter budding vesicles destined for 
the ER and are returned to the ER at a slower rate. Many Golgi enzymes cycle 
constantly between the ER and the Golgi, but their rate of return to the ER is slow 
enough for most of the protein to be found in the Golgi apparatus. 



Figure 13-24 A model for the retrieval 
of soluble ER resident proteins. ER 

resident proteins that escape from the ER 
are returned by vesicular transport. 
(A) The KDEL receptor present in vesicular 
tubular clusters and the Golgi apparatus 
captures the soluble ER resident proteins 
and carries them in COPI-coated 
transport vesicles back to the ER. Upon 
binding its ligands in this environment, 
the KDEL receptor may change 
conformation, so as to facilitate its 
recruitment into budding COPI-coated 
vesicles. (B) The retrieval of ER proteins 
begins in vesicular tubular clusters and 
continues from all parts of the Golgi 
apparatus. In the environment of the ER, 
the ER resident proteins dissociate from 
the KDEL receptor, which is then returned 
to the Golgi apparatus for reuse. 
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Many Proteins Are Selectively Retained in the Compartments in 
Which They Function 

The KDEL retrieval pathway only partly explains how ER resident proteins are 
maintained in the ER. As expected, cells that express genetically modified ER 
resident proteins, from which the KDEL sequence has been experimentally 
removed, secrete these proteins. But the rate of secretion is much slower than for 
a normal secretory protein. It seems that a mechanism that is independent of 
their KDEL signal anchors ER resident proteins and that only those proteins that 
escape this retention mechanism are captured and returned via the KDEL recep- 
tor. A suggested retention mechanism is that ER resident proteins bind to one 
another, thus forming complexes that are too big to enter transport vesicles effi- 
ciently. Because ER resident proteins are present in the ER at very high concen- 
trations (estimated to be millimolar), relatively low-affinity interactions would 
suffice to tie up most of the proteins in such complexes. 

Aggregation of proteins that function in the same compartment — called kin 
recognition — is a general mechanism that compartments use to organize and 
retain their resident proteins. Golgi enzymes that function together, for example, 
also bind to each other and are thereby restrained from entering transport vesi- 
cles leaving the Golgi apparatus. 



The Golgi Apparatus Consists of an Ordered Series of 
Compartments 

Because of its large and regular structure, the Golgi apparatus was one of the first 
organelles described by early light microscopists. It consists of a collection of 
flattened, membrane-enclosed compartments, called cisternae, that somewhat 
resemble a stack of pita breads. Each Golgi stack usually consists of four to six 
cisternae (Figure 13-25), although some unicellular flagellates can have up to 
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Figure 1 3-25 The Golgi apparatus. 

(A) Three-dimensional reconstruction 
from electron micrographs of the 
Golgi apparatus in a secretory animal 
cell. The cis face of the Golgi stack is 
that closest to the ER. (B) A thin- 
section electron micrograph 
emphasizing the transitional zone 
between the ER and the Golgi 
apparatus in an animal cell. (C) An 
electron micrograph of a Golgi 
apparatus in a plant cell (the green 
alga Chlamydomonas) seen in cross 
section. In plant cells, the Golgi 
apparatus is generally more distinct 
and more clearly separated from 
other intracellular membranes than in 
animal cells. (A, redrawn from 
A. Rambourg and Y. Clermont, Eur. 
J. Cell Biol. 51:1 89-200, 1 990. With 
permission from Wissenschaftliche 
Verlagsgesellschaft; B, courtesy of 
Brij J. Gupta; C, courtesy of George 
Palade.) 




1 p.m 



200 nm 



772 Chapter 1 3: Intracellular Vesicular Traffic 




60. In animal cells, tubular connections between corresponding cisternae link 
many stacks, thus forming a single complex, which is usually located near the 
cell nucleus and close to the centrosome (Figure 13-26A). This localization 
depends on microtubules. If microtubules are experimentally depolymerized, 
the Golgi apparatus reorganizes into individual stacks that are found throughout 
the cytoplasm, adjacent to ER exit sites. Some cells, including most plant cells, 
have hundreds of individual Golgi stacks dispersed throughout the cytoplasm 
(Figure 13-26B). 

During their passage through the Golgi apparatus, transported molecules 
undergo an ordered series of covalent modifications. Each Golgi stack has two 
distinct faces: a cis face (or entry face) and a trans face (or exit face) . Both cis and 
trans faces are closely associated with special compartments, each composed of 
a network of interconnected tubular and cisternal structures: the cis Golgi net- 
work (CGN) and the trans Golgi network (TGN), respectively. The CGN is a col- 
lection of fused vesicular tubular clusters arriving from the ER. Proteins and 
lipids enter the cis Golgi network and exit from the trans Golgi network bound 
for the cell surface or another compartment. Both networks are important for 
protein sorting: proteins entering the CGN can either move onward in the Golgi 
apparatus or be returned to the ER. Similarly, proteins exiting from the TGN 
move onward and are sorted according to their next destination: lysosomes, 
secretory vesicles, or the cell surface. They also can be returned to an earlier 
compartment. 

As described in Chapter 12, a single species of N-linked oligosaccharide is 
attached en bloc to many proteins in the ER and then trimmed while the protein 
is still in the ER. The oligosaccharide intermediates created by the trimming 
reactions serve to help proteins fold and to help transport misfolded proteins to 
the cytosol for degradation. Thus, they play an important role in controlling the 
quality of proteins exiting from the ER. Once these ER functions have been ful- 
filled, the cell is free to redesign the oligosaccharides for new functions. This 
happens in the Golgi apparatus, which generates the heterogeneous oligosac- 
charide structures seen in mature proteins. Upon arrival in the CGN, proteins 
pass through the cis Golgi network, before entering the first of the Golgi pro- 
cessing compartments (the cis Golgi cisternae). They then move to the next 
compartment (the medial cisternae) and finally to the trans cisternae, where gly- 
cosylation is completed. The lumen of the trans cisternae is thought to be con- 
tinuous with the TGN, the place where proteins are segregated into different 
transport packages and dispatched to their final destinations. 

The oligosaccharide processing steps occur in an organized sequence in the 
Golgi stack, with each cisterna containing a characteristic abundance of pro- 
cessing enzymes. Proteins are modified in successive stages as they move from 
cisterna to cisterna across the stack, so that the stack forms a multistage pro- 
cessing unit. This compartmentalization might seem unnecessary, since each 
oligosaccharide processing enzyme can accept a glycoprotein as a substrate 
only after it has been properly processed by the preceding enzyme. Nonetheless, 
it is clear that processing occurs in a spatial as well as a biochemical sequence: 
enzymes catalyzing early processing steps are concentrated in the cisternae 
toward the cis face of the Golgi stack, whereas enzymes catalyzing later process- 
ing steps are concentrated in the cisternae toward the trans face. 



Figure 1 3-26 Localization of the Golgi 
apparatus in animal and plant cells. 

(A) The Golgi apparatus in a cultured 
fibroblast stained with a fluorescent 
antibody that recognizes a Golgi resident 
protein (red). The Golgi apparatus is 
polarized, facing the direction in which 
the cell was crawling before fixation. 

(B) The Golgi apparatus in a plant cell that 
is expressing a fusion protein consisting 
of a resident Golgi enzyme fused to 
green fluorescent protein. The bright 
orange spots (false color) are Golgi stacks. 
(A, courtesy of John Henley and Mark 
McNiven; B, courtesy of Chris Hawes.) 
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Figure 13-27 Molecular compartmentalization of the Golgi apparatus. 

A series of electron micrographs shows the Golgi apparatus (A) unstained, 
(B) stained with osmium, which is preferentially reduced by the cisternae of 
the cis compartment, and (C and D) stained to reveal the location of 
specific enzymes. Nucleoside diphosphatase is found in the trans Golgi 
cisternae (C), while acid phosphatase is found in the trans Golgi network 
(D). Note that usually more than one cisterna is stained. The enzymes are 
therefore thought to be highly enriched rather than precisely localized to a 
specific cisterna. (Courtesy of Daniel S. Friend.) 



Investigators discovered the functional differences between the cis, medial, 
and trans subdivisions of the Golgi apparatus by localizing the enzymes involved 
in processing N- linked oligosaccharides in distinct regions of the organelle, both 
by physical fractionation of the organelle and by labeling the enzymes in elec- 
tron microscope sections with antibodies. The removal of mannose residues and 
the addition of N-acetylglucosamine, for example, were shown to occur in the 
medial compartment, while the addition of galactose and sialic acid was found 
to occur in the trans compartment and the trans Golgi network (Figure 13-27). 
Figure 13-28 summarizes the functional compartmentalization of the Golgi 
apparatus. 

The Golgi apparatus is especially prominent in cells that are specialized for 
secretion of glycoproteins, such as the goblet cells of the intestinal epithelium, 
which secrete large amounts of polysaccharide-rich mucus into the gut (Figure 
13-29). In such cells, unusually large vesicles are found on the trans side of the 
Golgi apparatus, which faces the plasma membrane domain where secretion 
occurs. 



Oligosaccharide Chains Are Processed in the Golgi Apparatus 

Whereas the ER lumen is full of soluble lumenal resident proteins and enzymes, 
the resident proteins in the Golgi apparatus are all membrane bound. The enzy- 
matic reactions in the Golgi apparatus seem to be carried out entirely on its 
membrane surface. All of the Golgi glycosidases and glycosyl transferases are 
single-pass transmembrane proteins, many of which are organized in multi- 
enzyme complexes. Thus, the two synthetic organelles in the biosynthetic- 
secretory pathway, the ER and the Golgi apparatus, are organized in fundamen- 
tally different ways. 
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Figure 13-28 Oligosaccharide 
processing in Golgi compartments. The 
localization of each processing step 
shown was determined by a combination 
of techniques, including biochemical 
subfractionation of the Golgi apparatus 
membranes and electron microscopy 
after staining with antibodies specific for 
some of the processing enzymes. 
Processing enzymes are not restricted to 
a particular cisterna; instead, their 
distribution is graded across the stack- 
such that early-acting enzymes are 
present mostly in the cis Golgi cisternae 
and later-acting enzymes are mostly in 
the trans Golgi cisternae. 
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Figure 1 3-29 A goblet cell of the small intestine. This cell is specialized for 
secreting mucus, a mixture of glycoproteins and proteoglycans synthesized 
in the ER and Golgi apparatus. Like all epithelial cells, goblet cells are highly 
polarized, with the apical domain of their plasma membrane facing the 
lumen of the gut and the basolateral domain facing the basal lamina. The 
Golgi apparatus is also highly polarized, which facilitates the discharge of 
mucus by exocytosis at the apical domain of the plasma membrane. (After 
R.V. Krstic, Illustrated Encyclopedia of Human Histology. New York: 
Springer-Verlag, 1984. With permission from Springer-Verlag.) 

Two broad classes of iV-linked oligosaccharides, the complex oligosaccha- 
rides and the high-mannose oligosaccharides, are attached to mammalian gly- 
coproteins (Figure 13-30). Sometimes, both types are attached (in different 
places) to the same polypeptide chain. 

Complex oligosaccharides are generated when the original A/-linked oligosac- 
charide added in the ER is trimmed and further sugars are added. By contrast, 
high-mannose oligosaccharides are trimmed but have no new sugars added to 
them in the Golgi apparatus. They contain just two A/-acetylglucosamines and 
many mannose residues, often approaching the number originally present in the 
lipid-linked oligosaccharide precursor added in the ER. Complex oligosaccharides 
can contain more than the original two iV-acetylglucosamines, as well as a variable 
number of galactose and sialic acids and, in some cases, fucose. Sialic acid is of 
special importance because it is the only sugar in glycoproteins that bears a neg- 
ative charge. Whether a given oligosaccharide remains high-mannose or is pro- 
cessed depends largely on its position in the protein. If the oligosaccharide is 
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Figure 13-30 The two main classes of asparagine-linked (AMinked) oligosaccharides found in 
mature mammalian glycoproteins. (A) Both complex oligosaccharides and high-mannose 
oligosaccharides share a common core region derived from the original AMinked oligosaccharide 
added in the ER (see Figure 12-50) and typically containing two N-acetylglucosamines (GlcNAc) 
and three mannoses (Man). (B) Each complex oligosaccharide consists of a core region, together 
with a terminal region that contains a variable number of copies of a special trisaccharide unit 
(N-acetylglucosamine-gaiactose-sialic acid) linked to the core mannoses. Frequently, the 
terminal region is truncated and contains only GlcNAc and galactose (Gal) or just GlcNAc. In 
addition, a fucose residue may be added, usually to the core GlcNAc attached to the asparagine 
(Asn). Thus, although the steps of processing and subsequent sugar addition are rigidly 
ordered, complex oligosaccharides can be heterogeneous. Moreover, although the complex 
oligosaccharide shown has three terminal branches, two and four branches are also common, 
depending on the glycoprotein and the cell in which it is made. (C) High-mannose 
oligosaccharides are not trimmed back all the way to the core region and contain additional 
mannose residues. Hybrid oligosaccharides with one Man branch and one GlcNAc and Gal 
branch are also found (not shown). 

The three amino acids indicated in (A) constitute the sequence recognized by the 
oligosaccharyl transferase enzyme that adds the initial oligosaccharide to the protein. 
Ser = serine; Thr = threonine; X = any amino acid, except proline. 
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accessible to the processing enzymes in the Golgi apparatus, it is likely to be con- 
verted to a complex form; if it is inaccessible because its sugars are tightly held to 
the protein's surface, it is likely to remain in a high-mannose form. The process- 
ing that generates complex oligosaccharide chains follows the highly ordered 
pathway shown in Figure 13-31. 

Beyond these commonalities in oligosaccharide processing that are shared 
among most cells, the products of the carbohydrate modifications carried out in 
the Golgi apparatus are highly complex and have given rise to a new field called 
glycobiology. The human genome, for example, encodes hundreds of different 
Golgi glycosyl transferases, which are expressed differently from one cell type to 
another, resulting in a variety of glycosylated forms of a given protein or lipid in 
different cell types and at varying stages of differentiation, depending on the 
spectrum of enzymes expressed by the cell The complexity of modifications is 
not limited to AMinked oligosaccharides but also occurs on O-linked sugars, as 
we discuss next. 



Proteoglycans Are Assembled in the Golgi Apparatus 

In addition to the AMinked oligosaccharide alterations made to proteins as 
they pass through the Golgi cisternae en route from the ER to their final desti- 
nations, many proteins are also modified in other ways. Some proteins have 
sugars added to the hydroxyl groups of selected serine or threonine side 
chains. This O-linked glycosylation (Figure 13-32), like the extension of N- 
linked oligosaccharide chains, is catalyzed by a series of glycosyl transferase 
enzymes that use the sugar nucleotides in the lumen of the Golgi apparatus to 
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Figure 13-31 Oligosaccharide processing in the ER and the Golgi apparatus. The processing pathway is highly ordered, so that each step 
shown depends on the previous one. Step 1 : Processing begins in the ER with the removal of the glucoses from the oligosaccharide initially 
transferred to the protein. Then a mannosidase in the ER membrane removes a specific mannose. Step 2: The remaining steps occur in the 
Golgi stack, where Golgi mannosidase I first removes three more mannoses. Step 3: /V-acetylglucosamine transferase I then adds an 
/V-acetylglucosamine. Step 4: Mannosidase II then removes two additional mannoses. This yields the final core of three mannoses that is 
present in a complex oligosaccharide. At this stage, the bond between the two N-acetylglucosamines in the core becomes resistant to attack 
by a highly specific endoglycosidase (Endo H). Since all later structures in the pathway are also Endo H-resistant, treatment with this enzyme is 
widely used to distinguish complex from high-mannose oligosaccharides. Step 5: Finally, as shown in Figure 1 3-30, additional 
/V-acetylglucosamines, galactoses, and sialic acids are added. These final steps in the synthesis of a complex oligosaccharide occur in the 
cisternal compartments of the Golgi apparatus. Three types of glycosyl transferase enzymes act sequentially, using sugar substrates that have 
been activated by linkage to the indicated nucleotide. The membranes of the Golgi cisternae contain specific carrier proteins that allow each 
sugar nucleotide to enter in exchange for the nucleoside phosphates that are released after the sugar is attached to the protein on the lumenal 
face. 

Note that as a biosynthetic organelle, the Golgi apparatus differs fundamentally from the ER: all sugars are assembled inside the lumen from 
sugar nucleotides. By contrast, in the ER, the /V-linked precursor oligosaccharide is assembled partly in the cytosol and partly in the lumen, and 
all lumenal reactions use dolichol-linked sugars as their substrates (see Figure 12-52). 
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add sugar residues to a protein one at a time. Usually, ^acetylgalactosamine is 
added first, followed by a variable number of additional sugar residues, ranging 
from just a few to 10 or more. 

The Golgi apparatus confers the heaviest O-linked glycosylation of all on 
mucins, the glycoproteins in mucus secretions (see Figure 13-29), and on proteo- 
glycan core proteins, which it modifies to produce proteoglycans. As discussed in 
Chapter 19, this process involves the polymerization of one or more glycosamino- 
glycan chains (long unbranched polymers composed of repeating disaccharide 
units; see Figure 19-58) via a xylose link onto serines on a core protein. Many pro- 
teoglycans are secreted and become components of the extracellular matrix, while 
others remain anchored to the extracellular face of the plasma membrane. Still 
others form a major component of slimy materials, such as the mucus that is 
secreted to form a protective coating on the surface of many epithelia. 

The sugars incorporated into glycosaminoglycans are heavily sulfated in the 
Golgi apparatus immediately after these polymers are made, thus adding a sig- 
nificant portion of their characteristically large negative charge. Some tyrosines 
in proteins also become sulfated shortly before they exit from the Golgi appara- 
tus. In both cases, the sulfation depends on the sulfate donor 3'-phosphoadeno- 
sine-5'-phosphosulfate (PAPS) (Figure 13-33), which is transported from the 
cytosol into the lumen of the trans Golgi network. 



Figure 1 3-32 N- and O-linked 
glycosylation. In each case, only the 
single sugar group that is directly 
attached to the protein chain is shown. 



What Is the Purpose of Glycosylation? 

There is an important difference between the construction of an oligosaccha- 
ride and the synthesis of other macromolecules such as DNA, RNA, and pro- 
tein. Whereas nucleic acids and proteins are copied from a template in a 
repeated series of identical steps using the same enzyme or set of enzymes, 
complex carbohydrates require a different enzyme at each step, each product 
being recognized as the exclusive substrate for the next enzyme in the series. 
The vast abundance of glycoproteins and the complicated pathways that have 
evolved to synthesize them suggest that the oligosaccharides on glycoproteins 
and glycosphingolipids have very important functions. 

AT- linked glycosylation, for example, is prevalent in all eucaryotes, including 
yeasts. AMinked oligosaccharides also occur in a very similar form in archaeal 
cell wall proteins, suggesting that the whole machinery required for their syn- 
thesis is evolutionarily ancient. N- linked glycosylation promotes protein folding 
in two ways. First, it has a direct role in making folding intermediates more sol- 
uble, thereby preventing their aggregation. Second, the sequential modifications 
of the Af-linked oligosaccharide establish a "glyco-code" that marks the progres- 
sion of protein folding and mediates the binding of the protein to chaperones 
(discussed in Chapter 12) and lectins— for example, in guiding ER-to-Golgi 
transport. As we discuss later, lectins also participate in protein sorting in the 
trans Golgi network. 
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Because chains of sugars have limited flexibility, even a small AMinked 
oligosaccharide protruding from the surface of a glycoprotein (Figure 13-34) 
can limit the approach of other macromolecules to the protein surface. In this 
way for example, the presence of oligosaccharides tends to make a glycoprotein 
more resistant to digestion by proteolytic enzymes. It may be that the oligosac- 
charides on cell-surface proteins originally provided an ancestral cell with a pro- 
tective coat. Compared to the rigid bacterial cell wall, a mucus coat has the 
advantage that it leaves the cell with the freedom to change shape and move. 

The sugar chains have since become modified to serve other purposes as 
well. The mucus coat of lung and intestinal cells, for example, protects against 
many pathogens. The recognition of sugar chains by lectins in the extracellular 
space is important in many developmental processes and in cell-cell recogni- 
tion: selectins, for example, are lectins that function in cell-cell adhesion during 
lymphocyte migration, as discussed in Chapter 19. The presence of oligosaccha- 
rides may modify a protein's antigenic properties, making glycosylation an 
important factor in the production of proteins for pharmaceutical purposes. 

Glycosylation can also have important regulatory roles. Signaling through 
the cell-surface signaling receptor Notch, for example, determines the cell's fate 
in development. Notch is a transmembrane protein that is O-glycosylated by 
addition of a single fucose to some serines, threonines, and hydroxyzines. 
Some cell types express an additional glycosyl transferase that adds an AT-acetyl- 
glucosamine to each of these fucoses in the Golgi apparatus. This addition 
changes the specificity of the Notch receptor for the cell-surface signal proteins 
that activate the receptor. 



Figure 13-34 The three-dimensional 
structure of a small N-linked 
oligosaccharide. The structure was 
determined by x-ray crystallographic 
analysis of a glycoprotein. This 
oligosaccharide contains only 6 sugars, 
whereas there are 14 sugars in the 
/V-linked oligosaccharide that is initially 
transferred to proteins in the ER (see 
Figures 12-50 and 12-51). (A) A 
backbone model showing all atoms 
except hydrogens. (B) A space-filling 
model, with the asparagine indicated by 
dark atoms. (B, courtesy of Richard 
Feldmann.) 



Transport Through the Golgi Apparatus May Occur by Vesicular 
Transport or by Cisternal Maturation 

It is still uncertain how the Golgi apparatus achieves and maintains its polarized 
structure and how molecules move from one cisterna to another. Functional evi- 
dence from in vitro transport assays, and the finding of abundant transport vesi- 
cles in the vicinity of Golgi cisternae, initially led to the view that these vesicles 
transport proteins between the cisternae, budding from one cisterna and fusing 
with the next. According to this vesicular transport model, the Golgi apparatus 
is a relatively static structure, with its enzymes held in place, while the molecules 
in transit move through the cisternae in sequence, carried by transport vesicles 
(Figure 13-35A). A retrograde flow of vesicles retrieves escaped ER and Golgi 
proteins and returns them to preceding compartments. Directional flow could 
be achieved because forward-moving cargo molecules are selectively allowed 
access to forward-moving vesicles. Although both forward-moving and retro- 
grade types of vesicles are likely to be COPI-coated, the coats may contain dif- 
ferent adaptor proteins that confer selectivity on the packaging of cargo 
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molecules. An alternative possibility is that the transport vesicles shuttling 
between Golgi cisternae are not directional at all, instead transporting cargo 
material randomly back and forth; directional flow would then occur because of 
the continual input at the cis cisterna and output at the trans cisterna. 

A different hypothesis, called the cisternal maturation model, views the 
Golgi as a dynamic structure in which the cisternae themselves move. The vesic- 
ular tubular clusters that arrive from the ER fuse with one another to become a 
cis Golgi network. According to this model, this network then progressively 
matures to become a cis cisterna, then a medial cisterna, and so on. Thus, at the 
cis face of a Golgi stack, new cis cisternae would continually form and then 
migrate through the stack as they mature (Figure 13-35B). This model is sup- 
ported by microscopic observations demonstrating that large structures such as 
collagen rods in fibroblasts and scales in certain algae — which are much too 
large to fit into classical transport vesicles — move progressively through the 
Golgi stack. 

In the cisternal maturation model, retrograde flow explains the characteris- 
tic distribution of Golgi enzymes. Everything moves continuously forward with 
the maturing cisterna, including the processing enzymes that belong in the early 
Golgi apparatus. But budding COPI-coated vesicles continually collect the appro- 
priate enzymes, almost all of which are membrane proteins, and carry them back 
to the earlier cisterna where they function. A newly formed cis cisterna would 
therefore receive its normal complement of resident enzymes primarily from the 
cisterna just ahead of it and would later pass those enzymes back to the next cis 
cisterna that forms. 

As we discuss later, when a cisterna finally moves forward to become part of 
the trans Golgi network, various types of coated vesicles bud off it until this net- 
work disappears, to be replaced by a maturing cisterna just behind. At the same 
time, other transport vesicles are continually retrieving membrane from post- 
Golgi compartments and returning this membrane to the trans Golgi network. 

The vesicular transport and the cisternal maturation models are not mutu- 
ally exclusive. Indeed, evidence suggests that transport may occur by a combi- 
nation of the two mechanisms, in which some cargo is moved forward rapidly in 
transport vesicles, whereas other cargo is moved forward more slowly as the 
Golgi apparatus constantly renews itself through cisternal maturation. 



Figure 13-35 Two possible models 
explaining the organization of the Golgi 
apparatus and the transport of proteins 
from one cisterna to the next. It is likely 
that the transport through the Golgi 
apparatus in the forward direction (red 
arrows) involves elements of both 
models. (A) In the vesicular transport 
model, Golgi cisternae are static 
organelles, which contain a characteristic 
complement of resident enzymes. The 
passing of molecules from cis to trans 
through the Golgi is accomplished by 
forward-moving transport vesicles, which 
bud from one cisterna and fuse with the 
next in a cis-Xo-trans direction. 
(B) According to the alternative cisternal 
maturation model, each Golgi cisterna 
matures as it migrates outward through 
the stack. At each stage, the Golgi 
resident proteins that are carried forward 
in a cisterna are moved backward to an 
earlier compartment in COPI-coated 
vesicles. When a newly formed cisterna 
moves to a medial position, for example, 
"left-over" cis Golgi enzymes would be 
extracted and transported retrogradely 
to a new cis cisterna behind. Likewise, the 
medial enzymes would be received by 
retrograde transport from the cisternae 
just ahead. In this way, a cis cisterna 
would mature to a medial cisterna as it 
moves outward. 



Golgi Matrix Proteins Help Organize the Stack 

The unique architecture of the Golgi apparatus depends on both the micro- 
tubule cytoskeleton, as already mentioned, and cytoplasmic Golgi matrix pro- 
teins, which form a scaffold between adjacent cisternae and give the Golgi stack 
its structural integrity. Some of the matrix proteins form long, filamentous teth- 
ers, which are thought to help retain Golgi transport vesicles close to the 
organelle. When the cell prepares to divide, mitotic protein kinases phosphory- 
late the Golgi matrix proteins, causing the Golgi apparatus to fragment and dis- 
perse throughout the cvtosol. The Golgi fragments are then distributed evenlvto 
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the two daughter cells, where the matrix proteins are dephosphorylated, leading 
to the reassembly of the Golgi apparatus. 

Remarkably, the Golgi matrix proteins can assemble into appropriately 
localized stacks near the centrosome even when Golgi membrane proteins are 
experimentally prevented from leaving the ER. This observation suggests that 
the matrix proteins are largely responsible for both the structure and location of 
the Golgi apparatus. 



Summary 

Correctly folded and assembled proteins in the ER are packaged into COPIl-coated 
transport vesicles that pinch off from the ER membrane. Shortly thereafter, the vesicles 
shed their coat and fuse with one another to form vesicular tubular clusters. The clus- 
ters then move on microtubule tracks to the Golgi apparatus, where they fuse with one 
another to form the cis-Golgi network. Any resident ER proteins that escape from the 
ER are returned therefrom the vesicular tubular clusters and Golgi apparatus by ret- 
rograde transport in COPI-coated vesicles. 

The Golgi apparatus, unlike the ER, contains many sugar nucleotides, which gly- 
cosyl transferase enzymes use to perform glycosylation reactions on lipid and protein 
molecules as they pass through the Golgi apparatus. The mannoses on the N-linked 
oligosaccharides that are added to proteins in the ER are often initially removed, and 
further sugars are added. Moreover, the Golgi apparatus is the site where O-linked gly- 
cosylation occurs and where glycosaminoglycan chains are added to core proteins to 
form proteoglycans. Sulfation of the sugars in proteoglycans and of selected tyrosines 
on proteins also occurs in a late Golgi compartment. 

The Golgi apparatus modifies the many proteins and lipids that it receives from 
the ER and then distributes them to the plasma membrane, lysosomes, and secretory 
vesicles. The Golgi apparatus is a polarized organelle, consisting of one or more stacks 
of disc- shaped cisternae, each stack organized as a series of at least three functionally 
distinct compartments, termed cis, medial, and trans cisternae. The cis and trans cis- 
ternae are both connected to special sorting stations, called the cis Golgi network and 
the trans Golgi network, respectively. Proteins and lipids move through the Golgi stack 
in the cis-fo-trans direction. This movement may occur by vesicular transport, by pro- 
gressive maturation of the cis cisternae as they migrate continuously through the 
stack, or, most likely, by a combination of these two mechanisms. Continual retrograde 
vesicular transport from more distal cisternae is thought to keep the enzymes concen- 
trated in the cisternae where they are needed. The finished new proteins end up in the 
trans Golgi network, which packages them in transport vesicles and dispatches them 
to their specific destinations in the cell. 



TRANSPORT FROM THE TRANS GOLGI NETWORK 
TO LYSOSOMES 

The trans Golgi network sorts all of the proteins that pass through the Golgi 
apparatus (except those that are retained there as permanent residents) accord- 
ing to their final destination. The sorting mechanism is especially well under- 
stood for those proteins destined for the lumen of lysosomes, and in this section 
we consider this selective transport process. We begin with a brief account of 
lysosome structure and function. 

Lysosomes Are the Principal Sites of Intracellular Digestion 

Lysosomes are membrane-enclosed compartments filled with soluble 
hydrolytic enzymes that control intracellular digestion of macromolecules. 
Lysosomes contain about 40 types of hydrolytic enzymes, including proteases, 
nucleases, glycosidases, lipases, phospholipases, phosphatases, and sulfa- 
tases. All are acid hydrolases. For optimal activity, they need to be activated by 
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proteolytic cleavage and require an acid environment, which the lysosome pro- 
vides by maintaining a pH of about 4.5-5.0 in its interior. By this arrangement, 
the contents of the cytosol are doubly protected against attack by the cell's own 
digestive system: the membrane of the lysosome keeps the digestive enzymes 
out of the cytosol, but even if they leak out, they can do little damage at the 
cytosolic pH of about 7.2. 

Like all other intracellular organelles, the lysosome not only contains a 
unique collection of enzymes, but also has a unique surrounding membrane. 
Most of the lysosomal membrane proteins, for example, are unusually highly 
glycosylated, which helps to protect them from the lysosomal proteases in the 
lumen. Transport proteins in the lysosomal membrane carry the final products 
of the digestion of macromolecules — such as amino acids, sugars, and 
nucleotides — to the cytosol, where the cell can either reuse or excrete them. 

A vacuolar H + ATPase in the lysosomal membrane uses the energy of ATP 
hydrolysis to pump H + into the lysosome, thereby maintaining the lumen at its 
acidic pH (Figure 13-36) . The lysosomal H + pump belongs to the family of V-type 
ATPases and has a similar architecture to the mitochondrial and chloroplast ATP 
synthases (F-type ATPases), which convert the energy stored in H + gradients into 
ATP (see Figure 11-12). By contrast to these enzymes, however, the vacuolar H + 
ATPase exclusively works in reverse, pumping H + into the organelle. Similar or 
identical V-type ATPases acidify all endocytic and exocytic organelles, including 
lysosomes, endosomes, selected compartments of the Golgi apparatus, and 
many transport and secretory vesicles. In addition to providing a low-pH envi- 
ronment that is suitable for reactions occurring in the organelle lumen, the H + 
gradient provides a source of energy that drives the transport of small metabo- 
lites across the organelle membrane. 
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Figure 1 3-36 Lysosomes. The acid 
hydrolases are hydrolytic enzymes that 
are active under acidic conditions. A 
V-type ATPase in the membrane pumps 
H + into the lysosome, maintaining its 
lumen at an acidic pH. 



Lysosomes Are Heterogeneous 

Lysosomes were initially discovered by the biochemical fractionation of cell 
extracts; only later were they seen clearly in the electron microscope. Although 
extraordinarily diverse in shape and size, staining them with specific antibodies 
shows they are members of a single family of organelles. They can also be iden- 
tified by histochemistry using the precipitate produced by the action of an acid 
hydrolase on its substrate to indicate which organelles contain the hydrolase 
(Figure 13-37). By this criterion, lysosomes are found in all eucaryotic cells. 

The heterogeneity of lysosomal morphology contrasts with the relatively 
uniform structures of most other cell organelles. The diversity reflects the wide 
variety of digestive functions that acid hydrolases mediate, including the break- 
down of intra- and extracellular debris, the destruction of phagocytosed 
microorganisms, and the production of nutrients for the cell. The diversity of 
lysosomal morphology, however, also reflects the way lysosomes form: late 
endosomes contain material received from both the plasma membrane by 
endocytosis and newly synthesized lysosomal hydrolases, and they therefore 
already bear a resemblance to lysosomes. Late endosomes fuse with preexisting 
lysosomes to form structures that are sometimes referred to as endolysosomes, 
which then fuse with one another (Figure 13-38). When the majority of the 
endocytosed material within an endolysosome has been digested so that only 
resistant or slowly digestable residues remain, these organelles become "classi- 
cal" lysosomes. These are relatively dense, round, and small, but they can enter 

Figure 1 3-37 Histochemical visualization of lysosomes. These electron 
micrographs show two sections of a cell stained to reveal the location of 
acid phosphatase, a marker enzyme for lysosomes. The larger membrane- 
enclosed organelles, containing dense precipitates of lead phosphate, are 
lysosomes. Their diverse morphology reflects variations in the amount and 
nature of the material they are digesting. The precipitates are produced 
when tissue fixed with glutaraldehyde (to fix the enzyme in place) is 
incubated with a phosphatase substrate in the presence of lead ions. Red 
arrows in the top panel indicate two small vesicles thought to be carrying 
acid hydrolases from the Golgi apparatus. (Courtesy of Daniel S. Friend.) 
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Figure 13-38 A model for lysosome 
maturation. The heterogeneity of 
lysosomal morphology reflects, in part, 
the different nature of the materials 
delivered to the organelle, as well as the 
different stages in the maturation cycle 
shown here. 



late endosome 



endolysosome 



lysosome 



the cycle again by fusing with late endosomes or endolysosomes. Thus, there is 
no real distinction between late endosomes and lysosomes: they are the same 
except that they are in different stages of a maturation cycle. For this reason, 
lysosomes are sometimes viewed as a heterogeneous collection of distinct 
organelles, the common feature of which is a high content of hydrolytic 
enzymes. It is especially hard to apply a narrower definition than this in plant 
cells, as we discuss next. 



Plant and Fungal Vacuoles Are Remarkably Versatile Lysosomes 

Most plant and fungal cells (including yeasts) contain one or several very large, 
fluid-filled vesicles called vacuoles. They typically occupy more than 30% of the 
cell volume, and as much as 90% in some cell types (Figure 13-39). Vacuoles are 
related to animal cell lysosomes and contain a variety of hydrolytic enzymes, but 
their functions are remarkably diverse. The plant vacuole can act as a storage 
organelle for both nutrients and waste products, as a degradative compartment, 
as an economical way of increasing cell size (Figure 13-40), and as a controller 
of turgor pressure (the osmotic pressure that pushes outward on the cell wall and 
keeps the plant from wilting). The same cell may have different vacuoles with 
distinct functions, such as digestion and storage. 

The vacuole is important as a homeostatic device, enabling plant cells to 
withstand wide variations in their environment. When the pH in the environ- 
ment drops, for example, the flux of H + into the cytosol is balanced, at least in 
part, by an increased transport of H + into the vacuole, which tends to keep the 
pH in the cytosol constant. Similarly, many plant cells maintain an almost con- 
stant turgor pressure despite large changes in the tonicity of the fluid in their 
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Figure 1 3-39 The plant cell vacuole. This 
electron micrograph of cells in a young 
tobacco leaf shows the cytosol as a thin 
layer, containing chloroplasts, pressed 
against the cell wall by the enormous 
vacuole. The vacuole membrane is called 
the tonoplast. (Courtesy of J. Burgess.) 
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immediate environment. They do so by changing the osmotic pressure of the 
cytosol and vacuole — in part by the controlled breakdown and resynthesis of 
polymers such as polyphosphate in the vacuole, and in part by altering the 
transport rates of sugars, amino acids, and other metabolites across the plasma 
membrane and the vacuolar membrane. The turgor pressure regulates the activ- 
ities of distinct transporters in each membrane to control these fluxes. 

Humans often harvest substances stored in plant vacuoles. In different 
species, these range from rubber to opium to the flavoring of garlic. Many stored 
products have a metabolic function. Proteins, for example, can be preserved for 
years in the vacuoles of the storage cells of many seeds, such as those of peas 
and beans. When the seeds germinate, these proteins are hydrolyzed and the 
resulting amino acids provide a food supply for the developing embryo. Antho- 
cyanin pigments stored in vacuoles color the petals of many flowers so as to 
attract pollinating insects, while noxious molecules released from vacuoles 
when a plant is eaten or damaged provide a defense against predators. 



Figure 1 3-40 The role of the vacuole in 
controlling the size of plant cells. 

A plant cell can achieve a large increase 
in volume without increasing the volume 
of the cytosol. Localized weakening of 
the cell wall orients a turgor-driven cell 
enlargement that accompanies the 
uptake of water into an expanding 
vacuole. The cytosol is eventually 
confined to a thin peripheral layer, which 
is connected to the nuclear region by 
strands of cytoplasm stabilized by 
bundles of actin filaments (not shown). 



Multiple Pathways Deliver Materials to Lysosomes 

Lysosomes are usually meeting places where several streams of intracellular traf- 
fic converge. A route that leads outwards from the ER via the Golgi apparatus 
delivers most digestive enzymes, while at least three paths from different 
sources feed substances into lysosomes for digestion. 

The best studied of these degradation paths in lysosomes is the one followed 
by macromolecules taken up from extracellular fluid by endocytosis. As dis- 
cussed in detail later, endocytosed molecules are initially delivered in vesicles to 
small, irregularly shaped intracellular organelles called early endosomes. Here 
endocytosed materials first meet the lysosomal hydrolases, which are delivered 
to the endosome from the Golgi apparatus. Some of the ingested molecules are 
selectively retrieved and recycled to the plasma membrane, while others pass on 
into late endosomes. The interior of the late endosomes is mildly acidic (pH -6), 
and it is the site where the hydrolytic digestion of the endocytosed molecules 
begins. As discussed above, mature lysosomes form by a maturation process 
from late endosomes, accompanied by a further decrease in internal pH. As 
lysosomes mature, endosomal membrane proteins are selectively retrieved from 
the developing lysosome by transport vesicles that deliver these proteins back to 
endosomes or the TGN. 

All cell types use a second degradation pathway in lysosomes to dispose of 
obsolete parts of the cell itself— a process called autophagy. In a liver cell, for 
example, an average mitochondrion has a lifetime of about 10 days, and elec- 
tron microscopic images of normal cells reveal lysosomes containing (and pre- 
sumably digesting) mitochondria, as well as other organelles. The process 
seems to begin with the enclosure of an organelle by a double membrane of 
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unknown origin, creating an autophagosome, which then fuses with a lysosome 
(or a late endosome). The process is highly regulated, and selected cell compo- 
nents can somehow be marked for lysosomal destruction during cell remodel- 
ing. For example, the smooth ER that proliferates in a liver cell in a detoxification 
response to lipid-soluble drugs such as phenobarbital (discussed in Chapter 12) 
is selectively removed by autophagy after the drug is withdrawn. 

Similarly other obsolete organelles, including senescent peroxisomes or 
mitochondria, can be selectively targeted for degradation by autophagy. Under 
starvation conditions, large portions of the cytosol are nonselectively captured 
into autophagosomes. Metabolites derived from the digestion of the captured 
material help the cell survive when external nutrients are limiting. 

In addition to maintaining basic cell functions in balance and helping to dis- 
pose of obsolete parts, autophagy also has a role in development and health. It 
helps restructure differentiating cells by disposing of no longer needed parts and 
helps defend against invading viruses and bacteria. Autophagy is uniquely suited 
as a mechanism that can remove whole organelles or large protein aggregates, 
which other mechanisms such as proteasomal degradation cannot handle. 

We still know very little about the events that lead to the formation of 
autophagosomes, or how the process of autophagy is controlled and targeted at 
specific organelles. More than 25 different proteins have been identified in yeast 
and animal cells that participate in the process. Autophagy can be divided into 
four general steps: (1) nucleation and extension of a delimiting membrane into 
a crescent-shaped structure that engulfs a portion of the cytoplasm, (2) closure 
of the autophagosome into a sealed double-membrane-bounded compartment, 
(3) fusion of the new compartment with lysosomes, and (4) digestion of the 
inner membrane of the autophagosome and its contents (Figure 13-41). Many 
mysteries remain to be solved, including identifying the membrane system from 
which the vesicles that form the autophagosomal envelope derive, and how 
some target organelles can be enclosed so selectively 

As we discuss later, the third pathway that brings materials to lysosomes for 
degradation is found mainly in cells specialized for the phagocytosis of large par- 
ticles and microorganisms. Such professional phagocytes (macrophages and 
neutrophils in vertebrates) engulf objects to form a phagosome, which is then 
converted to a lysosome in the manner described for the autophagosome. Fig- 
ure 13-42 summarizes the three pathways. 



A Mannose 6-Phosphate Receptor Recognizes Lysosomal Proteins 
in the Trans Golgi Network 

We now consider the pathway that delivers lysosomal hydrolases and mem- 
brane proteins to lysosomes. Both classes of proteins are co-translationally 
transported into the rough ER and then transported through the Golgi appara- 
tus to the TGN. The transport vesicles that deliver these proteins to endosomes 
(from where the proteins are moved on to lysosomes) bud from the TGN. The 
vesicles incorporate the lysosomal proteins and exclude the many other pro- 
teins being packaged into different transport vesicles for delivery elsewhere. 



Figure 1 3-41 A model of autophagy. 

After a nucleation event in the 
cytoplasm, a crescent of autophagosomal 
membrane grows by fusion of vesicles of 
unknown origin that extend its edges. 
Eventually, a membrane fusion event 
closes the autophagosome, sequestering 
a portion of the cytoplasm of the cell in a 
double membrane. The autophagosome 
then fuses with lysosomes containing 
acid hydrolases that digest its content. 
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How are lysosomal proteins recognized and selected in the TGN with the 
required accuracy? We know the answer for the lysosomal hydrolases. They carry 
a unique marker in the form of mannose 6-phosphate (M6P) groups, which are 
added exclusively to the N-linked oligosaccharides of these soluble lysosomal 
enzymes as they pass through the lumen of the ds Golgi network (Figure 13-43). 
Transmembrane M6P receptor proteins, which are present in the TGN, recog- 
nize the M6P groups. The receptor proteins bind to lysosomal hydrolases on the 
lumenal side of the membrane and to adaptor proteins in assembling clathrin 
coats on the cytosolic side. In this way, they help package the hydrolases into 
clathrin-coated vesicles that bud from the TGN. The vesicles shed their coat and 
deliver their contents to early endosomes. 



Figure 1 3-42 Three pathways to 
degradation in lysosomes. (A) Materials 
in each pathway are derived from a 
different source. Note that the 
autophagosome has a double 
membrane. (B) An electron micrograph of 
an autophagosome containing a 
mitochondrion and a peroxisome. 
(B, courtesy of Daniel S. Friend, from 
D.W. Fawcett, A Textbook of Histology, 
1 2th ed. New York: Chapman and 
Hall, 1994. With permission from Kluwer.) 
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The M6P Receptor Shuttles Between Specific Membranes 

The M6P receptor protein binds its specific oligosaccharide at pH 6.5-6.7 in the 
TGN and releases it at pH 6, which is the pH in the interior of late endosomes. 
Thus, as the pH drops during endosomal maturation, the lysosomal hydrolases 
dissociate from the M6P receptor and eventually begin to digest the material 
delivered by endocytosis. An acid phosphatase removes the phosphate group 
from the mannose, thereby destroying the sorting signal and contributing to the 
release of the lysosomal hydrolases from the M6P receptor. Having released their 
bound enzymes, the M6P receptors are retrieved into retromer-coated transport 
vesicles that bud from endosomes; the receptors are then returned to the mem- 
brane of the TGN for reuse (Figure 13-44). Transport in either direction requires 
signals in the cytoplasmic tail of the M6P receptor that direct this protein to the 
endosome or back to the Golgi apparatus. These signals are recognized by the 
retromer complex (see Figure 13-9) that recruits M6P receptors into vesicles in 
endosomes. The recycling of the M6P receptor resembles the recycling of the 
KDEL receptor discussed earlier, although it differs in the type of coated vesicles 
that mediate the transport. 

Not all the hydrolase molecules that are tagged with M6P get to lysosomes. 
Some escape the normal packaging process in the trans Golgi network and are 
transported "by default" to the cell surface, where they are secreted into the 
extracellular fluid. Some M6P receptors, however, also take a detour to the 
plasma membrane, where they recapture the escaped lysosomal hydrolases and 
return them by receptor-mediated endocytosis to lysosomes via early and late 
endosomes. As lysosomal hydrolases require an acidic milieu to work, they can 
do little harm in the extracellular fluid, which usuallv has a neutral dH of 7.4. 



mannose 6-phosphate 
(M6P) 



p -O-CH 



HO 

A/-I inked — ^ 
oligosaccharide 



lysosomal 
hydrolase 




Figure 1 3-43 The structure of mannose 
6-phosphate on a lysosomal hydrolase. 
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A Signal Patch in the Hydrolase Polypeptide Chain Provides the 
Cue for M6P Addition 

The sorting system that segregates lysosomal hydrolases and dispatches them to 
endosomes works because M6P groups are added only to the appropriate glyco- 
proteins in the Golgi apparatus. This requires specific recognition of the hydro- 
lases by the Golgi enzymes responsible for adding M6R Since all glycoproteins 
leave the ER with identical iV-linked oligosaccharide chains, the signal for 
adding the M6P units to oligosaccharides must reside somewhere in the 
polypeptide chain of each hydrolase. Genetic engineering experiments have 
revealed that the recognition signal is a cluster of neighboring amino acids on 
each protein's surface, known as a signal patch. 

Two enzymes act sequentially to catalyze the addition of M6P groups to 
lysosomal hydrolases. The first is a GlcNAc phosphotransferase in the cis Golgi 
that specifically binds the hydrolase and adds GlcNAc-phosphate to one or two 
of the mannose residues on each oligosaccharide chain (Figure 13-45). A sec- 
ond enzyme in the trans Golgi then cleaves off the GlcNAc residue, leaving 
behind a newly created M6P marker. Since most lysosomal hydrolases contain 
multiple oligosaccharides, they acquire many M6P residues, providing a high- 
affinity signal for the M6P receptor. 



Defects in the GlcNAc Phosphotransferase Cause a Lysosomal 
Storage Disease in Humans 

Genetic defects that affect one or more of the lysosomal hydrolases cause a 
number of human lysosomal storage diseases. The defects result in an accu- 
mulation of undigested substrates in lysosomes, with severe pathological con- 
sequences, most often in the nervous system. In most cases, there is a mutation 
in a structural gene that codes for an individual lysosomal hydrolase. This occurs 
in Hurler's disease, for example, in which the enzyme required for the breakdown 
of certain types of glycosaminoglycan chains is defective or missing. The most 
severe form of lysosomal storage disease, however, is a very rare disorder called 
inclusion- cell disease {I-cell disease). In this condition, almost all of the 
hydrolytic enzymes are missing from the lysosomes of fibroblasts, and their 
undigested substrates accumulate in lysosomes, which consequently form large 
inclusions in the patients' cells. 



Figure 1 3-44 The transport of newly 
synthesized lysosomal hydrolases to 
lysosomes. The sequential action of two 
enzymes in the cis and trans Golgi 
network adds mannose 6-phosphate 
(M6P) groups to the precursors of 
lysosomal enzymes (see Figure 13-45). 
They then segregate from all other types 
of proteins in the TGN because 
monomeric adaptor proteins in the 
clathrin coat bind the M6P receptors, 
which, in turn, bind the modified 
lysosomal hydrolases. The clathrin-coated 
vesicles bud off from the TGN, shed their 
coat, and fuse with early endosomes. At 
the lower pH of the endosome, the 
hydrolases dissociate from the M6P 
receptors, and the empty receptors are 
recycled in retromer-coated vesicles to 
the Golgi apparatus for further rounds of 
transport. In the endosomes, the 
phosphate is removed from the mannose 
sugars attached to the hydrolases, further 
ensuring that the hydrolases do not 
return to the Golgi apparatus with the 
receptor. 
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Figure 13-45 The recognition of a lysosomal hydrolase. 

The GlcNAc phosphotransferase enzyme that recognizes 
lysosomal hydrolases in the Golgi apparatus has separate 
catalytic and recognition sites. The catalytic site binds both 
high-mannose /V-linked oligosaccharides and UDP-GlcNAc. 
The recognition site binds to a signal patch that is present 
only on the surface of lysosomal hydrolases. A second 
enzyme cleaves off the GlcNAc, leaving the M6P exposed. 
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I-cell disease is due to a single gene defect and, like most genetic enzyme 
deficiencies, it is recessive — that is, it occurs only in individuals having two 
copies of the defective gene. In patients with I-cell disease, all the hydrolases 
missing from lysosomes are found in the blood. Because they fail to sort prop- 
erly in the Golgi apparatus, the hydrolases are secreted rather than transported 
to lysosomes. The missorting has been traced to a defective or missing GlcNAc - 
phosphotransferase. Because lysosomal enzymes are not phosphorylated in the 
cis Golgi network, the M6P receptors do not segregate them into the appropriate 
transport vesicles in the TGN. Instead, the lysosomal hydrolases are carried to 
the cell surface and secreted by a default pathway 

In I-cell disease, the lysosomes in some cell types, such as hepatocytes, con- 
tain a normal complement of lysosomal enzymes, implying that there is another 
pathway for directing hydrolases to lysosomes that is used by some cell types but 
not others. The nature of this M6P- independent pathway is unknown. Similarly, 
an M6P- independent pathway in all cells sorts the membrane proteins of lyso- 
somes from the TGN for transport to late endosomes, and those proteins are 
therefore normal in I-cell disease. These membrane proteins exit from the TGN 
in clathrin- coated vesicles that are distinct from those that transport the M6P- 
tagged hydrolases and use different adaptor proteins. 

It is unclear why cells need more than one sorting pathway to construct 
lysosomes, although it is perhaps not surprising that different mechanisms 
operate for soluble and membrane-bound lysosomal proteins, especially 
since — unlike M6P receptors — those membrane proteins are lysosomal resi- 
dents and need not be returned to the TGN. 
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Some Lysosomes Undergo Exocytosis 

Targeting of material to lysosomes is not necessarily the end of the pathway. 
Lysosomal secretion of undigested content enables all cells to eliminate undi- 
gestible debris. For most cells, this seems to be a minor pathway, used only when 
the cells are stressed. Some cell types, however, contain specialized lysosomes 
that have acquired the necessary machinery for fusion with the plasma mem- 
brane. Melanocytes in the skin, for example, produce and store pigments in their 
lysosomes. These pigment-containing melanosomes release their pigment into 
the extracellular space of the epidermis by exocytosis. The pigment is then taken 
up by keratinocytes, leading to normal skin pigmentation. In some genetic dis- 
orders, defects in melanosome exocytosis block this transfer process, leading to 
forms of hypopigmentation (albinism). 



Summary 

Lysosomes are specialized for the intracellular digestion ofmacromolecules. They con- 
tain unique membrane proteins and a wide variety of soluble hydrolytic enzymes that 
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operate best atpH5, which is the internal pH oftysosomes. An ATP-driven H + pump in 
the lysosomal membrane maintains this low pH. Newly synthesized lysosomal pro- 
teins are transferred into the lumen of the ER, transported through the Golgi appara- 
tus, and then carried from the trans Golgi network to late endosomes by means of 
clathrin- coated transport vesicles. 

The lysosomal hydrolases contain N-linked oligosaccharides that are covalently 
modified in a unique way in the cis Golgi network so that their mannose residues are 
phosphorylated. These mannose 6-phosphate (M6P) groups are recognized by an M6P 
receptor protein in the trans Golgi network that segregates the hydrolases and helps 
package them into budding transport vesicles that deliver their contents to endosomes. 
The M6P receptors shuttle back and forth between the trans Golgi network and these 
endosomes. The low pH in endosomes and removal of the phosphate from the M6P 
group cause the lysosomal hydrolases to dissociate from these receptors, making the 
transport of the hydrolases unidirectional. A separate transport system uses clathrin- 
coated vesicles to deliver resident lysosomal membrane proteins from the trans Golgi 
network. 



TRANSPORT INTO THE CELL FROM THE PLASMA 
MEMBRANE: ENDOCYTOSIS 

The routes that lead inward from the cell surface to lysosomes start with the pro- 
cess of endocytosis, by which cells take up macromolecules, particulate sub- 
stances, and, in specialized cases, even other cells. In this process, the material 
to be ingested is progressively enclosed by a small portion of the plasma mem- 
brane, which first invaginates and then pinches off to form an endocytic vesicle 
containing the ingested substance or particle. Two main types of endocytosis 
differ according to the size of the endocytic vesicles formed. In phagocytosis 
("cell eating"), large particles are ingested via large vesicles called phagosomes 
(generally >250 nm in diameter). In pinocytosis ("cell drinking"), fluid and 
solutes are ingested via small pinocytic vesicles (about 100 nm in diameter). Most 
eucaryotic cells are continually ingesting fluid and solutes by pinocytosis; large 
particles are ingested most efficiently by specialized phagocytic cells. 

Specialized Phagocytic Cells Can Ingest Large Particles 

Phagocytosis is a special form of endocytosis in which a cell uses large endo- 
cytic vesicles called phagosomes to ingest large particles such as microorgan- 
isms and dead cells. <TCAT> In protozoa, phagocytosis is a form of feeding: large 
particles taken up into phagosomes end up in lysosomes, and the products of 
the subsequent digestive processes pass into the cytosol to be used as food. 
However, few cells in multicellular organisms are able to ingest such large parti- 
cles efficiently. In the gut of animals, for example, extracellular processes break 
down food particles, and cells import the small hydrolysis products. 

Phagocytosis is important in most animals for purposes other than nutri- 
tion, and it is carried out mainly by specialized cells — so-called professional 
phagocytes. In mammals, two classes of white blood cells act as professional 
phagocytes — macrophages and neutrophils. These cells develop from 
hemopoietic stem cells (discussed in Chapter 23), and they ingest invading 
microorganisms to defend us against infection. Macrophages also have an 
important role in scavenging senescent cells and cells that have died by apopto- 
sis (discussed in Chapter 18). In quantitative terms, the clearance of senescent 
and dead cells is by far the most important: our macrophages phagocytose more 
than 10 11 senescent red blood cells in each of us every day, for example. 

Whereas the endocytic vesicles involved in pinocytosis are small and uni- 
form, phagosomes have diameters that are determined by the size of the 
ingested particle, and they can be almost as large as the phagocytic cell itself 
(Figure 13-46). The phagosomes fuse with lysosomes inside the cell, and the 
ingested material is then degraded. Any indigestible substances will remain in 
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lysosomes, forming residual bodies, which can be excreted from cells by exocy- 
tosis, as we have previously discussed. Some of the internalized plasma mem- 
brane components never reach the lysosome, because they are retrieved from 
the phagosome in transport vesicles and returned to the plasma membrane. 

To be phagocytosed, particles must first bind to the surface of the phago- 
cyte. Not all particles that bind are ingested, however. Phagocytes have a variety 
of specialized surface receptors that are functionally linked to the phagocytic 
machinery of the cell Phagocytosis is a triggered process. That is, it requires the 
activation of receptors that transmit signals to the cell interior and initiate the 
response. By contrast, pinocytosis is a constitutive process. It occurs continu- 
ously, regardless of the needs of the cell. The best-characterized triggers of 
phagocytosis are antibodies, which protect us by binding to the surface of infec- 
tious microorganisms to form a coat that exposes the tail region on the exterior 
of each antibody molecule. This tail region is called the Fc region (discussed in 
Chapter 25). The antibody coat is recognized by specific Fc receptors on the sur- 
face of macrophages and neutrophils, whose binding induces the phagocytic 
cell to extend pseudopods that engulf the particle and fuse at their tips to form 
a phagosome (Figure 13-47A). Localized actin polymerization, initiated by Rho- 
family GTPases and their activating Rho-GEFs (discussed in Chapters 15 and 
16), shapes the pseudopods. An active Rho GTPase switches on the kinase activ- 
ity of local PI kinases, and initial actin polymerization occurs in response to an 
accumulation of PI(4,5)P 2 in the membrane (see Figure 13-11). To seal off the 
phagosome and complete its engulfment, actin is depolymerized at its base as 
PI(4,5)P 2 is subjected to a PI 3-kinase, which converts it to PI(3,4,5)P 3 . PI(3,4,5)P 3 
is required for closure of the phagosome and may also contribute to reshaping 
the actin network to help drive the invagination of the forming phagosome (Fig- 
ure 13-47B). In this way, the ordered generation and consumption of specific 
phosphoinositides guides sequential steps in phagosome formation. 

Several other classes of receptors that promote phagocytosis have been 
characterized. Some recognize complement components, which collaborate 
with antibodies in targeting microbes for destruction (discussed in Chapter 24). 
Others directly recognize oligosaccharides on the surface of certain microor- 
ganisms. Still others recognize cells that have died by apoptosis. Apoptotic cells 
lose the asymmetric distribution of phospholipids in their plasma membrane. 
As a consequence, negatively charged phosphatidylserine, which is normally 
confined to the cytosolic leaflet of the lipid bilayer, is now exposed on the out- 
side of the cell, where it helps to trigger the phagocytosis of the dead cell. 
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Figure 13-46 Phagocytosis by a 
macrophage. A scanning electron 
micrograph of a mouse macrophage 
phagocytosing two chemically altered 
red blood cells. The red arrows point to 
edges of thin processes (pseudopods) of 
the macrophage that are extending as 
collars to engulf the red cells. (Courtesy 
of Jean Paul Revel.) 
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Figure 1 3-47 A neutrophil reshaping a plasma membrane during 
phagocytosis. (A) An electron micrograph of a neutrophil 
phagocytosing a bacterium, which is in the process of dividing. 
(B) Pseudopod extension and phagosome formation are driven by 
actin polymerization and reorganization, which respond to the 
accumulation of specific phosphoinositides in the membrane of the 
forming phagosome. (A, courtesy of Dorothy F. Bainton, Phagocytic 
Mechanisms in Health and Disease. New York: Intercontinental Medical 
Book Corporation, 1971.) 
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Remarkably, macrophages will also phagocytose a variety of inanimate par- 
ticles — such as glass or latex beads and asbestos fibers — yet they do not phago- 
cytose live animal cells. Living animal cells seem to display "dorit-eat-me" sig- 
nals in the form of cell-surface proteins that bind to inhibiting receptors on the 
surface of macrophages. The inhibitory receptors recruit tyrosine phosphatases 
that antagonize the intracellular signaling events required to initiate phagocyto- 
sis, thereby locally inhibiting the phagocytic process. Thus phagocytosis, like 
many other cell processes, depends on a balance between positive signals that 
activate the process and negative signals that inhibit it. Apopototic cells are 
thought both to gain "eat-me" signals (such as extracellularly exposed phos- 
phatidylserine) and to lose their "dorit-eat-me" signals, causing them to be very 
rapidly phagocytosed by macrophages. 



Pinocytic Vesicles Form from Coated Pits in the Plasma 
Membrane 

Virtually all eucaryotic cells continually ingest bits of their plasma membrane in 
the form of small pinocytic (endocytic) vesicles, which are later returned to the 
cell surface. The rate at which plasma membrane is internalized in this process 
of pinocytosis varies between cell types, but it is usually surprisingly large. A 
macrophage, for example, ingests 25% of its own volume of fluid each hour. This 
means that it must ingest 3% of its plasma membrane each minute, or 100% in 
about half an hour. Fibroblasts endocytose at a somewhat lower rate (1% of their 
plasma membrane per minute), whereas some amoebae ingest their plasma 
membrane even more rapidly. Since a cell's surface area and volume remain 
unchanged during this process, it is clear that the same amount of membrane 
being removed by endocytosis is being added to the cell surface by the converse 
process of exocytosis. In this sense, endocytosis and exocytosis are linked pro- 
cesses that can be considered to constitute an endocytic-exocytic cycle. The cou- 
pling between exocytosis and endocytosis is particularly strict in specialized 
structures characterized by high membrane turnover, such as the neuronal 
synapse. 

The endocytic part of the cycle often begins at clathrin-coated pits. These 
specialized regions typically occupy about 2% of the total plasma membrane 
area. The lifetime of a clathrin-coated pit is short: within a minute or so of being 
formed, it invaginates into the cell and pinches off to form a clathrin-coated 
vesicle (Figure 13-48). It has been estimated that about 2500 clathrin-coated 
vesicles leave the plasma membrane of a cultured fibroblast every minute. The 
coated vesicles are even more transient than the coated pits: within seconds of 
being formed, they shed their coat and are able to fuse with early endosomes. 
Since extracellular fluid is trapped in clathrin-coated pits as they invaginate to 



Figure 1 3-48 The formation of clathrin- 
coated vesicles from the plasma 
membrane. These electron micrographs 
illustrate the probable sequence of 
events in the formation of a clathrin- 
coated vesicle from a clathrin-coated pit. 
The clathrin-coated pits and vesicles 
shown are larger than those seen in 
normal-sized cells. They take up 
lipoprotein particles into a very large hen 
oocyte to form yolk. The lipoprotein 
particles bound to their membrane- 
bound receptors appear as a dense, fuzzy 
layer on the extracellular surface of the 
plasma membrane— which is the inside 
surface of the vesicle. (Courtesy of 
M.M. Perry and A.B. Gilbert, J. CeltSci. 
39:257-272, 1979. With permission from 
The Company of Biologists.) 
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form coated vesicles, any substance dissolved in the extracellular fluid is inter- 
nalized — a process called fluid-phase endocytosis. 



Figure 13-49 Caveolae in the plasma 
membrane of a fibroblast. (A) This 
electron micrograph shows a plasma 
membrane with a very high density of 
caveolae. Note that no cytosolic coat is 
visible. (B) This rapid-freeze deep-etch 
image demonstrates the characteristic 
"cauliflower" texture of the cytosolic face 
of the caveolae membrane. The regular 
texture is thought to result from 
aggregates of caveolin in the membrane. 
A clathrin-coated pit is also seen at the 
upper right. (Courtesy of 
R.G.W. Anderson, from K.G. Rothberg et 
al., Cell 68:673-682, 1 992. With 
permission from Elsevier.) 



Not All Pinocytic Vesicles Are Clathrin-Coated 

In addition to clathrin-coated pits and vesicles, there are other, less well-under- 
stood mechanisms by which cells can form pinocytic vesicles. One of these 
pathways initiates at caveolae (from the Latin for "little cavities"), originally rec- 
ognized by their ability to transport molecules across endothelial cells, which 
form the inner lining of blood vessels. Caveolae are present in the plasma mem- 
brane of most cell types, and in some of these they are seen in the electron 
microscope as deeply invaginated flasks (Figure 13-49). They are thought to 
form from membrane microdomains, or lipid rafts, which are patches of the 
plasma membrane that are especially rich in cholesterol, glycosphingolipids, 
and GPI-anchored membrane proteins (see Figure 10-14). The major structural 
proteins in caveolae are caveolins, which are a family of unusual integral mem- 
brane proteins that each insert a hydrophobic loop into the membrane from the 
cytosolic side but do not extend across the membrane. 

In contrast to clathrin-coated and COPI- or COPII-coated vesicles, caveolae 
are thought to invaginate and collect cargo proteins by virtue of the lipid com- 
position of the calveolar membrane, rather than by the assembly of a cytosolic 
protein coat. Caveolins may stabilize these raft domains, into which certain 
plasma membrane proteins partition. Caveolae pinch off from the plasma mem- 
brane using dynamin, and they deliver their contents either to an endosome- 
like compartment (called a caveosome) or to the plasma membrane on the 
opposite side of a polarized cell (in a process called transcytosis, which we dis- 
cuss later). Because caveolins are integral membrane proteins, they do not dis- 
sociate from the vesicles after endocytosis; instead they are delivered to the tar- 
get compartments, where they are maintained as discrete membrane domains. 
Some animal viruses such as SV40 and papilloma virus (which causes warts) 
enter cells in vesicles derived from caveolae. The viruses are first delivered to 
caveosomes, and they move from there in specialized transport vesicles to the 
ER. The viral genome exits from the ER across the ER membrane into the cytosol, 
from where it is imported into the nucleus to start the infection cycle. 

Endocytic vesicles can also bud from caveolin-free raft domains on the 
plasma membrane and deliver their cargo to caveosomes. Molecules that enter 
the cell through caveosomes avoid endosomes and lysosomes and are therefore 
shielded from exposure to low pH and lysosomal hydrolases; it is unknown how 
they move from caveosomes to other destinations in the cell. 
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Cells Use Receptor-Mediated Endocytosis to Import Selected 
Extracellular Macromolecules 

In most animal cells, clathrin-coated pits and vesicles provide an efficient path- 
way for taking up specific macromolecules from the extracellular fluid. In this 
process, called receptor-mediated endocytosis, the macromolecules bind to 
complementary transmembrane receptor proteins, accumulate in coated pits, 
and then enter the cell as receptor-macromolecule complexes in clathrin-coated 
vesicles (see Figure 13-48). Because ligands are selectively captured by receptors, 
receptor-mediated endocytosis provides a selective concentrating mechanism 
that increases the efficiency of internalization of particular ligands more than a 
hundredfold. In this way even minor components of the extracellular fluid can be 
specifically taken up in large amounts without taking in a large volume of extra- 
cellular fluid. A particularly well-understood and physiologically important 
example is the process that mammalian cells use to take up cholesterol. 

Many animals cells take up cholesterol through receptor-mediated endocy- 
tosis and, in this way, acquire most of the cholesterol they require to make new 
membrane. If the uptake is blocked, cholesterol accumulates in the blood and 
can contribute to the formation in blood vessel (artery) walls of atherosclerotic 
plaques, deposits of lipid and fibrous tissue that can cause strokes and heart 
attacks by blocking arterial blood flow. In fact, it was a study of humans with a 
strong genetic predisposition for atherosclerosis that first revealed the mecha- 
nism of receptor-mediated endocytosis. 

Most cholesterol is transported in the blood as cholesteryl esters in the 
form of lipid-protein particles known as low-density lipoproteins (LDLs) (Fig- 
ure 13-50). When a cell needs cholesterol for membrane synthesis, it makes 
transmembrane receptor proteins for LDL and inserts them into its plasma 
membrane. Once in the plasma membrane, the LDL receptors diffuse until they 
associate with clathrin-coated pits that are in the process of forming (Figure 
13-51A). Since coated pits constantly pinch off to form coated vesicles, any 
LDL particles bound to LDL receptors in the coated pits are rapidly internalized 
in coated vesicles. After shedding their clathrin coats, the vesicles deliver their 
contents to early endosomes, which are located near the cell periphery. Once the 
LDL and LDL receptors encounter the low pH in the endosomes, LDL is released 
from its receptor and is delivered via late endosomes to lysosomes. There, the 
cholesteryl esters in the LDL particles are hydrolyzed to free cholesterol, which is 
now available to the cell for new membrane synthesis. If too much free choles- 
terol accumulates in a cell, the cell shuts off both its own cholesterol synthesis 
and the synthesis of LDL receptors, so that it ceases either to make or to take up 
cholesterol. 

This regulated pathway for cholesterol uptake is disrupted in individuals 
who inherit defective genes encoding LDL receptors. The resulting high levels 
of blood cholesterol predispose these individuals to develop atherosclerosis 
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Figure 1 3-50 A low-density lipoprotein 
(LDL) particle. Each spherical particle has 
a mass of 3 x 1 6 daltons. It contains a 
core of about 1 500 cholesterol molecules 
esterified to long-chain fatty acids. A lipid 
monolayer composed of about 800 
phospholipid and 500 unesterified 
cholesterol molecules surrounds the core 
of cholesterol esters. A single molecule of 
a 500,000-dalton protein organizes the 
particle and mediates the specific binding 
of LDL to cell-surface LDL receptors. 
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Figure 13-51 Normal and mutant LDL 
receptors. (A) LDL receptors binding to a 
coated pit in the plasma membrane of a 
normal cell. The human LDL receptor is a 
single-pass transmembrane glycoprotein 
composed of about 840 amino acids, 
only 50 of which are on the cytoplasmic 
side of the membrane. (B) A mutant cell 
in which the LDL receptors are abnormal 
and lack the site in the cytoplasmic 
domain that enables them to bind to 
adaptor proteins in the clathrin-coated 
pits. Such cells bind LDL but cannot 
ingest it. In most human populations, 1 in 
500 individuals inherits one defective LDL 
receptor gene and, as a result, has an 
increased risk of a heart attack caused by 
atherosclerosis. 
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prematurely, and many would die at an early age of heart attacks resulting from 
coronary artery disease if they were not treated with drugs that lower the level of 
blood cholesterol. In some cases, the receptor is lacking altogether. In others, the 
receptors are defective — in either the extracellular binding site for LDL or the 
intracellular binding site that attaches the receptor to the coat of a clathrin- 
coated pit (see Figure 13-51B). In the latter case, normal numbers of LDL recep- 
tors are present, but they fail to become localized in clathrin-coated pits. 
Although LDL binds to the surface of these mutant cells, it is not internalized, 
directly demonstrating the importance of clathrin-coated pits for the receptor- 
mediated endocytosis of cholesterol. 

More than 25 distinct receptors are known to participate in receptor-medi- 
ated endocytosis of different types of molecules. They all apparently use 
clathrin- dependent internalization routes and are guided into clathrin-coated 
pits by signals in their cytoplasmic tails that bind to adaptor proteins in the 
clathrin coat. Many of these receptors, like the LDL receptor, enter coated pits 
irrespective of whether they have bound their specific ligands. Others enter 
preferentially when bound to a specific ligand, suggesting that a ligand- 
induced conformational change is required for them to activate the signal 
sequence that guides them into the pits. Since most plasma membrane pro- 
teins fail to become concentrated in clathrin-coated pits, the pits serve as 
molecular filters, preferentially collecting certain plasma membrane proteins 
(receptors) over others. 

Electron-microscope studies of cultured cells exposed simultaneously to 
different labeled ligands demonstrate that many kinds of receptors can cluster 
in the same coated pit, whereas some other receptors cluster in different 
clathrin-coated pits. The plasma membrane of one clathrin-coated pit can prob- 
ably accommodate up to 1000 receptors of assorted varieties. Although all of the 
receptor-ligand complexes that use this endocytic pathway are apparently 
delivered to the same endosomal compartment, the subsequent fates of the 
endocytosed molecules vary, as we discuss next. 



Endocytosed Materials That Are Not Retrieved from Endosomes 
End Up in Lysosomes 

The endosomal compartments of a cell can be complex. They can be made visi- 
ble in the electron microscope by adding a readily detectable tracer molecule, 
such as the enzyme peroxidase, to the extracellular medium and leaving the cells 
for various lengths of time to take it up by endocytosis. The distribution of the 
molecule after its uptake reveals the endosomal compartments as a set of het- 
erogeneous, membrane-enclosed tubes extending from the periphery of the cell 
to the perinuclear region, where it is often close to the Golgi apparatus. Two 
sequential sets of endosomes can be readily distinguished in such labeling 
experiments. The tracer molecule appears within a minute or so in early endo- 
somes, just beneath the plasma membrane. After 5-15 minutes, it has moved to 
late endosomes, close to the Golgi apparatus and near the nucleus. Early and 
late endosomes differ in their protein compositions. The transition from early to 
late endosomes is accompanied by the release of Rab5 and the binding of Rab7, 
for example. 

As mentioned earlier, a vacuolar H + ATPase in the endosomal membrane, 
which pumps H + into endosomes from the cytosol, keeps the lumen of the endo- 
somal compartments acidic (pH ~6). In general, later endosomes are more 
acidic than early endosomes. This gradient of acidic environments has a crucial 
role in the function of these organelles. 

We have already seen how endocytosed materials mix in early endosomes 
with newly synthesized acid hydrolases and eventually end up being degraded in 
lysosomes. Many molecules, however, are specifically diverted from this journey 
to destruction. They are instead recycled from the early endosomes back to the 
plasma membrane via transport vesicles. Only molecules that are not retrieved 
from endosomes in this way are delivered to lysosomes for degradation. 
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Although mild digestion may start in early endosomes, many hydrolases are 
synthesized and delivered there as proenzymes, called zymogens, which contain 
extra inhibitory domains at their N-terminus that keep the hydrolase inactive 
until these domains are proteolytically removed. The hydrolases are activated 
when late endosomes become endolysosomes as the result of fusion with pre- 
existing lysosomes, which contain a full complement of active hydrolases that 
digest off the inhibitory domains from the newly synthesized enzymes. More- 
over, the pH in early endosomes is not low enough to activate lysosomal hydro- 
lases optimally. By these means, cells can retrieve most membrane proteins 
from early endosomes and recycle them back to the plasma membrane. 



Specific Proteins Are Retrieved from Early Endosomes and 
Returned to the Plasma Membrane 

Early endosomes form a compartment that acts as the main sorting station in 
the endocytic pathway, just as the cis and trans Golgi networks serve this func- 
tion in the biosynthetic-secretory pathway. In the mildly acidic environment of 
the early endosome, many internalized receptor proteins change their confor- 
mation and release their ligand, as already discussed for the M6P receptors. 
Those endocytosed ligands that dissociate from their receptors in the early 
endosome are usually doomed to destruction in lysosomes, along with the other 
soluble contents of the endosome. Some other endocytosed ligands, however, 
remain bound to their receptors, and thereby share the fate of the receptors. 

The fates of receptors — and of any ligands remaining bound to them — vary 
according to the specific type of receptor. (1) Most receptors are recycled and 
return to the same plasma membrane domain from which they came; (2) some 
proceed to a different domain of the plasma membrane, thereby mediating 
transcytosis; and (3) some progress to lysosomes, where they are degraded (Fig- 
ure 13-52). 

The LDL receptor follows the first pathway. It dissociates from its ligand, 
LDL, in the early endosome and is recycled back to the plasma membrane for 
reuse, leaving the discharged LDL to be carried to lysosomes (Figure 13-53). The 
recycling transport vesicles bud from long, narrow tubules that extend from the 
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Figure 13-52 Possible fates for 
transmembrane receptor proteins that 
have been endocytosed. Three pathways 
from the endosomal compartment in an 
epithelial cell are shown. Retrieved 
receptors are returned (1) to the same 
plasma membrane domain from which 
they came (recycling) or (2) to a different 
domain of the plasma membrane 
(transcytosis). (3) Receptors that are not 
specifically retrieved from endosomes 
follow the pathway from the endosomal 
compartment to lysosomes, where they 
are degraded (degradation). The 
formation of oligomeric aggregates in 
the endosomal membrane may be one of 
the signals that guide receptors into the 
degradative pathway. If the ligand that is 
endocytosed with its receptor stays 
bound to the receptor in the acidic 
environment of the endosome, it follows 
the same pathway as the receptor; 
otherwise it is delivered to lysosomes. 
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Figure 13-53 The receptor-mediated 
endocytosis of LDL. <GCTA> Note that 
the LDL dissociates from its receptors in 
the acidic environment of the early 
endosome. After a number of steps 
(shown in Figure 13-55), the LDL ends up 
in lysosomes, where it is degraded to 
release free cholesterol. In contrast, the 
LDL receptors are returned to the plasma 
membrane via clathrin-coated transport 
vesicles that bud off from the tubular 
region of the early endosome, as shown. 
For simplicity, only one LDL receptor is 
shown entering the cell and returning to 
the plasma membrane. Whether it is 
occupied or not, an LDL receptor typically 
makes one round trip into the cell and 
back to the plasma membrane every 
10 minutes, making a total of several 
hundred trips in its 20-hour lifespan. 
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early endosomes. It is likely that the geometry of these tubules helps the sorting 
process: because tubules have a large membrane area enclosing a small volume, 
membrane proteins become enriched over soluble proteins. Transport vesicles 
that return material to the plasma membrane begin budding from the tubules, 
but tubular portions of the early endosome also pinch off and fuse with one 
another to form recycling endosomes, which serve as way-stations for the traffic 
between early endosomes and the plasma membrane. This recycling pathway 
operates continuously, compensating for the continuous endocytosis occurring 
at the plasma membrane. 

The transferrin receptor follows a similar recycling pathway as the LDL 
receptor, but unlike the LDL receptor it also recycles its ligand. Transferrin is a 
soluble protein that carries iron in the blood. Cell-surface transferrin receptors 
deliver transferrin with its bound iron to early endosomes by receptor- mediated 
endocytosis. The low pH in the endosome induces transferrin to release its 
bound iron, but the iron-free transferrin itself (called apotransferrin) remains 
bound to its receptor. The receptor-apotransferrin complex enters the tubular 
extensions of the early endosome and from there is recycled back to the plasma 
membrane (Figure 13-54). When the apotransferrin returns to the neutral pH of 
the extracellular fluid, it dissociates from the receptor and is thereby freed to 
pick up more iron and begin the cycle again. Thus, transferrin shuttles back and 
forth between the extracellular fluid and the endosomal compartment, avoiding 
lysosomes and delivering iron to the cell interior, as needed for cells to grow and 
proliferate. 

The second pathway that endocytosed receptors can follow from endo- 
somes is taken by many signaling receptors, including opioid receptors (see 
Figure 13-54) and the receptor that binds epidermal growth factor (EGF). EGF is 
a small, extracellular signal protein that stimulates epidermal and various other 
cells to divide. Unlike LDL receptors, EGF receptors accumulate in clathrin- 
coated pits only after binding EGF, and most of them do not recycle but are 
degraded in lysosomes, along with the ingested EGF. EGF binding therefore first 
activates intracellular signaling pathways and then leads to a decrease in the 
concentration of EGF receptors on the cell surface, a process called receptor 
down-regulation that reduces the cell's subsequent sensitivity to EGF (see Fig- 
ure 15-29). 



Figure 1 3-54 Sorting of membrane 
proteins in the endocytic pathway. 

Transferrin receptors mediate iron uptake 
and constitutively cycle between 
endosomes and the plasma membrane. 
By contrast, activated opioid receptors 
are down-regulated by endocytosis 
followed by degradation in lysosomes; 
they are activated by opiates such as 
morphine and heroin, as well as by 
endogenous peptides called enkephalins 
and endorphins. Endocytosis of both 
types of receptors starts in clathrin- 
coated pits. The receptors are then 
delivered to early endosomes, where 
their pathways part: transferrin receptors 
are sorted to the recycling endosomes, 
whereas opioid receptors are sorted to 
late endosomes. The micrograph shows 
both receptors — labeled with different 
fluorescent dyes — 30 min after 
endocytosis (transferrin receptors are 
labeled in red and opioid receptors in 
green). At this time, some early 
endosomes still contain both receptors 
and are seen as yellow, due to the overlap 
of red and green light emitted from the 
fluorescent dyes. By contrast, recycling 
endosomes and late endosomes are 
selectively enriched in either transferrin 
or opioid receptors, respectively, and 
therefore appear as distinct red and 
green structures. (Courtesy of Mark von 
Zastrow.) 
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Clathrin-dependent receptor-mediated endocytosis is highly regulated. 
The receptors are first covalently modified with the small protein ubiquitin. 
But, unlike polyubiquitylation, which adds a chain of ubiquitins that typically 
targets a protein for degradation in proteasomes (discussed in Chapter 6), 
ubiquitin tagging for sorting into the clathrin-dependent endocytic pathway 
adds one or more single ubiquitin molecules to the protein — a process called 
monoubiquitylation or multiubiquitylation, respectively. Ubiquitin-binding 
proteins recognize the attached ubiquitin and help direct the modified recep- 
tors into clathrin-coated pits. After delivery to the endosome, other ubiquitin- 
binding proteins recognize the ubiquitin and help mediate sorting steps. 



Multivesicular Bodies Form on the Pathway to Late Endosomes 

As previously stated, many of the endocytosed molecules move from the early to 
the late endosomal compartment. In this process, early endosomes migrate 
slowly along microtubules toward the cell interior, while shedding membrane 
tubules and vesicles that recycle material to the plasma membrane and TGN. At 
the same time, the membrane enclosing the migrating endosomes forms invagi- 
nating buds that pinch off and form internal vesicles; they are then called mul- 
tivesicular bodies (Figure 13-55). Multivesicular bodies eventually fuse with a 
late endosomal compartment or with each other to become late endosomes. At 
the end of this pathway, the late endosomes convert to endolysosomes and lyso- 
somes as a result of both their fusion with preexisting lysosomes and progressive 
acidification (Figure 13-56). 

The multivesicular bodies carry those endocytosed membrane proteins that 
are to be degraded. As part of the protein-sorting process, receptors destined for 
degradation, such as occupied EGF receptors described previously, selectively 
partition into the invaginating membrane of the multivesicular bodies. In this 
way, both the receptors and any signaling proteins strongly bound to them are 
made fully accessible to the digestive enzymes that will degrade them (Figure 
13-57). In addition to endocytosed membrane proteins, multivesicular bodies 
also contain the soluble content of early endosomes destined for late endo- 
somes and digestion in lysosomes. 
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Figure 13-55 Electron micrograph of a 
multivesicular body in a plant cell. The 

large amount of internal membrane will 
be delivered to the vacuole, the plant 
equivalent of the lysosome, for digestion. 
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Figure 1 3-56 Details of the endocytic 
pathway from the plasma membrane to 
lysosomes. Maturation of early 
endosomes to late endosomes occurs 
through the formation of multivesicular 
bodies, which contain large amounts of 
invaginated membrane and internal 
vesicles (hence their name). Multivesicular 
bodies move inward along microtubules, 
continually shedding transport vesicles 
that recycle components to the plasma 
membrane. They gradually convert into 
late endosomes, either by fusing with each 
other or by fusing with preexisting late 
endosomes. The late endosomes no longer 
send vesicles to the plasma membrane. 
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Figure 1 3-57 The sequestration of endocytosed proteins into internal 
membranes of multivesicular bodies. Eventually, proteases and lipases in 
lysosomes digest all of the internal membranes within multivesicular 
bodies produced by the invaginations. The invagination processes are 
essential to achieve complete digestion of endocytosed membrane 
proteins: because the outer membrane of the multivesicular body becomes 
continuous with the lysosomal membrane, for example, lysosomal 
hydrolases could not digest the cytosolic domains of endocytosed 
transmembrane proteins such as the EGF receptor shown here, if the 
protein were not localized in internal vesicles. 



Sorting into the internal vesicles of a multivesicular body requires one or 
multiple ubiquitin tags, which are added to the cytosolic domains of membrane 
proteins. These tags initially help guide the proteins into clathrin-coated vesi- 
cles. Once delivered to the endosomal membrane, the ubiquitin tags are recog- 
nized again, this time by a series of cytosolic protein complexes, called ESCRT-O, 
-I, -II, and -7//, which bind sequentially, handing the ubiquitylated cargo from 
one complex to the next, and ultimately mediate the sorting process into the 
internal vesicles of multivesicular bodies (Figure 13-58). Membrane invagina- 
tion into multivesicular bodies also depends on a lipid kinase that phosphory- 
lates phosphatidylinositol to produce PI(3)R which serves as an additional dock- 
ing site for the ESCRT complexes; these complexes require both PI(3)P and the 
presence of ubiquitylated cargo proteins to bind to the endosomal membrane. 
A second PI kinase adds another phosphate group to PI(3)P, producing PI(3,5)P 2 , 
which is required for ESCRT-III to form large multimeric assemblies on the 
membrane. It is not known how the assembly of ESCRT complexes ultimately 
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Figure 1 3-58 Sorting of endocytosed membrane proteins into the internal vesicles of a multivesicular body. A series of 
complex binding events passes the ubiquitylated cargo proteins sequentially from one ESCRT complex to the next 
enventually concentrating them in membrane areas that bud away from the cytosol into the lumen of the endosome to 
form the internal membrane vesicles of the multivesicular body. ESCRT complexes are soluble in the cytosol and are 
recruited to the membrane as needed. First, ESCRT-0 binds both the ubiquitin attached to the cargo protein and to PI(3)P 
head groups. ESCRT-0 dissociates from the membrane, handing the ubiquitylated cargo protein over to the ESCRT-I 
complex; next ESCRT-I dissociates, handing the cargo protein over to ESCRT-II complex; and finally ESCRT-II dissociates and 
ESCRT-III complexes assemble on the membrane. By contrast to ESCRT-0, -I, and -II, ESCRT-III does not bind to the 
ubiquitylated cargo directly. Instead its assembly into expansive multimeric structures is thought to confine the cargo 
molecules into specialized membrane areas that then invaginate, leaving the ESCRT components on the endosome surface. 
An AAA-ATPase {red cylinders) then disassembles the ESCRT-III complexes so that they can be reused. 
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drives the invagination and pinching-off processes required to form the internal 
vesicles but the ESCRT complexes themselves are not part of the invaginating 
membranes. 

Mutant cells compromised in ESCRT function display signaling defects. In 
such cells, activated receptors cannot be down-regulated by endocytosis and 
packaged into multivesicular bodies and therefore mediate prolonged signaling, 
which can lead to uncontrolled cell proliferation and cancer. 

The same ESCRT machinery that drives the internal budding from the endo- 
somal membrane to form multivesicular bodies is also used by HIV ebola, and 
other enveloped viruses to bud from the plasma membrane into the extracellu- 
lar space. The two processes are topologically equivalent, as they both involve 
budding away from the cytosolic surface of the membrane (Figure 13-59). 



Transcytosis Transfers Macromolecules Across Epithelial Cell 
Sheets 

Some receptors on the surface of polarized epithelial cells transfer specific 
macromolecules from one extracellular space to another by transcytosis (Fig- 
ure 13-60). These receptors are endocytosed and then follow a pathway from 
endosomes to a different plasma membrane domain (see Figure 13-52). A new- 
born rat, for example, obtains antibodies from its mother's milk (which help 
protect it against infection) by transporting them across the epithelium of its 
gut. The lumen of the gut is acidic, and, at this low pH, the antibodies in the 
milk bind to specific receptors on the apical (absorptive) surface of the gut 
epithelial cells. The receptor-antibody complexes are internalized via clathrin- 
coated pits and vesicles and are delivered to early endosomes. The complexes 
remain intact and are retrieved in transport vesicles that bud from the early 
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Figure 1 3-59 ESCRT complexes in 
multivesicular body formation and virus 
budding. In the two topologically 
equivalent processes indicated by the 
arrows, ESCRT complexes shape 
membranes into buds that bulge away 
from the cytosol. 
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Figure 1 3-60 The role of recycling 
endosomes in transcytosis. Recycling 
endosomes form a way-station on the 
transcytotic pathway. In the example 
shown here, an antibody receptor on a 
gut epithelial cell binds antibody and is 
endocytosed, eventually carrying the 
antibody to the basolateral plasma 
membrane. The receptor is called an Fc 
receptor because it binds the Fc part of 
the antibody (discussed in Chapter 25). 
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Figure 13-61 Storage of plasma 
membrane proteins in recycling 
endosomes. Recycling endosomes can 
serve as an intracellular pool of 
specialized plasma membrane proteins 
that can be mobilized when needed. In 
the example shown, insulin binding to 
the insulin receptor triggers an 
intracellular signaling pathway that 
causes the rapid insertion of glucose 
transporters into the plasma membrane 
of a fat or muscle cell, greatly increasing 
glucose intake. 



endosome and subsequently fuse with the basolateral domain of the plasma 
membrane. On exposure to the neutral pH of the extracellular fluid that bathes 
the basolateral surface of the cells, the antibodies dissociate from their receptors 
and eventually enter the newborn's bloodstream. 

The transcytotic pathway from the early endosome to the plasma mem- 
brane is not direct. The receptors first move from the early endosome to an 
intermediate endosomal compartment, the recycling endosome described pre- 
viously (see Figure 13-60). The variety of pathways that different receptors fol- 
low from endosomes implies that, in addition to binding sites for their ligands 
and binding sites for coated pits, many receptors also possess sorting signals 
that guide them into the appropriate type of transport vesicle leaving the endo- 
some and moving to the appropriate target membrane in the cell. 

A unique property of recycling endosomes is that cells can regulate the exit 
of membrane proteins from the compartment. Thus, cells can adjust the flux of 
proteins through the transcytotic pathway according to need. Although the 
mechanism is uncertain, this regulation allows recycling endosomes to play an 
important part in adjusting the concentration of specific plasma membrane 
proteins. Fat cells and muscle cells, for example, contain large intracellular pools 
of the glucose transporters that are responsible for the uptake of glucose across 
the plasma membrane. These membrane transport proteins are stored in spe- 
cialized recycling endosomes until the hormone insulin stimulates the cell to 
increase its rate of glucose uptake. In response to the insulin signal, transport 
vesicles rapidly bud from the recycling endosome and deliver large numbers of 
glucose transporters to the plasma membrane, thereby greatly increasing the 
rate of glucose uptake into the cell (Figure 13-61). 



Epithelial Cells Have Two Distinct Early Endosomal Compartments 
but a Common Late Endosomal Compartment 

In polarized epithelial cells, endocytosis occurs from both the basolateral 
domain and the apical domain of the plasma membrane. Material endocytosed 
from either domain first enters an early endosomal compartment that is unique 
to that domain. This arrangement allows endocytosed receptors to be recycled 
back to their original membrane domain, unless they contain sorting signals 
that mark them for transcytosis to the other domain. Molecules endocytosed 
from either plasma membrane domain that are not retrieved from the early 
endosomes end up in a common late endosomal compartment near the cell 
center and are eventually degraded in lysosomes (Figure 13-62). 

Whether cells contain a few connected or many unconnected endosomal 
compartments seems to depend on the cell type and the physiological state of 
the cell. Like many other membrane-enclosed organelles, endosomes of the 
same type can readily fuse with one another (an example of homotypic fusion, 
discussed earlier) to create large continuous endosomes. 



TRANSPORT FROM THE TRANS GOLGI NETWORK TO THE CELL EXTERIOR: EXOCYTOSIS 



799 




extracellular space 



Summary 



Cells ingest fluid, molecules, and particles by endocytosis, in which localized regions of 
the plasma membrane invaginate and pinch off to form endocytic vesicles. Many of the 
endocytosed molecules and particles eventually end up in lysosomes, where they are 
degraded. Endocytosis occurs both constitutively and as a triggered response to extra- 
cellular signals. Endocytosis is so extensive in many cells that a large fraction of the 
plasma membrane is internalized every hour. The cells remain the same size because 
most of the plasma membrane components (proteins and lipid) that are endocytosed 
are continually returned to the cell surface by exocytosis. This large-scale 
endocytic-exocytic cycle is mediated largely by clathrin- coated pits and vesicles. 

Many cell-surface receptors that bind specific extracellular macromolecules 
become tagged with ubiquitin, which guides them into clathrin-coated pits. As a 
result, these receptors and their ligands are efficiently internalized in clathrin-coated 
vesicles, a process called receptor-mediated endocytosis. The coated vesicles rapidly 
shed their clathrin coats and fuse with early endosomes. 

Most of the ligands dissociate from their receptors in the acidic environment of the 
endosome and eventually end up in lysosomes, while most of the receptors are recycled 
via transport vesicles back to the cell surface for reuse. But receptor-ligand complexes 
can follow other pathways from the endosomal compartment. In some cases, both the 
receptor and the ligand end up being degraded in lysosomes, resulting in receptor 
down- regulation; in these cases, the ubiquitin- tagged receptors recruit various ESCRT 
complexes, which drive the invagination and pinching- off of endosomal membrane 
vesicles to form multivesicular bodies. In other cases, both receptor and ligand are 
transferred to a different plasma membrane domain, causing the ligand to be released 
at a surface of the cell that differs from the membrane where it originated, a process 
called transcytosis. The transcytosis pathway involves recycling endosomes, where 
endocytosed plasma membrane proteins can be stored until they are needed. 



TRANSPORT FROM THE TRANS GOLGI NETWORK 
TO THE CELL EXTERIOR: EXOCYTOSIS 

Having considered the cell's internal digestive system and the various types of 
incoming membrane traffic that converge on lysosomes, we now return to the 
Golgi apparatus and examine the secretory pathways that lead out to the cell 
exterior. Transport vesicles destined for the plasma membrane normally leave 



Figure 13-62 The two distinct early 
endosomal compartments in an 
epithelial cell. The basolateral and the 
apical domains of the plasma membrane 
communicate with separate early 
endosomal compartments. However, 
endocytosed molecules from both 
domains that do not contain signals for 
recycling or transcytosis meet in a 
common late endosomal compartment 
before being digested in lysosomes. 
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Figure 1 3-63 The constitutive and 
regulated secretory pathways. <ACAG> 
The two pathways diverge in the trans 
Golgi network. The constitutive secretory 
pathway operates in all cells. Many soluble 
proteins are continually secreted from the 
cell by this pathway, which also supplies 
the plasma membrane with newly 
synthesized lipids and proteins. Specialized 
secretory cells also have a regulated 
secretory pathway, by which selected 
proteins in the trans Golgi network are 
diverted into secretory vesicles, where the 
proteins are concentrated and stored until 
an extracellular signal stimulates their 
secretion. The regulated secretion of small 
molecules, such as histamine and 
neurotransmitters, occurs by a similar 
pathway; these molecules are actively 
transported from the cytosol into 
preformed secretory vesicles. There they 
are often complexed to specific 
macromolecules (proteoglycans, for 
histamine), so that they can be stored at 
high concentration without generating an 
excessively high osmotic pressure. 



the TGN in a steady stream as irregularly shaped tubules. The membrane pro- 
teins and the lipids in these vesicles provide new components for the cell's 
plasma membrane, while the soluble proteins inside the vesicles are secreted to 
the extracellular space. The fusion of the vesicles with the plasma membrane is 
called exocytosis. In this way, for example, cells produce and secrete most of the 
proteoglycans and glycoproteins of the extracellular matrix, which is discussed 
in Chapter 19. 

All cells require this constitutive secretory pathway, which operates contin- 
uously. Specialized secretory cells, however, have a second secretory pathway in 
which soluble proteins and other substances are initially stored in secretory vesi- 
cles for later release. This is the regulated secretory pathway, found mainly in 
cells specialized for secreting products rapidly on demand — such as hormones, 
neurotransmitters, or digestive enzymes (Figure 13-63). In this section, we con- 
sider the role of the Golgi apparatus in both of these secretory pathways and 
compare the two mechanisms of secretion. 



Many Proteins and Lipids Seem to Be Carried Automatically from 
the Golgi Apparatus to the Cell Surface 

A cell capable of regulated secretion must separate at least three classes of pro- 
teins before they leave the trans Golgi network— those destined for lysosomes 
(via endosomes), those destined for secretory vesicles, and those destined for 
immediate delivery to the cell surface (Figure 13-64). We have already noted 
that proteins destined for lysosomes are tagged for packaging into specific 
departing vesicles (with mannose 6-phosphate in the case of lysosomal hydro- 
lases), and analogous signals are thought to direct secretory proteins into secre- 
tory vesicles. The non-selective constitutive secretory pathway transports most 
other proteins directly to the cell surface. Because entry into this pathway does 
not require a particular signal, it is also called the default pathway. Thus, in an 
unpolarized cell such as a white blood cell or a fibroblast, it seems that any pro- 
tein in the lumen of the Golgi apparatus is automatically carried by the consti- 
tutive pathway to the cell surface unless it is specifically returned to the ER, 
retained as a resident protein in the Golgi apparatus itself, or selected for the 
pathways that lead to regulated secretion or to lysosomes. In polarized cells, 
where different products have to be delivered to different domains of the cell 
surface, we shall see that the options are more complex. 
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Figure 13-64 The three best-understood 
pathways of protein sorting in the trans 
Golgi network. (1) Proteins with the 
mannose 6-phosphate (M6P) marker are 
diverted to lysosomes (via endosomes) in 
clathrin-coated transport vesicles (see 
Figure 1 3-44). (2) Proteins with signals 
directing them to secretory vesicles are 
concentrated in such vesicles as part of a 
regulated secretory pathway that is present 
only in specialized secretory cells. (3) In 
unpolarized cells, a constitutive secretory 
pathway delivers proteins with no special 
features to the cell surface. In polarized 
cells, such as epithelial cells, however, 
secreted and plasma membrane proteins 
are selectively directed to either the apical 
or the basolateral plasma membrane 
domain, so a specific signal must mediate 
at least one of these two pathways, as we 
discuss later. 



Secretory Vesicles Bud from the Trans Golgi Network 

Cells that are specialized for secreting some of their products rapidly on demand 
concentrate and store these products in secretory vesicles (often called secretory 
granules or dense -core vesicles because they have dense cores when viewed in 
the electron microscope). Secretory vesicles form from the trans Golgi network, 
and they release their contents to the cell exterior by exocytosis in response to 
specific signals. The secreted product can be either a small molecule (such as 
histamine) or a protein (such as a hormone or digestive enzyme). 

Proteins destined for secretory vesicles (called secretory proteins) are pack- 
aged into appropriate vesicles in the trans Golgi network by a mechanism that is 
thought to involve the selective aggregation of the secretory proteins. Clumps of 
aggregated, electron-dense material can be detected by electron microscopy in 
the lumen of the trans Golgi network. The signal that directs secretory proteins 
into such aggregates is not known, but it is thought to be composed of signal 
patches that are common to proteins of this class. When a gene encoding a 
secretory protein is artificially expressed in a secretory cell that normally does 
not make the protein, the foreign protein is appropriately packaged into secre- 
tory vesicles. This observation shows that, although the proteins that an indi- 
vidual cell expresses and packages in secretory vesicles differ, they all contain 
common sorting signals, which function properly even when the proteins are 
expressed in cells that do not normally make them. 

It is unclear how the aggregates of secretory proteins are segregated into 
secretory vesicles. Secretory vesicles have unique proteins in their membrane, 
some of which might serve as receptors for aggregated protein in the trans Golgi 
network. The aggregates are much too big, however, for each molecule of the 
secreted protein to be bound by its own cargo receptor, as proposed for transport 
of the lysosomal enzymes. The uptake of the aggregates into secretory vesicles 
may therefore more closely resemble the uptake of particles by phagocytosis at 
the cell surface, where the plasma membrane zippers up around large structures. 

Initially, most of the membrane of the secretory vesicles that leave the trans 
Golgi network is only loosely wrapped around the clusters of aggregated secre- 
tory proteins. Morphologically, these immature secretory vesicles resemble 
dilated trans Golgi cisternae that have pinched off from the Golgi stack. As the 
vesicles mature, they can fuse with one another and their contents become con- 
centrated (Figure 13-65A), probably as the result of both the continuous 
retrieval of membrane that is recycled back to late endosomes and the TGN and 
the progressive acidification of the vesicle lumen that results from the increas- 
ing concentration of ATP-driven H + pumps in the vesicle membrane. Recall that 
V-type ATPases acidify all endocytic and exocytic organelles (see Figure 13-36). 
The degree of concentration of proteins during the formation and maturation 
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Figure 13-65 The formation of secretory vesicles. (A) Secretory proteins become segregated and highly concentrated in 
secretory vesicles by two mechanisms. First, they aggregate in the ionic environment of the trans Golgi network; often the 
aggregates become more condensed as secretory vesicles mature and their lumen becomes more acidic. Second, clathrin- 
coated vesicles retrieve excess membrane and lumenal content present in immature secretory vesicles as the secretory 
vesicles mature. (B) This electron micrograph shows secretory vesicles forming from the trans Golgi network in an insulin- 
secreting p cell of the pancreas. An antibody conjugated to gold spheres (black dots) has been used to locate clathrin 
molecules. The immature secretory vesicles (open arrow), which contain insulin precursor protein (proinsulin), contain 
clathrin patches. Clathrin coats are no longer seen on the mature secretory vesicle, which has a highly condensed core 
(solid arrow). (Courtesy of Lelio Orci.) 



of secretory vesicles is only a small part of the total 200-400-fold concentration 
of these proteins that occurs after they leave the ER. Secretory and membrane 
proteins become concentrated as they move from the ER through the Golgi 
apparatus because of an extensive retrograde retrieval process mediated by 
COPI-coated transport vesicles that exclude them (see Figure 13-24). 

Membrane recycling is important for returning Golgi components to the 
Golgi apparatus, as well as for concentrating the contents of secretory vesicles. 
The vesicles that mediate this retrieval originate as clathrin-coated buds on the 
surface of immature secretory vesicles, often being seen even on budding secre- 
tory vesicles that have not yet been severed from the Golgi stack (see Figure 
13-65B). 

Because the final mature secretory vesicles are so densely filled with con- 
tents, the secretory cell can disgorge large amounts of material promptly by exo- 
cytosis when triggered to do so (Figure 13-66). 
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Figure 1 3-66 Exocytosis of secretory vesicles. The electron micrograph shows the release of insulin from a secretory 
vesicle of a pancreatic (5 cell. (Courtesy of Lelio Orci, from L. Orci, J.-D. Vassali and A. Perrelet, Sci. Am. 256:85-94, 1 988. With 
permission from Scientific American.) 
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Proteins Are Often Proteolytically Processed During the 
Formation of Secretory Vesicles 

Concentration is not the only process to which secretory proteins are subject as 
the secretory vesicles mature. Many polypeptide hormones and neuropeptides, 
as well as many secreted hydrolytic enzymes, are synthesized as inactive protein 
precursors. Proteolysis is necessary to liberate the active molecules from these 
precursors. The cleavages begin in the trans Golgi network, and they continue in 
the secretory vesicles and sometimes in the extracellular fluid after secretion has 
occurred. Many secreted polypeptides have, for example, an N-terminal pro- 
peptide that is cleaved off to yield the mature protein. These proteins are thus 
synthesized as pre-pro-proteins, the pre-peptide consisting of the ER signal pep- 
tide that is cleaved off earlier in the rough ER (see Figure 12-38). In other cases, 
peptide -signaling molecules are made as polyproteins that contain multiple 
copies of the same amino acid sequence. In still more complex cases, a variety 
of peptide-signaling molecules are synthesized as parts of a single polyprotein 
that acts as a precursor for multiple end products, which are individually 
cleaved from the initial polypeptide chain. The same polyprotein may be pro- 
cessed in various ways to produce different peptides in different cell types (Fig- 
ure 13-67). 

Why is proteolytic processing so common in the secretory pathway? Some of 
the peptides produced in this way, such as the enkephalins (five-amino-acid neu- 
ropeptides with morphine-like activity), are undoubtedly too short in their 
mature forms to be co-translationally transported into the ER lumen or to include 
the necessary signal for packaging into secretory vesicles. In addition, for secreted 
hydrolytic enzymes — or any other protein whose activity could be harmful inside 
the cell that makes it— delaying activation of the protein until it reaches a secre- 
tory vesicle or until after it has been secreted has a clear advantage: it prevents it 
from acting prematurely inside the cell in which it is synthesized. 



Figure 13-67 Alternative processing 
pathways for the prohormone pro- 
opiomelanocortin. The initial cleavages 
are made by proteases that cut next to 
pairs of positively charged amino acids 
(Lys-Arg, Lys-Lys, Arg-Lys, or Arg-Arg pairs). 
Trimming reactions then produce the final 
secreted products. Different cell types 
produce different concentrations of 
individual processing enzymes, so that the 
same prohormone precursor is cleaved to 
produce different peptide hormones. In 
the anterior lobe of the pituitary gland, for 
example, only corticotropin (ACTH) and 
p-lipotropin are produced from 
proopiomelanocortin, whereas in the 
intermediate lobe of the pituitary gland 
mainly a-melanocyte stimulating 
hormone (a-MSH), y-lipotropin, p-MSH, 
and f}-endorphin are produced. 



Secretory Vesicles Wait Near the Plasma Membrane Until Signaled 
to Release Their Contents 

Once loaded, a secretory vesicle has to reach the site of secretion, which in some 
cells is far away from the Golgi apparatus. Nerve cells are the most extreme 
example. Secretory proteins, such as peptide neurotransmitters (neuropeptides) 
that are to be released from nerve terminals at the end of the axon, are made and 
packaged into vesicles in the cell body, where the ribosomes, ER, and Golgi 
apparatus are located. They must then travel along the axon to the nerve termi- 
nals, which can be a meter or more away. As discussed in Chapter 16, motor pro- 
teins propel the vesicles along axonal microtubules, whose uniform orientation 
guides the vesicles in the proper direction. Microtubules also guide vesicles to 
the cell surface for constitutive exocytosis. 

Whereas vesicles containing materials for constitutive release fuse with the 
plasma membrane once they arrive there, secretory vesicles in the regulated 
pathway wait at the membrane until the cell receives a signal to secrete, and they 
then fuse. The signal is often a chemical messenger, such as a hormone, that 
binds to receptors on the cell surface. The resulting activation of the receptors 
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generates intracellular signals, often including a transient increase in the con- 
centration of free Ca 2+ in the cytosol In nerve terminals, the initial signal for 
exocytosis is usually an electrical excitation (an action potential) triggered by a 
chemical transmitter binding to receptors elsewhere on the same cell surface. 
When the action potential reaches the nerve terminals, it causes an influx of 
Ca 2+ through voltage-gated Ca 2+ channels. The binding of Ca 2+ ions to specific 
sensors then triggers the secretory vesicles (called synaptic vesicles) to fuse with 
the plasma membrane and release their contents to the extracellular space (see 
Figure 11-35). 

The speed of transmitter release (taking only milliseconds) indicates that the 
proteins mediating the fusion reaction do not undergo complex, multistep rear- 
rangements. After vesicles have been docked to the presynaptic plasma mem- 
brane, they undergo a priming step, which prepares them for rapid fusion. The 
SNAREs may be partly paired, but their helices are not fully wound into the final 
four-helix bundle required for fusion (see Figure 13-18). Other proteins are 
thought to keep the SNAREs from completing the fusion reaction until the Ca 2+ 
influx releases this brake. At a typical synapse, only few of the docked vesicles 
seem to be primed and ready for exocytosis. The use of only a few vesicles at a 
time allows each synapse to fire over and over again in quick succession. With 
each firing, new synaptic vesicles become primed to replace those that have 
fused and released their contents. 



Regulated Exocytosis Can Be a Localized Response of the Plasma 
Membrane and Its Underlying Cytoplasm 

Histamine is a small molecule secreted by mast cells. It is released by the regu- 
lated pathway in response to specific ligands that bind to receptors on the mast 
cell surface (see Figure 25-27). Histamine causes many of the unpleasant symp- 
toms that accompany allergic reactions, such as itching and sneezing. When 
mast cells are incubated in fluid containing a soluble stimulant, massive exocy- 
tosis occurs all over the cell surface (Figure 13-68). But if the stimulating ligand 
is artificially attached to a solid bead so that it can interact only with a localized 
region of the mast cell surface, exocytosis is now restricted to the region where 
the cell contacts the bead (Figure 13-69). 

This experiment shows that individual segments of the plasma membrane 
can function independently in regulated exocytosis. As a result, the mast cell, 
unlike a nerve cell, does not respond as a whole when it is triggered; the activa- 
tion of receptors, the resulting intracellular signals, and the subsequent exocy- 
tosis are all localized in the particular region of the cell that has been excited. 
Similarly, localized exocytosis enables a killer lymphocyte to deliver the proteins 
that induce the death of a single infected target cell precisely without endan- 
gering normal neighboring cells (see Figure 25-46). 



Figure 13-68 Electron micrographs of 
exocytosis in rat mast cells. (A) An 

unstimulated mast cell. (B) This cell has 
been activated to secrete its stored 
histamine by a soluble extracellular 
stimulant. Histamine-containing 
secretory vesicles are dark, while those 
that have released their histamine are 
light. The material remaining in the spent 
vesicles consists of a network of 
proteoglycans to which the stored 
histamine was bound. Once a secretory 
vesicle has fused with the plasma 
membrane, the secretory vesicle 
membrane often serves as a target to 
which other secretory vesicles fuse. Thus, 
the cell in (B) contains several large 
cavities lined by the fused membranes of 
many spent secretory vesicles, which are 
now in continuity with the plasma 
membrane. This continuity is not always 
apparent in one plane of section through 
the cell. (From D. Lawson, C. Fewtrell, 
B. Gomperts and M. Raff, J. Exp. Med. 
142:391-402, 1975. With permission from 
The Rockefeller University Press.) 
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Figure 13-69 Exocytosis as a localized 

response. This electron micrograph 
shows a mast cell that has been activated 
to secrete histamine by a stimulant 
coupled to a large solid bead. Exocytosis 
has occurred only in the region of the cell 
that is in contact with the bead. (From 
D. Lawson, C. Fewtrell and M. Raff, J. Cell 
Biol. 79:394-400, 1978. With permission 
from The Rockefeller University Press.) 
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Secretory Vesicle Membrane Components Are Quickly Removed 
from the Plasma Membrane 

When a secretory vesicle fuses with the plasma membrane, its contents are dis- 
charged from the cell by exocytosis, and its membrane becomes part of the 
plasma membrane. Although this should greatly increase the surface area of the 
plasma membrane, it does so only transiently, because membrane components 
are removed from the surface by endocytosis almost as fast as they are added by 
exocytosis, a process reminiscent of the exocytosis-endocytosis cycle discussed 
earlier. After their removal from the plasma membrane, the proteins of the 
secretory vesicle membrane are either recycled or shuttled to lysosomes for 
degradation. The amount of secretory vesicle membrane that is temporarily 
added to the plasma membrane can be enormous: in a pancreatic acinar cell 
discharging digestive enzymes for delivery to the gut lumen, about 900 jam 2 of 
vesicle membrane is inserted into the apical plasma membrane (whose area is 
only 30 jam 2 ) when the cell is stimulated to secrete. 

Control of membrane traffic thus has a major role in maintaining the com- 
position of the various membranes of the cell. To maintain each membrane- 
enclosed compartment in the secretory and endocytotic pathways at a constant 
size, the balance between the outward and inward flows of membrane needs to 
be precisely regulated. For cells to grow, the forward flow needs to be greater 
than the retrograde flow, so that the membrane can increase in area. For cells to 
maintain a constant size, the forward and retrograde flows must be equal. We 
still know very little about the mechanisms that coordinate these flows. 

Some Regulated Exocytosis Events Serve to Enlarge the Plasma 
Membrane 

An important task of regulated exocytosis is to deliver more membrane to enlarge 
the surface area of a cell's plasma membrane when such a need arises. A spec- 
tacular example is the plasma membrane expansion that occurs during the cel- 
lularization process of a fly embryo, which initially is a single cell containing 
about 6000 nuclei surrounded by a single plasma membrane. Within tens of min- 
utes, the embryo is converted into the same number of cells. This process of eel - 
lularization requires a vast amount of new plasma membrane, which is added by 
a carefully orchestrated fusion of vesicles, eventually forming the plasma mem- 
branes that enclose the separate cells. Similar vesicle fusion events are required 
to enlarge the plasma membrane at the cleavage furrows of other animal and 
plant cells during cytokinesis, the process by which the two daughter cells sepa- 
rate after mitosis (discussed in Chapter 17). 

Many cells, especially those subjected to mechanical stresses, frequently 
experience small ruptures to their plasma membranes during their life span. In 
a remarkable process thought to involve both homotypic vesicle-vesicle fusion 
and exocytosis, a temporary cell-surface patch is quickly fashioned from locally 
available internal-membrane sources, such as lysosomes. In addition to provid- 
ing an emergency barrier against leaks, the patch also helps to reduce mem- 
brane tension over the wounded area allowing the "normal" bilayer to flow back 
together to restore continuity and seal the puncture. In wound repair, the fusion 
and exocytosis of vesicles is triggered by the sudden increase of Ca 2 \ which is 
abundant in the extracellular space and rushes into the cells as soon as the 
plasma membrane is punctured. Figure 13-70 shows three examples in which 
regulated exocytosis leads to plasma membrane expansion. 

Polarized Cells Direct Proteins from the Trans Golgi Network to 
the Appropriate Domain of the Plasma Membrane 

Most cells in tissues are polarized and have two (and sometimes more) distinct 
plasma membrane domains that are the targets of different types of vesicles. 









806 Chapter 1 3: Intracellular Vesicular Traffic 



(A) CLEAVAGE FURROW (B) PHAGOCYTOSIS 



WOUND 




(C) WOUND REPAIR 



Figure 1 3-70 Three examples of 
regulated exocytosis leading to plasma 
membrane enlargement. (A, B) The 
vesicles fusing with the plasma 
membrane during cytokinesis and 
phagocytosis are thought to be derived 
from endosomes, whereas (C) those 
involved in wound repair may be derived 
from lysosomes. Plasma membrane 
wound repair is very important in cells 
subjected to mechanical stresses, such as 
muscle cells. 



This raises the general problem of how the delivery of membrane from the Golgi 
apparatus is organized so as to maintain the differences between one cell-sur- 
face domain and another. A typical epithelial cell has an apical domain, which 
faces an internal cavity, or the outside world, and often has specialized features 
such as cilia or a brush border of microvilli; it also has a basolateral domain, 
which covers the rest of the cell. The two domains are separated by a ring of tight 
junctions (see Figure 19-24), which prevent proteins and lipids (in the outer 
leaflet of the lipid bilayer) from diffusing between the two domains, so that the 
compositions of the two domains are different. 

A nerve cell is another example of a polarized cell. The plasma membrane of 
its axon and nerve terminals is specialized for signaling to other cells, whereas the 
plasma membrane of its cell body and dendrites is specialized for receiving sig- 
nals from other nerve cells. The two domains have distinct protein compositions. 
Studies of protein traffic in nerve cells in culture suggest that, with regard to 
vesicular transport from the trans Golgi network to the cell surface, the plasma 
membrane of the nerve cell body and dendrites resembles the basolateral mem- 
brane of a polarized epithelial cell, while the plasma membrane of the axon and 
its nerve terminals resembles the apical membrane of such a cell (Figure 13-71). 
Thus, some proteins that are targeted to a specific domain in the epithelial cell 
are also targeted to the corresponding domain in the nerve cell. 



Different Strategies Guide Membrane Proteins and Lipids 
Selectively to the Correct Plasma Membrane Domains 

In principle, differences between plasma membrane domains need not depend 
on the targeted delivery of the appropriate membrane components. Instead, 
membrane components could be delivered to all regions of the cell surface 
indiscriminately but then be selectively stabilized in some locations and selec- 
tively eliminated in others. Although this strategy of random delivery followed 
by selective retention or removal seems to be used in certain cases, deliveries are 
often specifically directed to the appropriate membrane domain. Epithelial 
cells, for example, frequently secrete one set of products— such as digestive 
enzymes or mucus in cells lining the gut— at their apical surface and another set 
of products— such as components of the basal lamina— at their basolateral sur- 



Figure 13-71 A comparison of two types of polarized cells. In terms of 

the mechanisms used to direct proteins to them, the plasma membrane of 
the nerve cell body and dendrites resembles the basolateral plasma 
membrane domain of a polarized epithelial cell, whereas the plasma 
membrane of the axon and its nerve terminals resembles the apical 
domain of an epithelial cell. The different membrane domains of both the 
epithelial cell and the nerve cell are separated by a molecular fence, 
consisting of a meshwork of membrane proteins tightly associated with 
the underlying actin cytoskeleton; this barrier— called a tight junction in 
the epithelial cell and an axonal hillock in neurons— keeps membrane 
proteins from diffusing between the two distinct domains. 
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(A) DIRECT SORTING OF 

MEMBRANE PROTEINS IN 
THE TRANS GOLGI NETWORK 



(B) INDIRECT SORTING VIA ENDOSOMES 



Figure 1 3-72 Two ways of sorting 
plasma membrane proteins in a 
polarized epithelial cell. Newly 
synthesized proteins can reach their 
proper plasma membrane domain by 
either (A) a direct specific pathway or 
(B) an indirect pathway. In the indirect 
pathway, a protein is retrieved from the 
inappropriate plasma membrane domain 
by endocytosis and then transported to 
the correct domain via early 
endosomes— that is, by transcytosis.The 
indirect pathway is used in liver 
hepatocytes to deliver proteins to the 
apica! domain that lines bile ducts. 
However, in other cases, the direct 
pathway is used, as in the case of the 
lipid-raft-dependent mechanism in 
epithelial cells described in the text 



face. Thus, cells must have ways of directing vesicles carrying different cargoes 
to different plasma membrane domains. 

By examining polarized epithelial cells in culture, it has been found that pro- 
teins from the ER destined for different domains travel together until they reach 
the TGN. Here they are separated and dispatched in secretory or transport vesi- 
cles to the appropriate plasma membrane domain (Figure 13-72). 

The apical plasma membrane of most epithelial cells is greatly enriched in 
glycosphingolipids, which help protect this exposed surface from damage— 
for example, from the digestive enzymes and low pH in sites such as the gut or 
stomach, respectively. Similarly, plasma membrane proteins that are linked to 
the lipid bilayer by a glycosylphosphatidylinositol (GPI) anchor (discussed in 
Chapter 12) are found predominantly in the apical plasma membrane. If 
recombinant DNA techniques are used to attach a GPI anchor to a protein that 
would normally be delivered to the basolateral surface, the protein is usually 
delivered to the apical surface instead. GPI-anchored proteins are thought to 
be directed to the apical membrane because they associate with glycosphin- 
golipids in lipid rafts that form in the membrane of the TGN. As discussed in 
Chapter 10, lipid rafts form in the TGN and plasma membrane when gly- 
cosphingolipids and cholesterol self-associate into microaggregates (see Fig- 
ure 10-14). Having selected a unique set of cargo molecules, the rafts then bud 
from the trans Golgi network into transport vesicles destined for the apical 
plasma membrane. Thus, similar to the selective partitioning of some mem- 
brane proteins into the specialized lipid domains in caveolae at the plasma 
membrane, lipid domains may also participate in protein sorting in the TGN. 

Membrane proteins destined for delivery to the basolateral membrane con- 
tain sorting signals in their cytosolic tail. When present in an appropriate struc- 
tural context, these signals are recognized by coat proteins that package them 
into appropriate transport vesicles in the TGN. The same basolateral signals that 
are recognized in the TGN also function in endosomes to redirect the proteins 
back to the basolateral plasma membrane after they have been endocytosed. 



Synaptic Vesicles Can Form Directly from Endocytic Vesicles 

Nerve cells (and some endocrine cells) contain two types of secretory vesicles. As 
for all secretory cells, these cells package proteins and peptides in dense-cored 
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Figure 1 3-73 The formation of synaptic vesicles. These tiny uniform 
vesicles are found only in nerve cells and in some endocrine cells, where 
they store and secrete small-molecule neurotransmitters. The import of 
neurotransmitter directly into the small endocytic vesicles that form from 
the plasma membrane is mediated by membrane carrier proteins that 
function as antiports, being driven by a H + gradient maintained by proton 
pumps in the vesicle membrane. 
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secretory vesicles in the standard way for release by the regulated secretory path- 
way. In addition, however, they use another specialized class of tiny (~50-nm 
diameter) secretory vesicles called synaptic vesicles, which are generated in a 
different way. In nerve cells, these vesicles store small neurotransmitter 
molecules, such as acetylcholine, glutamate, glycine, and y-aminobutyric acid 
(GAB A), which mediate rapid signaling from cell to cell at chemical synapses. As 
discussed earlier, when an action potential arrives at a nerve terminal, it triggers 
the vesicles to release their contents within a fraction of a millisecond. Some neu- 
rons fire more than 1000 times per second, releasing neurotransmitters each 
time. This rapid release is possible because some of the vesicles are docked and 
primed for fusion, which will occur only when an action potential causes an 
influx of Ca 2+ into the terminal. 

Only a small proportion of the synaptic vesicles in the nerve terminal fuse 
with die plasma membrane in response to each action potential. But for the 
nerve terminal to respond rapidly and repeatedly, the vesicles need to he replen- 
ished very quickly after they discharge. Thus, most synaptic vesicles are gener- 
ated not from the Golgi membrane in the nerve cell body but by local recycling 
from the plasma membrane in the nerve terminals. It is thought diat the mem- 
brane components of the synaptic vesicles are initially delivered to the plasma 
membrane by the constitutive secretory pathway and then retrieved by endocy- 
tosis. But instead of fusing with endoso tries, most of die endocytic vesicles 
immediately fill with transmitter to become synaptic vesicles. 

The membrane components of a synaptic vesicle include transporters spe- 
cialized for the uptake of nemo transmitter from the eytosol, where the small - 
molecule neurotransmitters diat mediate fast synaptic signaling are synthesized 
Once filled with neurotransmitter, the vesicles ret urn to the plasma membrane, 
where they wait until the cell is stimulated. After they have released their con- 
tents, their membrane components are retrieved in the same way and used again 
(Figure 13-73). 

Because synaptic vesicles are abundant and relatively uniform in size, they 
can be purified in large numbers and, consequently, are the best-characterized 
organelle of the cell. Careful quantitative proteomic analyses have identified all 
the components of a synaptic vesicle (Figure 13-74). 
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Figure 1 3-74 A scale model of a synaptic vesicle. The illustration shows a section through a 
synaptic vesicle in which proteins and lipids are drawn to scale based on their known 
stoichiometry and either their known or approximated structures. Only 70% of the membrane 
proteins estimated to be present in the membrane are shown; a complete model would 
therefore show a membrane that is even more crowded than this picture suggests. Each 
synaptic vesicle membrane contains 7000 phospholipid molecules and 5700 cholesterol 
molecules. Each also contains close to 50 different integral membrane protein molecules, which 
vary widely in their relative abundance and together contribute about 600 transmembrane 
a-helices.The v-SNARE synaptobrevin is the most abundant protein in the vesicle 
{-70 copies/vesicle). Because of its filamentous structure, it extends from the dense forest of the 
cytosolic protein domains, which almost completely cover the vesicle surface. By contrast, the 
V-ATPase, which uses ATP hydrolysis to pump H + into the vesicle lumen, is present in 1-2 copies 
per vesicle. The H + gradient provides the energy for neurotransmitter import by an 
^/neurotransmitter antiport, which loads each vesicle with 1800 neurotransmitter molecules, 
such as glutamate, one of which is shown to scale. (Adapted from S.Takamori et al., Cell, 
127:831-846, 2006. With permission from Elsevier.) <ACCA> 



Summary 

Cells can secrete molecules by exocytosis in either a constitutive or a regulated fashion. 
Whereas the regulated pathways operate only in specialized secretory cells, a constitu- 
tive secretory pathway operates in all eucaryotic cells, characterized by continual 
vesicular transport from the TGN to the plasma membrane. In the regulated pathways, 
the molecules are stored either in secretory vesicles or in synaptic vesicles, which do not 
fuse with the plasma membrane to release their contents until they receive an appro- 
priate signal Secretory vesicles containing proteins for secretion bud from the TGN. 
The secretory proteins become concentrated during the formation and maturation of 
the secretory vesicles. Synaptic vesicles, which are confined to nerve cells and some 
endocrine cells, form from both endocytic vesicles and from endosomes, and they 
mediate the regulated secretion of small-molecule neurotransmitters. 

Proteins are delivered from the TGN to the plasma membrane by the constitutive 
pathway unless they are diverted into other pathways or retained in the Golgi appara- 
tus. In polarized cells, the transport pathways from the TGN to the plasma membrane 
operate selectively to ensure that different sets of membrane proteins, secreted proteins, 
and lipids are delivered to the different domains of the plasma membrane. 
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PROBLEMS 

Which statements are true? Explain why or why not. 

13-1 In all events involving fusion of a vesicle to a target 
membrane, the cytosolic leaflets of the vesicle and target 
bilayers always fuse together, as do the leaflets that are not 
in contact with the cytosol. 

1 3-2 There is one strict requirement for the exit of a pro- 
tein from the ER: it must be correctly folded. 

1 3-3 All of the glycoproteins and glycolipids in intracellu- 
lar membranes have their oligosaccharide chains facing the 
lumenal side, and all those in the plasma membrane have 
their oligosaccharide chains facing the outside of the cell. 

1 3-4 During transcytosis, vesicles that form from coated 
pits on the apical surface fuse with the plasma membrane 
on the basolateral surface, and in that way transport 
molecules across the epithelium. 

Discuss the following problems. 

1 3-5 In a nondividing cell such as a liver cell, why must the 
flow of membrane between compartments be balanced, so 
that the retrieval pathways match the outward flow? Would 
you expect the same balanced flow in a gut epithelial cell, 
which is actively dividing? 

1 3-6 Yeast, and many other organisms, make a single type 
of clathrin heavy chain and a single type of clathrin light 
chain; thus, they make a single kind of clathrin coat. How is 
it, then, that a single clathrin coat can be used for three dif- 
ferent transport pathways— Golgi to late endosomes, 
plasma membrane to early endosomes, and immature 
secretory vesicles to Golgi— that each involves different spe- 
cialized cargo proteins? 

13-7 How can it possibly be true that complementary pairs 
of specific SNAREs uniquely mark vesicles and their target 
membranes? After vesicle fusion, the target membrane will 
contain a mixture of t-SNAREs and v-SNAREs. Initially, these 
SNAREs will be tightly bound to one another, but NSF can pry 
them apart, reactivating them. What do you suppose prevents 
target membranes from accumulating a population of v- 
SNAREs equal to or greater than their population of t-SNAREs? 

13-8 Viruses are the ultimate scavengers— a necessary 
consequence of their small genomes. Wherever possible, 
they make use of the cell's machinery to accomplish the 
steps involved in their own reproduction. Many different 
viruses have membrane coverings. These so-called 
enveloped viruses gain access to the cytosol by fusing with a 
cell membrane. Why do you suppose that each of these 
viruses encodes its own special fusion protein, rather than 
making use of a cell's SNAREs? 

13-9 For fusion of a vesicle with its target membrane to 
occur, the membranes have to be brought to within 1.5 nm 
so that the two bilayers can join (Figure Q13-1). Assuming 
that the relevant portions of the two membranes at the 
fusion site are circular regions 1.5 nm in diameter, calculate 
the number of water molecules that would remain between 
the membranes. (Water is 55.5 M and the volume of a cylin- 
der is nr 2 h.) Given that an average phospholipid occupies a 



membrane surface area of 0.2 nm 2 , how many phospho- 
lipids would be present in each of the opposing monolayers 
at the fusion site? Are there sufficient water molecules to 
bind to the hydrophilic head groups of this number of phos- 
pholipids? (It is estimated that 10-12 water molecules are 
normally associated with each phospholipid head group at 
the exposed surface of a membrane.) 

13-10 SNAREs exist as complementary partners that carry 
out membrane fusions between appropriate vesicles and 
their target membranes. In this way, a vesicle with a particu- 
lar variety of v-SNARE will fuse only with a membrane that 
carries the complementary t-SNARE. In some instances, how- 
ever, fusions of identical membranes (homotypic fusions) are 
known to occur. For example, when a yeast cell forms a bud, 
vesicles derived from the mother cell's vacuole move into the 
bud where they fuse with one another to form a new vacuole. 
These vesicles carry both v-SNAREs and t-SNAREs. Are both 
types of SNAREs essential for this homotypic fusion event? 

To test this point, you have developed an ingenious assay 
for fusion of vacuolar vesicles. You prepare vesicles from two 
different mutant strains of yeast: strain B has a defective 
gene for vacuolar alkaline phosphatase (Pase); strain A is 
defective for the protease that converts the precursor of 
alkaline phosphatase (pro-Pase) into its active form (Pase) 
(Figure Q13-2A). Neither strain has active alkaline phos- 
phatase, but when extracts of the strains are mixed, vesicle 
fusion generates active alkaline phosphatase, which can be 
easily measured (Figure Q13-2). 

Now you delete the genes for the vacuolar v-SNARE, t- 
SNARE, or both in each of the two yeast strains. You prepare 
vacuolar vesicles from each and test them for their ability to 
fuse, as measured by the alkaline phosphatase assay (Figure 
Q13-2B). 

What do these data say about the requirements for v- 
SNAREs and t-SNAREs in the fusion of vacuolar vesicles? 
Does it matter which kind of SNARE is on which vesicle? 

13-11 If you were to remove the ER retrieval signal from 
protein disulfide isomerase (PDI), which is normally a solu- 
ble resident of the ER lumen, where would you expect the 
modified PDI to be located? 

13-12 The KDEL receptor must shuttle back and forth 
between the ER and the Golgi apparatus to accomplish its 
task of ensuring that soluble ER proteins are retained in the 
ER lumen. In which compartment does the KDEL receptor 
bind its ligands more tightly? In which compartment does it 
bind its ligands more weakly? What is thought to be the 
basis for its different binding affinities in the two compart- 
ments? If you were designing the system, in which com- 
partment would you have the highest concentration of 
KDEL receptor? Would you predict that the KDEL receptor, 
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Figure Q13-1 Close approach of a vesicle and its target membrane 
in preparation for fusion (Problem 13-9). 
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Figure Q13-2 SNARE 
requirements for vesicle 
fusion (Problem 13-10). 
(A) Scheme for measuring 
the fusion of vacuolar 
vesicles. (B) Results of 
fusions of vesicles with 
different combinations of 
v-SNAREs and t-SNAREs. 
The SNAREs present on the 
vesicles of the two strains 
are indicated as v (v-SNARE) 
andt(t-SNARE). 
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which is a transmembrane protein, would itself possess an 
ER retrieval signal? 

1 3-1 3 How does the low pH of lysosomes protect the rest of 
the cell from lysosomal enzymes in case the lysosome breaks? 

13-14 Melanosomes are specialized lysosomes that store 
pigments for eventual release by exocytosis. Various cells 
such as skin and hair cells then take up the pigment, which 
accounts for their characteristic pigmentation. Mouse 
mutants that have defective melanosomes often have pale 
or unusual coat colors. One such light-colored mouse, the 
Mocha mouse (Figure Q13-3), has a defect in the gene for 
one of the subunits of the adaptor protein complex AP3, 
which is associated with coated vesicles budding from the 
trans Golgi network. How might the loss of AP3 cause a 
defect in melanosomes? 



Figure Ql 3-3 A normal 
mouse and the Mocha mouse 
(Problem 13-14). In addition 
to its light coat color, the 
Mocha mouse has a poor 
sense of balance. (Courtesy 
of Margit Burmeister.) 




normal mouse 



Mocha mouse 



13-15 Patients with Hunter's syndrome or with Hurler's 
syndrome rarely live beyond their teens. These patients 
accumulate glycosaminoglycans in lysosomes due to the 
lack of specific lysosomal enzymes necessary for their 
degradation. When cells from patients with the two syn- 
dromes are fused, glycosaminoglycans are degraded prop- 
erly, indicating that the cells are missing different degrada- 
tive enzymes. Even if the cells are just cultured together, 
they still correct each other's defects. Most surprising of all, 
the medium from a culture of Hurler's cells corrects the 
defect in Hunter's cells (and vice versa). The corrective fac- 
tors in the media are inactivated by treatment with pro- 
teases, by treatment with periodate, which destroys carbo- 
hydrate, and by treatment with alkaline phosphatase, 
which removes phosphates. 

A. What do you suppose the corrective factors are? Begin- 
ning with the donor patient's cells, describe the route by 
which the factors reach the medium and subsequently enter 
the recipient cells to correct the lysosomal defects. 

B, Why do you suppose the treatments with protease, 
periodate, and alkaline phosphatase inactivate the correc- 
tive factors? 



o 

strain A vt 

strain B vt 

experiment 1 
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C Would you expect a similar sort of correction scheme to 
work for mutant cytosolic enzymes? 

13-16 A macrophage ingests the equivalent of 100% of its 
plasma membrane each half hour by endocytosis. What is 
the rate at which membrane is returned by exocytosis? 

13-17 Cells take up extracellular molecules by receptor- 
mediated endocytosis and by fluid-phase endocytosis. A 
classic paper compared the efficiencies of these two path- 
ways by incubating human cells for various periods of rime 
in a range of concentrations of either l25 Habeled epidermal 
growth factor (EGF). to measure receptor-mediated endocy- 
tosis. or horseradish peroxidase (HRP), to measure fluid- 
phase endocytosis. Both EGF and HRP were found to be pre- 
sent in small vesicles with an internal radius of 20 nm. The 
uptake of HRP was linear (Figure Q13-4A), while that of EGF 
was initially linear but reached a plateau at higher concen- 
trations (Figure Q13-4B). 



(A} HRP UPTAKE 



(B) EGF UPTAKE 




10 20 30 
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20 40 60 80 
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Figure Q1 3-4 Uptake of HRP and EGF as a function of their 
concentration in the medium (Problem 1 3-1 7). 

A. Explain why the shapes of the curves in Figure Q13-4 
are different for HRP and EGF 

B. From the curves in Figure Q13-4, estimate the differ- 
ence in the uptake rates for HRP and EGF when both are 
present at 40 nM. What would the difference be if both were 
present at 40 uM? 

C. Calculate the average number of HRP molecules that 
get taken up by each endocytic vesicle (radius 20 nm) when 
the medium contains 40 jxM HRR [The volume of a sphere is 
(4/3)7ir 3 .] 

D. The scientists who did these experiments said at the 
time, "These calculations clearly illustrate how cells can 
internalize EGF by endocytosis while excluding all but 
insignificant quantities of extracellular fluid." What do you 
think they meant. 
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japter 14 



Energy Conversion: 
Mitochondria and Chloroplasts 




Through a set of reactions that occur in the cytosol, energy derived from the par- 
tial oxidation of energy-rich carbohydrate molecules is used to form ATR the 
chemical energy currency of cells (discussed in Chapter 2). But a much more 
efficient method of energy generation appeared very early in the history of life. 
This process requires a membrane, and it enables cells to acquire energy from a 
wide variety of sources. For example, it is central to the conversion of light 
energy into chemical bond energy in photosynthesis, as well as to the aerobic 
respiration that enables us to use oxygen to produce large amounts of ATP from 
food molecules. 

Procaryotes use their plasma membrane to produce ATP But the plasma 
membrane in eucaryotic cells is reserved for the transport processes described 
in Chapter 11. Eucaryotes instead use the specialized membranes inside energy- 
converting organelles to produce most of their ATP The membrane-enclosed 
organelles are mitochondria, which are present in the cells of virtually all 
eucaryotic organisms (including fungi, animals, plants, algae, and protozoa), 
and plastids — most notably chloroplasts— which occur only in plants and 
algae. In electron micrographs the most striking morphological feature of both 
mitochondria and chloroplasts is the large amount of internal membrane they 
contain. This internal membrane provides the framework for an elaborate set of 
electron-transport processes that produce most of the cell's ATP. 

The common pathway used by mitochondria, chloroplasts, and procaryotes 
to harness energy for biological purposes operates by a process known as 
chemiosmotic coupling— reflecting a link between the chemical bond-forming 
reactions that generate ATP ("chemi") and membrane-transport processes 
("osmotic"). The coupling process occurs in two linked stages, both of which are 
performed by protein complexes embedded in a membrane: 

Stage 1. High-energy electrons (derived from the oxidation of food molecules, 
from the action of sunlight, or from other sources discussed later) are 
transferred along a series of electron carriers embedded in the mem- 
brane. These electron transfers release energy that is used to pump 
protons (H + , derived from the water that is plentiful in cells) across the 
membrane and thus generate an electrochemical proton gradient. As 
discussed in Chapter 11, an ion gradient across a membrane is a form 
of stored energy, which can be harnessed to do useful work when the 
ions are allowed to flow back across the membrane down their elec- 
trochemical gradient. 
Stage 2. H + flows back down its electrochemical gradient through a protein 
machine called ATP synthase, which catalyzes the energy-requiring 
synthesis of ATP from ADP and inorganic phosphate (Pi). This ubiqui- 
tous enzyme plays the role of a turbine, permitting the proton gradi- 
ent to drive the production of ATP (Figure 14-1). 
The electrochemical proton gradient also drives other membrane-embedded 
protein machines (Figure 14-2). In eucaryotes, special proteins couple the "down- 
hill" H + flow to the transport of specific metabolites into and out of the organelles. 
In bacteria, the electrochemical proton gradient drives more than ATP synthesis 
and transport processes; as a store of directly usable energy, it also drives the rapid 
rotation of the bacterial flagellum, which enables the bacterium to swim. 
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Figure 14-1 Harnessing energy for life. 

(A) The essential requirements for 
chemiosmosis are a membrane — in which 
are embedded a pump protein and an ATP 
synthase, plus a source of high-energy 
electrons (e _ ).The protons (H + ) shown are 
freely available from water molecules. The 
pump harnesses the energy of electron 
transfer (details not shown here) to pump 
protons, creating an electrochemical 
proton gradient across the membrane. 
<B) This proton gradient serves as an 
energy store that can be used to drive ATP 
synthesis by the ATP synthase enzyme. 
The red arrow shows the direction of 
proton movement at each stage. 



(A) 



(B) 



The entire set of proteins in the membrane, together with the small 
molecules involved in the orderly sequence of electron transfers, is called an 
electron-transport chain. The mechanism of electron transport is analogous to 
an electric cell driving a current through a set of electric pumps. However, in bio- 
logical systems, electrons are carried between one site and another not by con- 
ducting wires, but by diffusible molecules that can pick up electrons at one loca- 
tion and deliver them to another. For mitochondria, the first of these electron 
carriers is NAD + , which takes up two electrons (plus an H + ) to become NADH, a 
water-soluble small molecule that ferries electrons from the sites where food 
molecules are degraded to the inner mitochondrial membrane. 

Figure 14-3 compares the electron-transport processes in mitochondria, 
which convert energy from chemical fuels, with those in chloroplasts, which 
convert energy from sunlight. In the mitochondrion, electrons — which have 
been released from a carbohydrate food molecule in the course of its degrada- 
tion to CO2 — are transferred through the membrane by a chain of electron car- 
riers, finally reducing oxygen gas (O2) to form water. The free energy released as 
the electrons flow down this path from a high-energy state to a low-energy state 
drives a series of three H + pumps in the inner mitochondrial membrane, and it 
is the third H + pump in the series that catalyzes the transfer of the electrons to 
2 (see Figure 14-3A). 

Although the chloroplast can be described in similar terms, and several of its 
main components are similar to those of the mitochondrion, the chloroplast 
membrane contains some crucial components not found in the mitochondrial 
membrane. Foremost among these are the photosystems, where the green pig- 
ment chlorophyll captures light energy and harnesses it to drive the transfer of 
electrons, much as photocells in solar panels absorb light energy and use it to 
drive an electric current. The electron-motive force generated by the chloroplast 
photosystems drives electron transfer in the direction opposite to that in mito- 
chondria: electrons are taken from water to produce O2, and they are donated to 
CO2 (via NADPH, a compound closely related to NADH) to synthesize carbohy- 
drate. Thus, the chloroplast generates O2 and carbohydrate, whereas the mito- 
chondrion consumes them (see Figure 14-3B). 

It is thought that the energy-converting organelles of eucaryotes evolved 
from procaryotes that were engulfed by primitive eucaryotic cells and developed 
a symbiotic relationship with them (discussed in Chapter 12). This hypothesis 
explains why mitochondria and chloroplasts contain their own DNA, which 
codes for some of their proteins. Since their initial uptake by a host cell, these 

Figure 1 4-2 Chemiosmotic coupling. Energy from sunlight or the 
oxidation of foodstuffs is first used to create an electrochemical proton 
gradient across a membrane. This gradient serves as a versatile energy 
store that drives energy-requiring reactions in mitochondria, chloroplasts, 
and bacteria. 
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Figure 14-3 Electron-transport processes. (A) The mitochondrion converts energy from chemical fuels. (B) The chloroplast 
converts energy from sunlight. Inputs are light green, products are blue, and the path of electron flow is indicated by red 
arrows. Each of the protein complexes (orange) is embedded in a membrane. Note that the electron-motive force 
generated by the two chloroplast photosystems enables the chloroplast to drive electron transfer from H2O to 
carbohydrate, and that this is opposite to the energetically favorable direction of electron transfer in a mitochondrion. Thus, 
whereas carbohydrate molecules and O2 are inputs for the mitochondrion, they are products of the chloroplast. 



organelles have lost much of their own genomes and become heavily dependent 
on proteins that are encoded by genes in the nucleus, synthesized in the cytosol, 
and then imported into the organelle. Conversely, the host cells have become 
dependent on these organelles for much of the ATP they need for biosyntheses, 
ion pumping, and movement; they have also become dependent on selected 
biosynthetic reactions that occur inside these organelles. 



THE MITOCHONDRION 

Mitochondria occupy a substantial portion of the cytoplasmic volume of 
eucaryotic cells, and they have been essential for the evolution of complex ani- 
mals. Without mitochondria, present-day animal cells would have to depend on 
anaerobic glycolysis for all of their ATR When glycolysis converts glucose to 
pyruvate, it releases only a small fraction of the total free energy that is poten- 
tially available from glucose oxidation. In mitochondria, the metabolism of sug- 
ars is completed: the pyruvate is imported into the mitochondrion and oxidized 
by O2 to CO2 and H2O. This allows 15 times more ATP to be made than is pro- 
duced by glycolysis alone. 

Mitochondria are usually depicted as stiff, elongated cylinders with a diam- 
eter of 0.5-1 jam, resembling bacteria. Time-lapse microcinematography of liv- 
ing cells, however, shows that mitochondria are remarkably mobile and plastic 
organelles, constantly changing their shape (Figure 14-4) and even fusing with 
one another and then separating again. As they move about in the cytoplasm, 
they often seem to be associated with microtubules (Figure 14-5), which can 
determine the unique orientation and distribution of mitochondria in different 
types of cells. Thus, the mitochondria in some cells form long moving filaments 
or chains. In others they remain fixed in one position where they provide ATP 
directly to a site of unusually high ATP consumption — packed between adjacent 
myofibrils in a cardiac muscle cell, for example, or wrapped tightly around the 
flagellum in a sperm (Figure 14-6). 

Mitochondria are large enough to be seen in the light microscope, and they 
were first identified during the nineteenth century. Real progress in understand- 
ing their function, however, depended on procedures developed in 1948 for iso- 
lating intact mitochondria. For technical reasons, many of these biochemical 
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Figure 14-4 Mitochondrial plasticity. 

Rapid changes of shape are often 
observed when an individual 
mitochondrion is followed in a living cell. 
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studies were performed with mitochondria purified from liver; each liver cell 
contains 1000-2000 mitochondria, which in total occupy about one-fifth of the 
cell volume. 



The Mitochondrion Contains an Outer Membrane, an Inner 
Membrane, and Two Internal Compartments 

Each mitochondrion is enclosed by two highly specialized membranes, which 
have very different functions. Together they create two separate mitochondrial 
compartments: the internal matrix and a much narrower intermembrane 
space. If purified mitochondria are gently disrupted and then fractionated into 
separate components (Figure 14-7), the biochemical composition of each of the 
two membranes and of the spaces enclosed by them can be determined. Each 
contains a unique collection of proteins. Most of these 1000 or so different mito- 
chondrial proteins are encoded in the nucleus and imported into the mitochon- 
drion from the cytoplasm by specialized protein translocases of the outer (TOM, 
translocase of the outer membrane) and inner (TIM) mitochondrial membrane 
(discussed in Chapter 12). 

The outer membrane contains many porin molecules, a type of transport 
protein that forms large aqueous channels through the lipid bilayer (discussed 
in Chapter 11). This membrane thus resembles a sieve that is permeable to all 
molecules of 5000 daltons or less, including small proteins. Such molecules can 
enter the intermembrane space, but most of them cannot pass the impermeable 
inner membrane. Thus, whereas the intermembrane space is chemically equiv- 
alent to the cytosol with respect to the small molecules it contains, the matrix 
contains a highly selected set of these molecules. 
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Figure 14-5 The relationship between 
mitochondria and microtubules. 

(A) A light micrograph of chains of 
elongated mitochondria in a living 
mammalian cell in culture. The cell was 
stained with a fluorescent dye 
(rhodamine 123) that specifically labels 
mitochondria in living cells. (B) An 
immunofluorescence micrograph of the 
same cell stained (after fixation) with 
fluorescent antibodies that bind to 
microtubules. Note that the mitochondria 
tend to be aligned along microtubules. 
(Courtesy of Lan Bo Chen.) 



(A) CARDIAC MUSCLE 



(B) SPERM TAIL 



Figure 14-6 Localization of 
mitochondria near sites of high ATP 
utilization in cardiac muscle and a 
sperm tail. Cardiac muscle (A) in the wall 
of the heart is the most heavily used 
muscle in the body, and its continual 
contractions require a reliable energy 
supply. It has limited built in energy 
stores and has to depend on a steady 
supply of ATP from the copious 
mitochondria aligned close to the 
contractile myofibrils (see p. 1031). 
During the development of the flagellum 
of the sperm tail (B), microtubules wind 
helically around the axoneme, where 
they are thought to help localize the 
mitochondria in the tail; these 
microtubules then disappear, and the 
mitochondria fuse with one another to 
create the structure shown. 
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Figure 14-7 Biochemical fractionation of purified mitochondria into 
separate components. These techniques have made it possible to study 
the different proteins in each mitochondrial compartment. The method 
shown allows the processing of large numbers of mitochondria at the 
same time. It takes advantage of the fact that, in a solution of low osmotic 
strength, water flows into mitochondria and greatly expands the matrix 
space (yellow). While the cristae of the inner membrane unfold to 
accommodate the expansion, the outer membrane — which has no folds — 
breaks, releasing a structure composed of only the inner membrane and 
the matrix. 



As we explain in detail later, the major working part of the mitochondrion is 
the matrix and the inner membrane that surrounds it. The inner membrane is 
highly specialized. Its lipid bilayer contains a high proportion of the "double" 
phospholipid cardiolipin, which has four fatty acids rather than two and may 
help to make the membrane especially impermeable to ions (see Figure 14-65). 
This membrane also contains a variety of transport proteins that make it selec- 
tively permeable to those small molecules that are metabolized or required by 
the many mitochondrial enzymes concentrated in the matrix. The matrix 
enzymes include those that metabolize pyruvate and fatty acids to produce 
acetyl CoA and those that oxidize acetyl CoA in the citric acid cycle. The princi- 
pal end products of this oxidation are CO2, which is released from the cell as 
waste, and NADH, which is the main source of electrons for transport along the 
respiratory chain — the name given to the electron-transport chain in mito- 
chondria. The enzymes of the respiratory chain are embedded in the inner mito- 
chondrial membrane, and they are essential to the process of oxidative phos- 
phorylation, which generates most of the animal cell's ATP. 

As illustrated in Figure 14-8, the inner membrane is usually highly convo- 
luted, forming a series of infoldings, known as cristae, that project into the 
matrix. These convolutions greatly increase the area of the inner membrane, so 
that in a liver cell, for example, it constitutes about one-third of the total cell 
membrane. The number of cristae is three times greater in the mitochondrion of 
a cardiac muscle cell than in the mitochondrion of a liver cell, presumably 
because of the greater demand for ATP in heart cells. There are also substantial 
differences in the mitochondrial enzymes of different cell types. In this chapter, 
we largely ignore these differences and focus instead on the enzymes and prop- 
erties that are common to all mitochondria. 



The Citric Acid Cycle Generates High-Energy Electrons 

Mitochondria can use both pyruvate and fatty acids as fuel. Pyruvate comes 
from glucose and other sugars, whereas fatty acids come from fats. Both of these 
fuel molecules are transported across the inner mitochondrial membrane and 
are then converted to the crucial metabolic intermediate acetyl CoA by enzymes 
located in the mitochondrial matrix. The acetyl groups in acetyl CoA are then 
oxidized in the matrix via the citric acid cycle, described in Chapter 2. The cycle 
converts the carbon atoms in acetyl CoA to CO2, which the cell releases as a 
waste product. Most importantly, this oxidation generates high-energy elec- 
trons, carried by the activated carrier molecules NADH and FADH2 (Figure 
14-9). These high-energy electrons are then transferred to the inner mitochon- 
drial membrane, where they enter the electron-transport chain; the loss of elec- 
trons from NADH and FADH 2 also regenerates the NAD + and FAD that is needed 
for continued oxidative metabolism. Figure 14-10 presents the entire sequence 
of reactions schematically. 
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A Chemiosmotic Process Converts Oxidation Energy into ATP 

Although the citric acid cycle is considered to be part of aerobic metabolism, it 
does not itself use the oxygen. Only in the final catabolic reactions that take 
place on the inner mitochondrial membrane is molecular oxygen (0 2 ) directly 
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Matrix. This large internal space contains a highly concentrated 
mixture of hundreds of enzymes, including those required for the 
oxidation of pyruvate and fatty acids and for the citric acid cycle. The 
matrix also contains several identical copies of the mitochondrial DNA 
genome, special mitochondrial ribosomes, tRNAs, and various enzymes 
required for expression of the mitochondrial genes. 
Inner membrane. The inner membrane is folded into numerous cristae, 
greatly increasing its total surface area. It contains proteins with three 
types of functions: (1) those that carry out the oxidation reactions of 
the electron-transport chain, (2) the ATP synthase that makes ATP in 
the matrix, and (3) transport proteins that a Now the passage of 
metabolites into and out of the matrix. An electrochemical gradient of 
H + P which drives the ATP synthase, is established across this membrane, 
so the membrane must be impermeable to ions and most small 
charged molecules. 

Outer membrane. Because it contains a large channel-forming protein 
(a porin, VDAC), the outer membrane is permeable to all molecules of 
5000 daltons or less. Other proteins in this membrane include enzymes 
involved in mitochondrial lipid synthesis and enzymes that convert 
lipid substrates into forms that are subsequently metabolized in the 
matrix, import receptors for mitochondrial proteins, and enzymatic 
machinery for division and fusion of the organelle. 

Intermembrane space. This space contains several enzymes that use 
the ATP passing out of the matrix to phosphorylate other nucleotides. 
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300 nm 



consumed. Nearly all the energy available from burning carbohydrates, fats, and 
other foodstuffs in the earlier stages of their oxidation is initially saved in the 
form of high-energy electrons removed from substrates by NAD + and FAD. These 
electrons, carried by NADH and FADH 2 , then combine with 2 by means of the 
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Figure 1 4-8 The structure of a 
mitochondrion. <CGAT> In the liver, an 
estimated 67% of the total 
mitochondrial protein is located in the 
matrix, 21% is located in the inner 
membrane, 6% in the outer membrane, 
and 6% in the intermembrane space. As 
indicated below, each of these four 
regions contains a special set of proteins 
that mediate distinct functions. (Large 
micrograph courtesy of Daniel S. Friend; 
small micrograph and three-dimensional 
reconstruction from T.G. Frey, 
C.W. Renken and G.A. Perkins, Biochim. 
Biophys.Acta 1555: 196-203, 2002. With 
permission from Elsevier.) 
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Figure 14-9 How NADH donates electrons. In this diagram, the high-energy electrons are shown as two red dots on a 
yellow hydrogen atom. A hydride ion (H~ a hydrogen atom with an extra electron) is removed from NADH and is converted 
into a proton and two high-energy electrons: H~ -> H + + 2e~ Only the ring that carries the electrons in a high-energy 
linkage is shown; for the complete structure and the conversion of NAD + back to NADH, see the structure of the closely 
related NADPH in Figure 2-60. Electrons are also carried in a similar way by FADH 2 , whose structure is shown in Figure 2-83. 
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respiratory chain embedded in the inner mitochondrial membrane. The inner 
membrane harnesses the large amount of energy released to drive the conver- 
sion of ADP + Pi to ATP. For this reason, the term oxidative phosphorylation is 
used to describe this last series of reactions (Figure 14-11). 

As previously mentioned, the generation of ATP by oxidative phosphoryla- 
tion via the respiratory chain depends on a chemiosmotic process. When it was 
first proposed in 1961, this mechanism explained a long-standing puzzle in cell 
biology. Nonetheless, the idea was so novel that it was some years before enough 
supporting evidence accumulated to make it generally accepted. 

In the remainder of this section we shall briefly describe the type of reac- 
tions that make oxidative phosphorylation possible, saving the details of the res- 
piratory chain for later. 



Figure 14-1 A summary of energy- 
generating metabolism in mitochondria. 

Pyruvate and fatty acids enter the 
mitochondrion (bottom) and are broken 
down to acetyl CoA. The acetyl CoA is then 
metabolized by the citric acid cycle, which 
reduces NAD + to NADH (and FAD to 
FADH 2 , not shown). In the process of 
oxidative phosphorylation, high-energy 
electrons from NADH (and FADH 2 ) are 
then passed along the electron-transport 
chain in the inner membrane to oxygen 
(0 2 ). This electron transport generates a 
proton gradient across the inner 
membrane, which drives the production 
of ATP by ATP synthase (see Figure 1 4-1 ). 
The NADH generated by glycolysis in the 
cytosol also passes electrons to the 
respiratory chain (not shown). Since NADH 
cannot pass across the inner 
mitochondrial membrane, the electron 
transfer from cytosolic NADH must be 
accomplished indirectly by means of one 
of several "shuttle" systems that transport 
another reduced compound into the 
mitochondrion; after being oxidized, this 
compound is returned to the cytosol, 
where it is reduced by NADH again (see 
also Figure 14-32). 



NADH Transfers its Electrons to Oxygen Through Three Large 
Respiratory Enzyme Complexes 

Although the respiratory chain harvests energy by a different mechanism than 
that used in other catabolic reactions, the principle is the same. The energeti- 
cally favorable reaction H2 + V2 O2 -» H 2 is made to occur in many small steps, 
so that most of the energy released can be stored instead of being lost to the 
environment as heat. The hydrogen atoms are first separated into protons and 
electrons. The electrons pass through a series of electron carriers in the inner 
mitochondrial membrane. At several steps along the way, protons and electrons 
are transiently recombined. But only at the end of the electron-transport chain 
are the protons returned permanently, when they are used to neutralize the neg- 
ative charges created by the final addition of the electrons to the oxygen 
molecule (Figure 14-12). 

The process of electron transport begins when the hydride ion is removed 
from NADH (to regenerate NAD + ) and is converted into a proton and two elec- 
trons (H - -> H + + 2e~). The two electrons are passed to the first of the more than 
15 different electron carriers in the respiratory chain. The electrons start with 
very high energy and gradually lose it as they pass along the chain. For the most 
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Figure 14-11 The major net energy 
conversion catalyzed by the 
mitochondrion. In this process of 
oxidative phosphorylation, the inner 
mitochondrial membrane serves as a 
device that changes one form of 
chemical bond energy to another, 
converting a major part of the energy of 
NADH (and FADH 2 ) oxidation into 
phosphate-bond energy in ATP. 



820 Chapter 14: Energy Conversion: Mitochondria and Chloroplasts 



(A) COMBUSTION 
Ha 




(B) BIOLOGICAL OXIDATION 



separate into H + 
and electrons 



2H + 



ViO, 



EXPLOSIVE 

RELEASE OF 

HEAT 

ENERGY 



much of the 
energy is 
harnessed and 
converted to 
a stored form 



H 2 




H 2 



part, the electrons pass from one metal ion to another, each of these ions being 
tightly bound to a protein molecule that alters the electron affinity of the metal 
ion (discussed in detail later). Most of the proteins involved are grouped into 
three large respiratory enzyme complexes, each containing transmembrane pro- 
teins that hold the complex firmly in the inner mitochondrial membrane. Each 
complex in the chain has a greater affinity for electrons than its predecessor, and 
electrons pass sequentially from one complex to another until they are finally 
transferred to oxygen, which has the greatest affinity of all for electrons. 



Figure 14-12 A comparison of biological 
oxidations with combustion. (A) Most of 
the energy would be released as heat if 
hydrogen were simply burned. (B) In 
biological oxidation by contrast, most of 
the released energy is stored in a form 
useful to the cell by means of the 
electron-transport chain in the inner 
mitochondrial membrane (the respiratory 
chain). The mitochondrion releases the 
rest of the oxidation energy as heat. In 
reality, the protons and electrons shown 
as being derived from H2 are removed 
from hydrogen atoms that are covalently 
linked to NADH or FADH 2 molecules. 



As Electrons Move Along the Respiratory Chain, Energy 
Is Stored as an Electrochemical Proton Gradient Across 
the Inner Membrane 

The close association of the electron carriers with protein molecules makes 
oxidative phosphorylation possible. The proteins guide the electrons along the 
respiratory chain so that the electrons move sequentially from one enzyme com- 
plex to another. The transfer of electrons is coupled to oriented H + uptake and 
release, as well as to allosteric changes in energy- converting protein pumps. The 
net result is the pumping of H + across the inner membrane— from the matrix to 
the intermembrane space, driven by the energetically favorable flow of elec- 
trons. This movement of H + has two major consequences: 

1. It generates a pH gradient across the inner mitochondrial membrane, with 
the pH higher in the matrix than in the cytosol, where the pH is generally 
close to 7. (Since small molecules equilibrate freely across the outer mem- 
brane of the mitochondrion, the pH in the intermembrane space is the 
same as in the cytosol.) 

2. It generates a voltage gradient (membrane potential) across the inner mito- 
chondrial membrane, with the inside negative and the outside positive (as 
a result of the net outflow of positive ions). 

The pH gradient (ApH) drives H + back into the matrix, thereby reinforcing 
the effect of the membrane potential (A V) } which acts to attract any positive ion 
into the matrix and to push any negative ion out. Together, the ApH and the Al^ 
are said to constitute an electrochemical proton gradient (Figure 14-13). The 
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electrochemical proton gradient exerts a proton-motive force, which can be 
measured in units of millivolts (mV). In a typical cell, the proton-motive force 
across the inner membrane of a respiring mitochondrion is about 180 to 190 mV 
(inside negative), and it is made up of a membrane potential of about 160 to 170 
mV and a pH gradient of about 0.3 to 0.5 pH units (each ApH of 1 pH unit has an 
effect equivalent to a membrane potential of about 60 mV). 



Figure 14-1 3 The two components of 
the electrochemical proton gradient. 

The total proton-motive force across the 
inner mitochondrial membrane consists 
of a large force due to the membrane 
potential (traditionally designated A\|/ by 
experts, but designated AV in this text) 
and a smaller force due to the H + 
gradient (ApH). Both forces act to drive 
H + into the matrix, 



The Proton Gradient Drives ATP Synthesis 

The electrochemical proton gradient across the inner mitochondrial membrane 
drives ATP synthesis in the critical process of oxidative phosphorylation (Figure 
14-14). This is made possible by the membrane-bound enzyme ATP synthase, 
mentioned previously. This enzyme creates a hydrophilic pathway across the 
inner mitochondrial membrane that allows protons to flow down their electro- 
chemical gradient. As these ions thread their way through the ATP synthase, they 
are used to drive the energetically unfavorable reaction between ADP and Pi that 
makes ATP (see Figure 2-27). The ATP synthase is of ancient origin; the same 
enzyme occurs in the mitochondria of animal cells, the chloroplasts of plants 
and algae, and in the plasma membrane of bacteria and archaea. 

Figure 14-15 shows the structure of ATP synthase. Also called the F Fi 
ATPase, it is a multisubunit protein with a mass of more than 500,000 daltons 
that works by rotary catalysis. A large enzymatic portion, shaped like a lollipop 
head and composed of a ring of 6 subunits, projects on the matrix side of the 
inner mitochondrial membrane. An elongated arm holds this head in place by 
tying it to a group of transmembrane proteins that form a "stator" in the mem- 
brane. This stator is in contact with a "rotor" composed of a ring of 10 to 14 iden- 
tical transmembrane protein subunits. As protons pass through a narrow chan- 
nel formed at the stator-rotor contact, their movement causes the rotor ring to 
spin. This spinning also turns a stalk attached to the rotor (blue in Figure 
14-15B), which is thereby made to turn rapidly inside the lollipop head. As a 
result, the energy of proton flow down a gradient has been converted into the 
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Figure 14-1 4 The general mechanism of 
oxidative phosphorylation. As a high- 
energy electron is passed along the 
electron-transport chain, some of the 
energy released drives the three 
respiratory enzyme complexes that pump 
H + out of the matrix. The resulting 
electrochemical proton gradient across 
the inner membrane drives H + back 
through the ATP synthase, a 
transmembrane protein complex that uses 
the energy of the H + flow to synthesize 
ATP from ADP and Pj in the matrix. 
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mechanical energy of two sets of proteins rubbing against each other: rotating 
stalk proteins pushing against a stationary ring of head proteins. 

Three of the six subunits in the head contain binding sites for ADP and inor- 
ganic phosphate. These are driven to form ATP as mechanical energy is con- 
verted into chemical bond energy through the repeated changes in protein con- 
formation that the rotating stalk creates. In this way, the ATP synthase is able to 
produce more than 100 molecules of ATP per second, generating 3 molecules of 
ATP per revolution. The number of proton-translocating subunits in the rotor is 
different in different ATP synthases, and it is this number that determines the 
number of protons that need to pass through this marvelous device to make 
each molecule of ATP (its "gear ratio", which is generally a non-integral number 
between 3 and 5). 



Figure 14-15 ATP synthase. <ATCG> 
<GAGA> (A) The enzyme is composed of 
a head portion, called the F 7 ATPase, and 
a transmembrane H + carrier, called F . 
Both Fi and F are formed from multiple 
subunits, as indicated. A rotating stalk is 
fixed to a rotor (red) formed by a ring of 
10-14 c subunits in the membrane. The 
stator (green) is formed from 
transmembrane a subunits, tied to other 
subunits that create an elongated arm. 
This arm fixes the stator to a ring of 
3a and 3|3 subunits that forms the head, 
which likewise cannot rotate. (B) The 
three-dimensional structure of the 
F] ATPase, determined by x-ray 
crystallography. This part of the ATP 
synthase derives its name from its ability 
to carry out the reverse of the ATP 
synthesis reaction — namely, the 
hydrolysis of ATP to ADP and Pj, when 
detached from the transmembrane 
portion. (B, courtesy of John Walker, from 
J.P. Abrahams et al., Nature 370:621-628, 
1994. With permission from Macmillan 
Publishers Ltd.) 



The Proton Gradient Drives Coupled Transport Across the Inner 
Membrane 

The electrochemical proton gradient drives other processes besides ATP synthe- 
sis. In mitochondria, many charged small molecules, such as pyruvate, ADP, and 
Pi, are pumped into the matrix from the cytosol, while others, such as ATFJ must 
be moved in the opposite direction. Transporters that bind these molecules can 
couple their transport to the energetically favorable flow of H + into the mito- 
chondrial matrix. Thus, for example, pyruvate and inorganic phosphate (PO are 
co-transported inward with H + as the H + moves into the matrix. 

ADP and ATP are co-transported in opposite directions by a single trans- 
porter protein. Since an ATP molecule has one more negative charge than ADP, 
each nucleotide exchange results in a total of one negative charge being moved 
out of the mitochondrion. Thus, the voltage difference across the membrane 
drives this ADP-ATP co-transporter (Figure 14-16). 

We have seen how, in eucaryotic cells, the electrochemical proton gradient 
across the inner mitochondrial membrane is used to drive both the formation of 
ATP and the transport of metabolites across the membrane. In bacteria, a similar 
gradient across the bacterial plasma membrane is harnessed to drive these two 
types of processes. In motile bacteria, this gradient also drives the rapid rotation 
of the bacterial flagellum, which propels the bacterium along (Figure 14-17). 



Proton Gradients Produce Most of the Cell's ATP 

As stated previously, glycolysis alone produces a net yield of 2 molecules of ATP 
for every molecule of glucose that is metabolized, and this is the total energy 
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pyruvate p. 

yield for the fermentation processes that occur in the absence of 2 (discussed 
in Chapter 2). During oxidative phosphorylation, each pair of electrons donated 
by the NADH produced in mitochondria is thought to provide energy for the for- 
mation of about 2.5 molecules of ATP, after subtracting the energy needed for 
transporting this ATP to the cytosol. Oxidative phosphorylation also produces 
1.5 ATP molecules per electron pair from FADH 2 , or from the NADH molecules 
produced by glycolysis in the cytosol From the product yields of glycolysis and 
the citric acid cycle summarized in Table 14-1A, we can calculate that the com- 
plete oxidation of one molecule of glucose — starting with glycolysis and ending 
with oxidative phosphorylation — gives a net yield of about 30 ATPs. 

In conclusion, the vast majority of the ATP produced from the oxidation of 
glucose in an animal cell is produced by chemiosmotic mechanisms in the mito- 
chondrial membrane. Oxidative phosphorylation in the mitochondrion also 
produces a large amount of ATP from the NADH and the FADH 2 that is derived 
from the oxidation of fats (Table 14-1B; see also Figure 2-81). 



Mitochondria Maintain a High ATP:ADP Ratio in Cells 

Because of the carrier protein in the inner mitochondrial membrane that 
exchanges ATP for ADP, the ADP molecules produced by ATP hydrolysis in the 



Figure 14-16 Some of the active 
transport processes driven by the 
electrochemical proton gradient across 
the inner mitochondrial membrane. 

Pyruvate, inorganic phosphate (P,), and 
ADP are moved into the matrix, while ATP 
is pumped out. The charge on each of the 
transported molecules is indicated for 
comparison with the membrane potential, 
which is negative inside, as shown. The 
outer membrane is freely permeable to all 
of these compounds. The active transport 
of molecules across membranes by 
transporter proteins is discussed in 
Chapter 11. 
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Figure 14-1 7 The rotation of the 
bacterial flagellum driven by H + flow. 

The flagellum is attached to a series of 
protein rings (orange), which are 
embedded in the outer and inner 
membranes and rotate with the flagellum. 
The rotation is driven by a flow of protons 
through an outer ring of proteins (the 
stator) by mechanisms that may resemble 
those used by the ATP synthase. 
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Table 1 4-1 Product Yields from the Oxidation of Sugars and Fats 

A, NET PRODUCTS FROM OXIDATION OF ONE MOLECULE OF GLUCOSE 

In cytosol (glycolysis) 

1 glucose -4 2 pyruvate + 2 NADH + 2 ATP 
In mitochondrion (pyruvate dehydrogenase and citric acid cycle) 

2 pyruvate -> 2 acetyl CoA + 2 NADH 
2 acetyl CoA -> 6 NADH + 2 FADH 2 + 2 GTP 

Net result in mitochondrion 

2 pyruvate -> 8 NADH + 2 FADH 2 + 2 GTP 




B. NET PRODUCTS FROM OXIDATION OF OWE MOLECULE OF PALMITOYL COA 
(ACTIVATED FORM OF PALMITATE, A FATTY ACID) 

In mitochondrion (fatty acid oxidation and citric acid cycle) 

1 palmitoyl CoA -> 8 acetyl CoA + 7 NADH + 7 FADH 2 
8 acetyl CoA -> 24 NADH + 8 FADH 2 + 8 GTP 

Net result in mitochondrion 

1 palmitoyl CoA -> 31 NADH + 15 FADH 2 + 8 GTP 



cytosol rapidly enter mitochondria for recharging, while the ATP molecules 
formed in the mitochondrial matrix by oxidative phosphorylation are rapidly 
pumped into the cytosol where they are needed. A typical ATP molecule in the 
human body shuttles out of a mitochondrion and back into it (as ADP) for 
recharging more than once per minute, and the cell maintains a concentration 
of ATP that is about 10 times higher than the concentration of ADP. 

As discussed in Chapter 2, biosynthetic enzymes often drive energetically 
unfavorable reactions by coupling them to the energetically favorable hydroly- 
sis of ATP (see Figure 2-59). The ATP pool is therefore used to drive cellular pro- 
cesses in much the same way that a battery can be used to drive electric engines. 
If the activity of the mitochondria is blocked, ATP levels fall and the cell's battery 
runs down; eventually, energetically unfavorable reactions are no longer driven, 
and the cell dies. The poison cyanide, which blocks electron transport in the 
inner mitochondrial membrane, causes death in exactly this way. 

It might seem that cellular processes would stop only when the concentra- 
tion of ATP reaches zero; but, in fact, life is more demanding: it depends on cells 
maintaining a concentration of ATP that is high compared with the concentra- 
tions of ADP and Pi. To explain why, we must consider some elementary princi- 
ples of thermodynamics. 



A Large Negative Value of AG for ATP Hydrolysis Makes ATP 
Useful to the Cell 

In Chapter 2, we introduced the concept of free energy (G). The free-energy 
change for a reaction, AG, determines whether this reaction will occur in a cell. 
We showed on p. 76 that the AG for a given reaction can be written as the sum of 
two parts: the first, called the standard free-energy change, AG°, depends on the 
intrinsic characters of the reacting molecules; the second depends on their con- 
centrations. For the simple reaction A -> B, 

AG=AG° + RT\n^\- 

[A] 

where [A] and [B] denote the concentrations of A and B, and In is the natural log- 
arithm. AG is therefore only a reference value that equals the value of AG when 
the molar concentrations of A and B are equal (In 1 = 0). 

In Chapter 2, ATP was described as the major "activated energy carrier 
molecule" in cells. The large, favorable free-energy change (large negative AG) 
for its hydrolysis is used, through coupled reactions, to drive other chemical 
reactions that would otherwise not occur (see pp. 79-87). The ATP hydrolysis 
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For the reaction 



— ADP + (Pi 



the following equation applies: 



AG = AG° + /?7"ln 



lADP||(p)l 



[ATP] 



where AG and AG are in kilocalories per mole, R is the gas 
constant (2 x 10" 3 kcal/mole K), T is the absolute temperature 
(K), and all the concentrations are in moles per liter. 

When the concentrations of all reactants are at 1 M, AG = AG 
(since RT In 1 = 0). AG° is thus a constant defined as the 
standard free-energy change for the reaction. 



At equilibrium the reaction has no net effect on the disorder of 
the universe, so AG = 0. Therefore, at equilibrium, 

IADPH®} 
[ATP] 

But the concentrations of reactants at equilibrium must satisfy 
the equilibrium equation: 

[ADPlj®! K 
[ATRI 

Therefore, at equilibrium, 

AG° = -RT\nK 



We thus see that whereas AG° indicates the equilibrium point for a 
reaction, AG reveals how far the reaction is from equilibrium. AG is 
a measure of the "driving force" for any chemical reaction, just as the 
proton-motive force is the driving force for the translocation of protons 



Figure 14-18The basic relationship between free-energy changes and equilibrium in the ATP hydrolysis reaction. The 

rate constants in boxes 1 and 2 are determined from experiments in which product accumulation is measured as a function 
of time. The equilibrium constant shown here, K, is in units of moles per liter. (See Panel 2-7, pp. 118-11 9, for a discussion 
of free energy and see Figure 3-43 for a discussion of the equilibrium constant) 



reaction produces two products, ADP and inorganic phosphate (Pi); it is there- 
fore of the type A -> B + C, where, as described in Figure 14-18, 



AG: 



:AG° + i?rinI5]JQ_ 

[A] 



When ATP is hydrolyzed to ADP and Pi under the conditions that normally 
exist in a cell, the free-energy change is roughly -11 to -13 kcal/mole (-46 to -54 
kj/mole). This extremely favorable AG depends on having a high concentration 
of ATP in the cell compared with the concentration of ADP and Pi. When ATR 
ADP, and Pj are all present at the same concentration of 1 mole/liter (so-called 
standard conditions), the AG for ATP hydrolysis is the standard free-energy 
change (AC?), which is only -7.3 kcal/mole (-30.5 kj/mol). At much lower con- 
centrations of ATP relative to ADP and Pi, AG becomes zero. At this point, the 
rate at which ADP and Pi will join to form ATP will be equal to the rate at which 
ATP hydrolyzes to form ADP and P*. In other words, when AG = 0, the reaction is 
at equilibrium (see Figure 14-18). 

It is AG, not AG , that indicates how far a reaction is from equilibrium and 
determines whether it can be used to drive other reactions. Because the efficient 
conversion of ADP to ATP in mitochondria maintains such a high concentration 
of ATP relative to ADP and Pi, the ATP-hydrolysis reaction in cells is kept very far 
from equilibrium and AG is correspondingly very negative. Without this large 
disequilibrium, ATP hydrolysis could not be used to direct the reactions of the 
cell; for example, many biosynthetic reactions would run backward rather than 
forward at low ATP concentrations. 
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ATP Synthase Can Function in Reverse to Hydrolyze ATP and 
Pump H + 

In addition to harnessing the flow of H + down an electrochemical proton gradi- 
ent to make ATP, the ATP synthase can work in reverse: it can use the energy of 
ATP hydrolysis to pump H + across the inner mitochondrial membrane (Figure 
14-19). It thus acts as a reversible coupling device, interconverting electrochem- 
ical proton gradient and chemical bond energies. The direction of action at any 
instant depends on the balance between the steepness of the electrochemical 
proton gradient and the local AG for ATP hydrolysis, as we now explain. 

The exact number of protons needed to make each ATP molecule depends on 
the number of subunits in the ring of transmembrane proteins that forms the 
base of the rotor (see Figure 14-15). However, to illustrate the principles involved, 
let us assume that one molecule of ATP is made by the ATP synthase for every 3 
protons driven through it. Whether the ATP synthase works in its ATP-synthesiz- 
ing or its ATP-hydrolyzing direction at any instant depends, in this case, on the 
exact balance between the favorable free-energy change for moving the three 
protons across the membrane into the matrix, AG3H+ (which is less than zero), 
and the unfavorable free-energy change for ATP synthesis in the matrix, AGatp syn- 
thesis (which is greater than zero). As just discussed, the value of AG A tp synthesis 
depends on the exact concentrations of the three reactants ATR ADR and Pi in the 
mitochondrial matrix (see Figure 14-18). The value of AQj H +, in contrast, is 
directly proportional to the value of the proton-motive force across the inner 
mitochondrial membrane. The following example will show how the balance 
between these two free-energy changes affects the ATP synthase. 

As explained in the legend to Figure 14-19, a single H + moving into the 
matrix down an electrochemical gradient of 200 mV liberates 4.6 kcal/mole (19.2 
kl/mol) of free energy, while the movement of three protons liberates three 
times this much free energy (AG3H+ = -13.8 kcal/mole; 57.7 kj/mol). Thus, if the 
proton-motive force remains constant at 200 mV, the ATP synthase synthesizes 
ATP until a ratio of ATP to ADP and Pi is reached where AG A tp synthesis is just equal 
to +13.8 kcal/mole (57.7 kj/mol; here AGatp synthesis + AGj H + = 0). At this point 
there is no further net ATP synthesis or hydrolysis by the ATP synthase. 

Suppose the energy- requiring reactions in the cytosol suddenly hydrolyze a 
large amount of ATR causing the ATPiADP ratio in the matrix to fall. Now the 
value of AGatp synthesis will decrease (see Figure 14-18). and ATP synthase will 
begin to synthesize ATP again to restore the original ATP:ADP ratio. Alterna- 
tively if die pro ton- motive force drops suddenly and is then maintained at a 
constant 160 mV, AG3H+ will change to -1 1.0 kcal/mole M6 kj/mol). As a result 
ATP synthase will start hydrolyzing some of the ATP in the matrix until a new 
balance of ATP to ADP and Pi is reached (where AGatp synthesis = +11.0 kcal/mole, 
or +46 kj/mol), and so on. 
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Figure 1 4-1 9 The ATP synthase is a 
reversible coupling device that can 
convert the energy of the 
electrochemical proton gradient into 
chemical-bond energy, or vice versa. 
<ATCG> The ATP synthase can either (A) 
synthesize ATP by harnessing the proton- 
motive force or (B) pump protons against 
their electrochemical gradient by 
hydrolyzing ATP. The direction of 
operation at any given instant depends 
on the net free-energy change (AG) for 
the coupled processes of H + translocation 
across the membrane and the synthesis 
of ATP from ADP and Pj. Measurement of 
the torque that the ATP synthase can 
produce when hydrolyzing ATP reveals 
that the synthase can pump 60 times 
more strongly than a diesel-engine of 
equal weight. 

The free-energy change (AG) for ATP 
hydrolysis depends on the 
concentrations of the three reactants ATR 
ADP, and Pj (see Figure 14-1 8); the AG for 
ATP synthesis is the negative of this 
value. The AG for proton translocation 
across the membrane is proportional to 
the proton-motive force. The conversion 
factor between them is the faraday.Thus, 
AGh+ = -0.023 (proton-motive force), 
where AGh+ is in kcal/mole and the 
proton-motive force is in mV. For an 
electrochemical proton gradient (proton- 
motive force) of 200 mV, 
AG H + = -4.6 kcal/mole (-19.2kJ/mole). 
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In many bacteria, the ATP synthase is routinely reversed in a transition 
between aerobic and anaerobic metabolism, as we shall see later. And the V- type 
ATPases that acidify organelles, which are architecturally similar to the ATP syn- 
thase, normally function in reverse (see Figure 13-36). Other membrane trans- 
port proteins that couple the transmembrane movement of an ion to ATP syn- 
thesis or hydrolysis share the same type of reversibility. Both the Na + -K + pump 
and the Ca 2+ pump described in Chapter 11, for example, normally hydrolyze 
ATP and use the energy released to move their specific ions across a membrane. 
If either of these pumps is exposed to an abnormally steep gradient of the ions 
it transports, however, it will act in reverse— synthesizing ATP from ADP and Pi 
instead of hydrolyzing it. Thus, the ATP synthase is by no means unique in its 
ability to convert the electrochemical energy stored in a transmembrane ion 
gradient directly into phosphate-bond energy in ATP 

Summary 

The mitochondrion performs most cellular oxidations and produces the bulk of the 
animal cell's ATP A mitochondrion is enclosed by two concentric membranes, and its 
major working part is the inner- most space (the matrix) and the inner membrane that 
surrounds it. The matrix contains a large variety of enzymes — including those that 
convert pyruvate and fatty acids to acetyl CoA and those that oxidize this acetyl CoA to 
C0 2 through the citric acid cycle. These oxidation reactions produce large amounts of 
NADH (and FADHz). The electron- transport chain (respiratory chain) located in the 
inner mitochondrial membrane then harnesses the energy available from combining 
molecular oxygen with the reactive electrons carried by the NADH and FADH2. 

The respiratory chain uses energy derived from electron transport to pump H + out 
of the matrix to create a transmembrane electrochemical proton (H + ) gradient, which 
includes contributions from both a membrane potential and a pH difference. The large 
amount of free energy released when H + flows back into the matrix (across the inner 
membrane) provides the basis for ATP production in the matrix by a remarkable pro- 
tein machine — the ATP synthase, a reversible coupling device between proton flows and 
ATP synthesis or hydrolysis. The transmembrane electrochemical gradient also drives 
the active transport of selected metabolites across the mitochondrial inner membrane, 
including an efficient ATP-ADP exchange between the mitochondrion and the cytosol 
that keeps the cell's ATP pool highly charged. The resulting high ratio of ATP to its 
hydrolysis products makes the free-energy change for ATP hydrolysis extremely favor- 
able, allowing this hydrolysis reaction to drive a large number of energy- requiring pro- 
cesses throughout the cell 

ELECTRON-TRANSPORT CHAINS AND THEIR 
PROTON PUMPS 

Having considered in general terms how a mitochondrion uses electron trans- 
port to create an electrochemical proton gradient, we now turn to the mecha- 
nisms that underlie this membrane-based energy-conversion process. In doing 
so, we also accomplish a larger purpose. As emphasized at the beginning of this 
chapter, mitochondria, chloroplasts, archaea, and bacteria use very similar 
chemiosmotic mechanisms. In fact, these mechanisms underlie the function of 
nearly all living organisms — including anaerobes that derive all their energy 
from electron transfers between two inorganic molecules. 

Our goal in this section is to explain how electron-transport process can 
pump protons across a membrane. We start with some of the basic principles on 
which this process depends. 

Protons Are Unusually Easy to Move 

Although protons resemble other positive ions such as Na + and K + in normally 
requiring proteins to move them across membranes, in some respects they are 
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Figure 14-20 How protons behave in water. (A) Protons move very rapidly 
along a chain of hydrogen- bonded water molecules. In this diagram, 
proton jumps are indicated by blue arrows, and hydronium ions are 
indicated by green shading. As discussed in Chapter 2, naked protons rarely 
exist as such; they are instead associated with a water molecule in the form 
of a hydronium ion, H 3 + . At a neutral pH (pH 7.0), the hydronium ions are 
present at a concentration of 10" 7 M. However, for simplicity, we usually 
refer to this as an H + concentration of 1 (T 7 M (see Panel 2-2, pp. 1 08-1 09). 
(B) Electron transfer can result in the transfer of entire hydrogen atoms, 
because protons are readily accepted from or donated to water inside cells. 
In this example, molecule A picks up an electron plus a proton when it is 
reduced, and B loses an electron plus a proton when it is oxidized. 

unique. Hydrogen atoms are by far the most abundant type of atom in living 
organisms; they are plentiful not only in all carbon-containing biological 
molecules, but also in the water molecules that surround them. The protons in 
water are highly mobile, flickering through the hydrogen-bonded network of 
water molecules by rapidly dissociating from one water molecule to associate 
with its neighbor, as illustrated in Figure 14-20A. Protons are thought to move 
across a protein pump embedded in a lipid bilayer in a similar way: they trans- 
fer from one amino acid side chain to another, following a special channel 
through the protein. 

Protons are also special with respect to electron transport. Whenever a 
molecule is reduced by acquiring an electron, the electron (e~) brings with it a 
negative charge. In many cases, the addition of a proton (H + ) from water rapidly 
neutralizes this charge, so that the net effect of the reduction is to transfer an 
entire hydrogen atom, H + + e~ (Figure 14-20B). Similarly, when a molecule is oxi- 
dized, a hydrogen atom removed from it can be readily dissociated into its con- 
stituent electron and proton — allowing the electron to be transferred separately 
to a molecule that accepts electrons, while the proton is passed to the water. 
Therefore, in a membrane in which electrons are being passed along an elec- 
tron-transport chain, pumping protons from one side of the membrane to 
another can be relatively simple. The electron carrier merely needs to be 
arranged in the membrane in a way that causes it to pick up a proton from one 
side of the membrane when it accepts an electron, and to release the proton on 
the other side of the membrane as the electron is passed to the next carrier 
molecule in the chain (Figure 14-21). 
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The Redox Potential Is a Measure of Electron Affinities 

In biochemical reactions, any electrons removed from one molecule are always 
passed to another, so that whenever one molecule is oxidized, another is 
reduced. Like any other chemical reaction, the tendency of such 
oxidation-reduction reactions, or redox reactions, to proceed spontaneously 
depends on the free-energy change (AG) for the electron transfer, which in turn 
depends on the relative affinities of the two molecules for electrons. 

Because electron transfers provide most of the energy for living things, it is 
worth taking the time to understand them. As discussed in Chapter 2, acids and 
bases donate and accept protons (see Panel 2-2, pp. 108-109). Acids and bases 
exist in conjugate acid-base pairs, in which the acid is readily converted into the 
base by the loss of a proton. For example, acetic acid (CH3COOH) is converted 
into its conjugate base (CH3COO") in the reaction: 

CH3COOH ^ CH3COO- + H + 

In exactly the same way, pairs of compounds such as NADH and NAD + are 
called redox pairs, since NADH is converted to NAD + by the loss of electrons in 
the reaction: 

NADH ^ NAD + + H + + 2e~ 
NADH is a strong electron donor: because its electrons are held in a high-energy 
linkage, the free-energy change for passing its electrons to many other molecules 
is favorable (see Figure ml4-9/14-9). It is difficult to form a high-energy linkage. 
Therefore its redox partner, NAD + , is of necessity a weak electron acceptor. 
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We can measure the tendency to transfer electrons from any redox pair 
experimentally. All that is required is the formation of an electrical circuit link- 
ing a 1:1 (equimolar) mixture of the redox pair to a second redox pair that has 
been arbitrarily selected as a reference standard, so that we can measure the 
voltage difference between them (Panel 14-1, p. 830). This voltage difference is 
defined as the redox potential; as defined, electrons move spontaneously from 
a redox pair like NADH/NAD+ with a low redox potential (a low affinity for elec- 
trons) to a redox pair like 2 /H 2 with a high redox potential (a high affinity for 
electrons). Thus, NADH is a good molecule for donating electrons to the respi- 
ratory chain, while 2 is well suited to act as the "sink" for electrons at the end of 
the pathway. As explained in Panel 14-1, the difference in redox potential, A£b', 
is a direct measure of the standard free-energy change (AG ) for the transfer of 
an electron from one molecule to another. 



Electron Transfers Release Large Amounts of Energy 

As just discussed, those pairs of compounds that have the most negative redox 
potentials have the weakest affinity for electrons and therefore contain carriers 
with the strongest tendency to donate electrons. Conversely those pairs that have 
the most positive redox potentials have the strongest affinity for electrons and 
therefore contain carriers with the strongest tendency to accept electrons. A 1:1 
mixture of NADH and NAD + has a redox potential of -320 mV, indicating that 
NADH has a strong tendency to donate electrons; a 1:1 mixture of H 2 and V2O2 
has a redox potential of +820 mY indicating that 2 has a strong tendency to 
accept electrons. The difference in redox potential is 1.14 volts (1140 mV), which 
means that the transfer of each electron from NADH to 2 under these standard 
conditions is enormously favorable, where AG = -26.2 kcal/mole (110 kj/mol), or 
twice this amount for the two electrons transferred per NADH molecule (see Panel 
14-1). If we compare this free-energy change with that for the formation of the 
phosphoanhydride bonds in ATR where AG° = -7.3 kcal/mole (-30.5 kj/mol; see 
Figure 2-75), we see that the oxidization of one NADH molecule releases more 
than enough energy to synthesize several molecules of ATP from ADP and Pj. 

Living systems could certainly have evolved enzymes that would allow 
NADH to donate electrons directly to 2 to make water in the reaction: 

2H + + 2<r + Vfe0 2 -» H 2 
But because of the huge free-energy drop, this reaction would proceed with 
almost explosive force and nearly all of the energy would be released as heat. 
Cells perform this reaction much more gradually by passing the high-energy 
electrons from NADH to 2 via the many electron carriers in the electron-trans- 
port chain. Each successive carrier in the chain holds its electrons more tightly, 
so that the highly energetically favorable reaction 2H + + 2e~ + V2O2 -> H 2 occurs 
in many small steps. This stepwise process allows the cell to store nearly half of 
the energy that is released. 

Spectroscopic Methods Identified Many Electron Carriers in the 
Respiratory Chain 

Many of the electron carriers in the respiratory chain absorb visible light and 
change color when they are oxidized or reduced. In general, each has an absorp- 
tion spectrum and reactivity that are distinct enough to allow its behavior to be 
traced spectroscopically even in crude mixtures. It was therefore possible to 
purify these components long before their exact functions were known. Thus, 
the cytochromes were discovered in 1925 as compounds that undergo rapid oxi- 
dation and reduction in living organisms as disparate as bacteria, yeasts, and 
insects. By observing cells and tissues with a spectroscope, researchers identi- 
fied three types of cytochromes by their distinctive absorption spectra and des- 
ignated them as cytochromes a, b } and c. This nomenclature has survived, even 
though cells are now known to contain several cytochromes of each type and the 
classification into types is not functionally important. 
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Figure 14-21 How protons can be 
pumped across membranes. As an 

electron passes along an electron- 
transport chain embedded in a lipid- 
bilayer membrane, it can bind and 
release a proton at each step. In this 
diagram, electron carrier B picks up a 
proton (H + ) from one side of the 
membrane when it accepts an electron 
(e~) from carrier A; it releases the proton 
to the other side of the membrane when 
it donates its electron to carrier C. 



830 PANEL 14-1: Redox Potentials 



HOW REDOX POTENTIALS ARE MEASURED 
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One beaker (left) contains substance A, with an equimolar mixture 
of the reduced (A reduced ) and oxidized (A oxidized ) members of its 
redox pair. The other beaker contains the hydrogen reference 
standard (2H + + 2e~ ^H 2 ), whose redox potential is arbitrarily 
assigned as zero by international agreement. (A salt bridge formed 
from a concentrated KCI solution allows the ions K + and CI" to move 
between the two beakers, as required to neutralize the charges in 
each beaker when electrons flow between them.) The metal wire 
(red) provides a resistance-free path for electrons, and a voltmeter 
then measures the redox potential of substance A. If electrons flow 
from A reduced to H + , as indicated here, the redox pair formed by 
substance A is said to have a negative redox potential. If they 
instead flow from H 2 to A oxidi2ed , the redox pair is said to have a 
positive redox potential. 



SOME STANDARD REDOX POTENTIALS AT pH7 



By convention, the redox potential for a redox pair is 
designated 5. For the standard state, with all reactants 
at a concentration of 1 M, including H + , one can determine 
a standard redox potential, designated E . Since biological 
reactions occur at pH 7, biologists use a different standard 
state in which A reduced = A oxidized and H + = 10~ 7 M. This 
standard redox potential is designated E ' . A few examples of 
special relevance to oxidative phosphorylation are given here. 
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AG°=-n(0.023) A£' , where n is the number of 
electrons transferred across a redox potential change 
of AE' millivolts (mV) 

Example: The transfer of one electron from NADH to 
ubiquinone has a favorable AG°of -8.0 kcal/mole 
(-1 .9 kJ/mole), whereas the transfer of one electron 
from ubiquinone to oxygen has an even more 
favorable AG°of -18.2 kcal/mole (-4.35 kJ/mole). The 
AG°value for the transfer of one electron from 
NADH to oxygen is the sum of these two values, 
-26.2 kcal/mole. 



The actual free-energy change for a reaction, AG, depends on the 
concentration of the reactants and generally is different from the 
standard free-energy change, AG°. The standard redox potentials 
are for a 1:1 mixture of the redox pair. For example, the standard 
redox potential of -320 mV is for a 1:1 mixture of NADH and 
NAD + . But when there is an excess of NADH over NAD + , electron 
transfer from NADH to an electron acceptor becomes more 
favorable. This is reflected by a more negative redox potential 
and a more negative AG for electron transfer. 
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The cytochromes constitute a family of colored proteins that are related by 
the presence of a bound heme group, whose iron atom changes from the ferric 
oxidation state (Fe 3+ ) to the ferrous oxidation state (Fe 2+ ) whenever it accepts an 
electron. The heme group consists of a porphyrin ring with a tightly bound iron 
atom held by four nitrogen atoms at the corners of a square (Figure 14-22). A 
similar porphyrin ring is responsible for the red color of blood and for the green 
color of leaves, being bound to iron in hemoglobin and to magnesium in chloro- 
phyll, respectively. 

In iron-sulfur proteins, a second major family of electron carriers, either two 
or four iron atoms are bound to an equal number of sulfur atoms and to cysteine 
side chains, forming an iron-sulfur center on the protein (Figure 14-23). There 
are more iron-sulfur centers than cytochromes in the respiratory chain. But 
their spectroscopic detection requires electron paramagnetic resonance (EPR) 
spectroscopy, and they are less completely characterized. Like the cytochromes, 
these centers carry one electron at a time. 

The simplest of the electron carriers in the respiratory chain — and the only 
one that is not part of a protein — is a quinone (called ubiquinone, or coenzyme 
Q). A quinone (Q) is a small hydrophobic molecule that is freely mobile in the 
lipid bilayer and can pick up or donate either one or two electrons; upon reduc- 
tion, it picks up a proton from the medium along with each electron it carries 
(Figure 14-24). 

In addition to six different hemes linked to cytochromes, more than seven 
iron-sulfur centers, and ubiquinone, there are also two copper atoms and a 
flavin serving as electron carriers tightly bound to respiratory-chain proteins in 
the pathway from NADH to oxygen. This pathway involves more than 60 differ- 
ent proteins in all. 

As we would expect, these electron carriers have higher and higher affinities 
for electrons (greater redox potentials) as the electrons move along the respira- 
tory chain. The redox potentials have been fine-tuned during evolution by the 
binding of each electron carrier in a particular protein context, which can alter 
its normal affinity for electrons. However, because iron-sulfur centers have a rel- 
atively low affinity for electrons, they predominate in the early part of the respi- 
ratory chain; in contrast, the cytochromes predominate further down the chain, 
where a higher affinity for electrons is required. 

The order of the individual electron carriers in the chain was determined by 
sophisticated spectroscopic measurements (Figure 14-25), and many of the pro- 
teins were initially isolated and characterized as individual polypeptides. A major 
advance in understanding the respiratory chain, however, was the later realiza- 
tion that most of the proteins are organized into three large enzyme complexes. 
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Figure 1 4-22 The structure of the heme 
group attached covalently to 
cytochrome c. The porphyrin ring is 
shown in blue. There are five different 
cytochromes in the respiratory chain. 
Because the hemes in different 
cytochromes have slightly different 
structures and are held by their 
respective proteins in different ways, 
each of the cytochromes has a different 
affinity for an electron. 



The Respiratory Chain Includes Three Large Enzyme Complexes 
Embedded in the Inner Membrane 

Membrane proteins are difficult to purify as intact complexes because they are 
insoluble in aqueous solutions, and some of the detergents required to solubi- 
lize them can destroy normal protein-protein interactions. In the early 1960s, 
however, researchers discovered that relatively mild ionic detergents, such as 
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Figure 1 4-23 The structures of two types 
of iron-sulfur centers. (A) A center of the 
2Fe2S type. (B) A center of the 4Fe4S type. 
Although they contain multiple iron atoms, 
each iron-sulfur center can carry only one 
electron at a time. There are more than 
seven different iron-sulfur centers in the 
respiratory chain. 
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deoxycholate, can solubilize selected components of the inner mitochondrial 
membrane in their native form. This permitted the identification and purifica- 
tion of the three major membrane-bound respiratory enzyme complexes in 
the pathway from NADH to oxygen. Each of these purified complexes can be 
inserted into lipid bilayer vesicles and shown to pump protons across the 
bilayer as electrons pass through it. In the mitochondrion, the three complexes 
are asymmetrically oriented in the inner membrane, and they are linked in 
series as electron-transport-driven H + pumps that pump protons out of the 
matrix (Figure 14-26): 

1. The NADH dehydrogenase complex (generally known as complex I) is the 
largest of the respiratory enzyme complexes, containing more than 40 
polypeptide chains. It accepts electrons from NADH and passes them 
through a flavin and at least seven iron-sulfur centers to ubiquinone. 
Ubiquinone then transfers its electrons to a second respiratory enzyme 
complex, the cytochrome b-C\ complex. 

2. The cytochrome b-c\ complex contains at least 11 different polypeptide 
chains and functions as a dimer. Each monomer contains three hemes 
bound to cytochromes and an iron-sulfur protein. The complex accepts 
electrons from ubiquinone and passes them on to cytochrome c, which 
carries its electron to the cytochrome oxidase complex. 

3. The cytochrome oxidase complex also functions as a dimer; each 
monomer contains 13 different polypeptide chains, including two 
cytochromes and two copper atoms. The complex accepts one electron at 
a time from cytochrome c and passes them four at a time to oxygen. 

The cytochromes, iron-sulfur centers, and copper atoms can carry only one 
electron at a time. Yet each NADH donates two electrons, and each 2 molecule 
must receive four electrons to produce water. There are several electron-collect- 
ing and electron-dispersing points along the electron-transport chain that coor- 
dinate these changes in electron number. The most obvious of these is 
cytochrome oxidase. 



An Iron-Copper Center in Cytochrome Oxidase Catalyzes Efficient 
2 Reduction 

Because oxygen has a high affinity for electrons, it releases a large amount of 
free energy when it is reduced to form water. Thus, the evolution of cellular 
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Figure 14-24 Quinone electron carriers. 

Ubiquinone in the respiratory chain picks 
up one H + from the aqueous 
environment for every electron it accepts, 
and it can carry either one or two 
electrons as part of a hydrogen atom 
(yellow). When reduced ubiquinone 
donates its electrons to the next carrier in 
the chain, these protons are released. 
A long hydrophobic tail confines 
ubiquinone to the membrane and 
consists of 6-10 five-carbon isoprene 
units, the number depending on the 
organism. The corresponding electron 
carrier in the photosynthetic membranes 
of chloroplasts is plastoquinone, which is 
almost identical in structure. For 
simplicity, we refer to both ubiquinone 
and plastoquinone in this chapter as 
quinone (abbreviated as Q). 



Figure 14-25 The general methods used 
to determine the path of electrons along 
an electron-transport chain. The extent of 
oxidation of electron carriers a, b, c, and d 
is continuously monitored by following 
their distinct spectra, which differ in their 
oxidized and reduced states. In this 
diagram, darker red indicates an increased 
degree of oxidation. (A) Under normal 
conditions, where oxygen is abundant, all 
carriers are in a partly oxidized state. The 
addition of a specific inhibitor causes the 
downstream carriers to become more 
oxidized (red) and the upstream carriers to 
become more reduced. (B) In the absence 
of oxygen, all carriers are in their fully 
reduced state (gray). The sudden addition 
of oxygen converts each carrier to its 
partly oxidized form with a delay that is 
greatest for the most upstream carriers. 
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Figure 14-26 The path of electrons 
through the three respiratory enzyme 
complexes. The approximate size and 
shape of each complex are shown. 
During the transfer of electrons from 
NADH to oxygen (red lines), ubiquinone 
and cytochrome c serve as mobile 
carriers that ferry electrons from one 
complex to the next. As indicated, 
protons are pumped across the 
membrane by each of the respiratory 
enzyme complexes. 
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respiration, in which O2 is converted to water, enabled organisms to harness 
much more energy than can be derived from anaerobic metabolism. This is pre- 
sumably why all higher organisms respire. The ability of biological systems to 
use O2 in this way however, requires a very sophisticated chemistry. We can tol- 
erate O2 in the air we breathe because it has trouble picking up its first electron; 
this fact has allowed cells to control its initial reaction through enzymatic catal- 
ysis. But once a molecule of O2 has picked up one electron to form a superoxide 
radical (O2"), it becomes dangerously reactive and rapidly takes up an additional 
three electrons wherever it can find them. The cell can use O2 for respiration 
only because cytochrome oxidase holds onto oxygen at a special bimetallic cen- 
ter, where it remains clamped between a heme-linked iron atom and a copper 
atom until it has picked up a total of four electrons. Only then can the two oxy- 
gen atoms of the oxygen molecule be safely released as two molecules of water 
(Figure 14-27). 
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Figure 14-27 The reaction of 2 with electrons in cytochrome oxidase. As indicated, the iron atom in heme a serves as an 
electron queuing point; this heme feeds four electrons into an 2 molecule held at the bimetallic center active site, which 
is formed by the other heme-linked iron (in heme o 3 ) and a closely opposed copper atom. Note that four protons are 
pumped out of the matrix for each 2 molecule that undergoes the reaction 4e~ + 4H + + 2 -» 2H 2 0. 
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The cytochrome oxidase reaction accounts for about 90% of the total oxygen 
uptake in most cells. This protein complex is therefore crucial for all aerobic life. 
Cyanide and azide are extremely toxic because they bind tightly to the cell's 
cytochrome oxidase complexes to stop electron transport, thereby greatly 
reducing ATP production. 

Although the cytochrome oxidase in mammals contains 13 different protein 
subunits, most of these seem to have a subsidiary role, helping to regulate either 
the activity or the assembly of the three subunits that form the core of the 
enzyme. The complete structure of this large enzyme complex has been deter- 
mined by x-ray crystallography, as illustrated in Figure 14-28. The atomic reso- 
lution structures, combined with mechanistic studies of the effect of precisely 
tailored mutations introduced into the enzyme by genetic engineering of the 
yeast and bacterial proteins, are revealing the detailed mechanisms of this finely 
tuned protein machine. 



Figure 14-28 The molecular structure of 
cytochrome oxidase. This protein is a 
dimer formed from a monomer with 
1 3 different protein subunits (monomer 
mass of 204,000 daltons).The three 
colored subunits are encoded by the 
mitochondrial genome, and they form 
the functional core of the enzyme. As 
electrons pass through this protein on 
the way to its bound O2 molecule, they 
cause the protein to pump protons 
across the membrane (see Figure 14-27). 
(A) The entire protein is shown, 
positioned in the inner mitochondrial 
membrane. (B) The electron carriers are 
located in subunits I and II, as indicated. 



Electron Transfers in the Inner Mitochondrial Membrane Are 
Mediated by Electron Tunneling during Random Collisions 

The two components that carry electrons between the three major enzyme com- 
plexes of the respiratory chain — ubiquinone and cytochrome c— diffuse rapidly 
in the plane of the inner mitochondrial membrane. The expected rate of random 
collisions between these mobile carriers and the more slowly diffusing enzyme 
complexes can account for the observed rates of electron transfer (each complex 
donates and receives an electron about once every 5-20 milliseconds). 

The ordered transfer of electrons along the respiratory chain is due entirely 
to the specificity of the functional interactions between the components of the 
chain: each electron carrier is able to interact only with the carrier adjacent to it 
in the sequence shown in Figure 14-26, with no short circuits. 

Electrons move between the molecules that carry them in biological systems 
not only by moving along covalent bonds within a molecule, but also by jump- 
ing across a gap as large as 2 nm. The jumps occur by electron "tunneling," a 
quantum-mechanical property that is critical for the processes we are dis- 
cussing. Insulation prevents short circuits that would otherwise occur when an 
electron carrier with a low redox potential collides with a carrier with a high 
redox potential. This insulation seems to be provided by carrying an electron 
deep enough inside a protein to prevent its tunneling interactions with an inap- 
propriate partner. 
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We discuss next how the changes in redox potential from one electron car- 
rier to the next are harnessed to pump protons out of the mitochondrial matrix. 



A Large Drop in Redox Potential Across Each of the Three 
Respiratory Enzyme Complexes Provides the Energy for 
H + Pumping 

We have previously discussed how the redox potential reflects electron affinities 
(see p. 76). Figure 14-29 presents an outline of the redox potentials measured 
along the respiratory chain. These potentials drop in three large steps, one 
across each major respiratory complex. The change in redox potential between 
any two electron carriers is directly proportional to the free energy released 
when an electron transfers between them. Each enzyme complex acts as an 
energy-conversion device by harnessing some of this free-energy change to 
pump H + across the inner membrane, thereby creating an electrochemical pro- 
ton gradient as electrons pass through that complex. This conversion can be 
demonstrated by purifying each respiratory enzyme complex and incorporating 
it separately into liposomes: when an appropriate electron donor and acceptor 
are added so that electrons can pass through the complex, protons are translo- 
cated across the liposome membrane. 



Figure 14-29 Redox potential changes 
along the mitochondrial electron- 
transport chain. The redox potential 
(designated F ) increases as electrons 
flow down the respiratory chain to 
oxygen. The standard free-energy 
change, AG°, for the transfer of each of 
the two electrons donated by an NADH 
molecule can be obtained from the 
left-hand ordinate (AG = -n(0.023) A£' , 
where n is the number of electrons 
transferred across a redox potential 
change of AF' mV). Electrons flow 
through a respiratory enzyme complex 
by passing in sequence through the 
multiple electron carriers in each 
complex. As indicated, part of the 
favorable free-energy change is 
harnessed by each enzyme complex to 
pump H + across the inner mitochondrial 
membrane. The NADH dehydrogenase 
and cytochrome b-c-\ complexes each 
pump two H + per electron, whereas the 
cytochrome oxidase complex pumps one. 
Note that NADH is not the only source 
of electrons for the respiratory chain. The 
flavin FADH 2 is also generated by fatty 
acid oxidation (see Figure 2-81) and by 
the citric acid cycle (see Figure 2-82). Its 
two electrons are passed directly to 
ubiquinone, bypassing NADH 
dehydrogenase; they therefore cause less 
H + pumping than the two electrons 
transported from NADH. 



The H + Pumping Occurs by Distinct Mechanisms in the Three 
Major Enzyme Complexes 

Some respiratory enzyme complexes pump one H + per electron across the inner 
mitochondrial membrane, whereas others pump two. The detailed mechanism 
by which electron transport is coupled to H + pumping is different for the three 
different enzyme complexes. In the cytochrome b-c\ complex, the quinones 
clearly have a role. As mentioned previously, a quinone picks up a H + from the 
aqueous medium along with each electron it carries and liberates it when it 
releases the electron (see Figure 14-24). Since ubiquinone is freely mobile in the 
lipid bilayer, it could accept electrons near the inside surface of the membrane 
and donate them to the cytochrome b-c\ complex near the outside surface, 
thereby transferring one H + across the bilayer for every electron transported. 
Two protons are pumped per electron in the cytochrome b-c\ complex, however. 
The complicated series of electron transfers that make this possible are still 
being worked out at the atomic level, aided by the complete structure of the 
cytochrome b-c\ complex determined by x-ray crystallography (Figure 14-30). 
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Figure 14-30 The atomic structure of cytochrome b-ci.This protein is a dimer.The 240,000-dalton 
monomer is composed of 1 1 different protein molecules in mammals. The three colored proteins form the 
functional core of the enzyme: cytochrome b (green), cytochrome C] (blue), and the Rieske protein 
containing an iron-sulfur center (purple). (A) The interaction of these three proteins across the two 
monomers. (B) Their electron carriers, along with the entrance and exit sites for electrons. The electrons 
initially donated by ubiquinone follow a complex path of electron and proton transfer reactions through 
the protein complex that enhances the storage of redox energy. This process, in which some of the 
electrons are recycled back into the quinone pool, is known as the Q cycle. 



Electron transport causes allosteric changes in protein conformations that 
can also pump H + , just as H + is pumped when ATP is hydrolyzed by the ATP syn- 
thase running in reverse. For both the NADH dehydrogenase complex and the 
cytochrome oxidase complex, it seems likely that electron transport drives 
sequential allosteric changes in protein conformation by altering the redox state 
of the components. These conformational changes in turn cause the protein to 
pump H + across the mitochondrial inner membrane. This type of H + pumping 
requires at least three distinct conformations for the pump protein; a general 
mechanism is presented in Figure 14-31. 

Now that we have discussed the mechanistic basis for electron transport and 
proton pumping, we are ready to consider how the respiratory chain is regulated 
to make it optimally useful to the cell. 



H + lonophores Uncouple Electron Transport from ATP Synthesis 

Since the 1940s, several substances — such as 2,4-dinitrophenol — have been 
known to act as uncoupling agents, uncoupling electron transport from ATP syn- 
thesis. The addition of these low-molecular-weight organic compounds to cells 
stops ATP synthesis by mitochondria without blocking their uptake of oxygen. In 
the presence of an uncoupling agent, electron transport and H + pumping con- 
tinue at a rapid rate, but no H + gradient is generated. The explanation for this 
effect is both simple and elegant: uncoupling agents are lipid-soluble weak acids 
that act as diffusible H + carriers in the lipid bilayer (H + ionophores), and they 
provide a pathway for the flow of H + across the inner mitochondrial membrane 
that bypasses the ATP synthase. As a result of this short-circuiting, the proton- 
motive force is dissipated completely, and ATP can no longer be made. 



ELECTRON-TRANSPORT CHAINS AND THEIR PROTON PUMPS 



837 



INCREASE m 

AFFINITY 

FORH + 

(low -* high} 



CONFORMATION A 

(high affinity for H + ) 






relaxation 



CONFORMATION C 

(low affinity for H + ) 

H + proton release 






energy input 




DROP IN 

AFFINITY 

FORH + 

(high-*- low) 



H + 
proton uptake 




CONFORMATION B 

(high affinity for H + ) 



Respiratory Control Normally Restrains Electron Flow Through 
the Chain 

The addition of an uncoupler such as dinitrophenol to cells causes mitochon- 
dria to increase their rate of electron transport substantially, resulting in an 
increase in oxygen uptake that reflects the existence of respiratory control. The 
control is thought to act via a direct inhibitory influence of the electrochemical 
proton gradient on the rate of electron transport. When an uncoupler collapses 
the gradient, electron transport is free to run unchecked at the maximal rate. As 
the gradient increases, electron transport becomes more difficult, and the pro- 
cess slows. Moreover, if an artificially large electrochemical proton gradient is 
experimentally created across the inner membrane, normal electron transport 
stops completely, and a reverse electron flow can be detected in some sections of 
the respiratory chain. This observation suggests that respiratory control reflects 
a simple balance between the free-energy change for electron-transport-linked 
proton pumping and the free-energy change for electron transport — that is, the 
magnitude of the electrochemical proton gradient affects both the rate and the 
direction of electron transport, just as it affects the directionality of the ATP syn- 
thase (see Figure 14-19). 

Respiratory control is just one part of an elaborate interlocking system of 
feedback controls that coordinate the rates of glycolysis, fatty acid breakdown, 
the citric acid cycle, and electron transport. The rates of all of these processes are 
adjusted to the ATP:ADP ratio, increasing whenever an increased utilization of 
ATP causes the ratio to fall. The ATP synthase in the inner mitochondrial mem- 
brane, for example, works faster as the concentrations of its substrates ADP and 
Pi increase. As it speeds up, the enzyme lets more H + flow into the matrix and 
thereby dissipates the electrochemical proton gradient more rapidly. The falling 
gradient, in turn, enhances the rate of electron transport. 

Similar controls, including feedback inhibition of several key enzymes by 
ATP, act to adjust the rates of NADH production to the rate of NADH utilization 



Figure 14-31 A general model for 
H + pumping. This model for H + pumping 
by a transmembrane protein is based on 
mechanisms that are thought to be used 
by NADH dehydrogenase and 
cytochrome oxidase, as well as by the 
light-driven procaryotic proton pump, 
bacteriorhodopsin.The protein is driven 
through a cycle of three conformations: 
A, B, and C. As indicated by their vertical 
spacing, these protein conformations 
have different energies. In conformation 
A, the protein has a high affinity for H + , 
causing it to pick up a H + on the inside of 
the membrane. In conformation C, the 
protein has a low affinity for H + , causing 
it to release a H + on the outside of the 
membrane. The transition from 
conformation B to conformation C that 
releases the H + is energetically 
unfavorable, and it occurs only because it 
is driven by being allosterically coupled 
to an energetically favorable reaction 
occurring elsewhere on the protein (blue 
arrow). The other two conformational 
changes, A — > B and C — » A, lead to states 
of lower energy, and they proceed 
spontaneously. Because the overall cycle 
A — > B -> C -> A releases free energy, 
H + is pumped from the inside (the matrix 
in mitochondria) to the outside (the 
intermembrane space in mitochondria). 
For NADH dehydrogenase and 
cytochrome oxidase, the energy required 
for the transition B — > C is provided by 
electron transport, whereas for 
bacteriorhodopsin this energy is 
provided by light (see Figure 10-33). For 
yet other proton pumps, the energy is 
derived from ATP hydrolysis. In all cases, 
at least three distinct conformations are 
required to create a vectorial pumping 
process, for the same reason that at 
least three conformations are required 
to create a protein that can walk in a 
single direction along a filament (see 
Figure 3-77). 
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by the respiratory chain, and so on. As a result of these multiple control mecha- 
nisms, the body oxidizes fats and sugars 5-10 times more rapidly during a period 
of strenuous exercise than during a period of rest. 



Natural Uncouplers Convert the Mitochondria in Brown Fat into 
Heat-Generating Machines 

In some specialized fat cells, mitochondrial respiration is normally uncoupled 
from ATP synthesis. In these cells, known as brown fat cells, most of the energy 
of oxidation is dissipated as heat rather than being converted into ATR The inner 
membranes of the large mitochondria in these cells contain a special transport 
protein, known as an uncoupling protein, that allows protons to move down 
their electrochemical gradient without passing through ATP synthase. This 
uncoupling protein is switched on when heat generation is required, causing the 
cells to oxidize their fat stores at a rapid rate and produce more heat than ATP 
Tissues containing brown fat serve as "heating pads," helping to revive hiber- 
nating animals and to protect sensitive areas of newborn human babies from 
the cold. 



The Mitochondrion Has Many Critical Roles in Cell Metabolism 

Cells are largely composed of macromolecules, which are constantly in need of 
repair or replacement as the cell ages. Even for cells and organisms that are not 
growing, those molecules that decay must be replaced by biosyntheses. 
Throughout this chapter, we emphasize the critical role of mitochondria in pro- 
ducing the ATP that cells need to maintain themselves as highly organized enti- 
ties in a universe that is always driving toward increasing disorder (discussed in 
Chapter 2). In addition to ATR however, the biosynthesis in the cytosol requires 
a constant supply of reducing power in the form of NADPH and of carbon skele- 
tons. Most descriptions of this type of biosynthesis state that the needed carbon 
skeletons come directly from the breakdown of sugars, whereas the NADPH is 
produced in the cytosol by a side pathway for the breakdown of sugars (the pen- 
tose phosphate pathway, an alternative to glycolysis). But under conditions 
where foodstuffs are abundant, and plenty of ATP is available, mitochondria also 
generate both carbon skeletons and NADPH needed for cell growth. For this pur- 
pose, excess citrate produced in the mitochondrial matrix by the citric acid cycle 
is transported down its electrochemical gradient to the cytosol, where it is metab- 
olized to produce both NADPH and carbon skeletons for biosyntheses. Thus, for 
example, as part of a cell's response to growth signals, large amounts of acetyl 
CoA are produced in the cytosol from citrate exported from mitochondria, accel- 
erating the production of the fatty acids and sterols that build new membranes. 

Mitochondria are also critical for buffering the redox potential in the cytosol. 
Cells need a constant supply of the electron acceptor NAD + for the central reac- 
tion in glycolysis that converts glyceraldehyde 3-phosphate to 1,3-bisphospho- 
glycerate (see Figure 2-72). This NAD + is converted to NADH in the process, and 
the NAD + needs to be regenerated by transferring the high-energy NADH elec- 
trons somewhere. 

The NADH electrons will eventually be used to help drive oxidative phos- 
phorylation inside the mitochondrion. But the mitochondrial inner membrane 
is impermeable to NADH. The electrons are therefore passed from the NADH to 
smaller molecules in the cytosol that can move through the inner mitochondrial 
membrane. Once in the matrix, these smaller molecules transfer their electrons 
to NAD + to form mitochondrial NADH, after which they are returned to the 
cytosol for recharging. This so-called shuttle system is bypassed in some spe- 
cialized cells, such as insect flight muscle, that produce especially large amounts 
of ATP by aerobic glycolysis. Here the high-energy electrons derived from glyc- 
eraldehyde 3-phosphate are passed directly to the outer surface of the mito- 
chondrial inner membrane, enabling them to enter the electron transport chain 
more rapidly and directly — but with loss of some of the usable energy. 
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Figure 1 4-32 Critical roles of 
mitochondria in cell metabolism 
besides ATP production. The many 
essential metabolic reactions carried out 
by mitochondria, such as those illustrated 
here, emphasize the inadequacy of our 
emphasis on mitochondria as the cell 
furnace that oxidizes pyruvate and fatty 
acids to feed oxidative phosphorylation. 



Under conditions of starvation, proteins in our bodies are broken down to 
amino acids, and the amino acids are imported into mitochondria and oxidized 
to produce NADH for ATP production. Thus, by carrying out different reactions 
under different conditions, the mitochondrion has many critical functions in 
maintaining cellular metabolism (Figure 14-32). 



Bacteria Also Exploit Chemiosmotic Mechanisms to Harness 
Energy 

Bacteria use enormously diverse energy sources. Some, like animal cells, are aer- 
obic; they synthesize ATP from sugars they oxidize to C0 2 and H 2 by glycolysis, 
the citric acid cycle, and a respiratory chain in their plasma membrane that is 
similar to the one in the inner mitochondrial membrane. Others are strict anaer- 
obes, deriving their energy either from glycolysis alone (by fermentation) or 
from an electron-transport chain that employs a molecule other than oxygen as 
the final electron acceptor. The alternative electron acceptor can be a nitrogen 
compound (nitrate or nitrite), a sulfur compound (sulfate or sulfite), or a carbon 
compound (fumarate or carbonate), for example. A series of electron carriers in 
the plasma membrane that are comparable to those in mitochondrial respira- 
tory chains transfer the electrons to these acceptors. 

Despite this diversity, the plasma membrane of the vast majority of bacteria 
contains an ATP synthase that is very similar to the one in mitochondria. In bac- 
teria that use an electron-transport chain to harvest energy, the electron-trans- 
port chain pumps H + out of the cell and thereby establishes a proton-motive 
force across the plasma membrane that drives the ATP synthase to make ATP. In 
other bacteria, the ATP synthase works in reverse, using the ATP produced by 
glycolysis to pump H + and establish a proton gradient across the plasma mem- 
brane. The ATP used for this process is generated by fermentation processes 
(discussed in Chapter 2). 

Thus, most bacteria, including the strict anaerobes, maintain a proton gra- 
dient across their plasma membrane. It can be harnessed to drive a flagellar 
motor, and it is used to pump Na + out of the bacterium via a Na + -H + antiporter 
that takes the place of the Na + -K + pump of eucaryotic cells. This gradient is also 
used for the active inward transport of nutrients, such as most amino acids and 
many sugars: each nutrient is dragged into the cell along with one or more pro- 
tons through a specific symporter (Figure 14-33). In animal cells, by contrast, 



Figure 14-33 The importance of H + -driven transport in bacteria. 

A proton-motive force generated across the plasma membrane pumps 
nutrients into the cell and expels Na + . (A) In an aerobic bacterium, a 
respiratory chain produces an electrochemical proton gradient across the 
plasma membrane; this gradient is then used both to transport some 
nutrients into the cell and to make ATP. (B) The same bacterium growing 
under anaerobic conditions derives its ATP from glycolysis. The ATP 
synthase then hydrolyzes some of this ATP to establish an electrochemical 
proton gradient that drives those transport processes that depend on the 
respiratory chain in (A). 
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most inward transport across the plasma membrane is driven by the Na + gradi- 
ent (high Na + outside, low Na + inside) that is established by the Na + -K + pump. 

Some unusual bacteria have adapted to live in a very alkaline environment 
and yet must maintain their cytoplasm at a physiological pH. For these cells, any 
attempt to generate an electrochemical H + gradient would be opposed by a large 
H + concentration gradient in the wrong direction (H + higher inside than out- 
side). Presumably for this reason, some of these bacteria substitute Na + for H + in 
all of their chemiosmotic mechanisms. The respiratory chain pumps Na + out of 
the cell, the transport systems and flagellar motor are driven by an inward flux 
of Na + , and a Na + -driven ATP synthase synthesizes ATP. The existence of such 
bacteria demonstrates that the principle of chemiosmosis is more fundamental 
than the proton-motive force on which it is normally based. 



Summary 

The respiratory chain embedded in the inner mitochondrial membrane contains three 
respiratory enzyme complexes through which electrons pass on their way from NADH 
to O2. Each complex can be purified, inserted into synthetic lipid vesicles, and then 
shown to pump H + when electrons are transported through it In these complexes, elec- 
trons are transferred along a series of protein-bound electron carriers, including hemes 
and iron-sulfur centers. Energy released as the electrons move to lower and lower 
energy levels is used to drive allosteric changes in each respiratory enzyme complex 
that help to pump protons. Electrons are carried between enzyme complexes by the 
mobile electron carriers ubiquinone and cytochrome c to complete the electron- trans- 
port chain. The path of electron flow is NADH -» NADH dehydrogenase complex -> 
ubiquinone -^ cytochrome b-ci complex -> cytochrome c -> cytochrome oxidase com- 
plex^ molecular oxygen (O2). 

The coupling of the energetically favorable transport of electrons to the pumping 
ofH + out of the matrix creates an electrochemical proton gradient. This gradient is 
harnessed to make ATP by the ATP synthase, through which H + flows back into the 
matrix. The universal presence of ATP synthase in mitochondria, chloroplasts, and 
procaryotes testifies to the central importance of chemiosmotic mechanisms in cells. 



CHLOROPLASTS AND PHOTOSYNTHESIS 

All animals and most microorganisms rely on the continual uptake of large 
amounts of organic compounds from their environment. These compounds 
provide both the carbon skeletons for biosynthesis and the metabolic energy 
that drives cellular processes. It is likely that the first organisms on the primitive 
Earth had access to an abundance of the organic compounds produced by geo- 
chemical processes, but it is clear that most of these original compounds were 
used up billions of years ago. Since that time, nearly all of the organic materials 
required by living cells have been produced by photosynthetic organisms, 
including many types of photosynthetic bacteria. 

The most advanced photosynthetic bacteria are the cyanobacteria, which 
have minimal nutrient requirements. They use electrons from water and the 
energy of sunlight to convert atmospheric CO2 into organic compounds — a 
process called carbon fixation. In the course of splitting water [in the overall 
reaction nH 2 + nC0 2 ^% (CH 2 0) w + n0 2 ] t they also liberate into the atmo- 
sphere the oxygen required for oxidative phosphorylation. As we see in this sec- 
tion, it is thought that the evolution of cyanobacteria from more primitive pho- 
tosynthetic bacteria eventually made possible the development of abundant 
aerobic life forms. 

In plants and algae, which developed much later, photosynthesis occurs in a 
specialized intracellular organelle — the chloroplast. Chloroplasts perform pho- 
tosynthesis during the daylight hours. Photosynthetic cells use the immediate 
products of photosynthesis, NADPH and ATP, to produce many organic 
molecules. In plants, the products include a low- molecular- weight sugar (usually 
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sucrose) that these cells export to meet the metabolic needs of the many non- 
photosynthetic cells of the organism. 

Biochemical and genetic evidence strongly suggests that chloroplasts are 
descended from oxygen-producing photosynthetic bacteria that were endocy- 
tosed and lived in symbiosis with primitive eucaryotic cells. Mitochondria are 
also generally believed to be descended from an endocytosed bacterium. The 
many differences between chloroplasts and mitochondria presumably reflect 
their different bacterial ancestors, as well as their subsequent evolutionary 
divergence. Nevertheless, the fundamental mechanisms involved in light-driven 
ATP synthesis in chloroplasts are similar to those that we have already discussed 
for respiration-driven ATP synthesis in mitochondria. 



The Chloroplast Is One Member of the Plastid Family of 
Organelles 

Chloroplasts are the most prominent members of the plastid family of 
organelles. Plastids are present in all living plant cells, each cell type having its 
own characteristic complement. All plastids share certain features. Most 
notably, all plastids in a particular plant species contain multiple copies of the 
same relatively small genome. In addition, an envelope composed of two con- 
centric membranes encloses each plastid. 

As discussed in Chapter 12 (see Figure 12-3), all plastids develop from pro- 
plastids, small organelles in the immature cells of plant meristems (Figure 
14-34A). Proplastids develop according to the requirements of each differenti- 
ated cell, and the type that is present is determined in large part by the nuclear 
genome. If a leaf is grown in darkness, its proplastids enlarge and develop into 
etioplasts, which have a semicrystalline array of internal membranes containing 
a yellow chlorophyll precursor instead of chlorophyll. When exposed to light, the 
etioplasts rapidly develop into chloroplasts by converting this precursor to 
chlorophyll and by synthesizing new membrane pigments, photosynthetic 
enzymes, and components of the electron-transport chain. 

Leucoplasts are plastids present in many epidermal and internal tissues that 
do not become green and photosynthetic. They are little more than enlarged 
proplastids. A common form of leucoplast is the amyloplast (Figure 14-34B), 
which accumulates the polysaccharide starch in storage tissues — a source of 
sugar for future use. In some plants, such as potatoes, the amyloplasts can grow 
to be as large as an average animal cell. 
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Figure 14-34 Plastid diversity. (A) A proplastid from a root tip cell of a bean plant. Note the double 
membrane; the inner membrane has also generated the relatively sparse internal membranes present. 
(B) Three amyloplasts (a form of leucoplast), or starch-storing plastids, in a root tip cell of soybean. (From 
B. Gunning and M. Steer, Plant Cell Biology: Structure and Function. Sudbury MA: Jones & Bartlett, 1996.) 
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Plastids are not just sites for photosynthesis and the deposition of storage 
materials. Plants have also used their plastids to compartmentalize their inter- 
mediary metabolism. Purine and pyrimidine synthesis, most amino acid syn- 
thesis, and all of the fatty acid synthesis of plants takes place in the plastids, 
whereas in animal cells these compounds are produced in the cytosol. 



0.5 jum 

Figure 14-35 Electron micrographs of 
chloroplasts. (A) In a wheat leaf cell, a 
thin rim of cytoplasm — containing 
chloroplasts, the nucleus, and 
mitochondria — surrounds a large 
vacuole. (B) A thin section of a single 
chloroplast, showing the chloroplast 
envelope, starch granules, and lipid (fat) 
droplets that have accumulated in the 
stroma as a result of the biosyntheses 
occurring there. (C) A high-magnification 
view of two grana. A granum is a stack of 
thylakoids. (Courtesy of K. Plaskitt.) 



Chloroplasts Resemble Mitochondria But Have an Extra 
Compartment 

Chloroplasts use chemiosmotic mechanisms to carry out their energy intercon- 
versions in much the same way that mitochondria do. Although much larger 
(Figure 14-35A), they are organized on the same principles. They have a highly 
permeable outer membrane; a much less permeable inner membrane, in which 
membrane transport proteins are embedded; and a narrow intermembrane 
space in between. Together, these membranes form the chloroplast envelope 
(Figure 14-35B,C). The inner membrane surrounds a large space called the 
stroma, which is analogous to the mitochondrial matrix and contains many 
metabolic enzymes. Like the mitochondrion, the chloroplast has its own 
genome and genetic system. The stroma therefore also contains a special set of 
ribosomes, RNAs, and the chloroplast DNA. 

There is, however, an important difference between the organization of 
mitochondria and that of chloroplasts. The inner membrane of the chloroplast 
is not folded into cristae and does not contain electron-transport chains. 
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Instead, the electron-transport chains, photosynthetic light-capturing systems, 
and ATP synthase are all contained in the thylakoid membrane, a third distinct 
membrane that forms a set of flattened disclike sacs, the thylakoids (Figure 
14-36). The lumen of each thylakoid is thought to be connected with the lumen 
of other thylakoids, thereby defining a third internal compartment called the 
thylakoid space, which is separated by the thylakoid membrane from the stroma 
that surrounds it. Thylakoid membranes interact with each other to form 
numerous local stacks called grana. 

Figure 14-37 highlights the structural similarities and differences between 
mitochondria and chloroplasts. An important difference is that the head of the 
ATP synthase, where the ATP is made, protrudes into the stroma from the thy- 
lakoid membrane in a chloroplast, whereas it protrudes into the matrix from the 
inner mitochondrial membrane in a mitochondrion. 



Chloroplasts Capture Energy from Sunlight and Use It to Fix 
Carbon 

We can group the many reactions that occur during photosynthesis in plants 

into two broad categories: 
1. In the photosynthetic electron-transfer reactions (also called the 'light 
reactions") , energy derived from sunlight energizes an electron in the 
green organic pigment chlorophyll enabling the electron to move along an 
electron-transport chain in the thylakoid membrane in much the same 
way that an electron moves along the respiratory chain in mitochondria. 



Figure 14-36 The chloroplast. This 

photosynthetic organelle contains three 
distinct membranes (the outer 
membrane, the inner membrane, and the 
thylakoid membrane) that define three 
separate internal compartments (the 
intermembrane space, the stroma, and 
the thylakoid space). The thylakoid 
membrane contains all the energy- 
generating systems of the chloroplast, 
including its chlorophyll. In electron 
micrographs, this membrane seems to be 
broken up into separate units that 
enclose individual flattened vesicles (see 
Figure 14-35), but these are probably 
joined into a single, highly folded 
membrane in each chloroplast. As 
indicated, the individual thylakoids are 
interconnected, and they tend to stack to 
form grana. 
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Figure 14-37 A mitochondrion and 
chloroplast compared. A chloroplast is 
generally much larger than a 
mitochondrion and contains, in addition 
to an outer and inner membrane, a 
thylakoid membrane enclosing a 
thylakoid space. Unlike the chloroplast 
inner membrane, the inner mitochondrial 
membrane is folded into cristae to 
increase its surface area. 
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The chlorophyll obtains its electrons from water (H 2 0), producing O2 as a 
by-product. During the electron-transport process, H + is pumped across 
the thylakoid membrane, and the resulting electrochemical proton gradi- 
ent drives the synthesis of ATP in the stroma. As the final step in this series 
of reactions, high-energy electrons are loaded (together with H + ) onto 
NADP+, converting it to NADPH. All of these reactions are confined to the 
chloroplast. 
2. In the carbon-fixation reactions (also called the "dark reactions"), the ATP 
and the NADPH produced by the photosynthetic electron-transfer reac- 
tions serve as the source of energy and reducing power, respectively, to 
drive the conversion of C0 2 to carbohydrate. The carbon-fixation reac- 
tions, which begin in the chloroplast stroma and continue in the cytosol, 
produce sucrose and many other organic molecules in the leaves of the 
plant. The sucrose is exported to other tissues as a source of both organic 
molecules and energy for growth. 

Thus, the formation of ATR NADPH, and 2 (which requires light energy 
directly) and the conversion of C0 2 to carbohydrate (which requires light energy 
only indirectly) are separate processes (Figure 14-38), although elaborate feed- 
back mechanisms interconnect the two. Several of the chloroplast enzymes 
required for carbon fixation, for example, are inactivated in the dark and reacti- 
vated by light-stimulated electron-transport processes. 



Carbon Fixation Is Catalyzed by Ribulose Bisphosphate 
Carboxylase 

We have seen earlier in this chapter how cells produce ATP by using the large 
amount of free energy released when carbohydrates are oxidized to C0 2 and 
H 2 0. Clearly, therefore, the reverse reaction, in which C0 2 and H 2 combine to 
make carbohydrate, must be a very unfavorable one that can occur only if it is 
coupled to other, very favorable reactions that drive it. 

Figure 14-39 illustrates the central reaction of carbon fixation, in which an 
atom of inorganic carbon is converted to organic carbon: C0 2 from the atmo- 
sphere combines with the five-carbon compound ribulose 1,5-bisphosphate 
plus water to yield two molecules of the three-carbon compound 3-phospho- 
glycerate. This "carbon-fixing" reaction, which was discovered in 1948, is cat- 
alyzed in the chloroplast stroma by a large enzyme called ribulose bisphosphate 
carboxylase. Since each molecule of the complex works sluggishly (processing 
only about 3 molecules of substrate per second compared to 1000 molecules per 
second for a typical enzyme), the reaction requires an unusually large number of 
enzyme molecules. Ribulose bisphosphate carboxylase often constitutes more 
than 50% of the total chloroplast protein, and it is thought to be the most abun- 
dant protein on Earth. 
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Figure 14-38 The reactions of 
photosynthesis in a chloroplast. Water is 
oxidized and oxygen is released in the 
photosynthetic electron-transfer 
reactions, while carbon dioxide is 
assimilated (fixed) to produce sugars and 
a variety of other organic molecules in 
the carbon-fixation reactions. 
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Figure 14-39 The initial reaction in 
carbon fixation. This reaction, in which 
carbon dioxide is converted into organic 
carbon, is catalyzed in the chloroplast 
stroma by the abundant enzyme ribulose 
bisphosphate carboxylase. The product is 
3-phosphoglycerate, which is also an 
intermediate in glycolysis. The two 
carbon atoms shaded in blue are used to 
produce phosphoglycolate when the 
same enzyme adds oxygen instead of 
C0 2 (see text). 
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Each CO2 Molecule That Is Fixed Consumes Three Molecules of 
ATP and Two Molecules of NADPH 

The actual reaction in which CO2 is fixed is energetically favorable because of 
the reactivity of the energy-rich compound ribulose 1,5-bisphosphate, to which 
each molecule of CO2 is added (see Figure 14-39). The elaborate metabolic path- 
way that produces ribulose 1,5-bisphosphate requires both NADPH and ATP; it 
was worked out in one of the first successful applications of radioisotopes as 
tracers in biochemistry. This carbon-fixation cycle (also called the Calvin cycle) 
is outlined in Figure 14-40. It starts when 3 molecules of CO2 are fixed by ribu- 
lose bisphosphate carboxylase to produce 6 molecules of 3-phosphoglycerate 
(containing 6x3 = 18 carbon atoms in all: 3 from the CO2 and 15 from ribulose 
1,5-bisphosphate). The 18 carbon atoms then undergo a cycle of reactions that 
regenerates the 3 molecules of ribulose 1,5-bisphosphate used in the initial car- 
bon-fixation step (containing 3x5=15 carbon atoms). This leaves 1 molecule of 
glyceraldehyde 3-phosphate (3 carbon atoms) as the net gain. 

Each CO2 molecule converted into carbohydrate consumes a total of 3 
molecules of ATP and 2 molecules of NADPH. The net equation is: 

3C0 2 + 9ATP + 6NADPH + water -> 
glyceraldehyde 3-phosphate + 8Pi + 9ADP + 6NADP + 
Thus, the formation of organic molecules from CO2 and H2O requires both 
phosphate-bond energy (as ATP) and reducing power (as NADPH). We return to 
this important point later. 

The glyceraldehyde 3-phosphate produced in chloroplasts by the carbon- 
fixation cycle is a three-carbon sugar that also serves as a central intermediate in 
glycolysis. Much of it is exported to the cytosol, where it can be converted into 
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Figure 14-40 The carbon-fixation cycle, 
which forms organic molecules from 
C0 2 and H 2 0. The number of carbon 
atoms in each type of molecule is 
indicated in the white box. There are 
many intermediates between 
glyceraldehyde 3-phosphate and ribulose 
5-phosphate, but they have been 
omitted here for clarity. The entry of 
water into the cycle is also not shown. 
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fructose 6-phosphate and glucose 1 -phosphate by the reversal of several reac- 
tions in glycolysis (see Panel 2-8, pp. 120-121). The glucose 1-phosphate is then 
converted to the sugar nucleotide UDP-glucose, and this combines with the 
fructose 6-phosphate to form sucrose phosphate, the immediate precursor of 
the disaccharide sucrose. Sucrose is the major form in which sugar is trans- 
ported between plant cells: just as glucose is transported in the blood of animals, 
sucrose is exported from the leaves via vascular bundles, providing the carbohy- 
drate required by the rest of the plant. 

Most of the glyceraldehyde 3-phosphate that remains in the chloroplast is 
converted to starch in the stroma. Like glycogen in animal cells, starch is a large 
polymer of glucose that serves as a carbohydrate reserve (see Figure 14-34B). 
The production of starch is regulated so that it is produced and stored as large 
grains in the chloroplast stroma during periods of excess photosynthetic capac- 
ity. This occurs through reactions in the stroma that are the reverse of those in 
glycolysis: they convert glyceraldehyde 3-phosphate to glucose 1-phosphate, 
which is then used to produce the sugar nucleotide ADP-glucose, the immediate 
precursor of starch. At night the plant breaks down the starch to help support the 
metabolic needs of the plant. Starch provides an important part of the diet of all 
animals that eat plants. 

Carbon Fixation in Some Plants Is Compartmentalized to 
Facilitate Growth at Low C0 2 Concentrations 

Although ribulose bisphosphate carboxylase preferentially adds C0 2 to ribulose 
1,5-bisphosphate, it can use 2 as a substrate in place of C0 2 , and if the con- 
centration of C0 2 is low, it will add 2 to ribulose 1,5-bisphosphate instead (see 
Figure 14-39). This is the first step in a pathway called photorespiration, whose 
ultimate effect is to use up 2 and liberate C0 2 without the production of useful 
energy stores. In many plants, about one-third of the C0 2 fixed is lost again as 
C0 2 because of photorespiration. 

Photorespiration can be a serious liability for plants in hot, dry conditions, 
which cause them to close their stomata (the gas exchange pores in their leaves, 
each of which is called a stoma) to avoid excessive water loss. This in turn causes 
the C0 2 levels in the leaf to fall precipitously, thereby favoring photorespiration. 
A special adaptation, however, occurs in the leaves of many plants, such as corn 
and sugar cane, that grow in hot, dry environments. In these plants, the carbon- 
fixation cycle occurs only in the chloroplasts of specialized bundle-sheath cells, 
which contain all of the plant's ribulose bisphosphate carboxylase. These cells 
are protected from the air and are surrounded by a specialized layer of meso- 
phyll cells that use the energy harvested by their chloroplasts to "pump" C0 2 into 
the bundle-sheath cells. This supplies the ribulose bisphosphate carboxylase 
with a high concentration of C0 2 , thereby greatly reducing photorespiration. 

The C0 2 pump is produced by a reaction cycle that begins in the cytosol of 
the mesophyll cells. A C0 2 -fixation step is catalyzed by an enzyme that binds 
carbon dioxide (as bicarbonate) and combines it with an activated three-car- 
bon molecule (phosphoenol-pyruvate) to produce a four-carbon molecule. 
The four-carbon molecule diffuses into the bundle-sheath cells, where it is 
broken down to release the C0 2 and generate a molecule with three carbons. 
The pumping cycle is completed when this three-carbon molecule is returned 
to the mesophyll cells and converted back to its original activated form. 
Because the C0 2 is initially captured by converting it into a compound con- 
taining four carbons, the C0 2 -pumping plants are called C 4 plants. All other 
plants are called C 3 plants because they capture C0 2 into the three-carbon 
compound 3-phosphoglycerate (Figure 14-41). 

As with any vectorial transport process, pumping C0 2 into the bundle- 
sheath cells in C 4 plants costs energy (ATP is hydrolyzed; see Figure 14-41B). In 
hot, dry environments, however, this cost can be much less than the energy lost 
by photorespiration in C 3 plants, so C 4 plants have a potential advantage. More- 
over, because C 4 plants can perform photosynthesis at a lower concentration of 
C0 2 inside the leaf, they need to open their stomata less often and therefore can 
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fix about twice as much net carbon as C 3 plants per unit of water lost. This type 
of carbon fixation has evolved independently in several different plant lineages. 
Although the vast majority of plant species are C 3 plants, C 4 plants such as corn 
and sugar cane are much more effective at converting sunlight energy into 
biomass than C 3 plants such as cereal grains. They are therefore of special 
importance in world agriculture. 



Figure 14-41 The C0 2 pumping in C 4 
plants. (A) Comparative leaf anatomy in a 
C 3 plant and a C 4 plant. The cells with 
green cytosol in the leaf interior contain 
chloroplasts that perform the normal 
carbon-fixation cycle. In C4 plants, the 
mesophyll cells are specialized for CO2 
pumping rather than for carbon fixation, 
and they thereby create a high ratio of 
CO2 to O2 in the bundle-sheath cells, 
which are the only cells in these plants 
where the carbon-fixation cycle occurs. 
The vascular bundles carry the sucrose 
made in the leaf to other tissues. (B) How 
carbon dioxide is concentrated in 
bundle-sheath cells by the harnessing of 
ATP energy in mesophyll cells. 



Photosynthesis Depends on the Photochemistry of Chlorophyll 
Molecules 

Having discussed the carbon-fixation reactions, we now return to the question 
of how the photosynthetic electron-transfer reactions in the chloroplast gener- 
ate the ATP and the NADPH needed to drive the production of carbohydrates 
from C0 2 and H 2 0. Sunlight absorbed by chlorophyll molecules supplies the 
required energy (Figure 14-42). The process of energy conversion begins when 
a quantum of light (a photon) excites a chlorophyll molecule, causing an elec- 
tron in the chlorophyll to move from one molecular orbital to another of higher 
energy. Such an excited molecule is unstable and tends to return quickly to its 
original, unexcited state. This can happen in one of three ways: 

1. By converting the extra energy into heat (molecular motions) or to some 
combination of heat and light of a longer wavelength (fluorescence) ; this is 
what happens when an isolated chlorophyll molecule in solution absorbs 
light energy. 

2. By transferring the energy— but not the electron— directly to a neighboring 
chlorophyll molecule by a process called resonance energy transfer. 

3. By transferring the negatively charged high-energy electron to another 
nearby molecule, an electron acceptor, after which the positively charged 
chlorophyll returns to its original state by taking up a low- energy electron 
from some other molecule, an electron donor. 
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In the process of photosynthesis, the last two mechanisms are greatly facili- 
tated by two different protein complexes: resonance energy transfer by an 
antenna complex and high-energy electron transfer by a photochemical reaction 
center. These two types of protein complexes, acting in concert, make most of 
the life on Earth possible. We shall now describe how they work. 



A Photochemical Reaction Center Plus an Antenna Complex Form 
a Photosystem 

Large multiprotein complexes called photosystems catalyze the conversion of 
the light energy captured in excited chlorophyll molecules to useful forms. A 
photosystem consists of two closely linked components: an antenna complex, 
consisting of proteins bound to a large set of pigment molecules that capture 
light energy and feed it to the reaction center; and a photochemical reaction 
center, consisting of a complex of proteins and chlorophyll molecules that 
enable light energy to be converted into chemical energy. 

The antenna complex is important for capturing light energy. In chloro- 
plasts it consists of a number of distinct membrane protein complexes (known 
as light-harvesting complexes); together, these proteins bind several hundred 
chlorophyll molecules per reaction center, orienting them precisely in the thy- 
lakoid membrane. The antenna complex also contains accessory pigments 
called carotenoids, which protect the chlorophylls from oxidation and can help 
collect light of other wavelengths. When light excites a chlorophyll molecule in 
the antenna complex, the energy is rapidly transferred from one molecule to 
another by resonance energy transfer until it reaches a special pair of chloro- 
phyll molecules in the photochemical reaction center. Each antenna complex 
thereby acts as a funnel, collecting light energy and directing it to a specific site 
where it can be used effectively (Figure 14-43). 

The photochemical reaction center is a transmembrane protein-pigment 
complex that lies at the heart of photosynthesis. It is thought to have evolved 
more than 3 billion years ago in primitive photosynthetic bacteria. The special 
pair of chlorophyll molecules in the reaction center acts as an irreversible trap for 
excitation quanta because the excited electron is immediately passed to a neigh- 
boring chain of electron acceptors in the protein complex (Figure 14-44). By 
moving the high-energy electron rapidly away from the chlorophylls, the photo- 
chemical reaction center transfers it to an environment where it is much more 
stable. The electron is thereby suitably positioned for subsequent reactions. 
These require more time to complete, and they result in the production of light- 
generated high-energy electrons that are fed into electron transport chains. 
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Figure 14-42 The structure of chlorophyll. 

A magnesium atom is held in a porphyrin 
ring, which is related to the porphyrin ring 
that binds iron in heme (see Figure 14-22). 
Electrons are delocalized over the bonds 
shown in blue. 
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Figure 14-43 The antenna complex and 
photochemical reaction center in a 
photosystem. <GCAC> The antenna 
complex is a collector of light energy in the 
form of excited electrons. The energy of the 
excited electrons is funneled, through a 
series of resonance energy transfers, to a 
special pair of chlorophyll molecules in the 
photochemical reaction center. The 
reaction center then produces a high- 
energy electron that can be passed rapidly 
to the electron-transport chain in the 
thylakoid membrane, via a quinone. 
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In a Reaction Center, Light Energy Captured by Chlorophyll 
Creates a Strong Electron Donor from a Weak One 

The electron transfers involved in the photochemical reactions just described 
have been analyzed extensively by rapid spectroscopic methods. Figure 14-45 
illustrates, in a general way, how light provides the energy needed to transfer 
an electron from a weak electron donor (a molecule with a strong affinity for 
electrons) to a molecule that is a strong electron donor in its reduced form (a 



Figure 1 4-44 The arrangement of the 
electron carriers in the photochemical 
reaction center of a purple bacterium. 

<ATCA> The pigment molecules shown 
are held in the interior of a 
transmembrane protein and are 
surrounded by the lipid bilayerof the 
bacterial plasma membrane. An electron 
in the special pair of chlorophyll 
molecules is excited by resonance from 
an antenna complex chlorophyll, and the 
excited electron is then transferred 
stepwise from the special pair to the 
quinone (see also Figure 14-45). A similar 
arrangement of electron carriers is 
present in the reaction centers of plants 
(see Figure 14-47). 
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Figure 14-45 How light energy is harvested by a reaction center chlorophyll molecule. (A) The initial events in a reaction 
center create a charge separation. A pigment-protein complex holds a chlorophyll molecule of the special pair (blue) 
precisely positioned so that both a potential low-energy electron donor (orange) and a potential high-energy electron 
acceptor (green) are immediately available. When light energizes an electron in the chlorophyll molecule (red electron), the 
excited electron is immediately passed to the electron acceptor and is thereby partially stabilized. The positively charged 
chlorophyll molecule then quickly attracts the low-energy electron from the electron donor and returns to its resting state, 
creating a larger charge separation that further stabilizes the high-energy electron. These reactions require less than 10" 6 
second to complete. (B) In the final stage of this process, which follows the steps in (A), the photosynthetic reaction center 
is restored to its original resting state by acquiring a new low-energy electron and then transferring the high-energy 
electron derived from chlorophyll to an electron transport chain in the membrane. As will be discussed subsequently, the 
ultimate source of low-energy electrons for photosystem II in the chloroplast is water; as a result, light produces 
high-energy electrons in the thylakoid membrane from low-energy electrons in water. 
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molecule with a weak affinity for electrons). The special pair of chlorophyll 
molecules in the reaction center is poised to pass each excited electron to a pre- 
cisely positioned neighboring molecule in the same protein complex (an elec- 
tron acceptor). The chlorophyll molecule that loses an electron becomes posi- 
tively charged, but it rapidly regains an electron from an adjacent electron donor 
to return to its unexcited, uncharged state (Figure 14-45A, orange electron). 
Then, in slower reactions, the electron donor has its missing electron replaced, 
and the high-energy electron that was generated by the excited chlorophyll is 
transferred to the electron-transport chain (Figure 14-45B). The excitation 
energy in chlorophyll that would normally be released as fluorescence or heat is 
thereby used instead to create a strong electron donor (a molecule carrying a 
high-energy electron) where none had been before. 

The photosystem of purple bacteria is somewhat simpler than the evolu- 
tionarily related photosystems in chloroplasts, and it has served as a good model 
for working out reaction details. The reaction center in this photosystem is a 
large protein-pigment complex that can be solubilized with detergent and puri- 
fied in active form. In a major triumph of structure analysis, its complete three- 
dimensional structure was determined by x-ray crystallography (see Figure 
10-34). This structure, combined with kinetic data, provides the best picture we 
have of the initial electron-transfer reactions that occur during photosynthesis. 
Figure 14-46 shows the actual sequence of electron transfers that take place, for 
comparison with Figure 14-45A. 

In the purple bacterium, the electron used to fill the electron-deficient hole 
created by the light-induced charge separation comes from a cyclic flow of elec- 
trons transferred through a cytochrome (see orange box in Figure 14-45); the 
strong electron donor produced is a quinone. One of the two photosytems in the 
chloroplasts of higher plants likewise produces a quinone carrying high-energy 
electrons. However, as we discuss next, because water provides the electrons for 
this photosystem, photosynthesis in plants— unlike that in purple bacteria- 
releases large quantities of oxygen gas. 



Noncyclic Photophosphorylation Produces Both NADPH and ATP 

Photosynthesis in plants and cyanobacteria produces both ATP and NADPH 
directly by a two-step process called noncyclic photophosphorylation. Because 
two photosystems — called photosystems I and II — work in series to energize an 
electron to a high-enough energy state, the electron can be transferred all the 
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Figure 1 4-46 The electron transfers that occur in the photochemical reaction center of 
a purple bacterium. <ATCA> A similar set of reactions occurs in the evolutionary related 
photosystem II in plants. At the top left is an orientating diagram showing the molecules 
that carry electrons, which are those in Figure 14-45, plus an exchangeable quinone (Q B ) 
and a freely mobile quinone (Q) dissolved in the lipid bilayer. Electron carriers 1-5 are each 
bound in a specific position on a 596-amino-acid transmembrane protein formed from 
two separate subunits (see Figure 1 0-34). After excitation by a photon of light, a high- 
energy electron passes from pigment molecule to pigment molecule, very rapidly creating 
a stable charge separation, as shown in the sequence of steps A-C in which the pigment 
molecule carrying a high-energy electron is colored red. Steps D and E then occur 
progressively. After a second photon has repeated this sequence with a second electron, 
the exchangeable quinone is released into the bilayer carrying two high-energy electrons. 
This quinone quickly loses its charge by picking up two protons (see Figure 14-24). 
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way from water to NADPH. As the high-energy electrons pass through the cou- 
pled photosystems to generate NADPH, some of their energy is siphoned off for 
ATP synthesis. 

The first of the two photosystems — paradoxically called photosystem II for 
historical reasons — has the unique ability to withdraw electrons from water. The 
oxygens of two water molecules bind to a cluster of four manganese atoms on the 
lumenal surface of the photosystem II reaction center complex (Figure 14-47). 
This cluster enables electrons to be removed one at a time from the water, as 
required to fill the electron-deficient holes created by light in chlorophyll 
molecules in the reaction center. As soon as four electrons have been removed 
from the two water molecules (requiring four quanta of light), O2 is released. Pho- 
tosystem II thus catalyzes the reaction 2H 2 + 4 photons -> 4H + + 4e~ + O2. As we 
discussed for the electron-transport chain in mitochondria, which uses O2 and 
produces water, the mechanism ensures that no partly oxidized water molecules 
are released as dangerous, highly reactive oxygen radicals. Essentially all the oxy- 
gen in the Earth's atmosphere has been produced in this way 

The core of the reaction center in photosystem II is homologous to the bac- 
terial reaction center just described, and it likewise produces strong electron 
donors in the form of reduced quinone molecules dissolved in the lipid bilayer 
of the membrane. The quinones pass their electrons to a H + pump called the 
cytochrome be-f complex, which resembles the cytochrome b-c\ complex in the 
respiratory chain of mitochondria. The cytochrome ^-/complex pumps H + into 
the thylakoid space across the thylakoid membrane (or out of the cytosol across 
the plasma membrane in cyanobacteria), and the resulting electrochemical gra- 
dient drives the synthesis of ATP by an ATP synthase (Figure 14-48). 

The final electron acceptor in this electron-transport chain is the second 
photosystem, photosystem 7, which accepts an electron into the electron- defi- 
cient hole created by light in the chlorophyll molecule in its reaction center. 
Each electron that enters photosystem I is finally boosted to a very high-energy 



Figure 14-47 The structure of 
photosystem II in plants and 
cyanobacteria. The structure shown is a 
dimer, organized around a two fold axis 
[red dotted arrows). Each monomer is 
composed of 16 integral membrane 
protein subunits plus three subunits in 
the lumen, with a total of 36 bound 
chlorophylls, 7 carotenoids, two 
pheophytins, two hemes, two 
plastoquinones, and one manganese 
cluster in an oxygen-evolving water- 
splitting center. (A) The complete three- 
dimensional structure of the dimer. 
(B) Schematic of the dimer with a few 
central features indicated. (C) A monomer 
drawn to show only the non-protein 
molecules in the structure, thereby 
highlighting the protein-bound pigments 
and electron carriers; green structures are 
chlorophylls. (Adapted from K.N. Ferreira 
et a\. f Science 303:1831-1838, 2004. With 
permission from AAAS.) 
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Figure 14-48 Electron flow during photosynthesis in the thylakoid membrane. The mobile electron carriers in the 
chain are plastoquinone (which closely resembles the ubiquinone of mitochondria), plastocyanin (a small copper- 
containing protein), and ferredoxin (a small protein containing an iron-sulfur center). The cytochrome fc> 6 -f complex 
resembles the b-cy complex of mitochondria and the b-c complex of bacteria (see Figure 14-73): all three complexes 
accept electrons from quinones and pump H + across the membrane. The H + released by water oxidation into the 
thylakoid space, and the H + consumed during NADPH formation in the stroma, also contribute to the generation of 
the electrochemical H + gradient. As illustrated, this gradient drives ATP synthesis by an ATP synthase present in this 
same membrane. 

level that allows it to be passed to the iron-sulfur center in ferredoxin and then 
to NADP+ to generate NADPH. 

The scheme for photosynthesis just discussed is known as the Z scheme. By 
means of its two electron-energizing steps, one catalyzed by each photosystem, 
an electron is passed from water, which normally holds on to its electrons very 
tightly (redox potential = +820 mV), to NADPH, which normally holds on to its 
electrons loosely (redox potential = -320 mV) (Figure 14-49). 
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Figure 14-49 Changes in redox 
potential during photosynthesis. The 

redox potential for each molecule is 
indicated by its position along the 
vertical axis. Note that photosystem II 
passes electrons derived from water to 
photosystem I. The net electron flow 
through the two photosystems in series 
is from water to NADP + , and it produces 
NADPH as well as ATP. 
The ATP is synthesized by an ATP 
synthase that harnesses the 
electrochemical proton gradient 
produced by the three sites of H + activity 
that are highlighted in Figure 14-48. This 
Z scheme for ATP production is called 
noncyclic photophosphorylation, to 
distinguish it from a cyclic scheme that 
utilizes only photosystem I (see the text). 
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A single quantum of visible light does not have enough energy to move an 
electron all the way from the bottom of photosystem II to the top of photosys- 
tem I, which is presumably the energy change required to pass an electron effi- 
ciently from water to NADP + . The use of two separate photosystems in series 
makes the energy from two quanta of light available for this purpose. In addi- 
tion, there is enough energy left over to enable the electron-transport chain that 
links the two photosystems to pump H + across the thylakoid membrane (or the 
plasma membrane of cyanobacteria), so that the ATP synthase can harness 
some of the light-derived energy for ATP production. 

Chloroplasts Can Make ATP by Cyclic Photophosphorylation 
Without Making NADPH 

In the noncyclic photophosphorylation scheme just discussed, high-energy 
electrons leaving photosystem II are harnessed to generate ATP and are passed 
on to photosystem I to drive the production of NADPH. This produces slightly 
more than 1 molecule of ATP for every pair of electrons that passes from H2O to 
NADP + to generate a molecule of NADPH. But carbon fixation requires 1.5 
molecules of ATP per NADPH molecule (see Figure 14-40). To produce extra ATP, 
chloroplasts can switch photosystem I into a cyclic mode so that it produces ATP 
instead of NADPH. In this process, called cyclic photophosphorylation, the high- 
energy electrons from photosystem I are transferred to the cytochrome b&-f 
complex rather than being passed on to NADP + . From the b 6 -/complex, the elec- 
trons are passed back to photosystem I at a low energy. The only net result, 
besides the conversion of some light energy to heat, is that H + is pumped across 
the thylakoid membrane by the ^-/complex as electrons pass through it, 
thereby increasing the electrochemical proton gradient that drives the ATP syn- 
thase. (This is analogous to the cyclic process that occurs in purple nonsulfur 
bacteria in Figure 14-73, below.) 

To summarize, cyclic photophosphorylation involves only photosystem I, 
and it produces ATP without the formation of either NADPH or 2 . The relative 
activities of cyclic and noncyclic electron flows are regulated by the cell to deter- 
mine how much light energy is converted into reducing power (NADPH) and 
how much into high-energy phosphate bonds (ATP). 

Photosystems I and II Have Related Structures, and Also 
Resemble Bacterial Photosystems 

The mechanisms of fundamental cell processes such as DNA replication or res- 
piration generally turn out to be the same in eucaryotic cells and in bacteria, 
even though the number of protein components involved is considerably 
greater in eucaryotes. Eucaryotes evolved from procaryotes, and the additional 
proteins were presumably selected for during evolution because they provided 
an extra degree of efficiency, regulation that was useful to the cell, or both. 

Photosystems provide a clear example of this type of evolution. The atomic 
structures of both eucaryotic photosystems have been revealed by a combina- 
tion of electron and x-ray crystallography, and the close relationship of photo- 
system I, photosystem II, and the photochemical reaction center of purple bac- 
teria has been clearly demonstrated from such analyses (Figure 14-50). 

The Proton-Motive Force Is the Same in Mitochondria and 
Chloroplasts 

The presence of the thylakoid space separates a chloroplast into three rather 
than the two internal compartments of a mitochondrion. The net effect of H + 
translocation in the two organelles is, however, similar. As illustrated in Figure 
14-51, in chloroplasts exposed to light, H + is pumped out of the stroma (pH 7.5) 
into the thylakoid space (pH -4.5), creating a gradient of 3 pH units. This repre- 
sents a proton-motive force of about 180 mV across the thylakoid membrane, 
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and it drives ATP synthesis by the ATP synthase embedded in this membrane. 
The force is nearly the same as that across the inner mitochondrial membrane, 
but nearly all of it is contributed by the pH gradient rather than by a membrane 
potential, unlike the case in mitochondria. 

For both mitochondria and chloroplasts, the catalytic site of the ATP syn- 
thase is at a pH of about 7.5 and is located in a large organelle compartment 
(matrix or stroma) that is packed full of soluble enzymes. It is here that all of the 
organelle's ATP is made (see Figure 14-51). 



Carrier Proteins in the Chloroplast Inner Membrane Control 
Metabolite Exchange with the Cytosol 

If chloroplasts are isolated in a way that leaves their inner membrane intact, this 
membrane can be shown to have a selective permeability, reflecting the pres- 
ence of specific transporters. Most notably, much of the glyceraldehyde 3-phos- 
phate produced by C0 2 fixation in the chloroplast stroma is transported out of 
the chloroplast by an efficient antiport system that exchanges three-carbon 
sugar phosphates for an inward flux of inorganic phosphate. 

Glyceraldehyde 3-phosphate normally provides the cytosol with an abundant 
source of carbohydrate, which the cell uses as the starting point for many other 
biosyntheses— including the production of sucrose for export. But this is not all 
that this molecule provides. Once the glyceraldehyde 3-phosphate reaches the 
cytosol, it is readily converted (by part of the glycolytic pathway) to 1,3-phospho- 
glycerate and then 3-phosphoglycerate (see p. 92), generating one molecule of 
ATP and one of NADH. (A similar two-step reaction, but working in reverse, forms 
glyceraldehyde 3-phosphate in the carbon-fixation cycle; see Figure 14-40.) As a 
result, the export of glyceraldehyde 3-phosphate from the chloroplast provides 
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Figure 14-50 Three types of 
photosynthetic reaction centers 
compared. Pigments involved in light 
harvesting are colored green; those 
involved in the central photochemical 
events are colored red. (A) The 
photochemical reaction center of purple 
bacteria, whose detailed structure is 
illustrated in Figure 10-34, contains two 
related protein subunits, L and M, that 
bind the pigments involved in the central 
process illustrated in Figure 14-46. Low- 
energy electrons are fed into the excited 
chlorophylls by a cytochrome. LH1 is a 
protein-pigment complex involved in light 
harvesting. (B) Photosystem II contains the 
Di and D2 proteins, which are homologous 
to the L and M subunits in (A). Low-energy 
electrons from water are fed into the 
excited chlorophylls by a manganese 
cluster. LHCII is a light-harvesting complex 
that feeds energy into the core antenna 
proteins (see Figure 14-47). 
(C) Photosystem I contains the Psa A and 
Psa B proteins, each of which is equivalent 
to a fusion of the Di or D 2 protein to a core 
antenna protein of photosystem II. 
Low-energy electrons are fed into the 
excited chlorophylls by loosely bound 
plastocyanin (pC). As indicated, in 
photosystem I, high-energy electrons are 
passed from a nonmobile quinone (Q) 
through a series of three iron-sulfur 
centers (red circles). (Modified from 
K. Rhee, E. Morris, J. Barber and 
W. Kuhlbrandt, Nature 396:283-286, 1998; 
W. Kuhlbrandt, Nature 41 1 :896-899, 2001 . 
With permission from Macmillan 
Publishers Ltd.) 
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Figure 14-51 A comparison of the flow of H + and the orientation of the 
ATP synthase in mitochondria and chloroplasts. Those compartments 
with similar pH values have been colored the same. The proton-motive 
force across the thylakoid membrane consists almost entirely of the pH 
gradient; a high permeability of this membrane to Mg 2+ and CI" ions allows 
the flow of these ions to dissipate most of the membrane potential. 
Mitochondria presumably need a large membrane potential because they 
could not tolerate having their matrix at pH 10, as would be required to 
generate their proton-motive force without one. 



not only the main source of fixed carbon to the rest of the cell, but also the 
reducing power and ATP needed for metabolism outside the chloroplast. 



Chloroplasts Also Perform Other Crucial Biosyntheses 

The chloroplast performs many biosyntheses in addition to photosynthesis. 
Enzymes in the chloroplast stroma make all of the cell's fatty acids and a set of 
amino acids, for example. Similarly, the reducing power of light-activated elec- 
trons drives the reduction of nitrite (N0 2 ") to ammonia (NH 3 ) in the chloroplast; 
this ammonia provides the plant with nitrogen for the synthesis of amino acids 
and nucleotides. The metabolic importance of the chloroplast for plants and 
algae therefore extends far beyond its role in photosynthesis. 
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Summary 

Chloroplasts and photosynthetic bacteria obtain high-energy electrons by means of 
photosystems that capture the electrons that are excited when sunlight is absorbed by 
chlorophyll molecules. Photosystems are composed of an antenna complex that fun- 
nels energy to a photochemical reaction center, where a precisely ordered complex of 
proteins and pigments allows electron carriers to capture the energy of an excited 
chlorophyll electron. The most completely understood photochemical reaction center 
is that of purple photosynthetic bacteria, which contain only a single photosystem. In 
contrast, there are two distinct photosystems in chloroplasts and cyanobacteria. The 
two photosystems are normally linked in series, and they transfer electrons from water 
to NADP+ to form NADPH, with the concomitant production of a transmembrane elec- 
trochemical proton gradient. In these linked photosystems, molecular oxygen (O2) is 
generated as a by-product of removing four low-energy electrons from two specifically 
positioned water molecules. The detailed three-dimensional structures of photosys- 
tems I and II show a striking degree of homology to the structure of the photosystems 
of purple photosynthetic bacteria, demonstrating a remarkable degree of conservation 
over billions of years of evolution. 

Compared with mitochondria, chloroplasts have an additional internal mem- 
brane (the thylakoid membrane) and a third internal space (the thylakoid space). All 
electron-transport processes occur in the thylakoid membrane: to make ATP, H + is 
pumped into the thylakoid space, and a backflow ofH + through an ATP synthase then 
produces the ATP in the chloroplast stroma. This ATP is used in conjunction with the 
NADPH made by photosynthesis to drive a large number of biosynthetic reactions in 
the chloroplast stroma, including the all-important carbon-fixation cycle, which cre- 
ates carbohydrate from C0 2 . Along with some other important chloroplast products, 
this carbohydrate is exported to the cell cytosol, where— as glyceraldehyde 3-phos- 
phate—it provides organic carbon, ATP, and reducing power to the rest of the cell 
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THE GENETIC SYSTEMS OF MITOCHONDRIA AND 
PLASTIDS 

The view that mitochondria and plastids evolved from bacteria that were 
engulfed by ancestral cells can explain why both types of organelles contain 
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their own genomes, as well as their own biosynthetic machinery for making 
RNA and organelle proteins. Mitochondria and plastids are never made from 
scratch, but instead arise by the growth and division of an existing mitochon- 
drion or plastid. On average, each organelle must double in mass in each cell 
generation and then be distributed into each daughter cell. Even nondividing 
cells must replenish organelles that are degraded as part of the continual pro- 
cess of organelle turnover, or produce additional organelles as the need arises. 
Organelle growth and proliferation is a complicated process because mito- 
chondrial and plastid proteins are encoded in two places: the nuclear genome 
and the separate genomes harbored in the organelles themselves (Figure 
14-52). In Chapter 12, we discuss how selected proteins and lipids are imported 
into mitochondria and chloroplasts from the cytosol. Here we describe how the 
organelle genomes are maintained and the contributions they make to 
organelle biogenesis. 



Mitochondria and Chloroplasts Contain Complete Genetic 
Systems 

The biosynthesis of mitochondria and plastids requires contributions from two 
separate genetic systems. Special genes in nuclear DNA encode most of the pro- 
teins in mitochondria and chloroplasts. The organelle imports these proteins 
from the cytosol after they have been synthesized on cytosolic ribosomes. The 
organelle DNA encodes other organelle proteins that are synthesized on ribo- 
somes within the organelle, using organelle-produced mRNA to specify their 
amino acid sequence (Figure 14-53). The protein traffic between the cytosol 
and these organelles seems to be unidirectional, as proteins are normally not 
exported from mitochondria or chloroplasts to the cytosol. An exception occurs 
under special conditions when a cell is about to undergo apoptosis. As will be 
discussed in detail in Chapter 19, the mitochondrion releases intermembrane 
space proteins (including cytochrome c) through its outer mitochondrial mem- 
brane as part of an elaborate signaling pathway that is triggered to cause cells to 
undergo programmed cell death. 

The processes of organelle DNA transcription, protein synthesis, and DNA 
replication (Figure 14-54) take place where the genome is located: in the matrix 
of mitochondria and the stroma of chloroplasts. Although the proteins that 
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Figure 14-52 Mitochondrial and nuclear 
DNA stained with a fluorescent dye. This 
micrograph shows the distribution of the 
nuclear genome (red) and the multiple 
small mitochondrial genomes (bright 
yellow spots) in a Euglena gracilis cell. The 
DNA is stained with ethidium bromide, a 
fluorescent dye that emits red light. In 
addition, the mitochondrial matrix space is 
stained with a green fluorescent dye that 
reveals the mitochondria as a branched 
network extending throughout the 
cytosol. The superposition of the green 
matrix and the red DNA gives the 
mitochondrial genomes their yellow color. 
(Courtesy of Y. Hayashi and K. Ueda, J. Cell 
ScL 93:565-570, 1989. With permission 
from The Company of Biologists.) 
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Figure 1 4-53 The production of 
mitochondrial and chloroplast proteins 
by two separate genetic systems. Most 
of the proteins in these organelles are 
encoded by the nucleus and must be 
imported from the cytosol. 
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mediate these genetic processes are unique to the organelle, the nuclear 
genome encodes most of them. This is all the more surprising because the pro- 
tein-synthesis machinery of the organelles resembles that of bacteria rather 
than that of eucaryotes. The resemblance is particularly close in chloroplasts. 
For example, chloroplast ribosomes are very similar to E. coli ribosomes, both in 
their structure and in their sensitivity to various antibiotics (such as chloram- 
phenicol, streptomycin, erythromycin, and tetracycline). In addition, protein 
synthesis in chloroplasts starts with N-formyl methionine, as in bacteria, and 
not with the methionine used for this purpose in the cytosol of eucaryotic cells. 
Although mitochondrial genetic systems are much less similar to those of pre- 
sent-day bacteria than are the genetic systems of chloroplasts, their ribosomes 
are also sensitive to antibacterial antibiotics, and protein synthesis in mito- 
chondria also starts with N-formyl methionine. 



Organelle Growth and Division Determine the Number of 
Mitochondria and Plastids in a Cell 

In mammalian cells, mitochondrial DNA makes up less than 1% of the total cel- 
lular DNA. In other cells, however, such as the leaves of higher plants or the very 
large egg cells of amphibians, a much larger fraction of the cellular DNA may be 
present in mitochondria or chloroplasts (Table 14-2), and a large fraction of 
RNA and protein synthesis takes place there. 

Mitochondria and plastids are large enough to be observed by light 
microscopy in living cells. For example, mitochondria can be visualized by 
expressing a genetically engineered fusion of a mitochondrial protein linked to 
the green fluorescent protein (GFP) in cells, or cells can be incubated with a flu- 
orescent dye that is specifically taken up by mitochondria because of the elec- 
trochemical gradient across their membranes. Such images demonstrate that 
the mitochondria in living cells are dynamic— frequently dividing, fusing, and 
changing shape (Figure 14-55), as mentioned previously. Division (fission) and 
fusion of these organelles are topologically complex processes, because the 
organelles are enclosed by a double membrane and the mitochondrion must 
maintain the integrity of its separate mitochondrial compartments during these 
processes (Figure 14-56). 

The number and shape of mitochondria vary dramatically in different cell 
types and can change in the same cell type under different physiological condi- 
tions, ranging from multiple spherical or cylindrically shaped organelles to a 
single organelle with a branched structure (a reticulum). The arrangement is 
controlled by the relative rates of mitochondrial division and fusion, which are 
regulated by dedicated GTPases that reside on mitochondrial membranes. In 
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Figure 14-54 An electron micrograph of 
an animal mitochondrial DNA molecule 
caught during the process of DNA 
replication. The circular DNA genome 
has replicated only between the two 
points marked by red arrows. The newly 
synthesized DNA is colored yellow. 
(Courtesy of David A. Clayton.) 



Table 1 4-2 Relative Amounts of Organelle DNA in Some Cells and Tissues 
ORGANISM 




ISSUE OR 
CELL TYPE 



MITOCHONDRIAL DNA 



DNA MOLECULES 
PER ORGANELLE 



Rat 


liver 


5-10 


Yeast* 


vegetative 


2-50 


Frog 


egg 


5-10 


CHLOROPLAST DNA 






Chlamydomonas 


vegetative 


80 


Maize 


leaves 


0-300** 



ORGANELLES PER 
CELL 



1000 
1-50 

10 7 



1 

20-40 



ORGANELLE DNA AS 

PERCENTAGE OF TOTAL 

CELLULAR DNA 



1 
15 
99 



7 
0-15* 



*The large variation in the number and size of mitochondria per cell in yeasts is due to mitochondrial fusion and fission. 

**ln maize, the amount of chloroplast DNA drops precipitously in mature leaves, after cell division ceases: the chloroplast DNA is degraded and 

stable mRNAs persist to provide for protein synthesis 
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Figure 14-55 A dynamic mitochondrial reticulum. (A) In yeast cells, mitochondria form a continuous reticulum 
underlying the plasma membrane (stereo pair). (B) A balance between fission and fusion determines the 
arrangement of the mitochondria in different cells. (C) Time-lapse fluorescent microscopy shows the dynamic 
behavior of the mitochondrial network in a yeast cell. In addition to shape changes, fission and fusion constantly 
remodel the network (red arrows). The pictures were taken at 3-minute intervals. (A and C, from J. Nunnari et al., Mol. 
Biol. Cell 8:1 233-1 242, 1997. With permission by the American Society for Cell Biology.) 



addition, the total organelle mass per cell can be regulated according to need. 
For example, a large increase in mitochondria (as much as 5-10-fold) occurs 
when a resting skeletal muscle is repeatedly stimulated to contract for a pro- 
longed period. 

There can be many copies of the mitochondrial and plastid genomes in the 
space enclosed by each organelles inner membrane The degree of organelle 
fragmentation determines the number of these genomes present in a single 
organelle; generally, a single compartment houses many genomes (see Table 
14-2). In most cells, the replication of the organelle DNA is not limited to the S 
phase of the cell cycle, when the nuclear DNA replicates, but occurs throughout 
the cell cycle— out of phase with cell division. Individual organelle DNA 
molecules seem to be selected at random for replication, so that in a given cell 
cycle, some may replicate more than once and others not at all. Nonetheless, 
under constant conditions, the process is regulated to ensure that the total num- 
ber of organelle DNA molecules doubles in every cell cycle, as required if each 
cell type is to maintain a constant amount of organelle DNA. 

In special circumstances, the cell can precisely control organelle division. In 
some algae that contain only one or a few chloroplasts, for example, the 
organelle divides just before the cell does, in a plane that is identical to the future 
plane of cell division. 
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Figure 14-56 Mitochondrial fission and 

fusion. These processes involve both 
outer and inner mitochondrial 
membranes. (A) During fusion and 
fission, the mitochondrion maintains 
both its matrix and its intermembrane 
space compartments. Different 
membrane fusion machines are thought 
to operate at the outer and inner 
membranes. The fission process 
resembles that of bacterial cell division 
(discussed in Chapter 16). The pathway 
shown has been postulated from static 
views such as that shown in (B). (B) An 
electron micrograph of a dividing 
mitochondrion in a liver cell. (B, courtesy 
of DanielS. Friend.) 
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Mitochondria and Chloroplasts Have Diverse Genomes 

The multiple copies of mitochondrial and chloroplast DNA contained within the 
matrix or stroma of these organelles are usually distributed in several clusters, 
called nucleoids. Nucleoids are thought to be attached to the inner mitochon- 
drial membrane. The DNA structure in nucleoids is likely to resemble that in 
bacteria rather than that in eucaryotic chromatin; as in bacteria, for example, 
there are no histones. 

The size range of organelle DNAs is similar to that of viral DNAs. Mitochon- 
drial DNA molecules range in size from less than 6000 nucleotide pairs in Plas- 
modium falciparum (the human malaria parasite) to more than 300,000 
nucleotide pairs in some land plants (Figure 14-57). The chloroplast genome of 
land plants ranges in size from 70,000 to 200,000 nucleotide pairs. Chloroplast 
chromosomes had long been thought to be circular DNA molecules that contain 
a single copy of the genome, but it is now recognized that such molecules are 
outnumbered in most chloroplasts by linear DNA molecules formed from mul- 
tiple copies of the genome, joined end to end. The mixture of structures 
observed for chloroplast chromosomes is best explained as originating from a 
DNA replication process that is closely linked to DNA recombination events, 
analogous to the replication processes observed for viruses such as Herpes. Sim- 
ilar types of mixed structures may also exist for many mitochondrial genomes. 
However, in mammals, the mitochondrial genome is a simple DNA circle of 
about 16,500 base pairs (less than 0.001% of the size of the nuclear genome), and 
it is nearly the same size in animals as diverse as Drosophila and sea urchins. 



Figure 14-57 Various sizes of 
mitochondrial genomes. The complete 
DNA sequences for more than 500 
mitochondrial genomes have been 
determined. The lengths of a few of these 
mitochondrial DNAs are shown to scale 
as circles for those genomes thought to 
be circular and lines for linear genomes. 
The largest circle represents the genome 
of Rickettsia prowazekii, a small 
pathogenic bacterium whose genome 
most closely resembles that of 
mitochondria. The size of mitochondrial 
genomes does not correlate well with the 
number of proteins encoded in them: 
while human mitochondrial DNA 
encodes 13 proteins, the 22-fold larger 
mitochondrial DNA of Arabidopsis 
encodes only 32 proteins— that is, about 
2.5-fold as many as human mitochondrial 
DNA. The extra DNA that is found in 
Arabidopsis, Marchantia, and other plant 
mitochondria may be "junk DNA". The 
mitochondrial DNA of the protozoan 
Reclinomonas amehcana has 98 genes, 
more than the mitochondrion of any 
other organism analyzed so far. (Adapted 
from M.W. Gray et al., Science 
283:1476-1481, 1999. With permission 
from AAAS.) 



Mitochondria and Chloroplasts Probably Both Evolved from 
Endosymbiotic Bacteria 

The procaryotic character of the organelle genetic systems, especially striking in 
chloroplasts, suggests that mitochondria and chloroplasts evolved from bacteria 
that were endocytosed more than 1 billion years ago. According to one version 
of this endosymbiont hypothesis, eucaryotic cells started out as anaerobic organ- 
isms without mitochondria or chloroplasts and then established a stable 
endosymbiotic relation with a bacterium, whose oxidative phosphorylation sys- 
tem they subverted for their own use (Figure 14-58). According to this hypothe- 
sis, the endocytic event that led to the development of mitochondria occurred 
when oxygen entered the atmosphere in substantial amounts, more than 1.5 x 
10 9 years ago, before animals and plants separated (see Figure 14-71). The 
chloroplast seems to have been similarly derived later, from endocytosis of an 
oxygen-producing photosynthetic organism that resembled a cyanobacterium. 
Most of the genes encoding present-day mitochondrial and chloroplast pro- 
teins are in the cell nucleus. Thus, an extensive transfer of genes from organelle 
to nuclear DNA must have occurred during eucaryote evolution. A successful 
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Figure 14-58 One possible evolutionary 
pathway for the origin of mitochondria. 

There are anaerobic single-celled 
eucaryotes present on the Earth today 
that lack a mitochondrial genome, 
including the protozoan Giardia. 
However, all those examined appear to 
contain some type of membrane- 
enclosed organelle remnant. In these 
remnants, mitochondrial-like reactions, 
such as the production of iron-sulfur 
centers, occur. Thus far, we know of no 
living example of the anaerobic cell 
postulated in this figure to have first 
engulfed the precursors of mitochondria. 
For this and other reasons, alternative 
pathways have been postulated in which 
the combination of a bacterium and an 
archaea led to both the first eukaryotic 
cells and the first mitochondria. 
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transfer of this type is expected to be rare, because a gene moved from organelle 
DNA needs to change to become a functional nuclear gene: it must adapt to the 
nuclear and cytoplasmic transcription and translation requirements, and also 
acquire a signal sequence so that the encoded protein can be delivered to the 
organelle after its synthesis in the cytosol. Nevertheless, there is evidence that 
such gene transfers to the nucleus continue to occur in some organisms today. 

Gene transfer explains why many of the nuclear genes encoding mitochon- 
drial and chloroplast proteins resemble bacterial genes. The amino acid 
sequence of the chicken mitochondrial enzyme superoxide dismutase, for exam- 
ple, resembles the corresponding bacterial enzyme much more than it resem- 
bles the superoxide dismutase found in the cytosol of the same eucaryotic cell. 

Gene transfer seems to have been a gradual process. When mitochondrial 
genomes encoding different numbers of proteins are compared, a pattern of 
sequential reduction of encoded mitochondrial functions emerges (Figure 
14-59). The smallest and presumably most highly evolved mitochondrial 
genomes, for example, encode only a few inner- membrane proteins involved in 
electron-transport reactions, plus ribosomal RNAs and some tRNAs. Mitochon- 
drial genomes that have remained more complex tend to contain this same sub- 
set of genes, plus others. The most complex genomes are characterized by the 
presence of many extra genes compared with animal and yeast mitochondrial 
genomes. Many of these genes encode components of the mitochondrial 
genetic system, such as RNA polymerase subunits and ribosomal proteins; these 
genes are instead found in the cell nucleus in organisms that have reduced their 
mitochondrial DNA content. 

What type of bacterium gave rise to the mitochondrion? From sequence 
comparisons, it seems that mitochondria have descended from a particular type 
of purple photosynthetic bacterium that had previously lost its ability to per- 
form photosynthesis and was left with only a respiratory chain. 
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Figure 14-59 Comparison of 
mitochondrial genomes. Less complex 
mitochondrial genomes encode subsets 
of the proteins and ribosomal RNAs that 
are encoded by larger mitochondrial 
genomes. There are only four genes 
present in all known mitochondrial 
genomes; these encode ribosomal RNAs 
(ms and ml), cytochrome b (cob), and a 
cytochrome oxidase subunit (coxl). 
(Adapted from M.W. Gray et al., Science 
283:1476-1481, 1999. With permission 
from A A AS.) 



Mitochondria Have a Relaxed Codon Usage and Can Have a 
Variant Genetic Code 

The relatively small size of the human mitochondrial genome made it a particu- 
larly attractive target for early DNA-sequencing projects, and in 1981, 
researchers published the complete sequence of its 16,569 nucleotides. By com- 
paring this sequence with known mitochondrial tRNA sequences and with the 
partial amino acid sequences available for proteins encoded by the mitochon- 
drial DNA, all of the human mitochondrial genes were mapped on the circular 
DNA molecule (Figure 14-60). 

Compared with nuclear, chloroplast, and bacterial genomes, the human 
mitochondrial genome has several surprising features: 

1. Dense gene packing. Unlike other genomes, nearly every nucleotide seems 
to be part of a coding sequence, either for a protein or for one of the rRNAs 
or tRNAs. Since these coding sequences run directly into each other, there 
is very little room left for regulatory DNA sequences. 

2. Relaxed codon wsage. Whereas 30 or more tRNAs specify amino acids in the 
cytosol and in chloroplasts, only 22 tRNAs are required for mitochondrial 
protein synthesis. The normal codon-anticodon pairing rules are relaxed 
in mitochondria, so that many tRNA molecules recognize any one of the 



Figure 1 4-60 The organization of the 
human mitochondrial genome. The 

genome contains 2 rRNA genes, 22 tRNA 
genes, and 13 protein-coding sequences. 
The DNAs of many other animal 
mitochondrial genomes have also been 
completely sequenced. Most of these 
animal mitochondrial DNAs encode 
precisely the same genes as humans, 
with the gene order being identical for 
animals that range from mammals to fish. 
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Table 14-3 Some Differences Between the "Universal" Code and Mitochondrial Genetic Codes* 

MITOCHONDRIAL CODES 



CODON 


"UNIVERSAL" CODE 


MAMMALS 


INVERTEBRATES 


YEASTS 


PLANTS 


UGA 


STOP 


Trp 


Trp 


Trp 


STOP 


AUA 


fie 


Met 


Met 


Met 


lie 


CUA 


Leu 


Uu 


Leu 


Thr 


Leu 


AGA I 
AGG 


Arg 


STOP 


Ser 


Arg 


Arg 



*Red italics indicate that the code differs from the "Universal" code. 



four nucleotides in the third (wobble) position. Such "2 out of 3" pairing 
allows one tRNA to pair with any one of four codons and permits protein 
synthesis with fewer tRNA molecules. 
3. Variant genetic code. Perhaps most surprising, comparisons of mitochon- 
drial gene sequences and the amino acid sequences of the corresponding 
proteins indicate that the genetic code is different: 4 of the 64 codons have 
different "meanings" from those of the same codons in other genomes 
(Table 14-3). 

The close similarity of the genetic code in all organisms provides strong evi- 
dence that all cells have evolved from a common ancestor. How, then, do we 
explain the few differences in the genetic code in many mitochondria? A hint 
comes from the finding that the mitochondrial genetic code is different in differ- 
ent organisms. In the mitochondrion with the largest number of genes in Figure 
14-59, that of the protozoan Reclinomonas, the genetic code is unchanged from 
the standard genetic code of the cell nucleus. Yet UGA, which is a stop codon else- 
where, is read as tryptophan in mitochondria of mammals, fungi, and inverte- 
brates. Similarly, the codon AGG normally codes for arginine, but it codes for stop 
in the mitochondria of mammals and codes for serine in the mitochondria of 
Drosophila (see Table 14-3). Such variation suggests that a random drift can 
occur in the genetic code in mitochondria. Presumably, the unusually small 
number of proteins encoded by the mitochondrial genome makes an occasional 
change in the meaning of a rare codon tolerable, whereas such a change in a large 
genome would alter the function of many proteins and thereby destroy the cell. 



Animal Mitochondria Contain the Simplest Genetic Systems 
Known 

Comparisons of DNA sequences in different organisms reveal that, in vertebrate 
animals (including ourselves), the rate of nucleotide substitution during evolu- 
tion has been 10 times greater in the mitochondrial genome than in the nuclear 
genome. This difference is likely to be due to a reduced fidelity of mitochondrial 
DNA replication, inefficient DNA repair, or both. Because only about 16,500 
DNA nucleotides need to be replicated and expressed as RNAs and proteins in 
animal cell mitochondria, the error rate per nucleotide copied by DNA replica- 
tion, maintained by DNA repair, transcribed by RNA polymerase, or translated 
into protein by mitochondrial ribosomes can be relatively high without damag- 
ing one of the relatively few gene products. This could explain why the mecha- 
nisms that perform these processes are relatively simple compared with those 
used for the same purpose elsewhere in cells. The presence of only 22 tRNAs and 
the unusually small size of the rRNAs (less than two-thirds the size of the E. coli 
rRNAs), for example, would be expected to reduce the fidelity of protein synthe- 
sis in mitochondria, although this has not yet been tested adequately. 

The relatively high rate of evolution of animal mitochondrial genes makes a 
comparison of mitochondrial DNA sequences especially useful for estimating 
the dates of relatively recent evolutionary events, such as the steps in primate 
evolution. 
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Some Organelle Genes Contain Introns 

The processing of precursor RNAs has an important role in the two mitochon- 
drial systems studied in most detail — human and yeast. In human cells, both 
strands of the mitochondrial DNA are transcribed at the same rate from a single 
promoter region on each strand, producing two different giant RNA molecules, 
each containing a full-length copy of one DNA strand. Transcription is therefore 
completely symmetric. The transcripts made on one strand are extensively pro- 
cessed by nuclease cleavage to yield the two rRNAs, most of the tRNAs, and 
about 10 poly-A-containing RNAs. In contrast, the transcript of the other strand 
is processed to produce only 8 tRNAs and 1 small poly-A-containing RNA; the 
remaining 90% of this transcript apparently contains no useful information 
(being complementary to coding sequences synthesized on the other strand) 
and is degraded. The poly-A-containing RNAs are the mitochondrial mRNAs: 
although they lack a cap structure at their 5' end, they carry a poly-A tail at their 
3' end that is added posttranscriptionally by a mitochondrial poly-A polymerase. 

Unlike human mitochondrial genes, some plant and fungal (including 
yeast) mitochondrial genes contain introns, which must be removed by RNA 
splicing. Introns also occur in some in plant chloroplast genes. Many of the 
introns in organelle genes consist of a family of related nucleotide sequences 
that are capable of splicing themselves out of the RNA transcripts by RNA-medi- 
ated catalysis (discussed in Chapter 6), although proteins generally aid these 
self-splicing reactions. The presence of introns in organelle genes is surprising, 
as introns are not common in the genes of the bacteria whose ancestors are 
thought to have given rise to mitochondria and plant chloroplasts. 

In yeasts, the same mitochondrial gene may have an intron in one strain but 
not in another. Such "optional introns" seem to be able to move in and out of 
genomes like transposable elements. In contrast, introns in other yeast mito- 
chondrial genes have also been found in a corresponding position in the mito- 
chondria of Aspergillus and Neurospora, implying that they were inherited from 
a common ancestor of these three fungi. It is possible that these intron 
sequences are of ancient origin — tracing back to a bacterial ancestor — and that, 
although they have been lost from many bacteria, they have been preferentially 
retained in some organelle genomes where RNA splicing is regulated to help 
control gene expression. 

The Chloroplast Genome of Higher Plants Contains About 
120 Genes 

More than 20 chloroplast genomes have now been sequenced. The genomes of 
even distantly related plants (such as tobacco and liverwort) are highly similar, 
and even those of green algae are closely related (Figure 14-61). Chloroplast 
genes are involved in four main types of processes: transcription, translation, 
photosynthesis, and the biosynthesis of small molecules such as amino acids, 
fatty acids, and pigments. Plant chloroplast genes also encode at least 40 pro- 
teins whose functions are as yet unknown. Paradoxically, all of the known pro- 
teins encoded in the chloroplast are part of larger protein complexes that also 
contain one or more subunits encoded in the nucleus. We discuss possible rea- 
sons for this paradox later. 

The genomes of chloroplasts and bacteria have striking similarities. The 
basic regulatory sequences, such as transcription promoters and terminators, 
are virtually identical in the two cases. The amino acid sequences of the proteins 
encoded in chloroplasts are clearly recognizable as bacterial, and several clus- 
ters of genes with related functions (such as those encoding ribosomal proteins) 
are organized in the same way in the genomes of chloroplasts, E. coli, and 
cyanobacteria. 

Further comparisons of large numbers of homologous nucleotide 
sequences should help clarify the exact evolutionary pathway from bacteria to 
chloroplasts, but some conclusions can already be drawn: 
1. Chloroplasts in higher plants arose from photosynthetic bacteria. 
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Figure 14-61 The organization of the 
liverwort chloroplast genome. The 

chloroplast genome organization is 
similar in all higher plants, although the 
size varies from species to species- 
depending on how much of the DNA 
surrounding the genes encoding the 
chloroplast's 16Sand 23S ribosomal RNAs 
is present in two copies. 
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2. Many of the genes of the original bacterium are now present in the nuclear 
genome, where they have been integrated and are stably maintained. In 
higher plants, for example, two-thirds of the 60 or so chloroplast ribosomal 
proteins are encoded in the cell nucleus; these genes have a clear bacterial 
ancestry, and the chloroplast ribosomes retain their original bacterial 
properties. 



Mitochondrial Genes Are Inherited by a Non-Mendelian 
Mechanism 

Many experiments on the mechanisms of mitochondrial biogenesis have been 
performed with Saccharomyces cerevisiae (baker's yeast). There are several rea- 
sons for this preference. First, when grown on glucose, this yeast has an ability 
to live by glycolysis alone and can therefore survive with defective mitochondria 
that cannot perform oxidative phosphorylation. This makes it possible to grow 
cells with mutations in mitochondrial or nuclear DNA that interfere with mito- 
chondrial function; such mutations are lethal in many other eucaryotes. Second, 
yeasts are simple unicellular eucaryotes that are easy to grow and characterize 
biochemically, Finally these yeast cells normally reproduce asexually by bud- 
ding, but they can also reproduce sexually. During sexual reproduction" two hap^ 
loid cells mate and fuse to form a diploid zygote, which can either grow mitoti- 
cally or divide by meiosis to produce new haploid cells. 

The ability to control the alternation between asexual and sexual reproduc- 
tion in the laboratory greatly facilitates genetic analyses. Mutations in mito- 
chondrial genes are not inherited in accordance with the Mendelian rules that 
govern the inheritance of nuclear genes. Therefore, long before the mitochon- 
drial genome could be sequenced, genetic studies revealed which of the genes 
involved in yeast mitochondrial function are located in the nucleus and which 
in the mitochondria. An example of non-Mendelian (cytoplasmic) inheritance 
of mitochondrial genes in a haploid yeast cell is shown in Figure 14-62. In this 
example, we trace the inheritance of a mutant gene that makes mitochondrial 
protein synthesis resistant to chloramphenicol. 

When a chloramphenicol-resistant haploid cell mates with a chlorampheni- 
col-sensitive wild-type haploid cell, the resulting diploid zygote contains a mix- 
ture of mutant and wild-type genomes. The two mitochondrial networks fuse in 
the zygote, creating one continuous reticulum that contains genomes of both 
parental cells. When the zygote undergoes mitosis, copies of both mutant and 
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wild-type mitochondrial DNA are segregated to the diploid daughter cell. In the 
case of nuclear DNA, each daughter cell receives exactly two copies of each 
chromosome, one from each parent. By contrast, in the case of mitochondrial 
DNA, the daughter cell may inherit either more copies of the mutant DNA or 
more copies of the wild-type DNA. Successive mitotic divisions can further 
enrich for either DNA, so that subsequently many cells will arise that contain 
mitochondrial DNA of only one genotype. This stochastic process is called 
mitotic segregation. 

When diploid cells that have segregated their mitochondrial genomes in this 
way undergo meiosis to form four haploid daughter cells, each of the four 
daughters receives the same mitochondrial genes. This type of inheritance is 
called non-Mendelian, or cytoplasmic inheritance, to contrast it with the 
Mendelian inheritance of nuclear genes (see Figure 14-62). When non- 
Mendelian inheritance occurs, it demonstrates that the gene in question is 
located outside the nuclear chromosomes. 

Although clusters of mitochondrial DNA molecules (nucleoids) are relatively 
immobile in the mitochondrial reticulum because of their anchorage to the 
inner membrane, individual nucleoids occasionally come together. This is most 
likely to occur at sites where the two parental mitochondrial networks fuse dur- 
ing zygote formation. When different DNAs are present in the same nucleoid, 
genetic recombination can occur. This recombination can result in mitochon- 
drial genomes that contain DNA from both parent cells, which are stably inher- 
ited after their mitotic segregation. 
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Figure 14-62 The difference in the 
patterns of inheritance between 
mitochondrial and nuclear genes of yeast 
cells. For nuclear genes (Mendelian 
inheritance), two of the four cells that result 
from meiosis inherit the gene from one of 
the original haploid parent cells {green 
chromosomes), and the remaining two cells 
inherit the gene from the other {black 
chromosomes). By contrast, for 
mitochondrial genes (non-Mendelian 
inheritance), it is possible for all four of the 
cells that result from meiosis to inherit their 
mitochondrial genes from only one of the 
two original haploid cells. In this example, 
the mitochondrial gene is one that, in its 
mutant form (mitochondrial DNA denoted 
by blue dots), makes protein synthesis in the 
mitochondrion resistant to 
chloramphenicol— a protein synthesis 
inhibitor that acts specifically on the 
procaryotic-like ribosomes in mitochondria 
and chloroplasts. We can detect yeast cells 
that contain the mutant gene by their 
ability to grow in the presence of 
chloramphenicol on a substrate, such as 
glycerol, that cannot be used for glycolysis. 
With glycolysis blocked, ATP must be 
provided by functional mitochondria, and 
therefore the cells that carry the normal 
(wild-type) mitochondrial DNA (red dots) 
cannot grow. 
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Organelle Genes Are Maternally Inherited in Many Organisms 

The consequences of cytoplasmic inheritance are more profound for some 
organisms, including ourselves, than they are for yeasts. In yeasts, when two 
haploid cells mate, they are equal in size and contribute equal amounts of mito- 
chondrial DNA to the zygote (see Figure 14-62). Mitochondrial inheritance in 
yeasts is therefore biparental: both parents contribute equally to the mitochon- 
drial gene pool of the progeny (although, as we have just seen, after several gen- 
erations of vegetative growth, the individual progeny often contain mitochon- 
dria from only one parent). In higher animals, by contrast, the egg cell always 
contributes much more cytoplasm to the zygote than does the sperm. One 
would therefore expect mitochondrial inheritance in higher animals to be nearly 
uniparental— or, more precisely, maternal. Such maternal inheritance has been 
demonstrated in laboratory animals. A cross between animals carrying type A 
mitochondrial DNA and animals carrying type B results in progeny that contain 
only the maternal type of mitochondrial DNA. Similarly, by following the distri- 
bution of variant mitochondrial DNA sequences in large families, we find that 
human mitochondrial DNA is maternally inherited. 

In about two-thirds of higher plants, the chloroplasts from the male parent 
(contained in pollen grains) do not enter the zygote, so that chloroplast as well 
as mitochondrial DNA is maternally inherited. In other plants, the pollen 
chloroplasts enter the zygote, making chloroplast inheritance biparental. In 
such plants, defective chloroplasts are a cause of variegation: a mixture of nor- 
mal and defective chloroplasts in a zygote may sort out by mitotic segregation 
during plant growth and development, thereby producing alternating green and 
white patches in leaves. The green patches contain normal chloroplasts, while 
the white patches contain defective chloroplasts (Figure 14-63). 

A fertilized human egg carries perhaps 2000 copies of the human mitochon- 
drial genome, all but one or two inherited from the mother. A human in which 
all of these genomes carried a deleterious mutation would generally not survive. 
But some mothers carry a mixed population of both mutant and normal mito- 
chondrial genomes. Their daughters and sons inherit this mixture of normal and 
mutant mitochondrial DNAs and are healthy unless the process of mitotic seg- 
regation by chance results in a majority of defective mitochondria in a particu- 
lar tissue. Muscle and nervous tissues are most at risk, because of their need for 
particularly large amounts of ATP. 

We can identify an inherited disease in humans caused by a mutation in 
mitochondrial DNA by its passage from affected mothers to both their daughters 
and their sons, with the daughters but not the sons producing grandchildren 
with the disease. As expected from the random nature of mitotic segregation, the 
symptoms of these diseases vary greatly between different family members- 
including not only the severity and age of onset, but also which tissue is affected. 

Consider, for example, the inherited disease myoclonic epilepsy and ragged 
red fiber disease (MERRF), which can be caused by a mutation in one of the mito- 
chondrial transfer RNA genes. This disease appears when, by chance, a particu- 
lar tissue inherits a threshold amount of defective mitochondrial DNA genomes. 
Above this threshold, the pool of defective tRNA decreases the synthesis of the 
mitochondrial proteins required for electron transport and production of ATP. 
The result may be muscle weakness or heart problems (from effects on heart 
muscle), forms of epilepsy or dementia (from effects on nerve cells), or other 
symptoms. Not surprisingly, a similar variability in phenotypes is found for 
many other mitochondrial diseases. 

Petite Mutants in Yeasts Demonstrate the Overwhelming 
Importance of the Cell Nucleus for Mitochondrial Biogenesis 

Genetic studies of yeasts have had a crucial role in the analysis of mitochon- 
drial biogenesis. A striking example is provided by studies of yeast mutants that 
contain large deletions in their mitochondrial DNA, so that all mitochondrial 
protein synthesis is abolished. Not surprisingly, these mutants cannot make 




Figure 14-63 A variegated leaf. In the 

white patches, the plant cells have 
inherited a defective chloroplast. (Courtesy 
of John Innes Foundation.) 
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respiring mitochondria. Some of these mutants lack mitochondrial DNA alto- 
gether. Because they form unusually small colonies when grown in medium 
containing a low concentration of glucose, all mutants with such defective mito- 
chondria are called cytoplasmic petite mutants. 

Although petite mutants cannot synthesize proteins in their mitochondria 
and therefore cannot make mitochondria that produce ATP, they nevertheless 
contain mitochondria. These mitochondria have a normal outer membrane and 
an inner membrane with poorly developed cristae (Figure 14-64). They contain 
virtually all the mitochondrial proteins that are specified by nuclear genes and 
imported from the cytosol— including DNA and RNA polymerases, all of the cit- 
ric acid cycle enzymes, and most inner membrane proteins — demonstrating the 
overwhelming importance of the nucleus in mitochondrial biogenesis. Petite 
mutants also show that an organelle that divides by fission can replicate indefi- 
nitely in the cytoplasm of proliferating eucaryotic cells, even in the complete 
absence of its own genome. It is possible that peroxisomes normally replicate in 
this way (see Figure 12-33). 

For chloroplasts, the nearest equivalent to yeast mitochondrial petite mutants 
are mutants of unicellular algae such as Euglena. Mutant algae in which no chloro- 
plast protein synthesis occurs still contain chloroplasts and are perfectly viable if 
they are provided with oxidizable substrates. If the development of mature chloro- 
plasts is blocked in higher plants, however, either by raising the plants in the dark 
or because chloroplast DNA is defective or absent, the plants die. 

Mitochondria and Plastids Contain Tissue-Specific Proteins That 
Are Encoded in the Cell Nucleus 

Mitochondria can have specialized functions in particular types of cells. The 
urea cycle, for example, is the central metabolic pathway in mammals for dis- 
posing of cellular breakdown products that contain nitrogen. These products are 
excreted in the urine as urea. Nucleus-encoded enzymes in the mitochondrial 
matrix perform several steps in the cycle. Urea synthesis occurs in only a few tis- 
sues, such as the liver, and the required enzymes are synthesized and imported 
into mitochondria only in these tissues. 

The respiratory enzyme complexes in the mitochondrial inner membrane of 
mammals also contain several tissue-specific, nucleus-encoded subunits that 
are thought to act as regulators of electron transport. Thus, some humans with 
a genetic muscle disease have a defective subunit of cytochrome oxidase; since 
the subunit is specific to skeletal muscle cells, their other cells, including their 
heart muscle cells, function normally, allowing the individuals to survive. As 
would be expected, tissue-specific differences are also found among the 
nucleus-encoded proteins in chloroplasts. 
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Figure 14-64 Electron micrographs of 
yeast cells. (A) The structure of normal 
mitochondria. (B) Mitochondria in a 
petite mutant. In petite mutants, all the 
mitochondrion-encoded gene products 
are missing, and the organelle is 
constructed entirely from nucleus- 
encoded proteins. (Courtesy of Barbara 
Stevens.) 



Mitochondria Import Most of Their Lipids; Chloroplasts Make 
Most of Theirs 

The biosynthesis of new mitochondria and chloroplasts requires lipids in addi- 
tion to nucleic acids and proteins. Chloroplasts tend to make the lipids they 
require. In spinach leaves, for example, all cellular fatty acid synthesis takes 
place in the chloroplast, although desaturation of the fatty acids occurs else- 
where. The major glycolipids of the chloroplast are also synthesized locally. 

Mitochondria, in contrast, import most of their lipids. Animal cells syn- 
thesize the phospholipids phosphatidylcholine and phosphatidylserine in the 
endoplasmic reticulum and then transfer them to the outer membrane of 
mitochondria. In addition to decarboxylating imported phosphatidylserine to 
phosphatidylethanolamine, the main reaction of lipid biosynthesis catalyzed 
by the mitochondria themselves is the conversion of imported lipids to cardi- 
olipin (bisphosphatidylglycerol). Cardiolipin is a "double" phospholipid that 
contains four fatty acid tails (Figure 14-65). It is found mainly in the mito- 
chondrial inner membrane, where it constitutes about 20% of the total lipid. 
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We discuss the important question of how specific cytosolic proteins are 
imported into mitochondria and chloroplasts in Chapter 12. 

Mitochondria May Contribute to the Aging of Cells and 
Organisms 

Oxygen gas, O2, which has a strong affinity for electrons, becomes especially reac- 
tive as soon as it acquires a single electron and becomes partially reduced to 
superoxide (O2"). This explains why, at the very end of the respiratory chain, the 
two oxygen atoms of O2" remain tightly held by cytochrome oxidase until they 
have taken up four electrons and can be safely released as two water molecules 
(see Figure 14-27). As we learn more about the details of biological electron trans- 
port processes, we find that evolution has selected for many clever mechanisms 
that prevent high-energy electrons from escaping from electron transport chains. 
But accidents nevertheless occur frequently, with a molecule of 2 ~ estimated to 
form in approximately 1 in every 2000 electron transfers from NADH to oxygen. 

Cells therefore have backup systems that minimize the damage from super- 
oxide and other reactive oxygen species once they form. Chief among these are 
superoxide dismutase (SOD) enzymes, which catalyze the reaction O2" + 2 " + 
2H + -» H2O2 + O2. The still toxic hydrogen peroxide product is then further 
metabolized to water, either by a catalase enzyme (which carries out the reac- 
tion H2O2 + H2O2 — » 2H2O + O2) or by a glutathione peroxidase (which carries out 
the reaction H 2 2 + 2 GSH -» 2 H 2 + GSSG, where GSH represents the tripep- 
tide glutathione with its cysteine sulfhydryl group). 

Approximately 90% of the O2" generated in cells is formed inside mitochon- 
dria. It is therefore not surprising that mitochondria contain their own superox- 
ide dismutase and glutathione peroxidase enzymes, or that mice deficient in 
mitochondrial superoxide dismutase (MnSOD) die early in life, unlike mice that 
lack either the cytosolic or the extracellular versions of this enzyme. Even in nor- 
mal animals, the mitochondrial DNA is observed to contain a ten-fold higher 
level of oxidized, abnormal nucleotides than the nuclear DNA. These types of 
observations, coupled with observations of poorly functioning mitochondria in 
older human beings, have led to a "vicious cycle" hypothesis that attempts to 
explain why organisms age. In this view, oxidation damage causes mutations to 
accumulate in the mitochondrial DNA of somatic tissues, which in turn causes 
an accelerating rate of oxidation errors in these tissues. This harmful feedback 
loop continues until the mitochondria of old mice or old humans are producing 
so many oxidation products that the entire organism decays — giving rise to a 
debilitating old age. 

Is there any truth to this idea? It is clear that aging is a very complex process. 
Moreover, for lower multicellular organisms such as worms and flies, lifespans 
can be dramatically elongated through any one of a number of genetic changes. 
This data suggests that these organisms have been programmed to age and die 
when they do, rather than wearing out from any accumulation of mutations or 
damage. As a direct test of the vicious cycle hypothesis, mice engineered to pro- 
duce half of the normal amount of MnSOD live as long as normal mice and do 
not appear to age prematurely, even though they accumulate an unusual 
amount of mitochondrial oxidative damage, show a premature decline in mito- 
chondrial function, and have an increased formation of tumors. 

Although the jury is still out with regard to humans, it seems unlikely at 
present that we shall be able to prolong our lives by simply popping pills of 
antioxidants. 
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Figure 14-65 The structure of 
cardiolipin. Cardiolipin is an unusual lipid 
in the inner mitochondrial membrane. 



Why Do Mitochondria and Chloroplasts Have Their Own Genetic 
Systems? 

Why do mitochondria and chloroplasts require their own separate genetic sys- 
tems, when other organelles that share the same cytoplasm, such as peroxi- 
somes and lysosomes, do not? The question is not trivial, because maintaining a 
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separate genetic system is costly: more than 90 proteins— including many ribo- 
somal proteins, aminoacyl-tRNA synthases, DNA and RNA polymerases, and 
RNA-processing and RNA-modifying enzymes— must be encoded by nuclear 
genes specifically for this purpose (Figure 14-66). The amino acid sequences of 
most of these proteins in mitochondria and chloroplasts differ from those of 
their counterparts in the nucleus and cytosol, and it appears that these 
organelles have relatively few proteins in common with the rest of the cell. This 
means that the nucleus must provide at least 90 genes just to maintain each 
organelle's genetic system. 

The reason for such a costly arrangement is not clear. We cannot think of 
compelling reasons why the proteins made in mitochondria and chloroplasts 
should be made there rather than in the cytosol. 

The individual protein subunits in the various mitochondrial enzyme com- 
plexes are highly conserved in evolution, but their site of synthesis is not (see 
Figure 14-59). Nevertheless, two protein-coding genes, Coxl and Cob, are pre- 
sent in all mitochondrial genomes. The proteins encoded by these genes are 
large and hydrophobic, with many membrane-spanning segments, and perhaps 
they need to be co-translationally inserted into the inner membrane by mito- 
chondrial ribosomes. 

Alternatively, the organelle genetic systems may simply be an evolutionary 
dead-end. In terms of the endosymbiont hypothesis, this would mean that the 
process whereby the endosymbionts transferred most of their genes to the 
nucleus stopped before it was complete. Further transfers may have been ruled 
out, for mitochondria, by alterations in the mitochondrial genetic code that 
made the remaining mitochondrial genes nonfunctional if they were transferred 
to the nucleus. 
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Figure 14-66 The origins of 
mitochondrial RNAs and proteins. The 

proteins encoded in the nucleus and 
imported from the cytosol have a major 
role in creating the genetic system of the 
mitochondrion, in addition to 
contributing most of the organelle's 
other proteins. Not indicated in this 
diagram are the additional nucleus- 
encoded proteins that regulate the 
expression of individual mitochondrial 
genes at posttranscriptional levels. The 
mitochondrion itself contributes only 
mRNAs, rRNAs, and tRNAs to its genetic 
system — and in the mitochondria of 
some organisms, even the tRNAs are 
imported. 



870 Chapter 14: Energy Conversion: Mitochondria and Chloroplasts 



Summary 

Mitochondria are additions to cells that allow most eucaryotes to carry out oxidative 
phosphorylation, while chloroplasts are additions to cell that allow selected eucaryotes 
(plants and certain algae) to carry out photosynthesis. Presumably as a result of their 
procaryotic origins, each organelle grows in a coordinated process that requires the 
contribution of two separate genetic systems — one in the organelle and one in the cell 
nucleus. Most of the proteins in these organelles are encoded by nuclear DNA, synthe- 
sized in the cytosol, and then imported individually into the organelle. Some organelle 
proteins and RNAs are encoded by the organelle DNA and are synthesized in the 
organelle itself The human mitochondrial genome contains about 16,500 nucleotides 
and encodes 2 ribosomal RNAs, 22 transfer RNAs, and 13 different polypeptide chains. 
Chloroplast genomes are about 10 times larger and contain about 120 genes. But 
partly functional organelles form in normal numbers even in mutants that lack a 
functional organelle genome, demonstrating the overwhelming importance of the 
nucleus for the biogenesis of both organelles. 

The ribosomes of chloroplasts closely resemble bacterial ribosomes, while mito- 
chondrial ribosomes show both similarities and differences that make their origin 
more difficult to trace. Protein similarities, however, suggest that both organelles orig- 
inated when a primitive eucaryotic cell entered into a stable endosymbiotic relation- 
ship with a bacterium. A purple bacterium is thought to have given rise to the mito- 
chondrion, and (later) a cyanobacterium is thought to have given rise to chloroplasts. 
Although many of the genes of these ancient bacteria still function to make organelle 
proteins, most of them have become integrated into the nuclear genome, where they 
encode bacterialike enzymes that are synthesized on cytosolic ribosomes and then 
imported into the organelle. 

The damage that accumulates in the genomes of mitochondia over time may 
make a unique contribution to the aging of cells and organisms, although much 
remains to be done to prove this a vicious cycle" hypothesis. 



THE EVOLUTION OF ELECTRON-TRANSPORT 
CHAINS 

Much of the structure, function, and evolution of cells and organisms can be 
related to their need for energy. We have seen that the fundamental mechanisms 
for harnessing energy from such disparate sources as light and the oxidation of 
glucose are the same. Apparently, an effective method for synthesizing ATP arose 
early in evolution and has since been conserved with only small variations. How 
did the crucial individual components — ATP synthase, redox-driven H + pumps, 
and photosystems — first arise? Hypotheses about events occurring on an evolu- 
tionary time scale are difficult to test. But clues abound, both in the many dif- 
ferent primitive electron-transport chains that survive in some present-day bac- 
teria, and in geological evidence about the environment of the Earth billions of 
years ago. 

The Earliest Cells Probably Used Fermentation to Produce ATP 

As explained in Chapter 1, the first living cells on Earth are thought to have 
arisen more than 3 x 10 9 years ago, when the Earth was not more than about 10 9 
years old. The environment lacked oxygen but was presumably rich in geo- 
chemically produced organic molecules, and some of the earliest metabolic 
pathways for producing ATP may have resembled present-day forms of fer- 
mentation. 

In the process of fermentation, ATP is made by a phosphorylation event that 
harnesses the energy released when a hydrogen-rich organic molecule, such as 
glucose, is partly oxidized (see Figure 2-71). The electrons lost from the oxidized 
organic molecules are transferred (via NADH or NADPH) to a different organic 
molecule (or to a different part of the same molecule), which thereby becomes 
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more reduced. At the end of the fermentation process, one or more of the organic 
molecules produced are excreted into the medium as metabolic waste products; 
others, such as pyruvate, are retained by the cell for biosynthesis. 

The excreted end products are different in different organisms, but they tend 
to be organic acids (carbon compounds that carry a COOH group). Among the 
most important of such products in bacterial cells are lactic acid (which also 
accumulates in the anaerobic glycolysis that occurs in muscle) and formic, 
acetic, propionic, butyric, and succinic acids. 



Electron-Transport Chains Enabled Anaerobic Bacteria to Use 
Nonfermentable Molecules as Their Major Source of Energy 

The early fermentation processes would have provided not only the ATP but also 
the reducing power (as NADH or NADPH) required for essential biosyntheses. 
Thus, many of the major metabolic pathways could have evolved while fermen- 
tation was the only mode of energy production. With time, however, the 
metabolic activities of these procaryotic organisms must have changed the local 
environment, forcing organisms to evolve new biochemical pathways. The accu- 
mulation of waste products of fermentation, for example, might have resulted in 
the following series of changes: 
Stage 1. The continuous excretion of organic acids lowered the pH of the envi- 
ronment, favoring the evolution of proteins that function as trans- 
membrane H + pumps that can pump H + out of the cell to protect it 
from the dangerous effects of intracellular acidification. One of these 
pumps may have used the energy available from ATP hydrolysis and 
could have been the ancestor of the present-day ATP synthase. 
Stage 2. At the same time as nonfermentable organic acids were accumulating 
in the environment and favoring the evolution of an ATP-consuming 
H + pump, the supply of geochemically generated fermentable nutri- 
ents, which provided the energy for the pumps and for all other cellu- 
lar processes, was dwindling. This favored bacteria that could excrete 
H + without hydrolyzing ATP, allowing the ATP to be conserved for other 
cellular activities. Selective pressures of this kind might have led to the 
first membrane-bound proteins that could use electron transport 
between molecules of different redox potentials as the energy source 
for transporting H + across the plasma membrane. Some of these pro- 
teins would have found their electron donors and electron acceptors 
among the nonfermentable organic acids that had accumulated. Pre- 
sent-day bacteria possess many such electron-transport proteins; 
some bacteria that grow on formic acid, for example, pump H + by using 
the small amount of redox energy derived from the transfer of electrons 
from formic acid to fumarate (Figure 14-67). Others have similar elec- 
tron-transport components devoted solely to the oxidation and reduc- 
tion of inorganic substrates (see Figure 14-69, for example). 
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Figure 14-67 The oxidation of formic acid 
in some present-day bacteria. In such 
anaerobic bacteria, including E, coil, the 
oxidation is mediated by an energy- 
conserving electron-transport chain in the 
plasma membrane. As indicated, the 
starting materials are formic acid and 
fumarate, and the products are succinate 
and C0 2 . Note that H + is consumed inside 
the cell and generated outside the cell, 
which is equivalent to pumping H + to the 
cell exterior. Thus, this membrane-bound 
electron-transport system can generate an 
electrochemical proton gradient across the 
plasma membrane. The redox potential of 
the formic acid-CC>2 pair is -420 mV, while 
that of the fumarate-succinate pair is 
+30 mV. (Compare to Figure 14-29.) 
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Figure 14-68 The evolution of oxidative phosphorylation mechanisms. 

One possible sequence is shown; the stages are described in the text. 



Stage 3. Eventually some bacteria developed H + -pumping electron-transport 
systems that were efficient enough to harness more redox energy than 
they needed just to maintain their internal pH. Now, bacteria that car- 
ried both types of H + pumps were at an advantage. In these cells, a 
large electrochemical proton gradient generated by excessive H + 
pumping allowed protons to leak back into the cell through the ATP- 
driven H + pumps, thereby running them in reverse, so that they func- 
tioned as ATP synthases to make ATP. Because such bacteria required 
much less of the increasingly scarce supply of fermentable nutrients, 
they proliferated at the expense of their neighbors. 

Figure 14-68 shows these three hypothetical stages in the evolution of 
oxidative phosphorylation mechanisms. 



By Providing an Inexhaustible Source of Reducing Power, 
Photosynthetic Bacteria Overcame a Major Evolutionary Obstacle 

The evolutionary steps just described would have solved the problem of main- 
taining both a neutral intracellular pH and an abundant store of energy, but 
these steps would not have solved another equally serious problem. The deple- 
tion of organic nutrients from the environment meant that organisms had to 
find some alternative source of carbon to make the sugars that served as the pre- 
cursors for so many other cellular molecules. Although the CO2 in the atmo- 
sphere provided an abundant potential carbon source, to convert it into an 
organic molecule such as a carbohydrate requires reducing the fixed C0 2 with a 
strong electron donor, such as NADH or NADPH, which can provide the high- 
energy electrons needed to generate each (CH 2 0) unit from CO2 (see Figure 
14-40). Early in cellular evolution, strong reducing agents (electron donors) 
would have been plentiful as products of fermentation. But as the supply of fer- 
mentable nutrients dwindled and a membrane-bound ATP synthase began to 
produce most of the ATR the plentiful supply of NADH and other reducing 
agents would have disappeared. It thus became imperative for cells to evolve a 
new way of generating strong reducing agents. 

Presumably, the main reducing agents still available were the organic acids 
produced by the anaerobic metabolism of carbohydrates, inorganic molecules 
such as hydrogen sulfide (H 2 S) generated geochemically and water. But the 
reducing power of these molecules is far too weak to be useful for C0 2 fixation. 
An early supply of strong electron donors could have been generated by using 
the electrochemical proton gradient across the plasma membrane to drive a 
reverse electron flow. This would have required the evolution of membrane - 
bound enzyme complexes resembling an NADH dehydrogenase, and mecha- 
nisms of this kind survive in the anaerobic metabolism of some present-day 
bacteria (Figure 14-69). 

The major evolutionary breakthrough in energy metabolism, however, was 
almost certainly the development of photochemical reaction centers that could 
use the energy of sunlight to produce molecules such as NADH. It is thought 
that this occurred early in the process of cellular evolution — more than 3 x 10 9 
years ago, in the ancestors of the green sulfur bacteria. Present-day green sul- 
fur bacteria use light energy to transfer hydrogen atoms (as an electron plus a 
proton) from H 2 S to NADPH, thereby creating the strong reducing power 
required for carbon fixation (Figure 14-70). Because the electrons removed 
from H 2 S are at a much more negative redox potential than those of H 2 (- 230 
mV for H 2 S compared + 820 mV for H 2 0) , one quantum of light absorbed by the 
single photosystem in these bacteria is sufficient to achieve a high enough 
redox potential to generate NADPH via a relatively simple photosynthetic elec- 
tron-transport chain. 
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The Photosynthetic Electron-Transport Chains of Cyanobacteria 
Produced Atmospheric Oxygen and Permitted New Life-Forms 

The next step, which is thought to have occurred with the development of the 
cyanobacteria perhaps 3 x 10 9 years ago, was the evolution of organisms capa- 
ble of using water as the electron source for C0 2 reduction. This entailed the 
evolution of a water- splitting enzyme and also required the addition of a second 
photosystem, acting in series with the first, to bridge the enormous gap in redox 
potential between H 2 and NADPH. Present-day structural homologies 
between photosystems suggest that this change involved the cooperation of a 
photosystem derived from green bacteria (photosystem I) with a photosystem 
derived from purple bacteria (photosystem II). The biological consequences of 
this evolutionary step were far-reaching. For the first time, there were organisms 
that made only very minimal chemical demands on their environment. These 
cells could spread and evolve in ways denied to the earlier photosynthetic bac- 
teria, which needed H 2 S or organic acids as a source of electrons. Consequently, 
large amounts of biologically synthesized, reduced organic materials accumu- 
lated. Moreover, oxygen entered the atmosphere for the first time. 

Oxygen is highly toxic because the oxidation reactions it brings about can 
randomly alter biological molecules. Many present-day anaerobic bacteria, for 
example, are rapidly killed when exposed to air. Thus, organisms on the primitive 
Earth would have had to evolve protective mechanisms against the rising 2 lev- 
els in the environment. Late evolutionary arrivals, such as ourselves, have 
numerous detoxifying mechanisms that protect our cells from the ill effects of 



Figure 14-69 Some of the electron- 
transport pathways in present-day 

bacteria. Some species can grow 
anaerobically by substituting nitrate for 
oxygen as the terminal electron acceptor. 
Most use the carbon-fixation cycle and 
synthesize their organic molecules 
entirely from carbon dioxide. The 
indicated pathways generate all the ATP 
and reducing power required from the 
oxidation of inorganic molecules, such as 
iron, ammonia, nitrite, and sulfur 
compounds. 

Note that both forward and reverse 
electron flows occur from the quinone 
(Q). As in the respiratory chain, the 
forward electron flows cause H + to be 
pumped out of the cell, and the resulting 
H + gradient drives the production of ATP 
by an ATP synthase (not shown). The 
NADPH required for carbon fixation is 
produced by an energy-requiring reverse 
electron flow; this energy is also derived 
from the H + gradient, as indicated. 
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Figure 14-70 The general flow of 
electrons in a relatively primitive form 
of photosynthesis observed in present- 
day green sulfur bacteria. The 

photosystem in green sulfur bacteria 
resembles photosystem I in plants and 
cyanobacteria. Both photosystems use a 
series of iron-sulfur centers as the 
electron acceptors that eventually donate 
their high-energy electrons to ferredoxin 
(Fd). An example of a bacterium of this 
type is Chlorobium tepidum, which can 
thrive at high temperatures and low light 
intensities in hot springs. 
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Figure 14-71 Some major events that are believed to have occurred during the evolution of living organisms on Earth. 

With the evolution of the membrane-based process of photosynthesis, organisms could make their own organic molecules 
from C0 2 gas. As explained in the text, the delay of more than 10 9 years between the appearance of bacteria that split 
water and released 2 during photosynthesis and the accumulation of high levels of 2 in the atmosphere is thought to be 
due to the initial reaction of the oxygen with the abundant ferrous iron (Fe 2+ ) that was dissolved in the early oceans. Only 
when the iron was used up would oxygen have started to accumulate. In response to the rising levels of oxygen in the 
atmosphere, nonphotosynthetic oxygen-using organisms appeared, and the concentration of oxygen in the atmosphere 
leveled out. 

oxygen. Even so, an accumulation of oxidative damage to our macromolecules 
has been postulated to be a major cause of human aging, as previously discussed. 

The increase in atmospheric 2 was very slow at first and would have 
allowed a gradual evolution of protective devices. For example, the early seas 
contained large amounts of iron in its ferrous oxidation state (Fe 2+ ), and nearly 
all the 2 produced by early photosynthetic bacteria could have been utilized in 
converting Fe 2+ to Fe 3+ . This conversion caused the precipitation of huge 
amounts of ferric oxides, and the extensive banded iron formations in sedimen- 
tary rocks beginning about 2.7 x 10 9 years ago help to date the spread of the 
cyanobacteria. By about 2 x 10 9 years ago, the supply of ferrous iron was 
exhausted, and the deposition of further iron precipitates ceased. Geological 
evidence suggests that 2 levels in the atmosphere then began to rise steeply, 
reaching current levels between 0.5 and 1.5 x 10 9 years ago (Figure 14-71). 

The availability of 2 made possible the development of bacteria that relied 
on aerobic metabolism to make their ATP. As explained previously, these organ- 
isms could harness the large amount of energy released by breaking down car- 
bohydrates and other reduced organic molecules all the way to C0 2 and H 2 0. 
Components of preexisting electron-transport complexes were modified to pro- 
duce a cytochrome oxidase, so that the electrons obtained from organic or inor- 
ganic substrates could be transported to 2 as the terminal electron acceptor. 
Depending on the availability of light and 2 , many present-day purple photo- 
synthetic bacteria can switch between photosynthesis and respiration, requiring 
only relatively minor reorganizations of their electron-transport chains. 

As organic materials accumulated on Earth as a result of photosynthesis, 
some photosynthetic bacteria (including the precursors of E. coli) lost their abil- 
ity to survive on light energy alone and came to rely entirely on respiration. As 
described previously (see Figure 14-58), it has been suggested that mitochon- 
dria first arose more than 1.5 x 10 9 years ago, perhaps when a primitive eucary- 
otic cell endocytosed such a respiration-dependent bacterium. Plants are 
believed to have evolved later, when a descendant of this early aerobic eucary- 
otic cell endocytosed a photosynthetic bacterium that became the precursor of 
chloroplasts. 

In Figure 14-72, we relate these postulated pathways to the various types of 
bacteria discussed in this chapter. Evolution is always conservative, taking 
parts of the old and building on them to create something new. Thus, parts of 
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Figure 14-72 A phylogenetic tree of the proposed evolution of mitochondria and chloroplasts and their bacterial 
ancestors. Oxygen respiration is thought to have begun developing about 2 x 1 9 years ago. As indicated, it seems to 
have evolved independently in the green, purple, and blue-green (cyanobacterial) lines of photosynthetic bacteria. It is 
thought that an aerobic purple bacterium that had lost its ability to photosynthesize gave rise to the mitochondrion, 
while a cyanobacterium gave rise to chloroplasts. Nucleotide sequence analyses suggest that mitochondria arose from 
purple bacteria that resembled the rhizobacteria, agrobacteria, and rickettsias— three closely related species known to 
form intimate associations with present-day eucaryotic cells. Archaea are not known to contain the types of 
photosystems described in this chapter, and they are not included here. 

the electron-transport chains that were derived to service anaerobic bacteria 
3-4 x 10 9 years ago probably survive, in altered form, in the mitochondria and 
chloroplasts of today's higher eucaryotes. Consider, for example, the homology 
in structure and function between the enzyme complex that pumps H + in the 
central segment of the mitochondrial respiratory chain (the cytochrome b-c\ 
complex) and its analogs in the electron-transport chains of both bacteria and 
chloroplasts (Figure 14-73). 



Summary 

Early cells are believed to have been bacteriumlike organisms living in an environ- 
ment rich in highly reduced organic molecules that had been formed by geochemical 
processes over the course of hundreds of millions of years. They may have derived 
most of their ATP by converting these reduced organic molecules to a variety of 
organic acids, which were then released as waste products. By acidifying the environ- 
ment, these fermentations may have led to the evolution of the first membrane-bound 
H + pumps, which could maintain a neutral pH in the cell interior by pumping out H + . 
The properties of present-day bacteria suggest that an electron-transport-driven H + 
pump and an ATP-driven H + pump first arose in this anaerobic environment. Rever- 
sal of the ATP-driven pump would have allowed it to function as an ATP synthase. As 
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Figure 14-73 A comparison of three 
electron-transport chains discussed in 
this chapter. Bacteria, chloroplasts, and 
mitochondria all contain a membrane- 
bound enzyme complex that resembles 
the cytochrome b<\ complex of 
mitochondria. These complexes all accept 
electrons from a quinone carrier (Q) and 
pump H + across their respective 
membranes. Moreover, in reconstituted in 
vitro systems, the different complexes can 
substitute for one another, and the 
structures of their protein components 
reveal that they are evolutionary related. 
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more effective electron-transport chains developed, the energy released by redox reac- 
tions between inorganic molecules and/or accumulated nonfermentable compounds 
produced a large electrochemical proton gradient, which could be harnessed by the 
ATP-driven pump for ATP production. 

Because preformed organic molecules were replenished only very slowly by geo- 
chemical processes, the proliferation of bacteria that used them as the source of both 
carbon and reducing power could not go on forever. The depletion of fermentable 
organic nutrients presumably led to the evolution of bacteria that could use C0 2 to 
make carbohydrates. By combining parts of the electron- transport chains that had 
developed earlier, light energy was harvested by a single photosystem in photosynthetic 
bacteria to generate the NADPH required for carbon fixation. The subsequent appear- 
ance of the more complex photosynthetic electron- transport chains of the cyanobacte- 
ria allowed H 2 to be used as the electron donor for NADPH formation, rather than the 
much less abundant electron donors required by other photosynthetic bacteria. Life 
could then proliferate over large areas of the Earth, so that reduced organic molecules 
accumulated again. 

About 2 x 10 9 years ago, the 2 released by photosynthesis in cyanobacteria began 
to accumulate in the atmosphere. Once both organic molecules and 2 had become 
abundant, electron-transport chains became adapted for the transport of electrons 
from NADH to 2 , and efficient aerobic metabolism developed in many bacteria. 
Exactly the same aerobic mechanisms operate today in the mitochondria of eucary- 
otes, and there is increasing evidence that both mitochondria and chloroplasts evolved 
from aerobic bacteria. 
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PROBLEMS 

Which statements are true? Explain why or why not. 

1 4-1 The three respiratory enzyme complexes in the mito- 
chondrial inner membrane exist in structurally ordered 
arrays that facilitate the correct transfer of electrons 
between appropriate complexes. 

14-2 Lipophilic weak acids short-circuit the normal flow 
of protons across the inner membrane, thereby eliminating 
the proton-motive force, stopping ATP synthesis, and block- 
ing the flow of electrons. 

1 4-3 Mutations that are inherited according to Mendelian 
rules affect nuclear genes; mutations whose inheritance vio- 
lates Mendelian rules are likely to affect organelle genes. 

Discuss the following problems, 

14-4 In the 1860s Louis Pasteur noticed that when he 
added O2 to a culture of yeast growing anaerobically on glu- 
cose, the rate of glucose consumption declined dramati- 
cally. Explain the basis for this result, which is known as the 
Pasteur effect. 

14-5 In actively respiring liver mitochondria, the pH 
inside the matrix is about half a pH unit higher than that in 
the cytosol. Assuming that the cytosol is at pH 7 and the 
matrix is a sphere with a diameter of 1 urn [V= (4/3)7ir 3 ], cal- 
culate the total number of protons in the matrix of a respir- 
ing liver mitochondrion. If the matrix began at pH 7 (equal 
to that in the cytosol), how many protons would have to be 
pumped out to establish a matrix pH of 7.5 (a difference of 
0.5 pH units)? 

1 4-6 Heart muscle gets most of the ATP needed to power its 
continual contractions through oxidative phosphorylation. 
When oxidizing glucose to C0 2 , heart muscle consumes 2 at 
a rate of 10 umol/min per g of tissue, in order to replace the 
ATP used in contraction and give a steady-state ATP concen- 
tration of 5 umol/g of tissue. At this rate, how many seconds 
would it take the heart to consume an amount of ATP equal to 
its steady-state levels? (Complete oxidation of one molecule 
of glucose to C0 2 yields 30 ATR 26 of which are derived by 
oxidative phosphorylation using the 12 pairs of electrons cap- 
tured in the electron carriers NADH and FADH 2 .) 

1 4-7 If isolated mitochondria are incubated with a source 
of electrons such as succinate, but without oxygen, electrons 
enter the respiratory chain, reducing each of the electron 
carriers almost completely. When oxygen is then intro- 
duced, the carriers become oxidized at different rates (Fig- 
ure Q14-1). How does this result allow you to order the elec- 
tron carriers in the respiratory chain? What is their order? 

1 4-8 The uncoupler dinitrophenol was once prescribed as 
a diet drug to aid in weight loss. How would an uncoupler of 
oxidative phosphorylation promote weight loss? Why do you 
suppose that it is no longer prescribed? 

1 4-9 How much energy is available in visible light? How 
much energy does sunlight deliver to the earth? How effi- 
cient are plants at converting light energy into chemical 
energy? The answers to these questions provide an impor- 
tant backdrop to the subject of photosynthesis. 
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Figure Q14-1 Rapid 
spectrophotometry 
analysis of the rates of 
oxidation of electron 
carriers in the respiratory 
chain (Problem 14 -7). 
Cytochromes a and 03 
cannot be distinguished 
and thus are listed as 
cytochrome {a + 03). 
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Each quantum or photon of light has energy hv, where h is 
Planck's constant (1.58 x 10~ 37 kcal sec/photon) and 1/ is the 
frequency in sec -1 . The frequency of light is equal to c/X, 
where c is the speed of light (3.0 x 10 17 nm/sec) and X is the 
wavelength in nm. Thus, the energy (£) of a photon is 

E-hv- hclX 

A. Calculate the energy of a mole of photons (6 x 10 23 pho- 
tons/mole) at 400 nm (violet light), at 680 nm (red light), and 
at 800 nm (near infrared light). 

B. Bright sunlight strikes Earth at the rate of about 0.3 kcal/ 
sec per square meter. Assuming for the sake of calculation 
that sunlight consists of monochromatic light of wavelength 
680 nm, how many seconds would it take for a mole of pho- 
tons to strike a square meter? 

C. Assuming that it takes eight photons to fix one molecule 
of CO2 as carbohydrate under optimal conditions (8-10 pho- 
tons is the currently accepted value), calculate how long it 
would take a tomato plant with a leaf area of 1 square meter 
to make a mole of glucose from CO2. Assume that photons 
strike the leaf at the rate calculated above and, furthermore, 
that all the photons are absorbed and used to fix C0 2 . 

D. If it takes 1 12 kcal/mole to fix a mole of C0 2 into carbo- 
hydrate, what is the efficiency of conversion of light energy 
into chemical energy after photon capture? Assume again 
that eight photons of red light (680 nm) are required to fix 
one molecule of C0 2 . 

14-10 Recalling Joseph Priestley's famous experiment in 
which a sprig of mint saved the life of a mouse in a sealed 
chamber, you decide to do an analogous experiment to see 
how a C3 and a C4 plant do when confined together in a 
sealed environment. You place a corn plant (C 4 ) and a gera- 
nium (C 3 ) in a sealed plastic chamber with normal air (300 
parts per million C0 2 ) on a windowsill in your laboratory. 
What will happen to the two plants? Will they compete or 
collaborate? If they compete, which one will win and why? 

1 4-1 1 Examine the variegated leaf shown in Figure Q14-2. 
Yellow patches surrounded by green are common, but there 
are no green patches surrounded by yellow. Propose an 
explanation for this phenomenon. 




Figure Q14-2 A variegated leaf of Aucuba japonico with green and 
yellow patches (Problem 14-1 1). 
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Mechanisms of Cell 
Communication 




To make a multicellular organism, cells must communicate, just as humans 
must communicate if they are to organize themselves into a complex society. 
And just as human communication involves more than the passage of noises 
from mouth to ear, so cell-cell communication involves more than the trans- 
mission of chemical signals across the space between one cell and another. 
Complex intracellular mechanisms are needed to control which signals are 
emitted at what time and to enable the signal-receiving cell to interpret those 
signals and use them to guide its behavior. According to the fossil record, sophis- 
ticated multicellular organisms did not appear on Earth until unicellular organ- 
isms resembling present-day procaryotes had already been in existence for 
about 2.5 billion years. The long delay may reflect the difficulty of evolving the 
language systems of animal, plant, and fungal cells — the machinery that would 
enable cells sharing the same genome to collaborate and coordinate their 
behavior, specializing in different ways and subordinating their individual 
chances of survival to the interests of the multicellular organism as a whole. 
These highly evolved mechanisms of cell-cell communication are the topic of 
this chapter. 

Communication between cells is mediated mainly by extracellular signal 
molecules. Some of these operate over long distances, signaling to cells far 
away; others signal only to immediate neighbors. Most cells in multicellular 
organisms both emit and receive signals. Reception of the signals depends on 
receptor proteins, usually (but not always) at the cell surface, which bind the sig- 
nal molecule. The binding activates the receptor, which in turn activates one or 
more intracellular signaling pathways. These relay chains of molecules — mainly 
intracellular signaling proteins — process the signal inside the receiving cell and 
distribute it to the appropriate intracellular targets. These targets are generally 
effector proteins, which are altered when the signaling pathway is activated and 
implement the appropriate change of cell behavior. Depending on the signal 
and the nature and state of the receiving cell, these effectors can be gene regu- 
latory proteins, ion channels, components of a metabolic pathway, or parts of 
the cytoskeleton — among other things (Figure 15-1). 

We begin this chapter by discussing the general principles of cell communi- 
cation. We then consider, in turn, the main families of cell- surface receptor pro- 
teins and the principal intracellular signaling pathways they activate. The main 
focus of the chapter is on animal cells, but we end by considering the special fea- 
tures of cell communication in plants. 
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GENERAL PRINCIPLES OF CELL COMMUNICATION 

Long before multicellular organisms appeared on Earth, unicellular organisms 
had developed mechanisms for responding to physical and chemical changes in 
their environment. These almost certainly included mechanisms for response to 
the presence of other cells. Evidence comes from studies of present-day unicel- 
lular organisms such as bacteria and yeasts. Although these cells largely lead 
independent lives, they can communicate and influence one another's behavior. 
Many bacteria, for example, respond to chemical signals that are secreted by their 
neighbors and increase in concentration with increasing population density. This 
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Figure 15-1 A simple intracellular 
signaling pathway activated by an 
extracellular signal molecule. The signal 
molecule usually binds to a receptor 
protein that is embedded in the plasma 
membrane of the target cell and activates 
one or more intracellular signaling 
pathways mediated by a series of 
signaling proteins. Finally, one or more of 
the intracellular signaling proteins alters 
the activity of effector proteins and 
thereby the behavior of the cell. 
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process, called quorum sensing, allows bacteria to coordinate their behavior, 
including their motility, antibiotic production, spore formation, and sexual con- 
jugation. 

Similarly, yeast cells communicate with one another in preparation for mat- 
ing. The budding yeast Saccharomyces cerevisiae provides a well- studied exam- 
ple: when a haploid individual is ready to mate, it secretes a peptide mating fac- 
tor that signals cells of the opposite mating type to stop proliferating and pre- 
pare to mate (Figure 15-2). The subsequent fusion of two haploid cells of oppo- 
site mating type produces a diploid cell, which can then undergo meiosis and 
sporulate, generating haploid cells with new assortments of genes (see Figure 
21-3B). The reshuffling of genes through sexual reproduction helps a species 
survive in an unpredictably variable environment (as discussed in Chapter 21). 

Studies of yeast mutants that are unable to mate have identified many pro- 
teins that are required in the signaling process. These proteins form a signaling 
network that includes cell-surface receptor proteins, GTP-binding proteins, and 
protein kinases, and each of these categories has close relatives among the recep- 
tors and intracellular signaling proteins in animal cells. Through gene duplication 
and divergence, however, the signaling systems in animals have become much 
more elaborate than those in yeasts; the human genome, for example, contains 
more than 1500 genes that encode receptor proteins, and the number of different 
receptor proteins is further increased by alternative RNA splicing and post-trans- 
lational modifications. 

The large numbers of signal proteins, receptors, and intracellular signaling 
proteins used by animals can be grouped into a much smaller number of protein 
families, most of which have been highly conserved in evolution. Flies, worms, 
and mammals all use essentially similar machinery for cell communication, and 
many of the key components and signaling pathways were first discovered 
through analysis of mutations in Drosophila and C. elegans. 



Extracellular Signal Molecules Bind to Specific Receptors 

Cells in multicellular animals communicate by means of hundreds of kinds of 
signal molecules. These include proteins, small peptides, amino acids, 
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Figure 15-2 Budding yeast cells 
responding to a mating factor. (A) The 

cells are normally spherical. (B) In 
response to the mating factor secreted 
by neighboring yeast cells, they put out a 
protrusion toward the source of the 
factor in preparation for mating. 
(Courtesy of Michael Snyder.) 
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nucleotides, steroids, retinoids, fatty acid derivatives, and even dissolved gases 
such as nitric oxide and carbon monoxide. Most of these signal molecules are 
released into the extracellular space by exocytosis from the signaling cell, as 
discussed in Chapter 13. Some, however, are emitted by diffusion through the 
signaling cell's plasma membrane, whereas others are displayed on the external 
surface of the cell and remain attached to it, providing a signal to other cells 
only when they make contact. Transmembrane proteins may be used for sig- 
naling in this way; or their extracellular domains may be released from the sig- 
naling cell's surface by proteolytic cleavage and then act at a distance. 

Regardless of the nature of the signal, the target cell responds by means of a 
receptor, which specifically binds the signal molecule and then initiates a 
response in the target cell. The extracellular signal molecules often act at very 
low concentrations (typically < 10~ 8 M), and the receptors that recognize them 
usually bind them with high affinity (affinity constant K& > 10 8 liters/mole; see 
Figure 3-43). 

In most cases, the receptors are transmembrane proteins on the target cell 
surface. When these proteins bind an extracellular signal molecule (a ligand), 
they become activated and generate various intracellular signals that alter the 
behavior of the cell. In other cases, the receptor proteins are inside the target 
cell, and the signal molecule has to enter the cell to bind to them: this requires 
that the signal molecule be sufficiently small and hydrophobic to diffuse across 
the target cell's plasma membrane (Figure 15-3). 

Extracellular Signal Molecules Can Act Over Either Short or Long 
Distances 

Many signal molecules remain bound to the surface of the signaling cell and 
influence only cells that contact it (Figure 15-4A). Such contact- dependent sig- 
naling is especially important during development and in immune responses. 
Contact-dependent signaling during development can sometimes operate over 
relatively large distances, where the communicating cells extend long processes 
to make contact with one another. 

In most cases, however, signaling cells secrete signal molecules into the 
extracellular fluid. The secreted molecules may be carried far afield to act on dis- 
tant target cells, or they may act as local mediators, affecting only cells in the 
local environment of the signaling cell. The latter process is called paracrine sig- 
naling (Figure 15-4B). Usually, the signaling and target cells in paracrine signal- 
ing are of different cell types, but cells may also produce signals that they them- 
selves respond to: this is referred to as autocrine signaling. Cancer cells, for 
example, often use this strategy to stimulate their own survival and proliferation. 

For paracrine signals to act only locally, the secreted molecules must not be 
allowed to diffuse too far; for this reason they are often rapidly taken up by 
neighboring target cells, destroyed by extracellular enzymes, or immobilized by 
the extracellular matrix. Heparan sulfate proteoglycans (discussed in Chapter 
19), either in the extracellular matrix or attached to cell surfaces, often play a 
part in localizing the action of secreted signal proteins. They contain long 
polysaccharide side chains that bind the signal proteins and immobilize them. 
They may also control the stability of these proteins, their transport through the 
extracellular space, or their interaction with cell-surface receptors. Secreted pro- 
tein antagonists also affect the distance over which a paracrine signal protein 
acts. These antagonists bind to either the signal molecule itself or its cell-surface 
receptor and block its activity, and they play an important part in restricting the 
effective range of secreted signal proteins that influence the developmental 
decisions that embryonic cells make (discussed in Chapter 22). 

Large, complex, multicellular organisms need long-range signaling mecha- 
nisms to coordinate the behavior of cells in remote parts of the body. Thus, they 
have evolved cell types specialized for intercellular communication over large 
distances. The most sophisticated of these are nerve cells, or neurons, which 
typically extend long branching processes (axons) that enable them to contact 
target cells far away, where the processes terminate at the specialized sites of 
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Figure 1 5-3 The binding of extracellular 
signal molecules to either cell-surface or 
intracellular receptors. (A) Most signal 
molecules are hydrophilic and are 
therefore unable to cross the target cell's 
plasma membrane directly; instead, they 
bind to cell-surface receptors, which in 
turn generate signals inside the target 
cell (see Figure 15-1). (B) Some small 
signal molecules, by contrast, diffuse 
across the plasma membrane and bind to 
receptor proteins inside the target cell — 
either in the cytosol or in the nucleus (as 
shown here). Many of these small signal 
molecules are hydrophobic and nearly 
insoluble in aqueous solutions; they are 
therefore transported in the bloodstream 
and other extracellular fluids bound to 
carrier proteins, from which they 
dissociate before entering the target cell. 
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Figure 1 5-4 Four forms of intercellular 
signaling. (A) Contact-dependent 
signaling requires cells to be in direct 
membrane-membrane contact. 
(B) Paracrine signaling depends on 
signals that are released into the 
extracellular space and act locally on 
neighboring cells. (C) Synaptic signaling 
is performed by neurons that transmit 
signals electrically along their axons and 
release neurotransmitters at synapses, 
which are often located far away from the 
neuronal cell body. (D) Endocrine 
signaling depends on endocrine cells, 
which secrete hormones into the 
bloodstream for distribution throughout 
the body. Many of the same types of 
signaling molecules are used in paracrine, 
synaptic, and endocrine signaling; the 
crucial differences lie in the speed and 
selectivity with which the signals are 
delivered to their targets. 



signal transmission known as chemical synapses. When activated by stimuli from 
the environment or from other nerve cells, the neuron sends electrical impulses 
(action potentials) rapidly along its axon; when such an impulse reaches the 
synapse at the end of the axon, it triggers secretion of a chemical signal that acts 
as a neurotransmitter. The tightly organized structure of the synapse ensures 
that the neurotransmitter is delivered specifically to receptors on the postsy- 
naptic target cell (Figure 15-4C). The details of this synaptic signaling process 
are discussed in Chapter 11. 

A quite different strategy for signaling over long distances makes use of 
endocrine cells. These secrete their signal molecules, called hormones, into the 
bloodstream, which carries the molecules far and wide, allowing them to act on 
target cells that may lie anywhere in the body (Figure 15-4D). 

Figure 15-5 compares the mechanisms that allow endocrine cells and nerve 
cells to coordinate cell behavior over long distances in animals. Because 
endocrine signaling relies on diffusion and blood flow, it is relatively slow. Synap- 
tic signaling, by contrast, is much faster, as well as more precise. Nerve cells can 
transmit information over long distances by electrical impulses that travel at 
speeds of up to 100 meters per second; once released from a nerve terminal, a 
neurotransmitter has to diffuse less than 100 nm to the target cell, a process that 
takes less than a millisecond. Another difference between endocrine and synap- 
tic signaling is that, whereas hormones are greatly diluted in the bloodstream and 
interstitial fluid and therefore must be able to act at very low concentrations (typ- 
ically < 10~ 8 M), neurotransmitters are diluted much less and can achieve high 
local concentrations. The concentration of acetylcholine in the synaptic cleft of 
an active neuromuscular junction, for example, is about 5 x 10" 4 M. Correspond- 
ingly, neurotransmitter receptors have a relatively low affinity for their ligand, 
which means that the neurotransmitter can dissociate rapidly from the receptor 
to help terminate a response. Moreover, after its release from a nerve terminal, a 
neurotransmitter is quickly removed from the synaptic cleft, either by specific 
hydrolytic enzymes that destroy it or by specific membrane transport proteins 
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Figure 15-5 The contrast between endocrine and neuronal strategies for 
long-range signaling. In complex animals, endocrine cells and nerve cells 
work together to coordinate the activities of cells in widely separated parts 
of the body. Whereas different endocrine cells must use different 
hormones to communicate specifically with their target cells, different 
nerve cells can use the same neurotransmitter and still communicate in a 
highly specific manner. (A) Endocrine cells secrete hormones into the 
blood, and these act only on those target cells that carry the appropriate 
receptors: the receptors bind the specific hormone, which the target cells 
thereby "pull" from the extracellular fluid. (B) In synaptic signaling, by 
contrast, specificity arises from the synaptic contacts between a nerve cell 
and the specific target cells it signals. <CTGA> Usually, only a target cell 
that is in synaptic communication with a nerve cell is exposed to the 
neurotransmitter released from the nerve terminal (although some 
neurotransmitters act in a paracrine mode, serving as local mediators that 
influence multiple target cells in the area). 



that pump it back into either the nerve terminal or neighboring glial cells. Thus, 
synaptic signaling is much more precise than endocrine signaling, both in time 
and in space. 

The speed of a response to an extracellular signal depends not only on the 
mechanism of signal delivery but also on the nature of the target cell's response. 
When the response requires only changes in proteins already present in the cell, 
it can occur very rapidly: an allosteric change in a neurotransmitter- gated ion 
channel (discussed in Chapter 11), for example, can alter the plasma membrane 
electrical potential in milliseconds, and responses that depend solely on protein 
phosphorylation can occur within seconds. When the response involves 
changes in gene expression and the synthesis of new proteins, however, it usu- 
ally requires many minutes or hours, regardless of the mode of signal delivery 
(Figure 15-6). 
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Figure 1 5-6 Extracellular signals can act slowly or rapidly to change the behavior of a target 
cell. Certain types of signaled responses, such as increased cell growth and division, involve 
changes in gene expression and the synthesis of new proteins; they therefore occur slowly, 
often starting after an hour or more. Other responses — such as changes in cell movement, 
secretion, or metabolism — need not involve changes in gene transcription and therefore occur 
much more quickly, often starting in seconds or minutes; they may involve the rapid 
phosphorylation of effector proteins in the cytoplasm, for example. Synaptic responses 
mediated by changes in membrane potential can occur in milliseconds (not shown). 
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Gap Junctions Allow Neighboring Cells to Share Signaling 
Information 

Gap junctions are narrow water-filled channels that directly connect the cyto- 
plasms of adjacent epithelial cells, as well as of some other cell types (see Figure 
19-34). The channels allow the exchange of inorganic ions and other small water- 
soluble molecules, but not of macromolecules such as proteins or nucleic acids. 
Thus, cells connected by gap junctions can communicate with each other directly, 
without having to surmount the barrier presented by the intervening plasma 
membranes (Figure 15-7) . In this way, gap junctions provide for the most intimate 
of all forms of cell communication, short of cytoplasmic bridges (see Figure 21-31) 
or cell fusion. 

In contrast with other modes of cell signaling, gap junctions generally allow 
communications to pass in both directions symmetrically, and their typical 
effect is to homogenize conditions in the communicating cells. They can also be 
important in spreading the effect of extracellular signals that act through small 
intracellular mediators such as Ca 2+ and cyclic AMP (discussed later), which 
pass readily through the gap -junctional channels. In the liver, for example, a fall 
in blood glucose levels releases noradrenaline (norepinephrine) from sympa- 
thetic nerve endings. The noradrenaline stimulates hepatocytes in the liver to 
increase glycogen breakdown and to release glucose into the blood, a response 
that depends on an increase in intracellular cyclic AMP. Not all of the hepato- 
cytes are innervated by sympathetic nerves, however. By means of the gap junc- 
tions that connect hepatocytes, the innervated hepatocytes transmit the signal 
to the noninnervated ones, at least in part by the movement of cyclic AMP 
through gap junctions. As expected, mice with a mutation in the major gap junc- 
tion gene expressed in the liver fail to mobilize glucose normally when blood 
glucose levels fall. Gap junctions are discussed in detail in Chapter 19. 
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Figure 15-7 Signaling via gap junctions. 

Cells connected by gap junctions share 
small molecules, including small 
intracellular signaling molecules such as 
cyclic AMP and Ca 2+ , and can therefore 
respond to extracellular signals in a 
coordinated way. 



Each Cell Is Programmed to Respond to Specific Combinations of 
Extracellular Signal Molecules 

A typical cell in a multicellular organism may be exposed to hundreds of different 
signal molecules in its environment. The molecules can be soluble, bound to the 
extracellular matrix, or bound to the surface of a neighboring cell; they can be 
stimulatory or inhibitory; they can act in innumerable different combinations; 
and they can influence almost any aspect of cell behavior. The cell must respond 
to this babel of signals selectively, according to its own specific character. This 
character is acquired through progressive cell specialization in the course of 
development. A cell may respond to one combination of signals by differentiat- 
ing, to another combination by growing and dividing, and to yet another by per- 
forming some specialized function such as contraction or secretion. One of the 
great challenges in cell biology is to understand how a cell integrates all of this 
signaling information in order to make its crucial decisions — to divide, to move, 
to differentiate, and so on. For most of the cells in animal tissues, even the deci- 
sion to continue living depends on correct interpretation of a specific combina- 
tion of signals required for survival. When deprived of these signals (in a culture 
dish, for example), the cell activates a suicide program and kills itself — usually 
by apoptosis, a form of programmed cell death (Figure 15-8), as discussed in 
Chapter 18. Because different types of cells require different combinations of 
survival signals, each cell type is restricted to a specific set of environments in 
the body. Many epithelial cells, for example, require survival signals from the 
basal lamina on which they sit (discussed in Chapter 19); they die by apoptosis 
if they lose contact with this sheet of matrix. 

In principle, the hundreds of signal molecules that an animal makes can be 
used in an almost unlimited number of signaling combinations so as to control 
the diverse behaviors of its cells in highly specific ways. Relatively small numbers 
of types of signal molecules and receptors are sufficient. The complexity lies in the 
ways in which cells respond to the combinations of signals that they receive. 
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Figure 15-8 An animal cell's dependence 
on multiple extracellular signal 
molecules. Each cell type displays a set of 
receptors that enables it to respond to a 
corresponding set of signal molecules 
produced by other cells. These signal 
molecules work in combinations to 
regulate the behavior of the cell. As shown 
here, an individual cell often requires 
multiple signals to survive (blue arrows) 
and additional signals to grow and divide 
(red arrow) or differentiate (green arrows). If 
deprived of appropriate survival signals, a 
cell will undergo a form of cell suicide 
known as apoptosis.The actual situation is 
even more complex. Although not shown, 
some extracellular signal molecules act to 
inhibit these and other cell behaviors, or 
even to induce apoptosis. 
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Different Types of Cells Usually Respond Differently to the Same 
Extracellular Signal Molecule 

A cell's response to extracellular signals depends not only on the receptor pro- 
teins it possesses but also on the intracellular machinery by which it integrates 
and interprets the signals it receives. Thus, a single signal molecule usually has 
different effects on different types of target cells. The neurotransmitter acetyl- 
choline (Figure 15-9A), for example, decreases the rate and force of contrac- 
tion in heart muscle cells (Figure 15-9B), but it stimulates skeletal muscle cells 
to contract (see Figure 15-9C). In this case, the acetylcholine receptor proteins 
on skeletal muscle cells differ from those on heart muscle cells. But receptor 
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Figure 15-9 Various responses induced by the neurotransmitter 
acetylcholine. (A) The chemical structure of acetylcholine. (B-D) Different 
cell types are specialized to respond to acetylcholine in different ways. In 
some cases (B and C), the receptors for acetylcholine differ. In other cases 
(B and D), acetylcholine binds to similar receptor proteins, but the 
intracellular signals produced are interpreted differently in cells specialized 
for different functions. 
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differences are usually not the explanation for the different effects. The same sig- 
nal molecule binding to identical receptor proteins usually produces very differ- 
ent responses in different types of target cells, as in the case of acetylcholine 
binding to heart muscle and to salivary gland cells (compare Figures 15-9B and 
D). In some cases, this reflects differences in the intracellular signaling proteins 
activated, whereas in others it reflects differences in the effector proteins or 
genes activated. Thus, an extracellular signal itself has little information content; 
it simply induces the cell to respond according to its predetermined state, which 
depends on the cell's developmental history and the specific genes it expresses. 
The challenge of understanding how a cell integrates, processes, and reacts 
to the various inputs it receives is analogous in many ways to the challenge of 
understanding how the brain integrates and processes information to control 
behavior. In both cases, we need more than just a list of the components and con- 
nections in the system to understand how the process works. As a first step, we 
need to consider some basic principles concerning the way a cell responds to a 
simple signal of a given type. 



The Fate of Some Developing Cells Depends on Their Position in 
Morphogen Gradients 

The same signal acting on the same cell type can have qualitatively different 
effects depending on the signal's concentration. As we discuss in Chapter 22, 
such different responses of a cell to different concentrations of the same signal 
molecule are crucial in animal development, when cells are becoming different 
from one another. 

The extracellular signal molecule in these cases during development is 
called a morphogen, and, in the simplest cases, it diffuses out from a localized 
cellular source (a signaling center) , generating a signal concentration gradient. 
The responding cells adopt different cell fates in accordance with their position 
in the gradient: those cells closest to the signaling center that encounter the 
highest concentration of the morphogen have the highest number of receptors 
activated and follow one pathway of development, whereas those slightly fur- 
ther away follow another, and so on (Figure 15-10). As we discuss later (and in 
Chapter 22), the different levels of receptor activation lead to differences in the 
concentration or activity of one or more gene regulatory proteins in the nucleus 
of each cell, which in turn results in different patterns of gene expression. Fur- 
ther, more local signaling interactions between the cells in the gradient often 
help determine and stabilize the different fate choices. 



A Cell Can Alter the Concentration of an Intracellular Molecule 
Quickly Only If the Lifetime of the Molecule Is Short 

It is natural to think of signaling systems in terms of the changes produced when 
an extracellular signal is delivered. But it is just as important to consider what 
happens when the signal is withdrawn. During development, transient extracel- 
lular signals often produce lasting effects: they can trigger a change in the cell's 
development that persists indefinitely through cell memory mechanisms, as we 
discuss later (and in Chapters 7 and 22). In most cases in adult tissues, however, 
the response fades when a signal ceases. Often the effect is transient because the 
signal exerts its effects by altering the concentrations of intracellular molecules 
that are short-lived (unstable), undergoing continual turnover. Thus, once the 
extracellular signal is gone, the replacement of the old molecules by new ones 
wipes out all traces of the signal's action. It follows that the speed with which a 
cell responds to signal removal depends on the rate of destruction, or turnover, 
of the intracellular molecules that the signal affects. 

It is also true, although much less obvious, that this turnover rate can deter- 
mine the promptness of the response when an extracellular signal arrives. Con- 
sider, for example, two intracellular signaling molecules X and Y, both of which 
are normally maintained at a concentration of 1000 molecules per cell. The cell 
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Figure 1 5-1 Cells adopt different fates 
depending on their position in a 
morphogen gradient. The different 
concentrations of morphogen induce the 
expression of different sets of genes, 
resulting in different cell fates (indicated 
by different letters and colors). 
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Figure 1 5-1 1 The importance of rapid 
turnover. The graphs show the predicted 
relative rates of change in the 
intracellular concentrations of molecules 
with differing turnover times when their 
synthesis rates are either (A) decreased or 
(B) increased suddenly by a factor of 10. 
In both cases, the concentrations of those 
molecules that are normally degraded 
rapidly in the cell (red lines) change 
quickly, whereas the concentrations of 
those that are normally degraded slowly 
(green lines) change proportionally more 
slowly. The numbers (in blue) on the right 
are the half-lives assumed for each of the 
different molecules. 



synthesizes and degrades molecule Y at a rate of 100 molecules per second, with 
each molecule having an average lifetime of 10 seconds. Molecule X has a 
turnover rate that is 10 times slower than that of Y: it is both synthesized and 
degraded at a rate of 10 molecules per second, so that each molecule has an aver- 
age lifetime in the cell of 100 seconds. If a signal acting on the cell causes a ten- 
fold increase in the synthesis rates of both X and Y with no change in the molec- 
ular lifetimes, at the end of 1 second the concentration of Y will have increased by 
nearly 900 molecules per cell (10 x 100 - 100), while the concentration of X will 
have increased by only 90 molecules per cell. In fact, after a molecule's synthesis 
rate has been either increased or decreased abruptly, the time required for the 
molecule to shift halfway from its old to its new equilibrium concentration is 
equal to its half-life — that is, equal to the time that would be required for its con- 
centration to fall by half if all synthesis were stopped (Figure 15-1 1) . 

The same principles apply to proteins and small molecules, whether the 
molecules are in the extracellular space or inside cells. Many intracellular pro- 
teins have short half-lives, some surviving for less than 10 minutes. In most 
cases, these are key regulatory proteins whose concentrations are rapidly con- 
trolled in the cell by changes in their rates of synthesis. 

We see later that many cell responses to extracellular signals depend on the 
conversion of intracellular signaling proteins from an inactive to an active form, 
rather than on their synthesis or degradation. Phosphorylation or the binding of 
GTP, for example, commonly activates signaling proteins. Even in these cases, 
however, the activation must be rapidly and continuously reversed (by dephos- 
phorylation or GTP hydrolysis to GDP, respectively, in these examples) to make 
rapid signaling possible. These inactivation processes play a crucial part in 
determining the magnitude, rapidity, and duration of the response. 



Nitric Oxide Gas Signals by Directly Regulating the Activity of 
Specific Proteins Inside the Target Cell 

Most of this chapter is concerned with signaling pathways activated by cell-sur- 
face receptors. Before discussing these receptors and pathways, however, we 
briefly consider some important signal molecules that activate intracellular 
receptors. These molecules include nitric oxide and steroid hormones, which we 
discuss in turn. Although most extracellular signal molecules are hydrophilic 
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Figure 1 5-1 2 The role of nitric oxide 
(NO) in smooth muscle relaxation in a 
blood vessel wall. (A) Simplified drawing 
of an autonomic nerve contacting a 
blood vessel. (B) Acetylcholine released 
by nerve terminals in the blood vessel 
wall activates NO synthase in endothelial 
cells lining the blood vessel, causing the 
endothelial cells to produce NO from 
arginine.The NO diffuses out of the 
endothelial cells and into the 
neighboring smooth muscle cells, where 
it binds to and activates guanylyl cyclase 
to produce cyclic GMP.The cyclic GMP 
triggers a response that causes the 
smooth muscle cells to relax, enhancing 
blood flow through the blood vessel. 
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and bind to receptors on the surface of the target cell, some are hydrophobic 
enough, small enough, or both, to pass readily across the target cell's plasma 
membrane. Once inside, they directly regulate the activity of specific intracellu- 
lar proteins. An important and remarkable example is the gas nitric oxide (NO), 
which acts as a signal molecule in both animals and plants. Even some bacteria 
can detect a very low concentration of NO and move away from it. 

In mammals, one of NO's many functions is to relax smooth muscle. It has 
this role in the walls of blood vessels, for example (Figure 15-12A). Autonomic 
nerves in the vessel wall release acetylcholine; the acetylcholine acts on the 
nearby endothelial cells that line the interior of the vessel; and the endothelial 
cells respond by releasing NO, which relaxes the smooth muscle cells in the wall, 
allowing the vessel to dilate. This effect of NO on blood vessels provides an 
explanation for the mechanism of action of nitroglycerine, which has been used 
for about 100 years to treat patients with angina (pain resulting from inadequate 
blood flow to the heart muscle). The nitroglycerine is converted to NO, which 
relaxes blood vessels. This reduces the workload on the heart and, as a conse- 
quence, reduces the oxygen requirement of the heart muscle. 

Many types of nerve cells use NO more directly to signal to their neighbors. 
NO released by autonomic nerves in the penis, for example, causes the local 
blood vessel dilation that is responsible for penile erection. NO is also produced 
by activated macrophages and neutrophils to help them to kill invading 
microorganisms. In plants, NO is involved in the defensive responses to injury or 
infection. 

NO is made by the deamination of the amino acid arginine, catalyzed by 
enzymes called NO synthases (NOS) (Figure 15-12B). The NOS in endothelial 
cells is called eNOS, while that in nerve and muscle cells is called nNOS. Nerve 
and muscle cells constitutively make nNOS, which is activated to produce NO by 
an influx of Ca 2+ when the cells are stimulated. Macrophages, by contrast, make 
yet another NOS, called inducible NOS (iNOS) because they make it only when 
they are activated, usually in response to an infection. 
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Because dissolved NO passes readily across membranes, it rapidly diffuses 
out of the cell where it is produced and into neighboring cells (see Figure 
15-12B). It acts only locally because it has a short half-life — about 5-10 sec- 
onds — in the extracellular space before oxygen and water convert it to nitrates 
and nitrites. 

In some target cells, including smooth muscle cells, NO reversibly binds to 
iron in the active site of the enzyme guanylyl cyclase, stimulating this enzyme to 
produce the small intracellular signaling molecule cyclic GMP, which we discuss 
later. Thus, guanylyl cyclase acts both as an intracellular receptor for NO and as 
an intracellular signaling protein (see Figure 15-12B). NO can increase cyclic 
GMP in the cytosol within seconds, because the normal rate of turnover of cyclic 
GMP is high: a rapid degradation to GMP by a phosphodiesterase constantly bal- 
ances the production of cyclic GMP from GTP by the guanylyl cyclase. The drug 
Viagra and its newer relatives inhibit the cyclic GMP phosphodiesterase in the 
penis, thereby increasing the amount of time that cyclic GMP levels remain ele- 
vated in the smooth muscle cells of penile blood vessels after NO production is 
induced by local nerve terminals. The cyclic GMP, in turn, keeps the blood ves- 
sels relaxed and thereby the penis erect. 

NO can also signal cells independently of cyclic GMP. It can, for example, 
alter the activity of an intracellular protein by covalently nitrosylating thiol (-SH) 
groups on specific cysteines in the protein. 

Carbon monoxide (CO) is another gas that is used as an extracellular signal 
molecule and like NO can act by stimulating guanylyl cyclase. But, these gases 
are not the only signal molecules that can pass directly across the target-cell 
plasma membrane, as we discuss next. 



Nuclear Receptors Are Ligand-Modulated Gene Regulatory 
Proteins 



Various small hydrophobic signal molecules diffuse directly across the plasma 
membrane of target cells and bind to intracellular receptors that are gene regu- 
latory proteins. These signal molecules include steroid hormones, thyroid hor- 
mones, retinoids, and vitamin D. Although they differ greatly from one another 
in both chemical structure (Figure 15-13) and function, they all act by a similar 
mechanism. They bind to their respective intracellular receptor proteins and 
alter the ability of these proteins to control the transcription of specific genes. 
Thus, these proteins serve both as intracellular receptors and as intracellular 
effectors for the signal. 

The receptors are all structurally related, being part of the very large nuclear 
receptor superfamily. Many family members have been identified by DNA 
sequencing only, and their ligand is not yet known; these proteins are therefore 
referred to as orphan nuclear receptors. Currently, more than half of the 48 
nuclear receptors encoded in the human genome are orphans, as are 17 out of 




Figure 15-13 Some nongaseous signal 
molecules that bind to intracellular 
receptors. Note that all of them are small 
and hydrophobic. The active, hydroxylated 
form of vitamin D 3 is shown. Estradiol and 
testosterone are steroid sex hormones. 
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18 in Drosophila and all 278 in the nematode C. elegans (see Figure 4-85). Some 
mammalian nuclear receptors are regulated by intracellular metabolites rather 
than by secreted signal molecules; the peroxisome proliferation-activated recep- 
tors (PPARs), for example, bind intracellular lipid metabolites and regulate the 
transcription of genes involved in lipid metabolism and fat cell differentiation 
(discussed in Chapter 23). It seems likely that the nuclear receptors for hor- 
mones evolved from such receptors for intracellular metabolites, which would 
help explain their intracellular location. 

Steroid hormones — which include Cortisol, the steroid sex hormones, vita- 
min D (in vertebrates), and the molting hormone ecdysone (in insects) — are all 
made from cholesterol. Cortisol is produced in the cortex of the adrenal glands 
and influences the metabolism of many types of cells. The steroid sex hormones 
are made in the testes and ovaries, and they are responsible for the secondary 
sex characteristics that distinguish males from females. Vitamin D is synthe- 
sized in the skin in response to sunlight; after it has been converted to its active 
form in the liver or kidneys, it regulates Ca 2+ metabolism, promoting Ca 2+ 
uptake in the gut and reducing its excretion in the kidneys. The thyroid hor- 
mones, which are made from the amino acid tyrosine, act to increase the 
metabolic rate of many cell types, while the retinoids, such as retinoic acid, are 
made from vitamin A and have important roles as local mediators in vertebrate 
development. Although all of these signal molecules are relatively insoluble in 
water, they are made soluble for transport in the bloodstream and other extra- 
cellular fluids by binding to specific carrier proteins, from which they dissociate 
before entering a target cell (see Figure 15-3B). 

The nuclear receptors bind to specific DNA sequences adjacent to the genes 
that the ligand regulates. Some of the receptors, such as those for Cortisol, are 
located primarily in the cytosol and enter the nucleus only after ligand binding; 
others, such as the thyroid and retinoid receptors, are bound to DNA in the 
nucleus even in the absence of ligand. In either case, the inactive receptors are 
usually bound to inhibitory protein complexes. Ligand binding alters the con- 
formation of the receptor protein, causing the inhibitory complex to dissociate, 
while also causing the receptor to bind coactivator proteins that stimulate gene 
transcription (Figure 15-14). In other cases, however, ligand binding to a 
nuclear receptor inhibits transcription: some thyroid hormone receptors, for 
example, act as transcriptional activators in the absence of their hormone and 
become transcriptional repressors when hormone binds. 

The transcriptional response usually takes place in multiple steps. In the 
cases in which ligand binding activates transcription, for example, the direct 
stimulation of a small number of specific genes occurs within about 30 minutes 
and constitutes the primary response; the protein products of these genes in turn 
activate other genes to produce a delayed, secondary response; and so on. In 
addition, some of the proteins produced in the primary response may act back 
to inhibit the transcription of primary response genes, thereby limiting the 
response — an example of negative feedback, which we discuss later. In this way, 
a simple hormonal trigger can cause a very complex change in the pattern of 
gene expression (Figure 15-15). 

The responses to steroid and thyroid hormones, vitamin D, and retinoids are 
determined as much by the nature of the target cell as by the nature of the sig- 
nal molecule. Many types of cells have the identical intracellular receptor, but 
the set of genes that the receptor regulates differs in each cell type. This is 
because more than one type of gene regulatory protein generally must bind to a 
eucaryotic gene to regulate its transcription. An intracellular receptor can there- 
fore regulate a gene only if there is the right combination of other gene regula- 
tory proteins, and many of these are cell-type specific. 

In summary, each of these hydrophobic signal molecules induces a charac- 
teristic set of responses in an animal for two reasons. First, only certain types of 
cells have receptors for it. Second, each of these cell types contains a different 
combination of other cell-type-specific gene regulatory proteins that collabo- 
rate with the activated receptor to influence the transcription of specific sets of 
genes. This principle applies to all signaled responses that depend on gene reg- 
ulatory proteins, including the many other examples we discuss in this chapter. 
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Figure 15-1 4 The nuclear receptor superfamily. All nuclear receptors bind to 
DNA as either homodimers or heterodimers, but for simplicity we show them as 
monomers here. (A) The receptors all have a related structure. Here, the short 
DNA-binding domain in each receptor is indicated in light green. (B) An inactive 
receptor protein is bound to inhibitory proteins. Domain-swap experiments 
suggest that many of the ligand-binding, transcription-activating, and DNA- 
binding domains in these receptors can function as interchangeable modules. 

(C) Receptor activation. Typically, the binding of ligand to the receptor causes 
the ligand-binding domain of the receptor to clamp shut around the ligand, the 
inhibitory proteins to dissociate, and coactivator proteins to bind to the 
receptor's transcription-activating domain, thereby increasing gene 
transcription. In other cases, ligand binding has the opposite effect, causing 
co-repressor proteins to bind to the receptor, thereby decreasing transcription 
(not shown). Though not shown here, activity can also be controlled through a 
change in the localization of the receptor: in its inactive form, it can be retained 
in the cytoplasm; ligand binding can then lead to the uncovering of nuclear 
localization signals that cause it to be imported into the nucleus to act on DNA. 

(D) The three-dimensional structure of a ligand-binding domain with (right) and 
without (left) ligand bound. Note that the blue a helix acts as a lid that snaps 
shut when the ligand (shown in red) binds, trapping the ligand in place. 
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The molecular details of how nuclear receptors and other gene regulatory pro- 
teins control specific gene transcription are discussed in Chapter 7. 

Nuclear receptor proteins are sometimes also present on the cell surface, 
where they function by mechanisms entirely different from that just described. 
In the remainder of the chapter, we consider various ways in which cell- surface 
receptors convert extracellular signals into intracellular ones, a process called 
signal transduction. 



The Three Largest Classes of Cell-Surface Receptor Proteins Are 
lon-Channel-Coupled, G-Protein-Coupled, and Enzyme-Coupled 
Receptors 

In contrast to the small hydrophobic signal molecules just discussed that bind 
to intracellular receptors, most extracellular signal molecules bind to specific 
receptor proteins on the surface of the target cells they influence and do not 
enter the cytosol or nucleus. These cell-surface receptors act as signal transduc- 
ers by converting an extracellular ligand-binding event into intracellular signals 
that alter the behavior of the target cell. 
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Figure 1 5-1 5 An example of primary and secondary responses induced by the activation of a nuclear hormone receptor. 

(A) Early primary response and (B) delayed secondary response. Some of the primary-response proteins turn on secondary- 
response genes, while others turn off the primary-response genes. The actual number of primary-response and secondary- 
response genes is greater than shown. Drugs that inhibit protein synthesis suppress the transcription of secondary-response 
genes but not primary-response genes, allowing these two classes of gene transcription responses to be readily 
distinguished. The figure shows the responses to a steroid hormone, but the same principles apply for many other ligands 
that activate nuclear receptor proteins. 



Most cell-surface receptor proteins belong to one of three classes, defined by 
their transduction mechanism. Ion- channel- coupled receptors, also known as 
transmitter- gated ion channels or ionotropic receptors, are involved in rapid 
synaptic signaling between nerve cells and other electrically excitable target 
cells such as nerve and muscle cells (Figure 15-16A). This type of signaling is 
mediated by a small number of neurotransmitters that transiently open or close 
an ion channel formed by the protein to which they bind, briefly changing the 
ion permeability of the plasma membrane and thereby the excitability of the 
postsynaptic target cell. Most ion-channel-coupled receptors belong to a large 
family of homologous, multipass transmembrane proteins. Because they are 
discussed in detail in Chapter 11, we shall not consider them further here. 

G-protein-coupled receptors act by indirectly regulating the activity of a 
separate plasma-membrane-bound target protein, which is generally either an 
enzyme or an ion channel. A trimeric GTP -binding protein (G protein) mediates 
the interaction between the activated receptor and this target protein (Figure 
15-16B). The activation of the target protein can change the concentration of 
one or more small intracellular mediators (if the target protein is an enzyme), or 
it can change the ion permeability of the plasma membrane (if the target protein 
is an ion channel). The small intracellular mediators act in turn to alter the 
behavior of yet other signaling proteins in the cell. All of the G-protein-coupled 
receptors belong to a large family of homologous, multipass transmembrane 
proteins. 

Enzyme-coupled receptors either function directly as enzymes or associate 
directly with enzymes that they activate (Figure 15-16C). They are usually single- 
pass transmembrane proteins that have their ligand-binding site outside the cell 
and their catalytic or enzyme -binding site inside. Enzyme -coupled receptors are 
heterogeneous in structure compared with the other two classes. The great 
majority, however, are either protein kinases or associate with protein kinases, 
which phosphorylate specific sets of proteins in the target cell when activated. 
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There are also some types of cell-surface receptors that do not fit easily into 
any of these classes but have important functions in controlling the specializa- 
tion of different cell types during development and in tissue renewal and repair. 
We discuss these in a later section, after we explain how G-protein-coupled 
receptors and enzyme-coupled receptors operate. First, however, we consider 
some general principles of signaling via cell-surface receptors, in order to pre- 
pare for the detailed discussion of the major classes of cell- surface receptors that 
follows. 



Most Activated Cell-Surface Receptors Relay Signals Via Small 
Molecules and a Network of Intracellular Signaling Proteins 

A combination of small and large intracellular signaling molecules relays signals 
received at the cell surface by either G-protein-coupled or enzyme-coupled 
receptors into the cell interior. The resulting chain of intracellular signaling 
events ultimately alters effector proteins that are responsible for modifying the 
behavior of the cell (see Figure 15-1). 

The small intracellular signaling molecules are called small intracellular 
mediators, or second messengers (the "first messengers" being the extracellular 
signals). They are generated in large numbers in response to receptor activation 
and often diffuse away from their source, spreading the signal to other parts of 
the cell. Some, such as cyclic AMP and Ca 2+ , are water-soluble and diffuse in the 
cytosol, while others, such as diacylglycerol, are lipid-soluble and diffuse in the 
plane of the plasma membrane. In either case, they pass the signal on by bind- 
ing to and altering the conformation and behavior of selected signaling proteins 
or effector proteins. 
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Figure 15-16 Three classes of cell- 
surface receptors. (A) lon-channel- 
coupled receptors (also called 
transmitter-gated ion channels), 

(B) G-protein-coupled receptors, and 

(C) enzyme-coupled receptors. Although 
many enzyme-coupled receptors have 
intrinsic enzymatic activity, as shown on 
the left in (C), many others rely on 
associated enzymes, as shown on the 
right in (C). 
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Figure 15-17 A hypothetical 
intracellular signaling pathway from a 
cell-surface receptor to the nucleus. In 

this example, a series of signaling 
proteins and small intracellular mediators 
relay the extracellular signal into the 
nucleus, causing a change in gene 
expression. The signal is altered 
(transduced), amplified, distributed, and 
modulated en route. Because many of the 
steps can be affected by other 
extracellular and intracellular signals, the 
final effect of one extracellular signal 
depends on multiple factors affecting the 
cell (see Figure 15-8). Ultimately, the 
signaling pathway activates (or 
inactivates) effector proteins that alter 
the cell's behavior. In this example, the 
effector is a gene regulatory protein that 
activates gene transcription. Although 
the figure shows individual signaling 
proteins performing a single function, in 
actuality they often have more than one 
function; scaffold proteins, for example, 
often also serve to anchor several 
signaling proteins to a particular 
intracellular structure. 

Most signaling pathways to the nucleus 
are more direct than this one, which is 
not based on a known pathway. 



GENE TRANSCRIPTION 



The large intracellular signaling molecules are intracellular signaling pro- 
teins, which help relay the signal into the cell by either generating small intra- 
cellular mediators or activating the next signaling or effector protein in the path- 
way These proteins form a functional network, in which each protein helps to 
process the signal in one or more of the following ways as it spreads the signal's 
influence through the cell (Figure 15-17): 

1 . The protein may simply relay the signal to the next signaling component in 
the chain. 

2. It may act as a scaffold to bring two or more signaling proteins together so 
that they can interact more quickly and efficiently 
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3. It may transform, or transduce, the signal into a different form, which is 
suitable for either passing the signal along or stimulating a cell response. 

4. It may amplify the signal it receives, either by producing large amounts of 
a small intracellular mediator or by activating many copies of a down- 
stream signaling protein. In this way, a small number of extracellular signal 
molecules can evoke a large intracellular response. When there are multi- 
ple amplification steps in a relay chain, the chain is often referred to as a 
signaling cascade. 

5. It may receive signals from two or more signaling pathways and integrate 
them before relaying a signal onward. A protein that requires input from 
two or more signaling pathways to become activated, is often referred to as 
a coincidence detector. 

6. It may spread the signal from one signaling pathway to another, creating 
branches in the signaling stream, thereby increasing the complexity of the 
response. 

7. It may anchor one or more signaling proteins in a pathway to a particular 
structure in the cell where the signaling proteins are needed. 

8. It may modulate the activity of other signaling proteins and thereby regu- 
late the strength of signaling along a pathway. 

We now consider in more detail some of the strategies that intracellular 
signaling proteins use in processing the signal as it is relayed along signaling 
pathways. We encounter these general strategies again in later sections of the 
chapter, when we discuss specific receptor classes and the signaling pathways 
they activate. 



Many Intracellular Signaling Proteins Function as Molecular 
Switches That Are Activated by Phosphorylation or GTP Binding 

Many intracellular signaling proteins behave like molecular switches. When they 
receive a signal, they switch from an inactive to an active conformation, until 
another process switches them off, returning them to their inactive conforma- 
tion. As we mentioned earlier, the switching off is just as important as the 
switching on. If a signaling pathway is to recover after transmitting a signal so 
that it can be ready to transmit another, every activated molecule in the pathway 
must return to its original, unactivated state. 

Two important classes of molecular switches that operate in intracellular 
signaling pathways depend on the gain or loss of phosphate groups for their 
activation or inactivation, although the way in which the phosphate is gained 
and lost is very different in the two classes. The largest class consists of pro- 
teins that are activated or inactivated by phosphorylation (discussed in Chap- 
ter 3). For these proteins, the switch is thrown in one direction by a protein 
kinase, which covalently adds one or more phosphate groups to the signaling 
protein, and in the other direction by a protein phosphatase, which removes 
the phosphate groups (Figure 15-18A). The activity of any protein regulated by 
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Figure 1 5-1 8 Two types of intracellular 
signaling proteins that act as molecular 
switches. Although one type is activated 
by phosphorylation and the other by GTP 
binding, in both cases the addition of a 
phosphate group switches the activation 
state of the protein and the removal of the 
phosphate switches it back again. 
(A) A protein kinase covalently adds a 
phosphate from ATP to the signaling 
protein, and a protein phosphatase 
removes the phosphate. Although not 
shown, some signaling proteins are 
activated by dephosphorylation rather 
than by phosphorylation. (B) A GTP- 
binding protein is induced to exchange its 
bound GDP for GTP, which activates the 
protein; the protein then inactivates itself 
by hydrolyzing its bound GTP to GDP. 
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phosphorylation depends on the balance at any instant between the activities of 
the kinases that phosphorylate it and of the phosphatases that dephosphorylate 
it. About 30% of human proteins contain covalently attached phosphate, and the 
human genome encodes about 520 protein kinases and about 150 protein phos- 
phatases. It is thought that a typical mammalian cell makes use of hundreds of 
distinct types of protein kinases at any one time. 

Many signaling proteins controlled by phosphorylation are themselves pro- 
tein kinases, and these are often organized into phosphorylation cascades. In 
such a cascade, one protein kinase, activated by phosphorylation, phosphory- 
lates the next protein kinase in the sequence, and so on, relaying the signal 
onward and, in the process, amplifying it and sometimes spreading it to other 
signaling pathways. Two main types of protein kinases operate as intracellular 
signaling proteins. The great majority are serine/threonine kinases, which 
phosphorylate proteins on serines and (less often) threonines. Others are tyro- 
sine kinases, which phosphorylate proteins on tyrosines. An occasional kinase 
can do both. 

The other important class of molecular switches that function by gaining 
and losing phosphate groups consists of GTP-binding proteins (discussed in 
Chapter 3). These proteins switch between an "on" (actively signaling) state 
when GTP is bound and an "off" state when GDP is bound. In the "on" state, they 
have intrinsic GTPase activity and shut themselves off by hydrolyzing their 
bound GTP to GDP (Figure 15-18B). There are two major types of GTP-binding 
proteins. Large trimeric GTP-binding proteins (also called G proteins) help relay 
signals from G-protein- coupled receptors that activate them (see Figure 
15-16B). Small monomeric GTPases (also called monomeric GTP-binding pro- 
teins) help relay signals from many classes of cell- surface receptors. 

Specific regulatory proteins control both types of GTP-binding proteins. 
GTPase-activating proteins (GAPs) drive the proteins into an "off" state by 
increasing the rate of hydrolysis of bound GTP; the GAPs that function in this way 
are also called regulators of G-protein signaling (RGS) proteins. Conversely, G-pro- 
tein- coupled receptors activate trimeric G proteins, and guanine nucleotide 
exchange factors (GEFs) activate monomeric GTPases, by promoting the release 
of bound GDP in exchange for binding of GTP. Figure 15-19 illustrates the regula- 
tion of monomeric GTPases. 

Both trimeric G proteins and monomeric GTPases also participate in many 
other processes in eucaryotic cells, including the regulation of vesicular traffic 
and aspects of cell division. 

As discussed earlier, specific combinations of extracellular signals, rather 
than one signal molecule acting alone, are generally required to stimulate com- 
plex cell behaviors, such as cell survival and cell growth and proliferation (see 
Figure 15-8). The cell therefore has to integrate information coming from multi- 
ple signals if it is to make an appropriate response; many mammalian cells, for 
example, require both soluble signals and signals from the extracellular matrix 
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Figure 15-1 9 The regulation of a 
monomeric GTPase. GTPase-activating 
proteins (GAPs) inactivate the protein by 
stimulating it to hydrolyze its bound GTP 
to GDP, which remains tightly bound to 
the inactivated GTPase. Guanine 
nucleotide exchange factors (GEFs) 
activate the inactive protein by 
stimulating it to release its GDP; because 
the concentration of GTP in the cytosol is 
1 times greater than the concentration 
of GDP, the protein rapidly binds GTP 
once it ejects GDP and is thereby 
activated. 
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(discussed in Chapter 19) to grow and proliferate. The integration depends in 
part on intracellular coincidence detectors, which are equivalent to AND gates in 
the microprocessor of a computer, in that they are only activated if they receive 
multiple converging signals (see Figure 15-17). Figure 15-20 illustrates a simple 
hypothetical example of such a protein. 

Not all molecular switches in signaling pathways depend on phosphoryla- 
tion or GTP binding, however. We see later that some signaling proteins are 
switched on or off by the binding of another signaling protein or a small intra- 
cellular mediator such as cyclic AMP or Ca 2+ , or by covalent modifications other 
than phosphorylation or dephosphorylation, such as ubiquitylation. Moreover, 
not all intracellular signaling proteins act as switches when they are phosphory- 
lated or otherwise reversibly modified. As we discuss later, in many cases the 
covalently added group simply marks the protein so that it can interact with 
other signaling proteins that recognize the modification. 



Intracellular Signaling Complexes Enhance the Speed, Efficiency, 
and Specificity of the Response 

Even a single type of extracellular signal acting through a single type of cell-sur- 
face receptor often activates multiple parallel signaling pathways and can 
thereby influence multiple aspects of cell behavior — such as shape, movement, 
metabolism, and gene expression. Given the complexity of signal-response sys- 
tems, which often involve multiple interacting relay chains of signaling proteins, 
how does an individual cell manage to make specific responses to so many dif- 
ferent combinations of extracellular signals? The question is especially puzzling 
because many of the signals are closely related to one another and bind to 
closely related types of receptors. The same type of intracellular relay protein 
may couple one receptor subtype to one set of effectors and another receptor 
subtype to another set of effectors. In such cases, how is it possible to achieve 
specificity and avoid cross-talk? One strategy makes use of scaffold proteins 
(see Figure 15-17), which bind together groups of interacting signaling proteins 
into signaling complexes, often before a signal has been received (Figure 
15-21A). Because the scaffold holds the signaling proteins in close proximity, 
the components can interact at high local concentrations and be sequentially 
activated speedily, efficiently, and selectively in response to an appropriate 
extracellular signal, avoiding unwanted cross-talk with other signaling path- 
ways. 

In other cases, signaling complexes form only transiently in response to an 
extracellular signal and rapidly disassemble when the signal is gone. Such tran- 
sient complexes often assemble around a receptor after an extracellular signal 
molecule has activated it. In many of these cases, the cytoplasmic tail of the acti- 
vated receptor is phosphorylated during the activation process, and the phos- 
phorylated amino acids then serve as docking sites for the assembly of other sig- 
naling proteins (Figure 15-2 IB). In yet other cases, receptor activation leads to 
the production of modified phospholipid molecules (called phosphoinositides) 
in the adjacent plasma membrane, which then recruit specific intracellular sig- 
naling proteins to this region of membrane, where they are activated (Figure 
15-21C). We discuss the roles of phosphoinositides in membrane trafficking 
events in Chapter 13. 



Modular Interaction Domains Mediate Interactions Between 
Intracellular Signaling Proteins 

Simply bringing intracellular signaling proteins together into close proximity is 
sometimes sufficient to activate them. Thus, induced proximity, where a signal 
triggers assembly of a signaling complex, is commonly used to relay signals from 
protein to protein along a signaling pathway. The assembly of such signaling 
complexes depends on various highly conserved, small interaction domains, 
which are found in many intracellular signaling proteins. Each of these compact 
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Figure 15-20 Signal integration. 

Extracellular signals A and B activate 
different intracellular signaling pathways, 
each of which leads to the 
phosphorylation of protein Y but at 
different sites on the protein. Protein Y is 
activated only when both of these sites 
are phosphorylated, and therefore it 
becomes active only when signals A and 
B are simultaneously present. Such 
proteins are often called coincidence 
detectors. 
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(B) ASSEMBLY OF SIGNALING COMPLEX ON AN ACTIVATED RECEPTOR 
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Figure 15-21 Three types of intracellular 
signaling complexes. (A) A receptor and 
some of the intracellular signaling 
proteins it activates in sequence are 
preassembled into a signaling complex 
on the inactive receptor by a large 
scaffold protein. In other cases, a 
preassembled complex binds to the 
receptor only after the receptor is 
activated (not shown). (B) A signaling 
complex assembles on a receptor only 
after the binding of an extracellular 
signal molecule has activated the 
receptor; here the activated receptor 
phosphorylates itself at multiple sites, 
which then act as docking sites for 
intracellular signaling proteins. 
(C) Activation of a receptor leads to the 
increased phosphorylation of specific 
phospholipids (phosphoinositides) in the 
adjacent plasma membrane, which then 
serve as docking sites for specific 
intracellular signaling proteins, which can 
now interact with each other. 



(C) ASSEMBLY OF SIGNALING COMPLEX ON PHOSPHOINOSITIDE DOCKING SITES 
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protein modules binds to a particular structural motif in another protein (or 
lipid) molecule with which the signaling protein interacts. The recognized motif 
in the interacting protein can be a short peptide sequence, a covalent modifica- 
tion (such as phosphorylated or ubiquitylated amino acids), or another protein 
domain. The use of modular interaction domains presumably facilitated the 
evolution of new signaling pathways; because it can be inserted almost any- 
where in a protein without disturbing the protein's folding or function, a new 
interaction domain added to an existing signaling protein could connect the 
protein to additional signaling pathways. 

There are many types of interaction domains in signaling proteins. Src 
homology 2 (SH2) domains and phosphotyrosine-binding (PTB) domains, for 
example, bind to phosphorylated tyrosines in a particular peptide sequence on 



GENERAL PRINCIPLES OF CELL COMMUNICATION 



899 



extracellular 
' signal molecule 



phosphoinositide 
docking sites 




plasma 
membrane 

/ 



CYTOSOL 



IRS1 (docking protein 



scaffold protein 



activated receptors or intracellular signaling proteins. Src homology 3 (SH3) 
domains bind to short proline-rich amino acid sequences. Some pleckstrin 
homology (PH) domains bind to the charged head groups of specific phospho- 
inositides that are produced in the plasma membrane in response to an extra- 
cellular signal; they enable the protein they are part of to dock on the membrane 
and interact with other similarly recruited signaling proteins (see Figure 
15-21C). Some signaling proteins consist solely of two or more interaction 
domains and function only as adaptors to link two other proteins together in a 
signaling pathway. 

Interaction domains enable signaling proteins to bind to one another in 
multiple specific combinations. Like Lego bricks, the proteins can form linear or 
branching chains or three-dimensional networks, which determine the route 
followed by the signaling pathway. As an example, Figure 15-22 illustrates how 
some interaction domains function in the case of the receptor for the hormone 
insulin. 

Some cell- surface receptors and intracellular signaling proteins may cluster 
together transiently in specific microdomains in the lipid bilayer of the plasma 
membrane that are enriched in cholesterol and glycolipids (see Figure 10-13). 
These lipid rafts may promote efficient signaling by serving as sites where sig- 
naling molecules assemble and interact, but their importance in signaling 
remains controversial. 

Another way of bringing receptors and intracellular signaling proteins 
together is to concentrate them in a specific region of the cell. An important 
example is the primary cilium that projects like an antenna from the surface of 
most vertebrate cells (discussed in Chapter 16) . It is usually short and nonmotile 
and has microtubules in its core, and a number of surface receptors and signal- 
ing proteins are concentrated there. We shall see later that light and smell recep- 
tors are also highly concentrated in specialized cilia. 



Figure 15-22 A specific signaling 
complex formed using modular 
interaction domains. This example is 
based on the insulin receptor, which is an 
enzyme-coupled receptor (a receptor 
tyrosine kinase, discussed later). First, the 
activated receptor phosphorylates itself 
on tyrosines, and one of the 
phosphotyrosines then recruits a docking 
protein called insulin receptor substrate- 
1 (IRS1 ) via a PTB domain of IRS1; the PH 
domain of IRS1 also binds to specific 
phosphoinositides on the inner surface of 
the plasma membrane. Then, the 
activated receptor phosphorylates IRS1 
on tyrosines, and one of these 
phosphotyrosines recruits the adaptor 
protein (Grb2) via an SH2 domain of 
Grb2. Next, Grb2 uses one of its two SH3 
domains to bind to a proline-rich region 
of the monomeric GTPase-activating 
protein called Sos (a Ras-GEF, discussed 
later), which also binds to 
phosphoinositides in the plasma 
membrane via its PH domain. Grb2 uses 
its other SH3 domain to bind to a proline- 
rich sequence in a scaffold protein. The 
scaffold protein binds several other 
signaling proteins, and the other 
phosphorylated tyrosines on IRS1 recruit 
additional signaling proteins that have 
SH2 domains (not shown). 



Cells Can Use Multiple Mechanisms to Respond Abruptly to a 
Gradually Increasing Concentration of an Extracellular Signal 

Some cell responses to extracellular signal molecules are smoothly graded accord- 
ing to the concentration of the signal molecule. In other cases, the relationship 
between signal and response can be discontinuous or all-or-none, with an abrupt 
switch from one type of outcome to another as the signal concentration increases 
beyond a certain value (Figure 15-23). Such effects are often the result of positive 

Figure 15-23 Smoothly graded versus switchlike signaling responses. 

Some cell responses increase gradually as the concentration of an 
extracellular signal molecule increases {blue line). In other cases, the 
responding cell is driven into an abruptly different state as the signal 
strength increases beyond a certain critical value (red line). 
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feedback in the response system, as we discuss below. Both types of responses are 
common, and it is not always easy to distinguish between them. When one mea- 
sures the effect of a signal on a whole population of cells, it may appear smoothly 
graded, even though the individual cells may be responding in an all-or-none 
manner but with variation from cell to cell in the signal concentration at which the 
switch occurs; Figure 15-24 shows an example. 

Moreover, smoothly graded responses can sometimes be very steeply depen- 
dent on the strength of the signal, giving the appearance of almost switchlike 
behavior. Cells use a variety of molecular mechanisms to acheive such effects. In 
one mechanism, more than one intracellular signaling molecule must bind to its 
downstream target protein to induce a response. As we discuss later, four 
molecules of the small intracellular mediator cyclic AMP, for example, must be 
bound simultaneously to each molecule of cyclic-AMP- dependent protein kinase 
(PKA) to activate the kinase. A similar sharpening of response is seen when the 
activation of an intracellular signaling protein requires phosphorylation at more 
than one site. Such cooperative responses become sharper as the number of 
required cooperating molecules or phosphate groups increases, and if the num- 
ber is large enough, responses become almost all-or-none (Figure 15-25). 

Responses are also sharpened when an intracellular signaling molecule acti- 
vates one enzyme and, at the same time, inhibits another enzyme that catalyzes 
the opposite reaction. A well- studied example of this common type of regulation 
is the stimulation of glycogen breakdown in skeletal muscle cells induced by the 
hormone adrenaline (epinephrine). Adrenaline's binding to a G-protein-cou- 
pled cell- surface receptor increases the intracellular concentration of cyclic 
AMP, which both activates an enzyme that promotes glycogen breakdown and 
inhibits an enzyme that promotes glycogen synthesis. 



Figure 15-24 The importance of 
examining individual cells to detect all- 
or-none responses to increasing 
concentrations of an extracellular 
signal. In these experiments, immature 
frog eggs (oocytes) were stimulated with 
increasing concentrations of the 
hormone progesterone. The response 
was assessed by analyzing the activation 
of MAP kinase (discussed later), which is 
one of the protein kinases activated by 
phosphorylation in the response. The 
amount of phosphorylated (activated) 
MAP kinase in extracts of the oocytes was 
assessed biochemically. In (A), extracts of 
populations of stimulated oocytes were 
analyzed, and the activation of MAP 
kinase is seen to increase progressively 
with increasing progesterone 
concentration. There are two possible 
ways of explaining this result: (B) MAP 
kinase could have increased gradually in 
each individual cell with increasing 
progesterone concentration; 
(C) alternatively, individual cells could 
have responded in an all-or-none way, 
with the gradual increase in total MAP 
kinase activation reflecting the increasing 
number of cells responding with 
increasing progesterone concentration. 
When extracts of individual oocytes were 
analyzed, it was found that cells had 
either very low amounts or very high 
amounts, but not intermediate amounts, 
of the activated kinase, indicating that 
the response was essentially all-or-none 
at the level of individual cells, as 
diagrammed in (C). (Adapted from 
J.E. Ferrell and E.M. Machleder, Science 
280:895-898, 1998. With permission from 
AAAS.) 
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Figure 1 5-25 Activation curves for an 
allosteric protein as a function of 
effector molecule concentration. The 

curves show how the sharpness of the 
activation response increases with an 
increase in the number of allosteric 
effector molecules that must be bound 
simultaneously to activate the target 
protein. The curves shown are those 
expected, under certain conditions, if the 
activation requires the simultaneous 
binding of 1, 2, 8, or 16 effector 
molecules. 
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All of these mechanisms can produce very steep responses, but the 
responses are still smoothly graded according to the concentration of the extra- 
cellular signal molecule. To produce true all-or-none responses, one needs 
another mechanism, positive feedback, as we now discuss. 
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Intracellular Signaling Networks Usually Make Use of Feedback 
Loops 

Like intracellular metabolic pathways (discussed in Chapter 2), most intracellu- 
lar signaling networks incorporate feedback loops, in which the output of a pro- 
cess acts back to regulate that same process. In positive feedback, the output 
stimulates its own production; in negative feedback, the output inhibits its own 
production (Figure 15-26). Feedback loops are of great general importance in 
biology, and they regulate many chemical and physical processes in cells. They 
operate over an enormous range of time scales in cells, from milliseconds (in the 
case of an action potential, for example — see Figure 11-29) to many hours (in 
the case of circadian oscillations, for example — see Figure 7-73). Those that reg- 
ulate cell signaling can either operate exclusively within the target cell or involve 
the secretion of extracellular signals. Here, we focus on those feedback loops 
that operate entirely within the target cell; even the simplest of these loops can 
produce complex and interesting effects. 

A positive feedback loop in a signaling pathway can transform the behavior 
of the responding cell. If the positive feedback is of only moderate strength, its 
effect will be simply to steepen and increase the response to the signal. But if the 
feedback is strong enough, it can produce a qualitatively different result: a run- 
away increase in the quantity of product when the signal increases above a crit- 
ical value, leading to a new steady level of production that is sharply different 
from that obtained when the signal was only slightly weaker (Figure 15-27). 

This type of all-or-none response goes hand in hand with a further property: 
once the responding system has switched to the high level of activation, this 
condition is self-sustaining and can persist even after the signal strength drops 
back below its critical value. In such a case, the system is said to be bistable: it 
can exist in either a "switched- off " or a "switched- on" state, and a transient stim- 
ulus can flip it from the one state to the other (Figure 15-28A and B). Cells use 
positive feedback loops of this sort to make stable all-or-none decisions, espe- 
cially during development, when cells in different positions must choose 
between alternative developmental pathways in response to smoothly graded 
positional signals (morphogens), as discussed earlier. Through positive feed- 
back, a transient extracellular signal can often induce long-term changes in cells 
and their progeny that can persist for the lifetime of the organism. The signals 
that trigger muscle cell specification, for example, turn on the transcription of a 
series of genes that encode muscle-specific gene regulatory proteins, which 
stimulate the transcription of their own genes, as well as genes encoding various 
other muscle cell proteins; in this way, the decision to become a muscle cell is 
made permanent (see Figure 7-75). This type of cell memory dependent on pos- 
itive feedback is one of the basic ways in which a cell can undergo a lasting 
change of character without any alteration in its DNA sequence; and this altered 
state can be passed on to daughter cells. Such mechanisms of inheritance are 
called epigenetic, in contrast to genetic mechanisms involving mutations of the 
DNA, and they are discussed more fully in Chapter 7 (see Figure 7-86). 

Figure 1 5-27 A positive feedback mechanism giving switch-like 
behavior. In this example, an intracellular signaling molecule (ligand) 
activates an enzyme located downstream in a signaling pathway. Two 
molecules of the product of the enzymatic reaction bind back to the same 
enzyme to activate it further. The consequence is a very low rate of 
synthesis of the product in the absence of the ligand. The rate increases 
slowly with the concentration of ligand until, at some threshold level of 
ligand, enough of the product has been synthesized to activate the 
enzyme in a self-accelerating, runaway fashion. The concentration of the 
product then rapidly increases to a much higher level. 
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Figure 15-26 Positive and negative 
feedback. In these simple examples, a 
stimulus activates protein A, which, in 
turn, activates protein B. Protein B then 
acts back to either increase or decrease 
the activity of A. 
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By contrast with positive feedback, a negative feedback loop counteracts 
the effect of a stimulus and thereby abbreviates and limits the level of the 
response, making the system less sensitive to perturbations. As with positive 
feedback, however, qualitatively different phenomena can be obtained when 
the feedback operates more powerfully A delayed negative feedback with a long 
enough delay can produce responses that oscillate. The oscillations may persist 
for as long as the stimulus is present (Figure 15-28C) or they may even be gen- 
erated spontaneously, without need of an external signal to drive them (see Fig- 
ure 22-82). Later in this chapter, we shall encounter a number of examples of 
such oscillatory behavior in the intracellular responses to extracellular signals; 
all of them depend on negative feedback. 

If negative feedback operates with a short delay, the system behaves like a 
change detector. It gives a strong response to a stimulus, but the response rapidly 
decays even while the stimulus persists; if the stimulus is suddenly increased, 
however, the system responds strongly again, but, again, the response rapidly 
decays. This is the phenomenon of adaptation, which we now discuss. 



Cells Can Adjust Their Sensitivity to a Signal 

In responding to many types of stimuli, cells and organisms are able to detect the 
same percentage of change in a signal over a very wide range of stimulus 
strengths. The target cells accomplish this through a reversible process of adap- 
tation, or desensitization, whereby a prolonged exposure to a stimulus decreases 
the cells' response to that level of stimulus. In chemical signaling, adaptation 
enables cells to respond to changes in the concentration of an extracellular signal 
molecule (rather than to the absolute concentration of the signal) over a very 
wide range of signal concentrations. The underlying mechanism is negative feed- 
back that operates with a short delay: a strong response modifies the signaling 
machinery involved, such that the machinery resets itself to become less respon- 
sive to the same level of signal (see Figure 15-28D, middle graph). Owing to the 
delay, however, a sudden increase in the signal is able to stimulate the cell again 
for a short period before the negative feedback has time to kick in. 



Figure 1 5-28 Some effects of simple 
feedback. The graphs show the 
computed effects of simple positive and 
negative feedback loops. In each case, 
the input signal is an activated protein 
kinase (S) that phosphorylates and 
thereby activates another protein kinase 
(E); a protein phosphatase (I) 
dephosphorylates and inactivates the 
activated E kinase. In the graphs, the red 
line indicates the activity of the E kinase 
over time; the underlying blue bar 
indicates the time for which the input 
signal (activated S kinase) is present. (A) 
Diagram of the positive feedback loop, in 
which the activated E kinase acts back to 
promote its own phosphorylation and 
activation; the basal activity of the I 
phosphatase dephosphorylates activated 
E at a steady low rate. (B) The top graph 
shows that, without feedback, the activity 
of the E kinase is simply proportional 
(with a short lag) to the level of 
stimulation by the S kinase. The bottom 
graph shows that, with the positive 
feedback loop, the system is bistable 
(i.e., it is capable of existing in either of 
two steady states): the transient 
stimulation by S kinase switches the 
system from an "off" state to an "on" state, 
which then persists after the stimulus has 
been removed. (C) Diagram of the 
negative feedback loop, in which the 
activated E kinase phosphorylates and 
activates the I phosphatase, thereby 
increasing the rate at which the 
phosphatase dephosphorylates and 
inactivates the phosphorylated E kinase. 
The top graph shows, again, the response 
in E kinase activity without feedback. The 
other graphs show the effects on E kinase 
activity of negative feedback operating 
after a short or long delay. With a short 
delay, the system shows a strong brief 
response when the signal is abruptly 
changed, and the feedback then drives 
the response back down to a lower level. 
With a long delay, the feedback produces 
sustained oscillations for as long as the 
stimulus is present. 

Small changes in the details of the 
feedback can radically alter the way the 
system responds, even for this very 
simple example; the figure shows only a 
small sample of possible behaviors. 
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Desensitization to a signal molecule can occur in various ways. It can result 
from an inactivation of the receptors themselves. The binding of signal 
molecules to cell-surface receptors, for example, may induce the endocytosis 
and temporary sequestration of the receptors in endosomes. In some cases, 
such signal-induced receptor endocytosis leads to the destruction of the recep- 
tors in lysosomes, a process referred to as receptor down-regulation (in other 
cases, however, activated receptors continue to signal after they have been 
endocytosed, as we discuss later). Receptors can also become desensitized 
(inactivated) on the cell surface — for example, by becoming phosphorylated or 
methylated — with a short delay following their activation. Desensitization can 
also occur at sites downstream of the receptors, either by a change in intracellu- 
lar signaling proteins involved in transducing the extracellular signal or by the 
production of an inhibitor protein that blocks the signal transduction process. 
These various desensitization mechanisms are compared in Figure 15-29. 

Having discussed some of the general principles of cell signaling, we next 
turn to the G-protein-coupled receptors, which are by far the largest class of cell- 
surface receptors, mediating responses to the most diverse types of extracellular 
signals. 



Figure 1 5-29 Some ways in which target 
cells can become desensitized (adapted) 
to an extracellular signal molecule. The 

mechanisms shown here that operate at 
the level of the receptor often involve 
phosphorylation or ubiquitylation of the 
receptor proteins. In bacterial 
chemotaxis, however, which we discuss 
later, adaptation depends on methylation 
of the receptor proteins. 



Summary 

Each cell in a multicellular animal has been programmed during development to 
respond to a specific set of extracellular signal molecules produced by other cells. The 
signal molecules act in various combinations to regulate the behavior of the cell. Most 
signal molecules act as local mediators, which are secreted and then rapidly taken up, 
destroyed, or immobilized, so that they act only on neighboring cells. Other signal 
molecules remain bound to the surface of the signaling cell and mediate contact- 
dependent signaling. There are also two distinct types of long-distance signaling. In 
endocrine signaling, hormones secreted by endocrine cells are carried in the blood to 
target cells throughout the body. In synaptic signaling, neurotransmitters secreted by 
nerve cell axons act locally on the postsynaptic cells that the axons contact. 

Cell signaling requires not only extracellular signal molecules but also a comple- 
mentary set of receptor proteins expressed by the target cells that specifically bind the 
signal molecules. Some small hydrophobic signal molecules, including steroid and 
thyroid hormones, diffuse across the plasma membrane of the target cell and activate 
intracellular receptor proteins that directly regulate the transcription of specific 
genes. The dissolved gases nitric oxide and carbon monoxide act as local mediators by 
diffusing across the plasma membrane of the target cell and activating an intracellu- 
lar protein such as the enzyme guanylyl cyclase, which produces cyclic GMP in the 
target cell. But most extracellular signal molecules are hydrophilic and cannot pass 
through the plasma membrane. These activate cell-surface receptor proteins, which 
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act as signal transducers, converting the extracellular signal into intracellular ones 
that alter the behavior of the target cell. 

The three largest families of cell- surface receptors transduce extracellular signals 
in different ways. Ion-channel-coupled receptors are transmitter- gated ion channels 
that open or close briefly in response to the binding of a neurotransmitter. G-protein- 
coupled receptors indirectly activate or inactivate plasma- membrane-bound enzymes 
or ion channels via tumeric GTP-binding proteins (G proteins). Enzyme-coupled 
receptors either act directly as enzymes or are associated with enzymes; these enzymes 
are usually protein kinases that phosphorylate the receptors and specific signaling 
proteins in the target cell. 

Once activated, G-protein- coupled receptors and enzyme- coupled receptors relay 
the signal into the cell interior by activating chains of intracellular signaling proteins. 
Some of these signaling proteins transduce, amplify, or spread the signal as they relay it, 
while others integrate signals from different signaling pathways. Some function as 
switches that are transiently activated by phosphorylation or GTP binding. Large sig- 
naling complexes form by means of modular interaction domains in the signaling pro- 
teins, which allow the proteins to form functional signaling networks. 

Target cells use various intracellular mechanisms, including feedback loops, to 
adjust the ways in which they respond to extracellular signals. Positive feedback loops 
can help cells to respond in an all- or- none fashion to a gradually increasing concen- 
tration of an extracellular signal or to convert a short-lasting signal into a long-lasting, 
or even irreversible, response. Delayed negative feedback allows cells to desensitize to a 
signal molecule, which enables them to respond to small changes in the concentration 
of the signal molecule over a large concentration range. 



SIGNALING THROUGH G-PROTEIN-COUPLED 
CELL-SURFACE RECEPTORS (GPCRs) AND SMALL 
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All eucaryotes use G-protein-coupled receptors (GPCRs). These form the 
largest family of cell-surface receptors, and they mediate most responses to sig- 
nals from the external world, as well as signals from other cells, including hor- 
mones, neurotransmitters, and local mediators. Our senses of sight, smell, and 
taste (with the possible exception of sour taste) depend on them. There are more 
than 700 GPCRs in humans, and in mice there are about 1000 concerned with 
the sense of smell alone. The signal molecules that act on GPCRs are as varied in 
structure as they are in function and include proteins and small peptides, as well 
as derivatives of amino acids and fatty acids, not to mention photons of light and 
all the molecules that we can smell or taste. The same signal molecule can acti- 
vate many different GPCR family members; for example, adrenaline activates at 
least 9 distinct GPCRs, acetylcholine another 5, and the neurotransmitter sero- 
tonin at least 14. The different receptors for the same signal are usually 
expressed in different cell types and elicit different responses. 

Despite the chemical and functional diversity of the signal molecules that 
activate them, all GPCRs have a similar structure. They consist of a single 
polypeptide chain that threads back and forth across the lipid bilayer seven times 
(Figure 15-30). In addition to their characteristic orientation in the plasma mem- 
brane, they all use G proteins to relay the signal into the cell interior. 

The GPCR superfamily includes rhodopsin, the light- activated protein in the 
vertebrate eye, as well as the large number of olfactory receptors in the vertebrate 
nose. Other family members are found in unicellular organisms: the receptors in 
yeasts that recognize secreted mating factors are an example. It is likely that the 
GPCRs that mediate cell-cell signaling in multicellular organisms evolved from 
the sensory receptors in their unicellular eucaryotic ancestors. 

It is remarkable that about half of all known drugs work through GPCRs or 
the signaling pathways GPCRs activate. Of the many hundreds of genes in the 
human genome that encode GPCRs, about 150 encode orphan receptors, for 
which the ligand is unknown. Many of them are likely targets for new drugs that 
remain to be discovered. 
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Figure 15-30 A G-protein-coupled 
receptor (GPCR). GPCRs that bind protein 
ligands have a large extracellular domain 
formed by the part of the polypeptide 
chain shown in light green. This domain, 
together with some of the 
transmembrane segments, binds the 
protein ligand. Receptors for small 
ligands such as adrenaline have small 
extracellular domains, and the ligand 
usually binds deep within the plane of 
the membrane to a site that is formed by 
amino acids from several transmembrane 
segments. 
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Trimeric G Proteins Relay Signals from GPCRs 

When an extracellular signal molecule binds to a GPCR, the receptor undergoes 
a conformational change that enables it to activate a trimeric GTP-binding pro- 
tein (G protein). The G protein is attached to the cytoplasmic face of the plasma 
membrane, where it functionally couples the receptor to either enzymes or ion 
channels in this membrane. In some cases, the G protein is physically associated 
with the receptor before the receptor is activated, whereas in others it binds only 
after receptor activation. There are various types of G proteins, each specific for 
a particular set of GPCRs and for a particular set of target proteins in the plasma 
membrane. They all have a similar structure, however, and operate similarly. 

G proteins are composed of three protein subunits — a, (3, and y. In the 
unstimulated state, the a subunit has GDP bound and the G protein is inactive 
(Figure 15-31). When a GPCR is activated, it acts like a guanine nucleotide 
exchange factor (GEF) and induces the a subunit to release its bound GDP, 
allowing GTP to bind in its place. This exchange causes a large conformational 
change in the G protein, which activates it. It was originally thought that the 
activation always causes the trimer to dissociate into two activated compo- 
nents — an a subunit and a (5y 'complex. However, there is now evidence that, in 
some cases at least, the conformational change exposes previously buried sur- 
faces between the a subunit and the (3y complex, so that the a subunit and (3y 
complex can each now interact with their targets without requiring the subunits 
to dissociate (Figure 15-32). These targets are either enzymes or ion channels in 
the plasma membrane that relay the signal onward. 

The a subunit is a GTPase, and once it hydrolyzes its bound GTP to GDP it 
becomes inactive. The time for which the G protein remains active depends on 
how quickly the a subunit hydrolyzes its bound GTP. This time is usually short 
because the GTPase activity is greatly enhanced by the binding of the a subunit 
to a second protein, which can be either the target protein or a specific regula- 
tor of G protein signaling (RGS). RGS proteins act as a-subunit-specific 
GTPase -activating proteins (GAPs) (see Figure 15-19), and they help shut off G- 
protein-mediated responses in all eucaryotes. There are about 25 RGS proteins 
encoded in the human genome, each of which interacts with a particular set of 
G proteins. 

GPCRs activate various intracellular signaling pathways, including some that 
are also activated by enzyme-coupled receptors. In this section, however, we 
focus on those GPCR- activated pathways that use small intracellular mediators. 



Figure 1 5-31 The structure of an 
inactive G protein. (A) Note that both the 
a and the y subunits have covalently 
attached lipid molecules (red) that help 
bind them to the plasma membrane, and 
the a subunit has GDP bound. (B) The 
three-dimensional structure of an 
inactive G protein, based on transducin, 
the G protein that operates in visual 
transduction (discussed later). The 
a subunit contains the GTPase domain 
and binds to one side of the p subunit, 
which locks the GTPase domain in an 
inactive conformation that binds GDP. 
The y subunit binds to the opposite side 
of the p subunit, and the p and y subunits 
together form a single functional unit. 
(B, based on D.G. Lombright et al., Nature 
379:311-319, 1996. With permission from 
Macmillan Publishers Ltd.) 



Some G Proteins Regulate the Production of Cyclic AMP 

Cyclic AMP (cAMP) acts as a small intracellular mediator in all procaryotic and 
animal cells that have been studied. Its normal concentration in the cytosol is 
about 10~ 7 M, but an extracellular signal can increase this concentration more 
than twentyfold in seconds (Figure 15-33). As explained earlier (see Figure 
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EXTRACELLULAR SPACE 




activated 
(3y complex 

15-11), such a rapid response requires balancing a rapid synthesis of the 
molecule with its rapid breakdown or removal. Cyclic AMP is synthesized from 
ATP by a plasma-membrane-bound enzyme adenylyl cyclase, and it is rapidly 
and continuously destroyed by cyclic AMP phosphodiesterases that hydrolyze 
cyclic AMP to adenosine 5'-monophosphate (5'-AMP) (Figure 15-34). 

Many extracellular signal molecules work by increasing cyclic AMP concen- 
tration, and they do so by increasing the activity of adenylyl cyclase against a 
steady background of phosphodiesterase activity. Adenylyl cyclase is a large 
multipass transmembrane protein with its catalytic domain on the cytosolic 
side of the plasma membrane. There are at least eight isoforms in mammals, 
most of which are regulated by both G proteins and Ca 2+ . GPCRs that act by 
increasing cyclic AMP are coupled to a stimulatory G protein (G s ), which acti- 
vates adenylyl cyclase and thereby increases cyclic AMP concentration. Another 
G protein, called inhibitory G protein (GO, inhibits adenylyl cyclase, but it acts 
mainly by directly regulating ion channels (as we discuss later). 

Both G s and Gi are targets for some medically important bacterial toxins. 
Cholera toxin, which is produced by the bacterium that causes cholera, is an 
enzyme that catalyzes the transfer of ADP ribose from intracellular NAD + to the 
a subunit of G s . This ADP ribosylation alters the a subunit so that it can no 
longer hydrolyze its bound GTP, causing it to remain in an active state that stim- 
ulates adenylyl cyclase indefinitely. The resulting prolonged elevation in cyclic 
AMP concentration within intestinal epithelial cells causes a large efflux of Cl~ 
and water into the gut, thereby causing the severe diarrhea that characterizes 
cholera. Pertussis toxin, which is made by the bacterium that causes pertussis 
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Figure 15-32 Activation of a G protein 
by an activated GPCR. Binding of an 
extracellular signal to a GPCR changes 
the conformation of the receptor, which 
in turn alters the conformation of the 
G protein. The alteration of the a subunit 
of the G protein allows it to exchange its 
GDP for GTP, activating both the 
a subunit and the Py complex, both of 
which can regulate the activity of target 
proteins in the plasma membrane. The 
receptor stays active while the external 
signal molecule is bound to it, and it can 
therefore catalyze the activation of many 
molecules of G protein, which dissociate 
from the receptor once activated (not 
shown). In some cases, the a subunit and 
the Py complex dissociate from each 
other when the G protein is activated. 
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Figure 15-33 An increase in cyclic AMP in 
response to an extracellular signal. This 
nerve cell in culture is responding to the 
neurotransmitter serotonin, which acts 
through a GPCR to cause a rapid rise in 
the intracellular concentration of cyclic 
AMP. To monitor the cyclic AMP level, the 
cell has been loaded with a fluorescent 
protein that changes its fluorescence 
when it binds cyclic AMP. Blue indicates a 
low level of cyclic AMP, yellow an 
intermediate level, and red a high level. 
(A) In the resting cell, the cyclic AMP level 
is about 5 x 1 0~ 8 M. (B) Twenty seconds 
after the addition of serotonin to the 
culture medium, the intracellular level of 
cyclic AMP has increased to more than 
10~ 6 M in the relevant parts of the cell, an 
increase of more than twentyfold. (From 
Brian J. Bacskai et al., Science 260:222-226, 
1993. With permission from AAAS.) 
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(whooping cough), catalyzes the ADP ribosylation of the a subunit of Gi, pre- 
venting the protein from interacting with receptors; as a result, the G protein 
retains its bound GDP and is unable to regulate its target proteins. These two 
toxins are widely used in experiments to determine whether a cell's GPCR- 
dependent response to a signal is mediated by G s or by Gi. 

Some of the responses mediated by a G s -stimulated increase in cyclic AMP 
concentration are listed in Table 15-1. As the table shows, different cell types 
respond differently to an increase in cyclic AMP concentration, and one cell type 
often responds in the same way to such an increase, regardless of the extracellu- 
lar signal that causes it. At least four hormones activate adenylyl cyclase in fat 
cells, for example, and all of them stimulate the breakdown of triglyceride (the 
storage form of fat) to fatty acids (see Table 15-1). 

Individuals who are genetically deficient in a particular G s a subunit show 
decreased responses to certain hormones. As a consequence, these people dis- 
play metabolic abnormalities, have abnormal bone development, and are men- 
tally retarded. 



Figure 1 5-34 The synthesis and 
degradation of cyclic AMP. In a reaction 
catalyzed by the enzyme adenylyl 
cyclase, cyclic AMP (cAMP) is synthesized 
from ATP through a cyclization reaction 
that removes two phosphate groups as 
pyrophosphate (® - ®); a 
pyrophosphatase drives this synthesis by 
hydrolyzing the released pyrophosphate 
to phosphate (not shown). Cyclic AMP is 
short-lived (unstable) in the cell because 
it is hydrolyzed by specific 
phosphodiesterases to form 5'-AMP, as 
indicated. 



Table 15-1 Some Hormone-induced Cell Responses Mediated by Cyclic AMP 


TARGET TISSUE 


HORMONE 


MAJOR RESPONSE 


Thyroid gland 


thyroid-stimulating 


thyroid hormone synthesis 




hormone (TSH) 


and secretion 


Adrenal cortex 


adrenocorticotrophic 
hormone (ACTH) 


Cortisol secretion 


Ovary 


luteinizing hormone (LH) 


progesterone secretion 


Muscle 


adrenaline 


glycogen breakdown 


Bone 


parathormone 


bone resorption 


Heart 


adrenaline 


increase in heart rate and 
force of contraction 


Liver 


glucagon 


glycogen breakdown 


Kidney 


vasopressin 


water resorption 


Fat 


adrenaline, ACTH, glucagon, 
TSH 


triglyceride breakdown 
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Cyclic-AMP-Dependent Protein Kinase (PKA) Mediates Most of the 
Effects of Cyclic AMP 

In most animal cells, cyclic AMP exerts its effects mainly by activating cyclic- 
AMP- dependent protein kinase (PKA). This kinase phosphorylates specific ser- 
ines or threonines on selected target proteins, including intracellular signaling 
proteins and effector proteins, thereby regulating their activity The target pro- 
teins differ from one cell type to another, which explains why the effects of cyclic 
AMP vary so markedly depending on the cell type (see Table 15-1). 

In the inactive state, PKA consists of a complex of two catalytic subunits and 
two regulatory subunits. The binding of cyclic AMP to the regulatory subunits 
alters their conformation, causing them to dissociate from the complex. The 
released catalytic subunits are thereby activated to phosphorylate specific target 
proteins (Figure 15-35). The regulatory subunits of PKA (also called A-kinase) 
are important for localizing the kinase inside the cell: special A-kinase anchor- 
ing proteins (AKAPs) bind both to the regulatory subunits and to a component of 
the cytoskeleton or a membrane of an organelle, thereby tethering the enzyme 
complex to a particular subcellular compartment. Some AKAPs also bind other 
signaling proteins, forming a complex that functions as a signaling module. An 
AKAP located around the nucleus of heart muscle cells, for example, binds both 
PKA and a phosphodiesterase that hydrolyzes cyclic AMP. In unstimulated cells, 
the phosphodiesterase keeps the local cyclic AMP concentration low, so that the 
bound PKA is inactive; in stimulated cells, cyclic AMP concentration rapidly 
rises, overwhelming the phosphodiesterase and activating the PKA. Among the 
target proteins that PKA phosphorylates and activates in these cells is the adja- 
cent phosphodiesterase, which rapidly lowers the cyclic AMP concentration 
again. This arrangement converts what might otherwise be a weak and pro- 
longed PKA response into a strong, brief, local pulse of PKA activity. 

Whereas some responses mediated by cyclic AMP occur within seconds and 
do not depend on changes in gene transcription (see Figure 15-33), others do 
depend on changes in the transcription of specific genes and take hours to 
develop fully. In cells that secrete the peptide hormone somatostatin, for exam- 
ple, cyclic AMP activates the gene that encodes this hormone. The regulatory 
region of the somatostatin gene contains a short DNA sequence, called the cyclic 
AMP response element (CRE), which is also found in the regulatory region of 
many other genes activated by cyclic AMP. A specific gene regulatory protein 
called CRE-binding (CREB) protein recognizes this sequence. When PKA is acti- 
vated by cAMP, it phosphorylates CREB on a single serine; phosphorylated CREB 
then recruits a transcriptional coactivator called CREB-binding protein (CBP), 
which stimulates the transcription of the target genes (Figure 15-36). Thus, 
CREB can transform a short cyclic AMP signal into a long-term change in a cell, 
a process that, in the brain, is thought to play an important part in some forms 
of learning and memory. 

PKA does not mediate all the effects of cyclic AMP in cells. As we discuss later, 
in olfactory neurons (responsible for the sense of smell), cyclic AMP also directly 
activates special ion channels in the plasma membrane. Moreover, in some other 
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Figure 1 5-35 The activation of cyclic- 
AMP-dependent protein kinase (PKA). 

The binding of cyclic AMP to the 
regulatory subunits of the PKA tetramer 
induces a conformational change, 
causing these subunits to dissociate from 
the catalytic subunits, thereby activating 
the kinase activity of the catalytic 
subunits. The release of the catalytic 
subunits requires the binding of more 
than two cyclic AMP molecules to the 
regulatory subunits in the tetramer. This 
requirement greatly sharpens the 
response of the kinase to changes in 
cyclic AMP concentration, as discussed 
earlier (see Figure 15-25). Mammalian 
cells have at least two types of PKAs: type 
I is mainly in the cytosol, whereas type II 
is bound via its regulatory subunits and 
special anchoring proteins to the plasma 
membrane, nuclear membrane, 
mitochondrial outer membrane, and 
microtubules. In both types, once the 
catalytic subunits are freed and active, 
they can migrate into the nucleus (where 
they can phosphorylate gene regulatory 
proteins), while the regulatory subunits 
remain in the cytoplasm. The three- 
dimensional structure of the protein 
kinase domain of the PKA catalytic 
subunit is shown in Figure 3-65. 
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Figure 15-36 How a rise in intracellular 
cyclic AMP concentration can alter gene 
transcription. <AGAT>The binding of an 
extracellular signal molecule to its GPCR 
activates adenylyl cyclase via G s and 
thereby increases cyclic AMP concentration 
in the cytosol.The rise in cyclic AMP 
concentration activates PKA, and the 
released catalytic subunits of PKA can then 
enter the nucleus, where they 
phosphorylate the gene regulatory protein 
CREB. Once phosphorylated, CREB recruits 
the coactivator CBP, which stimulates gene 
transcription. In some cases, at least, the 
inactive CREB protein is bound to the cyclic 
AMP response element (CRE) in DNA 
before it is phosphorylated (not shown). 
This signaling pathway controls many 
processes in cells, ranging from hormone 
synthesis in endocrine cells to the 
production of proteins required for the 
induction of long-term memory in the 
brain. We will see later that CREB can also 
be activated by some other signaling 
pathways, independent of cAMP. 



cells, it directly activates a guanine nucleotide exchange factor (GEF) that, in 
turn, activates a monomeric GTPase called Rapl, which often leads to increased 
cell adhesion through the activation of cell- surface integrins (discussed in Chap- 
ter 19). 

Having discussed how trimeric G proteins link activated GPCRs to adenylyl 
cyclase, we now consider how they couple activated GPCRs to another crucial 
enzyme, phospholipase C. The activation of this enzyme increases the concentra- 
tion of several small intracellular mediators, including Ca 2+ , which help relay the 
signal onward. Ca 2+ is even more widely used than cyclic AMP as a small intra- 
cellular mediator. 



Some G Proteins Activate An Inositol Phospholipid Signaling 
Pathway by Activating Phospholipase C-p 

Many GPCRs exert their effects mainly via G proteins that activate the plasma- 
membrane-bound enzyme phospholipase C-|J (PLCP). Table 15-2 lists several 
examples of responses activated in this way. The phospholipase acts on a phos- 
phorylated inositol phospholipid (a phosphoinositide) called phosphatidylinos- 
itol 4,5-bisphosphate [PI(4,5)P2, or PIP2L which is present in small amounts in 
the inner half of the plasma membrane lipid bilayer (Figure 15-37). Receptors 
that activate this inositol phospholipid signaling pathway mainly do so via a G 
protein called G q , which activates phospholipase C-(3 in much the same way that 
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Table 15-2 Some Cell Responses in Which GPCRs Activate PLCP 



TARGET TISSUE 


SIGNAL MOLECULE 


MAJOR RESPONSE 


Liver 
Pancreas 
Smooth muscle 
Blood platelets 


vasopressin 
acetylcholine 
acetylcholine 
thrombin 


glycogen breakdown 
amylase secretion 
muscle contraction 
platelet aggregation 



G s activates adenylyl cyclase. The activated phospholipase then cleaves the PIP2 
to generate two products: inositol 1,4,5-trisphosphate (IP3) and diacylglycerol 
(Figure 15-38). At this step, the signaling pathway splits into two branches. 

Inositol 1,4,5-trisphosphate (IP3) is a water-soluble molecule that acts as a 
small intracellular mediator. It leaves the plasma membrane and diffuses rapidly 
through the cytosol. When it reaches the endoplasmic reticulum (ER), it binds to 
and opens IP3-gated Ca 2+ -release channels (also called IP3 receptors) in the ER 
membrane. Ca 2+ stored in the ER is released through the open channels, quickly 
raising the concentration of Ca 2+ in the cytosol (Figure 15-39). Once the ER Ca 2+ 
stores have been depleted, they are refilled by the activation of store-operated 
Ca 2+ channels in the plasma membrane and a Ca 2+ sensor protein in the ER 
membrane, in regions where the two membranes are closely apposed. 

We discuss later how the increase in cytosolic Ca 2+ propagates the signal by 
influencing the activity of Ca 2+ - sensitive intracellular proteins. Several mecha- 
nisms operate to terminate the initial Ca 2+ response: (1) IP3 is rapidly dephos- 
phorylated by specific lipid phosphatases to form IP 2 ; (2) IP 3 is phosphorylated 
by specific lipid kinases to form IP 4 (which may function as another small intra- 
cellular mediator); and (3) Ca 2+ that enters the cytosol is rapidly pumped out, 
mainly to the exterior of the cell (see Figure 15-41). 

At the same time that the IP3 produced by the hydrolysis of PIP2 is increas- 
ing the concentration of Ca 2+ in the cytosol, the other cleavage product of the 
PIP2, diacylglycerol, is exerting different effects. It also acts as a small intracel- 
lular mediator, but it remains embedded in the plasma membrane, where it has 
several potential signaling roles. It can be further cleaved to release arachidonic 
acid, which can either act as a signal in its own right or be used in the synthesis 
of other small lipid signal molecules called eicosanoids. Most vertebrate cell 
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Figure 1 5-37 The synthesis of PI(4 / 5)P 2 . 

The phosphoinositides PI(4)P and 
Pl(4,5)p2 are produced by the 
phosphorylation of phosphatidylinositol 
(PI) and PI(4)P, respectively. Although all 
three inositol phospholipids may be 
broken down in a signaling response, it is 
the breakdown of PI(4,5)P 2 that 
predominates and is most critical 
because it generates two intracellular 
mediators, as shown in Figures 15-38 and 
1 5-39. Nevertheless, PI(4,5)P 2 is the least 
abundant, constituting less than 10% of 
the total inositol phospholipids and 
about 1% of the lipids in the plasma 
membrane. The conventional numbering 
of the carbon atoms in the inositol ring is 
shown in red numbers on the PI molecule. 
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Figure 15-38 The hydrolysis of PI(4,5)P 2 
by phospholipase C-p. Two small 
intracellular mediators are produced 
directly from the hydrolysis of the PIP2: 
inositol 1,4,5-trisphosphate (IP3), which 
diffuses through the cytosol and releases 
Ca 2+ from the endoplasmic reticulum, 
and diacylglycerol, which remains in the 
membrane and helps to activate the 
enzyme protein kinase C (PKC; see Figure 
1 5-39). There are several classes of 
phospholipase C; these include the 
P class, which is activated by GPCRs; as 
we see later, the y class is activated by a 
class of enzyme-coupled receptors called 
receptor tyrosine kinases (RTKs). 
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types make eicosanoids, including prostaglandins, which have many biological 
activities. They participate in pain and inflammatory responses, for example, 
and most anti- inflammatory drugs (such as aspirin, ibuprofen, and cortisone) 
act — in part, at least — by inhibiting their synthesis. 

A second function of diacylglycerol is to activate a crucial serine /threonine 
protein kinase called protein kinase C (PKC), so named because it is Ca 2+ - 
dependent. The initial rise in cytosolic Ca 2+ induced by IP3 alters the PKC so that 
it translocates from the cytosol to the cytoplasmic face of the plasma membrane. 
There it is activated by the combination of Ca 2+ , diacylglycerol, and the nega- 
tively charged membrane phospholipid phosphatidylserine (see Figure 15-39). 
Once activated, PKC phosphorylates target proteins that vary depending on the 
cell type. The principles are the same as discussed earlier for PKA, although most 
of the target proteins are different. 

There are various classes of PKCs, only some of which (called conventional 
PKCs) are activated by Ca 2+ and diacylglycerol; the others are called atypical 
PKCs. Different PKCs phosphorylate different substrates mainly because differ- 
ent anchoring or scaffold proteins tether them to different compartments in the 
cell. 
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Figure 15-39 How GPCRs increase 
cytosolic Ca 2+ and activate PKC. The 

activated GPCR stimulates the plasma- 
membrane-bound phospholipase PLCp 
via a G protein. Depending on the 
isoform of the PLCp, it may be activated 
by the a subunit of G q as shown, by the 
py subunits of another G protein, or by 
both. Two small intracellular messenger 
molecules are produced when PI(4,5)P2 is 
hydrolyzed by activated PLCp. Inositol 
1,4,5-trisphosphate (IP3) diffuses through 
the cytosol and releases Ca 2+ from the ER 
by binding to and opening IP 3 -gated 
Ca 2+ -release channels (IP3 receptors) in 
the ER membrane. The large 
electrochemical gradient for Ca 2+ across 
this membrane causes Ca 2+ to escape 
into the cytosol when the release 
channels are open. Diacylglycerol 
remains in the plasma membrane and, 
together with phosphatidylserine (not 
shown) and Ca 2+ , helps to activate 
protein kinase C (PKC), which is recruited 
from the cytosol to the cytosolic face of 
the plasma membrane. Of the 10 or more 
distinct isoforms of PKC in humans, at 
least 4 are activated by diacylglycerol. 
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Ca 2+ Functions as a Ubiquitous Intracellular Mediator 

Many extracellular signals trigger an increase in cytosolic Ca 2+ concentration, 
not just those that work via G proteins. In egg cells, for example, a sudden rise in 
cytosolic Ca 2+ concentration upon fertilization by a sperm triggers a Ca 2+ wave 
that initiates embryonic development (Figure 15-40). In muscle cells, Ca 2+ trig- 
gers contraction, and in many secretory cells, including nerve cells, it triggers 
secretion. Ca 2+ can act as a signal in this way because its concentration in the 
cytosol is normally very low (~10~ 7 M), whereas its concentration in the extra- 
cellular fluid (~10~ 3 M) and in the lumen of the ER [and sarcoplasmic reticulum 
(SR) in muscle] is high. Thus, there is a large gradient tending to drive Ca 2+ into 
the cytosol across both the plasma membrane and the ER or SR membrane. 
When a signal transiently opens Ca 2+ channels in these membranes, Ca 2+ rushes 
into the cytosol, increasing the local Ca 2+ concentration 10-20-fold and activat- 
ing Ca 2+ -responsive proteins in the cell. <CGTC> 

The Ca 2+ from outside the cell enters the cytosol through various Ca 2+ chan- 
nels in the plasma membrane, which open in response to ligand binding, 
stretch, or membrane depolarization. The Ca 2+ from the ER enters the cytosol 
through either IP 3 receptors (see Figure 15-39) or ryanodine receptors (so called 
because they are sensitive to the plant alkaloid ryanodine). Ryanodine receptors 
are normally activated by Ca 2+ binding and thereby amplify the Ca 2+ signal. Ca 2+ 
also activates IP3 receptors but only in the presence of IP3; and very high con- 
centrations of Ca 2+ inactivate them. We discuss how Ca 2+ is released from the 
sarcoplasmic reticulum to cause the contraction of muscle cells in Chapter 16. 

Several mechanisms keep the concentration of Ca 2+ in the cytosol low in 
resting cells (Figure 15-41). All eucaryotic cells have a Ca 2+ -pump in their 
plasma membrane that uses the energy of ATP hydrolysis to pump Ca 2+ out of 
the cytosol. Cells such as muscle and nerve cells, which make extensive use of 
Ca 2+ signaling, have an additional Ca 2+ transport protein (a Na + -driven Ca 2+ 
exchanger) in their plasma membrane that couples the efflux of Ca 2+ to the 
influx of Na + . A Ca 2+ pump in the ER membrane also has an important role in 
keeping the cytosolic Ca 2+ concentration low: this Ca 2+ -pump enables the ER to 
take up large amounts of Ca 2+ from the cytosol against a steep concentration 
gradient, even when Ca 2+ levels in the cytosol are low. In addition, a low- affinity, 
high-capacity Ca 2+ pump in the inner mitochondrial membrane has an impor- 
tant role in limiting the Ca 2+ signal and in terminating it; it uses the electro- 
chemical gradient generated across this membrane during the electron-transfer 
steps of oxidative phosphorylation to take up Ca 2+ from the cytosol. The result- 
ing increase in Ca 2+ concentration in the mitochondrion can activate some 
enzymes of the citric acid cycle, thereby increasing the synthesis of ATP and link- 
ing cell activation to energy production; an excessive increase in mitochondrial 
Ca 2+ , however, leads to cell death. 



Figure 15-40 The fertilization of an egg 
by a sperm triggers an increase in 
cytosolic Ca 2+ . <AGGA> This starfish egg 
was injected with a Ca 2+ -sensitive 
fluorescent dye before it was fertilized. A 
wave of cytosolic Ca 2+ (red), released 
from the ER, sweeps across the egg from 
the site of sperm entry (arrow). This Ca 2+ 
wave changes the egg cell surface, 
preventing the entry of other sperm, and 
it also initiates embryonic development 
(discussed in Chapter 21). The initial 
increase in Ca 2+ is thought to be caused 
by a sperm-specific form of PLC (PLCQ 
that the sperm brings into the egg 
cytoplasm when it fuses with the egg; 
the PLC?; cleaves PI(4,5)P 2 to produce IP 3 , 
which releases Ca 2+ from the egg ER. 
(Courtesy of Stephen A. Strieker.) 



The Frequency of Ca 2+ Oscillations Influences a Cell's Response 

Researchers often use Ca 2+ -sensitive fluorescent indicators, such as aequorin or 
fura-2 (discussed in Chapter 9), to monitor cytosolic Ca 2+ in individual cells after 
activation of the inositol phospholipid signaling pathway. When viewed in this 
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way, the initial Ca 2+ signal appears small and localized to one or more discrete 
regions of the cell. These Ca 2+ puffs or sparks reflect the local opening of indi- 
vidual or small groups of Ca 2+ -release channels in the ER. Because various Ca 2+ - 
binding proteins act as Ca 2+ buffers and restrict the diffusion of Ca 2+ , the signal 
often remains localized to the site where the Ca 2+ entered the cytosol. If the 
extracellular signal is sufficiently strong and persistent, however, this localized 
Ca 2+ signal can propagate as a regenerative Ca 2+ wave through the cytosol (see 
Figure 15-40), much like an action potential in an axon. Such a Ca 2+ "spike" is 
often followed by a series of further spikes, each usually lasting seconds (Figure 
15-42) . Such Ca 2+ oscillations can persist for as long as receptors are activated at 
the cell surface. Both the waves and the oscillations are thought to depend, at 
least in part, on a combination of positive and negative feedback by Ca 2+ on 



Figure 15-41 The main ways in which 
eucaryotic cells maintain a very low 
concentration of free Ca 2+ in their 
cytosol. (A) Ca 2+ is actively pumped out 
of the cytosol to the cell exterior. (B) Ca 2+ 
is pumped out of the cytosol into the ER 
and mitochondria, and various molecules 
in the cytosol bind free Ca 2+ tightly. 
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Figure 15-42 Vasopressin-induced 
Ca 2+ oscillations in a liver cell. The cell 
was loaded with the Ca 2+ -sensitive protein 
aequorin and then exposed to increasing 
concentrations of the peptide signal 
molecule vasopressin, which activates a 
GPCR and thereby PLCP (see Tabe 1 5-2). 
Note that the frequency of the Ca 2+ spikes 
increases with an increasing concentration 
of vasopressin but that the amplitude of 
the spikes is not affected. Each spike lasts 
about 7 seconds. (Adapted from 
N.M. Woods, K.S.R. Cuthbertson and 
P.H. Cobbold, Nature 319:600-602, 1986. 
With permission from Macmillan 
Publishers Ltd.) 
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both the IP3 receptors and ryanodine receptors: the released Ca 2+ initially stim- 
ulates more Ca 2+ release from both receptors, a process known as Ca 2 * -induced 
Ca 2+ release; but then, as its concentration gets high enough, the Ca 2+ inhibits 
further release; and this delayed negative feedback gives rise to oscillations (see 
Figure 15-28D). 

One can follow the effect of the Ca 2+ oscillations on specific Ca 2+ - sensitive 
proteins by using real-time imaging of individual cells expressing fluorescent 
reporter proteins. One can show, for example, that each Ca 2+ spike in some sig- 
nal-induced responses will recruit PKC transiently to the plasma membrane, 
where it will transiently phosphorylate a reporter protein. 

The frequency of the Ca 2+ oscillations reflects the strength of the extracellu- 
lar stimulus (see Figure 15-42), and this frequency can be translated into a fre- 
quency-dependent cell response. In some cases, the frequency-dependent 
response itself is also oscillatory: in hormone-secreting pituitary cells, for exam- 
ple, stimulation by an extracellular signal induces repeated Ca 2+ spikes, each of 
which is associated with a burst of hormone secretion. In other cases, the fre- 
quency-dependent response is non- oscillatory: in some types of cells, for 
instance, one frequency of Ca 2+ spikes activates the transcription of one set of 
genes, while a higher frequency activates the transcription of a different set. 
How do cells sense the frequency of Ca 2+ spikes and change their response 
accordingly? The mechanism presumably depends on Ca 2+ - sensitive proteins 
that change their activity as a function of Ca 2+ spike frequency. A protein kinase 
that acts as a molecular memory device seems to have this remarkable property, 
as we discuss next. 



Ca 2+ /Calmodulin-Dependent Protein Kinases (CaM-Kinases) 
Mediate Many of the Responses to Ca 2+ Signals in Animal Cells 

Various Ca 2+ -binding proteins help to relay the cytosolic Ca 2+ signal. The most 
important is calmodulin, which is found in all eucaryotic cells and can consti- 
tute as much as 1% of the total protein mass. Calmodulin functions as a multi- 
purpose intracellular Ca 2+ receptor, governing many Ca 2+ -regulated processes. 
It consists of a highly conserved, single polypeptide chain with four high-affin- 
ity Ca 2+ -binding sites (Figure 15-43A). When activated by Ca 2+ binding, it 
undergoes a conformational change. Because two or more Ca 2+ ions must bind 
before calmodulin adopts its active conformation, the protein responds in an 
almost switchlike manner to increasing concentrations of Ca 2+ (see Figure 
15-25): a tenfold increase in Ca 2+ concentration typically causes a fiftyfold 
increase in calmodulin activation. 

The allosteric activation of calmodulin by Ca 2+ is analogous to the allosteric 
activation of PKA by cyclic AMP, except that Ca 2+ / calmodulin has no enzymatic 
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Figure 1 5-43 The structure of 
Ca 2+ /calmodulin based on x-ray 
diffraction and NMR studies. <CTTC> 
(A) The molecule has a dumbbell shape, 
with two globular ends, which can bind to 
many target proteins. The globular ends are 
connected by a long, exposed a helix, 
which allows the protein to adopt a number 
of very different conformations, depending 
on the target protein it interacts with. Each 
globular head has two Ca 2+ -binding 
domains. (B) The major structural change in 
Ca 2+ /calmodulin that occurs when it binds 
to a target protein (in this example, a 
peptide that consists of the 
Ca 2+ /calmodulin-binding domain of a 
Ca 2+ /calmodulin-dependent protein 
kinase). Note that the Ca 2+ /calmodulin has 
"jack-knifed" to surround the peptide. 
When it binds to other targets, it can adopt 
different conformations. (A, based on x-ray 
crystallographic data from Y.S. Babu et al., 
Nature 3 15:37-40, 1985. With permission 
from Macmillan Publishers Ltd; B, based on 
x-ray crystallographic data from 
W.E. Meador, A.R. Means and F.A. Quiocho, 
Science 257:1 251-1 255, 1 992, and on NMR 
data from M. Ikura et al., Science 
256:632-638, 1992. With permission 
from AAAS.) 
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activity itself but instead acts by binding to and activating other proteins. In 
some cases, calmodulin serves as a permanent regulatory subunit of an enzyme 
complex, but usually the binding of Ca 2+ instead enables calmodulin to bind to 
various target proteins in the cell to alter their activity 

When an activated molecule of Ca 2+ /calmodulin binds to its target protein, 
the calmodulin further changes its conformation, the nature of which depends 
on the specific target protein (Figure 15-43B). Among the many targets calmod- 
ulin regulates are enzymes and membrane transport proteins. As one example, 
Ca 2+ /calmodulin binds to and activates the plasma membrane Ca 2+ -pump that 
uses ATP hydrolysis to pump Ca 2+ out of cells (see Figure 15-41). Thus, whenever 
the concentration of Ca 2+ in the cytosol rises, the pump is activated, which helps 
to return the cytosolic Ca 2+ level to resting levels. 

Many effects of Ca 2+ , however, are more indirect and are mediated by pro- 
tein phosphorylations catalyzed by a family of serine /threonine protein kinases 
called Ca 2+ /calmodulin-dependent kinases (CaM-kinases). Some CaM-kinases 
phosphorylate gene regulatory proteins, such as the CREB protein (see Figure 
15-36), and in this way activate or inhibit the transcription of specific genes. 

One of the best- studied CaM-kinases is CaM-kinase II, which is found in 
most animal cells but is especially enriched in the nervous system. It constitutes 
up to 2% of the total protein mass in some regions of the brain, and it is highly 
concentrated in synapses. CaM-kinase II has two remarkable properties that are 
related. First, it can function as a molecular memory device, switching to an 
active state when exposed to Ca 2+ /calmodulin and then remaining active even 
after the Ca 2+ signal has decayed. This is because the kinase phosphorylates 
itself (a process called autophosphorylation) , as well as other cell proteins, when 
Ca 2+ /calmodulin activates it. In its autophosphorylated state, the enzyme 
remains active even in the absence of Ca 2+ , thereby prolonging the duration of 
the kinase activity beyond that of the initial activating Ca 2+ signal. The enzyme 
maintains this activity until serine /threonine protein phosphatases overwhelm 
the autophosphorylation and shut the kinase off (Figure 15-44). CaM-kinase II 
activation can thereby serve as a memory trace of a prior Ca 2+ pulse, and it 
seems to have a role in some types of memory and learning in the vertebrate 
nervous system. Mutant mice that lack a brain- specific form of the enzyme have 
specific defects in their ability to remember where things are. 
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Figure 15-44 The stepwise activation of 
CaM-kinase II. The enzyme is a large 
protein complex of 12 subunits, 
although, for simplicity, only one subunit 
is shown (in gray). In the absence of 
Ca 2+ /calmodulin, the enzyme is inactive 
as the result of an interaction between an 
inhibitory domain and the catalytic 
domain. Ca 2+ /calmodulin binding alters 
the conformation of the protein, partially 
activating it. The catalytic domains in the 
complex phosphorylate the inhibitory 
domains of neighboring subunits, as well 
as other proteins in the cell (not shown). 
The autophosphorylation of the enzyme 
complex (by mutual phosphorylation of 
its subunits) fully activates the enzyme. It 
also prolongs the activity of the enzyme 
in two ways. First, it traps the bound 
Ca 2+ /calmodulin so that it does not 
dissociate from the enzyme complex 
until cytosolic Ca 2+ levels return to basal 
values for at least 10 seconds (not 
shown). Second, it converts the enzyme 
to a Ca 2+ -independent form so that the 
kinase remains active even after the 
Ca 2+ /calmodulin dissociates from it. This 
activity continues until the action of a 
protein phosphatase overrides the 
autophosphorylation activity of 
CaM-kinase II. 
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(A) low frequency Ca oscillations 
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The second remarkable property of CaM-kinase II follows from the first, in 
that the enzyme can use its intrinsic memory mechanism to act as a frequency 
decoder of Ca 2+ oscillations. This property is thought to be especially important 
at a nerve cell synapse, where changes in intracellular Ca 2+ levels in a postsy- 
naptic cell as a result of neural activity can lead to long-term changes in the sub- 
sequent effectiveness of that synapse (discussed in Chapter 11). When CaM- 
kinase II is immobilized on a solid surface and exposed to both a protein phos- 
phatase and repetitive pulses of Ca 2+ /calmodulin at different frequencies that 
mimic those observed in stimulated cells, the enzyme's activity increases steeply 
as a function of pulse frequency (Figure 15-45). Moreover, the frequency 
response of this multisubunit enzyme depends on its exact subunit composi- 
tion, so that a cell can tailor its response to Ca 2+ oscillations to particular needs 
by adjusting the composition of the CaM-kinase II enzyme that it makes. 



Figure 15-45 CaM-kinase II as a 
frequency decoder of Ca 2+ oscillations. 

(A) At low frequencies of Ca 2+ spikes, the 
enzyme becomes inactive after each 
spike, as the autophosphorylation 
induced by Ca 2+ /calmodulin binding 
does not maintain the enzyme's activity 
long enough for the enzyme to remain 
active until the next Ca 2+ spike arrives. 

(B) At higher spike frequencies, however, 
the enzyme fails to inactivate completely 
between Ca 2+ spikes, so its activity 
ratchets up with each spike. If the spike 
frequency is high enough, this 
progressive increase in enzyme activity 
will continue until the enzyme is 
autophosphorylated on all subunits and 
is therefore maximally activated. 
Although not shown, once enough of its 
subunits are autophosphorylated, the 
enzyme can be maintained in a highly 
active state even with a relatively low 
frequency of Ca 2+ spikes (a form of cell 
memory). 

The binding of Ca 2+ /calmodulin to the 
enzyme is enhanced by the CaM-kinase II 
autophosphorylation (an additional form 
of positive feedback), helping the 
response of the enzyme to repeated Ca 2+ 
spikes to exhibit a steep threshold in its 
frequency response, as discussed earlier. 
(From P.I. Hanson, T. Meyer, L Stryer and 
H. Schulman, Neuron 1 2:943-956, 1 994. 
With permission from Elsevier.) 



Some G Proteins Directly Regulate Ion Channels 

G proteins do not act exclusively by regulating the activity of membrane- 
bound enzymes that alter the concentration of cyclic AMP or Ca 2+ in the 
cytosol. The a subunit of one type of G protein (called G12), for example, acti- 
vates a guanine nucleotide exchange factor (GEF) that activates a monomeric 
GTPase of the Rho family (discussed later and in Chapter 16), which regulates 
the actin cytoskeleton. 

In some other cases, G proteins directly activate or inactivate ion channels in 
the plasma membrane of the target cell, thereby altering the ion permeability — 
and hence the electrical excitability — of the membrane. As an example, acetyl- 
choline released by the vagus nerve reduces both the rate and strength of heart 
muscle cell contraction (see Figure 15-9B). This effect is mediated by a special 
class of acetylcholine receptors that activate the Gi protein discussed earlier. 
Once activated, the a subunit of Gi inhibits adenylyl cyclase (as described previ- 
ously), while the (3y subunits bind to K + channels in the heart muscle cell plasma 
membrane and open them. The opening of these K + channels makes it harder to 
depolarize the cell and thereby contributes to the inhibitory effect of acetyl- 
choline on the heart. (These acetylcholine receptors, which can be activated by 
the fungal alkaloid muscarine, are called muscarinic acetylcholine receptors to 
distinguish them from the very different nicotinic acetylcholine receptors, which 
are ion-channel-coupled receptors on skeletal muscle and nerve cells that can be 
activated by the binding of nicotine, as well as by acetylcholine.) 

Other G proteins regulate the activity of ion channels less directly, either by 
stimulating channel phosphorylation (by PKA, PKC, or CaM-kinase, for example) 
or by causing the production or destruction of cyclic nucleotides that directly 
activate or inactivate ion channels. These cyclic- nucleotide- gated ion channels 
have a crucial role in both smell (olfaction) and vision, as we now discuss. 
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Smell and Vision Depend on GPCRs That Regulate Cyclic- 
Nucleotide-Gated Ion Channels 

Humans can distinguish more than 10,000 distinct smells, which they detect 
using specialized olfactory receptor neurons in the lining of the nose. These cells 
use specific GPCRs called olfactory receptors to recognize odors; the receptors 
are displayed on the surface of the modified cilia that extend from each cell (Fig- 
ure 15-46). The receptors act through cyclic AMP. When stimulated by odorant 
binding, they activate an olfactory- specific G protein (known as G if), which in 
turn activates adenylyl cyclase. The resulting increase in cyclic AMP opens 
cyclic- AMP -gated cation channels, thereby allowing an influx of Na + , which 
depolarizes the olfactory receptor neuron and initiates a nerve impulse that 
travels along its axon to the brain. 

There are about 1000 different olfactory receptors in a mouse and about 350 
in a human, each encoded by a different gene and each recognizing a different set 
of odorants. Each olfactory receptor neuron produces only one of these receptors 
(see p. 453); the neuron responds to a specific set of odorants by means of the 
specific receptor it displays, and each odorant activates its own characteristic set 
of olfactory receptor neurons. The same receptor also helps direct the elongating 
axon of each developing olfactory neuron to the specific target neurons that it 
will connect to in the brain. A different set of GPCRs acts in a similar way to medi- 
ate responses to pheromones, chemical signals detected in a different part of the 
nose that are used in communication between members of the same species. 
Humans, however, lack functional pheromone receptors. 

Vertebrate vision employs a similarly elaborate, highly sensitive, signal 
detection process. Cyclic-nucleotide-gated ion channels are also involved, but 
the crucial cyclic nucleotide is cyclic GMP (Figure 15-47) rather than cyclic 
AMP. As with cyclic AMP, a continuous rapid synthesis (by guanylyl cyclase) and 
rapid degradation (by cyclic GMP phosphodiesterase) controls the concentration 
of cyclic GMP in the cytosol. 

In visual transduction responses, which are the fastest G-protein-mediated 
responses known in vertebrates, the receptor activation stimulated by light causes 
a fall rather than a rise in the level of the cyclic nucleotide. The pathway has been 
especially well studied in rod photoreceptors (rods) in the vertebrate retina. Rods 
are responsible for noncolor vision in dim light, whereas cone photoreceptors 
(cones) are responsible for color vision in bright light. A rod photoreceptor is a 
highly specialized cell with outer and inner segments, a cell body, and a synaptic 
region where the rod passes a chemical signal to a retinal nerve cell (Figure 
15-48). This nerve cell relays the signal to another nerve cell in the retina, which 
in turn relays it to the brain (see Figure 23-16). 

The phototransduction apparatus is in the outer segment of the rod, which 
contains a stack of discs, each formed by a closed sac of membrane that has 
many embedded photosensitive rhodopsin molecules. The plasma membrane 
surrounding the outer segment contains cyclic- GMP '-gated cation channels. 
Cyclic GMP bound to these channels keeps them open in the dark. Paradoxically, 



Figure 15-46 Olfactory receptor neurons. 

(A) A section of olfactory epithelium in the 
nose. Olfactory receptor neurons possess 
modified cilia, which project from the 
surface of the epithelium and contain the 
olfactory receptors, as well as the signal 
transduction machinery. The axon, which 
extends from the opposite end of the 
receptor neuron, conveys electrical signals 
to the brain when an odorant activates the 
cell to produce an action potential. In 
rodents, at least, the basal cells act as stem 
cells, producing new receptor neurons 
throughout life, to replace the neurons that 
die. (B) A scanning electron micrograph of 
the cilia on the surface of an olfactory 
neuron. (B, from E.E. Morrison and 
R.M. Costanzo, J. Comp. Neurol. 297:1-13, 
1 990. With permission from Wiley-Liss.) 
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Figure 15-47 Cyclic GMP. 



91 8 Chapter 1 5: Mechanisms of Cell Communication 



Figure 1 5-48 A rod photoreceptor cell. There are about 1 000 discs in the 
outer segment. The disc membranes are not connected to the plasma 
membrane. The inner and outer segments are specialized parts of a 
primary cilium (discussed in Chapter 16); as mentioned earlier and 
discussed later, a primary cilium extends from the surface of most 
vertebrate cells, where it serves as a signaling organelle. 



light causes a hyperpolarization (which inhibits synaptic signaling) rather than 
a depolarization of the plasma membrane (which would stimulate synaptic sig- 
naling). Hyperpolarization (that is, the membrane potential moves to a more 
negative value — discussed in Chapter 11) results because the light-induced acti- 
vation of rhodopsin molecules in the disc membrane decreases the cyclic GMP 
concentration and closes the cation channels in the surrounding plasma mem- 
brane (Figure 15-49). 

Rhodopsin is a member of the GPCR family, but the activating extracellular 
signal is not a molecule but a photon of light. Each rhodopsin molecule contains 
a covalently attached chromophore, 11- cis retinal, which isomerizes almost 
instantaneously to all- trans retinal when it absorbs a single photon. The isomer- 
ization alters the shape of the retinal, forcing a conformational change in the 
protein (opsin). The activated rhodopsin molecule then alters the conformation 
of the G-protein transducin (G t ), causing the transducin a sub unit to activate 
cyclic GMP phosphodiesterase. The phosphodiesterase then hydrolyzes cyclic 
GMP, so that cyclic GMP levels in the cytosol fall. This drop in cyclic GMP con- 
centration decreases the amount of cyclic GMP bound to the plasma membrane 
cation channels, allowing more of these cyclic -GMP- sensitive channels to close. 
In this way, the signal quickly passes from the disc membrane to the plasma 
membrane, and a light signal is converted into an electrical one, through a 
hyperpolarization of the rod cell plasma membrane. 

Rods use several negative feedback loops to allow the cells to revert quickly to 
a resting, dark state in the aftermath of a flash of light — a requirement for per- 
ceiving the shortness of the flash. A rhodopsin- specific kinase called rhodopsin 
kinase (RK) phosphorylates the cytosolic tail of activated rhodopsin on multiple 
serines, partially inhibiting the ability of the rhodopsin to activate transducin. An 
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Figure 1 5-49 The response of a rod 
photoreceptor cell to light. Rhodopsin 
molecules in the outer-segment discs 
absorb photons. Photon absorption 
closes cation channels in the plasma 
membrane, which hyperpolarizes the 
membrane and reduces the rate of 
neurotransmitter release from the 
synaptic region. Because the 
neurotransmitter inhibits many of the 
postsynaptic retinal neurons, illumination 
serves to free the neurons from inhibition 
and thus, in effect, excites them. 
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Table 15-3 Four Major Families of Trimeric G Proteins* 




FAMILY 


SOME FAMILY 


SUBUNITSTHAT 


SOME FUNCTIONS 




MEMBERS 






1 


G s 


a 


activates adenylyl cyclase; activates Ca 2+ channels 




G if 


a 


activates adenylyl cyclase in olfactory sensory neurons 


II 


Gi 


a 


inhibits adenylyl cyclase 






Py 


activates K + channels 




Go 


Py 


activates K + channels; inactivates Ca 2+ channels 






a and py 


activates phospholipase C-p 




G t (transducin) 


a 


activates cyclic GMP phosphodiesterase in vertebrate rod 
photoreceptors 


III 


Gq 


a 


activates phospholipase C-p 


IV 


Gl2/13 


a 


activates Rho family monomeric GTPases (via Rho-GEF) to 
regulate the actin cytoskeleton 



^Families are determined by amino acid sequence relatedness of the a subunits. Only selected examples are included. About 20 a subunits 
and at least 6 P subunits and 1 1 ysubunits have been described in humans. 

inhibitory protein called arrestin then binds to the phosphorylated rhodopsin, 
further inhibiting rhodopsiris activity. Mice or humans with a mutation that inac- 
tivates the gene encoding RK have a prolonged light response, and their rods 
eventually die. 

At the same time as arrestin shuts off rhodopsin, an RGS protein (see p. 896) 
binds to activated transducin, stimulating the transducin to hydrolyze its bound 
GTP to GDP, which returns transducin to its inactive state. In addition, the cation 
channels that close in response to light are permeable to Ca 2+ , as well as to Na + , 
so that when they close, the normal influx of Ca 2+ is inhibited, causing the Ca 2+ 
concentration in the cytosol to fall. The decrease in Ca 2+ concentration stimu- 
lates guanylyl cyclase to replenish the cyclic GMP, rapidly returning its level to 
where it was before the light was switched on. A specific Ca 2+ - sensitive protein 
mediates the activation of guanylyl cyclase in response to a fall in Ca 2+ levels. In 
contrast to calmodulin, this protein is inactive when Ca 2+ is bound to it and 
active when it is Ca 2+ -free. It therefore stimulates the cyclase when Ca 2+ levels 
fall following a light response. 

Negative feedback mechanisms do more than just return the rod to its rest- 
ing state after a transient light flash; they also help the rod to adapt, stepping 
down the response when the rod is exposed to light continuously. Adaptation, as 
we discussed earlier, allows the receptor cell to function as a sensitive detector 
of changes in stimulus intensity over an enormously wide range of baseline lev- 
els of stimulation. It is why we can see a camera flash in broad daylight. 

The various trimeric G proteins we have discussed in this chapter fall into 
four major families, as summarized in Table 15-3. 



Intracellular Mediators and Enzymatic Cascades Amplify 
Extracellular Signals 

Despite the differences in molecular details, the different intracellular signaling 
pathways that GPCRs trigger share certain features and obey similar general 
principles. They depend on relay chains of intracellular signaling proteins and 
small intracellular mediators. In contrast to the more direct signaling pathways 
used by nuclear receptors discussed earlier, these relay chains provide numer- 
ous opportunities for amplifying the responses to extracellular signals. In the 
visual transduction cascade, for example, a single activated rhodopsin 
molecule catalyzes the activation of hundreds of molecules of transducin at a 
rate of about 1000 transducin molecules per second. Each activated transducin 
molecule activates a molecule of cyclic GMP phosphodiesterase, each of which 
hydrolyzes about 4000 molecules of cyclic GMP per second. This catalytic cas- 
cade lasts for about 1 second and results in the hydrolysis of more than 10 5 
cyclic GMP molecules for a single quantum of light absorbed, and the resulting 
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drop in the concentration of cyclic GMP in turn transiently closes hundreds of 
cation channels in the plasma membrane (Figure 15-50). As a result, a rod cell 
can respond to even a single photon of light in a way that is highly reproducible 
in its timing and magnitude. 

Likewise, when an extracellular signal molecule binds to a receptor that 
indirectly activates adenylyl cyclase via G s , each receptor protein may activate 
many molecules of G s protein, each of which can activate a cyclase molecule. 
Each cyclase molecule, in turn, can catalyze the conversion of a large number of 
ATP molecules to cyclic AMP molecules. A similar amplification operates in the 
inositol phospholipid signaling pathway. In these ways, a nanomolar (10~ 9 M) 
change in the concentration of an extracellular signal can induce micromolar 
(10~ 6 M) changes in the concentration of a small intracellular mediator such as 
cyclic AMP or Ca 2+ . Because these mediators function as allosteric effectors to 
activate specific enzymes or ion channels, a single extracellular signal molecule 
can alter many thousands of protein molecules within the target cell. 

Any such amplifying cascade of stimulatory signals requires counterbalanc- 
ing mechanisms at every step of the cascade to restore the system to its resting 
state when stimulation ceases. As emphasized earlier, the response to stimula- 
tion can be rapid only if the inactivating mechanisms are also rapid. Cells there- 
fore have efficient mechanisms for rapidly degrading (and resynthesizing) cyclic 
nucleotides and for buffering and removing cytosolic Ca 2+ , as well as for inacti- 
vating the responding enzymes and ion channels once they have been activated. 
This is not only essential for turning a response off, it is also important for defin- 
ing the resting state from which a response begins. 

Each protein in the signaling relay chain can be a separate target for regula- 
tion, including the receptor itself, as we discuss next. 



GPCR Desensitization Depends on Receptor Phosphorylation 

When target cells are exposed to a high concentration of a stimulating ligand for 
a prolonged period, they can become desensitized, or adapted, in several differ- 
ent ways. An important class of desensitization mechanisms depend on alter- 
ation of the quantity or condition of the receptor molecules themselves. 

For GPCRs, there are three general modes of desensitization (see Figure 
15-29): (1) In receptor inactivation, they become altered so that they can no 
longer interact with G proteins. (2) In receptor sequestration, they are temporar- 
ily moved to the interior of the cell (internalized) so that they no longer have 
access to their ligand. (3) In receptor down-regulation, they are destroyed in lyso- 
somes after internalization. 

In each case, the desensitization of the GPCRs depends on their phosphory- 
lation by PKA, PKC, or a member of the family of GPCR kinases (GRKs), which 
includes the rhodopsin-specific kinase RK involved in rod photoreceptor desen- 
sitization discussed earlier. The GRKs phosphorylate multiple serines and thre- 
onines on a GPCR, but they do so only after ligand binding has activated the 
receptor, because it is the activated receptor that allosterically activates the GRK. 
As with rhodopsin, once a receptor has been phosphorylated by a GRK, it binds 
with high affinity to a member of the arrestin family of proteins (Figure 15-51). 
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Figure 15-50 Amplification in the light- 
induced catalytic cascade in vertebrate 
rods. The red arrows indicate the steps 
where amplification occurs, with the 
thickness of the arrow roughly indicating 
the magnitude of the amplification. 



activated 
GPCR 



*= 



ACTIVATED 
GPCR STIMULATES 

GRK TO 

PHOSPHORYLATE 

THE GPCR ON 

MULTIPLE SITES 




ARRESTIN BINDS 

TO 

PHOSPHORYLATED 

GPCR 



ADP 



GPCR kinase (GRK) 




arrestin 



Figure 1 5-51 The roles of GPCR kinases 
(GRKs) and arrestins in GPCR 
desensitization. A GRK phosphorylates 
only activated receptors because it is the 
activated GPCR that activates the GRK. 
The binding of an arrestin to the 
phosphorylated receptor prevents the 
receptor from binding to its G protein 
and also directs its endocytosis (not 
shown). Mice that are deficient in one 
form of arrestin fail to desensitize in 
response to morphine, for example, 
attesting to the importance of arrestins 
for desensitization. 
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The bound arrestin can contribute to the desensitization process in at least 
two ways. First, it prevents the activated receptor from interacting with G pro- 
teins. Second, it serves as an adaptor protein to help couple the receptor to the 
clathrin-dependent endocytosis machinery (discussed in Chapter 13), inducing 
receptor-mediated endocytosis. The fate of the internalized GPCR-arrestin 
complex depends on other proteins in the complex. In some cases, the receptor 
is dephosphorylated and recycled back to the plasma membrane for reuse. In 
others, it is ubiquitylated and degraded in lysosomes (discussed later). 

Receptor endocytosis does not necessarily stop the receptor from signaling. 
In some cases, the bound arrestin recruits other signaling proteins to relay the 
signal onward along new pathways from the internalized GPCRs. 



Summary 

GPCRs can indirectly activate or inactivate either plasma-membrane-bound enzymes 
or ion channels via G proteins. When an activated receptor stimulates a G protein, the 
G protein undergoes a conformational change that activates both its a subunitand its 
py subunits, which can then directly regulate the activity of target proteins in the 
plasma membrane. Some GPCRs either activate or inactivate adenylyl cyclase, thereby 
altering the intracellular concentration of the small intracellular mediator cyclic AMP 
Others activate a phosphoinositide- specific phospholipase C (PLC/5), which hydrolyzes 
phosphatidylinositol 4,5-bisphosphate [PI(4,5)P2] to generate two small intracellular 
mediators. One is inositol 1,4,5-trisphosphate (IP3), which releases Ca 2+ from the ER 
and thereby increases the concentration ofCa 2+ in the cytosol. The other is diacylglyc- 
erol, which remains in the plasma membrane and helps activate protein kinase C 
(PKC). An increase in cytosolic cyclic AMP or Ca 2+ levels affects cells mainly by stimu- 
lating protein kinase A (PKA) and Ca 2+ / calmodulin- dependent protein kinases (CaM- 
kinases), respectively. 

PKC, PKA, and CaM-kinases phosphorylate specific target proteins on serines or 
threonines and thereby alter the activity of the proteins. Each type of cell has its own 
characteristic set of target proteins that is regulated in these ways, enabling the cell to 
make its own distinctive responses to the small intracellular mediators. The intracel- 
lular signaling cascades activated by GPCRs greatly amplify the responses, so that 
many thousands of target protein molecules are changed for each molecule of extra- 
cellular signaling ligand bound to its receptor. 

The responses mediated by GPCRs are rapidly turned off when the extracellular 
signal is removed. Thus, the G-protein a subunit is stimulated by its target protein or 
an RGS to inactivate itself by hydrolyzing its bound GTP to GDP, IP3 is rapidly dephos- 
phorylated by a lipid phosphatase or phosphorylated by a lipid kinase, cyclic 
nucleotides are hydrolyzed by phosphodiesterases, Ca 2+ is rapidly pumped out of the 
cytosol, and phosphorylated proteins are dephosphorylated by protein phosphatases. 
Activated GPCRs themselves are phosphorylated by GRKs, thereby trigging arrestin 
binding. The bound arrestin uncouples the receptors from G proteins and promotes 
their endocytosis, resulting in either desensitization or continued signaling via 
arrestin-recruited signaling proteins. 



SIGNALING THROUGH ENZYME-COUPLED 
CELL-SURFACE RECEPTORS 

Like GPCRs, enzyme- coupled receptors are transmembrane proteins with their 
ligand-binding domain on the outer surface of the plasma membrane. Instead 
of having a cytosolic domain that associates with a trimeric G protein, however, 
their cytosolic domain either has intrinsic enzyme activity or associates directly 
with an enzyme. Whereas a GPCR has seven transmembrane segments, each 
subunit of an enzyme -coupled receptor usually has only one. GPCRs and 
enzyme-coupled receptors often activate some of the same signaling pathways, 
and there is usually no obvious reason why a particular extracellular signal uti- 
lizes one class of receptors rather than the other. 
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There are six principal classes of enzyme -coupled receptors: 

1. Receptor tyrosine kinases directly phosphorylate specific tyrosines on 
themselves and on a small set of intracellular signaling proteins. 

2. Tyrosine-kinase- associated receptors have no intrinsic enzyme activity but 
directly recruit cytoplasmic tyrosine kinases to relay the signal. 

3. Receptor serine/threonine kinases directly phosphorylate specific serines or 
threonines on themselves and on latent gene regulatory proteins with 
which they are associated. 

4. Histidine-kinase-associated receptors activate a two-component signaling 
pathway in which the kinase phosphorylates itself on histidine and then 
immediately transfers the phosphoryl group to a second intracellular sig- 
naling protein. 

5. Receptor guanylyl cyclases directly catalyze the production of cyclic GMP in 
the cytosol, which acts as a small intracellular mediator in much the same 
way as cyclic AMP. 

6. Receptorlike tyrosine phosphatases remove phosphate groups from 
tyrosines of specific intracellular signaling proteins. (They are called 
"receptorlike" because their presumptive ligands have not yet been identi- 
fied, and so their receptor function is unproven.) 

We focus our discussion on the first four classes, beginning with the recep- 
tor tyrosine kinases, the most numerous of the enzyme -coupled receptors. 



Activated Receptor Tyrosine Kinases (RTKs) Phosphorylate 
Themselves 

Many extracellular signal proteins act through receptor tyrosine kinases 
(RTKs). Notable examples discussed elsewhere in this book include epidermal 
growth factor (EGF), platelet- derived growth factor (PDGF), fibroblast growth fac- 
tors (FGFs), hepatocyte growth factor (HGF), insulin, insulinlike growth factor- 1 
(IGF1), vascular endothelial growth factor (VEGF), macrophage-colony-stimulat- 
ing factor (MCSF), and the neurotrophins, including nerve growth factor (NGF). 

Many cell-surface-bound extracellular signal proteins also act through 
RTKs. Ephrins are the largest class of such membrane-bound ligands, with eight 
identified in humans. Among their many functions, they help guide the migra- 
tion of cells and axons along specific pathways during animal development by 
stimulating responses that result in attraction or repulsion (discussed later and 
in Chapter 22). The receptors for ephrins, called Eph receptors, are also among 
the most numerous RTKs, with thirteen genes encoding them in humans. The 
ephrins and Eph receptors are unusual in that they can simultaneously act as 
both ligand and receptor: on binding to an Eph receptor, some ephrins not only 
activate the Eph receptor but also become activated themselves to transmit sig- 
nals into the interior of the ephrin- expressing cell; in this way, the behavior of 
both the signaling cell and the target cell is altered. Such bidirectional signaling 
between ephrins and Eph receptors is required, for example, to keep some 
neighboring groups of cells from mixing with each other during development. 

There are about 60 genes encoding human RTKs. These receptors can be 
classified into more than 16 structural subfamilies, each dedicated to its com- 
plementary family of protein ligands. Figure 15-52 shows a number of the fam- 
ilies that operate in mammals, and Table 15-4 lists some of their ligands and 
functions. In all cases, the binding of the signal protein to the ligand-binding 
domain on the outside of the cell enables the intracellular tyrosine kinase 
domain to phosphorylate selected tyrosine side chains, both on the receptor 
proteins themselves and on intracellular signaling proteins that subsequently 
bind to the phosphorylated tyrosines on the receptors. 

How does the binding of an extracellular ligand activate the kinase domain 
on the other side of the plasma membrane? For a GPCR, ligand binding is 
thought to change the relative orientation of several of the transmembrane a 
helices, thereby shifting the position of the cytoplasmic loops relative to one 
another. It is difficult to imagine, however, how a conformational change could 
propagate across the lipid bilayer through a single transmembrane a helix. 
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Figure 15-52 Some subfamilies of RTKs. 

Only one or two members of each 
subfamily are indicated. Note that the 
tyrosine kinase domain is interrupted by 
a "kinase insert region" in some of the 
subfamilies. The functional roles of most 
of the cysteine-rich, immunoglobulin- 
like, and fibronectin-type Ill-like domains 
are not known. Some of the ligands for 
the receptors shown are listed in Table 
15-4, along with some representative 
responses that they mediate. 



Instead, for many RTKs, ligand binding causes the receptor chains to dimerize, 
bringing the kinase domains of two receptor chains together (an example of 
induced proximity, discussed earlier) so that they can become activated and 
cross-phosphorylate each other on multiple tyrosines, a process referred to as 
transautophosphorylation (Figure 15-53A). 

Because of the requirement for receptor dimerization, it is relatively easy to 
inactivate a specific RTK to determine its importance for a cell response. For this 
purpose, cells are transfected with DNA encoding a mutant form of the receptor 
that dimerizes normally but has an inactive kinase domain. When coexpressed 
at a high level with normal receptors, the mutant receptor acts in a dominant- 
negative way, disabling the normal receptors by forming inactive dimers with 
them (Figure 15-53B). 



Phosphorylated Tyrosines on RTKs Serve as Docking Sites for 
Intracellular Signaling Proteins 

The cross-phosphorylation of adjacent cytosolic tails of RTKs contributes to the 
receptor activation process in two ways. First, phosphorylation of tyrosines 
within the kinase domain increases the kinase activity of the enzyme. Second, 



Table 15-4 Some Signal Proteins That Act Via RTKs 




SIGNAL PROTEIN 


RECEPTORS 


SOME REPRESENTATIVE RESPONSES 


Epidermal growth factor (EGF) 


EGF receptors 


stimulates cell survival, growth, proliferation, or 
differentiation of various cell types; acts as inductive 
signal in development 


Insulin 


insulin receptor 


stimulates carbohydrate utilization and protein synthesis 


Insulin-like growth factors (IGF1 


IGF receptor-1 


stimulate cell growth and survival in many cell types 


and IGF2) 






Nerve growth factor (NGF) 


TrkA 


stimulates survival and growth of some neurons 


Platelet-derived growth factors 


PDGF receptors (a and p) 


stimulate survival, growth, proliferation, and migration of 


(PDGF AA, BB, AB) 




various cell types 


Macrophage-colony-stimulating 


MCSF receptor 


stimulates monocyte/macrophage proliferation and 


factor (MCSF) 




differentiation 


Fibroblast growth factors (FGF1 to 


FGF receptors (FGFR1-FGFR4, 


stimulate proliferation of various cell types; inhibit 


FGF24) 


plus multiple isoforms 


differentiation of some precursor cells; act as inductive 




of each) 


signals in development 


Vascular endothelial growth factor 


VEGF receptors 


stimulates angiogenesis 


(VEGF) 






Ephrins (A and B types) 


Eph receptors (A and B types) 


stimulate angiogenesis; guide cell and axon migration 
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phosphorylation of tyrosines outside the kinase domain creates high-affinity 
docking sites for the binding of specific intracellular signaling proteins. Each of 
the signaling proteins binds to specific phosphorylated sites on the activated 
receptors because it contains a specific phosphotyrosine-binding domain that 
recognizes surrounding features of the polypeptide chain in addition to the 
phosphotyrosine. 

Once bound to the activated RTK, a signaling protein may itself become 
phosphorylated on tyrosines and thereby be activated. In many cases, however, 
the binding alone may be sufficient to activate the docked signaling protein, by 
either inducing a conformational change in the protein or simply bringing it 
near the protein that is next in the signaling pathway. Thus, transautophospho- 
rylation serves as a switch to trigger the transient assembly of an intracellular 
signaling complex, which can then relay the signal onward, often along multiple 
routes, to various destinations in the cell (Figure 15-54). Because different RTKs 
bind different combinations of these signaling proteins, they activate different 
responses. 

The receptors for insulin and IGF1 act in a slightly different way. They are 
tetramers (see Figure 15-52), and ligand binding is thought to rearrange their 
transmembrane receptor chains, moving the two kinase domains close together. 
Moreover, most of the phosphotyrosine docking sites generated by ligand bind- 
ing are not on the receptor itself but on a specialized docking protein called 
insulin receptor substrate-1 (IRS1). The activated receptor first transautophos- 
phorylates its kinase domains, which then phosphorylate IRS1 on multiple 
tyrosines, thereby creating many more docking sites than could be accommo- 
dated on the receptor alone (see Figure 15-22). Some other RTKs use docking 
proteins in a similar way to enlarge the size of the signaling complex. 



Figure 15-53 Activation and 
inactivation of RTKs by dimerization. 

(A) The normal receptors dimerize in 
response to ligand binding. The two 
kinase domains cross-phosphorylate 
each other, increasing the activity of the 
kinase domains, which now 
phosphorylate other sites on the 
receptors. (B) The mutant receptor with 
an inactivated kinase domain can 
dimerize normally, but it cannot cross- 
phosphorylate a normal receptor in a 
dimer. For this reason, the mutant 
receptors, if present in excess, will block 
signaling by the normal receptors — a 
process called dominant-negative 
regulation. Cell biologists frequently use 
this strategy to inhibit a specific type of 
RTK in a cell to determine its normal 
function. A similar approach can be used 
to inhibit the function of various other 
types of receptors and intracellular 
signaling proteins that function as dimers 
or larger oligomers. 



Proteins with SH2 Domains Bind to Phosphorylated Tyrosines 

A whole menagerie of intracellular signaling proteins can bind to the phospho- 
tyrosines on activated RTKs (or on docking proteins such as IRS1). They help to 
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Figure 15-54 The docking of 
intracellular signaling proteins on 
phosphotyrosines on an activated RTK. 

The activated receptor and its bound 
signaling proteins form a signaling 
complex that can then broadcast signals 
along multiple signaling pathways. 
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relay the signal onward, mainly through chains of protein-protein interactions 
mediated by modular interaction domains, as discussed earlier. Some of the 
docked proteins are enzymes, such as phospholipase C-y (PLCy), which func- 
tions in the same way as phospholipase C-p — activating the inositol phospho- 
lipid signaling pathway discussed earlier in connection with GPCRs (see Figures 
15-38 and 15-39). Through this pathway, RTKs can increase cytosolic Ca 2+ levels 
and activate PKC. Another enzyme that docks on these receptors is the cyto- 
plasmic tyrosine kinase Src, which phosphorylates other signaling proteins on 
tyrosines. Yet another is phosphoinositide 3-kinase (PI 3-kinase), which phos- 
phorylates mainly lipids rather than proteins; as we discuss later, the phospho- 
rylated lipids then serve as docking sites to attract various signaling proteins to 
the plasma membrane. 

The intracellular signaling proteins that bind to phosphotyrosines, either on 
activated RTKs or on proteins docked on them, have varied structures and func- 
tions. However, they usually share highly conserved phosphotyrosine-binding 
domains. These can be either SH2 domains (for Src homology region) or, less 
commonly, PTB domains (for phosphotyrosine-binding). By recognizing specific 
phosphorylated tyrosines, these small interaction domains enable the proteins 
that contain them to bind to activated RTKs, as well as to many other intracellu- 
lar signaling proteins that have been transiently phosphorylated on tyrosines 
(Figure 15-55). As previously discussed, many signaling proteins also contain 
other interaction domains that allow them to interact specifically with other 
proteins as part of the signaling process. These domains include the SH3 
domain, which binds to proline-rich motifs in intracellular proteins (see Figure 
15-22). There are about 115 SH2 domains and about 295 SH3 domains encoded 
in the human genome. 

Not all proteins that bind to activated RTKs via SH2 domains help to relay the 
signal onward. Some act to decrease the signaling process, providing negative 
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Figure 1 5-55 The binding of SH2- 
containing intracellular signaling 
proteins to an activated PDGF receptor. 

(A) This drawing of a PDGF receptor 
shows five phosphotyrosine docking 
sites, three in the kinase insert region and 
two on the C-terminal tail, to which the 
three signaling proteins shown bind as 
indicated. The numbers on the right 
indicate the positions of the tyrosines in 
the polypeptide chain. These binding 
sites have been identified by using 
recombinant DNA technology to mutate 
specific tyrosines in the receptor. 
Mutation of tyrosines 1009 and 1021, for 
example, prevents the binding and 
activation of PLCy, so that receptor 
activation no longer stimulates the 
inositol phospholipid signaling pathway. 
The locations of the SH2 {red) and SH3 
(blue) domains in the three signaling 
proteins are indicated. (Additional 
phosphotyrosine docking sites on this 
receptor are not shown, including those 
that bind the cytoplasmic tyrosine kinase 
Src and two adaptor proteins.) It is 
unclear how many signaling proteins can 
bind simultaneously to a single RTK. 

(B) The three-dimensional structure of an 
SH2 domain, as determined by x-ray 
crystallography. The binding pocket for 
phosphotyrosine is shown in yellow on 
the right, and a pocket for binding a 
specific amino acid side chain (isoleucine, 
in this case) is shown in yellow on the left 
(see also Figure 3-39). (C) The SH2 
domain is a compact, "plug-in" module, 
which can be inserted almost anywhere 
in a protein without disturbing the 
protein's folding or function (discussed in 
Chapter 3). Because each domain has 
distinct sites for recognizing 
phosphotyrosine and for recognizing a 
particular amino acid side chain, different 
SH2 domains recognize phosphotyrosine 
in the context of different flanking amino 
acid sequences. (B, based on data from 
G. Waksman et al., Cell 72:779-790, 1 993. 
With permission from Elsevier.) 
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feedback. One example is the c-Cbl protein, which can dock on some activated 
receptors and catalyze their ubiquitylation, covalently adding a single ubiquitin 
molecule to one or more sites on the receptor (called monoubiquityl-ation to 
distinguish it from polyubiquitylation, in which one or more long ubiquitin 
chains are added to a protein). Monoubiquitylation promotes the endocytosis 
and degradation of the receptors in lysosomes — an example of receptor down- 
regulation (see Figure 15-29). Endocytic proteins that contain ubiquitin-interac- 
tion motifs (UIMs) recognize the monoubiquitylated RTKs and direct them into 
clathrin-coated vesicles and, ultimately, into lysosomes (discussed in Chapter 
13). Mutations that inactivate c-Cbl-dependent RTK down- regulation cause pro- 
longed RTK signaling and thereby promote the development of cancer. 

As is the case for GPCRs, ligand-induced endocytosis of RTKs does not 
always decrease signaling. In some cases, RTKs are endocytosed with their 
bound signaling proteins and continue to signal from endosomes or other 
intracellular compartments. This mechanism, for example, allows nerve growth 
factor (NGF) to bind to its specific RTK (called TrkA) at the end of a long nerve 
cell axon and signal to the cell body of the same cell a long distance away. Here, 
signaling endocytic vesicles containing TrkA, with NGF bound on the inside 
and signaling proteins docked on the cytosolic side, are transported along the 
axon to the cell body, where they signal the cell to survive. 

Some signaling proteins are composed almost entirely of SH2 and SH3 
domains and function as adaptors to couple tyrosine-phosphorylated proteins 
to other proteins that do not have their own SH2 domains (see Figure 15-22). 
Adaptor proteins of this type help to couple activated RTKs to the important sig- 
naling protein Ras, a monomeric GTPase that, in turn, can activate various 
downstream signaling pathways, as we now discuss. 



Ras Belongs to a Large Superfamily of Monomeric GTPases 

The Ras superfamily consists of various families of monomeric GTPases, but 
only the Ras and Rho families relay signals from cell- surface receptors (Table 
15-5). By interacting with different intracellular signaling proteins, a single Ras 
or Rho family member can coordinately spread the signal along several distinct 
downstream signaling pathways, thereby acting as a signaling hub. 

There are three major, closely related Ras proteins in humans (H-, K-, and 
N-Ras — see Table 15-5). Although they have subtly different functions, they are 
thought to work in the same way, and we will refer to them simply as Ras. Like 
many monomeric GTPases, Ras contains one or more covalently attached lipid 
groups that help anchor the protein to the cytoplasmic face of the membrane 
where the protein functions, which is mainly the plasma membrane, from 
where it relays signals to other parts of the cell. Ras is often required, for exam- 
ple, when RTKs signal to the nucleus to stimulate cell proliferation or differen- 
tiation, both of which require changes in gene expression. If one inhibits Ras 
function by the microinjection of either neutralizing anti-Ras antibodies or a 
dominant- negative mutant form of Ras, the cell proliferation or differentiation 



Table 1 5-5 The Ras Superfamily of Monomeric GTPases 



FAMILA 


r 


SOME FAMILY MEMBERS 


SOME FUNCTIONS 


Ras 




H-Ras, K-Ras, N-Ras 


relay signals from RTKs 






Rheb 


activates mTOR to stimulate cell growth 






Rep1 


activated by a cyclic-AMP-dependent GEF; influences cell adhesion by 
activating integrins 


Rho* 




Rho, Rac, Cdc42 


relay signals from surface receptors to the cytoskeleton and elsewhere 


ARF* 




ARF1-ARF6 


regulate assembly of protein coats on intracellular vesicles 


Rab* 




Rab1-60 


regulate intracellular vesicle traffic 


Ran* 




Ran 


regulates mitotic spindle assembly and nuclear transport of RNAs and proteins 



*The Rho family is discussed in Chapter 1 6, the ARF and Rab proteins in Chapter 1 3, and Ran in Chapters 1 2 and 1 7. The three-dimensional 
structure of Ras is shown in Figure 3-72. 
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responses normally induced by the activated RTKs do not occur. Conversely, 
30% of human tumors have hyperactive mutant forms of Ras, which contribute 
to the uncontrolled proliferation of the cancer cells. 

Like other GTP-binding proteins, Ras functions as a molecular switch, 
cycling between two distinct conformational states — active when GTP is bound 
and inactive when GDP is bound (see Figure 15-18B). <GAAC> As discussed ear- 
lier for monomeric GTPases in general, two classes of signaling proteins regulate 
Ras activity by influencing its transition between active and inactive states (see 
Figure 15-19). Ras guanine nucleotide exchange factors (Ras-GEFs) stimulate the 
dissociation of GDP and the subsequent uptake of GTP from the cytosol, thereby 
activating Ras. Ras GTPase- activating proteins (Ras-GAPs) increase the rate of 
hydrolysis of bound GTP by Ras, thereby inactivating Ras. Hyperactive mutant 
forms of Ras are resistant to GAP-mediated GTPase stimulation and are locked 
permanently in the GTP-bound active state, which is why they promote the 
development of cancer. 

But how do RTKs normally activate Ras? In principle, they could either acti- 
vate a Ras-GEF or inhibit a Ras-GAP. Even though some GAPs bind directly (via 
their SH2 domains) to activated RTKs (see Figure 15-55A), whereas GEFs usually 
bind only indirectly, it is the indirect coupling of the receptor to a Ras-GEF that 
drives Ras into its active state. In fact, the loss of function of a Ras-GEF has a sim- 
ilar effect to the loss of function of Ras itself. Activation of the other Ras super- 
family proteins, including those of the Rho family, also occurs through the acti- 
vation of GEFs. The particular GEF determines in which membrane the GTPase 
is activated and, by acting as a scaffold, which downstream proteins the GTPase 
activates. 

Ras proteins and the proteins that regulate them have been highly con- 
served in evolution, and genetic analyses in Drosophila and C. elegans provided 
the first clues to how RTKs activate Ras. Genetic studies of photoreceptor cell 
development in the Drosophila eye were particularly informative. 




200 |nm 

Figure 15-56 Scanning electron 
micrograph of a compound eye of 
Drosophila. The eye is composed of 
about 800 identical units (ommatidia), 
each having a separate lens that focuses 
light onto eight photoreceptor cells at its 
base. (Courtesy of Kevin Moses.) 



RTKs Activate Ras Via Adaptors and GEFs: Evidence from the 
Developing Drosophila Eye 

The Drosophila compound eye consists of about 800 identical units called 
ommatidia, each composed of 8 photoreceptor cells (R1-R8) and 12 accessory 
cells (Figure 15-56). The eye develops from a simple epithelial sheet, and the cells 
that make up each ommatidium are recruited from the sheet in a fixed sequence, 
by a series of cell-cell interactions. Beginning with the development of the R8 
photoreceptor, each differentiating cell induces its uncommitted immediate 
neighbors to adopt a specific fate and assemble into the developing ommatidium 
(Figure 15-57). 

The development of the R7 photoreceptor, which is required for the detection 
of ultraviolet light, has been studied most intensively, beginning with the descrip- 
tion of a mutant fly called Sevenless (Sev), in which a deficiency of R7 is the only 
observed defect. Such mutants are easy to select on the basis of their blindness to 
ultraviolet light. The normal Sev gene was shown to encode an RTK that is 
expressed on R7 precursor cells. Further genetic analysis of mutants in which R7 
development is blocked but the Sev protein itself is not affected led to the identi- 
fication of the gene Bride-of-sevenless (Boss), which encodes the ligand for the Sev 
RTK. Boss is a seven-pass transmembrane protein that is expressed exclusively on 
the surface of the adjacent R8 cell, and when it binds to and activates Sev, it 
induces the R7 precursor cell to differentiate into an R7 photoreceptor. The Sev 
protein is also expressed on several other precursor cells in the developing 
ommatidium, but none of these cells contact R8; therefore, the Sev protein is not 
activated, and these cells do not differentiate into R7 photoreceptors. 
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Figure 1 5-57 The assembly of 
photoreceptor cells in a developing 
Drosophila ommatidium. Cells are 
recruited sequentially to become 
photoreceptors, beginning with R8 and 
ending with R7, which is the last 
photoreceptor cell to develop. 
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Figure 1 5-58 How the Sev RTK activates 
Ras in the fly eye. The activation of Sev 
on the surface of the R7 precursor cell by 
the Boss protein on the surface of R8 
activates the Ras-GEF Sos via the adaptor 
protein Drk. Drk recognizes a specific 
phosphorylated tyrosine on the Sev 
protein by means of an SH2 domain and 
interacts with Sos by means of two SH3 
domains. Sos stimulates the inactive Ras 
protein to replace its bound GDP by GTP, 
which activates Ras to relay the signal 
downstream, inducing the R7 precursor 
cell to differentiate into a UV-sensing 
photoreceptor cell. 



adaptor protein (Drk) 



The components of the intracellular signaling pathway activated by Sev in 
the R7 precursor cell proved more difficult to identify than the receptor or its lig- 
and because mutations that inactivate them are lethal. The problem was solved 
by performing a genetic screen in flies with a partially inactive Sev protein. One 
of the genes identified encodes a Ras protein. Flies in which both copies of the 
Ras gene are inactivated by mutation die, whereas flies with only one inactivated 
copy survive. But when a fly carries a partially inactive Sev protein in its devel- 
oping eyes, an inactivating mutation in one copy of the Ras gene results in the 
loss of R7. Moreover, if one of the Ras genes is rendered overactive by mutation, 
R7 develops even in mutants in which both Sev and Boss are inactive. These find- 
ings indicate that Ras acts downstream of Sev and that its activation in R7 pre- 
cursor cells is necessary and sufficient to induce R7 differentiation. 

A second gene identified in these genetic screens is called Son-of-sevenless 
(Sos). It encodes a Ras-GEF, which is required for the Sev RTK to activate Ras. A 
third gene (called Drk) encodes an adaptor protein, which couples the Sev 
receptor to the Sos protein; the SH2 domain of the Drk adaptor binds to acti- 
vated Sev, while its two SH3 domains bind to Sos (Figure 15-58). This type of 
genetic screen, in which organisms with a partially crippled component of a 
genetic pathway are used to identify genes encoding other proteins in the path- 
way, has now been widely used elsewhere with great success. 

Biochemical and cell biological studies have shown that the coupling of 
RTKs to Ras occurs by a similar mechanism in mammalian cells, where the 
adaptor protein is called Grb2 and the Ras-GEF is also called Sos (see Figure 
15-22). Interestingly, when mammalian Sos activates Ras, Ras acts back to stim- 
ulate Sos further, forming a simple positive feedback loop. 

RTKs are not the only means of activating Ras. Ca 2+ and diacylglycerol, for 
example, activate a Ras-GEF that is found mainly in the brain; this Ras-GEF can 
couple GPCRs to Ras activation independently of Sos. 

Once activated, Ras activates various other signaling proteins to relay the 
signal downstream, as we discuss next. 



Ras Activates a MAP Kinase Signaling Module 

Both the tyrosine phosphorylations and the activation of Ras triggered by acti- 
vated RTKs are usually short-lived (Figure 15-59). Tyrosine- specific protein 
phosphatases quickly reverse the phosphorylations, and GAPs induce acti- 
vated Ras to inactivate itself by hydrolyzing its bound GTP to GDP. To stimu- 
late cells to proliferate or differentiate, these short-lived signaling events must 
be converted into longer-lasting ones that can sustain the signal and relay it 
downstream to the nucleus to alter the pattern of gene expression. One of the 
key mechanisms used for this purpose is a system of proteins called the mito- 
gen-activated protein kinase module (MAP kinase module) (Figure 15-60). 
The three components of this system together form a functional signaling 
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Figure 15-59 Transient activation of Ras revealed by single-molecule fluorescence resonance energy transfer (FRET). 

(A) Schematic drawing of the experimental strategy. Cells of a human cancer cell line are genetically engineered to express 
a Ras protein that is covalently linked to yellow fluorescent protein (YFP). GTP that is labeled with a red fluorescent dye is 
microinjected into some of the cells. The cells are then stimulated with the extracellular signal protein epidermal growth 
factor (EGF), and single fluorescent molecules of Ras-YFP at the inner surface of the plasma membrane are followed by 
video fluorescence microscopy in individual cells. When a fluorescent Ras-YFP molecule becomes activated, it exchanges 
unlabeled GDP for fluorescently labeled GTP; the yellow-green light emitted from the YFP now activates the fluorescent 
GTP to emit red light. Thus, the activation of single Ras molecules can be followed by the emission of red fluorescence from 
a previously yellow-green fluorescent spot at the plasma membrane. As shown in (B), activated Ras molecules can be 
detected after about 30 seconds of EGF stimulation. The red signal peaks at 3-4 minutes and then decreases to baseline by 
6 minutes. As Ras-GAP is found to be recruited to the same spots at the plasma membrane as Ras, it presumably plays a 
major part in rapidly shutting off the Ras signal. (Modified from H. Murakoshi et al., Proc. Natl Acad. Sci. U.S.A. 
101 :731 7-7322, 2004. With permission from National Academy of Sciences.) 

module that has been highly conserved, from yeasts to humans, and is used, 
with variations, in many different signaling contexts. 

The three components are all protein kinases. The final kinase in the series 
is called simply MAP kinase (MAPK). The next one upstream from this is MAP 
kinase kinase (MAPKK): it phosphorylates and thereby activates MAP kinase. 
And next above that, receiving an activating signal directly from Ras, is MAP 
kinase kinase kinase (MAPKKK): it phosphorylates and thereby activates 
MAPKK. In the mammalian Ras-MAP-kinase signaling pathway, these three 
kinases are known by shorter names: Raf (=MAPKKK), Mek (= MAPKK) and Erk 
(=MAPK). 

Once activated, the MAP kinase relays the signal downstream by phosphory- 
lating various proteins in the cell, including gene regulatory proteins and other 
protein kinases (see Figure 15-60). Erk MAP kinase, for example, enters the 
nucleus and phosphorylates one or more components of a gene regulatory com- 
plex. This activates the transcription of a set of immediate early genes, so named 
because they turn on within minutes after an RTK receives an extracellular signal, 
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Figure 15-60 The MAP kinase 
serine/threonine phosphorylation 
module activated by Ras. The three- 
component module begins with a MAP 
kinase kinase kinase called Raf. Ras 
recruits Raf to the plasma membrane and 
helps activate it. Raf then activates the 
MAP kinase kinase Mek, which then 
activates the MAP kinase Erk. Erk in turn 
phosphorylates a variety of downstream 
proteins, including other protein kinases, 
as well as gene regulatory proteins in the 
nucleus. The resulting changes in gene 
expression and protein activity cause 
complex changes in cell behavior. 
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even if protein synthesis is experimentally blocked with drugs. Some of these 
genes encode other gene regulatory proteins that turn on other genes, a process 
that requires both protein synthesis and more time. (This relationship between 
the immediate early and later genes is similar to the relationship between pri- 
mary and secondary response genes activated by the nuclear receptors dis- 
cussed earlier — see Figure 15-15.) 

In this way, the Ras-MAP-kinase signaling pathway conveys signals from the 
cell surface to the nucleus and alters the pattern of gene expression. Among the 
genes activated by this pathway are some that stimulate cell proliferation, such 
as the genes encoding Gi cyclins (discussed in Chapter 17). 

Extracellular signals usually activate MAP kinases only transiently, and the 
period during which the kinase remains active influences the response. When 
EGF activates its receptors in a neural precursor cell line, for example, Erk MAP 
kinase activity peaks at 5 minutes and rapidly declines, and the cells later go on 
to divide. By contrast, when NGF activates its receptors on the same cells, Erk 
activity remains high for many hours, and the cells stop proliferating and differ- 
entiate into neurons. Many factors can influence the duration of the signaling 
response, including positive and negative feedback loops (see Figure 15-28). 

MAP kinases participate in both positive and negative feedback loops, 
which can combine to give responses that are either graded or switchlike and 
either brief or long lasting. In an example illustrated earlier, in Figure 15-24, 
MAP kinase activates a complex positive feedback loop to produce an all-or- 
none, irreversible response when frog oocytes are stimulated to mature by a 
brief exposure to the extracellular signal molecule progesterone. In many cells, 
MAP kinases activate a negative feedback loop by increasing the concentration 
of a dual-specificity protein phosphatase that removes the phosphate from both 
a tyrosine and a threonine to inactivate the MAP kinase (which is phosphory- 
lated on the tyrosine and threonine by MAPKK) . The increase in the phosphatase 
results from both an increase in the transcription of the phosphatase gene and 
the stabilization of the enzyme against degradation. In the Ras-MAP-kinase 
pathway shown in Figure 15-61, Erk also phosphorylates and inactivates Raf, 
providing another negative feedback loop that helps shut off the MAP kinase 
module. 



Scaffold Proteins Help Prevent Cross-Talk Between Parallel MAP 
Kinase Modules 

Three-component MAP kinase signaling modules operate in all eucaryotic cells, 
with different modules mediating different responses in the same cell. In bud- 
ding yeast, for example, one such module mediates the response to mating 
pheromone, another the response to starvation, and yet another the response to 
osmotic shock. Some of these MAP kinase modules use one or more of the same 
kinases and yet manage to activate different effector proteins and hence differ- 
ent responses. As discussed earlier, one way in which cells avoid cross-talk 
between the different parallel signaling pathways and ensure that each response 
is specific is to use scaffold proteins (see Figure 15-2 1A). In yeast cells, such scaf- 
folds bind all or some of the kinases in each MAP kinase module to form a com- 
plex and thereby help to ensure response specificity (Figure 15-61). 

Mammalian cells also use this scaffold strategy to prevent cross-talk 
between different MAP kinase modules. At least 5 parallel MAP kinase modules 
can operate in a mammalian cell. These modules make use of at least 12 MAP 
kinases, 7 MAP kinase kinases, and 7 MAP kinase kinase kinases. Two of these 
modules (terminating in MAP kinases called JNK and p38) are activated by dif- 
ferent kinds of cell stresses, such as UV irradiation, heat shock, and osmotic 
stress, as well as by inflammatory cytokines; others mainly mediate responses to 
signals from other cells. 

Although the scaffold strategy provides precision and avoids cross-talk, it 
reduces the opportunities for amplification and spreading of the signal to differ- 
ent parts of the cell, which require at least some of the components to be dif- 
fusible (see Figure 15-17). It is unclear to what extent the individual components 
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of MAP kinase modules can dissociate from the scaffold during the activation 
process to permit amplification, which is why we refer to them as modules 
rather than cascades. 



Rho Family GTPases Functionally Couple Cell-Surface Receptors 
to the Cytoskeleton 

Besides the Ras proteins, the other class of Ras superfamily GTPases that relays 
signals from cell-surface receptors is the large Rho family (see Table 15-5). Rho 
family monomeric GTPases regulate both the actin and microtubule cytoskele- 
tons, controlling cell shape, polarity, motility, and adhesion (discussed in Chap- 
ter 16); they also regulate cell-cycle progression, gene transcription, and mem- 
brane transport. They play a key part in the guidance of cell migration and nerve 
axon outgrowth, mediating cytoskeletal responses to the activation of a special 
class of guidance receptors. We focus on this aspect of Rho family function here. 

The three best-characterized members are Rho itself, Rac, and Cdc42, each 
of which affects multiple downstream target proteins. In the same way as for 
Ras, GEFs activate and GAPs inactivate the Rho family GTPases; there are more 
than 60 Rho -GEFs and more than 70 Rho -GAPs in humans. Some of the GEFs 
and GAPs are specific for one particular family member, whereas others are less 
specific. Unlike Ras, which is membrane-associated even when inactive (with 
GDP bound), inactive Rho family GTPases are often bound to guanine 
nucleotide dissociation inhibitors (GDIs) in the cytosol, which prevent the 
GTPases from interacting with their Rho -GEFs at the plasma membrane. 

Although cell- surface receptors activate the Rho GTPases by activating Rho- 
GEFs, it is not known in most cases how a receptor activates its GEE One of the 
exceptions is an Eph RTK (see Figure 15-52) on the surface of motor neurons 
that helps guide the migrating tip of the axon (called a growth cone) to its mus- 
cle target. The binding of a cell-surface ephrin protein activates the Eph recep- 
tor, causing the growth cones to collapse, thereby repelling them from inappro- 
priate regions and keeping them on track. The response depends on a Rho-GEF 
called ephexin, which is stably associated with the cytosolic tail of the Eph recep- 
tor. When ephrin binding activates the Eph receptor, the receptor activates a 
cytoplasmic tyrosine kinase that phosphorylates ephexin on a tyrosine, enhanc- 
ing the ability of ephexin to activate the Rho protein RhoA. The activated RhoA 
(RhoA-GTP) then regulates various downstream target proteins, including some 
effector proteins that control the actin cytoskeleton, causing the growth cone to 
collapse (Figure 15-62). 

Ephrins are among the best-characterized extracellular guidance proteins, 
with many functions, both in the nervous system and outside it. One of their 
roles, for example, is to direct the way in which nerve axons grow from the eye 
to the optic tectum so as to create a neural 'map' of the visual field in the brain 



Figure 15-61 The organization of two 
MAP kinase modules by scaffold 
proteins in budding yeast. Budding 
yeast have at least six three-component 
MAP kinase modules involved in a variety 
of biological processes, including the two 
responses illustrated here — a mating 
response and the response to high 
osmolarity. (A) The mating response is 
triggered when a mating factor secreted 
by a yeast of opposite mating type binds 
to a GPCR.This activates a G protein, the 
Py complex of which indirectly activates 
the MAP kinase kinase kinase (kinase A), 
which then relays the response onward. 
Once activated, the MAP kinase (kinase C) 
phosphorylates and thereby activates 
several proteins that mediate the mating 
response, in which the yeast cell stops 
dividing and prepares for fusion. The 
three kinases in this module are bound to 
scaffold protein 1. (B) In a second 
response, a yeast cell exposed to a high- 
osmolarity environment is induced to 
synthesize glycerol to increase its internal 
osmolarity. This response is mediated by 
a transmembrane, osmolarity-sensing, 
receptor protein and a different MAP 
kinase module bound to a second 
scaffold protein. (Note that the kinase 
domain of scaffold 2 provides the MAP 
kinase kinase activity of this module.) 
Although both pathways use the same 
MAP kinase kinase kinase (kinase A, 
green), there is no cross-talk between 
them, because the kinases in each 
module are bound to different scaffold 
proteins, and the osmosensor is bound to 
the same scaffold protein as the 
particular kinase it activates. 

Remarkably, scaffold 1 is able to 
function even if it is stripped (by genetic 
engineering) of any ability to align or 
allosterically regulate its three bound 
kinases. Its role is apparently just to bring 
the kinases into close proximity, thereby 
increasing the rate at which they 
encounter each other, as expected for 
reaction rate accelerations caused by 
protein tethering (see Figure 3-80C). 
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Figure 15-62 Growth cone collapse 
mediated by Rho family GTPases. 

(A) The binding of transmembrane ephrin 
A1 proteins on an adjacent cell activates 
EphA4 RTKs on the growth cone of an 
axon, by the mechanism illustrated in 
Figure 15-53A. Phosphotyrosines on the 
activated Eph receptors recruit and 
activate a cytoplasmic tyrosine kinase to 
phosphorylate the receptor-associated 
Rho-GEF ephexin on a tyrosine. This 
enhances the ability of the ephexin to 
activate the Rho family GTPase RhoA. 
RhoA then induces the growth cone to 
collapse by stimulating the myosin- 
dependent contraction of the actin 
cytoskeleton. (B) When ephrin A1 is not 
bound to the EphA4 receptor, ephexin 
activates three different Rho family 
members (Cdc42, Rac, and RhoA) about 
equally, promoting the forward advance 
of the growth cone. (C) EphrinAI binding 
to EphA4 alters the activity of ephexin so 
that it now mainly activates RhoA, 
causing the growth cone to collapse. 



(discussed in Chapter 22) . But the guidance of axon outgrowth is a complex mat- 
ter, and other types of guidance receptors are also involved, as discussed in 
Chapter 22. All of these receptors, however, are thought to guide cell movement 
by influencing the cytoskeleton, and to do so via members of the Rho family 

Having considered how RTKs use GEFs and monomeric GTPases to relay 
signals into the cell, we now consider a second major strategy that RTKs use that 
depends on a quite different intracellular relay mechanism. 



PI 3-Kinase Produces Lipid Docking Sites in the Plasma 
Membrane 

As mentioned earlier, one of the proteins that binds to the intracellular tail of 
RTK molecules is the plasma-membrane-bound enzyme phosphoinositide 3- 
kinase (PI 3-kinase). This kinase principally phosphorylates inositol phospho- 
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lipids rather than proteins, and both RTKs and GPCRs can activate it. It plays a 
central part in promoting cell survival and growth. 

Phosphatidylinositol (PI) is unique among membrane lipids because it can 
undergo reversible phosphorylation at multiple sites on its inositol head group 
to generate a variety of phosphorylated PI lipids called phosphoinositides (see 
Figure 15-37). When activated, PI 3-kinase catalyzes phosphorylation at the 3 
position of the inositol ring to generate several phosphoinositides (Figure 
15-63). The production of PI(3,4,5)P3 matters most because it can serve as a 
docking site for various intracellular signaling proteins, which assemble into sig- 
naling complexes that relay the signal into the cell from the cytosolic face of the 
plasma membrane (see Figure 15-21C). 

Notice the difference between this use of phosphoinositides and their use 
described earlier, in which PI(4,5)P2 is cleaved by PLC(3 (in the case of GPCRs) 
or PLCy (in the case of RTKs) to generate soluble IP3 and membrane-bound dia- 
cylglycerol (see Figures 15-38 and 15-39). By contrast, PI (3,4,5) P3 is not cleaved 
by PLC. It is made from PI(4,5)P2 and then remains in the plasma membrane 
until specific phosphoinositide phosphatases dephosphorylate it. Prominent 
among these is the PTEN phosphatase, which dephosphorylates the 3 position 
of the inositol ring. Mutations in PTEN are found in many cancers: by prolong- 
ing signaling by PI 3-kinase, they promote uncontrolled cell growth. 

There are various types of PI 3-kinases. Those activated by RTKs and GPCRs 
belong to class I. These are heterodimers composed of a common catalytic sub- 
unit and different regulatory subunits. RTKs activate class la PI 3 -kinases, in 
which the regulatory subunit is an adaptor protein that binds to two phospho- 
tyrosines on activated RTKs through its two SH2 domains (see Figure 15-55A). 
GPCRs activate class lb PI 3-kinases, which have a regulatory subunit that binds 
to the (3y complex of an activated trimeric G protein when GPCRs are activated 
by their extracellular ligand. The direct binding of activated Ras can also activate 
the common class I catalytic subunit. 

Intracellular signaling proteins bind to PI(3,4,5)P3 produced by activated PI 
3-kinase via a specific interaction domain, such as a pleckstrin homology (PH) 
domain mentioned earlier, first identified in the platelet protein pleckstrin. PH 
domains function mainly as protein-protein interaction domains, and it is only 
a small subset of them that bind to PIP3; at least some of these also recognize a 
specific membrane-bound protein as well as the PIP3, which greatly increases 
the specificity of the binding and helps to explain why the signaling proteins 
with PIP 3 -binding PH domains do not all dock at all PI(3,4,5)P 3 sites. PH 
domains occur in about 200 human proteins, including the Ras-GEF Sos dis- 
cussed earlier (see Figure 15-22). 
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Figure 1 5-63 The generation of 
phosphoinositide docking sites by 
PI 3-kinase. PI 3-kinase phosphorylates the 
inositol ring on carbon atom 3 to generate 
the phosphoinositides shown at the bottom 
of the figure (diverting them away from the 
pathway leading to IP 3 and diacylglycerol). 
The most important phosphorylation 
(indicated in red) is of PI(4,5)P 2 to PI(3A5)P 3 , 
which can serve as a docking site for 
signaling proteins with PIP3-binding PH 
domains. Other inositol phospholipid 
kinases catalyze the phosphorylations 
indicated by the green arrows. 
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One especially important PH- domain- containing protein is the serine/thre- 
onine protein kinase Akt. The PI-3-kinase-Akt signaling pathway is the major 
pathway activated by the hormone insulin. It also plays a key part in promoting 
the survival and growth of many cell types in both invertebrates and vertebrates, 
as we now discuss. 



The PI-3-Kinase-Akt Signaling Pathway Stimulates Animal Cells to 
Survive and Grow 



For a multicellular organism to grow, its cells must grow (accumulate mass and 
enlarge). If the cells simply divided without growing, they would get progres- 
sively smaller and the organism as a whole would stay the same size. As dis- 
cussed earlier, extracellular signals are usually required for animal cells to grow 
and divide, as well as to survive (see Figure 15-8). Members of the insulin-like 
growth factor (IGF) family of signal proteins, for example, stimulate many types 
of animal cells to survive and grow. They bind to specific RTKs (see Figure 
15-52), which activate PI 3-kinase to produce PI(3,4,5)P3. The PIP3 recruits two 
protein kinases to the plasma membrane via their PH domains — Akt (also called 
protein kinase B, or PKB) and phosphoinositide- dependent protein kinase 1 
{PDK1), and this leads to the activation of Akt (Figure 15-64). Once activated, 
Akt phosphorylates various target proteins at the plasma membrane, as well as 
in the cytosol and nucleus. The effect on most of the known targets is to inacti- 
vate them; but the targets are such that these actions of Akt all conspire to 
enhance cell survival and growth. 

One effect of Akt, for example, is to phosphorylate a cytosolic protein called 
Bad, which, in its nonphosphorylated state, promotes cell death by apoptosis 
(discussed in Chapter 18). The phosphorylation of Bad by Akt creates phospho- 
serine-binding sites for a scaffold protein called 14-3-3, which sequesters phos- 
phorylated Bad and keeps it out of action, thereby promoting cell survival (see 
Figure 15-64). 

The PI-3-kinase-Akt pathway signals cells to grow through a more complex 
mechanism that depends on a large, serine /threonine protein kinase called TOR 
(named as the target of rapamycin, a bacterial toxin that inactivates the kinase 
and is used clinically as both an immunosuppressant and anti-cancer drug). TOR 
was originally identified in yeasts in genetic screens for rapamycin resistance; in 
mammalian cells, it is called mTOR. TOR exists in cells in two functionally dis- 
tinct multiprotein complexes. In mammalian cells, mTOR complex 1 contains the 
protein raptor, this complex is sensitive to rapamycin, and it stimulates cell 
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Figure 1 5-64 One way in which 
signaling through PI 3-kinase promotes 
cell survival. An extracellular survival 
signal activates an RTK, which recruits 
and activates PI 3-kinase. The PI 3-kinase 
produces PI(3,4,5)P 3 , which serves as a 
docking site for two serine/threonine 
kinases with PH domains — Akt and the 
phosphoinositol-dependent kinase 
PDKI — which are brought into proximity 
at the plasma membrane. The Akt is 
phosphorylated on a serine by a third 
kinase (usually mTOR), which alters the 
conformation of the Akt so that it can be 
phosphorylated on a threonine by PDK1, 
which activates the Akt. The activated Akt 
now dissociates from the plasma 
membrane and phosphorylates various 
target proteins, including the Bad 
protein. 

When unphosphorylated, Bad holds 
one or more apoptosis-inhibitory 
proteins (of the Bcl2 family — discussed in 
Chapter 18) in an inactive state. Once 
phosphorylated, Bad releases the 
inhibitory proteins, which now can block 
apoptosis and thereby promote cell 
survival. As shown, once phosphorylated, 
Bad binds to a ubiquitous cytosolic 
protein called 14-3-3, which keeps Bad 
out of action. 
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growth — both by promoting ribosome production and protein synthesis and by 
inhibiting protein degradation. Complex 1 also promotes both cell growth and 
cell survival by stimulating nutrient uptake and metabolism. mTOR complex 2 
contains the protein victor and is insensitive to rapamycin; it helps to activate 
Akt (see Figure 15-64), and it regulates the actin cytoskeleton via Rho family 
GTPases. 

The mTOR in complex 1 integrates inputs from various sources, including 
extracellular signal proteins referred to as growth factors and nutrients such as 
amino acids, both of which help activate mTOR and promote cell growth. The 
growth factors activate mTOR mainly via the PI-3-kinase-Akt pathway. Akt acti- 
vates mTOR in complex 1 indirectly by phosphorylating, and thereby inhibiting, 
a GAP called Tsc2. Tsc2 acts on a monomeric Ras-related GTPase called Rheb 
(see Table 15-5, p. 926). Rheb in its active form (Rheb-GTP) activates mTOR. The 
net result is that Akt activates mTOR and thereby promotes cell growth (Figure 
15-65). We discuss how mTOR stimulates ribosome production and protein syn- 
thesis in Chapter 17 (see Figure 17-65). 



Figure 15-65 Activation of mTOR by the 
PI-3-kinase-Akt signaling pathway. 

(A) In the absence of extracellular growth 
factors, Tsc2 (a Rheb-GAP) keeps Rheb 
inactive; mTOR in complex I is inactive, 
and there is no cell growth. (B) In the 
presence of growth factors, activated Akt 
phosphorylates and inhibits Tsc2, thereby 
promoting the activation of Rheb. 
Activated Rheb (Rheb-GTP) helps activate 
mTOR in complex 1, which in turn 
stimulates cell growth. Figure 15-64 
shows how growth factors (or survival 
signals) activate Akt. 

The Erk MAP kinase (see Figure 15-60) 
can also phosphorylate and inhibit Tsc2 
and thereby activate mTOR. Thus, both 
the PI-3-kinase-Akt and Ras-MAP- kinase 
signaling pathways converge on mTOR in 
complex 1 to stimulate cell growth. 

Tsc2 is short for tuberous sclerosis 
protein 2, and it is one component of a 
heterodimer composed of Tsd and Tsc2 
(not shown); these proteins are so called 
because mutations in either gene 
encoding them cause the genetic disease 
tuberous sclerosis, which is associated 
with benign tumors in the brain and 
elsewhere that contain abnormally 
large cells. 



The Downstream Signaling Pathways Activated By RTKs and 
GPCRs Overlap 

As mentioned earlier, RTKs and GPCRs activate some of the same intracellular 
signaling pathways. Both, for example, can activate the inositol phospholipid 
pathway triggered by phospholipase C. Moreover, even when they activate dif- 
ferent pathways, the different pathways can converge on the same target pro- 
teins. Figure 15-66 illustrates both of these types of signaling overlaps: it sum- 
marizes five parallel intracellular signaling pathways that we have discussed so 
far — one triggered by GPCRs, two triggered by RTKs, and two triggered by both 
kinds of receptors. 



Tyrosine-Kinase-Associated Receptors Depend on Cytoplasmic 
Tyrosine Kinases 

Many cell-surface receptors depend on tyrosine phosphorylation for their activ- 
ity and yet lack a tyrosine kinase domain. These receptors act through cytoplas- 
mic tyrosine kinases, which are associated with the receptors and phosphorylate 
various target proteins, often including the receptors themselves, when the 
receptors bind their ligand. These tyrosine-kinase- associated receptors thus 
function in much the same way as RTKs, except that their kinase domain is 
encoded by a separate gene and is noncovalently associated with the receptor 
polypeptide chain. A variety of receptor classes belong in this category, including 
the receptors for antigen and interleukins on lymphocytes (discussed in Chapter 



936 Chapter 1 5: Mechanisms of Cell Communication 



signal molecule 




Figure 15-66 Five parallel intracellular 
signaling pathways activated by GPCRs, 
RTKs, or both. In this hypothetical 
example, the five kinases (shaded yellow) 
at the end of each signaling pathway 
phosphorylate target proteins (shaded 
red), many of which are phosphorylated 
by more than one of the kinases. The 
phospholipase C activated by the two 
types of receptors is different: GPCRs 
activate PLCp, whereas RTKs activate 
PLCy (not shown). Although not shown, 
some GPCRs can also activate Ras, but 
they do so independently of Grb2, via a 
Ras-GEF that is activated by Ca 2+ and 
diacylglycerol. 
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25), integrins (discussed in Chapter 19), and receptors for various cytokines and 
some hormones. As with RTKs, many of these receptors are either preformed 
dimers (Figure 15-67) or are cross-linked into dimers by ligand binding. 

Some of these receptors depend on members of the largest family of mam- 
malian cytoplasmic tyrosine kinases, the Src family (see Figures 3-10 and 3-69), 
which includes Src, Yes, Fgr, Fyn, Lck, Lyn, Hck, and Blk. These protein kinases all 
contain SH2 and SH3 domains and are located on the cytoplasmic side of the 
plasma membrane, held there partly by their interaction with transmembrane 
receptor proteins and partly by covalently attached lipid chains. Different fam- 
ily members are associated with different receptors and phosphorylate overlap- 
ping but distinct sets of target proteins. Lyn, Fyn, and Lck, for example, are each 
associated with different sets of receptors in lymphocytes (discussed in Chapter 
25). In each case, the kinase is activated when an extracellular ligand binds to 
the appropriate receptor protein. Src itself, as well as several other family mem- 
bers, can also bind to activated RTKs; in these cases, the receptor and cytoplas- 
mic kinases mutually stimulate each other's catalytic activity, thereby strength- 
ening and prolonging the signal (see Figure 15-62). There are even some G pro- 
teins (G s and GO that can activate Src, which is one way that the activation of 
GPCRs can lead to tyrosine phosphorylation of intracellular signaling proteins 
and effector proteins. 

Another type of cytoplasmic tyrosine kinase associates with integrins, the 
main receptors that cells use to bind to the extracellular matrix (discussed in 
Chapter 19). The binding of matrix components to integrins activates intracellu- 

Figure 1 5-67 The three-dimensional structure of human growth hormone 
bound to its receptor. <CGCT> The receptor is a homodimer, and the 
hormone (red) binds to each of the two identical subunits (one shown in 
green and the other in blue), which recognize different parts of the 
monomeric hormone. Hormone binding is thought to cause a rearrangement 
of the subunits, which activates cytoplasmic tyrosine kinases that are tightly 
bound to the cytosolic tails of the receptors (not shown). The structure shown 
was determined by x-ray crystallographic studies of complexes formed 
between the hormone and extracellular receptor domains produced by 
recombinant DNA technology. (From A.M. deVos, M. Ultsch and 
A.A. Kossiakoff, Sc/ence 255:306-31 2, 1992. With permission from AAAS.) 
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lar signaling pathways that influence the behavior of the cell. When integrins 
cluster at sites of matrix contact, they help trigger the assembly of cell-matrix 
junctions called focal adhesions. Among the many proteins recruited into these 
junctions is the cytoplasmic tyrosine kinase called focal adhesion kinase (FAK), 
which binds to the cytosolic tail of one of the integrin subunits with the assis- 
tance of other proteins. The clustered FAK molecules cross-phosphorylate each 
other, creating phosphotyrosine- docking sites where the Src kinase can bind. Src 
and FAK now phosphorylate each other and other proteins that assemble in the 
junction, including many of the signaling proteins used by RTKs. In this way, the 
two tyrosine kinases signal to the cell that it has adhered to a suitable substra- 
tum, where the cell can now survive, grow, divide, migrate, and so on. 

The largest and most diverse class of receptors that rely on cytoplasmic tyro- 
sine kinases to relay signals into the cell is the class of cytokine receptors, which 
we consider next. 



Cytokine Receptors Activate the JAK-STAT Signaling Pathway, 
Providing a Fast Track to the Nucleus 

The large family of cytokine receptors includes receptors for many kinds of local 
mediators (collectively called cytokines), as well as receptors for some hor- 
mones, such as growth hormone (see Figure 15-67) and prolactin. These recep- 
tors are stably associated with cytoplasmic tyrosine kinases called Janus kinases 
(JAKs) (after the two-faced Roman god), which phosphorylate and activate gene 
regulatory proteins called STATs (signal transducers and activators of transcrip- 
tion). STAT proteins are located in the cytosol and are referred to as latent gene 
regulatory proteins because they only migrate into the nucleus and regulate gene 
transcription after they are activated. 

Although many intracellular signaling pathways lead from cell-surface 
receptors to the nucleus, where they alter gene transcription (see Figure 15-66), 
the JAK-STAT signaling pathway provides one of the more direct routes. 
Cytokine receptors are dimers or trimers and are stably associated with one or 
two of the four known JAKs (JAK1, JAK2, JAK3, and Tyk2). Cytokine binding alters 
the arrangement so as to bring two JAKs into close proximity so that they trans- 
phosphorylate each other, thereby increasing the activity of their tyrosine kinase 
domains. The JAKs then phosphorylate tyrosines on the cytokine receptors, cre- 
ating phosphotyrosine docking sites for STATs (Figure 15-68). Some adaptor 
proteins can also bind to some of these sites and couple cytokine receptors to 
the Ras-MAP-kinase signaling pathway discussed earlier, but these will not be 
discussed here. 

There are at least six STATs in mammals. Each has an SH2 domain that per- 
forms two functions. First, it mediates the binding of the STAT protein to a 
phosphotyrosine docking site on an activated cytokine receptor. Once bound, 
the JAKs phosphorylate the STAT on tyrosines, causing the STAT to dissociate 
from the receptor. Second, the SH2 domain on the released STAT now mediates 
its binding to a phosphotyrosine on another STAT molecule, forming either a 
STAT homodimer or a heterodimer. The STAT dimer then translocates to the 
nucleus, where, in combination with other gene regulatory proteins, it binds to 
a specific DNA response element in various genes and stimulates their tran- 
scription (see Figure 15-68). In response to the hormone prolactin, for example, 
which stimulates breast cells to produce milk, activated STAT5 stimulates the 
transcription of genes that encode milk proteins. Table 15-6 lists some of the 
more than 30 cytokines and hormones that activate the JAK-STAT pathway by 
binding to cytokine receptors; it also shows the specific JAKs and STATs involved. 

Some STATs also have an SH2 domain that enables them to dock onto spe- 
cific phosphotyrosines on some activated RTKs. These receptors can directly 
activate the bound STAT, independently of JAKs. The nematode C. elegans uses 
STATs for signaling but does not make any JAKs or cytokine receptors, suggest- 
ing that STATs evolved before JAKs and cytokine receptors. 

Negative feedback regulates the responses mediated by the JAK-STAT path- 
way. In addition to activating genes that encode proteins mediating the 
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cytokine-induced response, the STAT dimers can also activate genes that encode 
inhibitory proteins that help shut off the response. Some of these proteins bind 
to and inactivate phosphorylated JAKs and their associated phosphorylated 
receptors; others bind to phosphorylated STAT dimers and prevent them from 
binding to their DNA targets. Such negative feedback mechanisms, however, are 
not enough on their own to turn off the response. Inactivation of the activated 
JAKs and STATs requires dephosphorylation of their phosphotyrosines. 



Protein Tyrosine Phosphatases Reverse Tyrosine Phosphorylations 

In all signaling pathways that use tyrosine phosphorylation, the tyrosine phos- 
phorylations are reversed by dephosphorylation performed by protein tyrosine 
phosphatases. These phosphatases are as important in the signaling process as 
the protein tyrosine kinases that add the phosphates. Whereas only a few types 
of serine/threonine protein phosphatase catalytic subunits are responsible for 
removing phosphate groups from phosphorylated serines and threonines on 
proteins, there are about 100 protein tyrosine phosphatases encoded in the 
human genome, including some dual-specificity phosphatases that also dephos- 
phorylate serines and threonines. 

Like tyrosine kinases, the tyrosine phosphatases occur in both cytoplasmic 
and transmembrane forms, none of which are structurally related to serine/ 
threonine protein phosphatases. Some of the transmembrane forms are thought 
to act as cell-surface receptors, but as this has not been established, they are 
generally referred to as receptorlike tyrosine phosphatases. 



Figure 1 5-68 The JAK-STAT signaling 
pathway activated by cytokines. The 

binding of the cytokine either causes two 
separate receptor polypeptide chains to 
dimerize (as shown) or reorients the 
receptor chains in a preformed dimer. In 
either case, the associated JAKs are 
brought together so that they can cross- 
phosphorylate each other on tyrosines, 
starting the signaling process shown. In 
some cases, the active receptor is a trimer 
rather than a dimer. 
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Table 15-6 Some Extracellular Signal Proteins That Act Through Cytokine Receptors and the JAK-STAT Signaling Pathway 



SIGNAL PROTEIN 


RECEPTOR-ASSOCIATED 


STATS ACTIVATED 


SOME RESPONSES 




JAKs 






y-i interferon 


JAK1 and JAK2 


STAT1 


activates macrophages 


oc-i interferon 


Tyk2 and JAK2 


STAT1 and STAT2 


increases cell resistance to viral infection 


Erythropoietin 


JAK2 


STAT5 


stimulates production of erythrocytes 


Prolactin 


JAK1 and JAK2 


STAT5 


stimulates milk production 


Growth hormone 


JAK2 


STAT1 and STAT5 


stimulates growth by inducing IGF1 
production 


GMCSF 


JAK2 


STAT5 


stimulates production of granulocytes and 
macrophages 



Unlike serine /threonine protein phosphatases, which generally have broad 
specificity, most tyrosine phosphatases display exquisite specificity for their 
substrates, removing phosphate groups from only selected phosphotyrosines on 
a subset of proteins. Together, these phosphatases ensure that tyrosine phos- 
phorylations are short-lived and that the level of tyrosine phosphorylation in 
resting cells is very low. They do not, however, simply continuously reverse the 
effects of protein tyrosine kinases; they are often regulated to act only at the 
appropriate time and place in a signaling response or in the cell-division cycle 
(discussed in Chapter 17). 

Having discussed the crucial role of tyrosine phosphorylation and dephos- 
phorylation in the intracellular signaling pathways activated by many enzyme- 
coupled receptors, we now turn to a class of enzyme-coupled receptors that rely 
entirely on serine /threonine phosphorylation. These receptor serine/ threonine 
kinases activate an even more direct signaling pathway to the nucleus than does 
the JAK-STAT pathway. They directly phosphorylate latent gene regulatory pro- 
teins called Smads, which then translocate into the nucleus to activate gene 
transcription. 



Signal Proteins of the TGFp Superfamily Act Through Receptor 
Serine/Threonine Kinases and Smads 

The transforming growth factor- P (TGF|3) superfamily consists of a large num- 
ber (30-40 in humans) of structurally related, secreted, dimeric proteins. They 
act either as hormones or, more commonly, as local mediators to regulate a wide 
range of biological functions in all animals. During development, they regulate 
pattern formation and influence various cell behaviors, including proliferation, 
specification and differentiation, extracellular matrix production, and cell 
death. In adults, they are involved in tissue repair and in immune regulation, as 
well as in many other processes. The superfamily consists of the TGFfi/ activin 
family and the larger bone morphogenetic protein (BMP) family. 

All of these proteins act through enzyme -coupled receptors that are single- 
pass transmembrane proteins with a serine /threonine kinase domain on the 
cytosolic side of the plasma membrane. There are two classes of these receptor 
serine/threonine kinases — type I and type II — which are structurally similar 
homodimers. Each member of the TGF(3 superfamily binds to a characteristic 
combination of type-I and type-II receptor dimers, bringing the kinase domains 
together so that the type-II receptor can phosphorylate and activate the type-I 
receptor, forming an active tetrameric receptor complex. 

Once activated, the receptor complex uses a strategy for rapidly relaying the 
signal to the nucleus that is very similar to the JAK-STAT strategy used by 
cytokine receptors. The activated type-I receptor directly binds and phospho- 
rylates a latent gene regulatory protein of the Smad family (named after the 
first two identified, Sma in C. elegans and Mad in Drosophila). Activated 
TGFp/activin receptors phosphorylate Smad2 or Smad3, while activated BMP 
receptors phosphorylate Smadl, Smad5, or Smad8. Once one of these receptor- 
activated Smads (R-Smads) has been phosphorylated, it dissociates from the 
receptor and binds to Smad4 (called a co-Smad), which can form a complex 
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with any of the five R-Smads. The Smad complex then translocates into the 
nucleus, where it associates with other gene regulatory proteins and regulates 
the transcription of specific target genes (Figure 15-69). Because the partner 
proteins in the nucleus vary depending on the cell type and state of the cell, the 
genes affected vary 

Activated TGF(3 receptors and their bound ligand are endocytosed by two 
distinct routes, one leading to further activation, and the other leading to inac- 
tivation. The activation route depends on clathrin- coated vesicles and leads to 
early endosomes (discussed in Chapter 13), where most of the Smad activation 
occurs. An anchoring protein called SARA (for Smad anchor for receptor activa- 
tion) has an important role in this pathway; it is concentrated in early endo- 
somes and binds to both activated TGF(3 receptors and Smads, increasing the 
efficiency of receptor-mediated Smad phosphorylation. The inactivation route 
depends on caveolae (discussed in Chapter 13) and leads to receptor ubiquityla- 
tion and degradation in proteasomes. 

Some TGF(3 family members serve as graded morphogens during develop- 
ment, inducing different responses in a developing cell depending on the con- 
centration of the morphogen (see Figure 15-10, and discussed in Chapter 22). 
Their effective extracellular concentrations are often regulated by secreted 
inhibitory proteins that bind directly to the signal molecules and prevent them 
from activating their receptors on target cells. Noggin and chordin, for example, 
inhibit BMPs, and follistatin inhibits activins. Some of these inhibitors, as well as 
most TGF(3 family members themselves, are secreted as inactive precursors, 
which are activated by extracellular proteolytic cleavage. 

During the signaling response, the Smads continuously shuttle between the 
cytoplasm and the nucleus: they are dephosphorylated in the nucleus and 
exported to the cytoplasm, where they can be rephosphorylated by activated 
receptors. In this way, the effect exerted on the target genes reflects both the con- 
centration of the extracellular signal and the time for which it continues to act on 
the cell-surface receptors (often several hours). Cells exposed to a morphogen at 
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Figure 15-69 The Smad-dependent 
signaling pathway activated byTGF|3. 

Note thatTGFp is a dimer and that Smads 
open up to expose a dimerization surface 
when they are phosphorylated. Several 
features of the pathway have been 
omitted for simplicity, including the 
following. (1 ) The type-l and type-ll 
receptor proteins are both thought to be 
homodimers. (2) The type-l receptors are 
normally associated with an inhibitory 
protein, which dissociates when the type- 
I receptor is phosphorylated by a type-ll 
receptor. (3) The individual Smads are 
thought to be trimers. (4) An anchoring 
protein called SARA helps to recruit 
Smad2 or Smad3 to the activated type I 
receptor, mainly in endosomes, as 
discussed in the text. 
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high concentration, or for a long time, or both, will switch on one set of genes, 
while cells receiving a lower or more transient exposure will switch on another 
set. 

As with the JAK-STAT pathway, negative feedback regulates the Smad path- 
way. Among the target genes activated by Smad complexes are those that encode 
inhibitory Smads, either Smad6 or Smad7. Smad7 (and possibly Smad6) binds to 
the activated receptor and inhibits its signaling ability in at least three ways: (1) 
it competes with R- Smads for binding sites on the receptor, decreasing R-Smad 
phosphorylation; (2) it recruits a ubiquitin ligase called Smurf, which ubiquity- 
lates the receptor, leading to receptor internalization and degradation (it is 
because Smurfs also ubiquitylate and promote the degradation of Smads that 
they are called Smad ubiquitylation regulatory factors, or Smurfs); and (3) it 
recruits a protein phosphatase that dephosphorylates and inactivates the recep- 
tor. In addition, the inhibitory Smads bind to the co-Smad, Smad4, and inhibit 
it, either by preventing its binding to R- Smads or by promoting its ubiquitylation 
and degradation. 

Although receptor serine /threonine kinases operate mainly through the 
Smad pathway just described, they can also affect other intracellular signaling 
pathways. Conversely, signaling proteins in other pathways can phosphorylate 
Smads and thereby influence signaling along the Smad pathway. 

Serine/Threonine and Tyrosine Protein Kinases Are Structurally 
Related 

All the signaling pathways activated by GPCRs and enzyme -coupled receptors 
that we have discussed so far depend on serine/threonine-specific protein 
kinases, tyrosine-specific protein kinases, or both. These kinases are all struc- 
turally related, as summarized in Figure 15-70. 

The bewildering complexity of multiple cross -regulatory signaling pathways 
and feedback loops that we have discussed is not just a haphazard tangle, but a 
highly evolved system for processing and interpreting the mass of signals that 
impinge upon animal cells. The whole molecular control network, leading from 
the receptors at the cell surface to the genes in the nucleus, can be viewed as a 
computing device; and, like that other computing device, the brain, it presents 
one of the hardest problems in biology. We can identify the components and dis- 
cover how they work individually. We can understand how small subsets of com- 
ponents work together as regulatory modules, as we have seen. But, it is a much 
more difficult task to understand how the system works as a whole. This is not 
only because the system is complex; it is also because the way it behaves is 
strongly dependent on the quantitative details of the molecular interactions, 
and for most animal cells we have only rough qualitative information. 

In bacterial cells, the signaling pathways are simpler, and precise quantita- 
tive information is much easier to obtain. It is therefore possible to give a 
detailed account of how a complete signaling system works, at least for one par- 
ticular bacterial cell behavior and the signals that control it. We discuss here one 
such example, in which bacteria respond to environmental signals delivered via 
enzyme-coupled receptors that are again kinases, but of a type unrelated to 
those we have discussed so far. 

Bacterial Chemotaxis Depends on a Two-Component Signaling 
Pathway Activated by Histidine-Kinase-Associated Receptors 

As pointed out earlier, many of the mechanisms involved in chemical signal- 
ing between cells in multicellular animals are thought to have evolved from 
mechanisms used by unicellular organisms to respond to chemical changes in 
their environment. In fact, both types of organisms use some of the same 
intracellular mediators, such as cyclic nucleotides and Ca 2+ . Among the best- 
studied reactions of unicellular organisms to extracellular signals are their 
chemotactic responses, in which cell movement is oriented toward or away 
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from a source of some chemical in the environment. We conclude this section 
on enzyme-coupled receptors with a brief account of bacterial chemotaxis, 
which provides a particularly well-understood illustration of the crucial role of 
adaptation in the response to chemical signals. This chemotaxis response is 
mediated by histidine-kinase- associated receptors that activate a two-compo- 
nent signaling pathway, which is also used by yeasts and plants, although appar- 
ently not by animals. 

Motile bacteria such as E. coli will swim toward higher concentrations of 
nutrients (attractants), including sugars, amino acids, and small peptides, and 
away from higher concentrations of various noxious chemicals (repellents). They 
swim by means of four to six flagella, each of which is attached by a short, flexi- 
ble hook at its base to a small protein disc embedded in the bacterial membrane. 
This disc is part of a tiny motor that uses the energy stored in the transmem- 
brane H + gradient to rotate rapidly and turn the helical flagellum (Figure 15-71) . 
Because the flagella on the bacterial surface have an intrinsic "handedness," dif- 
ferent directions of rotation have different effects on movement. The flagella 
spend most of the time rotating counterclockwise, which draws all the flagella 
together into a coherent bundle, so that the bacterium swims uniformly in one 
direction. In the absence of any environmental stimulus, every second or so, 
one or more of the motors transiently reverses direction so that the attached 
flagellum breaks out of the bundle, causing the bacterium to tumble chaotically 
without moving forward (Figure 15-72). This sequence produces a characteris- 
tic pattern of movement in which smooth swimming in a straight line is inter- 
rupted by abrupt, random changes in direction caused by tumbling. 



Figure 15-70 Some of the protein 
kinases discussed in this chapter. The 

size and location of their catalytic 
domains (dark green) are shown. In each 
case the catalytic domain is about 
250 amino acids long. These domains are 
all similar in amino acid sequence, 
suggesting that they have all evolved 
from a common primordial kinase (see 
also Figure 3-66). Note that all of the 
tyrosine kinases shown are bound to the 
plasma membrane (JAKs are bound by 
their association with cytokine 
receptors — see Figure 15-68), whereas 
most of the serine/threonine kinases are 
in the cytosol. 
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Figure 15-71 The bacterial flagellar motor. (A) A schematic drawing of the structure. The flagellum is linked to a flexible 
hook. The hook is attached to a series of protein rings (shown in orange), which are embedded in the outer and inner 
(plasma) membranes. The rings form a rotor, which rotates with the flagellum at more than 1 00 revolutions per second. The 
rotation is driven by a flow of protons through an outer ring of proteins, the stator (blue), which is embedded in the inner 
membrane and tethered to the peptidoglycan layer. The stator also contains the proteins responsible for switching the 
direction of rotation. (B) A flagellar motor reconstructed from multiple electron microscopic images. (A, based on data from 
T Kubori et al., J. Mol. Biol. 226:433-446,1 992, with permission from Academic Press, and N.R. Francis et al., Proc Natl Acad. 
Sci. U.S.A. 89:6304-6308, 1992, with permission from National Academy of Sciences; B, from D.Thomas, D.G. Morgan and 
DJ. DeRosier, J. Bacteriol. 1 83:6404-641 2, 2001 . With permission from American Society for Microbiology.) 

The normal swimming behavior of bacteria is modified by chemotactic 
attractants or repellents, which bind to specific receptor proteins and affect the 
frequency of tumbling by increasing or decreasing the time that elapses between 
successive changes in direction of flagellar rotation. When bacteria are swim- 
ming in a favorable direction (toward a higher concentration of an attractant or 
away from a higher concentration of a repellent), they tumble less frequently 
than when they are swimming in an unfavorable direction (or when no gradient 
is present) . Since the periods of smooth swimming are longer when a bacterium 
is traveling in a favorable direction, it will gradually progress in that direction — 
toward an attractant or away from a repellent. 

Histidine-kinase-associated chemotaxis receptors mediate these 
responses. The receptors typically are dimeric transmembrane proteins that 
bind specific attractants and repellents on the outside of the plasma membrane. 
The cytoplasmic tail of the receptor is stably associated with a histidine kinase 
CheAvia an adaptor protein CheW (Figure 15-73). Repellent binding activates 
the receptors, whereas attractant binding inactivates them; a single receptor can 
bind either type of molecule, with opposite consequences. The binding of a 
repellent to the receptor activates CheA, which phosphorylates itself on a histi- 
dine and almost immediately transfers the phosphoryl group to an aspartic acid 
on a response regulator protein CheY. The phosphorylated CheY dissociates 
from the receptor, diffuses through the cytosol, binds to a flagellar motor, and 
causes the motor to rotate clockwise, so that the bacterium tumbles. CheY has 
intrinsic phosphatase activity and dephosphorylates itself in a process that is 
greatly accelerated by the CheZ protein (see Figure 15-73). 



(A) 




TUMBLING 



Receptor Methylation Is Responsible for Adaptation in Bacterial 
Chemotaxis 

The change in tumbling frequency in response to an increase in the concentra- 
tion of an attractant or repellent occurs within less than a second, but it is only 



(B) 

Figure 15-72 Positions of the flagella on 
E co// during swimming. (A) When the 
flagella rotate counterclockwise (as seen 
looking into the cell from the flagellum), 
they are drawn together into a single 
bundle, which acts as a propeller to 
produce smooth swimming. (B) When 
one or more motors reverses direction, 
the attached flagellum breaks out of the 
bundle, causing the bacterium to tumble. 
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transient. Even if the higher level of ligand is maintained, within minutes, the 
bacteria adapt (desensitize), to the increased stimulus. The adaptation is a cru- 
cial part of the response, as it enables a bacterium to compare its present envi- 
ronment with that in its recent past and thereby respond to changes in the con- 
centration of ligand rather than to steady-state levels. 

The adaptation is mediated by the covalent methylation (catalyzed by a 
methyl transferase) or demethylation (catalyzed by a methylase) of the chemo- 
taxis receptors, which change their responsiveness to ligand binding as a conse- 
quence of the covalent modification. When an attractant binds to a chemotaxis 
receptor, for example, it has two effects: (1) It decreases the ability of the recep- 
tor to activate CheA, resulting in a decreased rate of tumbling. (2) It slowly (over 
minutes) alters the receptor so that it can be methylated by the methyl trans- 
ferase, which returns the receptor's ability to activate CheA to its original level. 
Thus, the unmethylated receptor without a bound ligand has the same activity 
as the methylated receptor with a bound ligand, and the tumbling frequency of 
the bacterium is therefore the same in both cases. 

Each receptor dimer has eight methylation sites, and the number of sites 
methylated increases with increasing concentration of the attractant (as each 
receptor spends a longer time with a ligand bound at higher concentrations). 
When the attractant is removed, the methylase demethylates the receptor. 
Although the level of receptor methylation changes during a chemotactic 
response, it remains constant once a bacterium is adapted because an exact bal- 
ance is reached between the rates of methylation and demethylation. A simple 
model of how ligand binding and methylation may operate in bacterial chemo- 
taxis proposes that both receptor methylation and repellent binding tighten the 
structure of the multisubunit receptor and its associated signaling proteins, 
thereby increasing signaling and tumbling; by contrast, receptor demethylation 
and attractant binding loosen the structure of the complex, thereby decreasing 
signaling and tumbling. It is thought that the sensitivity of this response is 
greatly increased by cooperative effects that result from the clustering of the 
cytoplasmic tails of adjacent receptors in the membrane (Figure 15-74). 

All of the genes and proteins involved in bacterial chemotaxis have now 
been identified, and their interactions have been measured in some detail. It 
therefore seems likely that it will be the first signaling system to be completely 
understood in molecular terms. Even in this relatively simple signaling network, 
however, computer-based simulations are required to comprehend how the sys- 
tem works as an integrated network. Cell signaling pathways will provide an 
especially rich area of investigation for a new generation of computational biol- 
ogists, as the network properties of these pathways are not understandable 
without powerful computational tools. 

As mentioned earlier, there are some cell- surface receptor proteins that do 
not fit into the three major classes we have discussed thus far — ion-channel- 
coupled, G-protein-coupled, and enzyme-coupled. In the next section, we con- 
sider cell-surface receptors that activate signaling pathways that depend on pro- 
teolysis to regulate the activity of latent gene regulatory proteins. These path- 
ways have especially important roles in animal development and in tissue 
renewal and repair. 



Summary 

There are various classes of enzyme- coupled receptors, including receptor tyrosine 
kinases (RTKs), tyrosine-kinase- associated receptors, receptor serine /threonine 
kinases, and histidine-kinase- associated receptors. The first two classes are by far the 
most numerous in mammals. 

Ligand binding to RTKs induces the receptors to cross -phosphory late their cyto- 
plasmic domains on multiple tyrosines. This transautophosphorylation both stimu- 
lates the kinases and produces a set of phosphotyrosines that serve as docking sites for 
a set of intracellular signaling proteins, which bind via their SH2 (or PTB) domains. 
One such signaling protein serves as an adaptor to couple some activated receptors to 
a Ras-GEF (Sos), which activates the monomeric GTPase Ras; Ras, in turn, activates a 
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Figure 1 5-73 The two-component 
signaling pathway that enables 
chemotaxis receptors to control the 
flagellar motors during bacterial 
chemotaxis. The histidine kinase CheA is 
stably bound to the receptor via the 
adaptor protein CheW.The receptors and 
their associated proteins are all clustered 
at one end of the cell (see Figure 15-74). 
The binding of a repellent increases the 
activity of the receptor, which stimulates 
CheA to phosphorylate itself on histidine. 
CheA quickly transfers its covalently 
bound, high-energy phosphoryl group 
directly to the response regulator CheY to 
generate CheY-phosphate, which then 
diffuses away, binds to a flagellar motor, 
and causes the motor to rotate clockwise, 
which results in tumbling. The binding of 
an attractant has the opposite effect: it 
decreases the activity of the receptor and 
therefore decreases the phosphorylation 
of CheA and CheY, which results in 
counterclockwise flagellar rotation and 
smooth swimming. CheZ accelerates the 
autodephosphorylation of CheY- 
phosphate, thereby inactivating CheY. 
Each of the phosphorylated intermediates 
decays with a half life of less than a 
second, enabling the bacterium to 
respond very quickly to changes in its 
environment (see Figure 1 5-1 1 ). 
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three-component MAP kinase signaling module, which relays the signal to the nucleus 
by phosphorylating gene regulatory proteins there. Another important signaling pro- 
tein that can dock on activated RTKs is PI 3-kinase, which phosphorylates specific 
phosphoinositides to produce lipid docking sites in the plasma membrane for signal- 
ing proteins with phosphoinositide-binding PH domains, including the serine/threo- 
nine protein kinase Akt (PKB), which plays a key part in the control of cell survival and 
growth. Many receptor classes, including some RTKs, activate Rho family monomeric 
GTPases, which functionally couple the receptors to the cytoskeleton. 

Tyrosine-kinase- associated receptors depend on various cytoplasmic tyrosine 
kinases for their action. These kinases include members of the Src family, which associ- 
ate with many kinds of receptors, and the focal adhesion kinase (FAK), which associates 
with integrins at focal adhesions. The cytoplasmic tyrosine kinases then phosphorylate 
a variety of signaling proteins to relay the signal onward. The largest family of recep- 
tors in this class is the cytokine receptor family. When stimulated by ligand binding, 
these receptors activate JAK cytoplasmic tyrosine kinases, which phosphorylate STATs. 
The STATs then dimerize, translocate to the nucleus, and activate the transcription of 
specific genes. Receptor serine/ threonine kinases, which are activated by signal pro- 
teins of the TGFfi superfamily, act similarly: they directly phosphorylate and activate 
Smads, which then oligomerize with another Smad, translocate to the nucleus, and 
activate gene transcription. 

Bacterial chemotaxis is regulated by a signaling pathway that is understood excep- 
tionally well. It is governed by histidine-kinase- associated chemotaxis receptors, which 
activate a two -component signaling pathway. When activated by a repellent, a chemo- 
taxis receptor stimulates its associated histidine kinase to phosphorylate itself on histi- 
dine and then transfer the phosphoryl group to a response regulator protein, which 
relays the signal to a flagellar motor to alter the bacterium's swimming behavior. Attrac- 
tants have the opposite effect on this kinase and therefore on swimming. These 
responses depend crucially on receptor adaptation, which is mediated by reversible 
receptor methylation. 



Figure 1 5-74 A structural model for the 
clustering of chemotaxis receptors in the 
bacterial plasma membrane. Two 

clustered receptors are shown. Each is a 
homodimer, and the long a-helical 
cytoplasmic tail of each of the two 
subunits folds back on itself. Thus, each 
subunit contributes two helices to the 
four-helix bundle. Distance measurements 
between protein domains, determined by 
a technique known as pulsed electron-spin 
resonance, have been integrated with the 
known three-dimensional protein structure 
to produce the model shown. The 
clustering of receptor-CheA-CheW 
complexes into arrays of this type allows 
cooperative interactions between adjacent 
complexes, greatly increasing the 
sensitivity of the signaling process. 
(A) View of the structure just below the 
bacterial plasma membrane. The 
extracellular domain of the receptor is 
drawn schematically. (B) View of the same 
structure looking inward from the plasma 
membrane. (Adapted from S.Y. Park et al., 
Nat. Struct. Biol. 1 3:400-407, 2006. With 
permission from Macmillan Publishers Ltd.) 
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SIGNALING PATHWAYS DEPENDENT ON 
REGULATED PROTEOLYSIS OF LATENT GENE 
REGULATORY PROTEINS 

The need for intercellular signaling is never greater than during animal develop- 
ment. Each cell in the embryo has to be guided along one developmental path- 
way or another according to its history, its position, and the character of its 
neighbors. At each step in the pathway, it must exchange signals with its neigh- 
bors to coordinate its behavior with theirs, ensuring the correct number and 
pattern of different cell types in each tissue and organ. Most of the signaling 
pathways already discussed are widely used for these developmental purposes, 
controlling cell survival, growth, proliferation, adhesion, specification, differen- 
tiation, and migration. 

There are other signaling pathways, however, that are at least as important 
in controlling developmental processes but relay signals in other ways from cell- 
surface receptors to the interior of the cell. Several of these pathways depend on 
regulated proteolysis to control the activity and location of latent gene regulatory 
proteins, which enter the nucleus and activate the transcription of specific tar- 
get genes only after they have been signaled to do so. Although the STAT and 
Smad proteins discussed earlier are also latent gene regulatory proteins, they are 
activated by phosphorylation in response to extracellular signals rather than by 
highly selective protein degradation. Signaling pathways that use latent gene 
regulatory proteins provide, as their primary function, a relatively direct linear 
pathway by which extracellular signals can control gene expression, which is 
presumably why they are so commonly used during development, especially to 
control cell fate decisions. 

Although most of the pathways we discuss in this section were discovered 
through genetic studies in Drosophila, they have been highly conserved in evo- 
lution and are used over and over again during the development of different tis- 
sues and different animals. As we discuss in Chapter 23, they also have a crucial 
role in the many developmental processes that continue in adult tissues and 
organs, where new cells are constantly being produced. 

We discuss four of these signaling pathways in this section: that mediated by 
the receptor protein Notch, that activated by secreted Wnt proteins, that acti- 
vated by secreted Hedgehog proteins, and the pathway that activates the latent 
gene regulatory protein NFkB. All of these pathways have crucial roles in animal 
development, and we discuss the central roles of Notch, Wnt, and Hedgehog sig- 
naling in embryonic development in detail in Chapter 22. 



The Receptor Protein Notch Is a Latent Gene Regulatory Protein 

Signaling through the Notch receptor protein may be the most widely used sig- 
naling pathway in animal development. As discussed in Chapter 22, it has a gen- 
eral role in controlling cell fate choices and regulating pattern formation during 
the development of most tissues, as well as in the continual renewal of tissues 
such as the lining of the gut. It is best known, however, for its role in the pro- 
duction of nerve cells in Drosophila, which usually arise as isolated single cells 
within an epithelial sheet of precursor cells. During this process, when a precur- 
sor cell commits to becoming a nerve cell, it signals to its immediate neighbors 
not to do the same; the inhibited cells develop into epidermal cells instead. This 
process, called lateral inhibition, depends on a contact-dependent signaling 
mechanism that is activated by a single-pass transmembrane signal protein 
called Delta, displayed on the surface of the future neural cell. By binding to the 
Notch receptor protein on a neighboring cell, Delta signals to the neighbor not 
to become neural (Figure 15-75). When this signaling process is defective, the 
neighbors of neural cells also develop as neural cells, producing a huge excess of 
neurons at the expense of epidermal cells, which is lethal. 

Signaling between adjacent cells via Notch and Delta (or Deltalike ligands) 
regulates cell fate choices in many tissues and animals. Often, it mediates lateral 
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inhibition to control the formation of mixtures of differentiated cell types within 
a tissue, as in the fly nervous system. In some other cases, however, it works in 
the opposite way, promoting rather than inhibiting a particular cell fate and 
driving neighboring cells to behave similarly There is hardly any developmental 
cell behavior that is not regulated by Notch signaling in one tissue or another. 

Notch is a single-pass transmembrane protein that requires proteolytic pro- 
cessing to function. It acts as a latent gene regulatory protein and provides the 
simplest and most direct signaling pathway known from a cell-surface receptor 
to the nucleus. When activated by the binding of Delta on another cell, a plasma- 
membrane-bound protease cleaves off the cytoplasmic tail of Notch, and the 
released tail translocates into the nucleus to activate the transcription of a set of 
Notch-response genes. The Notch tail fragment acts by binding to a DNA-bind- 
ing protein, converting it from a transcriptional repressor into a transcriptional 
activator. We shall see that the Wnt and Hedgehog signaling pathways use this 
same strategy of switching a transcriptional repressor into a transcriptional acti- 
vator to regulate cell fate. The set of genes regulated by Notch signaling varies 
depending on the tissue and circumstances, although the primary targets in 
most cells are members of a gene family, known (in mammals) as Hes genes, 
which encode inhibitory gene regulatory proteins. In the nervous system, for 
example, the products of the Hes genes block the expression of genes required 
for neural differentiation. 

The Notch receptor undergoes three successive proteolytic cleavage steps, 
but only the last two depend on Delta binding. As part of its normal biosynthe- 
sis, it is cleaved in the Golgi apparatus to form a heterodimer, which is then 
transported to the cell surface as the mature receptor. The binding of Delta to 
Notch induces a second cleavage in the extracellular domain, mediated by an 
extracellular protease. A final cleavage quickly follows, cutting free the cytoplas- 
mic tail of the activated receptor (Figure 15-76). Note that, unlike most recep- 
tors, the activation of Notch is irreversible; once activated by ligand binding, the 
protein cannot be used again. 

This final cleavage of the Notch tail occurs just within the transmembrane 
segment, and it is mediated by a protease complex called y-secretase, which is also 
responsible for the intramembrane cleavage of various other proteins. One of its 
essential subunits is Presenilin, so-called because mutations in the gene encod- 
ing it are a frequent cause of early- onset, familial Alzheimer's disease, a form of 
presenile dementia. The protease complex is thought to contribute to this and 
other forms of Alzheimer's disease by generating extracellular peptide fragments 
from a transmembrane neuronal protein; the fragments accumulate in excessive 
amounts and form aggregates of misfolded protein called amyloid plaques, 
which may injure nerve cells and contribute to their degeneration and loss. 

Both Notch and Delta are glycoproteins, and their interaction is regulated by 
the glycosylation of Notch. The Fringe family of glycosyltransferases, in particu- 
lar, adds extra sugars to the O-linked oligosaccharide (discussed in Chapter 13) 
on Notch, which alters the specificity of Notch for its ligands. This has provided 
the first example of the modulation of ligand-receptor signaling by differential 
receptor glycosylation. 



Figure 15-75 Lateral inhibition mediated 
by Notch and Delta during nerve cell 
development in Drosophila. When 
individual cells in the epithelium begin to 
develop as neural cells, they signal to their 
neighbors not to do the same. This 
inhibitory, contact-dependent signaling is 
mediated by the ligand Delta, which 
appears on the surface of the future nerve 
cell and binds to Notch receptor proteins 
on the neighboring cells. In many tissues, 
all the cells in a cluster initially express 
both Delta and Notch, and a competition 
occurs, with one cell emerging as winner, 
expressing Delta strongly and inhibiting its 
neighbors from doing likewise (see Figure 
22-60). In other cases, additional factors 
interact with Delta or Notch to make some 
cells susceptible to the lateral inhibition 
signal and others unresponsive to it. 
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Wnt Proteins Bind to Frizzled Receptors and Inhibit the 
Degradation of p-Catenin 

Wnt proteins are secreted signal molecules that act as local mediators and mor- 
phogens to control many aspects of development in all animals that have been 
studied. They were discovered independently in flies and in mice: in Drosophila, 
the Wingless (Wg) gene originally came to light because of its role as a mor- 
phogen in wing development (discussed in Chapter 22), while in mice, the Intl 
gene was found because it promoted the development of breast tumors when 
activated by the integration of a virus next to it. Both of these genes encode Wnt 
proteins. Wnts are unusual as secreted proteins in that they have a fatty acid 
chain covalently attached to their N-terminus, which increases their binding to 
cell surfaces. There are 19 Wnts in humans, each having distinct, but often over- 
lapping, functions. 

Wnts can activate at least three types of intracellular signaling pathways: (1) 
The Wnt/fi-catenin pathway (also known as the canonical Wnt pathway) is cen- 
tered on the latent gene regulatory protein /5-catenin. (2) The planar polarity 
pathway coordinates the polarization of cells in the plane of a developing epithe- 
lium (discussed in Chapters 19 and 22) and depends on Rho family GTPases. (3) 
The Wnt/Ca 2+ pathway stimulates an increase of intracellular Ca 2+ , with conse- 
quences of the sort we described earlier for other pathways. All three pathways 
begin with the binding of Wnts to Frizzled family cell-surface receptors, which 



Figure 15-76 The processing and 
activation of Notch by proteolytic 
cleavage. The numbered red arrowheads 
indicate the sites of proteolytic cleavage. 
The first proteolytic processing step occurs 
within the trans Golgi network to generate 
the mature heterodimeric Notch receptor 
that is then displayed on the cell surface. 
The binding of Delta, which is displayed 
on a neighboring cell, triggers the next 
two proteolytic steps: the complex of 
Delta and the Notch subunit to which it is 
bound is endocytosed by the Delta- 
expressing cell, exposing the extracellular 
cleavage site in the transmembrane Notch 
subunit. Note that Notch and Delta 
interact through their repeated EGF-like 
domains. 

The released Notch tail migrates into the 
nucleus, where it binds to the Rbpsuh 
protein and converts it from a 
transcriptional repressor to a 
transcriptional activator. 
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are seven-pass transmembrane proteins that resemble GPCRs in structure. 
There are seven such receptors in humans. When activated by Wnt binding, Friz- 
zled proteins recruit the scaffold protein Dishevelled, which is required for 
relaying the signal down all three signaling pathways. We focus here on the first 
pathway. 

The Wnt/p-catenin pathway acts by regulating the proteolysis of the multi- 
functional protein P-catenin (or Armadillo in flies), which functions both in 
cell-cell adhesion (discussed in Chapter 19) and in gene regulation. In this path- 
way (but not in the other Wnt pathways), Wnts act by binding to both a Frizzled 
protein and a co-receptor protein that is related to the low-density lipoprotein 
(LDL) receptor protein (discussed in Chapter 13) and is therefore called an LDL- 
receptor-related protein (LRP). In epithelial cells, most of a cell's p-catenin is 
located at cell-cell adherens junctions, where it is associated with cadherins, 
which are transmembrane cell-cell adhesion proteins. As discussed in Chapter 
19, the p-catenin in these junctions helps link the cadherins to the actin 
cytoskeleton. In both epithelial and nonepithelial cells, the p-catenin that is not 
associated with cadherins is rapidly degraded in the cytoplasm. 

The degradation of cytoplasmic p-catenin depends on a large protein degra- 
dation complex, which binds p-catenin and keeps it out of the nucleus while 
promoting its degradation. The complex contains at least four other proteins: a 
serine/threonine kinase called casein kinase 1 (CK1) phosphorylates the |3- 
catenin on a serine, priming it for further phosphorylation by another 
serine/threonine kinase called glycogen synthase kinase 3 (GSK3); this final 
phosphorylation marks the protein for ubiquitylation and rapid degradation in 
proteasomes. Two scaffold proteins called axin and Adenomatous polyposis coli 
(APC) hold the protein complex together (Figure 15-77A). APC gets its name 
from the finding that the gene encoding it is often mutated in a type of benign 
tumor (adenoma) of the colon; the tumor projects into the lumen as a polyp and 
can eventually become malignant. (This APC is not to be confused with the 
Anaphase Promoting Complex, or APC, that plays a central part in selective pro- 
tein degradation during the cell cycle — see Figure 17-20A.) 
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Figure 15-77 The Wnt/|3-catenin 
signaling pathway. (A) In the absence of 
a Wnt signal, p-catenin that is not bound 
to the cytosolic tail of cadherin proteins 
(not shown) becomes bound by a 
degradation complex containing APC, 
axin, GSK3, and CK1 . In this complex, 
p-catenin is phosphorylated by CK1 and 
then by GSK3, triggering its 
ubiquitylation and degradation in 
proteasomes. Wnt-responsive genes are 
kept inactive by the Groucho 
co-repressor protein bound to the gene 
regulatory protein LEF1/TCF. (B) Wnt 
binding to Frizzled and LRP clusters the 
two types of receptors together, resulting 
in the recruitment of the degradation 
complex to the plasma membrane and 
the phosphorylation of the cytosolic tail 
of LRP by GSK3 and then by CKIy. Axin 
binds to the phosphorylated LRP and is 
inactivated and/or degraded. The loss of 
axin from the degradation complex 
inactivates the complex and thereby 
blocks p-catenin phosphorylation and 
ubiquitylation, which allows 
unphosphorylated p-catenin to 
accumulate and translocate to the 
nucleus. Dishevelled and probably a 
G protein are required for the signaling 
pathway to operate; both bind to Frizzled 
and Dishevelled becomes 
phosphorylated (not shown), but their 
functional roles are unknown. 

Once in In the nucleus, p-catenin binds 
to LEF1/TCF, displaces the co-repressor 
Groucho, and acts as a coactivator to 
stimulate the transcription of Wnt 
target genes. 
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The binding of a Wnt protein to both a Frizzled and LRP receptor brings the 
two receptors together to form a complex. In a poorly understood process, the 
two protein kinases, GSK3 and then CKly, phosphorylate the cytosolic tail of the 
LRP receptor, allowing the LRP tail to recruit and inactivate axin, thereby dis- 
rupting the degradation complex in the cytoplasm. In this way, the phosphory- 
lation and degradation of (3-catenin are inhibited, enabling unphosphorylated 
(3-catenin gradually to accumulate and translocate to the nucleus, where it alters 
the pattern of gene transcription (Figure 15-77B). 

In the absence of Wnt signaling, Wnt-responsive genes are kept silent by an 
inhibitory complex of gene regulatory proteins. The complex includes proteins of 
the LEF1/TCF family bound to a co-repressor protein of the Groucho family (see 
Figure 15-77A). In response to a Wnt signal, (3-catenin enters the nucleus and 
binds to the LEF1/TCF proteins, displacing Groucho. The (3-catenin now func- 
tions as a coactivator, inducing the transcription of the Wnt target genes (see Fig- 
ure 15-77B). Thus, as in the case of Notch signaling, Wnt/ p-catenin signaling trig- 
gers a switch from transcriptional repression to transcriptional activation. 

Among the genes activated by p-catenin is c-Myc, which encodes a protein 
(c-Myc) that is a powerful stimulator of cell growth and proliferation (discussed 
in Chapter 17). Mutations of the Ape gene occur in 80% of human colon cancers 
(discussed in Chapter 20) . These mutations inhibit the protein's ability to bind (3- 
catenin, so that p-catenin accumulates in the nucleus and stimulates the tran- 
scription of c-Myc and other Wnt target genes, even in the absence of Wnt sig- 
naling. The resulting uncontrolled cell growth and proliferation promote the 
development of cancer. 

Various secreted inhibitory proteins regulate Wnt signaling in development. 
Some bind to the LRP receptors and promote their down-regulation, whereas 
others compete with Frizzled receptors for secreted Wnts. In Drosophila at least, 
Wnts activate negative feedback loops, in which Wnt target genes encode pro- 
teins that help shut the response off; some of these proteins inhibit Dishevelled, 
and others are secreted inhibitors. 



Hedgehog Proteins Bind to Patched, Relieving Its Inhibition of 
Smoothened 

Hedgehog proteins and Wnt proteins act in similar ways. Both are secreted sig- 
nal molecules, which act as local mediators and morphogens in many develop- 
ing invertebrate and vertebrate tissues. Both proteins are modified by covalently 
attached lipids, depend on secreted or cell-surface-bound heparan sulfate pro- 
teoglycans (discussed in Chapter 19) for their action, and activate latent gene 
regulatory proteins by inhibiting their proteolysis. They both trigger a switch 
from transcriptional repression to transcriptional activation, and excessive sig- 
naling along either pathway in adult cells can lead to cancer. They even use 
some of the same intracellular signaling proteins and sometimes collaborate to 
mediate a response. 

The Hedgehog proteins were discovered in Drosophila, where this protein 
family has only one member. Mutation of the Hedgehog gene produces a larva 
covered with spiky processes (denticles), like a hedgehog. At least three genes 
encode Hedgehog proteins in vertebrates — Sonic, Desert, and Indian hedge- 
hog. The active forms of all Hedgehog proteins are covalently coupled to 
cholesterol, as well as to a fatty acid chain. The cholesterol is added during an 
unusual processing step, in which a precursor protein cleaves itself to produce 
a smaller, cholesterol- containing signal protein. Most of what we know about 
the downstream signaling pathway activated by Smoothened, however, came 
initially from genetic studies in flies, and it is the fly pathway that we summa- 
rize here. 

Three transmembrane proteins — Patched, Smoothened, and iHog — medi- 
ate the responses to Hedgehog proteins. Patched is predicted to cross the 
plasma membrane 12 times, and, although much of it is in intracellular vesicles, 
some is on the cell surface where it binds the Hedgehog protein. iHog proteins 
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have four or five immunoglobulin-like domains and two or three fibronectin- 
type-III-like domains; they are on the cell surface and are also thought to serve 
as receptors for Hedgehog proteins, probably acting as co-receptors with 
Patched. Smoothened is a seven-pass transmembrane protein with a structure 
very similar to a Frizzled protein. In the absence of a Hedgehog signal, Patched 
through some unknown mechanism, keeps Smoothened sequestered and inac- 
tive in intracellular vesicles. The binding of Hedgehog to iHog and Patched 
inhibits the activity of Patched and induces its endocytosis and degradation. The 
result is that Smoothened becomes phosphorylated, translocates to the cell sur- 
face, and relays the signal downstream. 

The downstream effects are mediated by a latent gene regulatory protein 
called Cubitus interruptus (Ci). In the absence of a Hedgehog signal, Ci is ubiq- 
uitylated and proteolytically cleaved in proteasomes. Instead of being com- 
pletely degraded, however, Ci is processed to form a smaller protein, which 
accumulates in the nucleus, where it acts as a transcriptional repressor, helping 
to keep Hedgehog-responsive genes silent. The proteolytic processing of the Ci 
protein depends on its phosphorylation by three serine /threonine protein 
kinases — PKA and two kinases also used in the Wnt pathway, namely GSK3 and 
CK1. As in the Wnt pathway, the proteolytic processing occurs in a multiprotein 
complex. The complex includes the serine /threonine kinase Fused and a scaf- 
fold protein Costal2, which stably associates with Ci, recruits the three other 
kinases, and binds the complex to microtubules, thereby keeping unprocessed 
Ci out of the nucleus (Figure 15-78A). 

When the Hedgehog pathway is activated and Smoothened is thereby let 
loose in the plasma membrane, it recruits the protein complexes containing Ci, 
Fused, and Costal2. Costal2 is no longer able to bind the other three kinases, and 
so Ci is no longer cleaved. Unprocessed Ci protein can now enter the nucleus 
and activate the transcription of Hedgehog target genes (Figure 15-78B). Among 
the genes activated by Ci is Patched itself; the resulting increase in Patched pro- 
tein on the cell surface inhibits further Hedgehog signaling — providing yet 
another example of a negative feedback loop. 

Many gaps in the understanding of the Hedgehog signaling pathway remain 
to be filled in. It is not known, for example, how Patched keeps Smoothened 
inactive and intracellular. As the structure of Patched resembles a transmem- 
brane transporter protein, it has been proposed that it may transport a small 
molecule into the cell that keeps Smoothened sequestered in vesicles. 

Even less is known about the more complex Hedgehog pathway in verte- 
brate cells. In addition to there being at least three types of vertebrate Hedgehog 
proteins, there are three Ci-like gene regulatory proteins {GUI, GU2, and GU3) 
downstream of Smoothened. Only Gli3 has been shown to undergo proteolytic 
processing like Ci and to act as either a transcriptional repressor or a transcrip- 
tional activator. Both Glil and Gli2 are thought to act exclusively as transcrip- 
tional activators. Moreover, in vertebrates, Smoothened, upon activation, 
becomes localized to a very specific site in the plasma membrane — the surface 
of the primary cilium, which projects from the surface of most types of verte- 
brate cells (as discussed in Chapter 16). The primary cilium thus acts as a Hedge- 
hog signaling center, and the Gli proteins are also concentrated here. This 
arrangement presumably increases the speed and efficiency of the signaling 
process. 

Hedgehog signaling can promote cell proliferation, and excessive Hedgehog 
signaling can lead to cancer. Inactivating mutations in one of the two human 
Patched genes, for example, which leads to excessive Hedgehog signaling, 
occurs frequently in basal cell carcinoma of the skin, the most common form of 
cancer in Caucasians. A small molecule called cyclopamine, made by a meadow 
lily, is being used to treat cancers associated with excessive Hedgehog signaling. 
It blocks Hedgehog signaling by binding tightly to Smoothened and inhibiting its 
activity. It was originally identified because it causes severe developmental 
defects in the progeny of sheep grazing on such lilies; these include the presence 
of a single central eye (a condition called cyclopia), which is also seen in mice 
that are deficient in Hedgehog signaling. 
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Figure 15-78 Hedgehog signaling in Drosophila. (A) In the absence of 
Hedgehog, Patched keeps Smoothened inactive and sequestered in 
intracellular vesicles. The Ci protein is bound in a protein degradation 
complex in the cytoplasm, which includes the serine/threonine kinase 
Fused and the scaffold protein Costal2. Costal2 recruits three other protein 
kinases (PKA, GSK3, and CK1), which phosphorylate Ci. Phosphorylated Ci is 
ubiquitylated and then cleaved in proteasomes (not shown) to form a 
transcriptional repressor, which accumulates in the nucleus to help keep 
Hedgehog target genes inactive. (B) Hedgehog binding to iHog and 
Patched removes the inhibition of Smoothened by Patched. Smoothened is 
phosphorylated by PKA and CK1 and translocates to the plasma 
membrane, where it recruits Fused and Costal2. Costal2 releases the three 
other kinases and unprocessed Ci.The full-length Ci protein accumulates in 
the nucleus and activates the transcription of Hedgehog target genes. The 
iHog proteins in mammals are called BOC and CDO. Many details in the 
pathway are poorly understood, including the role of Fused. 



Many Stressful and Inflammatory Stimuli Act Through an 
NFKB-Dependent Signaling Pathway 

The NFkB proteins are latent gene regulatory proteins that are present in most 
animal cells and are central to many stressful, inflammatory and innate 
immune responses. These responses occur as a reaction to infection or injury 
and help protect stressed multicellular organisms and their cells (discussed in 
Chapter 24). An excessive or inappropriate inflammatory response in animals 
can also damage tissue and cause severe pain, and chronic inflammation can 
lead to cancer; as in the case of Wnt and Hedgehog signaling, excessive NFkB 
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signaling is found in a number of human cancers. NFkB proteins also have 
important roles during normal animal development. The Drosophila NFkB 
family member Dorsal, for example, has a crucial role in specifying the dor- 
sal-ventral axis of the developing fly embryo (discussed in Chapter 22). 

Various cell-surface receptors activate the NFkB signaling pathway in animal 
cells. Toll receptors in Drosophila and Toll-like receptors in vertebrates, for exam- 
ple, recognize pathogens and activate this pathway in triggering innate immune 
responses (discussed in Chapter 24). The receptors for tumor necrosis factor a 
(TNFa) and interleukin-1 (IL1), which are vertebrate cytokines especially impor- 
tant in inducing inflammatory responses, also activate this pathway. The Toll, Toll- 
like, and IL1 receptors belong to the same family of proteins, whereas TNF recep- 
tors belong to a different family; all of them, however, act in similar ways to acti- 
vate NFkB. When activated, they trigger a multiprotein ubiquitylation and phos- 
phorylation cascade that releases NFkB from an inhibitory protein complex, so 
that it can translocate to the nucleus and turn on the transcription of hundreds of 
genes that participate in inflammatory and innate immune responses. 

There are five NFkB proteins in mammals {RelA, RelB, c-Rel, NFkBI, and 
NFkB2), and they form a variety of homodimers and heterodimers, each of 
which activates its own characteristic set of genes. Inhibitory proteins called IkB 
bind tightly to the dimers and hold them in an inactive state within the cyto- 
plasm of unstimulated cells. There are three major IkB proteins in mammals 
(IkB a, (3, and e), and the signals that release NFkB dimers do so by triggering a 
signaling pathway that leads to the phosphorylation, ubiquitylation, and conse- 
quent degradation of the IkB proteins. The phosphorylation of IkB is mediated 
by IkB kinase (IKK), which is a multiprotein complex containing two 
serine/threonine protein kinases (IKKa and IKK(3) and a regulatory protein 
called NEMO (for NFkB essential modifier), or IKKy (Figure 15-79). 

Among the genes activated by the released NFkB is the gene that encodes 
IkBoc, one of three IkB isoforms that holds NFkB inactive in the cytosol of resting 
cells. This activation leads to the resynthesis of IkBoc protein, which binds to 
NFkB and inactivates it, creating a negative feedback loop (Figure 15-80A). 
Experiments on TNFa- induced responses, as well as computer modeling stud- 
ies of the responses, indicate that the negative feedback produces two types of 
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Figure 15-79 The activation of the NFkB 
pathway by TNFa. Both TNFa and its 
receptors are trimers.The binding of 
TNFa causes a rearrangement of the 
clustered cytosolic tails of the receptors, 
which now recruit various signaling 
proteins, resulting in the activation of a 
serine/threonine protein kinase that 
phosphorylates and activates IkB kinase 
kinase (IKK). IKK is a heterotrimer 
composed of two kinase subunits (IKKa 
and IKKp) and a regulatory subunit called 
NEMO. IKKP then phosphorylates IkB on 
two serines, which marks the protein for 
ubiquitylation and degradation in 
proteasomes.The released NFkB 
translocates into the nucleus, where, in 
collaboration with coactivator proteins, it 
stimulates the transcription of its 
target genes. 
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NFkB responses, depending on the duration of the TNFa stimulus. Short expo- 
sure (less than an hour) to TNFa produces a short period of NFkB activation that 
is independent of the duration of the TNFa stimulus; the negative feedback 
through iKBa shuts off the response after about an hour. Prolonged exposure, by 
contrast, produces slow oscillations of NFkB activation, in which activation is 
followed by IKBa- mediated inactivation, which is followed by IKBa destruction 
and NFkB reactivation, and so on; the oscillations can persist for several hours 
before fading, even when the stimulus is maintained. Importantly, the two types 
of responses induce different patterns of gene expression, as some NFkB target 
genes turn on only in response to the prolonged oscillatory activation of NFkB 
(Figure 15-80B, C, and D). The negative feedback through IKBa is required for 
both types of responses: in cells deficient in IKBa, even a short exposure to TNFa 
induces a sustained activation of NFkB, without oscillations, and all of the 
NFkB -responsive genes are activated. 

Thus far, we have discussed cell signaling mainly in animals, with a few 
diversions into yeasts and bacteria. But intercellular signaling is just as impor- 
tant for plants as it is for animals, although the mechanisms and molecules used 
are mainly different, as we discuss next. 



Summary 

Some signaling pathways that are especially important in animal development depend 
on proteolysis to control the activity and location of latent gene regulatory proteins. 
Notch receptors are themselves such latent gene regulatory proteins, which are activated 
by cleavage when Delta (or a related ligand) on another cell binds to them; the cleaved 
cytosolic tail of Notch migrates into the nucleus, where it stimulates the transcription of 
Notch-responsive genes. In the Wnt/p-catenin signaling pathway, by contrast, the prote- 
olysis of the latent gene regulatory protein p-catenin is inhibited when a secreted Wnt 
protein binds to both a Frizzled and LRP receptor protein; as a result, p-catenin accu- 
mulates in the nucleus and activates the transcription of Wnt target genes. 

Hedgehog signaling in flies works much like Wnt signaling. In the absence of a sig- 
nal, a bifunctional, cytoplasmic gene regulatory protein, Ci, is proteolytically cleaved 
to form a transcriptional repressor that keeps Hedgehog target genes silenced. The 
binding of Hedgehog to its receptors (Patched and iHog) inhibits the proteolytic pro- 
cessing of Ci; as a result, the larger form ofCi accumulates in the nucleus and activates 
the transcription of Hedgehog-responsive genes. In Notch, Wnt, and Hedgehog signal- 
ing, the extracellular signal triggers a switch from transcriptional repression to tran- 
scriptional activation. 

Signaling through the latent gene regulatory protein NFkB also depends on prote- 
olysis. NFkB proteins are normally held in an inactive state by inhibitory IkB proteins 



Figure 15-80 Negative feedback in the 
NFkB signaling pathway induces 
oscillations in NFkB activation. 

(A) Drawing showing how activated NFkB 
stimulates the transcription of IkBoc, the 
protein product of which acts back in the 
cytoplasm to sequester NFkB there; if the 
stimulus is persistent, the newly made 
IkBcx protein will then be ubiquitylated 
and degraded, liberating active NFkB 
again so that it can return to the nucleus 
and activate transcription (see Figure 
1 5-79). (B) A short exposure to TNFa 
produces a single, short pulse of NFkB 
activation, beginning within minutes and 
ending by 1 hour. This response turns on 
the transcription of gene A but not gene B. 

(C) A sustained exposure to TNFa for the 
entire 6 hours of the experiment produces 
oscillations in NFkB activation that damp 
down over time. This response turns on 
the transcription of both genes; gene B 
turns on only after several hours, 
indicating that gene B transcription 
requires prolonged activation of NFkB, for 
reasons that are not understood. 

(D) These time-lapse confocal fluorescence 
micrographs from a different study of 
TNFa stimulation show the oscillations of 
NFkB in a cultured cell, as indicated by the 
periodic movement into the nucleus (N) of 
a fusion protein composed of NFkB fused 
to a red fluorescent protein. In the cell at 
the upper part of the micrographs, NFkB is 
active and in the nucleus at 6, 60, 210, 380, 
and 480 minutes, but it is exclusively in the 
cytoplasm at 0, 120, 300, 410, and 510 
minutes. (A-C, based on data from 

A. Hoffmann et al., Science 298:1241-1245, 
2002, and adapted from A.Y.Ting and 
D. Endy, Science 298:1 1 89-1 1 90, 2002; 
D, from D.E. Nelson et al., Science 
306:704-708, 2004. All with permission 
from AAAS.) 
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in the cytoplasm. A variety of extracellular stimuli, including proinflammatory 
cytokines, trigger the phosphorylation and ubiquitylation oflKB, marking it for degra- 
dation; this enables the NFkB to translocate to the nucleus and activate the transcrip- 
tion of its target genes. NFkB also activates the transcription of the gene that encodes 
IkBoc, creating a negative feedback loop, which can produce prolonged oscillations in 
NFkB activity with sustained extracellular signaling. 



SIGNALING IN PLANTS 

In plants, as in animals, cells are in constant communication with one another. 
Plant cells communicate to coordinate their activities in response to the chang- 
ing conditions of light, dark, and temperature, which guide the plant's cycle of 
growth, flowering, and fruiting. Plant cells also communicate to coordinate 
activities in their roots, stems, and leaves. In this final section, we consider how 
plant cells signal to one another and how they respond to light. Less is known 
about the receptors and intracellular signaling mechanisms involved in cell 
communication in plants than is known in animals, and we will concentrate 
mainly on how the receptors and mechanisms differ from those used by ani- 
mals. We discuss some of the details of plant development in Chapter 22. 



Multicellularity and Cell Communication Evolved Independently 
in Plants and Animals 



Although plants and animals are both eucaryotes, they have evolved separately 
for more than a billion years. Their last common ancestor is thought to have 
been a unicellular eucaryote that had mitochondria but no chloroplasts; the 
plant lineage acquired chloroplasts after plants and animals diverged. The earli- 
est fossils of multicellular animals and plants date from almost 600 million years 
ago. Thus, it seems that plants and animals evolved multicellularity indepen- 
dently, each starting from a different unicellular eucaryote, sometime between 
1.6 and 0.6 billion years ago (Figure 15-81). 

If multicellularity evolved independently in plants and animals, the 
molecules and mechanisms used for cell communication will have evolved sep- 
arately and would be expected to be different. There should be some degree of 
resemblance, however, because the genes in both plants and animals diverged 
from those contained by their last common unicellular ancestor. Thus, whereas 
both plants and animals use nitric oxide, cyclic GMP, Ca 2+ , and Rho family 
GTPases for signaling, there are no homologs of the nuclear receptor family, Ras, 
JAK, STAT, TGF(3, Notch, Wnt, or Hedgehog encoded by the completely 
sequenced genome of Arabidopsis thaliana, the small flowering plant. Similarly, 
plants do not seem to use cyclic AMP for intracellular signaling. 
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Figure 15-81 The proposed divergence 
of plant and animal lineages from a 
common unicellular eucaryotic 
ancestor. The plant lineage acquired 
chloroplasts after the two lineages 
diverged. Both lineages independently 
gave rise to multicellular organisms — 
plants and animals. (Paintings courtesy of 
John Innes Foundation.) 
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Much of what is known about the molecular mechanisms involved in sig- 
naling in plants has come from genetic studies on Arabidopsis. Although the 
specific molecules used in cell communication in plants often differ from those 
used in animals, the general strategies are frequently very similar. Both, for 
example, use enzyme-coupled cell-surface receptors, as we now discuss. 



Receptor Serine/Threonine Kinases Are the Largest Class of 
Cell-Surface Receptors in Plants 

Whereas most cell-surface receptors in animals are G-protein-coupled (GPCRs), 
most found so far in plants are enzyme-coupled. Moreover, whereas the largest 
class of enzyme-coupled receptors in animals is the receptor tyrosine kinase 
(RTK) class, this type of receptor is extremely rare in plants. Plants do, however, 
have many cytoplasmic tyrosine kinases, and tyrosine phosphorylation and 
dephosphorylation have important roles in plant cell signaling. Instead of RTKs, 
plants rely largely on a great diversity of transmembrane receptor serine/threo- 
nine kinases. Although very different from the corresponding animal receptors 
in most respects, they resemble them in having a typical serine /threonine 
kinase cytoplasmic domain and an extracellular ligand-binding domain. The 
most abundant types of these receptors have a tandem array of extracellular 
leucine-rich repeats (Figure 15-82) and are therefore called leucine-rich repeat 
(LRR) receptor kinases. 

There are about 175 LRR receptor kinases encoded by the Arabidopsis 
genome. One of the best characterized is the Clavatal/Clavata2 (Clvl/Clv2) 
receptor complex. Mutations that inactivate either of the two receptor subunits 
cause the production of flowers with extra floral organs and a progressive 
enlargement of both the shoot and floral meristems, the groups of self- renewing 
stem cells that produce the cells that give rise to stems, leaves, and flowers (dis- 
cussed in Chapter 22). The extracellular signal molecule that binds to the recep- 
tor is thought to be a small protein called Clv3, which is secreted by neighboring 
cells. The binding of Clv3 to the Clvl/Clv2 receptor suppresses meristem growth, 
either by inhibiting cell division there or, more probably, by stimulating cell dif- 
ferentiation (Figure 15-83A). 

The intracellular signaling pathway from the Clvl/Clv2 receptor to the cell 
response is largely unknown, but it includes a serine /threonine protein phos- 
phatase that inhibits the signaling. Other signaling proteins in the pathway 
include a Rho family GTPase and a nuclear gene regulatory protein that is dis- 
tantly related to animal homeodomain proteins. Mutations that inactivate this 
gene regulatory protein have the opposite effect from those that inactivate the 
Clvl/Clv2 receptor: cell division is greatly decreased in the shoot meristem, and 
the plant produces flowers with too few organs. Thus, the intracellular signaling 
pathway activated by the Clvl/Clv2 receptor seems to stimulate cell differentia- 
tion by inhibiting the gene regulatory protein that normally inhibits cell differ- 
entiation (Figure 15-83B). 

A different LRR receptor kinase in Arabidopsis, called Bril, is part of a cell- 
surface steroid hormone receptor. Plants synthesize a class of steroids that are 




Figure 15-82 The three-dimensional 
structure of leucine-rich repeats, similar 
to those found in the LRR 
serine/threonine receptor kinases. 

Multiple copies of such repeats are 
present in the extracellular domain of 
LRR receptor kinases, where they 
participate in binding the signal 
molecule. (Courtesy of David Lawson.) 
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called brassinosteroids because they were originally identified in the mustard 
family Brassicaceae, which includes Arabidopsis. These plant signal molecules 
regulate the growth and differentiation of plants throughout their life cycle. 
Binding of a brassinosteroid to a Bril cell-surface receptor kinase initiates a sig- 
naling cascade that uses a GSK3 protein kinase and a protein phosphatase to 
regulate the phosphorylation and degradation of specific gene regulatory pro- 
teins in the nucleus, and thereby specific gene transcription. Mutant plants that 
are deficient in the Bril receptor kinase are insensitive to brassinosteroids and 
are therefore dwarfs. 

The LRR receptor kinases are only one of many classes of transmembrane 
receptor serine/threonine kinases in plants. There are at least six additional fam- 
ilies, each with its own characteristic set of extracellular domains. The lectin 
receptor kinases, for example, have extracellular domains that bind carbohy- 
drate signal molecules. The Arabidopsis genome encodes over 300 receptor ser- 
ine/threonine kinases, which makes them the largest family of receptors known 
in plants. Many are involved in defense responses against pathogens. 



Ethylene Blocks the Degradation of Specific Gene Regulatory 
Proteins in the Nucleus 

Various growth regulators (also called plant hormones) help to coordinate 
plant development. They include ethylene, auxin, cytokinins, gibberellins, and 
abscisic acid, as well as brassinosteroids. Growth regulators are all small 
molecules made by most plant cells. They diffuse readily through cell walls and 
can either act locally or be transported to influence cells further away. Each 
growth regulator can have multiple effects. The specific effect depends on envi- 
ronmental conditions, the nutritional state of the plant, the responsiveness of 
the target cells, and which other growth regulators are acting. 

Ethylene is an important example. This small gas molecule (Figure 15-84A) 
can influence plant development in various ways; it can, for example, promote 
fruit ripening, leaf abscission, and plant senescence. It also functions as a stress 
signal in response to wounding, infection, flooding, and so on. When the shoot 
of a germinating seedling, for instance, encounters an obstacle, such as a piece 
of gravel underground in the soil, the seedling responds to the encounter in 
three ways. First, it thickens its stem, which can then exert more force on the 
obstacle. Second, it shields the tip of the shoot by increasing the curvature of a 
specialized hook structure. Third, it reduces the shoot's tendency to grow away 
from the direction of gravity, so as to avoid the obstacle. This triple response is 
controlled by ethylene (Figure 15-84B and C). 



Figure 15-83 A hypothetical model for 
how Clv3 and the Clv1/Clv2 receptor 
regulate cell proliferation and/or 
differentiation in the shoot meristem. 

(A) Cells in the outer layer of the 
meristem (light red) secrete Clv3 peptide, 
which binds to the Clv1/Clv2 receptor 
protein on target cells in an adjacent, 
more central region of the meristem 
(green), stimulating the differentiation of 
the target cells. (B) Some parts of the 
intracellular signaling pathway activated 
by Clv3 binding. When activated by Clv3 
binding, Clv1 phosphorylates the 
receptor proteins on serines and 
threonines, thereby activating the 
receptor complex and leading to the 
activation of a Rho family GTPase.The 
signaling pathway after this point is 
unclear, but it leads to the inhibition of a 
gene regulatory protein in the nucleus 
called Wuschel. Because Wuschel 
normally blocks the transcription of 
genes required for differentiation, its 
inhibition of Clv3 signaling allows 
differentiation to proceed. The protein 
phosphatase dephosphorylates the 
receptor proteins and thereby negatively 
regulates the signaling pathway. 
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Figure 15-84 The ethylene-mediated triple response that occurs when 
the growing shoot of a germinating seedling encounters an obstacle 
underground. (A) The structure of ethylene. (B) Before the encounter, the 
shoot is growing upward and is long and thin. (C) After the encounter, the 
shoot thickens, and the protective hook (at top) increases its curvature to 
protect the tip of the shoot. The shoot also alters its direction of growth to 
grow around the obstacle (not shown). (Courtesy of Melanie Webb.) 



Plants have various ethylene receptors, which are located in the endoplasmic 
reticulum and are all structurally related. They are dimeric, multipass transmem- 
brane proteins, with a copper-containing ethylene -binding domain and a 
domain that interacts with a protein called CTR1, which is closely related in 
sequence to the Raf MAP kinase kinase kinase discussed earlier (see Figure 
15-60). The function of CTR1 in ethylene signaling depends on its serine/threo- 
nine kinase activity and the association of its N -terminal domain with the ethy- 
lene receptors. Surprisingly, it is the empty receptors that are active and keep 
CTR1 active. By an unknown signaling mechanism, active CTR1 stimulates the 
ubiquitylation and degradation in proteasomes of a nuclear gene regulatory pro- 
tein called EIN3, which is required for the transcription of ethylene -responsive 
genes. In this way, the empty but active receptors and active CTR1 keep ethylene- 
response genes off. The EIN protein gets its name from the finding that plants 
with inactivating mutations in the gene that encodes it are ethylene insensitive. 

Ethylene binding inactivates the receptors, altering their conformation so 
that they no longer bind to CTRL As a result, CTR1 is inactivated, and the down- 
stream signaling pathway emanating from it is blocked; the EIN3 protein is no 
longer ubiquitylated and degraded and can now activate the transcription of the 
large number of ethylene-responsive genes (Figure 15-85). 
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Figure 1 5-85 A current view of the 
ethylene signaling pathway. (A) In the 
absence of ethylene, both the receptors 
and CTR1 are active, causing the 
ubiquitylation and destruction of the EIN3 
protein, the gene regulatory protein in the 
nucleus that is responsible for the 
transcription of ethylene-responsive genes. 
(B) The binding of ethylene inactivates the 
receptors and disrupts the interaction 
between the receptors and CTR1 . The EIN3 
protein is not degraded and can therefore 
activate the transcription of ethylene- 
responsive genes. 
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Figure 15-86 The auxin signaling 
pathway. (A) The structure of the 
auxin indole-3-acetic acid. 
(B) In the absence of auxin, a 
transcriptional repressor protein 
(called Aux/IAA) binds and 
suppresses a gene regulatory protein 
(called auxin-response factor, ARF), 
which is required for the 
transcription of auxin-responsive 
genes. (C) The auxin receptor 
proteins are mainly located in the 
nucleus and form part of ubiquitin 
ligase complexes (not shown). When 
activated by auxin binding, the 
receptor-auxin complexes recruit the 
ubiquitin ligase complexes, which 
ubiquitylate the Aux/IAA proteins, 
marking them for degradation in 
proteasomes. ARF is now free to 
activate the transcription of auxin- 
responsive genes. There are many 
ARFs, Aux/IAA proteins, and auxin 
receptors that work as illustrated. 



TRANSCRIPTION OF AUXIN TARGET GENES 



A somewhat different strategy is involved in the regulation of auxin-respon- 
sive genes. Moreover, the way that auxin controls the direction and pattern of 
plant growth is unlike any mechanism observed in animals, as we now discuss. 



Regulated Positioning of Auxin Transporters Patterns Plant 
Growth 

The plant hormone auxin, which is generally indole -3 -acetic acid (Figure 
15-86A), binds to receptor proteins in the nucleus. It helps plants grow toward 
light, grow upward rather than branch out, and grow their roots down. It also 
regulates organ initiation and positioning and helps plants flower and bear fruit. 
Like ethylene, it influences gene expression by controlling the degradation of 
gene regulatory proteins in the nucleus, but instead of blocking the ubiquityla- 
tion and degradation of gene regulatory proteins required for the expression of 
auxin-responsive genes, it stimulates the ubiquitylation and degradation of 
repressor proteins that block the transcription of these genes in unstimulated 
cells (Figure 15-86B and C). 

Auxin is unique in the way that it is transported. Unlike animal hormones, 
which are usually secreted by a specific endocrine organ and transported to tar- 
get cells via the circulatory system, auxin has its own transport system. Specific 
plasma-membrane-bound influx transporter proteins and efflux transporter 
proteins move auxin into and out of plant cells, respectively. Distinct gene fami- 
lies encode the influx transporters and efflux transporters, and the two families 
of proteins are regulated independently. The efflux transporters are composed of 
Pin proteins, and cells can distribute them asymmetrically in the plasma mem- 
brane to make the efflux of auxin directional. A row of cells with their auxin 
efflux transporters confined to the basal plasma membrane, for example, will 
transport auxin from the top of the plant to the bottom. 

In some regions of the plant, the localization of the auxin transporters, and 
therefore the direction of auxin flow, is highly dynamic and regulated. A cell can 
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rapidly redistribute transporters by controlling the traffic of vesicles containing 
them. The auxin efflux transporters, for example, normally recycle between 
intracellular vesicles and the plasma membrane. A cell can redistribute these 
transporters on its surface by inhibiting their endocytosis in one domain of the 
plasma membrane, causing the transporters to accumulate there. One example 
occurs in the root, where gravity influences the direction of growth. The auxin 
efflux transporters are normally distributed symmetrically in the cap cells of the 
root. Within minutes of a change in the direction of the gravity vector, however, 
the efflux transporters redistribute to one side of the cells, so that auxin is 
pumped out toward the side of the root pointing downward. Because auxin 
inhibits root cell elongation, this redirection of auxin transport causes the root 
tip to reorient, so that it grows downward again (Figure 15-87). 

In shoot apical meristems, the distribution of an auxin efflux transporter is 
also dynamic and regulated. Here, the directional transport of auxin helps deter- 
mine the regular arrangement of leaves and flowers (see Figure 22-122). 



Figure 15-87 Auxin transport and root 
gravitropism. (A-C) Roots respond to a 
90° change in the gravity vector and 
adjust their direction of growth so that 
they grow downward again. The cells that 
respond to gravity are in the center of the 
root cap, while it is the epidermal cells 
further back (on the lower side) that 
decrease their rate of elongation to 
restore downward growth. (D) The 
gravity-responsive cells in the root cap 
redistribute their auxin efflux 
transporters in response to the 
displacement of the root. This redirects 
the auxin flux mainly to the lower part of 
the displaced root, where it inhibits the 
elongation of the epidermal cells. The 
resulting asymmetrical distribution of 
auxin in the Arabidopsis root tip shown 
here is assessed indirectly, using an 
auxin-responsive reporter gene that 
encodes a protein fused to green 
fluorescent protein (GFP); the epidermal 
cells on the downward side of the root 
are green, whereas those on the upper 
side are not, reflecting the asymmetrical 
distribution of auxin. The distribution of 
auxin efflux transporters in the plasma 
membrane of cells in different regions of 
the root (shown as gray rectangles) is 
indicated in red, and the direction of 
auxin efflux is indicated by a green arrow. 
(The fluorescence photograph in D is 
from T. Paciorek et a I., Nature 
435:1 251-1 256, 2005. With permission 
from Macmillan Publishers Ltd.) 



Phytochromes Detect Red Light, and Cryptochromes Detect Blue 
Light 

Plant development is greatly influenced by environmental conditions. Unlike 
animals, plants cannot move on when conditions become unfavorable; they 
have to adapt or they die. The most important environmental influence on 
plants is light, which is their energy source and has a major role throughout their 
entire life cycle — from germination, through seedling development, to flowering 
and senescence. Plants have thus evolved a large set of light-sensitive proteins 
to monitor the quantity, quality, direction, and duration of light. These are usu- 
ally referred to as photoreceptors. However, because the term photoreceptor is 
also used for light-sensitive cells in the animal retina (see Figure 15-48), we shall 
use the term photoprotein instead. 

All photoproteins sense light by means of a covalently attached light- 
absorbing chromophore, which changes its shape in response to light and then 
induces a change in the protein's conformation. 

The best-known plant photoproteins are the phytochromes, which are pre- 
sent in all plants and in some algae but are absent in animals. These are dimeric, 
cytoplasmic serine/threonine kinases, which respond differentially and 
reversibly to red and far-red light: whereas red light usually activates the kinase 
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activity of the phytochrome, far-red light inactivates it. When activated by red 
light, the phytochrome is thought to phosphorylate itself and then to phospho- 
rylate one or more other proteins in the cell. In some light responses, the acti- 
vated phytochrome translocates into the nucleus, where it activates gene regu- 
latory proteins to alter gene transcription (Figure 15-88). In other cases, the 
activated phytochrome activates a latent gene regulatory protein in the cyto- 
plasm, which then translocates into the nucleus to regulate gene transcription. 
In still other cases, the photoprotein triggers signaling pathways in the cytosol 
that alter the cell's behavior without involving the nucleus. 

Although the phytochromes possess serine /threonine kinase activity, parts 
of their structure resemble the histidine kinases involved in bacterial chemo- 
taxis discussed earlier. This finding suggests that the plant phytochromes origi- 
nally evolved from bacterial histidine kinases and only later in evolution altered 
their substrate specificity from histidine to serine and threonine. 

Plants sense blue light using photoproteins of two other sorts, phototropin 
and cryptochromes. Phototropin is associated with the plasma membrane and 
is partly responsible for phototropism, the tendency of plants to grow toward 
light. Phototropism occurs by directional cell elongation, which is stimulated by 
auxin, but the links between phototropin and auxin are unknown. 

Cryptochromes are flavoproteins that are sensitive to blue light. They are 
structurally related to blue-light-sensitive enzymes called photolyases, which are 
involved in the repair of ultraviolet-induced DNA damage in all organisms, 
except most mammals. Unlike phytochromes, cryptochromes are also found in 
animals, where they have an important role in circadian clocks, which operate 
in most cells and cycle with a 24-hour rhythm (discussed in Chapter 7). 
Although cryptochromes are thought to have evolved from the photolyases, they 
do not have a role in DNA repair. 

In this chapter, we have discussed how extracellular signals influence cell 
behavior. One crucial intracellular target of these signals is the cytoskeleton, 
which determines cell shape and is responsible for cell movements, as we dis- 
cuss in the next chapter. 



Figure 1 5-88 A current view of one way 
in which phytochromes mediate a light 
response in plant cells. When activated 
by red light, the phytochrome, which is a 
dimeric protein kinase, phosphorylates 
itself and then moves into the nucleus, 
where it activates gene regulatory 
proteins to stimulate the transcription of 
red-light-responsive genes. 



Summary 

Plants and animals are thought to have evolved multicellularity and cell communica- 
tion mechanisms independently, each starting from a different unicellular eucaryote, 
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which in turn evolved from a common unicellular eucaryotic ancestor. Not surpris- 
ingly, therefore, the mechanisms used to signal between cells in animals and in plants 
have both similarities and differences. Whereas animals rely mainly on GPCRs, for 
example, plants rely mainly on enzyme-coupled receptors of the receptor serine/threo- 
nine kinase type, especially ones with extracellular leucine-rich repeats. Various plant 
hormones, or growth regulators, including ethylene and auxin, help coordinate plant 
development. Ethylene acts through intracellular receptors to stop the degradation of 
specific nuclear gene regulatory proteins, which can then activate the transcription of 
ethylene-responsive genes. The receptors for some other plant hormones, including 
auxin, also regulate the degradation of specific gene regulatory proteins, although the 
details vary. Auxin signaling is unusual in that it has its own highly regulated trans- 
port system, in which the dynamic positioning of plasma-membrane-bound auxin 
transporters controls the direction of auxin flow and thereby the direction of plant 
growth. Light has an important role in regulating plant development. These light 
responses are mediated by a variety of light-sensitive photoproteins, including phy- 
tochromes, which are responsive to red light, and cryptochromes and phototropin, 
which are sensitive to blue light. 



PROBLEMS 

Which statements are true? Explain why or why not. 

15-1 The receptors involved in paracrine, synaptic, and 
endocrine signaling all have very high affinity for their 
respective signaling molecules. 

15-2 All small intracellular mediators (second messen- 
gers) are water soluble and diffuse freely through the 
cytosol. 

15-3 In the regulation of molecular switches, protein 
kinases and guanine nucleotide exchange factors (GEFs) 
always turn proteins on, whereas protein phosphatases and 
GTPase- activating proteins (GAPs) always turn proteins off. 

15-4 In contrast to the more direct signaling pathways 
used by nuclear receptors, catalytic cascades of intracellular 
mediators provide numerous opportunities for amplifying 
the responses to extracellular signals. 

1 5-5 Binding of extracellular ligands to receptor tyrosine 
kinases activates the intracellular catalytic domain by prop- 
agating a conformational change across the lipid bilayer 
through a single transmembrane a helix. 

15-6 Protein tyrosine phosphatases display exquisite 
specificity for their substrates, unlike serine /threonine pro- 
tein phosphatases, which have rather broad specificity. 

15-7 Even though plants and animals independently 
evolved multicellularity, they use virtually all the same sig- 
naling proteins and second messengers for cell-cell com- 
munication. 

Discuss the following problems. 

1 5-8 Suppose that the circulating concentration of hor- 
mone is 10" 10 M and the K& for binding to its receptor is 
10 -8 M. What fraction of the receptors will have hormone 
bound? If a meaningful physiological response occurs when 
50% of the receptors have bound a hormone molecule, how 
much will the concentration of hormone have to rise to elicit 
a response? The fraction of receptors (R) bound to hormone 
(H) to form a receptor-hormone complex (R-H) is 
[R-H]/([R] + [R-H]) = [R-H]/[R]tot= [H]/([H] + Ad). 



15-9 Cells communicate in ways that resemble human 
communication. Decide which of the following forms of 
human communication are analogous to autocrine, 
paracrine, endocrine, and synaptic signaling by cells. 

A. A telephone conversation 

B. Talking to people at a cocktail party 

C. A radio announcement 

D. Talking to yourself 

15-10 Why do signaling responses that involve changes in 
proteins already present in the cell occur in milliseconds to 
seconds, whereas responses that require changes in gene 
expression require minutes to hours? 

15-11 How is it that different cells can respond in different 
ways to exactly the same signaling molecule even when they 
have identical receptors? 

15-12 Why do you suppose that phosphorylation/dephos- 
phorylation, as opposed to allosteric binding of small 
molecules, for example, has evolved to play such a prominent 
role in switching proteins on and off in signaling pathways? 

1 5-1 3 Consider a signaling pathway that proceeds through 
three protein kinases that are sequentially activated by 
phosphorylation. In one case, the kinases are held in a sig- 
naling complex by a scaffolding protein; in the other, the 
kinases are freely diffusible (Figure Q15-1). Discuss the 
properties of these two types of organization in terms of sig- 
nal amplification, speed, and potential for cross-talk 
between signaling pathways. 
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Figure Q15-1 A protein 
kinase cascade organized 
by a scaffolding protein or 
composed of freely 
diffusing components 
(Problem 15-13). 
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Figure Q15-2 Progesterone- 
induced MAP kinase 
activation, leading to oocyte 
maturation (Problem 15-15). 
(Courtesy of Helfrid 
Hochegger.) 
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15-14 Describe three ways in which a gradual increase in an 
extracellular signal can be sharpened to produce an abrupt 
or nearly all-or-none cellular response. 

15-15 Activation ('maturation') of frog oocytes is signaled 
through a MAP kinase signaling module. An increase in the 
hormone progesterone triggers the module by stimulating 
the translation of Mos mRNA, which is the frog's MAP kinase 
kinase kinase (Figure Q15-2). Maturation is easy to score 
visually by the presence of a white spot in the middle of the 
brown surface of the oocyte (see Figure Q15-2). To deter- 
mine the dose-response curve for progesterone-induced 
activation of MAP kinase, you place 16 oocytes in each of six 
plastic dishes and add various concentrations of proges- 
terone. After an overnight incubation, you crush the 
oocytes, prepare an extract, and determine the state of MAP 
kinase phosphorylation (hence, activation) by SDS poly- 
acrylamide-gel electrophoresis (Figure Q15-3A). This analy- 
sis shows a graded response of MAP kinase to increasing 
concentrations of progesterone. 

Before you crushed the oocytes you noticed that not all 
oocytes in individual dishes had white spots. Had some 
oocytes undergone partial activation and not yet reached 
the white-spot stage? To answer this question, you repeat 
the experiment, but this time you analyze MAP kinase acti- 
vation in individual oocytes. You are surprised to find that 
each oocyte has either a fully activated or a completely inac- 
tive MAP kinase (Figure Q15-3B). How can an all-or-none 
response in individual oocytes give rise to a graded response 
in the population? 
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Figure Q15-3 Activation of 
frog oocytes (Problem 
15-15). (A) Phosphorylation 
of MAP kinase in pooled 
oocytes. (B) Phosphorylation 
of MAP kinase in individual 
oocytes. MAP kinase was 
detected by immunoblotting 
using a MAP-kinase-specific 
antibody. The first two lanes 
in each gel contain 
nonphosphorylated, inactive 
MAP kinase (-) and 
phosphorylated, active MAP 
kinase (+). (From J.E. Ferrell, 
Jr., and E.M. Machleder, 
Science 280:895-898, 1998. 
With permission from AAAS.) 
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15-16 Propose specific types of mutations in the gene for 
the regulatory subunit of PKA that could lead to either a per- 
manently active PKA or a permanently inactive PKA. 

15-17 Phosphorylase kinase integrates signals from the 
cyclic- AMP- dependent and Ca 2+ - dependent signaling path- 
ways that control glycogen breakdown in liver and muscle 
cells (Figure Q15-4). Phosphorylase kinase is composed of 
four subunits. One is the protein kinase that catalyzes the 
addition of phosphate to glycogen phosphorylase to acti- 
vate it for glycogen breakdown. The other three subunits are 
regulatory proteins that control the activity of the catalytic 
subunit. Two contain sites for phosphorylation by PKA, 
which is activated by cyclic AMP. The remaining subunit is 
calmodulin, which binds Ca 2+ when the cytosolic concen- 
tration rises. The regulatory subunits control the equilib- 
rium between the active and inactive conformations of the 
catalytic subunit. How does this arrangement allow phos- 
phorylase kinase to serve its role as an integrator protein for 
the multiple pathways that stimulate glycogen breakdown? 
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Figure Q1 5-4 Integration of cyclic-AMP-dependent and Ca 2+ - 
dependent signaling pathways by phosphorylase kinase in liver and 
muscle cells (Problem 1 5-1 7). 

1 5-1 8 In principle, the activated, GTP-bound form of Ras 
could be increased by activating a guanine-nucleotide 
exchange factor (GEF) or by inactivating a GTPase- 
activating protein (GAP). Why do you suppose that Ras- 
mediated signaling pathways always increase Ras-GTP by 
activating a GEF rather than inactivating a GAP? 

15-19 The Wnt planar polarity signaling pathway normally 
ensures that each wing cell in Drosophila has a single hair. 
Overexpression of the Frizzled gene from a heat- shock pro- 
moter (hs-Fz) causes multiple hairs to grow from many cells 
(Figure Q15-5A). This phenotype is suppressed if hs-Fz is 
combined with a heterozygous deletion (Dsh A ) of the Dishev- 
elled gene (Figure Q15-5B). Do these results allow you to order 
the action of Frizzled and Dishevelled in the signaling path- 
way? If so, what is the order? Explain your reasoning. 
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Figure Q1 5-5 Pattern of hair growth on wing cells in genetically 
different Drosophila (Problem 1 5-1 9). (From C.G. Winter et al., Cell 
105:81-91, 2001. With permission from Elsevier.) 
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The Cytoskeleton 




For cells to function properly, they must organize themselves in space and inter- 
act mechanically with their environment. They have to be correctly shaped, 
physically robust, and properly structured internally. Many have to change their 
shape and move from place to place. All cells have to be able to rearrange their 
internal components as they grow, divide, and adapt to changing circumstances. 
Eucaryotic cells have developed all these spatial and mechanical functions to a 
very high degree, and they depend on a remarkable system of filaments called 
the cytoskeleton (Figure 16-1). 

The cytoskeleton pulls the chromosomes apart at mitosis and then splits the 
dividing cell into two. It drives and guides the intracellular traffic of organelles, 
ferrying materials from one part of the cell to another. It supports the fragile 
plasma membrane and provides the mechanical linkages that let the cell bear 
stresses and strains without being ripped apart as the environment shifts and 
changes. It enables cells such as sperm to swim and others, such as fibroblasts 
and white blood cells, to crawl across surfaces. It provides the machinery in the 
muscle cell for contraction and in the nerve cell to extend an axon and den- 
drites. It guides the growth of the plant cell wall and controls the amazing diver- 
sity of eucaryotic cell shapes. 

The cytoskeletoris varied functions depend on the behavior of three families 
of protein molecules, which assemble to form three main types of filaments. 
Each type of filament has distinct mechanical properties, dynamics, and biolog- 
ical roles, but all three share certain fundamental principles. These principles 
provide the basis for a general understanding of how the cytoskeleton works and 
how the different elements cooperate. Just as we require our ligaments, bones, 
and muscles to work together, so all three cytoskeletal filament systems must 
normally function collectively to give a cell its strength, its shape, and its ability 
to move. 

In this chapter, we begin by describing the three main types of filaments, the 
basic principles underlying their assembly and disassembly, and their individual 
peculiarities. We then describe how other proteins interact with the three main 
filament systems, enabling the cell to establish and maintain internal order, to 
shape and remodel its surface, to move organelles in a directed manner from 
one place to another, and — when appropriate— to move itself to new locations. 



THE SELF-ASSEMBLY AND DYNAMIC STRUCTURE 
OF CYTOSKELETAL FILAMENTS 

Most animal cells have three types of cytoskeletal filaments that are responsible 
for the cells' spatial organization and mechanical properties. Intermediate fila- 
ments provide mechanical strength. Microtubules determine the positions of 
membrane-enclosed organelles and direct intracellular transport. Actin fila- 
ments determine the shape of the cell's surface and are necessary for whole-cell 
locomotion. But these cytoskeletal filaments would be ineffective without the 
hundreds of accessory proteins that link the filaments to other cell components, 
as well as to each other. This large set of accessory proteins is essential for the 
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Figure 16-1 The cytoskeleton. A cell in 
culture has been fixed and labeled to 
show two of its major cytoskeletal 
systems, the microtubules {green) and the 
actin filaments (red). The DNA in the 
nucleus is labeled in blue. (Courtesy of 
Albert Tousson.) 
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controlled assembly of the cytoskeletal filaments in particular locations, and it 
includes the motor proteins, remarkable molecular machines that convert the 
energy of ATP hydrolysis into mechanical force that can either move organelles 
along the filaments or move the filaments themselves. 

In this section, we discuss the properties of the proteins that make up the fil- 
aments of the cytoskeleton. We focus on their ability to form intrinsically polar- 
ized and self-organized structures. We shall see that, because of the remarkable 
mechanisms that cause cytoskeletal filaments to be dynamic, the cell is able to 
respond rapidly to any eventuality that it may face. 

Cytoskeletal Filaments Are Dynamic and Adaptable 

Cytoskeletal systems are dynamic and adaptable, organized more like ant trails 
than interstate highways. A single trail of ants may persist for many hours, 
extending from the ant nest to a delectable picnic site, but the individual ants 
within the trail are anything but static. If the ant scouts find a new and better 
source of food, or if the picnickers clean up and leave, the dynamic structure rear- 
ranges itself with astonishing rapidity to deal with the new situation. In a similar 
way, large-scale cytoskeletal structures can change or persist, according to need, 
lasting for lengths of time ranging from less than a minute up to the cell's lifetime. 
But the individual macromolecular components that make up these structures 
are in a constant state of flux. Thus, like the alteration of an ant trail, a structural 
rearrangement in a cell requires little extra energy when conditions change. 

Regulation of the dynamic behavior and assembly of the cytoskeletal fila- 
ments allows eucaryotic cells to build an enormous range of structures from the 
three basic filament systems. The micrographs in Panel 16-1 reveal some of 
these structures. Microtubules, which are frequently found in a star-like cyto- 
plasmic array emanating from the center of an interphase cell, can quickly rear- 
range themselves to form a bipolar mitotic spindle during cell division. They can 
also form motile whips called cilia and flagella on the surface of the cell, or 
tightly aligned bundles that serve as tracks for the transport of materials down 
long neuronal axons. In plant cells, organized arrays of microtubules help to 
direct the pattern of cell wall synthesis. 

Actin filaments underlie the plasma membrane of animal cells, providing 
strength and shape to its thin lipid bilayer. They also form many types of cell- 
surface projections. Some of these are dynamic structures, such as the lamel- 
lipodia and filopodia that cells use to explore territory and pull themselves 
around. The actin-based contractile ring assembles transiently to divide cells in 
two; more stable arrays allow cells to brace themselves against an underlying 
substratum and enable muscle to contract. The regular bundles of stereocilia on 
the surface of hair cells in the inner ear contain stable bundles of actin filaments 
that tilt as rigid rods in response to sound, and similarly organized microvilli on 
the surface of intestinal epithelial cells vastly increase the apical cell surface area 
to enhance nutrient absorption. 

Intermediate filaments line the inner face of the nuclear envelope, forming 
a protective cage for the cell's DNA; in the cytosol, they are twisted into strong 
cables that can hold epithelial cell sheets together or help nerve cells to extend 
long and robust axons, and they allow us to form tough appendages such as hair 
and fingernails. 

An important and dramatic example of rapid reorganization of the 
cytoskeleton occurs during cell division, as shown in Figure 16-2 for a fibroblast 
growing in a tissue culture dish. After the chromosomes have replicated, the 
interphase microtubule array that spans throughout the cytoplasm is reconfig- 
ured into the bipolar mitotic spindle, which serves the critical function of accu- 
rately segregating the two copies of each replicated chromosome into two sepa- 
rate daughter nuclei. At the same time, the specialized actin structures that 
enable the fibroblast to crawl across the surface of the dish disassemble so that 
the cell stops moving, rounds up, and assumes a more spherical shape. Actin 
and its associated motor protein myosin then form a belt around the middle of 
the cell, the contractile ring, which constricts like a tiny muscle to pinch the cell 
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in two. When division is complete, the cytoskeletons of the two daughter fibrob- 
lasts reassemble into their interphase structures to convert the two rounded up 
daughter cells into smaller versions of the flattened, crawling mother cell. In a 
fibroblast, this sequence of events takes about an hour; in some cases, such as 
the early nuclear divisions in a Drosophila embryo, the actin and microtubule 
cytoskeletons can completely rearrange themselves within less than five min- 
utes (Figure 16-3). 
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Figure 16-2 Rapid changes in 
cytoskeletal organization associated 

with cell division. The crawling fibroblast 
drawn here has a polarized, dynamic 
actin cytoskeleton (shown in red) that 
assembles to push its leading edge 
toward the right. The polarization of the 
actin cytoskeleton is assisted by the 
microtubule cytoskeleton (shown in 
green), consisting of long microtubules 
that emanate from a single microtubule 
organizing center located in front of the 
nucleus. When the cell divides, the 
polarized microtubule array rearranges to 
form a bipolar mitotic spindle, which is 
responsible for aligning and then 
separating the duplicated chromosomes 
{brown). The actin filaments form a 
contractile ring at the center of the cell 
that pinches the cell in two after the 
chromosomes have separated. After cell 
division is complete, the two daughter 
cells reorganize both the microtubule 
and actin cytoskeletons into smaller 
versions of those that were present in the 
mother cell, enabling them to crawl their 
separate ways. 
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Figure 16-3 Rapid changes in 
cytoskeletal organization observed 
during the development of a Drosophila 
early embryo. <TTCT> In this giant 
multinuclear cell, the early nuclear 
divisions occur every 1 minutes or so in 
a common cytoplasm. The rapid 
rearrangements of the actin filaments 
(red) and microtubules (green), seen here 
in a living embryo, are required to 
separate the chromosomes at mitosis, 
while keeping each nucleus from 
colliding with its neighbors. (Courtesy of 
William Sullivan.) 
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I Actin filaments (also known as microfilaments) are two-stranded helica 
polymers of the protein actin. They appear as flexible structures, with a 
diameter of 5-9 nm, and they are organized into a variety of linear bundles, 
two-dimensional networks, and three-dimensional gels. Although actin 
filaments are dispersed throughout the cell, they are most highly 
concentrated in the cortex, just beneath the plasma membrane. 

Micrographs courtesy of Roger Craig (i and iv); P.T. Matsudaira and D R, Burgess (ii); Keith Burridge (iii). 




MICROTUBULES 
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Microtubules are long, hollow cylinders made of the protein tubulin, 
I With an outer diameter of 25 nm, they are much more rigid than actin 

filaments. Microtubules are long and straight and typically have one 

end attached to a single microtubule-organizing center (MTOC) 
| called a centrosome. 

Micrographs courtesy of Richard Wade (i); D T. Woodrow and R W Linck (ii); David Shima (iii); A Desai (iv). 
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Intermediate filaments are ropelike fibers with a diameter of around 
10 nm; they are made of intermediate filament proteins, which constitute a 
I large and heterogeneous family. One type of intermediate filament forms a 
meshwork called the nuclear lamina just beneath the inner nuclear 
membrane. Other types extend across the cytoplasm, giving cells mechanica 
strength. In an epithelial tissue, they span the cytoplasm from one cell-ce 
junction to another, thereby strengthening the entire epithelium. 
Micrographs courtesy of Roy Quinlan (i); Nancy L Kedersha (ii); Mary Osborn (iii); Ueli Aebi (iv), 
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Many cells require rapid cytoskeletal rearrangements for their normal func- 
tioning during interphase as well. For example, the neutrophil, a type of white 
blood cell, chases and engulfs bacterial and fungal cells that accidentally gain 
access to the normally sterile parts of the body, as through cuts in the skin. Like 
most crawling cells, neutrophils advance by extending a protrusive structure at 
the leading edge filled with newly polymerized actin filaments. When the elusive 
bacterial prey moves in a different direction, the neutrophil is poised to reorga- 
nize its polarized protrusive structures within seconds (Figure 16-4). Both of 
these kinds of rapid cytoskeletal rearrangements will be discussed in more detail 
in the final section of this chapter. 



Figure 16-4 A neutrophil in pursuit of 
bacteria. <TGTA> In this preparation of 
human blood, a clump of bacteria (white 
arrow) is about to be captured by a 
neutrophil. As the bacteria move, the 
neutrophil quickly reassembles the 
dense actin network at the leading edge 
{red) to push toward the location of the 
bacteria. Rapid disassembly and 
reassembly of the actin cytoskeleton in 
this cell enables it to change its 
orientation and direction of movement 
within a few seconds. (From a video 
recorded by David Rogers.) 



The Cytoskeleton Can Also Form Stable Structures 

In cells that have achieved a stable, differentiated morphology such as mature 
neurons or epithelial cells, the dynamic elements of the cytoskeleton must also 
provide stable, large-scale structures for cellular organization. On specialized 
epithelial cells that line tissues such as the intestine and the lung, cytoskeletal- 
based cell surface protrusions including microvilli and cilia are able to maintain 
a constant location, length, and diameter over the entire lifetime of the cell. For 
the actin bundles at the cores of microvilli on intestinal epithelial cells this is 
only a few days. But the actin bundles at the cores of stereocilia on the hair cells 
of the inner ear must maintain their stable organization for the entire lifetime of 
the animal, since these cells do not turn over. Nonetheless, the individual actin 
filaments remain strikingly dynamic and are continuously remodeled and 
replaced on average every 48 hours, even within these stable cell surface struc- 
tures that persist for decades. 

Besides forming stable specialized cell surface protrusions, the cytoskeleton 
is also responsible for large-scale cellular polarity, enabling cells to tell the dif- 
ference between top and bottom, or front and back. The large-scale polarity 
information encoded by the organization of the cytoskeleton must also often be 
maintained over the lifetime of the cell. Polarized epithelial cells such as those 
found in the lining of the intestine, for example, use organized arrays of micro- 
tubules, actin filaments, and intermediate filaments to maintain the critical 
functional differences between the apical surface that absorbs nutrients from 
the lumen of the intestine where food passes by to the basolateral surface where 
the cells transfer nutrients through the plasma membrane to the bloodstream. 
They also must maintain strong adhesive contacts with one another to enable 
this single layer of cells to serve as an effective physical barrier (Figure 16-5). 

Even small, morphologically simple cells such as the budding yeast Saccha- 
romyces cerevisiae need stable large-scale polarity. The most notable feature of 
the structure of these cells is the marked asymmetry, evident in the way they 
divide by budding to create a small daughter cell and a large mother cell. This 
asymmetry derives from the polar orientation of the cell's actin cytoskeleton. 
There are two types of actin filament assemblies in these cells: actin cables (long 
bundles of actin filaments) and actin patches (small assemblies of filaments 
associated with the cell cortex, marking sites of actin-driven endocytosis). Pro- 
liferating budding yeast cells must be highly polarized to allow the cell to grow a 
bud from a single site on the cell surface, as opposed to simply growing uni- 
formly larger. In this process, the actin patches become highly concentrated at 
the growing tip of the bud, with the actin cables aligned and pointing toward 
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Figure 16-5 Organization of the 
cytoskeleton in polarized epithelial 
cells. All the components of the 
cytoskeleton cooperate to produce the 
characteristic shapes of specialized cells 
including the epithelial cells that line the 
small intestine. At the apical (upper) 
surface, facing the intestinal lumen, 
bundled actin filaments {red) form 
microvilli that increase the cell surface 
area available for absorbing nutrients 
from food. Just below the microvilli, a 
circumferential band of actin filaments 
contributes to forming cell-cell junctions 
that prevent the contents of the intestinal 
lumen from leaking into the body. 
Intermediate filaments {blue) are 
anchored to other kinds of adhesive 
structures including desmosomes and 
hemidesmosomes that connect the 
epithelial cells into a sturdy sheet and 
attach them to the underlying 
extracellular matrix on the basal side of 
the cell; these important adhesive 
structures will be discussed in Chapter 
19. Microtubules {green) run vertically 
from the top of the cell to the bottom, 
and provide a global coordinate system 
that enables the cell to direct newly 
synthesized components to their proper 
locations. 



them. This actin organization directs the secretion of new cell wall and other 
materials to the site of budding (Figure 16-6). The polarized organization of the 
actin structures in turn influences the orientation of the mitotic spindle, so that 
a complete set of replicated chromosomes can be delivered into the daughter 
cell at the end of the cell division process. 



Each Type of Cytoskeletal Filament Is Constructed from Smaller 
Protein Subunits 

Cytoskeletal structures frequently reach all the way from one end of the cell to 
the other, spanning tens or even hundreds of micrometers. Yet the individual 
protein molecules of the cytoskeleton are generally only a few nanometers in 
size. The cell builds the large structures by the repetitive assembly of large num- 
bers of the small subunits, like building a skyscraper out of bricks. Because 
these subunits are small, they can diffuse rapidly within the cytoplasm, 
whereas the assembled filaments cannot. In this way, cells can undergo rapid 
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Figure 1 6-6 Polarity of actin patches 
and cables throughout the yeast cell 
cycle. Filamentous actin structures in the 
yeast cell, labeled here with fluorescent 
phalloidin, include actin patches (round 
bright spots) and actin cables (extended 
lines). (A) In a mother cell, before the 
formation of the bud, most of the 
patches become clustered at one end. 
The cables are lined up and point toward 
the cluster of patches, which is the site 
where the bud will emerge. (B) As the 
small bud grows, most patches remain 
within it. Cables in the mother cell 
continue to point toward this site of new 
cell wall growth. (C) Patches are almost 
uniformly distributed over the surface of 
a full-sized bud. Cables in the mother cell 
remain polarized. (D) Immediately after 
cell division, mother and daughter cells 
form new patches, which are 
concentrated near the division site, 
although both cells have randomly 
oriented cables. (From T.S. Karpova et al., 
I Cell Biol. 142:1501-1517, 1998. With 
permission from The Rockefeller 
University Press.) 
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Figure 1 6-7 The cytoskeleton during changes in cell shape. The 

formation of protein filaments from much smaller protein subunits allows 
regulated filament assembly and disassembly to reshape the cytoskeleton. 
(A) Filament formation from a small protein. (B) Rapid reorganization of the 
cytoskeleton in a cell in response to an external signal. 



structural reorganizations, disassembling filaments at one site and reassem- 
bling them at another site far away (Figure 16-7). 

Intermediate filaments are made up of smaller subunits that are themselves 
elongated and fibrous, whereas actin filaments and microtubules are made of 
subunits that are compact and globular — actin subunits for actin filaments, 
tubulin subunits for microtubules. All three types of cytoskeletal filaments form 
as helical assemblies of subunits (see Figure 3-26) that self-associate, using a 
combination of end-to-end and side-to-side protein contacts. Differences in the 
structures of the subunits and the strengths of the attractive forces between 
them produce critical differences in the stability and mechanical properties of 
each type of filament. 

Covalent linkages between their subunits hold together many biological 
polymers — including DNA, RNA, and proteins. In contrast, weak noncovalent 
interactions hold together the three types of cytoskeletal "polymers". Conse- 
quently, their assembly and disassembly can occur rapidly, without covalent 
bonds being formed or broken. 

Within the cell, hundreds of different cytoskeleton-associated accessory 
proteins regulate the spatial distribution and the dynamic behavior of the fila- 
ments, converting information received through signaling pathways into 
cytoskeletal action. These accessory proteins bind to the filaments or their sub- 
units to determine the sites of assembly of new filaments, to regulate the parti- 
tioning of polymer proteins between filament and subunit forms, to change the 
kinetics of filament assembly and disassembly, to harness energy to generate 
force, and to link filaments to one another or to other cell structures such as 
organelles and the plasma membrane. In these processes, the accessory pro- 
teins bring cytoskeletal structure under the control of extracellular and intracel- 
lular signals, including those that trigger the dramatic transformations of the 
cytoskeleton that occur during each cell cycle. Acting together, the accessory 
proteins enable a eucaryotic cell to maintain a highly organized but flexible 
internal structure and, in many cases, to move. 
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Filaments Formed from Multiple Protofilaments Have 
Advantageous Properties 

In general, we can view the linking of protein subunits together to form a fila- 
ment as a simple association reaction. A free subunit binds to the end of a fila- 
ment that contains n subunits to generate a filament of length n + 1. The addi- 
tion of each subunit to the end of the polymer creates a new end to which yet 
another subunit can bind. However, the robust cytoskeletal filaments in living 
cells are not built by simply stringing subunits together in a single straight file. A 
thousand tubulin monomers, for example, lined up end to end, would span the 
diameter of a small eucaryotic cell, but a filament formed in this way would lack 
the strength to avoid breakage by ambient thermal energy, unless each subunit 
in the filament was bound extremely tightly to its two neighbors. Such tight 
binding would limit the rate at which the filaments could disassemble, making 
the cytoskeleton a static and less useful structure. 

Cytoskeletal polymers combine strength with adaptability because they are 
built out of multiple protofilaments — long linear strings of subunits joined end 
to end — that associate with one another laterally. Typically, the protofilaments 
twist around one another in a helical lattice. The addition or loss of a subunit at 
the end of one protofilament makes or breaks one set of longitudinal bonds and 
either one or two sets of lateral bonds. In contrast, breakage of the composite fil- 
ament in the middle requires breaking sets of longitudinal bonds in several 
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Figure 1 6-8 The thermal stability of 
cytoskeletal filaments with dynamic 
ends. Formation of a cytoskeletal 
filament from more than one 
protofilament allows the ends to be 
dynamic, while the filaments themselves 
are resistant to thermal breakage. In this 
hypothetical example, the stable filament 
is formed from five protofilaments.The 
bonds holding the subunits together in 
the filaments are shown in red. 




5 protofilaments 




BREAKAGE IN 
MEDDLE BREAKS 
5 LONGITUDINAL 



BONDS 




J\ 



REMOVAL FROM 
ONE END BREAKS 
ONE LONGITUDINAL 
AND 2 LATERAL BONDS 




MULTIPLE PROTOFILAMENTS: THERMALLY STABLE 



protofilaments all at the same time (Figure 16-8). The large energy difference 
between these two processes allows most cytoskeletal filaments to resist thermal 
breakage, while leaving the filament ends as dynamic structures at which addi- 
tion and loss of subunits can occur rapidly. 

As with other specific protein-protein interactions, many hydrophobic 
interactions and weak noncovalent bonds hold the subunits in a cytoskeletal fil- 
ament together (see Figure 3-4). The locations and types of subunit-subunit 
contacts differ for the different cytoskeletal filaments. Intermediate filaments, 
for example, assemble by forming strong lateral contacts between a-helical 
coiled coils, which extend over most of the length of each elongated fibrous sub- 
unit. Because the individual subunits are staggered in the filament, intermedi- 
ate filaments tolerate stretching and bending, forming strong rope-like struc- 
tures (Figure 16-9). Microtubules, by contrast, are built from globular subunits 
held together primarily by longitudinal bonds, and the lateral bonds holding the 
13 protofilaments together are comparatively weak. For this reason, micro- 
tubules break much more easily when they are bent than do intermediate fila- 
ments. 



staggered long subunits: lateral contacts dominate 



i i i " i i » i * i " i - i 
♦ i ♦ i » i I \ i i » I 



I I I i 4 i 






*.'v- 






ROPE-LIKE 
PROPERTIES 



Figure 16-9 A strong filament formed from 
elongated fibrous subunits with strong 
lateral contacts. Intermediate filaments are 
formed in this way and are consequently 
especially resistant to stretching forces, 
although they are easily bent. 
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Nucleation Is the Rate-Limiting Step in the Formation of a 
Cytoskeletal Polymer 

There is an important additional consequence of the multiple-protofilament 
organization of cytoskeletal polymers. Short oligomers composed of a few sub- 
units can assemble spontaneously, but they are unstable and disassemble read- 
ily because each monomer is bonded only to a few other monomers. For a new 
large filament to form, subunits must assemble into an initial aggregate, or 
nucleus, that is stabilized by many subunit-subunit contacts and can then elon- 
gate rapidly by addition of more subunits. The initial process of nucleus assem- 
bly is called filament nucleation, and it can take quite a long time, depending on 
how many subunits must come together to form the nucleus. 

The instability of smaller aggregates creates a kinetic barrier to nucleation, 
which is easily observed in a solution of pure actin or tubulin — the subunits of 
actin filaments and microtubules, respectively When polymerization is initiated 
in a test tube containing a solution of pure individual subunits (by raising the 
temperature or raising the salt concentration), there is an initial lag phase, dur- 
ing which no filaments are observed. During this lag phase, however, a few of the 
small unstable aggregates succeed in making the transition to the more stable 
filament form, so that the lag phase is followed by a phase of rapid filament elon- 
gation, during which subunits add quickly onto the ends of the nucleated fila- 
ments (Figure 16-10A). Finally the system approaches a steady state at which 
the rate of addition of new subunits to the filament ends exactly balances the 
rate of subunit dissociation from the ends. The concentration of free subunits 
left in solution at this point is called the critical concentration, Q. As explained 
in Panel 16-2 (pp. 978-979), the value of the critical concentration is equal to the 
rate constant for subunit loss divided by the rate constant for subunit addition — 
that is, C c = kos /k on . 

The lag phase in filament growth is eliminated if preexisting seeds (such as 
filament fragments that have been chemically cross-linked) are added to the 
solution at the beginning of the polymerization reaction (Figure 16-10B). The 
cell takes great advantage of this nucleation requirement: it uses special proteins 
to catalyze filament nucleation at specific sites, thereby determining the loca- 
tion at which new cytoskeletal filaments are assembled. Indeed, the regulation 
of filament nucleation is a primary way for cells to control their shape and their 
movement. 



Figure 16-10 The time course of actin 
polymerization in a test tube. 

(A) Polymerization is begun by raising the 
salt concentration in a solution of pure 
actin subunits. (B) Polymerization is 
begun in the same way, but with 
preformed fragments of actin filaments 
present to act as nuclei for filament 
growth. As indicated, the % free subunits 
reflects the critical concentration (Cc), the 
point at which there is no net change in 
polymer. 



The Tubulin and Actin Subunits Assemble Head-to-Tail to Create 
Polar Filaments 

Microtubules are formed from protein subunits of tubulin. The tubulin subunit 
is itself a heterodimer formed from two closely related globular proteins called 
a- tubulin and ft- tubulin, tightly bound together by noncovalent bonds (Figure 
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16-1 1). These two tubulin proteins are found only in this heterodimer. Each a or 
P monomer has a binding site for one molecule of GTP. The GTP that is bound 
to the a-tubulin monomer is physically trapped at the dimer interface and is 
never hydrolyzed or exchanged; it can therefore be considered to be an integral 
part of the tubulin heterodimer structure. The nucleotide on the p-tubulin, in 
contrast, may be in either the GTP or the GDP form, and it is exchangeable. As 
we shall see, the hydrolysis of GTP at this site to produce GDP has an important 
effect on microtubule dynamics. 

A microtubule is a hollow cylindrical structure built from 13 parallel protofil- 
aments, each composed of alternating a-tubulin and P-tubulin molecules. 
When the tubulin heterodimers assemble to form the hollow cylindrical micro- 
tubule, they generate two new types of protein-protein contacts. Along the lon- 
gitudinal axis of the microtubule, the "top" of one p-tubulin molecule forms an 
interface with the "bottom" of the a-tubulin molecule in the adjacent het- 
erodimer. This interface is very similar to the interface holding the a and P 
monomers together in the dimer subunit, and the binding energy is strong. Per- 
pendicular to these interactions, neighboring protofilaments form lateral con- 
tacts. In this dimension, the main lateral contacts are between monomers of the 
same type (oc-cc and p-p). Together, the longitudinal and lateral contacts are 
repeated in the regular helical lattice of the microtubule. Because multiple con- 
tacts within the lattice hold most of the subunits in a microtubule in place, the 
addition and loss of subunits occurs almost exclusively at the microtubule ends 
(see Figure 16-8). These multiple contacts among subunits make microtubules 
stiff and difficult to bend. The stiffness of a filament can be characterized by its 
persistence length, a property of the filament describing how long it must be 
before random thermal fluctuations are likely to cause it to bend. The persis- 
tence length of a microtubule is several millimeters, making microtubules the 
stiffest and straightest structural elements found in most animal cells. 

The subunits in each protofilament in a microtubule all point in the same 
direction, and the protofilaments themselves are aligned in parallel (in Figure 
16-11, for example, the a-tubulin is down and the P-tubulin up in each hetero- 



Figure 16-11 The structure of a 
microtubule and its subunit. (A) The 
subunit of each protofilament is a tubulin 
heterodimer, formed from a very tightly 
linked pair of a- and p-tubulin monomers. 
The GTP molecule in the a-tubulin 
monomer is so tightly bound that it can 
be considered an integral part of the 
protein. The GTP molecule in the p-tubulin 
monomer, however, is less tightly bound 
and has an important role in filament 
dynamics. Both nucleotides are shown in 
red. (B) One tubulin subunit 
{cc-p heterodimer) and one protofilament 
are shown schematically. Each 
protofilament consists of many adjacent 
subunits with the same orientation. 
(C) The microtubule is a stiff hollow tube 
formed from 13 protofilaments aligned in 
parallel. (D) A short segment of a 
microtubule viewed in an electron 
microscope. (E) Electron micrograph of a 
cross section of a microtubule showing a 
ring of 13 distinct protofilaments. 
(D, courtesy of Richard Wade; E, courtesy 
of Richard Linck.) 
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dimer). Therefore, the microtubule itself has a distinct structural polarity, with 
a-tubulins exposed at one end and (3-tubulins exposed at the other end. 

The actin subunit is a single globular polypeptide chain and is thus a 
monomer rather than a dimer. Like tubulin, each actin subunit has a binding 
site for a nucleotide, but for actin the nucleotide is ATP (or ADP) rather than GTP 
(or GDP) (Figure 16-12). As for tubulin, the actin subunits assemble head-to-tail 
to generate filaments with a distinct structural polarity. The actin filament can 
be considered to consist of two parallel protofilaments that twist around each 
other in a right-handed helix. Actin filaments are relatively flexible and easily 
bent compared with the hollow cylindrical microtubules, with a persistence 
length of only a few tens of micrometers. But in a living cell, accessory proteins 
(see below) crosslink and bundle them together, making these large-scale actin 
structures much stronger than an individual actin filament. 



Microtubules and Actin Filaments Have Two Distinct Ends That 
Grow at Different Rates 

The regular, parallel orientation of their subunits gives actin filaments and 
microtubules structural polarity. This orientation makes the two ends of each 
polymer different in ways that have a profound effect on filament growth rates. 
Addition of a subunit to either end of a filament of n subunits results in a fila- 
ment of n + 1 subunits. In the absence of ATP or GTP hydrolysis, the free energy 
difference, and therefore the equilibrium constant (and the critical concentra- 
tion), must be the same for addition of subunits at either end of the polymer. In 
this case, the ratio of the forward and backward rate constants, k Jkoff> must be 
identical at the two ends, even though the absolute values of these rate con- 
stants may be very different at each end. 

In a structurally polar filament, the kinetic rate constants for association and 
dissociation— k on and fc ff, respectively— are often much greater at one end than 
at the other. Thus, if an excess of purified subunits is allowed to assemble onto 
marked fragments of preformed filaments, one end of each fragment elongates 
much faster than the other (Figure 16-13). If filaments are rapidly diluted so that 
the free subunit concentration drops below the critical concentration, the fast- 
growing end also depolymerizes fastest. The more dynamic of the two ends of a 
filament, where both growth and shrinkage are fast, is called the plus end, and 
the other end is called the minus end. 
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Figure 16-1 2 The structures of an actin 
monomer and actin filament. (A) The 

actin monomer has a nucleotide (either 
ATP or ADP) bound in a deep cleft in the 
center of the molecule. (B) Arrangement 
of monomers in a filament. Although the 
filament is often described as a single 
helix of monomers, it can also be thought 
of as consisting of two protofilaments, 
held together by lateral contacts, which 
wind around each other as two parallel 
strands of a helix, with a twist repeating 
every 37 nm. All the subunits within the 
filament have the same orientation. 
(C) Electron micrographs of negatively 
stained actin filaments. (C, courtesy of 
Roger Craig.) 
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Figure 16-1 3 The preferential growth of microtubules at the plus end. 

Both microtubules and actin filaments grow faster at one end than at the 
other. In this case, a stable bundle of microtubules obtained from the core 
of a cilium (discussed later) was incubated for a short time with tubulin 
subunits under polymerizing conditions. Microtubules grow fastest from 
the plus end of the microtubule bundle, the end at the top in this 
micrograph. (Courtesy of Gary Borisy.) 

On microtubules, a subunits are exposed at the minus end, and p subunits 
are exposed at the plus end. On actin filaments, the ATP-binding cleft on the 
monomer points toward the minus end. (For historical reasons, the plus ends of 
actin filaments are usually referred to as "barbed" ends, and minus ends as 
"pointed" ends, because of the arrowhead appearance of myosin heads when 
bound along the filament.) 

Filament elongation proceeds spontaneously when the free energy change 
(AG) for addition of the soluble subunit is less than zero. This is the case when 
the concentration of subunits in solution exceeds the critical concentration. 
Likewise, filament depolymerization proceeds spontaneously when this free 
energy change is greater than zero. A cell can couple an energetically unfavor- 
able process to these spontaneous processes; thus, the cell can use free energy 
released during spontaneous filament polymerization or depolymerization to 
do mechanical work — in particular, to push or pull an attached load. For exam- 
ple, elongating microtubules can help push out membranes, and shrinking 
microtubules can help pull mitotic chromosomes away from their sisters during 
anaphase. Similarly, elongating actin filaments help protrude the leading edge of 
motile cells, as we discuss later. 



newly formed microtubules 
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Filament Treadmilling and Dynamic Instability Are Consequences 
of Nucleotide Hydrolysis by Tubulin and Actin 

Thus far, our discussion of filament dynamics has ignored a critical fact that 
applies to both actin filaments and microtubules. In addition to their ability to 
form noncovalent polymers, the actin and tubulin subunits are both enzymes 
that can catalyze the hydrolysis of a nucleoside triphosphate, ATP or GTR 
respectively. For the free subunits, this hydrolysis proceeds very slowly; however, 
it is accelerated when the subunits are incorporated into filaments. Shortly after 
incorporation of an actin or tubulin subunit into a filament, nucleotide hydrol- 
ysis occurs; the free phosphate group is released from each subunit, but the 
nucleoside diphosphate remains trapped in the filament structure. (On tubulin, 
the nucleotide-binding site lies at the interface between two neighboring sub- 
units — see Figure 16-11, whereas in actin, the nucleotide is deep in a cleft near 
the center of the subunit — see Figure 16-12.) Thus, two different types of fila- 
ment structures can exist, one with the "T form" of the nucleotide bound (ATP 
for actin, GTP for tubulin), and one with the "D form" bound (ADP for actin, GDP 
for tubulin). 

When the nucleotide is hydrolyzed, much of the free energy released by 
cleavage of the high-energy phosphate-phosphate bond is stored in the polymer 
lattice. This makes the free energy change for dissociation of a subunit from the 
D-form polymer more negative than the free energy change for dissociation of a 
subunit from the T-form polymer. Consequently, the ratio of /c ff lk on for the D- 
form polymer, which is numerically equal to its critical concentration [Q(D)], is 
larger than the corresponding ratio for the T-form polymer. Thus, C C (D) is 
greater than Q(T). For certain concentrations of free subunits, D-form polymers 
will therefore shrink while T-form polymers grow. 

In living cells, most of the free subunits are in the T form, as the free con- 
centration of both ATP and GTP is about ten-fold higher than that of ADP and 
GDP. The longer the time that subunits have been in the polymer lattice, the 
more likely they are to have hydrolyzed their bound nucleotide. Whether the 
subunit at the very end of a filament is in the T or the D form depends on the rate 
of this hydrolysis compared with the rate of subunit addition. If the rate of sub- 
unit addition is high, that is if the filament is growing rapidly, then it is likely that 
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soluble subunits are in T form ( ■ ) 
polymers are a mixture of T form ( ■ ) and D form ( W ) 



*■* 



POLYMERIZATION FOLLOWED 
BY NUCLEOSIDE HYDROLYSIS 




minus-end addition is slow- 
hydrolysis catches up 



plus-end addition is fast — 
hydrolysis lags behind 
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For C C (T)<C<C C (D) 
treadmilling occurs 



a new subunit will add on to the polymer before the nucleotide in the previously 
added subunit has been hydrolyzed, so that the tip of the polymer remains in the 
T form, forming an ATP cap or GTP cap. However, if the rate of subunit addition 
is low, hydrolysis may occur before the next subunit is added, and the tip of the 
filament will then be in the D form. 

The rate of subunit addition at the end of a filament is the product of the free 
subunit concentration and the rate constant fcbn. The k on is much faster for the 
plus end of a filament than for the minus end because of a structural difference 
between the two ends (see Panel 16-2). At an intermediate concentration of free 
subunits, it is therefore possible for the rate of subunit addition to be faster than 
nucleotide hydrolysis at the plus end, but slower than nucleotide hydrolysis at 
the minus end. In this case, the plus end of the filament remains in the T con- 
formation, while the minus end adopts the D conformation. As just explained, 
the D form has a higher critical concentration than the T form. (In other words, 
the D form leans more readily toward disassembly, while the T form leans more 
readily toward assembly). If the concentration of free subunits in solution is in 
an intermediate range— higher than the critical concentration of the T form 
(that is, the plus end), but lower than the critical concentration of the D form 
(that is, the minus end)— the filament adds subunits at the plus end, and simul- 
taneously loses subunits from the minus end. This leads to the remarkable prop- 
erty of filament treadmilling (Figure 16-14 and Panel 16-2). 

During treadmilling, subunits are recruited at the plus end of the polymer in 
the T form and shed from the minus end in the D form. The ATP or GTP hydrol- 
ysis that occurs along the way gives rise to the difference in the free energy of the 
association/dissociation reactions at the plus and minus ends of the actin fila- 
ment or microtubule and thereby makes treadmilling possible. At a particular 
intermediate subunit concentration, the filament growth at the plus end exactly 
balances the filament shrinkage at the minus end. Now, the subunits cycle 
rapidly between the free and filamentous states, while the total length of the fil- 
ament remains unchanged. This "steady-state treadmilling" requires a constant 
consumption of energy in the form of nucleoside triphosphate hydrolysis. While 
the extent of treadmilling inside the cell is uncertain, the treadmilling of single 
filaments has been observed in vitro for actin, and a phenomenon that looks like 
treadmilling can be observed in live cells for individual microtubules (Figure 
16-15). 

Figure 1 6-1 5 Treadmilling behavior of a microtubule, as observed in a 
living cell. A cell was injected with tubulin that had been covalently linked 
to the fluorescent dye rhodamine, so that approximately 1 tubulin subunit 
in 20 was fluorescent. The fluorescence of individual microtubules was 
then observed with a sensitive electronic camera. The microtubule shown 
appears to be sliding from left to right, but, in fact the microtubule lattice 
remains stationary (as shown by a landmark within the microtubule lattice 
indicated by the red arrowhead), while the plus end (on the right) grows 
and the minus end (on the left) shrinks. The plus end also displays dynamic 
instability. (From CM. Waterman-Storer and E.D. Salmon, J. Cell Biol. 
1 39:41 7-434, 1 997. With permission from The Rockefeller University Press.) 



Figure 16-14The treadmilling of an 
actin filament or microtubule, made 
possible by the nucleoside triphosphate 
hydrolysis that follows subunit addition. 

(A) Explanation for the different critical 
concentrations (C c ) at the plus and minus 
ends. Subunits with bound nucleoside 
triphosphate (T-form subunits) 
polymerize at both ends of a growing 
filament, and then undergo nucleotide 
hydrolysis in the filament lattice. As the 
filament grows, elongation is faster than 
hydrolysis at the plus end in this example, 
and the terminal subunits at this end are 
therefore always in the T form. However, 
hydrolysis is faster than elongation at the 
minus end, and so terminal subunits at 
this end are in the D form. 

(B) Treadmilling occurs at intermediate 
concentrations of free subunits. The 
critical concentration for polymerization 
on a filament end in the T form is lower 
than for a filament end in the D form. If 
the actual subunit concentration is 
somewhere between these two values, 
the plus end grows while the minus end 
shrinks, resulting in treadmilling. 
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ON RATES AND OFF RATES 

A linear polymer of protein molecules, such 
as an actin filament or a microtubule, 
assembles (polymerizes) and disassembles 
(depolymerizes) by the addition and removal 
of subunits at the ends of the polymer. The 
rate of addition of these subunits (called 
monomers) is given by the rate constant k oni 
which has units of M" 1 sec" 1 . The rate of loss 
is given by k off (units of sec" 1 ). 



polymer (with n subunits) 



subunit 



polymer (with n +1 subunits) 



A helical polymer is stabilized by multiple contacts between 
adjacent subunits. In the case of actin, two actin molecules bind relatively weakly 
to each other, but addition of a third actin monomer to form a trimer makes the 
entire group more stable. 




\J 



Further monomer addition can take place onto this trimer, which therefore acts 
as a nucleus for polymerization. For tubulin, the nucleus is larger and has a more 
complicated structure (possibly a ring of 13 or more tubulin molecules)^but the 

principle is the same. 

The assembly of a nucleus is relatively slow, which explains the lag phase 
seen during polymerization. The lag phase can be reduced or abolished 
entirely by adding premade nuclei, such as fragments of already polymerized 
microtubules or actin filaments. 



THE CRITICAL CONCENTRATION 

The number of monomers that add to the 
polymer (actin filament or microtubule) per 
second will be proportional to the 
concentration of the free subunit (k on Q, but 
the subunits will leave the polymer end at a 
constant rate (/c off ) that does not depend on C 
As the polymer grows, subunits are used up, 
and C is observed to drop until it reaches a 
constant value, called the critical concentration 
(C c ). At this concentration the rate of subunit 
addition equals the rate of subunit loss. 
At this equilibrium, 



*nn C. - K„ 



so that 



Q = 



K off 



(where K is the equilibrium constant for 
subunit addition; see Figure 3-43). 



TIME COURSE OF POLYMERIZATION 

The assembly of a protein into a long helical polymer such as a cytoskeletal 
filament or a bacterial flagellum typically shows the following time course: 

EQUILIBRIUM 
I PHASE 



1 




I! 




c 
o 
1 r 


LAG 
PHASE J 


/ 1 GROWTH 
/ / PHASE 


—I J 


^ 



The lag phase corresponds to time taken for nucleation. time — ^ 

The growth phase occurs as monomers add to the exposed ends of the 
growing filament, causing filament elongation. 

The equilibrium phase, or steady state, is reached when the growth of the 
polymer due to monomer addition precisely balances the shrinkage of the 
polymer due to disassembly back to monomers. 



PLUS AND MINUS ENDS 

The two ends of an actin filament or microtubule polymerize 
at different rates. The fast-growing end is called the plus end, 
whereas the slow-growing end is called the minus end. The 
difference in the rates of growth at the two ends is made 
possible by changes in the conformation of each subunit as 
it enters the polymer. 



free 
subunit 



subunft in 

polymer 




This conformational change affects the rates at which subunits add to 
the two ends. 

Even though k on and Ar Q ff will have different values for the plus and 
minus ends of the polymer, their ratio /r off //c on — and hence C c — must be 
the same at both ends for a simple polymerization reaction (no ATP or 
GTP hydrolysis). This is because exactly the same subunit interactions 
are broken when a subunit is lost at either end, and the final state of 
the subunit after dissociation is identical. Therefore, the AG for subunit 



plu5 

end 
I SLOW FAST 

loss, which determines the equilibrium constant for its association 
with the end, is identical at both ends: if the plus end grows four 
times faster than the minus end, it must also shrink four times 
faster. Thus, for C > C c , both ends grow; for C < C c , both ends 
shrink. 

The nucleoside triphosphate hydrolysis that accompanies 
actin and tubulin polymerization removes this constraint. 



NUCLEOTIDE HYDROLYSIS 

Each actin molecule carries a tightly bound ATP molecule that is hydrolyzed to a 
tightly bound ADP molecule soon after its assembly into the polymer. Similarly, each 
tubulin molecule carries a tightly bound GTP that is converted to a tightly bound 
GDP molecule soon after the molecule assembles into the polymer. 

^^_ ---^^™. . . „ (T= monomer carrying ATP 

x't ^ vT ' 4 or GTp ) 

- - -^™ 1 - - A****. (D = monomer carrying ADP 

free monomer subunit in polymer or GDP) 

Hydrolysis of the bound nucleotide reduces the binding affinity of the subunit for 
neighboring subunits and makes it more likely to dissociate from each end of the 
filament (see Figure 16-16 for a possible mechanism). It is usually the i&fc form 
that adds to the filament and the |jr form that leaves. 
Considering events at the plus end only: 



T 



ATP CAPS AND GTP CAPS 

The rate of addition of subunits to a 
growing actin filament or microtubule 
can be faster than the rate at which their 
bound nucleotide is hydrolyzed. Under 
such conditions, the end has a "cap" 
of subunits containing the nucleoside 
triphosphate— an ATP cap on an actin 
filament or a GTP cap on a microtubule. 




ATP or GTP cap 



As before, the polymer will grow until C= C c . For illustrative purposes, we can ignore | 
/c D on and k T Qf f since they are usually very small, so that polymer growth ceases when 



k on C — k , 



off 



or 



Q = 



off 



k T 



This is a steady state and not a true equilibrium, because the ATP or GTP that is 
hydrolyzed must be replenished by a nucleotide exchange reaction of the 
free subunit — \T ) 






DYNAMIC INSTABILITY and 
I TREADMILLING are two behaviors 
observed in cytoskeletal polymers. Both 
are associated with nucleoside 
triphosphate hydrolysis. Dynamic instability 
is believed to predominate in microtubules, 
whereas treadmilling may predominate 
in actin filaments. 



TREADMILLING 

One consequence of the nucleotide hydrolysis that accompanies polymer 
formation is to change the critical concentration at the two ends of the 
polymer. Since /c D off and k T on refer to different reactions, their ratio 
k D fffk T on need not be the same at both ends of the polymer, so that: 

C c (minus end) > Q (plus end) 

Thus, if both ends of a polymer are exposed, polymerization proceeds 
until the concentration of free monomer reaches a value that 
is above C c for the plus end but below C c for the minus end. At this 
steady state, subunits undergo a net assembly at the plus end and a 
net disassembly at the minus end at an identical rate. The polymer 
maintains a constant length, even though there is a net flux of subunits 
through the polymer, known as treadmilling. 
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DYNAMIC INSTABILITY 

Microtubules depolymerize about 100 times faster from an end containing GDP 
tubulin than from one containing GTP tubulin. A GTP cap favors growth, 
but if it is lost, then depolymerization ensues. 

GTP cap 
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GROWING 



SHRINKING 
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Individual microtubules can therefore alternate between a period of slow growth and 
a period of rapid disassembly, a phenomenon called dynamic instability. 
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The kinetic differences between the behavior of the T form and the D form 
have another important consequence for the behaviors of filaments. If the rate 
of subunit addition at one end is similar in magnitude to the rate of hydrolysis, 
there is a finite probability that this end will start out in a T form, but that hydrol- 
ysis will eventually "catch up" with the addition and transform the end to a D 
form. This transformation is sudden and random, with a certain probability per 
unit time. 

Suppose that the concentration of free subunits is intermediate between the 
critical concentration for a T-form end and the critical concentration for a D- 
form end (that is, in the same range of concentrations where treadmilling is 
observed) . Now, any end that happens to be in the T form will grow, whereas any 
end that happens to be in the D form will shrink. On a single filament, an end 
might grow for a certain length of time in a T form, but then suddenly change to 
the D form and begin to shrink rapidly, even while the free subunit concentra- 
tion is held constant. At some later time, it might then regain a T-form end and 
begin to grow again. This rapid interconversion between a growing and shrink- 
ing state, at a uniform free subunit concentration, is called dynamic instability 
(Figure 16-16A). The change from growth to rapid shrinkage is called a catastro- 
phe, while the change to growth is called a rescue. <CCCA> 

In a population of microtubules, at any instant some of the ends are in the T 
form and some are in the D form, with the ratio depending on the hydrolysis rate 
and the free subunit concentration. The structural difference between a T-form 
end and a D-form end is dramatic. Tubulin subunits with GTP bound to the p- 
monomer produce straight protofilaments that make strong and regular lateral 
contacts with one another. But the hydrolysis of GTP to GDP is associated with 
a subtle conformational change in the protein, which makes the protofilaments 
curved (Figure 16-16B). On a rapidly growing microtubule, the GTP cap is 
thought to constrain the curvature of the protofilaments, and the ends appear 
straight. But when the terminal subunits have hydrolyzed their nucleotides, this 
constraint is removed, and the curved protofilaments spring apart. This cooper- 
ative release of the energy of hydrolysis stored in the microtubule lattice causes 
the curled protofilaments to peel off rapidly, and rings and curved oligomers of 
GDP- containing tubulin are seen near the ends of depolymerizing microtubules 
(Figure 16-16C). 

Actin filaments also undergo length fluctuations but on a much smaller 
scale, so that at steady state the length fluctuates only a micrometer or so over 
several minutes, as compared to tens of micrometers for microtubules undergo- 
ing dynamic instability. In most eucaryotic cells, dynamic instability is thought 
to predominate in microtubules, whereas treadmilling may predominate in 
actin filaments. 

Treadmilling and Dynamic Instability Aid Rapid Cytoskeletal 
Rearrangement <aaat> 

Both dynamic instability and treadmilling allow a cell to maintain the same 
overall filament content, while individual subunits constantly recycle between 
the filaments and the cytosol. How dynamic are the microtubules and actin fila- 
ments inside a living cell? Typically, a microtubule, with major structural differ- 
ences between its growing and shrinking ends, switches between growth and 
shrinkage every few minutes. The ends of individual microtubules can therefore 
be seen in real time to exhibit dynamic instability (Figure 16-17). Because of 
their smaller size and denser packing, it is much more difficult to resolve the 
ends of individual actin filaments within living cells. With appropriate tech- 
niques based on fluorescence microscopy, however, one can show that actin fil- 
ament turnover is typically rapid, with individual filaments persisting for only a 
few tens of seconds or minutes. 

At first glance, the dynamic behavior of filaments seems like a waste of 
energy. To maintain a constant concentration of actin filaments and micro- 
tubules, most of which are undergoing a process of either treadmilling or 
dynamic instability, the cell must hydrolyze large amounts of nucleoside 
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triphosphate. As we explained with our ant-trail analogy at the beginning of the 
chapter, the advantage to the cell seems to be the spatial and temporal flexibil- 
ity that is inherent in a structural system with constant turnover. Individual sub- 
units are small and can diffuse very rapidly; an actin or tubulin subunit can dif- 
fuse across the diameter of a typical eucaryotic cell in several seconds. As noted 
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Figure 1 6-16 Dynamic instability due to the structural differences between a growing and a shrinking microtubule end. (A) If the free 
tubulin concentration in solution is between the critical values indicated in Figure 16-14B, a single microtubule end may undergo transitions 
between a growing state and a shrinking state. A growing microtubule has GTP-containing subunits at its end, forming a GTP cap. If nucleotide 
hydrolysis proceeds more rapidly than subunit addition, the cap is lost and the microtubule begins to shrink, an event called a "catastrophe." 
But GTP-containing subunits may still add to the shrinking end, and if enough add to form a new cap, then microtubule growth resumes, an 
event called "rescue." (B) Model for the structural consequences of GTP hydrolysis in the microtubule lattice. The addition of GTP-containing 
tubulin subunits to the end of a protofilament causes the end to grow in a linear conformation that can readily pack into the cylindrical wall of 
the microtubule. Hydrolysis of GTP after assembly changes the conformation of the subunits and tends to force the protofilament into a curved 
shape that is less able to pack into the microtubule wall. (C) In an intact microtubule, protofilaments made from GDP-containing subunits are 
forced into a linear conformation by the many lateral bonds within the microtubule wall, given a stable cap of GTP-containing subunits. Loss of 
the GTP cap, however, allows the GDP-containing protofilaments to relax into their more curved conformation. This leads to a progressive 
disruption of the microtubule. Above the drawings of a growing and a shrinking microtubule, electron micrographs show actual microtubules 
in each of these two states, as observed in preparations in vitreous ice. Note particularly the curling, disintegrating GDP-containing 
protofilaments at the end of the shrinking microtubule. (C, courtesy of E.M. Mandelkow, E. Mandelkow and R.A. Milligan, 1 Ceil Biol. 
1 14:977-991, 1991. With permission from The Rockefeller University Press.) 
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Figure 16-17 Direct observation of the 
dynamic instability of microtubules in a 
living cell. <AAAT> Microtubules in a 
newt lung epithelial cell were observed 
after the cell was injected with a small 
amount of rhodamine labeled tubulin, as 
in Figure 16-1 5. Notice the dynamic 
instability of microtubules at the edge of 
the cell. Four individual microtubules are 
highlighted for clarity; each of these 
shows alternating shrinkage and growth. 
(Courtesy of Wendy C. Salmon and Clare 
Waterman-Storer.) 



previously, the rate-limiting step in the formation of a new filament is nucle- 
ation, so these rapidly diffusing subunits tend to assemble either on the ends of 
preexisting filaments or at particular sites where special proteins catalyze the 
nucleation step. The new filaments in either case are highly dynamic, and unless 
specifically stabilized, they have only a fleeting existence. By controlling where 
filaments are nucleated and selectively stabilized, a cell can control the location 
of its filament systems, and hence its structure. It seems that the cell is continu- 
ally testing a wide variety of internal structures and only preserving those that 
are useful. When external conditions change, or when internal signals arise (as 
during the transitions in the cell cycle), the cell is poised to change its structure 
rapidly (see Figures 16-2 to 16-4). 

Actin and tubulin have independently evolved their nucleoside triphos- 
phate hydrolysis to enable their filaments to depolymerize readily after they 
have polymerized. These two proteins are completely unrelated in amino acid 
sequence: actin is distantly related in structure to the glycolytic enzyme hexoki- 
nase, whereas tubulin is distantly related to a large family of GTPases that 
includes the heterotrimeric G proteins and monomeric GTPases such as Ras 
(discussed in Chapter 3), In both protein families, the coupling between 
nucleotide hydrolysis and a protein conformational change that alters protein 
function appears to be evolution arily very ancient; however, the purposes of that 
structural coupling have diverged over time to include signal transmission, 
catalysis, and regulation of the poh^ierization/depolymerization cycle, 

In certain specialized structures, parts of the cytoskeleton become less 
dynamic. In a terminally differentiated cell such as a neuron, for example, it is 
desirable to maintain a consistent structure over time, and many of the actio fil- 
aments and microtubules are stabilized by association with other proteins. 
However, when new connections are made in the brain, as when the informa- 
tion you are reading now is transferred into long-term memory even a cell as 
stable as a neuron can grow new elongated processes to make new synapses. To 
do this, a neuron requires the dynamic, exploratory activities of its cytoskeletal 
filaments. 



Tubulin and Actin Have Been Highly Conserved During Eucaryotic 
Evolution 

Tubulin is found in all eucaryotic ceils, and it exists in multiple isoforms. Yeast 
and human tubulins are 75% identical in amino acid sequence. In mammals, 
there are at least six forms of a- tubulin and a similar number of forms of p- tubu- 
lin, each encoded by a different gene, The different forms of tubulin are very 
similar, and they generally will copolymerize into mixed microtubules in the test 
tube. However, they can have distinct locations in a cell and perform subtly dif- 
ferent functions. As a striking example, a specific form of (3-tubuhn forms the 
microtubules in six specialized touch-sensitive neurons in the nematode 
Caenorhabditis elegans. Mutations that eliminate this protein result in the loss of 
touch-sensitivity with no apparent defect in other functions. 
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Like tubulin, actin is found in all eucaryotic cells. Most organisms have mul- 
tiple genes encoding actin; humans have six. Actin is extraordinarily well con- 
served among eucaryotes. The amino acid sequences of actins from different 
species are usually about 90% identical. But, again like tubulin, small variations 
in actin amino acid sequence can cause significant functional differences. In 
vertebrates, there are three subtly different isoforms of actin, termed a, (3, and y, 
that differ slightly in their amino acid sequences. The a-actin is expressed only 
in muscle cells, while (3 and y are found together in almost all nonmuscle cells. 
Yeast actin and Drosophila muscle actin are 89% identical, yet the expression of 
yeast actin in Drosophila results in a fly that looks normal but is unable to fly. 

Why are the amino acid sequences of actin and tubulin so strictly conserved 
in eucaryotic evolution, whereas the sequences of most other cytoskeletal pro- 
teins, including intermediate filament proteins and the large families of acces- 
sory proteins that bind to actin or tubulin, are not? The likely explanation is that 
the need for large numbers of other proteins to interact with the entire surface 
of an actin filament or microtubule limits the variability of their structures. 
Genetic and biochemical studies in the yeast Saccharomyces cerevisiae have 
demonstrated that actin interacts directly with dozens of other proteins, and 
indirectly with even more (Figure 16-18). Thus, any mutation in actin that could 
result in a desirable change in its interaction with one protein might cause unde- 
sirable changes in its interactions with other proteins that bind at or near the 
same site. Over time, evolving organisms have found it more profitable to leave 
actin and tubulin alone, and to alter their binding partners instead. 



Intermediate Filament Structure Depends on the Lateral Bundling 
and Twisting of Coiled Coils 

All eucaryotic cells contain actin and tubulin. But the third major type of 
cytoskeletal protein, the intermediate filament, forms a cytplasmic filament in 
only some metazoans — including vertebrates, nematodes, and mollusks. Even 
in these organisms, intermediate filaments are not required in the cytoplasm of 
every cell type. The specialized glial cells (called oligodendrocytes) that make 
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Figure 16-18 Actin at the crossroads. 

Actin binds to a very large variety of 
accessory proteins in all eucaryotic cells. 
This diagram shows most of the 
interactions that have been 
demonstrated, using either genetic or 
biochemical techniques, in the yeast 
Saccharomyces cerevisiae. Accessory 
proteins that operate in the same 
intracellular process are shown in the 
same color, as indicated in the key. 
(Adapted from D. Botstein et al., in The 
Molecular and Cellular Biology of the 
Yeast Saccharomyces [J.R. Broach, 
J.R. Pringle, E.W. Jones, eds.], Cold Spring 
Harbor, NY: Cold Spring Harbor 
Laboratory Press, 1991.) 
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myelin in the vertebrate central nervous system, for example, do not contain 
such intermediate filaments. Intermediate filaments are particularly prominent 
in the cytoplasm of cells that are subject to mechanical stress, and are generally 
not found in animals that have rigid exoskeletons such as arthropods and echin- 
oderms. It seems that intermediate filaments play an important role in impart- 
ing mechanical strength to tissues for the squishier animals. 

Cytoplasmic intermediate filaments are closely related to their ancestors, 
the much more prevalent nuclear lamins. The nuclear lamins are intermediate 
filament proteins that form a meshwork lining the inner membrane of the 
eucaryotic nuclear envelope, where they provide anchorage sites for chromo- 
somes and nuclear pores (their dynamic behavior during cell division is dis- 
cussed in Chapter 12). Several times during metazoan evolution, lamin genes 
have apparently duplicated, and the duplicates have evolved to produce rope- 
like, cytoplasmic intermediate filaments. 

The individual polypeptides of intermediate filaments are elongated 
molecules with an extended central a-helical domain that forms a parallel coiled 
coil with another monomer. A pair of parallel dimers then associates in an 
antiparallel fashion to form a staggered tetramer. This tetramer represents the 
soluble subunit that is analogous to the aP-tubulin dimer, or to the actin 
monomer (Figure 16-19). Unlike the actin or tubulin, the intermediate filament 
subunits do not contain a binding site for a nucleoside triphosphate. 



Figure 1 6-1 9 A model of intermediate 
filament construction. <GCCA> The 
monomer shown in (A) pairs with an 
identical monomer to form a dimer (B), in 
which the conserved central rod domains 
are aligned in parallel and wound 
together into a coiled coil. (C) Two dimers 
then line up side by side to form an 
antiparallel tetramer of four polypeptide 
chains. The tetramer is the soluble 
subunit of intermediate filaments. 
(D) Within each tetramer, the two dimers 
are offset with respect to one another, 
thereby allowing it to associate with 
another tetramer. (E) In the final 10-nm 
rope-like filament, tetramers are packed 
together in a helical array, which has 16 
dimers (32 coiled coils) in cross-section. 
Half of these dimers are pointing in each 
direction. An electron micrograph of 
intermediate filaments is shown on the 
upper left. (Electron micrograph courtesy 
of Roy Quinlan.) 
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Table 16-1 Major Types of Intermediate Filament Proteins in Vertebrate Cells 



TYPES OF IF 


COMPONENT POLYPEPTIDES 


LOCATION 


Nuclear 


lamins A, B, and C 


nuclear lamina (inner lining 
of nuclear envelope) 


Vimentin-like 


vimentin 


many cells of mesenchymal 
origin 




desmin 


muscle 




glial fibrillary acidic protein 


glial cells (astrocytes and 
some Schwann cells) 




peripherin 


some neurons 


Epithelial 


type 1 keratins (acidic) 


epithelial cells and their 




type II keratins (basic) 


derivatives (e.g., hair and nails) 


Axon a 1 


neurofilament proteins 
(NF-L, NF-M, and NF-H) 


neurons 



Since the tetrameric subunit is made up of two dimers pointing in opposite 
directions, its two ends are the same. The assembled intermediate filament 
therefore lacks the overall structural polarity that is critical for actin filaments 
and microtubules. The tetramers pack together laterally to form the filament, 
which includes eight parallel protofilaments made up of tetramers. Each indi- 
vidual intermediate filament therefore has a cross section of 32 individual a- 
helical coils. This large number of polypeptides all lined up together, with the 
strong lateral hydrophobic interactions typical of coiled-coil proteins, gives 
intermediate filaments a rope-like character. They can be easily bent, with a per- 
sistence length of less than one micrometer (compared to several millimeters for 
microtubules and about ten micrometers for actin), but they are extremely diffi- 
cult to break. 

Less is understood about the mechanism of assembly and disassembly of 
intermediate filaments than of actin filaments and microtubules, but some 
types of intermediate filaments including vimentin form highly dynamic struc- 
tures in cells such as fibroblasts. Under normal conditions, protein phosphory- 
lation probably regulates their disassembly in much the same way that phos- 
phorylation regulates the disassembly of nuclear lamins in mitosis (see Figure 
12-20). As evidence for rapid turnover, labeled subunits microinjected into tis- 
sue culture cells rapidly add themselves onto the existing intermediate filaments 
within a few minutes, while an injection of peptides derived from a conserved 
helical region of the subunit induces the rapid disassembly of the intermediate 
filament network. Interestingly the latter injection can also induce the disas- 
sembly of the microtubule and actin filament networks in some cases, demon- 
strating that there is a fundamental mechanical integration of the three 
cytoskeletal systems in these cells. 



Intermediate Filaments Impart Mechanical Stability to Animal 
Cells 



Intermediate filaments come in a wide variety of types, with substantially more 
sequence variation in the subunit isoforms than occurs in the isoforms of actin 
or tubulin. A central a-helical domain has 40 or so heptad repeat motifs that 
form an extended coiled-coil structure (see Figure 3-9). This domain is similar in 
the different isoforms, but the N- and C-terminal globular domains can vary 
greatly. 

Different families of intermediate filaments are expressed in different cell 
types (Table 16-1). Keratins are the most diverse intermediate filament family: 
there are about 20 found in different types of human epithelial cells, and about 
10 more that are specific to hair and nails; analysis of the human genome 
sequence has revealed that there may be about 50 distinct keratins. Every keratin 
filament is made up of an equal mixture of type I (acidic) and type II 
(neutral/basic) keratin chains; these form heterodimers, two of which then join 
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Figure 16-20 Keratin filaments in 
epithelial cells. Immunofluorescence 
micrograph of the network of keratin 
filaments (green) in a sheet of epithelial 
cells in culture. The filaments in each cell 
are indirectly connected to those of its 
neighbors by desmosomes {discussed in 
Chapter 19). A second protein (blue) has 
been stained to reveal the location of the 
cell boundaries. (Courtesy of Kathleen 
Green and Evangeline Amargo.) 



10 [am 



to form the fundamental tetrameric subunit (see Figure 16-19). Cross-linked 
keratin networks held together by disulfide bonds may survive even the death of 
their cells, forming tough coverings for animals, as in the outer layer of skin and 
in hair, nails, claws, and scales. The diversity in keratins is clinically useful in the 
diagnosis of epithelial cancers (carcinomas), as the particular set of keratins 
expressed gives an indication of the epithelial tissue in which the cancer origi- 
nated and thus can help to guide the choice of treatment. 

A single epithelial cell may produce multiple types of keratins, and these 
copolymerize into a single network (Figure 16-20). Keratin filaments impart 
mechanical strength to epithelial tissues in part by anchoring the intermediate 
filaments at sites of cell-cell contact, called desmosomes, or cell-matrix contact, 
called hemidesmosomes (see Figure 16-5). We discuss these important adhesive 
structures in more detail in Chapter 19. 

Mutations in keratin genes cause several human genetic diseases. For exam- 
ple, when defective keratins are expressed in the basal cell layer of the epider- 
mis, they produce a disorder called epidermolysis bullosa simplex, in which the 
skin blisters in response to even very slight mechanical stress, which ruptures 
the basal cells (Figure 16-21). Other types of blistering diseases, including dis- 
orders of the mouth, esophageal lining, and the cornea of the eye, are caused by 
mutations in the different keratins whose expression is specific to those tissues. 
All of these maladies are typified by cell rupture as a consequence of mechani- 
cal trauma and a disorganization or clumping of the keratin filament cytoskele- 
ton. Many of the specific mutations that cause these diseases alter the ends of 



Figure 16-21 Blistering of the skin 
caused by a mutant keratin gene. 

A mutant gene encoding a truncated 
keratin protein (lacking both the N- and 
C -terminal domains) was expressed in a 
transgenic mouse. The defective protein 
assembles with the normal keratins and 
thereby disrupts the keratin filament 
network in the basal cells of the skin. 
Light micrographs of cross sections of 
normal (A) and mutant (B) skin show that 
the blistering results from the rupturing 
of cells in the basal layer of the mutant 
epidermis (short red arrows). (C) A sketch 
of three cells in the basal layer of the 
mutant epidermis, as observed by 
electron microscopy. As indicated by the 
red arrow, the cells rupture between the 
nucleus and the hemidesmosomes 
(discussed in Chapter 19), which connect 
the keratin filaments to the underlying 
basal lamina. (From PA Coulombe et al., 
J. Cell Biol. 115:1661-1674, 1991. With 
permission from The Rockefeller 
University Press.) 
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the central rod domain, demonstrating the importance of this particular part of 
the protein for correct filament assembly. 

A second family of intermediate filaments, called neurofilaments, is found 
in high concentrations along the axons of vertebrate neurons (Figure 16-22). 
Three types of neurofilament proteins (NF-L, NF-M, NF-H) coassemble in vivo, 
forming heteropolymers that contain NF-L plus one of the others. The NF-H and 
NF-M proteins have lengthy C-terminal tail domains that bind to neighboring 
filaments, generating aligned arrays with a uniform interfilament spacing. Dur- 
ing axonal growth, new neurofilament subunits are incorporated all along the 
axon in a dynamic process that involves the addition of subunits along the fila- 
ment length, as well as the addition of subunits at the filament ends. After an 
axon has grown and connected with its target cell, the diameter of the axon may 
increase as much as fivefold. The level of neurofilament gene expression seems 
to directly control axonal diameter, which in turn influences how fast electrical 
signals travel down the axon. 

The neurodegenerative disease amyotrophic lateral sclerosis (ALS, or Lou 
Gehrig's Disease) is associated with an accumulation and abnormal assembly of 
neurofilaments in motor neuron cell bodies and in the axon, which may inter- 
fere with normal axonal transport. The degeneration of the axons leads to mus- 
cle weakness and atrophy, which is usually fatal. The over-expression of human 
NF-L or NF-H in mice results in mice that have an ALS-like disease. 

The vimentin-like filaments are a third family of intermediate filaments. 
Desmin, a member of this family, is expressed in skeletal, cardiac, and smooth 
muscle. Mice lacking desmin show normal initial muscle development, but adults 
have various muscle cell abnormalities, including misaligned muscle fibers. 
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Figure 1 6-22 Two types of intermediate 
filaments in cells of the nervous system. 

(A) Freeze-etch electron microscopic 
image of neurofilaments in a nerve cell 
axon, showing the extensive cross-linking 
through protein cross-bridges — an 
arrangement believed to give this long 
cell process great tensile strength. The 
cross-bridges are formed by the long, 
nonhelical extensions at the C-terminus 
of the largest neurofilament protein 
(NF-H). (B) Freeze-etch image of glial 
filaments in glial cells, showing that these 
intermediate filaments are smooth and 
have few cross-bridges. (C) Conventional 
electron micrograph of a cross section of 
an axon showing the regular side-to-side 
spacing of the neurofilaments, which 
greatly outnumber the microtubules. 
(A and B, courtesy of Nobutaka Hirokawa; 
C, courtesy of John Hopkins.) 



Drugs Can Alter Filament Polymerization 

Because the survival of eucaryotic cells depends on a balanced assembly and 
disassembly of the highly conserved cytoskeletal filaments formed from actin 
and tubulin, the two types of filaments are frequent targets for natural toxins. 
These toxins are produced in self-defense by plants, fungi, or sponges that do 
not wish to be eaten but cannot run away from predators, and they generally 
disrupt the filament polymerization reaction. The toxin binds tightly to either 
the filament form or the free subunit form of a polymer, driving the assembly 
reaction in the direction that favors the form to which the toxin binds. For exam- 
ple, the drug latrunculin, extracted from the sea sponge Latrunculia magnified, 
binds to actin monomers and prevents their assembly into filaments; it thereby 
causes a net depolymerization of actin filaments. In contrast, piudloidin, from 
the fungus Amanita phalloides (death cap), binds to and stabilizes actin fila- 
ments* causing a net increase in actin polymerization. (This attractive but ined- 
ible mushroom also expresses a second deadly toxin t the RNA polymerase II 
inhibitor a-ainanitin.) Either change in actin filaments is very toxic for cells. 
Similarly, colchicine, from the autumn crocus (or meadow saffron), binds to and 
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Table 1 6-2 Drugs That Affect Actin Filaments and Microtubules 

ACTIN-SPECIFIC DRUGS 

Phalloidin binds and stabilizes filaments 

Cytochalasin caps filament plus ends 

Swinholide severs filaments 

Latrunculin binds subunits and prevents their polymerization 

IJCROTUBULE-SPEOFIC DRUGS 

binds and stabilizes microtubules 
binds subunits and prevents their polymerization 
binds subunits and prevents their polymerization 
binds subunits and prevents their polymerization 



M 



Taxol 

Colchicine, colcemid 

Vinblastine, vincristine 

Nocodazole 



stabilizes free tubulin, causing microtubule depolymerization. In contrast, 
taxol, extracted from the bark of a rare species of yew tree, binds to and stabilizes 
microtubules, causing a net increase in tubulin polymerization. These and some 
other natural products that are commonly used by cell biologists to manipulate 
the cytoskeleton are listed in Table 16-2. 

Drugs like these have a rapid and profound effect on the organization of the 
cytoskeleton in living cells (Figure 16-23). They provided early evidence that the 
cyroskeleton is a dynamic structure, maintained by a rapid and continual 
exchange of subunits between the soluble and filamentous forms, and they 
revealed thai this subunit flux is necessary for normal cytoskeietal function. 

The drugs listed in Table 16-2 have been useful to cell biologists trying to 
probe the roles of actin and microtubules in various cell processes. Some of them 
are also used to treat cancer. Both mLcrotuhule-depolymerizing drugs (such as 
vinblastine) and micro tubule- polymerizing drugs (such as taxol) preferentially 
kill dividing cells, since both microtubule assembly and disassembly are crucial 
for correct function of the mitotic spindle [discussed later hi this chapter). These 
drugs efficiently kill certain types of tumor cells in a human patient, although not 
without toxicity to rapidly dividing normal cells, including those in the bone mar- 
row, intestine, and hair follicles. Taxol in particular has been widely used to treat 
cancers of the breast and lung, and it is frequently successful in treatment of 
tumors that are resistant to other chemotherapeulic agents. 



Figure 16-23 Effect of the drug taxol on 
microtubule organization. (A) Molecular 
structure of taxol. Recently organic 
chemists have succeeded in synthesizing 
this complex molecule, which is widely 
used for cancer treatment. 
(B) Immunofluorescence micrograph 
showing the microtubule organization in 
a liver epithelial cell before the addition 
of taxol. (C) Microtubule organization in 
the same type of cell after taxol 
treatment. Note the thick circumferential 
bundles of microtubules around the 
periphery of the cell. (D) A Pacific yew 
tree, the natural source of taxol. 
(B, C from N.A. Gloushankova et al., Proc 
Natl Acad. Sci. U.S.A. 91:8597-8601, 1994. 
With permission from National Academy 
of Sciences; D, courtesy of A.K. Mitchell 
2001 . © Her Majesty the Queen in Right 
of Canada, Canadian Forest Service.) 
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Figure 16-24 The bacterial FtsZ protein, a tubulin homolog in 
procaryotes. (A) A band of FtsZ protein forms a ring in a dividing bacterial 
cell. This ring has been labeled by fusing the FtsZ protein to the green 
fluorescent protein (GFP), which allows it to be observed in living E. coil 
cells with a fluorescence microscope. Top, side view shows the ring as a bar 
in the middle of the dividing cell. Bottom, rotated view showing the ring 
structure. <B) FtsZ filaments and rings, formed in vitro, as visualized using 
electron microscopy. Compare this image with that of the microtubule 
shown on the right in Figure 16-16C. (A, from X. Ma, D.W. Ehrhardt and 
W. Margolin, Proc. Natl Acad. Sci. U.S.A. 93:12998-13003, 1996; B, from 
H.A. Erickson et al., Proc Natl Acad. Sci. U.S.A. 93:519-523, 1996. All with 
permission from National Academy of Sciences.) 

Bacterial Cell Organization and Cell Division Depend on 
Homologs of the Eucaryotic Cytoskeleton 

While eucaryotic cells are typically large and morphologically complex, bacterial 
cells are usually only a few micrometers long and assume simple, modest shapes 
such as spheres or rods. Bacteria also lack the elaborate networks of intracellu- 
lar membrane-enclosed organelles such as the endoplasmic reticulum and 
Golgi apparatus. For many years, biologists assumed that the lack of a bacterial 
cytoskeleton was one reason for these striking differences between cell organi- 
zation in the eucaryotic and bacterial kingdoms. This assumption was chal- 
lenged with the discovery in the early 1990s that nearly all bacteria and many 
archaea contain a homolog of tubulin, FtsZ, that can polymerize into filaments 
and assemble into a ring (called the Z-ring) at the site where the septum forms 
during cell division (Figure 16-24). 

The three-dimensional folded protein structure of FtsZ is remarkably simi- 
lar to the structure of a or [3 tubulin and, like tubulin, hydrolysis of GTP is trig- 
gered by polymerization and causes a conformational change in the filament 
structure. Although the Z-ring itself persists for many minutes, the individual fil- 
aments within it are highly dynamic, with an average half-life of about thirty sec- 
onds. As the bacterium divides, the Z-ring becomes smaller until it has com- 
pletely disassembled, and it is thought that the shrinkage of the Z-ring may con- 
tribute to the membrane invagination necessary for the completion of cell divi- 
sion. The Z-ring may also serve as a site for localization of specialized cell wall 
synthesis enzymes required for building the septum between the two daughter 
cells. The disassembled FtsZ subunits later reassemble at the new sites of sep- 
tum formation in the daughter cells (Figure 16-25). 

More recently, it has been found that many bacteria also contain homologs 
of actin. Two of these, MreB and Mbl, are found primarily in rod-shaped or spi- 
ral-shaped cells, and mutations disrupting their expression cause extreme 
abnormalities in cell shape and defects in chromosome segregation (Figure 
16-26). MreB and Mbl filaments assemble in vivo to form large-scale spirals that 




100 nm 




(B) 



time 
(min) 




1 jam 



2 urn 



Figure 16-25 Rapid rearrangements of FtsZ through the bacterial cell cycle. (A) After chromosome segregation is complete, 

the ring formed by FtsZ at the middle of the cell becomes smaller as the cell pinches in two, much like the contractile ring 

formed by actin and myosin filaments in eucaryotic cells. The FtsZ filaments that have disassembled as the cells have separated 

then reassemble to form two new rings at the middle of the two daughter cells. (B) Dividing chloroplasts (red) from a red alga 

also make use of a protein ring made from FtsZ (yellow) for cleavage. (A, from Q. Sun and W. Margolin, J. Bacteriol. 

1 80:2050-2056, 1 998. With permission from American Society for Microbiology; B, from S. Miyagishima et al., Plant Cell 

1 3:2257-2268, 2001 . With permission from American Society of Plant Biologists.) 
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span the length of the cell and apparently contribute to cell shape determina- 
tion by serving as a scaffold to direct the synthesis of the peptidoglycan cell wall, 
in much the same way that microtubules help organize the synthesis of the cel- 
lulose cell wall in higher plant cells (see Figure 19-82). As with FtsZ, the fila- 
ments within the MreB and Mbl spirals are highly dynamic, with half-lives of a 
few minutes; as for actin, ATP hydrolysis accompanies the polymerization pro- 
cess. 

Diverse relatives of MreB and Mbl have more specialized roles. A particularly 
intriguing bacterial actin homolog is ParM, which is encoded on certain bacte- 
rial plasmids that also carry genes responsible for antibiotic resistance and fre- 
quently cause the spread of multi-drug resistance in epidemics. Bacterial plas- 
mids typically encode all the gene products that are necessary for their own seg- 
regation, presumably as a strategy to ensure their faithful inheritance and prop- 
agation in their bacterial hosts. In vivo, ParM assembles into a filamentous 
structure that associates at each end with a copy of the plasmid that encodes it, 
and growth of the ParM filament appears to push the replicated plasmid copies 
apart, rather like a mitotic spindle operating in reverse (Figure 16-27). Although 
ParM is a structural homolog of actin, its dynamic behavior differs significantly. 
ParM filaments undergo dramatic dynamic instability in vitro, more closely 
resembling microtubules than actin filaments in the way that they grow and 
shrink. The spindle-like structure is apparently built by the selective stabiliza- 
tion of spontaneously nucleated filaments that bind to specialized proteins 
recruited to the origins of replication on the plasmids. 

The various bacterial actin homologs share similar molecular structures but 
their amino acid sequence similarity to each other is quite low (-10-15% identi- 
cal residues). They assemble into filaments with distinct helical packing pat- 
terns, which may also have very different dynamic behaviors. Rather than using 
the same well-conserved actin for many different purposes, as eucaryotic cells 
do, bacteria have apparently opted to proliferate and specialize their actin 
homologs for distinct purposes. 

It is now clear that the general principle of organizing cell structure by the 
self- association of nucleotide-binding proteins into dynamic helical filaments is 
used in all cells, and that the two major families of actin and tubulin are very 
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Figure 16-26 Actin homologs in 
bacteria determine cell shape. (A) The 

common soil bacterium Bacillus subtilis 
normally forms cells with a regular rod- 
like shape. (B) B. subtilis cells lacking the 
actin homolog Mbl grow into irregular 
twisted tubes and eventually die. (C) The 
Mbl protein forms long helices made of 
up many short filaments that run the 
length of the bacterial cell and help to 
direct the sites of cell wall synthesis. 
(From L.J. Jones, R. Carbadillo-Lopez and 
J. Errington, Cell 104: 913-922, 2001. With 
permission from Elsevier.) 
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Figure 1 6-27 Role of the actin homolog 
ParM in plasmid segregation. (A) Some 
bacterial drug-resistance plasmids 
{yellow) encode an actin homolog, ParM, 
that will spontaneously nucleate to form 
small, dynamic filaments {green) 
throughout the bacterial cytoplasm. A 
second plasmid-encoded protein {blue) 
binds to specific DNA sequences in the 
plasmid, and also stabilizes the dynamic 
ends of the ParM filaments. When the 
plasmid has duplicated, so that the ParM 
filaments can be stabilized at both ends, 
the filaments grow and push the 
duplicated plasmids to opposite ends of 
the cell. (B) In these bacterial cells 
harboring a drug-resistance plasmid, the 
plasmids are labeled in red and the ParM 
protein in green. Left, a short ParM 
bundle connects the two daughter 
plasmids shortly after their duplication. 
Right, the fully assembled ParM filament 
has pushed the duplicated plasmids to 
the cell poles. (A, adapted from 
E.C. Garner, C.S. Campbell and 
R.D. Mullins, Science 306:1 02 1 -1 025, 
2004. With permission from AAAS; B, from 
J. Moller-Jensen et al., Mol. Cell 
1 2:1 477-1 487, 2003. With permission 
from Elsevier.) 
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ancient, probably predating the split between the eucaryotic and bacterial king- 
doms. However, the uses to which bacteria put their cytoskeletons appear some- 
what different from their eucaryotic homologs. For example, in bacteria it is the 
tubulin (FtsZ) that is involved in cytokinesis (the pinching apart of a dividing cell 
into two daughters), while actin drives this process in eucaryotic cells. Con- 
versely, eucaryotic microtubules are responsible for chromosome segregation, 
while bacterial actins (ParM and possibly MreB) help to segregate replicated 
DNA in bacteria. 

At least one bacterial species with an unusual crescent shape, Caulobacter 
crescentus, even appears to harbor a protein with significant structural similar- 
ity to the third major class of cytoskeletal filaments found in animal cells, the 
intermediate filaments. A protein called crescentin forms a filamentous struc- 
ture that seems to influence the cell shape; when the gene encoding crescentin 
is deleted, the Caulobacter cells grow as straight rods (Figure 16-28). 

Since we now know that bacteria do in fact have sophisticated dynamic 
cytoskeletons, why then do they remain so small and morphologically simple? 
As yet there have been no motor proteins identified that walk along the bacterial 
filaments; perhaps the evolution of motor proteins was a critical step allowing 
morphological elaboration in the eucaryotes. 



Figure 1 6-28 Caulobacter and 
crescentin. The sickle-shaped bacterium 
Caulobacter crescentus expresses a protein, 
crescentin, with a series of coiled-coil 
domains similar in size and organization to 
the domains of eucaryotic intermediate 
filaments. In cells, the crescentin protein 
forms a fiber that runs down the inner side 
of the curving bacterial cell wall. When the 
gene is disrupted, the bacteria are viable 
but grow in a straight rod-shaped form. 
(From N. Ausmees, J.R. Kuhn and 
C. Jacobs-Wagner, Cell 1 1 5:705-71 3, 2003. 
With permission from Elsevier.) 



Summary 

The cytoplasm of eucaryotic cells is spatially organized by a network of protein fila- 
ments known as the cytoskeleton. This network contains three principal types of fila- 
ments: microtubules, actin filaments, and intermediate filaments. All three types of fil- 
aments form as helical assemblies ofsubunits that self- associate using a combination 
of end-to-end and side-to-side protein contacts. Differences in the structure of the sub- 
units and the manner of their self-assembly give the filaments different mechanical 
properties. Intermediate filaments are rope-like and easy to bend but hard to break. 
Microtubules are strong, rigid hollow tubes. Actin filaments are the thinnest of the 
three and are easy to break. 

In living cells, the assembly and disassembly of their subunits constantly remodels 
all three types of cytoskeletal filaments. Microtubules and actin filaments add and lose 
subunits only at their ends, with one end (the plus end) growing faster than the other. 
Tubulin and actin (the subunits of microtubules and actin filaments, respectively) 
bind and hydrolyze nucleoside triphosphates (tubulin binds GTP and actin binds 
ATP). Nucleotide hydrolysis underlies the characteristic dynamic behavior of these two 
filaments. Actin filaments in cells seem to predominantly undergo treadmilling, where 
a filament assembles at one end while simultaneously disassembling at the other end. 
Microtubules in cells predominantly display dynamic instability, where a microtubule 
end undergoes alternating bouts of growth and shrinkage. 

Whereas tubulin and actin have been strongly conserved in evolution, the family 
of intermediate filaments is very diverse. There are many tissue- specific forms found in 
the cytoplasm of animal cells, including keratin filaments in epithelial cells, neurofil- 
aments in nerve cells, and desmin filaments in muscle cells. In all these cells, the pri- 
mary job of intermediate filaments is to provide mechanical strength. 

Bacterial cells also contain homologs of tubulin, actin and intermediate filaments 
that form dynamic filamentous structures involved in determining cell shape and in 
cell division. 
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HOW CELLS REGULATE THEIR CYTOSKELETAL 
FILAMENTS 

Microtubules, actin filaments, and intermediate filaments are much more 
dynamic in cells than they are in the test tube. The cell regulates the length and 
stability of its cytoskeletal filaments, as well as their number and geometry. It 
does so largely by regulating their attachments to one another and to other com- 
ponents of the cell, so that the filaments can form a wide variety of higher-order 
structures. Direct covalent modification of the filament subunits regulates some 
filament properties, but most of the regulation is performed by a large array of 
accessory proteins that bind to either the filaments or their free subunits. Some 
of the most important accessory proteins associated with microtubules and 
actin filaments are outlined in Panel 16-3 (pp. 994-995). This section describes 
how these accessory proteins modify the dynamics and structure of cytoskeletal 
filaments. We begin with a discussion of the way that microtubules and actin fil- 
aments are nucleated in cells, because this plays a major part in determining the 
overall organization of the cell's interior. 

A Protein Complex Containing y-Tubulin Nucleates Microtubules 

While a- and p-tubulins are the regular building blocks of microtubules, another 
type of tubulin, called y- tubulin, has a more specialized role. Present in much 
smaller amounts than a- and (3-tubulin, this protein is involved in the nucle- 
ation of microtubule growth in organisms ranging from yeasts to humans. 
Microtubules are generally nucleated from a specific intracellular location 
known as a microtubule-organizing center (MTOC). Antibodies against y- tubu- 
lin stain the MTOC in virtually all species and cell types thus far examined. 

Microtubules arc nucleated at their minus end, with the plus end growing out- 
ward from each MTOC to create various types of microtubule arrays. A y-tubulin 
ring complex ly-TuRCJ diat is capable of nucleating microtubule growth in a test 
tube has been isolated from both insect and vertebrate cells. Two proteins, con- 
served from yeasts to humans, bind directly to the y-tubulia along with several 
other proteins that help create a ring of y-tubulin molecules. This ring can be seen 
at die minus ends of die microtubules nucleated by y-TuRC, and it is therefore 
thought to serve as a template that creates a microtubule with 13 protolllaments 
[Figure 16-29). 



Microtubules Emanate from the Centrosome in Animal Cells 

Most animal cells have a single, well-defined MTOC called the centrosome, 
located near the nucleus. From this focal point, the cytoplasmic microtubules 
emanate in a star-like, "astral" conformation. Microtubules are nucleated at the 
centrosome at their minus ends, so the plus ends point outward and grow 
toward the cell periphery. Microtubules nucleated at the centrosome continu- 
ously grow and shrink by dynamic instability, probing the entire three-dimen- 
sional volume of the cell. A centrosome is composed of a fibrous centrosome 
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Figure 16-29 Polymerization of tubulin 
nucleated by y-tubulin ring complexes. 

(A) Structure of the y-tubulin ring 
complex, reconstructed from averaging 
electron micrographs of individual 
purified complexes. (B) Model for the 
nucleation of microtubule growth by the 
y-TuRC.The red outline indicates a pair of 
proteins bound to two molecules of 
y-tubulin; this group can be isolated as a 
separate subcomplex of the larger ring. 
Note the longitudinal discontinuity 
between two protofi laments. 
Microtubules generally have one such 
"seam" breaking the otherwise uniform 
helical packing of the protofilaments. 
(C) Electron micrograph of a single 
microtubule nucleated from the purified 
y-tubulin ring complex. (A and C, from 
M. Moritz et al., Nat Cell Biol. 2:365-370, 
2000. With permission from Macmillan 
Publishers Ltd.) 
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matrix that contains more than fifty copies of y-TuRC. Most of the proteins that 
form this matrix remain to be discovered, and it is not yet known how they 
recruit and activate the y-TuRC. 

Embedded in the centrosome is a pair of somewhat mysterious cylindrical 
structures arranged at right angles to each other in an L- shaped configuration 
(Figure 16-30). These are the centrioles, which become the basal bodies of cilia 
and flagella in motile cells (described later). The centrioles organize the centro- 
some matrix (also called the pericentriolar material), ensuring its duplication 
during each cell cycle as the centrioles themselves duplicate (Figure 16-31). As 
described in Chapter 17, the centrosome duplicates and splits into two equal 
parts during interphase, each half containing a duplicated centriole pair. These 
two daughter centrosomes move to opposite sides of the nucleus when mitosis 
begins, and they form the two poles of the mitotic spindle (see Panel 17-1, pp. 
1072-1073). A centriole consists of a short cylinder of modified microtubules, 
plus a large number of accessory proteins. The molecular basis for its duplica- 
tion is not well-understood. 

In fungi and diatoms, microtubules are nucleated at an MTOC that is 
embedded in the nuclear envelope as a small plaque called the spindle pole 
body. Higher-plant cells seem to nucleate microtubules at sites distributed all 
around the nuclear envelope. Neither fungi nor most plant cells contain centri- 
oles. Despite these differences, all these cells contain y-tubulin and seem to use 
it to nucleate their microtubules. 

In animal cells, the astral configuration of microtubules is very robust, with 
dynamic plus ends pointing outward toward the cell periphery and stable minus 
ends collected near the nucleus. The system of microtubules radiating from the 
centrosome acts as a device to survey the outlying regions of the cell and to posi- 
tion the centrosome at its center, and it does this even in artificial enclosures 










Figure 16-30 The centrosome. (A) The 

centrosome is the major MTOC of animal 
cells. Located in the cytoplasm next to 
the nucleus, it consists of an amorphous 
matrix of fibrous proteins to which the 
y-tubulin ring complexes that nucleate 
microtubule growth are attached. This 
matrix is organized by a pair of centrioles, 
as described in the text. (B) A centrosome 
with attached microtubules. The minus 
end of each microtubule is embedded in 
the centrosome, having grown from a 
y-tubulin ring complex, whereas the plus 
end of each microtubule is free in the 
cytoplasm. (C) In a reconstructed image 
of the MTOC from a C. elegans cell, a 
dense thicket of microtubules can be 
seen emanating from the centrosome. 
(C, from E.T. O'Toole et al., J. Cell Biol. 
163:451-456, 2003. With permission from 
The Rockefeller University Press.) 
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Figure 16-31 A centriole in the 
centrosome. (A) An electron 
micrograph of a thin section of a 
centrosome showing an end-on 
view of the mother centriole and a 
longitudinal section of the 
daughter centriole. Numerous 
microtubules are seen nearby. 
(B) Structure of the centriole pair. 
(A,fromGJ. Mack, Y.Ou and 
J.B. Rattner, Microsc Res. Tech. 
49:409-419, 2000. With permission 
from John Wiley & Sons. B, adapted 
from D. Chretien et al., J. Struct. Biol. 
120:117-133, 1997. With 
permission from Elsevier.) 
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Some of the major accessory proteins of the microtubule cytoskeleton. Except for two classes of motor proteins, to be 
discussed in a later section, an example of each major type is shown. Each of these is discussed in the text. However, 
most cells contain more than a hundred different microtubule-binding proteins, and — as for the actin-associated 
proteins — it is likely that there are important types of microtubule-associated proteins that are not yet recognized. 
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(Figure 16-32). Even in an isolated cell fragment lacking the centrosome, 
dynamic microtubules interacting with membraneous organelles arrange them- 
selves into a star-shaped array with the microtubule minus ends clustered at the 
center, although this process may involve more components than just the sim- 
ple pushing mechanism used by the isolated centrosome (Figure 16-33). This 
ability of the microtubule cytoskeleton to find the center of the cell establishes a 
general coordinate system, which is then used to position many organelles 
within the cell. Highly differentiated cells with complex morphologies such as 
neurons, muscles, and epithelial cells must use additional measuring mecha- 
nisms to establish their more elaborate internal coordinate systems. Thus, for 
example, when an epithelial cell forms cell-cell junctions and becomes highly 
polarized, the microtubule minus ends move to a region near the apical plasma 
membrane. From this asymmetric location, an array of nearly parallel micro- 
tubules forms along the long axis of the cell, with plus ends extending as far as 
the basal surface (see Figure 16-5). 



Figure 16-32 The center-seeking 
behavior of a centrosome. (A) Small 
square wells were micromachined into a 
plastic substrate. A single centrosome 
was placed into one of these wells, along 
with tubulin subunits in solution. As the 
microtubules polymerize, nucleated by 
the centrosome, they push against the 
walls of the well. The requirement for 
equal pushing in all directions to stabilize 
the position forces the centrosome to the 
center of the well. These pictures were 
taken at three-minute intervals. (B) A self- 
centered centrosome, fixed and stained 
to show the distribution of the 
microtubules pushing on all four walls of 
the enclosure. (From T.E. Holy et al., Proc 
Natl Acad. Sci. U.S.A. 94:6228-6231, 1997. 
With permission from National Academy 
of Sciences.) 



Actin Filaments Are Often Nucleated at the Plasma Membrane 

In contrast to microtubule nucleation, which occurs primarily deep within the 
cytoplasm near the nucleus, actin filament nucleation most frequently occurs at 
or near the plasma membrane. Consequently, the highest density of actin fila- 
ments in most cells is at the cell periphery. The layer just beneath the plasma 
membrane is called the cell cortex, and the actin filaments in it determine the 
shape and movement of the cell surface. For example, depending on their 
attachments to one another and to the plasma membrane, actin structures can 
form many strikingly different types of cell surface projections. These include 
spiky bundles such as microvilli or filopodia, flat protrusive veils called lamel- 
lipodia that help move cells over solid substrates, and the phagocytic cups in 
macrophages. 

External signals frequendy regulate the nucleation of actin filaments at the 
plasma membrane, allowing the cell to change its shape and stiffness rapidly in 
response to changes in its external environment. This nucleation can be catalyzed 



centrosome 
containing centriole 
pair 




cut 
here 
with 
needle 



melanophore cell 



1+ 

new microtubule- : 

organizing 

cell fragment 
w with 

reorganized 
microtubules 



Figure 16-33 A microtubule array can 
find the center of a cell. After the arm of 
a fish pigment cell is cut off with a 
needle, the microtubules in the detached 
cell fragment reorganize so that their 
minus ends end up near the center of the 
fragment, buried in a new microtubule- 
organizing center. 
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by two different types of regulated factors, the ARP complex and the formins 
(discussed below). The first of these is a complex of proteins that includes two 
actin-related proteins, or ARPs, each of which is about 45% identical to actin. 
Analogous to the function of the y-TuRC, the ARP complex (also known as the 
Arp 2/3 complex) nucleates actin filament growth from the minus end, allowing 
rapid elongation at the plus end (Figure 16-34A and B). The complex can also 
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Figure 1 6-34 Nucleation and actin web formation by the ARP complex. (A)The structures of Arp2 and ArpB, compared to the structure of 
actin. Although the face of the molecule equivalent to the plus end (top) in both Arp2 and Arp3 is very similar to the plus end of actin rtself, 
differences on the sides and minus end (bottom) prevent these actin-related proteins from forming filaments on their own or coassembling 
into filaments with actin. (B) A model for actin filament nucleation by the ARP complex. In the absence of an activating factor, Arp2 and Arp3 
are held by their accessory proteins in an orientation that prevents them from nucleating a new actin filament When an activating factor 
indicated by the blue triangle binds the complex, Arp2 and Arp3 are brought together into a new configuration that resembles the plus end of 
an actin filament. Actin subunits can then assemble onto this structure, bypassing the rate-limiting step of filament nucleation (see Figure 
1 6-10). (C) The ARP complex nucleates filaments most efficiently when it is bound to the side of a preexisting actin filament The result is a^ 
filament branch that grows at a 70° angle relative to the original filament. Repeated rounds of branching nucleation result in a treelike web of 
actin filaments. (D) Top, electron micrographs of branched actin filaments formed by mixing purified actin subunits with purified ARP 
complexes. Bottom, reconstructed image of a branch where the crystal structures of actin and the ARP complex have been fitted to the 
electron density. The mother filament runs from top to bottom, and the daughter filament branches off to the right where the ARP complex 
binds to three actin subunits in the mother filament (D, from R.D. Mullins et al., Proc Natl Acad ScL U.S.A. 95:6181-6186, 1998 With permission 
from National Academy of Sciences, and from N. Volkmann et al., Science 293:2456-2459, 2001 . With permission from Macm.llan Publishers Ltd.) 



998 Chapter 1 6: The Cytoskeleton 




20 \ur\ 



100 \im 



Figure 16-35 Function of the ARP 
complex in plant cells. (A) Cells in the 
maize leaf epidermis form small, actin- 
rich lobes that lock neighboring cells 
together like pieces of a jigsaw puzzle. 
(B) The regular pattern of interlocking 
cells covers the leaf surface. (C) Epidermal 
cells in a mutant plant lacking the ARP 
complex do not form the interlocking 
lobes. The brick-shaped cells are normal 
in size and spacing, but form leaves that 
appear too shiny to the naked eye. (From 
M.J. Frank, H.N. Cartwright and 
L.G. Smith, Development 1 30:753-762, 
2003. With permission from the Company 
of Biologists.) 



attach to the side of another actin filament while remaining bound to the minus 
end of the filament that it has nucleated, thereby building individual filaments 
into a treelike web (Figure 16-34C and D). 

In animals, the ARP complex is associated with structures at the leading 
edge of migrating cells. The complex is localized in regions of rapid actin fila- 
ment growth such as lamellipodia, and intracellular signaling molecules and 
components at the cytosolic Face of the plasma membrane regulate its nucleat- 
ing activity. This conserved complex is also involved in actin filament nucleation 
near the plasma membrane in yeast, where it is required to form cortical actin 
patches (see Figure 16-6), and in plant cells, where it directs the formation of 
actin bundles at the surface that are required for the growth of complex cell 
shapes in a variety of different tissues (Figure 16-35). 

Both y-tubulin and ARPs are evolutionarily ancient, and they are conserved 
among a wide variety or'eucaryotic species. Their genes seem to have arisen by 
early duplication of the gene for the microtubule or actin filament subunit, 
respectively followed by divergence and specialisation of the gene copies so that 
they encode proteins with a special nucleating function. Thus, a similar strategy 
has evolved for two separate cytoskeletal systems. This underlines the central 
importance of regulated filament nucleation as a general organizing principle in 
cells. 



The Mechanism of Nucleation Influences Large-Scale Filament 
Organization 

Because the ARP complex nucleates the growth of a new actin filament most 
efficiently when it is bound to die side of an old actin filament, regulated activa- 
tion of the ARP complex in animal cells tends to lead to the assembly of gel-like 
branched actin networks. However, many of the large-scale actin structures seen 
in cells are made tip of parallel bundles of unbranched actin filaments, includ- 
ing the cleavage furrow found at the equator of dividing cells (see Figure u>-2) 
and the acrin cables that point toward the site of bud growth in yeast (see Figure 
16-6}. The formation of many of these actin bundles is induced by a different set 
of nucleating proteins, the formim } which are able to nucleate the growth of 
straight, unbranched filaments that can be cross-linked by other proteins to 
form parallel bundles. 

Formiiis are a huge family of dimeric proteins (about 15 distinct form ins are 
encoded in the mouse genome). Each formin subunit has a binding site for 
monomeric actin, and the formin dimer appears to nucleate actin filament poly- 
merization by capturing two monomers. As the newly nucleated filament grows, 
the formin dimer remains associated with the rapidly growing plus end, while 
still allowing the binding of new subunits at that end to elongate the filament 
[Figure 16-36). This is very different from the behavior of the ARP complex or 
the Y-TuRC, which remain stably bound to die minus end of the actin filament 
or microtubule and prevent both subunit addition and subunit loss at this end. 
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Proteins That Bind to the Free Subunits Modify Filament 
Elongation 

Once nucleated, cytoskeletal filaments generally elongate by the addition of sol- 
uble subunits. In most nonmuscle vertebrate cells, approximately 50% of the 
actin is in filaments and 50% is soluble, although this ratio can change rapidly in 
response to external signals. The soluble monomer concentration is typically 
50-200 jaM (2-8 mg/ml); this is surprisingly high, given the critical concentration 
of less than 1 |iM observed for pure actin in a test tube. Why does so much of the 
actin remain soluble, rather than polymerizing into filaments? The reason is that 
the subunit pool contains special proteins that bind to the actin monomers, 
thereby making polymerization much less favorable (an action similar to that of 
the drug latrunculin) . A small protein called thymosin is the most abundant of 
these proteins. Actin monomers bound to thymosin are in a locked state, where 
they cannot associate with either the plus or minus ends of actin filament and 
can neither hydrolyze nor exchange their bound nucleotide. 

How do cells recruit actin monomers from this buffered storage pool and 
use them for polymerization? It might seem as if signal transduction pathways 
such as those discussed in Chapter 15 could regulate thymosin, but this has not 
been found to be the case. Instead, recruitment depends on another monomer- 
binding protein, profilin. Profilin binds to the face of the actin monomer oppo- 
site the ATP-binding cleft, blocking the side of the monomer that would nor- 
mally associate with the filament minus end, while leaving exposed the site on 
the monomer that binds to the plus end (Figure 16-37). The profilin-actin com- 
plex can readily add onto a free plus end. This addition induces a conforma- 
tional change in the actin that reduces its affinity for profilin, so the profilin falls 
off, leaving the actin filament one subunit longer. Because profilin competes 
with thymosin in binding to individual actin monomers, the net result of a local 
activation of profilin molecules is a movement of actin subunits from the 
sequestered thymosin-bound pool onto filament plus ends. Actin filament 
growth depends even more strongly on profilin activation for those filaments 
whose plus end is associated with certain formins (the family of actin-nucleat- 
ing proteins discussed above); in these cases, actin filament elongation can 
require that the monomeric actin be bound to profilin (Figure 16-38). 

Several intracellular mechanisms regulate profilin activity, including profilin 
phosphorylation and profilin binding to inositol phospholipids. These mecha- 
nisms can define the sites where profilin acts. For example, profiliris ability to 
move sequestered actin subunits onto the growing ends of filaments is critical 
for filament assembly at the plasma membrane. Profilin is localized at the 
cytosolic face of the plasma membrane because it binds to acidic membrane 
phospholipids there. At this location, extracellular signals can activate profilin to 
produce explosive local actin polymerization and the extension of actin-rich 

Figure 16-37 Profilin bound to an actin monomer. The profilin protein 
molecule is shown in blue, and the actin in red. ATP is shown in green. 
Profilin binds to the face of actin opposite the ATP-binding cleft. This 
profilin-actin heterodimer can therefore bind to and elongate the plus end 
of an actin filament, but it is sterically prevented from binding to the minus 
end. (Courtesy of Michael Rozycki and Clarence E. Schutt.) 



Figure 16-36 Actin elongation mediated 
by formins. Formin proteins {green) form a 
dimeric complex that can nucleate the 
formation of a new actin filament {red) 
and remain associated with the rapidly- 
growing plus end as it elongates. The 
formin protein maintains its binding to 
one of the two actin subunits exposed at 
the plus end as it allows each new subunit 
to assemble. Only part of the large formin 
molecule is shown here. Other regions 
regulate its activity and link it to particular 
structures in the cell. Many formins are 
indirectly connected to the cell plasma 
membrane, and aid the insertional 
polymerization of the actin filament 
directly beneath the membrane surface. 
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Figure 1 6-38 Profilin and formins. Some members of the formin protein 
family have unstructured domains or "whiskers" that contain several binding 
sites for profilin or the profilin-actin complex. These flexible domains serve as 
a staging area for addition of actin to the growing plus end of the actin 
filament when formin is bound. Under some conditions, this can enhance the 
rate of actin filament elongation so that filament growth is faster than that 
expected for a diffusion-controlled reaction, and faster in the presence of 
formin and profilin than the rate for pure actin alone (see also Figure 3-80C). 

motile structures such as filopodia and lamellipodia (see below). Besides bind- 
ing to actin and phospholipids, profilin also binds to various other intracellular 
proteins that have domains rich in proline; these proteins can also help to local- 
ize profilin to sites that require rapid actin assembly. 

As it does with actin monomers, the cell sequesters unpolymerized tubulin 
subunits to maintain the subunit pool at a level substantially higher than the 
critical concentration. One molecule of the small protein stathmin binds to two 
tubulin heterodimers and prevents their addition onto the ends of micro- 
tubules. Stathmin thus decreases the effective concentration of tubulin subunits 
that are available for polymerization (an action analogous to that of the drug 
colchicine). Furthermore, stathmin enhances the likelihood that a growing 
microtubule will undergo the catastrophic transition to the shrinking state. 
Phosphorylation of stathmin inhibits its binding to tubulin, and signals that 
cause stathmin phosphorylation can increase the rate of microtubule elonga- 
tion and suppress dynamic instability. Cancer cells frequently overexpress stath- 
min, and the increased rate of microtubule turnover that results is thought to 
contribute to the characteristic change in cell shape associated with malignant 
transformation. 



Severing Proteins Regulate the Length and Kinetic Behavior of 
Actin Filaments and Microtubules 

In some situations, a cell may break an existing long filament into many smaller 
filaments. This generates a large number of new filament ends: one long fila- 
ment with just one plus end and one minus end might be broken into dozens of 
short filaments, each with its own minus end and plus end. Under some intra- 
cellular conditions, these newly formed ends nucleate filament elongation, and 
in this case severing accelerates the assembly of new filament structures. Under 
other conditions, severing promotes the depolymerization of old filaments, 
speeding up the depolymerization rate by tenfold or more. In addition, severing 
filaments changes the physical and mechanical properties of the cytoplasm: 
stiff, large bundles and gels become more fluid when the filaments are severed. 
To sever a microtubule, thirteen longitudinal bonds must be broken, one for 
each protofilament. The protein katanin, named after the Japanese word for 
"sword," accomplishes this demanding task (Figure 16-39). Katanin is made up 
of two subunits, a smaller subunit that hydrolyzes ATP and performs the actual 
severing, and a larger one that directs katanin to the centrosome. Katanin 
releases microtubules from their attachment to a microtubule organizing center, 
and it is thought to have an important role in the rapid microtubule depolymer- 
ization observed at the poles of spindles during meiosis and mitosis. It may also 
be involved in microtubule release and depolymerization in proliferating cells in 
interphase and in postmitotic cells such as neurons. 

In contrast to microtubule severing by katanin, which requires ATI> the sev- 
ering of actin filaments does not require an extra energy input. Most actin-sev- 
ering proteins are members of the gelsolin superfamily, whose severing activity 
is activated by high levels of cytosolic Ca 2+ . Gelsolin has subdomains that bind 
to two different sites on the actin subunit, one exposed on the surface of the fil- 
ament and one that is normally hidden in the longitudinal bond to the next sub- 
unit in the protofilament. According to one model for gelsolin severing, gelsolin 
binds on the side of an actin filament and waits until a thermal fluctuation hap- 
pens to create a small gap between neighboring subunits in the protofilament; 
gelsolin then insinuates its subdomain into the gap, breaking the filament. 
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Proteins That Bind Along the Sides of Filaments Can Either 
Stabilize or Destabilize Them 

Once a cytoskeletal filament is formed by nucleation and elongated from the 
subunit pool, a set of proteins that bind along the sides of the polymer may alter 
the filament's stability and mechanical properties. Different filament-associated 
proteins use their binding energy to either lower or raise the free energy of the 
polymer state, and they thereby either stabilize or destabilize the polymer, 
respectively. 

Proteins that bind along the sides of microtubules are collectively called 
microtubule-associated proteins, or MAPs. Like the drug taxol, MAPs can sta- 
bilize microtubules against disassembly. A subset of MAPs can also mediate the 
interaction of microtubules with other cell components. This subset is promi- 
nent in neurons, where stabilized microtubule bundles form the core of the 
axons and dendrites that extend from the cell body (Figure 16-40). These MAPs 
have at least one domain that binds to the microtubule surface and another that 
projects outward. The length of the projecting domain can determine how 
closely MAP-coated microtubules pack together, as demonstrated in cells engi- 
neered to overproduce different MAPs. Cells overexpressing MAP2, which has a 
long projecting domain, form bundles of stable microtubules that are kept 
widely spaced, while cells overexpressing tail, a MAP with a much shorter pro- 
jecting domain, form bundles of more closely packed microtubules (Figure 
16-41). Tau binding to filaments can also regulate the transport of membrane- 
enclosed organelles driven by molecular motors, which we will discuss later. 

MAPs are the targets of several protein kinases, and the resulting phospho- 
rylation of a MAP can have a primary role in controlling both its activity and 
localization inside cells. Among the important protein kinases that can regulate 
MAPs are those that are turned on and off as cells progress through the cell cycle 
(discussed in Chapter 17). In particular, MAP activities regulate the changes in 
microtubule dynamics that occur as the cell rearranges its microtubule 
cytoskeleton to form the mitotic spindle in preparation for chromosome segre- 
gation (see Figure 16-2). 

In addition to binding along the sides of microtubules, tau protein forms its 
own helical filaments when present at sufficiently high concentrations. The 
nerve cell cytoplasm in the brains of people with Alzheimer's disease contains 
large aggregates of tau filaments, called neurofibrillary tangles. It is not yet clear 
whether these tangles of tau are a cause or a consequence of the neurodegener- 
ation associated with this disease. 

The binding of accessory proteins along their sides also affects actin fila- 
ments. Selected actin filaments in most cells are stabilized by the binding of 
tropomyosin, an elongated protein that binds simultaneously to seven adjacent 
actin subunits in one protofilament. The binding of tropomyosin along an actin 
filament can prevent the filament from interacting with other proteins; for this 
reason, the regulation of tropomyosin binding is an important step in muscle 
contraction, as we discuss later (see Figure 16-78). 

Another important actin-filament binding protein present in all eucaryotic 
cells is cofilin, which destabilizes actin filaments. Also called actin depolymeriz- 
ing factor, cofilin is unusual in that it binds to actin in both the filament and free 
subunit forms. Cofilin binds along the length of the actin filament, forcing the 
filament to twist a little more tightly (Figure 16-42). This mechanical stress 



Figure 16-40 Localization of MAPs in axon and dendrites of a neuron. 

This immunofluorescence micrograph shows the distribution of tau 
staining (green) and MAP2 staining (orange) in a hippocampal neuron in 
culture. Whereas tau staining is confined to the axon (long and branched in 
this neuron), MAP2 staining is confined to the cell body and its dendrites. 
The antibody used here to detect tau binds only to unphosphorylated tau; 
phosphorylated tau is also present in dendrites. (Courtesy of James W. 
Mandell and Gary A. Banker.) 
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Figure 16-39 Microtubule severing by 
katanin. Taxol-stabilized, rhodamine- 
labeled microtubules were adsorbed on 
the surface of a glass slide, and purified 
katanin was added along with ATP. 
(A) There are a few breaks in the 
microtubules 30 seconds after the 
addition of katanin. (B) The same field 
3 minutes after the addition of katanin. 
The filaments have been severed in many 
places, leaving a series of small fragments 
at the previous locations of the long 
microtubules. (From J.J. Hartman et al., 
Ceil 93:277-287, 1998. With permission 
from Elsevier.) 
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weakens the contacts between actin subunits in the filament, making the fila- 
ment brittle and more easily severed by thermal motions. In addition, it makes 
it much easier for an ADP-actin subunit to dissociate from the minus end of the 
filament. These activities greatly accelerate actin filament disassembly. As a 
result, most of the actin filaments inside cells are much shorter-lived than are fil- 
aments formed from pure actin in a test tube. Actin filaments can be protected 
from cofilin by tropomyosin binding. 

Cofilin binds preferentially to ADP-containing actin filaments rather than to 
ATP- containing filaments. Since ATP hydrolysis is usually slower than filament 
assembly the newest actin filaments in the cell still contain mostly ATP and are 
resistant to depolymerization by cofilin. Cofilin therefore efficiently dismantles 
the older filaments in the cell, ensuring that all actin filaments turn over rapidly. 
As we will discuss later, the cofilin-mediated disassembly of old but not new 
actin filaments is critical for the polarized, directed growth of the actin network 
responsible for unidirectional cell crawling. 



Figure 16-41 Organization of 
microtubule bundles by MAPs. 

(A) MAP2 binds along the microtubule 
lattice at one of its ends and extends a 
long projecting arm with a second 
microtubule-binding domain at the other 
end. (B) Tau binds to the microtubule 
lattice at both its N- and C-termini, with a 
short projecting loop. (C) Electron 
micrograph showing a cross section 
through a microtubule bundle in a cell 
overexpressing MAP2.The regular 
spacing of the microtubules (MTs) in this 
bundle result from the constant length of 
the projecting arms of the MAP2. 
(D) Similar cross section through a 
microtubule bundle in a cell 
overexpressing tau. Here the 
microtubules are spaced more closely 
together than they are in (C) because of 
tau's relatively short projecting arm. 
(C and D, courtesy of V. Chen et al., Nature 
360:674-647, 1992. With permission from 
Macmillan Publishers Ltd.) 



Proteins That Interact with Filament Ends Can Dramatically 
Change Filament Dynamics 

As we have just seen, proteins that bind along the side of a filament can change 
the filament's dynamic behavior. For maximum effect, however, these proteins 
often need to coat the filament completely, and this means they have to be pre- 
sent at fairly high stoichiometries (for example, about one tropomyosin for every 

(A) actin filament 




74 nm 



(B) actin filament + cofilin 




57 nm 



Figure 1 6-42 Twisting of an actin 
filament induced by cofilin. 

(A) Three-dimensional reconstruction 
from cryo-electron micrographs of 
filaments made of pure actin. The bracket 
shows the span of two twists of the actin 
helix. (B) Reconstruction of an actin 
filament coated with cofilin, which binds 
in a 1:1 stoichiometry to actin subunits all 
along the filament. Cofilin is a small 
protein (14 kilodaltons) compared to 
actin (43 kilodaltons), and so the filament 
appears only slightly thicker. The energy 
of cofilin binding serves to deform the 
actin filament lattice, twisting it more 
tightly and reducing the distance 
spanned by each twist of the helix. (From 
A. McGough et al., 1 Cell Biol. 
138:771-781, 1997. With permission from 
The Rockefeller University Press.) 
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seven actin subunits, one tau for every four tubulin subunits, or one cofilin for 
every actin subunit). In contrast, proteins that bind preferentially to the ends of 
filaments can have dramatic effects on filament dynamics even when they are 
present at very low levels. Since subunit addition and loss occur primarily at fil- 
ament ends, one molecule of such a protein per actin filament (typically one per 
about 200-500 actin monomers) can be enough to transform the architecture of 
an actin filament network. 

As previously discussed, an actin filament that ceases elongation and is not 
specifically stabilized by the cell can depolymerize rapidly: it can lose subunits 
from either its plus or its minus end, once the actin molecules at that end have 
hydrolyzed their ATP to convert to the D form. The most rapid changes, however, 
occur at the plus end. The binding of a plus end capping protein stabilizes an 
actin filament at its plus end, which greatly slows the rates of both filament 
growth and filament depolymerization by making the plus end inactive (Figure 
16-43). Indeed, most of the actin filaments in a cell are capped at their plus end 
by proteins such as CapZ (named for its location in the muscle Z band, see 
below; it is also called Capping Protein). At the minus end, an actin filament may 
be capped by remaining bound to the ARP complex that was responsible for its 
nucleation, although it is possible that many of the actin filament minus ends in 
typical cells are released from the ARP complex and are uncapped. 

In muscle cells, where actin filaments are exceptionally long-lived, the fila- 
ments are specially capped at both ends — by CapZ at the plus end and by tropo- 
modulin at the minus end. Tropomodulin binds only to the minus end of actin 
filaments that have been coated by tropomyosin and have thereby already been 
somewhat stabilized. 



Figure 16-43 Filament capping and its 
effects on filament dynamics. 

A population of uncapped filaments adds 
and loses subunits at both the plus and 
minus ends, resulting in rapid growth or 
shrinkage, depending on the 
concentration of available free 
monomers (green line). In the presence of 
a protein that caps the plus end (red line), 
only the minus end is able to add or lose 
subunits; consequently, filament growth 
will be slower at all monomer 
concentrations above the critical 
concentration, and filament shrinkage 
will be slower at all monomer 
concentrations below the critical 
concentration. In addition, the critical 
concentration for the population shifts to 
that of the filament minus end. 



Different Kinds of Proteins Alter the Properties of Rapidly 
Growing Microtubule Ends 

The end of a microtubule, with thirteen protofilaments in a hollow ring (see Fig- 
ure 16-11), is a much larger and more complex structure than the end of an actin 
filament, with many more possibilities for accessory protein action. We have 
already discussed an important microtubule capper: the y-tubulin ring complex 
(y-TuRC), which both nucleates the growth of microtubules at an organizing 
center and caps their minus ends. Another true capping protein for micro- 
tubules is the special protein complex found at the ends of the microtubules in 
cilia (discussed later), where microtubules are both stable and uniform in 
length. 

Some proteins that act at the ends of microtubules have crucial roles beyond 
those expected for a simple capping protein. In particular, they can have dra- 
matic effects on the dynamic instability of microtubules (see Figure 16-16). They 
can influence the rate at which a microtubule switches from a growing to a 
shrinking state (the frequency of catastrophes) or from a shrinking to a growing 
state (the frequency of rescues). For example, a family of kinesin-related pro- 
teins known as catastrophe factors significantly increases the catastrophe rate 
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Figure 16-44 The effects of proteins 
that bind to microtubule ends. The 

transition between microtubule growth 
and microtubule shrinking is controlled 
in cells by special proteins. A MAP such as 
XMAP21 5 stabilizes the end of a growing 
microtubule by its preferential binding 
there. Opposing its action are 
catastrophe factors such as kinesin-1 3, a 
member of the kinesin motor protein 
superfamily (discussed later). 



(these proteins are members of the kinesin- 13 family; see Figure 16-58). They 
bind specifically to microtubule ends and seem to pry protofilaments apart, 
lowering the normal activation energy barrier that prevents a microtubule from 
springing apart into the curved protofilament characteristic of the shrinking 
state (see Figure 16-16C). Opposing their actions are MAPs such as the ubiqui- 
tous XMAP215 that has close homologs in organisms that range from yeast to 
humans (XMAP stands for Xenopus microtubule-associated protein, and the 
number refers to its molecular mass in kilodaltons). This protein has a special 
ability to stabilize free microtubule ends and inhibit their switch from a growing 
to a shrinking state. The phosphorylation of XMAP215 during mitosis inhibits its 
activity and shifts the balance of its competition with catastrophe factors (Fig- 
ure 16-44). The shift results in a tenfold increase in the dynamic instability of 
microtubules observed during mitosis, a transition that is critical for the effi- 
cient construction of the mitotic spindle (see Figure 17-33). 

In many cells, the minus ends of microtubules are stabilized by association 
with the centrosome, or else serve as microtubule depolymerization sites. The 
plus ends, in contrast, efficiently explore and probe the entire cell space. Micro- 
tubule-associated proteins called plus-end tracking proteins (+TIPs) accumulate 
at these active ends, and appear to rocket around the cell as passengers at the 
ends of rapidly growing microtubules, dissociating from the ends when the 
microtubules begin to shrink (Figure 16-45). 




Figure 16-45 +TIP proteins found 
at the growing plus ends of 
microtubules. <TAAT> (A) In an 
epithelial cell grown in tissue 
culture, each microtubule {green) 
has a growing plus end which is 
associated with the +TIP protein 
EB1 {red). (B) In the rod-shaped 
fission yeast Schizosoccharomyces 
pombe, the plus ends of the 
microtubules {green) are associated 
with the homolog of EB1 {red) at 
the two poles of the rod-shaped 
cells. (A, from A. Akhmanova and 
C.C. Hoogenraad, Curr. Opin. Cell 
Biol. 17:47-54, 2005. With 
permission from Elsevier; 
B, courtesy of Ken Sawin.) 
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Some of the +TIPs, such as the kinesin-related catastrophe factors and 
XMAP215 mentioned above, modulate the growth and shrinkage of the micro- 
tubule end to which they are attached. Others control microtubule positioning 
by helping to capture and stabilize the growing microtubule end at the location 
of specific target proteins in the cell cortex. EB1, a +TIP present in both yeasts 
and humans, for example, is essential for yeast mitotic spindle positioning, 
directing the growing plus ends of yeast spindle microtubules to a specific dock- 
ing region in the yeast bud and then helping to anchor them there. 

Filaments Are Organized into Higher-Order Structures in Cells 

So far, we have described how cells use accessory proteins to regulate the loca- 
tion and dynamic behavior of cytoskeletal filaments. These proteins can nucle- 
ate filament assembly, bind to the ends or sides of the filaments, or bind to the 
free subunits of filaments. But in order for the cytoskeletal filaments to form a 
useful intracellular scaffold that gives the cell mechanical integrity and deter- 
mines its shape, the individual filaments must be organized and attached to one 
another in larger-scale structures. The centrosome is one example of such a 
cytoskeletal organizer; in addition to nucleating the growth of microtubules, it 
holds them together in a defined geometry, with all of the minus ends buried in 
the centrosome and the plus ends pointing outward. In this way, the centrosome 
creates the astral array of microtubules that is able to find the center of each cell 
(see Figure 16-32). 

Another mechanism that cells use to organize filaments into large structures 
is filament cross-linking. As described earlier, some MAPs can bundle micro- 
tubules together: they have two domains — one that binds along the microtubule 
side (and thereby stabilizes the filament) and another that projects outward to 
contact other MAP-coated microtubules. In the actin cytoskeleton, the stabiliz- 
ing and cross-linking functions are separated. Tropomyosin binds along the 
sides of actin filaments, but it does not have an outward projecting domain. As 
we shall see shortly filament cross-linking is instead mediated by a second 
group of actin-binding proteins that have only this function. Intermediate fila- 
ments are different yet again; they are organized both by a lateral self-associa- 
tion of the filaments themselves and by the cross-linking activity of accessory 
proteins, as we describe next. 

Intermediate Filaments Are Cross-Linked and Bundled Into 
Strong Arrays 

Each individual intermediate filament forms as a long bundle of tetrameric sub- 
units (see Figure 16-19). Many intermediate filaments further bundle them- 
selves by self-association; for example, the neurofilament proteins NF-M and 
NF-H (see Table 16-1, p. 985) contain a C-terminal domain that extends outward 
from the surface of the assembled intermediate filament and binds to a neigh- 
boring filament. Thus groups of neurofilaments form robust parallel arrays that 
are held together by multiple lateral contacts, giving strength and stability to the 
long cell processes of neurons (see Figure 16-22). 

Other types of intermediate filament bundles are held together by accessory 
proteins, such as filaggrin, which bundles keratin filaments in differentiating 
cells of the epidermis to give the outermost layers of the skin their special tough- 
ness. Plectin is a particularly interesting cross-linking protein. Besides bundling 
intermediate filaments, it also links the intermediate filaments to microtubules, 
actin filament bundles, and filaments of the motor protein myosin II (discussed 
below), as well as helping to attach intermediate filament bundles to adhesive 
structures at the plasma membrane (Figure 16-46). 

Mutations in the gene for plectin cause a devastating human disease that 
combines epidermolysis bullosa (caused by disruption of skin keratin fila- 
ments), muscular dystrophy (caused by disruption of desmin filaments), and 
neurodegeneration (caused by disruption of neurofilaments). Mice lacking a 
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Figure 16-46 Plectin cross-linking of 
diverse cytoskeletal elements. Plectin 
(green) is seen here making cross-links 
from intermediate filaments (blue) to 
microtubules (red). In this electron 
micrograph, the dots (yellow) are gold 
particles linked to anti-plectin antibodies. 
The entire actin filament network was 
removed to reveal these proteins. (From 
T.M. Svitkina and G.G.Borisy, J. Cell Biol. 
135:991-1007, 1996. With permission 
from The Rockefeller University Press.) 



functional plectin gene die within a few days of birth, with blistered skin and 
abnormal skeletal and heart muscles. Thus, although plectin may not be neces- 
sary for the initial formation and assembly of intermediate filaments, its cross- 
linking action is required to provide cells with the strength they need to with- 
stand the mechanical stresses inherent to vertebrate life. 



Cross-linking Proteins with Distinct Properties Organize Different 
Assemblies of Actin Filaments 

Actin filaments in animal cells are organized into two types of arrays: bundles 
and weblike (gel-like) networks (Figure 16-47). As described earlier, these dif- 
ferent structures are initiated by the action of distinct nucleating proteins: the 
long straight filaments produced by formins make bundles and the ARP com- 
plex makes webs. The actin filament cross-linking proteins that help to stabilize 
and maintain these distinct structures are divided into two classes: bundling 
proteins and gel-forming proteins. Bundling proteins cross-link actin filaments 
into a parallel array while gel-forming proteins hold two actin filaments 
together at a large angle to each other, thereby creating a looser meshwork. Both 
types of cross-linking protein generally have two similar actin-filament-binding 
sites, which can either be part of a single polypeptide chain or contributed by 
each of two polypeptide chains held together in a dinier [Figure 16-48 j. the 
spacing and arrangement of these two filament-binding domains determines 
the type of actin structure that a given cross-linking protein forms. 

Each type of bundling protein also determines which other molecules can 
interact with an actin filament. Myosin II (discussed later) is the motor protein 
in stress fibers and other contractile arrays that enables them to contract. The 
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Figure 16-47 Actin arrays in a cell. 

A fibroblast crawling in a tissue culture dish 
is shown with three areas enlarged to show 
the arrangement of actin filaments. The 
actin filaments are shown in red, with 
arrowheads pointing toward the minus 
end. Stress fibers are contractile and exert 
tension. Filopodia are spike-like projections 
of the plasma membrane that allow a cell 
to explore its environment. The cortex 
underlies the plasma membrane. 
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very close packing of actin filaments caused by the small monomeric bundling 
protein fimbrin apparently excludes myosin, and thus the parallel actin fila- 
ments held together by fimbrin are not contractile; on the other hand, the looser 
packing caused by the larger dimeric bundling protein a-actinin allows myosin 
molecules to enter, making stress fibers contractile (Figure 16-49). Because of 
the very different spacing between the actin filaments, bundling by fimbrin 
automatically discourages bundling by a-actinin, and vice-versa, so that the two 
types of bundling protein are themselves mutually exclusive. 

Villin is another bundling protein that, like fimbrin, has two actin-filament- 
binding sites very close together in a single polypeptide chain. Villin (together 
with fimbrin) helps cross-link the 20 to 30 tightly bundled actin filaments found 
in microvilli, the finger-like extensions of the plasma membrane on the surface 
of many epithelial cells (Figure 16-50). A single absorptive epithelial cell in the 
human small intestine, for example, has several thousand microvilli on its apical 
surface. Each is about 0.08 (im wide and 1 |im long, making the cell's absorptive 
surface area about 20 times greater than it would be without microvilli. When 
villin is introduced into cultured fibroblasts, which do not normally contain 
villin and have only a few small microvilli, the existing microvilli become greatly 
elongated and stabilized, and new ones are induced. The actin filament core of 
the microvillus is attached to the plasma membrane along its sides by lateral 
sidearms made of myosin I (discussed later), which has a binding site for fila- 
mentous actin on one end and a domain that binds lipids on the other end. 
These two types of cross-linkers, one binding actin filaments to each other and 
the other binding these filaments to the membrane, seem to be sufficient to 
form microvilli on cells. Interestingly, when the gene for villin is disrupted in a 
mouse, the intestinal microvilli form with apparently normal morphology, indi- 
cating that other bundling proteins provide sufficient redundant function for 
this purpose. However, the remodeling of intestinal microvilli in response to cer- 
tain kinds of stress or starvation is impaired. 



Figure 16-48 The modular structures of 
four actin-cross-linking proteins. Each of 
the proteins shown has two actin- 
binding sites (red) that are related in 
sequence. Fimbrin has two directly 
adjacent actin-binding sites, so that it 
holds its two actin filaments very close 
together (14 nm apart), aligned with the 
same polarity (see Figure 16-49A).The 
two actin-binding sites in a-actinin are 
separated by a spacer around 30 nm 
long, so that it forms more loosely 
packed actin bundles (see Figure 
16-49A). Filamin has two actin-binding 
sites with a V-shaped linkage between 
them, so that it cross-links actin filaments 
into a network with the filaments 
oriented almost at right angles to one 
another (see Figure 16-51). Spectrin is a 
tetramer of two a and two (3 subunits, 
and the tetramer has two actin-binding 
sites spaced about 200-nm apart (see 
Figure 10-41). 



Figure 1 6-49 The formation of two 
types of actin filament bundles. 

(A) a-actinin, which is a homodimer, 
cross-links actin filaments into loose 
bundles, which allow the motor protein 
myosin II (not shown) to participate in 
the assembly. Fimbrin cross-links actin 
filaments into tight bundles, which 
exclude myosin. Fimbrin and a-actinin 
tend to exclude one another because of 
the very different spacing of the actin 
filament bundles that they form. 

(B) Electron micrograph of purified 
a-actinin molecules. (B, courtesy of 
John Heuser.) 
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Figure 1 6-50 A microvillus. (A) A bundle of parallel actin filaments cross-linked by the actin-bundling proteins villin and 
fimbrin forms the core of a microvillus. Lateral sidearms (composed of myosin I and the Ca 2+ -binding protein calmodulin) 
connect the sides of the actin filament bundle to the overlying plasma membrane. All the plus ends of the actin filaments 
are at the tip of the microvillus, where they are embedded in an amorphous, densely staining substance of unknown 
composition. (B) Freeze-fracture electron micrograph of the apical surface of an intestinal epithelial cell, showing microvilli. 
Actin bundles from the microvilli extend down into the cell and are rooted in the terminal web, where they are linked 
together by a complex set of proteins that includes spectrin and myosin II. Below the terminal web is a layer of 
intermediate filaments. (C) Thin section electron micrograph of microvilli. (B, courtesy of John Heuser; C, from 
P.T. Matsudaira and D.R. Burgess, Cold Spring Harb. Symp. Quant Biol. 46:845-854, 1 985. With permission from Cold Spring 
Harbor Laboratory Press.) 

Filamin and Spectrin Form Actin Filament Webs 

The various bundling proteins that we have discussed so far have straight, stiff 
connections between their two actin-filament-binding domains, and they tend 
to align filaments in parallel bundles. In contrast, those actin cross-linking pro- 
teins that have either a flexible or a stiff, bent connection between their two 
binding domains form actin filament webs or gels, rather than actin bundles. 

Any cross-linking protein that has its two actin-binding domains joined by a 
long bent linkage can form three-dimensional actin gels. Filamin (see Figure 
16-48) promotes the formation of a loose and highly viscous gel by clamping 
together two actin filaments roughly at right angles (Figure 16-51). Cells require 
the actin gels formed by filamin in order to extend the thin sheet-like membrane 
projections called lamellipodia that help them to crawl across solid surfaces. Fil- 
amin is lacking in some types of cancer cells, especially some malignant 
melanomas (pigment-cell cancers). These cells cannot crawl properly and 
instead they protrude disorganized membrane blebs (Figure lfr-52). Losing Fil- 
amin is bad for the melanoma cells but good for die melanoma patiem; because 
of i he cells" inability to crawl melanoma cells that have lost filamin expression 
are less invasive than similar melanoma cells that still express filamin, and, as a 
result, the cancer is much less likely to metastasize. 

A very different well-studied web-forming protein is spectrin, which was first 
identified in red blood cells. Spectrin is a long, flexible protein made out of four 
elongated polypeptide chains (two a subunits and two (3 subunits), arranged so 
that the two actin-filament-binding sites are about 200 nm apart (compared 
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Figure 1 6-51 Filamin cross-links actin 
filaments into a three-dimensional 
network with the physical properties of 
a gel. (A) Each filamin homodimer is 
about 1 60 nm long when fully extended 
and forms a flexible, high-angle link 
between two adjacent actin filaments. 
(B) A set of actin filaments cross-linked by 
filamin forms a mechanically strong web 
or gel. 
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with 14 nm for fimbrin and about 30 nm for a-actinin, see Figure 16-48). In the 
red blood cell, spectrin is concentrated just beneath the plasma membrane, 
where it forms a two-dimensional web held together by short actin filaments; 
spectrin links this web to the plasma membrane because it has separate binding 
sites for peripheral membrane proteins, which are themselves positioned near 
the lipid bilayer by integral membrane proteins (see Figure 10-41) . The resulting 
network creates a stiff cell cortex that provides mechanical support for the over- 
lying plasma membrane, allowing the red blood cell to spring back to its original 
shape after squeezing through a capillary. Close relatives of spectrin are found in 
the cortex of most other vertebrate cell types, where they also help to shape and 
stiffen the surface membrane. 



Cytoskeletal Elements Make Many Attachments to Membranes 

Actin cytoskeletal structures both stiffen and change the shape of the plasma 
membrane. We have already discussed two examples: the spectrin-actin web 
that underlies the plasma membranes and the villin-actin bundles in microvilli 
that enlarge the absorptive surface area of epithelial cells. The effectiveness of 
these structures requires specific attachments between the actin filament net- 
work and proteins or lipids of the plasma membrane. 

The connections of the cortical actin cytoskeleton to the plasma membrane 
are only partially understood. A widespread family of closely related intracellu- 
lar proteins, the ERM family (named for its first three members, ezrin, radixin, 
and moesin), contains members that are required for the maintenance of cell 
polarity and involved in exocytosis and endocytosis. The C-terminal domain of 
an ERM protein binds directly to the sides of actin filaments. The N-terminal 
domain binds to the cytoplasmic face of one or more transmembrane glycopro- 
teins, such as CD44, the receptor for the extracellular matrix component 
hyaluronan. 

The attachments between actin and the plasma membrane mediated by 
ERM proteins are regulated by both intracellular and extracellular signals. ERM 
proteins can exist in two conformations, an active extended conformation that 
oligomerizes and binds to both actin and a transmembrane protein, and an 
inactive folded conformation, in which the N- and C-termini are held together 
by an intramolecular interaction. Switching to the active conformation can be 

Figure 1 6-52 Loss of filamin causes abnormal cell motility. (A) A group of 
melanoma cells that have an abnormally low level of filamin. These cells are 
not able to make normal lamellipodia and instead are covered with 
membrane "blebs." As a result, they crawl poorly and tend not to 
metastasize. (B) The same melanoma cells in which filamin expression has 
been artificially restored. The cells now make normal lamellipodia and are 
highly metastatic. This example is one of many demonstrating the profound 
effect that the presence or absence of a single structural protein can have 
on cell morphology and motility. (From C. Cunningham et al., J. Cell Biol. 
1 36:845-857, 1 997. With permission The Rockefeller University Press.) 
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triggered by protein phosphorylation or by binding to PIP 2 , either of which can 
occur, for example, in response to extracellular signals (the ERM proteins are 
direct targets of several receptor tyrosine kinases). In this way the properties of 
the cell cortex become sensitive to a variety of signals received by the cell (Fig- 
ure 16-53). 

The loss of one of the members of the ERM family, called merlin, results in a 
form of the human genetic disease called neurofibromatosis, in which multiple 
benign tumors develop in the auditory nerves and certain other parts of the ner- 
vous system. This is one of many indications of a feedback system that connects 
cell structural elements to the control of cell growth (see Chapter 17). 

The proteins, discussed in this section, that control the assembly and posi- 
tion of actin filaments and microtubules are reviewed in Panel 16-3 (pp. 
994-995). Some of these proteins have the additional function of helping to con- 
nect the internal structure of a cell to other cells or to an extracellular basement 
membrane. Both actin filaments and intermediate filaments are critical for 
these connections, which require the specialized cell-cell junctions and cell- 
matrix junctions that we will discuss in Chapter 19. 



Summary 

The varied forms and functions of cytoskeletal filament structures in eucaryotic cells 
depend on a versatile repertoire of accessory proteins. Each of the three major filament 
classes (microtubules, intermediate filaments, and actin filaments) has a large dedi- 
cated subset of such accessory proteins. 

A primary determinant of the sites of cytoskeletal structures is the regulation of the 
processes that initiate the nucleation of new filaments. In most animal cells, micro- 
tubules are nucleated at the centrosome, a complex assembly located near the center of 
the cell. In contrast, most actin filaments are nucleated near the plasma membrane. 

The kinetics of filament assembly and disassembly can be either slowed or acceler- 
ated by accessory proteins that bind to either the free subunits or the filaments them- 
selves. Some of these proteins alter filament dynamics by binding to the ends of fila- 
ments or by severing the filaments into smaller fragments. Another class of accessory 
proteins assembles the filaments into larger ordered structures by cross-linking them to 
one another in geometrically defined ways. Yet other accessory proteins determine the 
shape and adhesive properties of cells by attaching filaments to the plasma membrane. 



Figure 1 6-53 The role of ERM-family 
proteins in attaching actin filaments to 
the plasma membrane. Regulated 
unfolding of an ERM-family protein, 
caused by phosphorylation or by binding 
to PIP 2 , exposes two binding sites, one 
for an actin filament and one for a 
transmembrane protein. Activation of 
ERM-family proteins can thereby 
generate and stabilize cell-surface 
protrusions that form in response to 
extracellular signals. 



MOLECULAR MOTORS 

Among the most fascinating proteins that associate with the cytoskeleton are the 
molecular motors called motor proteins. These remarkable proteins bind to a 
polarized cytoskeletal filament and use the energy derived from repeated cycles of 
ATP hydrolysis to move steadily along it. Dozens of different motor proteins coex- 
ist in every eucaryotic cell. They differ in the type of filament they bind to (either 
actin or microtubules), the direction in which they move along the filament, and 
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the "cargo" they carry. Many motor proteins carry membrane-enclosed 
organelles — such as mitochondria, Golgi stacks, or secretory vesicles — to their 
appropriate locations in the cell. Other motor proteins cause cytoskeletal fila- 
ments to exert tension or to slide against each other, generating the force that 
drives such phenomena as muscle contraction, ciliary beating, and cell division. 

Cytoskeletal motor proteins that move unidirectionally along an oriented 
polymer track are reminiscent of some other proteins and protein complexes 
discussed elsewhere in this book, such as DNA and RNA polymerases, helicases, 
and ribosomes. All of these have the ability to use chemical energy to propel 
themselves along a linear track, with the direction of sliding dependent on the 
structural polarity of the track. All of them generate motion by coupling nucleo- 
side triphosphate hydrolysis to a large-scale conformational change in a protein, 
as explained in Chapter 3 (see Figure 3-77). 

The cytoskeletal motor proteins associate with their filament tracks through 
a "head" region, or motor domain, that binds and hydrolyzes ATR Driven by 
cycles of nucleotide hydrolysis that produce conformational changes, the pro- 
teins cycle between states in which they are bound strongly to their filament 
tracks and states in which they are unbound. Through a mechanochemical cycle 
of filament binding, conformational change, filament release, conformational 
relaxation, and filament rebinding, the motor protein and its associated cargo 
move one step at a time along the filament (typically a distance of a few 
nanometers). The motor domain (head) determines the identity of the track and 
the direction of movement along it, whereas the tail of the motor protein deter- 
mines the identity of the cargo (and therefore the biological function of the indi- 
vidual motor protein). 

In this section, we begin by describing the three groups of cytoskeletal motor 
proteins: myosins, kinesins, and dyneins. We then describe how they can work to 
transport membrane-enclosed organelles and mRNAs or to change the shape of 
structures built from cytoskeletal filaments. In the final section of this chapter, we 
will examine how a collaboration between motor proteins and the dynamic 
cytoskeletal filaments described previously generates complex cell behaviors. 



Actin-Based Motor Proteins Are Members of the Myosin 
Superfamily 

The first motor protein identified was skeletal muscle myosin, which generates 
the force for muscle contraction. This myosin, called myosin II (see below) is an 
elongated protein that is formed from two heavy chains and two copies of each 
of two light chains. Each heavy chain has a globular head domain at its N-termi- 
nus that contains the force-generating machinery, followed by a very long amino 
acid sequence that forms an extended coiled-coil that mediates heavy chain 
dimerization (Figure 16-54). The two light chains bind close to the N-terminal 
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Figure 16-54 Myosin II. (A) A myosin II molecule is composed of two heavy 
chains (each about 2000 amino acids long (green) and four light chains 
(blue). The light chains are of two distinct types, and one copy of each type 
is present on each myosin head. Dimerization occurs when the two a 
helices of the heavy chains wrap around each other to form a coiled-coil, 
driven by the association of regularly spaced hydrophobic amino acids (see 
Figure 3-9). The coiled-coil arrangement makes an extended rod in 
solution, and this part of the molecule is called the tail. (B) The two 
globular heads and the tail can be clearly seen in electron micrographs of 
myosin molecules shadowed with platinum. (B, courtesy of David Shotton.) 
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Figure 16-55 The myosin II bipolar thick filament in muscle. (A) Electron micrograph of a myosin II thick filament isolated 
from frog muscle. Note the central bare zone, which is free of head domains. (B) Schematic diagram, not drawn to scale The 
myosin II molecules aggregate by means of their tail regions, with their heads projecting to the outside of the filament. The 
bare zone in the center of the filament consists entirely of myosin II tails. (C) A small section of a myosin II filament as 
reconstructed from electron micrographs. An individual myosin molecule is highlighted in green. The cytoplasmic myosin II 
filaments in non-muscle cells are much smaller, although similarly organized (see Figure 1 6-72). (A, courtesy of Murray Stewart; 
C, based on R.A. Crowther, R. Padron and R. Craig, J. Moi. Biol. 1 84:429-439, 1 985. With permission from Academic Press.) 

head domain, while the long coiled-coil tail bundles itself with the tails of other 
myosin molecules. These tail-tail interactions form large bipolar "thick fila- 
ments" that have several hundred myosin heads, oriented in opposite directions 
at the two ends of the thick filament (Figure 16-55). 

Each myosin head binds and hydrolyzes ATP, using the energy of ATP 
hydrolysis to walk toward the plus end of an actin filament. The opposing orien- 
tation of the heads in the thick filament makes the filament efficient at sliding 
pairs of oppositely oriented actin filaments past each other. In skeletal muscle, 
in which carefully arranged actin filaments are aligned in "thin filament" arrays 
surrounding the myosin thick filaments, the ATP-driven sliding of actin fila- 
ments results in muscle contraction (discussed later). Cardiac and smooth mus- 
cle contain myosin II molecules that are similarly arranged, although different 
genes encode them. 

When a muscle myosin is digested by chymotrypsin and papain, the head 
domain is released as an intact fragment (called SI). The SI fragment alone can 
generate filament sliding in vitro, proving that the motor activity is contained 
completely within the head (Figure 16-56). 

It was initially thought that myosin was present only in muscle, but in the 
1970s, researchers found that a similar two-headed myosin protein was also 
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Figure 16-56 Direct evidence for the 
motor activity of the myosin head. 

<TTAT> In this experiment, purified SI 
myosin heads were attached to a glass 
slide, and then actin filaments labeled 
with fluorescent phalloidin were added 
and allowed to bind to the myosin heads. 

(A) When ATP was added, the actin 
filaments began to glide along the 
surface, owing to the many individual 
steps taken by each of the dozens of 
myosin heads bound to each filament. 
The video frames shown in this sequence 
were recorded about 0.6 second apart; 
the two actin filaments shown (one red 
and one green) were moving in opposite 
directions at a rate of about 4 u,m/sec. 

(B) Diagram of the experiment. The large 
red arrows indicate the direction of actin 
filament movement. (A, courtesy of 
James Spudich.) 
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present in nonmuscle cells, including protozoan cells. At about the same time, 
other researchers found a myosin in the freshwater amoeba Acanthamoeba 
castellanii that was unconventional in having a motor domain similar to the 
head of muscle myosin but a completely different tail. This molecule seemed to 
function as a monomer and was named myosin I (for one-headed); the conven- 
tional myosin was renamed myosin II (for two-headed). 

Subsequently, many other myosin types were discovered. The heavy chains 
generally start with a recognizable myosin motor domain at the N-terminus, 
and then diverge widely with a variety of C-terminal tail domains (Figure 
16-57). The newly identified types of myosins include a number of one-headed 
and two-headed varieties that are approximately equally related to myosin I and 
myosin II, and the nomenclature now reflects their approximate order of dis- 
covery (myosin III through at least myosin XVIII). Sequence comparisons 
among diverse eucaryotes indicate that there are at least 37 distinct myosin fam- 
ilies in the superfamily. The myosin tails (and the tails of motor proteins gener- 
ally) have apparently diversified during evolution to permit the proteins to bind 
other subunits and to interact with different cargoes. 

Some myosins (such as VIII and XI) have been found only in plants, and 
some have been found only in vertebrates (IX). Most, however, are found in all 
eucaryotes, suggesting that myosins arose early in eucaryotic evolution. The 
yeast Saccharomyces cerevisiae contains five myosins: two myosin Is, one 
myosin II, and two myosin Vs. It is tempting to speculate that these three types 
of myosins are necessary for a eucaryotic cell to survive and that other myosins 
perform more specialized functions, particularly in multicellular organisms. The 
nematode C. elegans, for example, has at least 15 myosin genes, representing at 
least seven structural classes; the human genome includes about 40 myosin 
genes. Nine of the human myosins are expressed primarily or exclusively in the 
hair cells of the inner ear, and mutations in five of them are known to cause 
hereditary deafness. These extremely specialized myosins are important for the 
construction and function of the complex and beautiful bundles of actin-rich 
stereocilia that are found on the apical surface of these cells (see Figure 9-50); 
these tilt in response to sound and convert sound waves into electrical signals 
(discussed in Chapter 23). 

All of the myosins except one move toward the plus end of an actin filament, 
although they do so at different speeds. The exception is myosin VI, which 
moves toward the minus end. 

The exact functions for most of the myosins remain to be determined. 
Myosin V is involved in vesicle and organelle transport. Myosin II is associated 
with contractile activity in both muscle and nonmuscle cells. It is generally 
required for cytokinesis (the pinching apart of a dividing cell into two daugh- 
ters), as well as for the forward translocation of the body of a cell during cell 
migration. The myosin I proteins often contain a second actin-binding site or a 
membrane -binding site in their tails, and they are generally involved in intracel- 
lular organization — including the protrusion of actin-rich structures at the cell 
surface, as discussed earlier for the construction of microvilli (see Figure 16-50). 



Figure 16-57 Myosin superfamily 
members. Comparison of the domain 
structure of the heavy chains of some 
myosin types. All myosins share similar 
motor domains (shown in dork green), 
but their C-terminal tails (light green) and 
N-terminal extensions (light blue) are very 
diverse. On the right are depictions of the 
molecular structure for these family 
members. Many myosins form dimers, 
with two motor domains per molecule, 
but a few (such as I, IX, and XIV) seem to 
function as monomers, with just one 
motor domain. Myosin VI, despite its 
overall structural similarity to other family 
members, is unique in moving toward 
the minus end (instead of the plus end) 
of an actin filament. The small insertion 
within its motor head domain, not found 
in other myosins, is probably responsible 
for this change in direction. 
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There Are Two Types of Microtubule Motor Proteins: Kinesins and 
Dyneins 

Kinesin is a motor protein that moves along microtubules. It was first identified 
in the giant axon of the squid, where it carries membrane-enclosed organelles 
away from the neuronal cell body toward the axon terminal by walking toward 
the plus end of microtubules. Kinesin is similar structurally to myosin II in hav- 
ing two heavy chains and two light chains per active motor; these form two glob- 
ular head motor domains and an elongated coiled-coil tail responsible for heavy 
chain dimerization. Like myosin, kinesin is a member of a large protein super- 
family for which the motor domain is the only common element (Figure 16-58) . 
The yeast Saccharomyces cerevisiae has six distinct kinesins. The nematode C. 
elegans has 16 kinesins, and humans have about 45. 

There are at least fourteen distinct families in the kinesin superfamily. Most 
of them have the motor domain at the N-terminus of the heavy chain and walk 
toward the plus end of the microtubule. A particularly interesting family has the 
motor domain at the C-terminus and walks in the opposite direction, toward the 
minus end of the microtubule. Some kinesin heavy chains lack a coiled-coil 
sequence and seem to function as monomers, analogous to myosin I. Some oth- 
ers are homodimers, and yet others are heterodimers. Members of the kinesin-5 
family can self-associate through the tail domain, forming a bipolar motor that 
slides oppositely oriented microtubules past one another, much as a myosin II 
thick filament does for actin filaments. Most kinesins carry a binding site in the 
tail for either a membrane-enclosed organelle or another microtubule. Many of 
the kinesin superfamily members have specific roles in mitotic and meiotic 
spindle formation, and in chromosome separation during cell division. 

The dyneins are a family of minus-end-directed microtubule motors unre- 
lated to the kinesin superfamily. They are composed of two or three heavy chains 
(that include the motor domain) and a large and variable number of associated 
intermediate chains and light chains. The dynein family has two major branches 
(Figure 16-59). The most ancient branch contains the cytoplasmic dyneins, 
which are typically heavy- chain homodimers, with two large motor domains as 
heads. Cytoplasmic dyneins are probably found in all eucaryotic cells, and they 
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Figure 16-58 Kinesin and kinesin- 
related proteins. (A) Structures of five 
kinesin superfamily members. As in the 
myosin superfamily, only the motor 
domains are conserved. Kinesin-1 has the 
motor domain at the N-terminus of the 
heavy chain. The middle domain forms a 
long coiled-coil, mediating dimerization. 
The C-terminal domain forms a tail that 
attaches to cargo, such as a membrane- 
enclosed organelle. Kinesin-3 represents 
an unusual class of kinesins that seem to 
function as monomers and move 
membrane-enclosed organelles along 
microtubules. Kinesin-5 forms tetramers 
where two dimers associate by their tails. 
The bipolar kinesin-5 tetramer is able to 
slide two microtubules past each other, 
analogous to the activity of the bipolar 
thick filaments formed by myosin II. 
Kinesin-1 3 has its motor domain located 
in the middle of the heavy chain. It is a 
member of a family of kinesins that have 
lost typical motor activity and instead 
bind to microtubule ends to increase 
dynamic instability of microtubules; they 
are therefore called catastrophe factors 
(see p. 1003). Kinesin-14 is a C-terminal 
kinesin that includes the Drosophila 
protein Ned and the yeast protein Kar3. 
These kinesins generally travel in the 
opposite direction from the majority of 
kinesins, toward the minus end instead of 
the plus end of a microtubule. (B) Freeze- 
etch electron micrograph of a kinesin 
molecule with the head domains on the 
left. (B, courtesy of John Heuser.) 
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are important for vesicle trafficking, as well as for localization of the Golgi appa- 
ratus near the center of the cell. Axonemal dyneins, the other large branch, 
include heterodimers and heterotrimers, with two or three motor-domain 
heads, respectively. They are highly specialized for the rapid and efficient sliding 
movements of microtubules that drive the beating of cilia and flagella (discussed 
later). A third, minor, branch shares greater sequence similarity with cytoplas- 
mic than with axonemal dyneins but seems to be involved in the beating of cilia. 
Dyneins are the largest of the known molecular motors, and they are also 
among the fastest: axonemal dyneins can move microtubules in a test tube at 
the remarkable rate of 14 |im/sec. In comparison, the fastest kinesins can move 
their microtubules at about 2-3 jam/sec. We shall discuss how they work below. 



The Structural Similarity of Myosin and Kinesin Indicates a 
Common Evolutionary Origin 

The motor domain of myosins is substantially larger than that of kinesins, about 
850 amino acids compared with about 350. The two classes of motor proteins 
track along different filaments and have different kinetic properties, and they 
have no identifiable amino acid sequence similarities. However, determination 
of the three-dimensional structure of the motor domains of myosin and kinesin 
has revealed that these two motor domains are built around nearly identical 
cores (Figure 16-60). The central force-generating element that the two types of 
motor proteins have in common includes the site of ATP binding and the 
machinery necessary to translate ATP hydrolysis into an allosteric conforma- 
tional change. Large loops extending outward from the central core cause the 



Figure 16-59 Dyneins. Freeze-etch 
electron micrographs of a molecule of 
cytoplasmic dynein and a molecule of 
ciliary (axonemal) dynein. Like myosin II 
and kinesin, cytoplasmic dynein is a two- 
headed molecule. The ciliary dynein 
shown is from a protozoan and has three 
heads; ciliary dynein from animals has 
two heads. Note that the dynein head is 
very large compared with the head of 
either myosin or kinesin. (Courtesy of 
John Heuser.) 
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Figure 16-60 X-ray crystal structures of 
myosin and kinesin heads. The central 
nucleotide-binding domains of myosin 
and kinesin (shaded in yellow) are 
structurally very similar. The very different 
sizes and functions of the two motors are 
due to major differences in the polymer- 
binding and force-transduction portions 
of the motor domain. (Adapted from 
L.A. Amos and R.A. Cross, Curr. Opin. 
Struct Biol. 7:239-246, 1997. With 
permission from Elsevier.) 
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difference in domain size and are responsible for the choice of track. These loops 
include the actin-binding and microtubule-binding sites in the myosin and 
kinesin, respectively. It is thought that both the kinesins and the myosins are 
descended from a common ancestral motor protein precursor, and that their 
various specialized functions arose via gene duplication and modification 
through evolution of the loops coming out from the central core. 

An important clue to how the central core is involved in force generation has 
come from the observation that the motor core also bears some structural 
resemblance to the nucleotide binding site of the small GTPases of the Ras 
superfamily. As discussed in Chapter 3 (see Figure 3-72), these proteins exhibit 
distinct conformations in their GTP-bound (active) and GDP-bound (inactive) 
forms: mobile "switch" loops in the nucleotide-binding site are in close contact 
with the y-phosphate in the GTP-bound state, but these loops swing out when 
the hydrolyzed y-phosphate (the terminal phosphate) is released. Although the 
details of the movement differ for the two motor proteins, and ATP rather than 
GTP is hydrolyzed, a relatively small structural change in the active site — the 
presence or absence of a terminal phosphate — is similarly amplified to cause a 
rotation of a different part of the protein. 

In kinesin and myosin, a switch loop interacts extensively with those 
regions of the protein involved in microtubule and actin binding, respectively, 
allowing the structural transitions caused by the ATP hydrolysis cycle to be 
relayed to the polymer-binding interface. The relay of structural changes 
between the polymer-binding site and the nucleotide hydrolysis site seems to 
work in both directions, since the ATPase activity of motor proteins is strongly 
activated by binding to their filament tracks. 



Motor Proteins Generate Force by Coupling ATP Hydrolysis to 
Conformational Changes 

Although the cytoskeletal motor proteins and GTP-binding proteins both use 
structural changes in their nucleoside-triphosphate-binding sites to produce 
cyclic interactions with a partner protein, the motor proteins have a further 
requirement: each cycle of binding and release must propel them forward in a 
single direction along a filament to a new binding site on the filament. For such 
unidirectional motion, a motor protein must use the energy derived from ATP 
binding and hydrolysis to force a large movement in part of the protein 
molecule. For myosin, each step of the movement along actin is generated by 
the swinging of an 8.5-nm-long a helix, or lever arm, which is structurally stabi- 
lized by the binding of light chains. At the base of this lever arm next to the head, 
there is a piston-like helix that connects movements at the ATP-binding cleft in 
the head to small rotations of the so-called converter domain (see Figure 16-60). 
A small change at this point can swing the helix like a long lever, causing the far 
end of the helix to move by about 5.0 nm. 

These changes in the conformation of the myosin are coupled to changes in 
its binding affinity for actin, allowing the myosin head to release its grip on the 
actin filament at one point and snatch hold of it again at another. The full 
mechanochemical cycle of nucleotide binding, nucleotide hydrolysis, and phos- 
phate release (which causes the "power stroke") produces a single step of move- 
ment (Figure 16-61). 

In kinesin, instead of the rocking of a lever arm, the small movements of 
switch loops at the nucleotide-binding site regulate the docking and undocking of 
the motor head domain to a long linker region that connects this motor head at 
one end to the coiled-coil dimerization domain at the other end (see Figure 
16-61). When the front (leading) kinesin head is bound to a microtubule before 
the power stroke, its linker region is relatively unstructured. On the binding of ATP 
to this bound head, its linker region docks along the side of the head; this throws 
the second head forward to a position where it will be able to bind a new attach- 
ment site on the protofilament, 8 nm closer to the microtubule plus end than the 
binding site for the first head. The nucleotide hydrolysis cycles in the two heads 
are closely coordinated, so that this cycle of linker docking and undocking allows 
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) At the start of the cycle shown in this figure, 
a myosin head lacking a bound nucleotide is locked 
tightly onto an actin filament in a rigor configuration (so 
named because it is responsible for rigor mortis, the 
rigidity of death). In an actively contracting muscle, this 
state is very short-lived, being rapidly terminated by the 
binding of a molecule of ATP 



RELEASED A molecule of ATP binds to the large cleft on 
the "back" of the head (that is, on the side furthest from 
the actin filament) and immediately causes a slight 
change in the conformation of the domains that make up 
the actin-binding site. This reduces the affinity of the 
head for actin and allows it to move along the filament. 
(The space drawn here between the head and actin 
emphasizes this change, although in reality the head 
probably remains very close to the actin.) 



COCKED The cleft closes like a clam shell around the 
ATP molecule, triggering a large shape change that 
causes the head to be displaced along the filament by a 
distance of about 5 nm Hydrolysis of ATP occurs, but the 
ADP and inorganic phosphate (Pi) produced remain 
tightly bound to the protein. 



FORCE-GENERATING A weak binding of the myosin 
head to a new site on the actin filament causes release of 
the inorganic phosphate produced by ATP hydrolysis, 
concomitantly with the tight binding of the head to actin. 
This release triggers the power stroke — the 
force-generating change in shape during which the head 
regains its original conformation In the course of the 
power stroke, the head loses its bound ADP, thereby 
returning to the start of a new cycle 



ATTACHED At the end of the cycle, the myosin head is 
again locked tightly to the actin filament in a rigor 
configuration, Note that the head has moved to a new 
position on the actin filament. 



the two-headed motor to move in a hand-over- hand (or head- over-head) stepwise 
manner (Figure 16-62), each time taking a discrete 8-nm step. 

The coiled-coil domain seems both to coordinate the mechanochemical 
cycles of the two heads (motor domains) of the kinesin dimer and to determine 
its directionality of movement. Recall that whereas most members of the kinesin 
superfamily with their motor domains at the N-terminus, move toward the plus 
end of the microtubule, a few superfamily members have their motor domains 
at the C-terminus and move toward the minus end. Since the motor domains of 
these two types of kinesins are essentially identical, how can they move in oppo- 
site directions? The answer seems to lie in the way in which the heads are con- 
nected. In high-resolution images of forward-walking and backward-walking 
members of the kinesin superfamily bound to microtubules, the heads that are 
attached to the microtubule are essentially indistinguishable, but the second, 
unattached heads are oriented differently. This difference in tilt apparently 



Figure 16-61 The cycle of structural 
changes used by myosin II to walk along 
an actin filament. <ATAT> In the myosin 
II cycle, the head remains bound to the 
actin filament for only about 5% of the 
entire cycle time, allowing many myosins 
to work together to move a single actin 
filament. (Based on I. Rayment et al., 
Science 261:50-58, 1993. With permission 
from A A AS.) 
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Figure 16-62 The mechanochemical cycle of kinesin. <GAAT> Kinesin-1 is 
a dimer of two nucleotide-binding motor domains (heads) that are 
connected through a long coiled-coil tail (see Figure 1 6-58). The two 
kinesin motor domains work in a coordinated manner; during a kinesin 
''step," the rear head detaches from its tubulin binding site, passes the 
partner motor domain, and then rebinds to the next available tubulin 
binding site. Using this "hand-over-hand" motion, the kinesin dimer can 
move for long distances on the microtubule without completely letting go 
of its track. 

At the start of each step, one of the two kinesin heads, the rear or lagging 
head {dark green), is tightly bound to the microtubule and to ATP, while the 
front or leading head is loosely bound to the microtubule with ADP in its 
binding site. The forward displacement of the rear motor domain is driven 
by an exchange of ATP for ADP in the front motor domain (between panels 
2 and 3 in this drawing). The binding of ATP to this motor domain causes a 
small peptide called the "neck linker" to shift from a rearward-pointing to a 
forward-pointing conformation (the neck linker is drawn here as a 
connecting line between the motor domain and the intertwined coiled 
coil). This shift pulls the rear motor domain forward, once it has detached 
from the microtubule with ADP bound (detachment requires ATP 
hydrolysis and phosphate (Pj) release). The kinesin molecule is now poised 
for the next step, which proceeds by an exact repeat of the process shown. 



biases the next binding site for the second head, and thereby determines the 
directionality of motor movement (Figure 16-63). 

The dynein motor is structurally unrelated to myosins and kinesins, but still 
follows the general rule of coupling the nucleotide hydrolysis to microtubule 
binding and unbinding as well as to a force -generating conformational change. 
A giant heavy chain of more than 500,000 daltons forms the basic structure that 
creates the movement. Its N-terminal portion forms a tail that binds a set of light 
chains and connects to the other heavy chains in the dynein molecule, while the 
major portion of the heavy chain is used to form an elaborate, ring-shaped head. 
The head consists of a planar ring formed from seven domains: six AAA domains 
plus the heavy-chain C-terminal domain; it is therefore a more complex relative 
of the hexameric ATPase discussed in Chapter 6 (see Figure 6-91). A hook- 
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Figure 16-63 Orientation of 
forward- and backward-walking 
kinesin superfamily proteins 
bound to microtubules. These 

images were generated by fitting 
the structures of the free motor- 
protein dimers (determined byx- 
ray crystallography) onto a lower 
resolution image of the dimers 
attached to microtubules 
(determined by cryoelectron 
microscopy). (A) Kinesin-1 
(conventional kinesin) has its 
motor domain at the protein's 
N-terminus and moves toward 
the plus end of the microtubule. 
When one head of the dimer is 
bound to the microtubule in a 
post-stroke state (with ATP in the 
nucleotide binding site), the 
second, unbound head is 
pointing toward the microtubule 
plus end, poised to take the next 
step. (B) Kinesin-1 3 (called Ned in 
Drosophila), a minus-end-directed 
motor with the motor domain at 
the C-terminus, forms dimers with 
the opposite orientation. (From 
E. Sablin etal., Nature 
395:813-816, 1998. With 
permission from Macmillan 
Publishers Ltd.) 
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Figure 1 6-64 The power stroke of dynein. 

(A) The organization of the domains in each 
dynein heavy chain. This is a huge 
molecule, containing nearly 5000 amino 
acids. The number of heavy chains in a 
dynein is equal to its number of motor 
heads. (B) Dynein c is a monomeric flagella 
dynein found in unicellular green alga 
Chlamydomonas reinhardtii. The large 
dynein motor head is a planar ring 
containing a C-terminal domain [gray) and 
six AAA domains, four of which retain ATP- 
binding sequences, but only one of which 
{dark red) has the major ATPase activity. 
Extending from the head are a long, coiled- 
coil stalk with the microtubule binding site 
at the tip, and a tail with a cargo- 
attachment site. In the ATP-bound state, 
the stalk is detached from the microtubule, 
but ATP hydrolysis causes stalk-microtubule 
attachment. Subsequent release of ADP 
and Pi then leads to a large conformational 
"power stroke" involving rotation of the 
head and stalk relative to the tail. Each 
cycle generates a step of about 8 nm along 
the microtubule towards its minus end. 
(C) Electron micrographs of purified 
dyneins in two different conformations 
representing different steps in the 
mechanochemical cycle. (B, from 
S.A. Burgess et al., Nature 421:71 5-71 8, 
2003. With permission from Macmillan 
Publishers Ltd.) 



shaped linker region connects the heavy-chain tail to the AAA domain that is 
most active as an ATPase. Between the fourth and fifth AAA domains is a heavy 
chain domain that forms a long anti-parallel coiled-coil stalk. This stalk extends 
from the top of the ring, with an ATP hydrolysis-regulated microtubule -binding 
site at its tip. Dynein's "power stroke" is driven by the release of ADP and inor- 
ganic phosphate, and it causes the ring to rotate relative to the tail (Figure 16-64). 

Although kinesin, myosin, and dynein all undergo analogous mechano- 
chemical cycles, the exact nature of the coupling between the mechanical and 
chemical cycles differs in the three cases. For example, myosin without any 
nucleotide is tightly bound to its actin track, in a so-called "rigor" state, and it is 
released from this track by the association of ATP. In contrast, kinesin forms a 
rigor-like tight association with a microtubule when ATP is bound to the kinesin, 
and it is hydrolysis of ATP that promotes release of the motor from its track. The 
mechanochemical cycle of dynein is more similar to myosin than to kinesin, in 
that nucleotide-free dynein is tightly bound to the microtubule and it is released 
by binding ATP. However, for dynein the inorganic phosphate and ADP appear 
to be released at the same time, causing the conformational change driving the 
power stroke, while for myosin the phosphate is released first and the power 
stroke does not occur until the ADP subsequently dissociates from the motor 
head. 

Thus, cytoskeletal motor proteins work in a manner highly analogous to 
GTP-binding proteins, except that in motor proteins the small protein confor- 
mational changes (a few tenths of a nanometer) associated with nucleotide 
hydrolysis are amplified by special protein domains — the lever arm in the case 
of myosin, the linker in the case of kinesin, and the ring and stalk in the case of 
dynein — to generate large-scale (several nanometers) conformational changes 
that move the motor proteins stepwise along their filament tracks. The analogy 
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between the GTPases and the cytoskeletal motor proteins has recently been 
extended by the observation that one of the GTP-binding proteins — the bacte- 
rial elongation factor G — translates the chemical energy of GTP hydrolysis into 
directional movement of the mRNA molecule on the ribosome. 



Motor Protein Kinetics Are Adapted to Cell Functions 

The motor proteins in the myosin and kinesin superfamilies exhibit a remark- 
able diversity of motile properties, well beyond their choice of different polymer 
tracks. Most strikingly a single dimer of kinesin- 1 moves in a highly processive 
fashion, traveling for hundreds of ATPase cycles along a microtubule without 
dissociating. Skeletal muscle myosin II, in contrast, cannot move processively 
and makes just one or a few steps along an actin filament before letting go. These 
differences are critical for the motors' various biological roles. A small number 
of kinesin- 1 molecules must be able to transport an organelle all the way down 
a nerve cell axon, and therefore require a high level of processivity Skeletal mus- 
cle myosin, in contrast, never operates as a single molecule but rather as part of 
a huge array of myosin II molecules in a thick filament. Here processivity would 
actually inhibit biological function, since efficient muscle contraction requires 
that each myosin head perform its power stroke and then quickly get out of the 
way — in order to avoid interfering with the actions of the other heads attached 
to the same actin filament. 

There are two reasons for the high degree of processivity of kinesin- 1 move- 
ment. The first is that the mechanochemical cycles of the two motor heads in a 
kinesin- 1 dimer are coordinated with each other, so that one kinesin head does 
not let go until the other is poised to bind. This coordination allows the motor 
protein to operate in a hand-over-hand fashion, never allowing the organelle 
cargo to diffuse away from the microtubule track. In contrast, there is no appar- 
ent coordination between the myosin heads in a myosin II dimer. The second 
reason for the high processivity of kinesin- 1 movement is that kinesin- 1 spends 
a relatively large fraction of its ATPase cycle tightly bound to the microtubule. 
For both kinesin- 1 and myosin II, the conformational change that produces the 
force-generating working stroke must occur while the motor protein is tightly 
bound to its polymer, and the recovery stroke in preparation for the next step 
must occur while the motor is unbound. But myosin II spends only about 5% of 
its ATPase cycle in the tightly bound state, and it is unbound the rest of the time. 

What myosin loses in processivity it gains in speed; in an array in which 
many motor heads are interacting with the same actin filament, a set of linked 
myosins can move its filament a total distance equivalent to 20 steps during a 
single cycle time, while kinesins can move only two. Thus, myosin II can typi- 
cally drive filament sliding much more rapidly than kinesin- 1, even though the 
two different motor proteins hydrolyze ATP at comparable rates and take molec- 
ular steps of comparable length. This property is particularly important in the 
rapid contraction of skeletal muscle, as we will discuss later. 

Within each motor protein class, movement speeds vary widely, from about 
0.2 to 60 jam/sec for myosins, and from about 0.02 to 2 jam/sec for kinesins. 
These differences arise from a fine-tuning of the mechanochemical cycle. The 
number of steps that an individual motor molecule can take in a given time, and 
thereby the velocity, can be decreased by either decreasing the motor protein's 
intrinsic ATPase rate or by increasing the proportion of cycle time spent bound 
to the filament track. For example, myosin V (which acts as a processive vesicle 
motor) spends up to 90% of its nucleotide cycle tightly bound to the actin fila- 
ment, in contrast to 5% for myosin II. Moreover, a motor protein can evolve to 
change the size of each step by either changing the length of the lever arm (for 
example, the lever arm of myosin V is about three times longer than the lever 
arm of myosin II) or the angle through which the helix swings (Figure 16-65). 
Each of these parameters varies slightly among different members of the myosin 
and kinesin families, corresponding to slightly different protein sequences and 
structures. 
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Figure 1 6-65 The effect of lever arm 
length on the step size for a motor 
protein. The lever arm of myosin II is 
much shorter than the lever arm of 
myosin V. The power stroke in the head 
swings their lever arms through the same 
angle, so myosin V is able to take a bigger 
step than myosin II. 
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It is assumed that evolution has fine-tuned the behavior of each motor pro- 
tein, whose function is determined by the identity of the cargo attached through 
its tail domain, for speed and processivity according to the specific needs of the 
cell for the function of that particular family member. Whereas there are many 
different myosin and kinesin family members found in a typical eucaryotic cell, 
there is usually only one form of cytoplasmic dynein. It is not yet clear how or 
whether the mechanical properties of cytoplasmic dynein can be modified in 
response to differing needs of the cell. 



Motor Proteins Mediate the Intracellular Transport of Membrane- 
Enclosed Organelles <caat> <aaat> 

A major function of cytoskeletal motors in interphase cells is the transport and 
positioning of membrane-enclosed organelles. Kinesin was originally identified 
as the protein responsible for fast axonal transport, the rapid movement of mito- 
chondria, secretory vesicle precursors, and various synapse components down 
the microtubule highways of the axon to the distant nerve terminals. Although 
organelles in most cells need not cover such long distances, their polarized 
transport is equally necessary. A typical microtubule array in an interphase cell 
is oriented with the minus ends near the center of the cell at the centrosome, 
and the plus ends extending to the cell periphery. Thus, centripetal movements 
of organelles or vesicles toward the cell center require the action of minus-end- 
directed motor proteins such as cytoplasmic dynein, whereas centrifugal move- 
ments toward the periphery require plus-end-directed motors such as kinesins. 

The clearest example of the effect of microtubules and microtubule motors 
on the behavior of intracellular membranes is their role in organizing the endo- 
plasmic reticulum (ER) and the Golgi apparatus. The network of ER membrane 
tubules aligns with microtubules and extends almost to the edge of the cell, 
whereas the Golgi apparatus is located near the centrosome. When cells are 
treated with a drug that depolymerizes microtubules, such as colchicine or 
nocodazole, the ER collapses to the center of the cell, while the Golgi apparatus 
fragments and disperses throughout the cytoplasm (Figure 16-66). In vitro, 
kinesins can tether ER-derived membranes to preformed microtubule tracks, 
and walk toward the microtubule plus ends, dragging the ER membranes out 
into tubular protrusions and forming a membranous web very much like the ER 
in cells. Likewise, in living cells the outward movement of ER tubules toward the 
cell periphery is associated with microtubule growth. Conversely, dyneins are 
required for positioning the Golgi apparatus near the cell center, moving Golgi 
vesicles along microtubule tracks toward minus ends at the centrosome. 

The different tails and their associated light chains on specific motor proteins 
allow the motors to attach to their appropriate organelle cargo. Membrane-asso- 
ciated motor receptors that are sorted to specific membrane-enclosed compart- 
ments interact directly or indirectly with the tails of the appropriate kinesin fam- 
ily members. One of these receptors seems to be the amyloid precursor protein, 
APR which binds directly to a light chain on the tail of kinesin- 1 and is proposed 
to be a transmembrane motor receptor in nerve-cell axons. The abnormal pro- 
cessing of this protein gives rise to Alzheimer's disease by producing large, stable 
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Figure 16-66 Effect of depolymerizing 
microtubules on the Golgi apparatus. 

(A) In this endothelial cell, the 
microtubules are labeled in red, and the 
Golgi apparatus is labeled in green (using 
an antibody against a Golgi protein). As 
long as the system of microtubules 
remains intact, the Golgi is localized near 
the centrosome, close to the nucleus at 
the center of the cell. The cell on the right 
is in interphase, with a single centrosome. 
The cell on the left is in prophase, and the 
duplicated centrosomes have moved to 
opposite sides of the nucleus. (B) After 
exposure to nocodazole, which causes 
microtubules to depolymerize (see Table 
1 6-2), the Golgi apparatus fragments and 
is dispersed throughout the cell 
cytoplasm. (Courtesy of David Shima.) 
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protein aggregates in nerve cells of the brain (see Figure 6-95). Other receptors 
for specific kinesins have been identified on the endoplasmic reticulum, as well 
as on various other membrane-bound organelles that rely on microtubule-based 
transport for their localization. The JIPs (JNK-interacting proteins), are scaffold 
proteins associated with cell signaling. These kinesin receptors may provide a 
link between transport and cell signaling. 

For dynein, a large macromolecular assembly often mediates attachment to 
membranes. Cytoplasmic dynein, itself a huge protein complex, requires associ- 
ation with a second large protein complex called dynactin to translocate 
organelles effectively. The dynactin complex includes a short actin-like filament 
that forms from the actin-related protein Arp 1 (distinct from Arp2 and Arp3, the 
components of the ARP complex involved in the nucleation of conventional 
actin filaments). Membranes of the Golgi apparatus are coated with the proteins 
ankyrin and spectrin, which have been proposed to associate with the Arpl fila- 
ment in the dynactin complex to form a planar cytoskeletal array reminiscent of 
the erythrocyte membrane cytoskeleton (see Figure 10-41). The spectrin array 
probably gives structural stability to the Golgi membrane, and— via the Arpl fil- 
ament—it may mediate the regulated attachment of dynein to the organelle (Fig- 
ure 16-67). In other cases, cytoplasmic dynein motors may interact directly with 
their cargo. The cytoplasmic tail of rhodopsin, the light-detecting protein found 
in the rod cells of the eye, binds directly to one of the dynein light chains, and this 
interaction is required for normal trafficking of rhodopsin in the rod cell. 

Motor proteins also have a significant role in organelle transport along actin 
filaments. The first myosin shown to mediate organelle motility was myosin Y a 
two-headed myosin with a large step size (see Figure 16-65). In mice and 
humans, membrane-enclosed pigment granules, called melanosomes, are syn- 
thesized in cells called melanocytes beneath the skin surface. These 
melanosomes move out to the ends of dendritic processes in the melanocytes, 
where they are delivered to the overlying keratinocytes that form the skin (and 
the fur in mice). Myosin Vis associated with the surface of melanosomes (Figure 
16-68) and is able to mediate their actin-based movement in a test tube. In 
mice, mutations in the myosin V gene result in a "dilute" phenotype, in which fur 
color looks faded because the melanosomes are not delivered to the ker- 
atinocytes efficiently Other myosins, including myosin I, are associated with 
endosomes and a variety of other organelles. 
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Figure 16-67 A model for the 
attachment of dynein to a membrane- 
enclosed organelle. Dynein requires the 
presence of a large number of accessory 
proteins to associate with membrane- 
enclosed organelles. Dynactin is a large 
complex (red) that includes components 
that bind weakly to microtubules, 
components that bind to dynein itself, 
and components that form a small actin- 
like filament made of the actin-related 
protein Arpl . It is thought that the Arpl 
filament may mediate attachment of this 
large complex to membrane-enclosed 
organelles through a network of spectrin 
and ankyrin, similar to the membrane- 
associated cytoskeleton of the red blood 
cell (see Figure 10-41). 



The Cytoskeleton Localizes Specific RNA Molecules 

In order to concentrate proteins at their site of function, cells often restrict the 
synthesis of a particular protein by localizing its mRNA molecules, a process that 
establishes cellular asymmetries. This is particularly important when a parent 
cell divides to generate two daughters with distinct fates. As another example, 
several mRNAs encoding proteins involved in synapse function are specifically 
localized close to synapses in many neurons, and there is evidence that mRNA 
localization and translation regulation at the synaptic sites play important roles 
in regulating long-term memory and synaptic plasticity. Not surprisingly, the 
cytoskeleton and cytoskeletal motor proteins transport and position mRNA 
molecules in these types of situations. 

The giant Drosophila oocyte localizes a large number of maternally encoded 
mRNAs to specific sites within the cell in anticipation of the rapid cell specifica- 
tion events in early embryogenesis (discussed in Chapter 22) . A group of mRNAs 
that encode proteins necessary for proper development of the posterior region 
of the embryo, including development of the germ cells, is localized posteriorly 
in the oocyte, and a distinct group of mRNAs encoding proteins necessary for 
specification of anterior structures in the embyro is localized in the anterior 
region of the oocyte. 

The oocyte takes advantage of its polarized microtubule cytoskeleton, 
where most microtubule minus ends are clustered in the anterior part of the cell 
and plus ends near the posterior, to establish these specialized mRNA distribu- 
tions. For example, the mRNA encoding Bicoid, a transcription factor critical for 
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Figure 16-68 Myosin V on melanosomes. 

(A) Phase-contrast image of a portion of a 
melanocyte isolated from a mouse. The 
black spots are melanosomes, which are 
membrane-enclosed organelles filled with 
the skin pigment melanin. (B) The same cell 
labeled with a fluorescent antibody against 
myosin V. Every melanosome is associated 
with a large number of copies of this motor 
protein. (From X. Wu et al., J. Cell Sci. 
1 1 0:847-859, 1 997. With permission from 
The Company of Biologists.) 
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anterior development, has a structure within the 3' UTR that binds a protein 
called Swallow, which in turn binds to a cytoplasmic dynein light chain, pre- 
sumably enabling its transport to the microtubule minus ends at the cell ante- 
rior. Conversely, the transport of mRNA encoding Oskar, a protein necessary for 
germ cell development in the posterior of the embryo, requires kinesin-1 for its 
transport to the microtubule plus ends. The anchoring of the mRNAs to their 
appropriate locations after delivery via microtubules appears to involve the cor- 
tical actin cytoskeleton. The mRNA encoding Oskar, for example, binds directly 
to an actin-binding protein called moesin, a member of the ERM family 

In some cells, mRNA transport as well as anchoring is actin-dependent. The 
yeast mother and daughter cells retain distinct identities, as revealed by major 
differences in their subsequent ability to undergo mating-type switching (dis- 
cussed in Chapter 7) and in the choice of their next bud site. Many of these dif- 
ferences are caused by a gene regulatory protein called Ashl. Both Ashl mRNA 
and protein are localized exclusively to the growing bud and therefore end up 
only in the daughter cell. One of the two type V myosins found in yeast, Myo4p, 
is required for this asymmetric distribution of Ashl mRNA. A genetic screen for 
other mutations that disrupt the mother/ daughter difference has revealed that 
at least six other gene products that are associated with the cytoskeleton are 
required for normal polarity; these include one of the formins, tropomyosin, 
profilin, and actin itself, as well as a complex of two proteins that form a direct 
link between a specific sequence in the Ashl mRNA and the myosin V protein 
(Figure 16-69). 




Figure 16-69 Polarized mRNA 
localization in the yeast bud tip. (A) The 

molecular mechanism of Ashl mRNA 
localization, as determined by genetics 
and biochemistry. (B) Fluorescent in situ 
hybridization (FISH) was used to localize 
the Ashl mRNA {red) in this dividing yeast 
cell. The mRNA is confined to the far tip 
of the daughter cell (here, still a large 
bud). Ashl protein, transcribed from this 
localized mRNA, is also confined to the 
daughter cell. (B, courtesy of Peter 
Takizawa and Ron Vale.) 



Cells Regulate Motor Protein Function 

The cell can regulate the activity of motor proteins and thereby cause either a 
change in the positioning of its membrane-enclosed organelles or whole-cell 
movements. Fish melanocytes provide one of the most dramatic examples. These 
giant cells, which are responsible for rapid changes in skin coloration in several 
species of fish, contain large pigment granules that can alter their location in 
response to neuronal or hormonal stimulation (Figure 16-70). The pigment gran- 
ules aggregate or disperse by moving along an extensive network of microtubules. 
The centrosome nucleates these microtubules, localizing their minus ends in the 
center of the cell, while the plus ends are distributed around the cell periphery. 

The tracking of individual pigment granules (Figure 16-71) reveals that the 
inward movement is rapid and smooth, while the outward movement is jerky, 
with frequent backward steps. Both the microtubule motors dynein and kinesin 
are associated with the pigment granules, as well as the actin motor myosin V 
The jerky outward movements apparently result from a tug-of-war between the 
two microtubule motor proteins, with the stronger kinesin winning out overall. 
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Figure 16-70 Regulated melanosome 
movements in fish pigment cells. These 
giant cells, which are responsible for 
changes in skin coloration in several 
species offish, contain large pigment 
granules, or melanosomes (brown). The 
melanosomes can change their location 
in the cell in response to a hormonal or 
neuronal stimulus. (A) Schematic view of 
a pigment cell, showing the dispersal and 
aggregation of melanosomes in response 
to an increase or decrease in intracellular 
cyclic AMP (cAMP), respectively. Both 
redistributions of melanosomes occur 
along microtubules. (B) Bright-field 
images of a single cell in a scale of an 
African cichlid fish, showing its 
melanosomes either dispersed 
throughout the cytoplasm (left) or 
aggregated in the center of the cell 
(right). (B, courtesy of Leah Haimo.) 
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When the kinesin light chains become phosphorylated after a hormonal stimu- 
lation that signals skin color change, kinesin is inactivated, leaving dynein free 
to drag the pigment granules rapidly toward the cell center, changing the fish's 
color. In a similar way the movement of other membrane organelles coated with 
particular motor proteins is controlled by a complex balance of competing sig- 
nals that regulate both motor protein attachment and activity 

The cell can also use phosphorylation to regulate myosin activity. In non- 
muscle cells, myosin II can be phosphorylated on a variety of sites on both heavy 
and light chains, affecting both motor activity and thick filament assembly The 
myosin II can exist in two different conformational states in such cells, an 
extended state that can form bipolar filaments, and a bent state in which the tail 
domain apparently interacts with the motor head. Phosphorylation of the regu- 
latory light chain by the calcium-dependent myosin light-chain kinase (MLCK) 
causes the myosin II to preferentially assume the extended state, which pro- 
motes its assembly into a bipolar filament and leads to cell contraction (Figure 
16-72). MLCK is also activated during mitosis, causing myosin II to assemble 
into the actin-based contractile ring that pinches the mitotic cell into two. As we 
will discuss below, myosin phosphorylation is also an important component of 
the control of contraction in smooth muscle cells. Regulation of other members 



Figure 16-71 Bidirectional movement of 
a melanosome on a microtubule. An 
isolated melanosome (yellow) moves 
along a microtubule on a glass slide, from 
the plus end toward the minus end. 
Halfway through the video sequence, it 
abruptly switches direction and moves 
from the minus end toward the plus end. 
(From S.L. Rogers et al., Proc. Natl Acad. 
Sci. U.S.A. 94:3720-3725, 1997. With 
permission from National Academy of 
Sciences.) 



plus end 



microtubule 




minus end 




10[im 



THE CYTOSKELETON AND CELL BEHAVIOR 















1025 



myosin light chains 
b \m 

X2. 




actin-binding site 



'\r 



PHOSPHORYLATION 
BY MUCK 



INACTIVE STATE: 

(light chains not phosphorylated) 



(A) 



SPONTANEOUS 
SELF-ASSEMBLY 



bipolar filament 
of 15-20 molecules 



myosin tail released 



ACTIVE STATE: 

(light chains phosphorylated) 




1 pm 



Figure 1 6-72 Light-chain phosphorylation and the regulation of the assembly of myosin II into thick filaments. (A) The 

controlled phosphorylation by the enzyme myosin light-chain kinase (MLCK) of one of the two light chains (the so-called 
regulatory light chain, shown in light blue) on nonmuscle myosin II in a test tube has at least two effects: it causes a change 
in the conformation of the myosin head, exposing its actin-binding site, and it releases the myosin tail from a "sticky patch" 
on the myosin head, thereby allowing the myosin molecules to assemble into short, bipolar, thick filaments. (B) Electron 
micrograph of negatively stained short filaments of myosin II that have been induced to assemble in a test tube by 
phosphorylation of their light chains. These myosin II filaments are much smaller than those found in skeletal muscle cells 
(see Figure 16-55). (B, courtesy of John Kendrick-Jones.) 

of the myosin superfamily is not as well understood, but the control of these 
myosins is likewise thought to involve site-specific phosphorylations. 



Summary 

Motor proteins use the energy of ATP hydrolysis to move along microtubules or actin 
filaments. They mediate the sliding of filaments relative to one another and the trans- 
port of cargo along filament tracks. All known motor proteins that move on actin fila- 
ments are members of the myosin superfamily. The motor proteins that move on 
microtubules are either members of the kinesin superfamily or the dynein family The 
myosin and kinesin superfamilies are diverse, with about 40 genes encoding each type of 
protein in humans. The only structural element shared among all members of each 
superfamily is the motor "head" domain. These heads are fused to a wide variety of dif- 
ferent "tails," which attach to different types of cargo and enable the various family mem- 
bers to perform different functions in the cell. These functions include the transportation 
and localization of specific proteins, membrane-enclosed organelles, and mRNAs. 

Although myosin and kinesin walk along different tracks and use different mech- 
anisms to produce force and movement by ATP hydrolysis, they share a common struc- 
tural core, suggesting that they are derived from a common ancestor. The dynein motor 
protein has independently evolved, and it has a distinct structure and mechanism of 
action. 



THE CYTOSKELETON AND CELL BEHAVIOR 

A central challenge in all areas of cell biology is to understand how the functions 
of many individual molecular components combine to produce complex cell 
behaviors. The cell behaviors that we describe in this final section all rely on a 
coordinated deployment of the components and processes that we have 
explored in the first three sections of the chapter: the dynamic assembly and dis- 
assembly of cytoskeletal polymers, the regulation and modification of their 
structure by polymer-associated proteins, and the actions of motor proteins 
moving along the polymers. How does the cell coordinate all these activities to 
define its shape, to enable it to crawl, or to divide it neatly into two at mitosis? 
These problems of cytoskeletal coordination will challenge scientists for many 
years to come. 
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To provide a sense of our present understanding, we first discuss examples 
where specialized cells build stable arrays of filaments and use highly ordered 
arrays of motor proteins sliding them relative to each other to generate the 
large-scale movements of muscle, cilia, and eucaryotic flagella. Next, we con- 
sider two important instances where filament dynamics collude with motor 
protein activity to generate complex, self- organized dynamic structures: the 
microtubule-based mitotic spindle and the actin arrays involved in cell crawl- 
ing. Finally, we consider the extraordinary organization and behavior of the 
neuronal cytoskeleton. 



Sliding of Myosin II and Actin Filaments Causes Muscles to 
Contract 

Muscle contraction is the most familiar and the best understood form of move- 
ment in animals. In vertebrates, running, walking, swimming, and flying all 
depend on the rapid contraction of skeletal muscle on its scaffolding of bone, 
while involuntary movements such as heart pumping and gut peristalsis depend 
on the contraction of cardiac muscle and smooth muscle, respectively All these 
forms of muscle contraction depend on the ATP-driven sliding of highly orga- 
nized arrays of actin filaments against arrays of myosin II filaments. 

Skeletal muscle was a relatively late evolutionary development, and muscle 
cells are highly specialized for rapid and efficient contraction. The long thin 
muscle fibers of skeletal muscle are actually huge single cells that form during 
development by the fusion of many separate cells, as discussed in Chapter 22. 
The large muscle cell retains the many nuclei of the contributing cells. These 
nuclei lie just beneath the plasma membrane (Figure 16-73). The bulk of the 
cytoplasm inside is made up of myofibrils, which is the name given to the basic 
contractile elements of the muscle cell. A myofibril is a cylindrical structure 1-2 
um in diameter that is often as long as the giant muscle cell itself. It consists of a 
long repeated chain of tiny contractile units — called sarcomeres, each about 2.2 
um long, which give the vertebrate myofibril its striated appearance (Figure 
16-74). 

Each sarcomere is formed from a miniature, precisely ordered array of paral- 
lel and partly overlapping thin and thick filaments. The thin filaments are com- 
posed of actin and associated proteins, and they are attached at their plus ends 
to a Z disc at each end of the sarcomere. The capped minus ends of the actin fil- 
aments extend in toward the middle of the sarcomere, where they overlap with 
thick filaments, the bipolar assemblies formed from specific muscle isoforms of 
myosin II (see Figure 16-55). When this region of overlap is examined in cross 
section by electron microscopy, the myosin filaments are seen to be arranged in 
a regular hexagonal lattice, with the actin filaments evenly spaced between them 
(Figure 16-75). Cardiac muscle and smooth muscle also contain sarcomeres, 
although the organization is not as regular as that in skeletal muscle. 

Sarcomere shortening is caused by the myosin filaments sliding past the 
actin thin filaments, with no change in the length of either type of filament (Fig- 
ure 16-74 C and D). Bipolar thick filaments walk toward the plus ends of two sets 
of thin filaments of opposite orientations, driven by dozens of independent 
myosin heads that are positioned to interact with each thin filament. Because 
there is no coordination among the movements of the myosin heads, it is critical 
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Figure 16-73 Skeletal muscle cells (also 
called muscle fibers). (A) These huge 
multinucleated cells form by the fusion of 
many muscle cell precursors, called 
myoblasts. In an adult human, a muscle 
cell is typically 50 (im in diameter and can 
be up to several centimeters long. 
(B) Fluorescence micrograph of rat muscle, 
showing the peripherally located nuclei 
(blue) in these giant cells. Myofibrils are 
stained red; see also Figure 23-46B. 
{B, courtesy of Nancy L. Kedersha.) 



myofibril 




50 um 



THE CYTOSKELETON AND CELL BEHAVIOR 




2 Lim 



Figure 1 6-74 Skeletal muscle myofibrils. (A) Low-magnification electron 
micrograph of a longitudinal section through a skeletal muscle cell of a 
rabbit, showing the regular pattern of cross-striations. The cell contains 
many myofibrils aligned in parallel (see Figure 16-73). (B) Detail of the 
skeletal muscle shown in (A), showing portions of two adjacent myofibrils 
and the definition of a sarcomere [black arrow). (C) Schematic diagram of a 
single sarcomere, showing the origin of the dark and light bands seen in 
the electron micrographs. The Z discs, at each end of the sarcomere, are 
attachment sites for the plus ends of actin filaments (thin filaments); the 
M line, or midline, is the location of proteins that link adjacent myosin II 
filaments (thick filaments) to one another. The dark bands, which mark the 
location of the thick filaments, are sometimes called A bands because they 
appear anisotropic in polarized light (that is, their refractive index changes 
with the plane of polarization). The light bands, which contain only thin 
filaments and therefore have a lower density of protein, are relatively 
isotropic in polarized light and are sometimes called I bands. (D) When the 
sarcomere contracts, the actin and myosin filaments slide past one another 
without shortening. (A and B, courtesy of Roger Craig.) 
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that they operate with a low processivity, remaining tightly bound to the actin 
filament for only a small fraction of each ATPase cycle so that they do not hold 
one another back. Each myosin thick filament has about 300 heads (294 in frog 
muscle), and each head cycles about five times per second in the course of a 
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Figure 16-75 Electron micrographs of an 
insect flight muscle viewed in cross 
section. The myosin and actin filaments 
are packed together with almost crystalline 
regularity. Unlike their vertebrate 
counterparts, these myosin filaments have 
a hollow center, as seen in the 
enlargement on the right. The geometry of 
the hexagonal lattice is slightly different in 
vertebrate muscle. (From J. Auber, J. de 
Microsc. 8:197-232, 1969. With permission 
from Societe francaise de microscopie 
electronique.) 
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rapid contraction — sliding the myosin and actin filaments past one another at 
rates of up to 15 jum/sec and enabling the sarcomere to shorten by 10% of its 
length in less than l/50th of a second. The rapid synchronized shortening of the 
thousands of sarcomeres lying end-to-end in each myofibril enables skeletal 
muscle to contract rapidly enough for running and flying, or for playing the 
piano. 

Accessory proteins produce the remarkable uniformity in filament organi- 
zation, length, and spacing in the sarcomere (Figure 16-76). The actin filament 
plus ends are anchored in the Z disc, which is built from CapZ and a-actinin; the 
Z disc caps the filaments (preventing depolymerization), while holding them 
together in a regularly spaced bundle. The precise length of each thin filament is 
determined by a template protein of enormous size, called nebulin, which con- 
sists almost entirely of a repeating 35-amino-acid actin-binding motif. Nebulin 
stretches from the Z disc to the minus end of each thin filament and acts as a 
"molecular ruler" to dictate the length of the filament. The minus ends of the 
thin filaments are capped and stabilized by tropomodulin. Although there is 
some slow exchange of actin subunits at both ends of the muscle thin filament, 
such that the components of the thin filament turn over with a half-life of sev- 
eral days, the actin filaments in sarcomeres are remarkably stable compared to 
the dynamic actin filaments characteristic of most other cell types that turn over 
with half-lives of a few minutes or less. 

Opposing pairs of an even longer template protein, called titin, position the 
thick filaments midway between the Z discs. Titin acts as a molecular spring, 
with a long series of immunoglobulin-like domains that can unfold one by one 
as stress is applied to the protein. A springlike unfolding and refolding of these 
domains keeps the thick filaments poised in the middle of the sarcomere and 
allows the muscle fiber to recover after being overstretched. In C. elegans, whose 
sarcomeres are longer than those in vertebrates, titin is also longer, suggesting 
that it too serves as a molecular ruler, determining in this case the overall length 
of each sarcomere (see Figure 3-33). 



Figure 16-76 Organization of accessory 
proteins in a sarcomere. <CTGC> Each 
giant titin molecule extends from the 
Z disc to the M line — a distance of over 
1 jum. Part of each titin molecule is 
closely associated with a myosin thick 
filament (which switches polarity at the 
M line); the rest of the titin molecule is 
elastic and changes length as the 
sarcomere contracts and relaxes. Each 
nebulin molecule is exactly the length of 
a thin filament. The actin filaments are 
also coated with tropomyosin and 
troponin (not shown; see Figure 16-78) 
and are capped at both ends. 
Tropomodulin caps the minus end of the 
actin filaments, and CapZ anchors the 
plus end at the Z disc, which also 
contains a-actinin. 



A Sudden Rise in Cytosolic Ca 2+ Concentration Initiates Muscle 
Contraction <ctgc> 

The force-generating molecular interaction between myosin thick filaments and 
actin thin filaments takes place only when a signal passes to the skeletal muscle 
from its motor nerve. Immediately upon arrival of the signal, the muscle cell 
needs to be able to contract very rapidly with all the sarcomeres shortening 
simultaneously Two major features or the muscle cell are required for extremely 
rapid comraction. First, as previously discussed, the individual myosin motor 
heads in each thick filament spend only a small fraction of the ATP cycle time 
bound to the filament and actively generating force, so many myosin heads can 
act in rapid succession on the same thin filament without interfering with one 
another. Second, a specialized membrane system relays the incoming signal 
rapidly throughout the entire cell. The signal from the nerve triggers an action 
potential in the muscle cell plasma membrane (discussed in Chapter 11), and 
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this electrical excitation spreads rapidly into a series of membraneous folds, the 
transverse tubules, or T tubules, that extend inward from the plasma membrane 
around each myofibril. The signal is then relayed across a small gap to the sar- 
coplasmic reticulum, an adjacent web-like sheath of modified endoplasmic retic- 
ulum that surrounds each myofibril like a net stocking (Figure 16-77A and B). 

When the incoming action potential activates a Ca 2+ channel in the T-tubule 
membrane, a Ca 2+ influx triggers the opening of Ca 2+ -release channels in the 
sarcoplasmic reticulum (Figure 16-77C). Ca 2+ flooding into the cytosol then ini- 
tiates the contraction of each myofibril. Because the signal from the muscle-cell 
plasma membrane is passed within milliseconds (via the T tubules and sar- 
coplasmic reticulum) to every sarcomere in the cell, all of the myofibrils in the 
cell contract at once. The increase in Ca 2+ concentration is transient because the 
Ca 2+ is rapidly pumped back into the sarcoplasmic reticulum by an abundant, 
ATP-dependent Ca 2+ -pump (also called a Ca 2+ -ATPase) in its membrane (see 
Figure 11-13). Typically the cytoplasmic Ca 2+ concentration is restored to rest- 
ing levels within 30 msec, allowing the myofibrils to relax. Thus, muscle con- 
traction depends on two processes that consume enormous amounts of ATP: fil- 
ament sliding, driven by the ATPase of the myosin motor domain, and Ca 2+ 
pumping, driven by the Ca 2+ -pump. 

The Ca 2+ dependence of vertebrate skeletal muscle contraction, and hence 
its dependence on motor commands transmitted via nerves, is due entirely to a 
set of specialized accessory proteins that are closely associated with the actin 
thin filaments. One of these accessory proteins is a muscle form of tropomyosin, 
an elongated molecule that binds along the groove of the actin helix. The other 
is troponin, a complex of three polypeptides, troponins T, I, and C (named for 
their tropomyosin-binding, inhibitory, and Ca 2+ -binding activities, respec- 
tively). Troponin I binds to actin as well as to troponin T In a resting muscle, the 
troponin I-T complex pulls the tropomyosin out of its normal binding groove 
into a position along the actin filament that interferes with the binding of 



(A) 



Figure 1 6-77 T tubules and the 
sarcoplasmic reticulum. (A) Drawing of 
the two membrane systems that relay the 
signal to contract from the muscle cell 
plasma membrane to all of the myofibrils 
in the cell. (B) Electron micrograph 
showing two T tubules. Note the position 
of the large Ca 2+ -release channels in the 
sarcoplasmic reticulum membrane; they 
look like square-shaped "feet" that 
connect to the adjacent T-tubule 
membrane. (C) Schematic diagram 
showing how a Ca 2+ -release channel in 
the sarcoplasmic reticulum membrane is 
thought to be opened by the activation 
of a voltage-gated Ca 2+ channel. 
(B, courtesy of Clara Franzini-Armstrong.) 
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Figure 16-78 The control of skeletal muscle contraction by troponin. (A) A skeletal muscle cell thin filament, showing the 
positions of tropomyosin and troponin along the actin filament. Each tropomyosin molecule has seven evenly spaced 
regions with similar amino acid sequences, each of which is thought to bind to an actin subunit in the filament. 
(B) A thin filament shown end-on, illustrating how Ca 2+ (binding to troponin) is thought to relieve the tropomyosin 
blockage of the interaction between actin and the myosin head. (A, adapted from G.N. Phillips, J.P. Fillers and C. Cohen, 
J. Mot. Biol. 192:1 1 1-131, 1986. With permission from Academic Press.) 



myosin heads, thereby preventing any force-generating interaction. When the 
level of Ca 2+ is raised, troponin C— which binds up to four molecules of Ca 2+ — 
causes troponin I to release its hold on actin. This allows the tropomyosin 
molecules to slip back into their normal position so that the myosin heads can 
walk along the actin filaments (Figure 16-78). Troponin C is closely related to 
the ubiquitous Ca 2+ -binding protein calmodulin (see Figure 15-44); it can be 
thought of as a specialized form of calmodulin that has acquired binding sites 
for troponin I and troponin T, thereby ensuring that the myofibril responds 
extremely rapidly to an increase in Ca 2+ concentration. 

In smooth muscle cells, so-called because they lack the regular striations of 
skeletal muscle, contraction is also triggered by an influx of calcium ions, but the 
regulatory mechanism is different. Smooth muscle forms the contractile portion 
of the stomach, intestine, and uterus, the walls of arteries, and many other struc- 
tures requiring slow and sustained contractions. Smooth muscle is composed of 
sheets of highly elongated spindle-shaped cells, each with a single nucleus. 
Smooth muscle cells do not express the troponins. Instead, Ca 2+ influx into the 
cell regulates contraction by two mechanisms that depend on the ubiquitous 
calcium binding protein calmodulin. 

First, Ca 2+ -bound calmodulin binds to an actin-binding protein, caldesmon, 
which blocks the actin sites where the myosin motor heads would normally 
bind. This causes the caldesmon to fall off of the actin filaments, preparing the 
filaments for contraction. Second, smooth muscle myosin is phosphorylated on 
one of its two light chains by myosin light chain kinase (MLCK), as described 
previously for regulation of nonmuscle myosin II (see Figure 16-72). When the 
light chain is phosphorylated, the myosin head can interact with actin filaments 
and cause contraction; when it is dephosphorylated, the myosin head tends to 
dissociate from actin and becomes inactive (in contrast to nonmuscle myosin II, 
light chain dephosphorylation does not cause thick filament disassembly in 
smooth muscle cells). MLCK requires bound Ca 2+ / calmodulin to be fully active. 

External signaling molecules such as adrenaline (epinephrine) can also reg- 
ulate the contractile activity of smooth muscle. Adrenaline binding to its G-pro- 
tein-coupled cell surface receptor causes an increase in the intracellular level of 
cyclic AMP, which in turn activates cyclic -AMP-dependent protein kinase (PKA) 
(see Figure 15-35). PKA phosphorylates and inactivates MLCK, thereby causing 
the smooth muscle cell to relax. 

The phosphorylation events that regulate contraction in smooth muscle 
cells occur realtively slowly, so that maximum contraction often requires nearly 
a second (compared with the few milliseconds required for contraction of a 
skeletal muscle cell). But rapid activation of contraction is not important in 
smooth muscle: its myosin II hydrolyzes ATP about 10 times more slowly than 
skeletal muscle myosin, producing a slow cycle of myosin conformational 
changes that results in slow contraction. 
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Heart Muscle Is a Precisely Engineered Machine <aggt> 

The heart is the most heavily worked muscle in the body, contracting about 3 bil- 
lion (3 x 10 9 ) times during the course of a human lifetime. This number is about 
the same as the average number of revolutions in the lifetime of an automobile's 
internal combustion engine. Heart cells express several specific isoforms of car- 
diac muscle myosin and cardiac muscle actin. Even subtle changes in these con- 
tractile proteins expressed in the heart — changes that would not cause any 
noticeable consequences in other tissues — can cause serious heart disease (Fig- 
ure 16-79). 

The normal cardiac contractile apparatus is such a highly tuned machine 
that a tiny abnormality anywhere in the works can be enough to gradually wear 
it down over years of repetitive motion. Familial hypertrophic cardiomyopathy is 
a frequent cause of sudden death in young athletes. It is a genetically dominant 
inherited condition that affects about two out of every thousand people, and it 
is associated with heart enlargement, abnormally small coronary vessels, and 
disturbances in heart rhythm (cardiac arrhythmias). The cause of this condition 
is either any one of over 40 subtle point mutations in the genes encoding cardiac 
P myosin heavy chain (almost all causing changes in or near the motor domain), 
or one of about a dozen mutations in other genes encoding contractile pro- 
teins — including myosin light chains, cardiac troponin, and tropomyosin. Minor 
missense mutations in the cardiac actin gene cause another type of heart condi- 
tion, called dilated cardiomyopathy, that also frequently results in early heart 
failure. 




Figure 16-79 Effect on the heart of a 
subtle mutation in cardiac myosin. Left, 
normal heart from a 6-day old mouse 
pup. Right, heart from a pup with a point 
mutation in both copies of its cardiac 
myosin gene, changing Arg 403 to Gin. 
The arrows indicate the atria. In the heart 
from the pup with the cardiac myosin 
mutation, both atria are greatly enlarged 
(hypertrophic), and the mice die within a 
few weeks of birth. (From D. Fatkin et al., 
J. Clin, invest 103:147, 1999. With 
permission from The Rockefeller 
University Press.) 



Cilia and Flagella Are Motile Structures Built from Microtubules 
and Dyneins 

Just as myofibrils are highly specialized and efficient motility machines built 
from actin and myosin filaments, cilia and flagella are highly specialized and 
efficient motility structures built from microtubules and dynein. Both cilia and 
flagella are hair-like cell appendages that have a bundle of microtubules at their 
core. Flagella are found on sperm and many protozoa. By their undulating 
motion, they enable the cells to which they are attached to swim through liquid 
media (Figure 16-80A). Cilia tend to be shorter than flagella and are organized 
in a similar fashion, but they beat with a whip-like motion that resembles the 
breast stroke in swimming (Figure 16-80B). The cycles of adjacent cilia are 
almost but not quite in synchrony, creating the wave-like patterns that can be 
seen in fields of beating cilia under the microscope. Ciliary beating can either 
propel single cells through a fluid (as in the swimming of the protozoan Parame- 
cium) or can move fluid over the surface of a group of cells in a tissue. In the 
human body, huge numbers of cilia (10 9 /cm 2 or more) line our respiratory tract, 
sweeping layers of mucus, trapped particles of dust, and bacteria up to the 
mouth where they are swallowed and ultimately eliminated. Likewise, cilia along 
the oviduct help to sweep eggs toward the uterus. 

The movement of a cilium or a flagellum is produced by the bending of its 
core, which is called the axoneme. The axoneme is composed of microtubules 
and their associated proteins, arranged in a distinctive and regular pattern. Nine 
special doublet microtubules (comprising one complete and one partial micro- 
tubule fused together so that they share a common tubule wall) are arranged in 



Figure 16-80 The contrasting motions of flagella and cilia. (A) The wave-like 
motion of the flagellum of a sperm cell from a tunicate. The cell was 
photographed with stroboscopic illumination at 400 flashes per second. Note that 
waves of constant amplitude move continuously from the base to the tip of the 
flagellum. (B) The beat of a cilium, which resembles the breast stroke in 
swimming. A fast power stroke (red arrows), in which fluid is driven over the 
surface of the cell, is followed by a slow recovery stroke. Each cycle typically 
requires 0.1-0.2 sec and generates a force perpendicular to the axis of the 
axoneme {the ciliary core). (A, courtesy of CJ. Brokaw.) 
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a ring around a pair of single microtubules (Figure 16-81). Almost all forms of 
eucaryotic flagella and cilia (from protozoans to humans) have this characteris- 
tic arrangement. The microtubules extend continuously for the length of the 
axoneme, which can be 10-200 ^m. At regular positions along the length of the 
microtubules, accessory proteins cross-link the microtubules together. 

Molecules of ciliary dynein form bridges between the neighboring doublet 
microtubules around the circumference of the axoneme (Figure 16-82). When 
the motor domain of this dynein is activated, the dynein molecules attached to 
one microtubule doublet (see Figure 16-64) attempt to walk along the adjacent 
microtubule doublet, tending to force the adjacent doublets to slide relative to 
one another, much as actin thin filaments slide during muscle contraction. 
However, the presence of other links between the microtubule doublets pre- 
vents this sliding, and the dynein force is instead converted into a bending 
motion (Figure 16-83). 

The length of flagella is carefully regulated. If one of the two flagella on a 
Chlamydomonas cell is amputated, the remaining one will transiently shrink as 
the stump regrows until they reach the same length, and then the two shortened 
flagella will continue to elongate until both are as long as they were on the 
unperturbed cell. New flagellar components including tubulin and dynein are 
incorporated into the growing flagella at the distal tips. Thus, even in these 



Figure 16-81 The arrangement of 
microtubules in a flagellum or cilium. 

(A) Electron micrograph of the flagellum 
of a green-alga cell (Chlamydomonas) 
shown in cross section, illustrating the 
distinctive "9 + 2" arrangement of 
microtubules. (B) Diagram of the parts of 
a flagellum or cilium. The various 
projections from the microtubules link 
the microtubules together and occur at 
regular intervals along the length of the 
axoneme. (A, courtesy of Lewis Tilney.) 
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Figure 16-82 Ciliary dynein. Ciliary (axonemal) dynein is a large protein assembly (nearly 2 million daltons) composed of 
9-1 2 polypeptide chains, the largest of which is the heavy chain of more than 500,000 daltons. (A) The heavy chains form 
the major portion of the globular head and stem domains, and many of the smaller chains are clustered around the base of 
the stem. There are two heads in the outer dynein in metazoans, but three heads in protozoa, each formed from their own 
heavy chain (see Figure 16-59B for a view of an isolated molecule). The tail of the molecule binds tightly to an A 
microtubule in an ATP-independent manner, while the large globular heads have an ATP-dependent binding site for a B 
microtubule (see Figure 1 6-81 ). When the heads hydrolyze their bound ATP, they move toward the minus end of the B 
microtubule, thereby producing a sliding force between the adjacent microtubule doublets in a cilium or flagellum. For 
details, see Figure 16-64. (B) Freeze-etch electron micrograph of a cilium showing the dynein arms projecting at regular 
intervals from the doublet microtubules. (B, courtesy of John Heuser.) 
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Figure 16-83 The bending of an 
axoneme. (A) When axonemes are 
exposed to the proteolytic enzyme 
trypsin, the linkages holding neighboring 
doublet microtubules together are 
broken. In this case, the addition of ATP 
allows the motor action of the dynein 
heads to slide one pair of doublet 
microtubules against the other pair. (B) In 
an intact axoneme (such as in a sperm), 
flexible protein links prevent the sliding 
of the doublet. The motor action 
therefore causes a bending motion, 
creating waves or beating motions, as 
seen in Figure 16-80. 



highly ordered and stable filament-motor arrays, cells use the intrinsic flexibil- 
ity and adaptability of the cytoskeleton to respond rapidly and dynamically to 
the changes they experience. 

In humans, hereditary defects in ciliary dynein cause Kartagener's syn- 
drome. The syndrome is characterized by male sterility due to immotile sperm, 
a high susceptibility to lung infections owing to the paralyzed cilia in the respi- 
ratory tract that fail to clear debris and bacteria, and defects in determination of 
the left-right axis of the body during early embryonic development (discussed in 
Chapter 22). 

Bacteria also swim using cell surface structures called flagella, but these do 
not contain microtubules or dynein and do not wave or beat. Instead, bacterial 
flagella are long, rigid helical filaments, made up of repeating subunits of the 
protein flagellin. The flagella rotate like propellers, driven by a special rotary 
motor embedded in the bacterial cell wall (see Figure 15-71). The use of the 
same name to denote these two very different types of swimming apparatus is 
an unfortunate historical accident. 

Structures called basal bodies firmly root eucaryotic cilia and flagella at the 
cell surface. The basal bodies have the same form as the centrioles that are found 
embedded at the center of animal centrosomes, with nine groups of fused triplet 
microtubules arranged in a cartwheel (Figure 16-84). Indeed, in some organ- 
isms, basal bodies and centrioles are functionally interconvertible: during each 
mitosis in the unicellular alga Chlamydomonas, for example, the flagella are 
resorbed, and the basal bodies move into the cell interior and become part of the 
spindle poles. New centrioles and basal bodies arise by a curious replication pro- 
cess, in which a smaller daughter is formed perpendicular to the original struc- 
ture by a still mysterious mechanism (see Figure 17-31). 




Figure 16-84 Basal bodies. (A) Electron 
micrograph of a cross section through 
three basal bodies in the cortex of a 
protozoan. (B) Diagram of a basal body 
viewed from the side. Each basal body 
forms the lower portion of a ciliary 
axoneme and is composed of nine sets of 
triplet microtubules, each triplet 
containing one complete microtubule 
(the A microtubule) fused to two 
incomplete microtubules (the B and C 
microtubules). Other proteins (shown in 
red in B) form links that hold the 
cylindrical array of microtubules together. 
The arrangement of microtubules in a 
centriole is essentially the same (see 
Figure 16-31). (A, courtesy of 
D.T. Woodrow and R.W. Linck.) 
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Even in animal cells that lack fully developed beating cilia or flagella, centri- 
oles frequently nucleate the growth of a non-motile, microtubule-rich surface 
projection called a primary cilium. Primary cilia are usually only a few microm- 
eters in length and lack dynein. They are found on the surface of many different 
cell types including fibroblasts, epithelial cells, neurons, bone cells, and chon- 
drocytes (cartilage cells). Many signaling proteins are concentrated in the pri- 
mary cilium, including proteins involved in the Hedgehog signaling pathway 
(see p. 950), and receptors for neurotransmitters on neurons in the central ner- 
vous system. On kidney epithelial cells, primary cilia act as flow sensors that 
detect the movement of fluid through the kidney tubules. Mechanosensitive cal- 
cium channels are opened when the fluid flow bends the primary cilia, regulat- 
ing kidney cell growth and proliferation. Loss of the calcium channel or other 
structural components of the primary cilium in the kidney cells causes polycys- 
tic kidney disease, a common genetic disorder that causes overproliferation of 
the kidney epithelial cells — resulting in the formation of large fluid-filled cysts 
throughout the organs and eventually in kidney failure. Another specialized kind 
of primary cilium that is unusual in being able to beat is required for establish- 
ing left-right asymmetry in the developing embryo (see Figure 22-87). 

Construction of the Mitotic Spindle Requires Microtubule 
Dynamics and the Interactions of Many Motor Proteins 

Myofibrils and cilia are relatively permanent structures specialized to produce 
repetitive movement. But most cell movements depend on labile structures that 
appear at specific stages of the cell cycle or in response to external signals and 
then disappear once they complete their jobs. The most familiar of these are the 
mitotic spindle and the contractile ring that form during cell division. In Chap- 
ter 17, we will describe in detail both the process of mitosis and the cell cycle 
control system that determines the timing of the events of cell division. Here, we 
briefly discuss a few of the cytoskeletal mechanisms that contribute to the con- 
struction and mechanical function of the mitotic spindle. 

The construction of the mitotic spindle is a particularly important and fas- 
cinating example of the power of self-organization by teams of motor proteins 
interacting with dynamic cytoskeletal filaments. It also features the active par- 
ticipation of the chromosomes. In a rapid sequence of events that typically takes 
less than an hour in animal cells, the interphase array of microtubules is com- 
pletely disassembled and reorganized to form the bipolar spindle structure that 
is responsible for segregating the replicated chromosomes with perfect fidelity 
to the two daughter cells. Because of the central importance of reliability in 
transmission of the genetic material, the construction and functioning of the 
mitotic spindle feature a tremendous degree of redundancy, so that if one set of 
mechanisms fails for any reason, there are backup mechanisms in place to 
ensure reliable chromosome partitioning. 

In early mitosis, there are dramatic changes in the dynamic behavior and 
average length of the microtubules. In the interphase array the microtubules are 
typically long and undergo rare catastrophes, but during mitosis the micro- 
tubules are shorter and much more dynamic. Microtubule nucleation and 
assembly are enhanced in the regions around the condensed chromosomes. As 
microtubules assemble on condensed chromatin pointing in random direc- 
tions, the coordinated actions of several motor proteins build a coherent bipo- 
lar spindle from the disorganized micro tuble mass. First, the bipolar kinesin-5 
(see Figure 16-58) bundles the microtubules into a parallel array and slides 
microtubules that are oriented in opposite directions away from each other. 
Next, another kinesin that is bound to chromosome arms, kinesin-4, walks 
toward the plus ends of chromosome-associated microtubules and pushes their 
minus ends away from the chromosome mass. Finally, the minus-end directed 
motors cytoplasmic dynein and kinesin- 14 form oligomeric complexes with 
scaffold proteins that gather the microtubule minus ends together to form the 
spindle poles. In most animal cells, these processes are guided by a pair of cen- 
trosomes that help to nucleate and organise these microtubule minus ends. The 
final result is the elegantly balanced bipolar mitotic spindle (Figure 16-85). 
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Figure 16-85 The mitotic spindle in 
animal cells. <GTCT> (A) There are three 
classes of dynamic microtubules in the 
mitotic spindle at metaphase: 
kinetochore microtubules {blue) that 
attach each chromosome to the spindle 
pole, interpolar microtubules {red) that 
hold the two halves of the spindle 
together, and astral microtubules {green) 
that can interact with the cell cortex. All 
of the microtubules are oriented with 
their minus ends at the spindle poles 
where the centrosomes reside, and their 
plus ends projecting away. As indicated 
by the arrows, the astral microtubules 
undergo dynamic instability, growing 
and shrinking at their plus ends, while 
the kinetochore microtubules and 
interpolar microtubules both undergo 
continuous flux toward the spindle poles. 
(B) A phase-contrast micrograph of an 
isolated mitotic spindle at metaphase, 
with the chromosomes aligned at the 
spindle equator. (C) This fluorescence 
micrograph shows the microtubules of 
the spindle in green and the 
chromosomes in blue. The red spots mark 
the kinetochores, specialized structures 
that connect the microtubules to the 
chromosomes. (B, from E.D. Salmon and 
R.R. SegallJ. Cell Biol. 86:355-365, 1980. 
With permission from The Rockefeller 
University Press; C, from A. Desai, 
Curr. Biol. 10:R508, 2000. With permission 
from Elsevier.) 
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After the bipolar mitotic spindle has assembled, it can appear stable and 
quiescent for long periods of time. In many animals, the unfertilized egg arrests 
its cell cycle in meiotic metaphase, and the spindle waits for days or months 
until fertilization triggers the progression of the cell cycle (see Chapter 21). This 
steady appearance is deceptive, because the spindle is actually an extremely 
dynamic structure, tensed for action that will begin when the chromosomes 
suddenly begin to separate in anaphase. For example, many of the spindle 
microtubules exhibit a behavior called poleward flux, with a net addition of 
tubulin subunits at their plus ends, balancing a net loss at their minus ends near 
the spindle pole. Poleward flux is driven by the action of minus end- directed 
motor proteins at the spindle pole that are constantly reeling in the micro- 
tubules, and the bipolar plus end-directed kinesin-5 motors on the interpolar 
microtubules that are constantly pushing them apart (see Figure 16-85). As will 
be discussed in Chapter 17, the delicate balance between these two types of 
motor protein activities in the spindle also determines its length. Overall the 
mitotic spindle represents a collaborative effort combining the dynamic proper- 
ties of microtubules with the individual actions of dozens of molecular motors 
and other organizing components. 
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Many Cells Can Crawl Across a Solid Substratum 

The process of cell crawling provides another instance where we can appreciate 
the dynamic integration of cytoskeletal filaments, filament regulators, and 
motor proteins. Many cells move by crawling over surfaces rather than by using 
cilia or flagella to swim. Predatory amoebae crawl continuously in search of 
food, and they can easily be observed to attack and devour smaller ciliates and 
flagellates in a drop of pond water. In animals, almost all cell locomotion occurs 
by crawling, with the notable exception of swimming sperm. During embryoge- 
nesis, the structure of an animal is created by the migrations of individual cells 
to specific target locations and by the coordinated movements of whole epithe- 
lial sheets (discussed in Chapter 23). In vertebrates, neural crest cells are remark- 
able for their long-distance migrations from their site of origin in the neural tube 
to a variety of sites throughout the embryo. These cells have diverse fates, 
becoming skin pigment cells, sensory and sympathetic neurons and glia, and 
various structures of the face. Long-distance crawling is fundamental to the con- 
struction of the entire nervous system: it is in this way that the actin-rich growth 
cones at the advancing tips of developing axons travel to their eventual synaptic 
targets, guided by combinations of soluble signals and signals bound to cell sur- 
faces and extracellular matrix along the way 

The adult animal also seethes with crawling cells. Macrophages and neu- 
trophils crawl to sites of infection and engulf foreign invaders as a critical part of 
the innate immune response. Osteoclasts tunnel into bone, forming channels 
that are filled in by the osteoblasts that follow after them, in a continuous pro- 
cess of bone remodeling and renewal. Similarly, fibroblasts can migrate through 
connective tissues, remodeling them where necessary and helping to rebuild 
damaged structures at sites of injury. In an ordered procession, the cells in the 
epithelial lining of the intestine travel up the sides of the intestinal villi, replac- 
ing absorptive cells lost at the tip of the villus. Unfortunately, cell crawling also 
has a role in many cancers, when cells in a primary tumor invade neighboring 
tissues and crawl into blood vessels or lymph vessels and then emerge at other 
sites in the body to form metastases. 

Cell crawling is a highly complex integrated process, dependent on the 
actin-rich cortex beneath the plasma membrane. Three distinct activities are 
involved: protrusion, in which actin-rich structures are pushed out at the front 
of the cell; attachment, in which the actin cytoskeleton connects across the 
plasma membrane to the substratum; and traction, in which the bulk of the trail- 
ing cytoplasm is drawn forward (Figure 16-86). In some crawling cells, such as 



actin cortex 



lamellipodium 



substratum 




cortex under tension 




actin polymerization at 
plus end protrudes 
lamellipodium 

I PROTRUSION 



movement of unpolymerized actin 



ATTACHMENT AND 
TRACTS ON 




focal contacts 

(contain integrm*;) 




Figure 16-86 A model of how forces 
generated in the actin-rich cortex move 
a cell forward. The actin-polymerization- 
dependent protrusion and firm 
attachment of a lamellipodium at the 
leading edge of the cell moves the edge 
forward (green arrows at front) and 
stretches the actin cortex. Contraction at 
the rear of the cell propels the body of 
the cell forward (green arrow at back) to 
relax some of the tension (traction). New 
focal contacts are made at the front, and 
old ones are disassembled at the back as 
the cell crawls forward. The same cycle 
can be repeated, moving the cell forward 
in a stepwise fashion. Alternatively, all 
steps can be tightly coordinated, moving 
the cell forward smoothly. The newly 
polymerized cortical actin is shown in red. 
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keratocytes from the fish epidermis, these activities are closely coordinated, and 
the cells seem to glide forward smoothly without changing shape. In other cells, 
such as fibroblasts, these activities are more independent, and the locomotion is 
jerky and irregular. 



Actin Polymerization Drives Plasma Membrane Protrusion 

The first step in locomotion, protrusion of a leading edge, seems to rely primar- 
ily on forces generated by actin polymerization pushing the plasma membrane 
outward. Different cell types generate different types of protrusive structures, 
including filopodia (also known as microspikes), lamellipodia, and pseudopo- 
dia. All are filled with a dense core of filamentous actin, which excludes mem- 
brane-enclosed organelles. The three structures differ primarily in the way in 
which the actin is organized — in one, two, or three dimensions, respectively — 
and we have already discussed how this results from the presence of different 
actin-associated proteins. 

Filopodia, formed by migrating growth cones and some types of fibroblasts, 
are essentially one- dimensional. They contain a core of long, bundled actin fila- 
ments, which are reminiscent of those in microvilli but longer and thinner, as 
well as more dynamic. Lamellipodia, formed by epithelial cells and fibroblasts, 
as well as by some neurons, are two-dimensional, sheet-like structures. They 
contain an orthogonally cross-linked mesh of actin filaments, most of which lie 
in a plane parallel to the solid substratum. Pseudopodia, formed by amoebae 
and neutrophils, are stubby three-dimensional projections filled with an actin- 
filament gel. <ATGG> Perhaps because their two-dimensional geometry is most 
convenient for examination with the light microscope, we have more informa- 
tion about the dynamic organization and protrusion mechanism of lamellipodia 
than we have for either filopodia or pseudopodia. 

Lamellipodia contain all of the machinery that is required for cell motility. 
They have been especially well studied in the epithelial cells of the epidermis of 
fish and frogs, which are known as keratocytes because of their abundant keratin 
filaments. These cells normally cover the animal by forming an epithelial sheet, 
and they are specialized to close wounds very rapidly, moving at rates up to 30 
Lim/min. When cultured as individual cells, keratocytes assume a distinctive 
shape with a very large lamellipodium and a small, trailing cell body that is not 
attached to the substratum (Figure 16-87). Fragments of this lamellipodium can 
be sliced off with a micropipette. Although the fragments generally lack micro- 
tubules and membrane-enclosed organelles, they continue to crawl normally, 
looking like tiny keratocytes. 

The dynamic behavior of actin filaments can be studied in keratocyte lamel- 
lipodia by marking a small patch of actin and examining its fate. This reveals 
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Figure 16-87 Migratory keratocytes from 
a fish epidermis. <GTTA> (A) Light 
micrographs of a keratocyte in culture, 
taken about 15 sec apart. This cell is 
moving at about 1 5 |im/sec. (B) Keratocyte 
seen by scanning electron microscopy, 
showing its broad, flat lamellipodium and 
small cell body, including the nucleus, 
carried up above the substratum at the 
rear. (C) Distribution of cytoskeletal 
filaments in this cell. Actin filaments (red) 
fill the large lamellipodium and are 
responsible for the cell's rapid movement. 
Microtubules (green) and intermediate 
filaments (blue) are restricted to the 
regions close to the nucleus. (A and B, 
courtesy of Juliet Lee.) 
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Figure 16-88 Actin filament nucleation and web formation by the ARP 
complex in lamellipodia. (A) A keratocyte with actin filaments labeled in 
red by fluorescent phalloidin, and the ARP complex labeled in green with 
an antibody raised against one of its component proteins. The regions 
where the two overlap appear yellow. The ARP complex is highly 
concentrated near the front of the lamellipodiiim, where actin nucleation is 
most active. (B) Electron micrograph of a platinum-shadowed replica of the 
leading edge of a keratocyte, showing the dense actin filament meshwork. 
The labels denote areas enlarged in C. (C) Close-up views of the marked 
regions of the actin web at the leading edge shown in B. Numerous 
branched filaments can be seen, with the characteristic 70° angle formed 
when the ARP complex nucleates a new actin filament off the side of a 
preexisting filament (see Figure 1 6-34). (From T. Svitkina and G. Borisy, 
J. Cell Biol. 145:1009-1026, 1999. With permission from The Rockefeller 
University Press.) 



that, while the lamellipodia crawl forward, the actin filaments remain stationary 
with respect to the substrate. The actin filaments in the meshwork are mostly 
oriented with their plus ends facing forward. The minus ends are frequently 
attached to the sides of other actin filaments by ARP complexes (see Figure 
16-34), helping to form the two-dimensional web (Figure 16-88). The web as a 
whole seems to be undergoing treadmilling, assembling at the front and disas- 
sembling at the back, reminiscent of the treadmilling that occurs in individual 
actin filaments and microtubules discussed previously (see Figure 16-14). 
Treadmilling of a dendritic web built by the ARP complex is only one of several 
ways that cells can use dynamic actin filaments to drive the protrusion of the 
leading edge. Some slowly-moving cells including fibroblasts appear to use a 
mechanism that does not depend on the ARP complex, but still requires coordi- 
nated actin filament assembly and disassembly, possibly coordinated by 
formins. 

Maintenance of unidirectional motion by lamellipodia is thought to require 
the cooperation and mechanical integration of several factors. Filament nucle- 
ation is localized at the leading edge, with new actin filament growth occurring 
primarily in that location to push the plasma membrane forward. Most fila- 
ment depolymerization occurs at sites located well behind the leading edge. 
Because cofilin (see Figure 16-42) binds cooperatively and preferentially to 
actin filaments containing ADP-actin (the D form), the new T-form filaments 
generated at the leading edge should be resistant to depolymerization by cofilin 
(Figure 16-89). As the filaments age and ATP hydrolysis proceeds, cofilin can 
efficiently disassemble the older filaments. Thus, the delayed ATP hydrolysis by 
filamentous actin is thought to provide the basis for a mechanism that main- 
tains an efficient, unidirectional treadmilling process in the lamellipodium 
(Figure 16-90). Finally, bipolar myosin II filaments seem to associate with the 
actin filaments in the web and pull them into a new orientation — from nearly 
perpendicular to the leading edge to an orientation almost parallel to the lead- 
ing edge. This contraction prevents protrusion and it pinches in the sides of the 
locomoting lamellipodium, helping to gather in the sides of the cell as it moves 
forward (Figure 16-91). 
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Figure 16-89 Cofilin in lamellipodia. 

(A) A keratocyte with actin filaments 
labeled in red by fluorescent phalloidin 
and cofilin labeled in green with a 
fluorescent antibody. The regions where 
the two overlap appear yellow. Although 
the dense actin meshwork reaches all the 
way through the lamellipodium, cofilin is 
not found at the very leading edge. 

(B) Close-up view of the region marked 
with the white rectangle in A. The actin 
filaments closest to the leading edge, 
which are also the ones that have formed 
most recently and that are most likely to 
contain ATP actin (rather than ADP actin) 
in the filament lattice are generally not 
associated with cofilin. (From T. Svitkina 
and G. Borisy, J. Cell Biol. 1 45:1 009-1 026, 

1 999. With permission from The 
Rockefeller University Press.) 
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Figure 16-90 A model for protrusion of 
the actin meshwork at the leading edge. 

Two time points during advance of the 
lamellipodium are illustrated, with newly 
assembled structures at the later time 
point shown in a lighter color. Nucleation 
is mediated by the ARP complex at the 
front. Newly nucleated actin filaments are 
attached to the sides of preexisting 
filaments, primarily at a 70° angle. 
Filaments elongate, pushing the plasma 
membrane forward because of some sort 
of anchorage of the array behind. At a 
steady rate, actin filament plus ends 
become capped. After newly polymerized 
actin subunits hydrolyze their bound ATP 
in the filament lattice, the filaments 
become susceptible to depolymerization 
by cofilin. This cycle causes a spatial 
separation between net filament 
assembly at the front and net filament 
disassembly at the rear, so that the actin 
filament network as a whole can move 
forward, even though the individual 
filaments within it remain stationary with 
respect to the substratum. 



The pushing force created by the polymerization of a branched web of actin 
filaments plays an important role in many cell processes. The polymerization at 
the plus end can push the plasma membrane outward, as in the example just 
discussed (see Figure 16-90), or it can propel vesicles or particles through the 
cell cytoplasm, as in the example of the bacterium Listeria monocytogenes dis- 
cussed in Chapter 24 (see Figure 24-37 <GTAT>). Moreover, when anchored in a 
more complex way to the membrane, the same type of force drives plasma 
membrane invaginations, as it does during the endocytotic and phagocytotic 
processes discussed in Chapter 13. 

It is interesting to compare the organization of the actin-rich lamellipodium 
to the organization of the microtubule-rich mitotic spindle. In both cases, the 
cell harnesses and amplifies the intrinsic dynamic behavior of the cytoskeletal 
filament systems to generate large-scale structures that determine the behavior 
of the whole cell. Both structures feature rapid turnover of their constituent 
cytoskeletal filaments, even though the structures themselves may remain intact 
at steady state for long periods of time. The leading edge plasma membrane in 
the lamellipodium fulfills an organizational role analogous to the condensed 
chromosomes in organizing and stimulating the dynamics of the mitotic spin- 
dle. In both cases, molecular motor proteins help to enhance cytoskeletal fila- 
ment flux and turnover in the large-scale arrays. 



Figure 16-91 Contribution of myosin II to polarized cell motility. 

(A) Myosin II bipolar filaments bind to actin filaments in the dendritic 
lamellipodial meshwork and cause network contraction. The myosin-driven 
reorientation of the actin filaments in the dendritic meshwork forms an 
actin bundle that recruits more myosin II and contributes to generating the 
contractile forces required for retraction of the trailing edge of the moving 
cell. (B) A fragment of the large lamellipodium of a keratocyte can be 
separated from the main cell body either by surgery with a micropipette or 
by treating the cell with certain drugs. Many of these fragments continue 
to move rapidly, with the same overall cytoskeletal organization as the 
intact keratocytes. Actin (blue) forms a protrusive meshwork at the front of 
the fragment. Myosin II (pink) is gathered into a band at the rear. (From 
A. Verkovsky et al., Curr. Biol. 9:1 1 -20, 1 999. With permission from Elsevier.) 
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Cell Adhesion and Traction Allow Cells to Pull Themselves 
Forward 

Lamellipodia of all cells seem to share a basic, simple type of dynamic organiza- 
tion where actin filament assembly occurs preferentially at the leading edge and 
actin filament disassembly occurs preferentially at the rear. However, the inter- 
actions between the cell and its normal physical environment usually make the 
situation considerably more complex than for fish keratocytes crawling on a cul- 
ture dish. Particularly important in locomotion is the intimate crosstalk between 
the cytoskeleton and the cell adhesion apparatus. Although some degree of 
adhesion to the substratum is necessary for any form of cell crawling, adhesion 
and locomotion rate seem generally to be inversely related, with highly adhesive 
cells moving more slowly Uian weakly adhesive ones. Keratocytes are so weakly 
adhesive to the substratum that the force of actin polymerization can push ihe 
leading edge forward very rapidly. In contrast, neurons from the sea slug Aplysia 
cultured on a sticky substratum form large lamellipodia that become stuck too 
tightly to move forward. In these lamellipodia, the same cycle of localized nucle- 
ation of new actin filaments, depolymerization of old filaments, and myosin- 
dependent contraction continues to operate. But because the leading edge is 
prevented physically from moving forward, the entire actin mesh moves back- 
ward toward the cell body instead, pulled by myosins (Figure 16-92). The adhe- 
sion of most cells lies somewhere between these two extremes, and most lamel- 
lipodia exhibit some combination of forward actin filament protrusion (like ker- 
atocytes) and rearward actin flux (like the Aplysia neurons). 

As a lamellipodium, filopodium, or pseudopodium extends forward over a 
substratum, it can form new attachment sites at the cell front that remain sta- 
tionary as the cell moves forward over them, persisting until the rear of the cell 
catches up with them. When an individual lamellipodium fails to adhere to the 
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Figure 16-92 Rearward movement of 
the actin network in a growth-cone 

lamellipodium. (A) A growth cone from a 
neuron of the sea slug Aplysia is cultured 
on a highly adhesive substratum and 
viewed by differential-interference- 
contrast microscopy. Microtubules and 
membrane-enclosed organelles are 
confined to the bright, rear area of the 
growth cone (to the left), while a 
meshwork of actin filaments fills the 
lamellipodium (on the right). (B) After 
brief treatment with the drug 
cytochalasin, which caps the plus ends of 
actin filaments (see Table 16-2, p. 988), 
the actin meshwork has detached from 
the front edge of the lamellipodium and 
has been pulled backward. (C) At the 
time point shown in B, the cell was fixed 
and labeled with fluorescent phalloidin 
to show the distribution of the actin 
filaments. Some actin filaments persist at 
the leading edge, but the region behind 
the leading edge is devoid of filaments. 
Note the sharp boundary of the 
rearward-moving actin meshwork. 
(D) The complex cyclic structure of 
cytochalasin B. (A-C, courtesy of Paul 
Forscher.) 
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Figure 16-93 Lamellipodia and ruffles at 
the leading edge of a human fibroblast 
migrating in culture. The arrow in this 
scanning electron micrograph shows the 
direction of cell movement. As the cell 
moves forward, lamellipodia that fail to 
attach to the substratum are swept 
backward over the dorsal surface of the 
cell, a movement known as ruffling. 
(Courtesy of Julian Heath.) 
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substratum, it is usually lifted up onto the dorsal surface of the cell and rapidly 
carried backward as a "ruffle" (Figure 16-93). 

The attachment sites established at the leading edge serve as anchorage 
points, which allow the cell to generate traction on the substratum and pull its 
body forward. Myosin motor proteins, especially myosin II, seem to generate 
traction forces. In many locomoting cells, myosin II is highly concentrated at the 
posterior of the cell where it may help to push the cell body forward like tooth- 
paste being squeezed out of a tube from the rear (Figure 16-94; see also Figure 
16-91). Dictyostelium amoebae that are deficient in myosin II are able to pro- 
trude pseudopodia at normal speeds, but the translocation of their cell body is 
much slower than that of wild-type amoebae, indicating the importance of 
myosin II contraction in this part of the cell locomotion cycle. In addition to 
helping to push the cell body forward, contraction of the actin-rich cortex at the 
rear of the cell may selectively weaken the older adhesive interactions that tend 
to hold the cell back. Myosin II may also transport cell body components for- 
ward over a polarized array of actin filaments. 

The traction forces generated by locomoting cells exert a significant pull on 
the substratum (Figure 16-95). In a living animal, most crawling cells move 
across a semiflexible substratum made of extracellular matrix, which can be 
deformed and rearranged by these cell forces. In culture, movement of fibrob- 
lasts through a gel of collagen fibrils aligns the collagen, generating an organized 
extracellular matrix that in turn affects the shape and direction of locomotion of 
the fibroblasts within it (Figure 16-96). Conversely, mechanical tension or 
stretching applied externally to a cell will cause it to assemble stress fibers and 
focal adhesions, and become more contractile. Although poorly understood, 
this two-way mechanical interaction between cells and their physical environ- 
ment is thought to be a primary way that vertebrate tissues organize themselves. 



Members of the Rho Protein Family Cause Major Rearrangements 
of the Actin Cytoskeleton 

Cell migration is one example of a process that requires long-distance commu- 
nication and coordination between one end of a cell and the other. During 
directed migration, it is important that the front end of the cell remain struc- 
turally and functionally distinct from the back end. In addition to driving local 
mechanical processes such as protrusion at the front and retraction at the rear, 
the cytoskeleton is responsible for coordinating cell shape, organization, and 
mechanical properties from one end of the cell to the other, a distance which is 
typically several tens of micrometers for animal cells. In many cases, including 
but not limited to cell migration, large-scale cytoskeletal coordination takes the 
form of the establishment of cell polarity, where a cell builds different structures 
with distinct molecular components at the front vs. the back, or at the top vs. the 
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Figure 16-94 The localization of myosin 
I and myosin II in a normal crawling 
Dictyostelium amoeba. This cell was 
crawling toward the upper right at the 
time that it was fixed and labeled with 
antibodies specific for two myosin 
isoforms. Myosin I (green) is mainly 
restricted to the leading edge of 
pseudopodia at the front of the cell. 
Myosin II (red) is highest in the posterior, 
actin-rich cortex. Contraction of the 
cortex at the posterior of the cell by 
myosin II may help to push the cell body 
forward. (Courtesy of Yoshio Fukui.) 
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Figure 16-95 Adhesive cells exert 
traction forces on the substratum. These 
fibroblasts have been cultured on a very 
thin sheet of silicon rubber. Attachment of 
the cells, followed by contraction of their 
cytoskeleton, has caused the rubber 
substratum to wrinkle. (From A.K. Harris, 
P. Wild and D. Stopak, Science 208:1 77-1 79, 
1 980. With permission from AAAS.) 
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Figure 1 6-96 Shaping of the extracellular matrix by cell pulling. This 
micrograph shows a region between two pieces of embryonic chick heart 
(tissue explants rich in fibroblasts and heart muscle cells) that were grown 
in culture on a collagen gel for 4 days. A dense tract of aligned collagen 
fibers has formed between the two explants, apparently as a result of 
fibroblasts tugging on the collagen. (From D. Stopak and A.K. Harris, Dev. 
Biol. 90:383-398, 1982. With permission from Academic Press.) 

bottom. Cell locomotion requires an initial polarization of the cell to set it off in 
a particular direction. Carefully controlled cell polarization processes are also 
required for oriented cell divisions in tissues and for formation of a coherent, 
organized multicellular structure. Genetic studies in yeast, flies, and worms have 
provided most of our current understanding of the molecular basis of cell polar- 
ity. The mechanisms that generate cell polarity in vertebrates are only beginning 
to be explored. In all known cases, however, the cytoskeleton has a central role, 
and many of the molecular components have been evolutionarily conserved. 

For the actin cytoskeleton, diverse cell-surface receptors trigger global 
structural rearrangements in response to external signals. But all of these signals 
seem to converge inside the cell on a group of closely related monomeric 
GTPases that are members of the Rho protein family— Cdc42 t Rac, and Rho. The 
same Rho family proteins are also involved in the establishment of many kinds 
of cell polarity. 

Like other members of the Ras superfamily these Rho proteins act as molec- 
ular switches to control cell processes by cycling between an active, GTP-bound 
state and an inactive, GDP-bound state (see Figure 3-71). Activation of Cdc42 on 
the plasma membrane triggers actin polymerization and bundling to form 
either filopodia or shorter cell protrusions called microspikes. Activation of Rac 
promotes actin polymerization at the cell periphery leading to the formation of 
sheet-like lamellipodial extensions and membrane ruffles, which are actin-rich 
protrusions on the cell's dorsal surface (see Figure 16-93). Activation of Rho pro- 
motes both the bundling of actin filaments with myosin II filaments into stress 
fibers and the clustering of integrins and associated proteins to form focal con- 
tacts (Figure 16-97). These dramatic and complex structural changes occur 
because each of these three molecular switches has numerous downstream tar- 
get proteins that affect actin organization and dynamics. 
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Figure 16-97 The dramatic effects of 
Rac, Rho, and Cdc42 on actin 
organization in fibroblasts. In each case, 
the actin filaments have been labeled 
with fluorescent phalloidin. 

(A) Serum-starved fibroblasts have actin 
filaments primarily in the cortex, and 
relatively few stress fibers. 

(B) Microinjection of a constitutively 
activated form of Rho causes the rapid 
assembly of many prominent stress 
fibers. (C) Microinjection of a 
constitutively activated form of Rac, a 
closely related monomeric GTPase, 
causes the formation of an enormous 
lamellipodium that extends from the 
entire circumference of the cell. 

(D) Microinjection of a constitutively 
activated form of Cdc42, another Rho 
family member, causes the protrusion of 
many long filopodia at the cell periphery. 
The distinct global effects of these three 
GTPases on the organization of the actin 
cytoskeleton are mediated by the actions 
of dozens of other protein molecules that 
are regulated by the GTPases. These 
target proteins include some of the 
various actin-associated proteins that we 
have discussed in this chapter. (From 
A. Hall, Science 279:509-514, 1998. With 
permission from AAA5.) 
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Some key targets of activated Cdc42 are members of the WASp protein fam- 
ily. Human patients deficient in WASp suffer from Wiskott-Aldrich Syndrome, a 
severe form of immunodeficiency where immune system cells have abnormal 
actin-based motility and platelets do not form normally. Although WASp itself is 
expressed only in blood cells and immune system cells, other family members 
are expressed ubiquitously that enable activated Cdc42 to enhance actin poly- 
merization. WASp proteins can exist in an inactive folded conformation and an 
activated open conformation. Association with Cdc42-GTP stabilizes the open 
form of WASp, enabling it to bind to the ARP complex and strongly enhancing 
this complex's actin-nucleating activity (see Figure 16-34) . In this way, activation 
of Cdc42 increases actin nucleation. 

Rac-GTP also activates WASp family members, as well as activating the 
crosslinking activity of the gel-forming protein filamin, and inhibiting the con- 
tractile activity of the motor protein myosin II, stabilizing the lamellipodia and 
inhibiting the formation of contractile stress fibers (Figure 16-98A). 

Rho-GTP has a very different set of targets. Instead of activating the ARP 
complex to build actin networks, Rho-GTP turns on formin proteins to construct 
parallel actin bundles. At the same time, Rho-GTP activates a protein kinase that 
indirectly inhibits the activity of cofilin, leading to actin filament stabilization. 
The same protein kinase inhibits a phosphatase acting on myosin light chains 
(see Figure 16-72). The consequent increase in the net amount of myosin light 
chain phosphorylation increases the amount of contractile myosin motor pro- 
tein activity in the cell, enhancing the formation of tension-dependent struc- 
tures such as stress fibers (Figure 16-98B). 

In some cell types, Rac-GTP activates Rho, usually with kinetics that are slow 
compared to Rac's activation of the ARP complex. This enables cells to use the 
Rac pathway to build a new actin structure while subsequently activating the 
Rho pathway to induce a contractility that builds up tension in this structure. 
This occurs, for example, during the formation and maturation of cell-cell con- 
tacts. As we will explore in more detail below, the communication between the 
Rac and Rho pathways also facilitates maintenance of the large-scale differences 
between the cell front and the cell rear during migration. 



Extracellular Signals Can Activate the Three Rho Protein Family 
Members 

The activation of the monomeric GTPases Rho, Rac, and Cdc42 occurs through 
an exchange of GTP for a tightly bound GDP molecule, catalyzed by guanine 
nucleotide exchange factors (GEFs). Of the 85 GEFs that have been identified in 
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Figure 16-98 The contrasting effects of 
Rac and Rho activation on actin 
organization. (A) Activation of the small 
GTPase Rac leads to actin nucleation by 
the ARP complex and other alterations in 
actin accessory proteins that tend to 
favor the formation of actin networks, as 
in lamellipodia. Several different 
pathways contribute independently. 
Rac-GTP activates members of the WASp 
protein family, which in turn activate 
actin nucleation and branched web 
formation by the ARP complex. In a 
parallel pathway, Rac-GTP activates a 
protein kinase, PAK, which has several 
targets including the web-forming 
crosslinker filamin, which is activated by 
phosphorylation, and the myosin light 
chain kinase (MLCK), which is inhibited 
by phosphorylation. The resulting 
decrease in phosphorylation of the 
myosin regulatory light chain leads to 
myosin II filament disassembly and a 
decrease in contractile activity. In some 
cells, PAK also directly inhibits myosin II 
activity by phosphorylation of the myosin 
heavy chain (MHC). Another set of 
pathways downstream of Rac activation 
is mediated by phosphoinositide lipid 
signals. Local creation of PIP 2 [PI<4,5)P 2 ] 
may help to reduce the activity of 
capping protein, to further aid actin 
polymerization. Activation of PI 3-kinase, 
which generates PIP 3 from PIP 2 , leads to 
further activation of Rac itself via a 
positive feedback loop. 

(B) Activation of the related GTPase Rho 
leads to nucleation of actin filaments by 
formins and increases contraction by 
myosin II, promoting the formation of 
contractile actin bundles such as stress 
fibers. Activation of myosin II by Rho 
requires a Rho-dependent protein kinase 
called Rock. This kinase inhibits the 
phosphatase that removes the activating 
phosphate groups from myosin II light 
chains (MLC); it may also directly 
phosphorylate the myosin light chains in 
some cell types. Rock also activates other 
protein kinases, such as LIM kinase, which 
in turn contributes to the formation of 
stable contractile actin filament bundles 
by inhibiting the actin depolymerizing 
factor cofilin. A similar signaling pathway 
is important for forming the contractile 
ring necessary for cytokinesis (see 
Figure 17-52). 
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the human genome, some are specific for an individual Rho family GTPase, 
whereas others seem to act on all three family members. The number of GEFs 
exceeds the number of Rho GTPases that they regulate because different GEFs 
are restricted to specific tissues and even specific subcellular locations, and they 
are sensitive to distinct kinds of regulatory inputs. Various cell-surface receptors 
activate GEFs. An example is the Eph receptor tyrosine kinase involved in neu- 
rite growth cone guidance, which is discussed in detail in Chapter 15. Interest- 
ingly, several of the Rho family GEFs associate with the growing ends of micro- 
tubules by binding to one of the +TIPs. This provides a connection between the 
dynamics of the microtubule cytoskeleton and the large-scale organization of 
the actin cytoskeleton, which is important for the overall integration of cell 
shape and movement. 

The Rho family GTPases are also primary determinants of cell polarity in 
budding yeast, where extensive genetic analyses have increased our under- 
standing of the general mechanisms involved. On starvation, yeasts, like many 
other unicellular organisms, sporulate. But sporulation can occur only in diploid 
budding yeast cells, whereas budding yeasts mainly proliferate as haploid cells. 
A starving haploid individual must therefore locate a partner of the opposite 
mating type, woo it, and mate with it before sporulating. Yeast cells are unable to 
swim and, instead, reach their mates by polarized growth. The haploid form of 
budding yeast comes in two mating types, a and a f which secrete mating factors 
known as a-factor and a- factor, respectively These secreted signal molecules act 
by binding to cell-surface receptors that belong ro I he G -protein-coupled recep- 
tor super family (discussed in Chapter 15). One consequence of the binding of ce- 
faclor to its receptor Is to cause the recipient cell to become polarized, adopting 
a shape known as a "shrnoo" (Figure 16-99], In the presence of an a-factor gra- 
dient, the a-cell shrnoo tip is directed toward the highest concentration of the 
signal molecule, which under normal circumstances would direct it toward an 
amorous a cell located nearby. 

This polarized cell growth requires alignment of the actin cytoskeleton in 
response to the mating factor signal. When the signal binds to its receptor, the 
receptor activates Cdc42, which in turn induces assembly of actin filaments at 
the location closest to the source of the signal. Local activation of Cdc42 is further 
enhanced by a positive feedback loop, requiring actin- dependent transport of 
Cdc42 itself as well as its GEF and other signaling components along the newly 
assembled actin structures toward the site of the signal. Subsequently, actin 
cables are assembled pointing toward the site of Cdc42 accumulation due to the 
activation of another Rho family GTPase that in turn stimulates a yeast formin. 
The actin cables serve as tracks for directed transport and exocytosis of new cell 
wall material, resulting in the polarized growth of the shrnoo tip (Figure 16-100}, 

Haploid budding yeast cells use this same polarization machinery during 
vegetative growth. To form die hud that will grow out to become a daughter cell, 
the yeast must direct new plasma membrane and cell wall material primarily to 
a single site. As with shrnoo formation, this requires an initial cytoskeletal polar- 
ity, with most actin patches in the growing bud and actin cables oriented along 
the hud axis, in haploid cells, a new bud site is always constructed immediately 
adjacent to the previous bud site. In this case, the spatial cues that set up 
cytoskeletal polarity are intrinsic to the cell, left behind from previous rounds of 
cell division. Cdc42 is once again involved in transducing the signal from the 
destined bud sire to the cytoskeleton, and most of the proteins involved in the 
upstream and downstream pathways have been identified through genetic 




Figure 16-99 Morphological polarization 
of yeast cells in response to mating 
factor. (A) Cells of Saccharomyces 
cerevisiae are usually spherical. (B) They 
become polarized when treated with 
mating factor from cells of the opposite 
mating type. The polarized cells are called 
/ 'shmoos."(C) Al Capp's famous cartoon 
character, the original Shrnoo. (A and B, 
courtesy of Michael Snyder; C, © 1 948 
Capp Enterprises, Inc. Used by permission.) 
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experiments. Subsequent to their identification in yeast, many of these proteins 
have been found to have homologs in other organisms, where they are often 
likewise involved in the establishment of cell polarity. 
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Figure 16-1 00 The signaling pathway in 
the yeast mating factor response. The 

extracellular mating factor binds to a 
G-protein-coupled receptor in the plasma 
membrane. Activation of the receptor 
triggers dissociation of the GTP-bound 
Get subunit from a heterotrimeric 
G-protein (discussed in Chapter 15). This 
in turn activates the Rho family GTP- 
binding protein, Cdc42. As in mammalian 
cells, Cdc42 activates a WASp family 
protein that activates the ARP complex, 
leading to local actin nucleation at the 
site of mating factor binding. The local 
actin nucleation and filament growth 
create a positive feedback loop whereby 
Cdc42 activity is further enhanced. This 
leads to extensive Rho and formin 
activation, and finally to actin cable 
formation, polarized growth, and 
acquisition of a shmoo shape. In addition, 
receptor activation triggers other 
responses through a MAP kinase cascade 
(discussed in Chapter 15), preparing the 
haploid cell for mating (not shown). 



External Signals Can Dictate the Direction of Cell Migration 

Chemotaxis is defined as cell movement in a direction controlled by a gradient 
of a diffusible chemical. This is a particularly interesting case where external sig- 
nals trigger the Rho family proteins to set up large-scale cell polarity by influ- 
encing the organization of the apparatus required for cell motility, described 
above. One well-studied example is the chemotactic movement of a class of 
white blood cells, called neutrophils, toward a source of bacterial infection. 
Receptor proteins on the surface of neutrophils enable them to detect the very 
low concentrations of the N-formylated peptides that are derived from bacterial 
proteins (only procaryotes begin protein synthesis with AT-formylmethionine). 
Using these receptors, neutrophils are guided to bacterial targets by their ability 
to detect a difference of only 1% in the concentration of these diffusible peptides 
on one side of the cell versus the other (Figure 16-101). 

Both in this case and in the similar chemotaxis of Dictyostelium amoebae 
toward a source of cyclic AMR a local polymerization of actin near the receptors 
is stimulated when the receptors bind their ligands. This actin polymerization 
response depends on the monomeric Rho-family GTPases discussed earlier. As 
in the shmooing yeast (see Figure 16-99), the responding cell extends a protru- 
sion toward the signal. For chemotactic cells, binding of the chemoattractant lig- 
and to its G-protein coupled receptor activates phosphoinositide 3' kinases 
(PI3Ks), which generates a lipid-based signaling molecule (PI(3,4,5)P3) that in 
turn activates the Rac GTPase. Rac then activates the ARP complex and lamel- 
lipodial protrusion results (see Figure 16-98). Through an unknown mechanism, 

Figure 16-101 Neutrophil polarization and chemotaxis. <GTCG> <TGTA> 
The pipette tip at the right is leaking a small amount of the peptide formyl- 
Met-Leu-Phe. Only bacterial proteins have formylated methionine residues, 
so the human neutrophil recognizes this peptide as the product of a foreign 
invader (discussed in Chapter 24). The neutrophil quickly extends a new 
lamellipodium toward the source of the chemoattractant peptide (top). It 
then extends this lamellipodium and polarizes its cytoskeleton so that 
contractile myosin II is located primarily at the rear, opposite the position of 
the lamellipodium (middle). Finally, the cell crawls toward the source of this 
peptide (bottom). If a real bacterium were the source of the peptide, rather 
than an investigator's pipette, the neutrophil would engulf the bacterium 
and destroy it (see also Figure 16-4). (From O.D.WeineretaL, Nat. Cell Biol. 
1:75-81, 1999. With permission from Macmillan Publishers Ltd.) 
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accumulation of the polarized actin web at the leading edge causes further local 
enhancement of PI3K activity in a positive feedback loop, strengthening the 
induction of protrusion. The PI(3,4,5)P 3 that activates Rac cannot diffuse far 
from its site of synthesis, since it is rapidly converted back into PIP2 by a consti- 
tutively active lipid phosphatase. At the same time, binding of the chemoattrac- 
tant ligand to its receptor activates another signaling pathway that turns on Rho 
and enhances myosin-based contractility. The two processes directly inhibit 
each other, and as a result, Rac activation dominates in the front of the cell and 
Rho activation dominates in the rear (Figure 16-102). This enables the cell to 
maintain its functional polarity with protrusion at the leading edge and con- 
traction at the back. 

Nondiffusible chemical cues attached to the extracellular matrix or to the 
surface of cells can also influence the direction of cell migration. When these sig- 
nals activate receptors, they can cause increased cell adhesion and directed 
actin polymerization. Most long-distance cell migrations in animals, including 
neural-crest-cell migration and the travels of neuronal growth cones, depend on 
a combination of diffusible and non- diffusible signals to steer the locomoting 
cells or growth cones to their proper destinations (see Figure 15-62). 



Figure 16-102 Signaling during 
neutrophil polarization. Bacteria that 
have invaded the human body secrete 
molecules that are recognized as foreign 
by the cells of the immune system, 
including neutrophils. Binding of the 
bacterial molecules to G-protein-coupled 
receptors on the neutrophils stimulate 
directed motility. These receptors are 
found all over the surface of the cell, but 
are more likely to be bound to the 
bacterial ligand at the front. Two distinct 
signaling pathways contribute to the 
cell's polarization. At the front of the cell, 
close to the source of the bacterial signal, 
stimulation of the Rac pathway leads, via 
the trimeric G protein Gj, to growth of 
protrusive actin networks. Second 
messengers within this pathway are 
short-lived, so protrusion is limited to the 
region of the cell closest to the stimulant. 
The same receptor also stimulates a 
second signaling pathway, via the 
trimeric G proteins G1 2 and G13 
(denoted G12/13), that triggers the 
activation of Rho. The two pathways are 
mutually antagonistic. Since Rac-based 
protrusion is active at the front of the cell, 
Rho is activated only at the rear of the 
cell, stimulating contraction of the cell 
rear and assisting directed movement. 
For a real-life example of the 
effectiveness of this signaling system, see 
Figure 16-4. 



Communication Between the Microtubule and Actin Cytoskeletons 
Coordinates Whole-Cell Polarization and Locomotion 

To help organize persistent movement in a particular direction cells use their 
microtubules along with their actin filaments. In many locomoting cells, the 
position of the centrosome is influenced by the location of protrusive actin poly- 
merization, being found on the forward side of the nucleus. The mechanism of 
centrosome reorientation is not clear, although there is evidence that the Rho 
family protein Cdc42 may be involved. It is thought that the activation of recep- 
tors on one edge of a cell might not only stimulate actin polymerization there 
(and therefore local protrusion) but also locally activate dynein-like motor pro- 
teins that move the centrosome by pulling on its microtubules. Several effector 
proteins downstream of Rac and Rho modulate microtubule dynamics directly. 
For example, a protein kinase activated by Rac can phosphorylate (and therefore 
inhibit) the tubulin binding protein stathmin (see Panel 16-3, pp. 994-995), 
destabilizing microtubules, and Rho activation appears to stabilize microtubules. 
In turn, microtubule dynamics influence actin rearrangements. The centro- 
some nucleates a large number of dynamic microtubules, and its repositioning 
means that many of these microtubules have their plus ends extending from the 
centrosome into the protrusive region of the cell. The dynamic microtubule plus 
ends may indirectly modulate local adhesion and also activate the Rac GTPase 
to further increase actin polymerization in the protrusive region by delivering 
Rac-GEFs that bind to the +TIPs traveling on growing microtubule ends. The 
increased concentration of microtubules would thereby encourage further pro- 
trusion, creating a positive feedback loop that enables protrusive motility to per- 
sist in the same direction for a prolonged period. Regardless of the exact mech- 
anism, the orientation of the centrosome seems to reinforce the polarity infor- 
mation that the actin cytoskeleton receives from the outside world, allowing a 
sensitive response to weak signals. 
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Figure 16-1 03 The polarization of a cytotoxic Tcell after target-cell 
recognition. (A) Changes in the cytoskeleton of a cytotoxic T cell after it 
has made contact with a target cell. The initial recognition event results in 
signals that cause actin polymerization in both cells at the site of contact. 
In the T cell, interactions between the actin-rich contact zone and 
microtubules emanating from the centrosome result in reorientation of the 
centrosome, so that the associated Golgi apparatus is directly apposed to 
the target cell. (B) Immuno-fluorescence micrograph in which both the 
T cell (top) and its target cell (bottom) have been stained with an antibody 
against microtubules. The centrosome and the microtubules radiating from 
it in the T cell are oriented toward the point of cell-cell contact. In contrast, 
the microtubule array in the target cell is not polarized. (B, from B. Geiger, 
D. Rosen and G. Berke, J. Cell Biol. 95:137-143, 1982. With permission from 
The Rockefeller University Press.) 

A similar cooperative feedback loop seems to operate in many other 
instances of cell polarization. A particularly interesting example is the killing of 
specific target cells by T lymphocytes. These cells are a critical component of the 
vertebrate's adaptive immune response to infection by viruses. T cells, like neu- 
trophils, use actin-based motility to crawl through the body's tissue and find 
infected target cells. When a T cell comes into contact with a virus-infected cell 
and its receptors recognize foreign viral antigens on the surface of the target cell, 
the same polarization machinery is engaged in a very different way to facilitate 
killing of the target cell. Rac is activated at the point of cell-cell contact and 
causes actin polymerization at this site, creating a specialized region of the cor- 
tex. This specialized site causes the centrosome to reorient, moving with its 
microtubules to the zone of T-cell-target contact (Figure 16-103). The micro- 
tubules, in turn, position the Golgi apparatus right under the contact zone, 
focusing the killing machinery onto the target cell. The mechanism of killing is 
discussed in Chapter 25 (see Figure 25-47). 
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The Complex Morphological Specialization of Neurons Depends 
on the Cytoskeleton 

For our final case study of the ways that the intrinsic properties of the eucaryotic 
cytoskeleton enable specific and enormously complicated large-scale cell behav- 
iors, we examine the neuron. Neurons begin life in the embryo as unremarkable 
cells, which use actin-based motility to migrate to specific locations. Once there, 
however, they send out a series of long specialized processes that will either 
receive electrical signals (dendrites) or transmit electrical signals (axons) to their 
target cells. <TACC> The beautiful and elaborate branching morphology of axons 
and dendrites enables neurons to form tremendously complex signaling net- 
works, interacting with many other cells simultaneously and making possible the 
complicated and often unpredictable behavior of the higher animals. Both axons 
and dendrites (collectively called neurites) are filled with bundles of microtubules 
that are critical to both their structure and their function. 

In axons, all the microtubules are oriented in the same direction, with their 
minus end pointing back toward the cell body and their plus end pointing for- 
ward toward the axon terminals (Figure 16-104). The microtubules do not reach 
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from the cell body all the way to the axon terminals; each is typically only a few 
micrometers in length, but large numbers are staggered in an overlapping array. 
This set of aligned microtubule tracks acts as a highway to transport many spe- 
cific proteins, protein-containing vesicles, and mRNAs to the axon terminals, 
where synapses must be constructed and maintained. The longest axon in the 
human body reaches from the base of the spinal cord to the foot, being up to a 
meter in length. 

Mitochondria, large numbers of specific proteins in transport vesicles, and 
synaptic vesicle precursors make the long journey in the forward (anterograde) 
direction. They are carried there by plus-end-directed kinesin-family motor pro- 
teins that can move them a meter in as little as two or three days, which is a great 
improvement over diffusion, which would take approximately several decades 
to move a mitochondrion this distance. Many members of the kinesin super- 
family contribute to this anterograde axonal transport, most carrying specific 
subsets of membrane-enclosed organelles along the microtubules. The great 
diversity of the kinesin family motor proteins used in axonal transport suggests 
that they are involved in targeting their cargo to specific structures near the ter- 
minus or along the way as well as in cargo movement. Old components from the 
axon terminals are carried back to the cell body for degradation and recycling by 
a retrograde axonal transport. This transport occurs along the same set of ori- 
ented microtubules, but it relies on cytoplasmic dynein, which is a minus-end- 
directed motor protein. Retrograde transport is also critical for communicating 
the presence of growth and survival signals received by the nerve terminus back 
to the nucleus, in order to influence gene expression. 

One form of human peripheral neuropathy, Charcot-Marie-Tooth disease, is 
caused by a point mutation in a particular kinesin family member that trans- 
ports synaptic vesicle precursors down the axon. Other kinds of neurodegener- 
ative diseases such as Alzheimer's disease may also be caused in part by disrup- 
tions in neuronal trafficking; as pointed out previously, the amyloid precursor 
protein APP is part of a protein complex that serves as a receptor for kinesin- 1 
binding to other axonal transport vesicles. 

Axonal structure depends on the axonal microtubules, as well as on the con- 
tributions of the other two major cytoskeletal systems— actin filaments and 
intermediate filaments. Actin filaments line the cortex of the axon, just beneath 
the plasma membrane, and actin-based motor proteins such as myosin V are 
also abundant in the axon, presumably to help move materials. Neurofilaments, 
the specialized intermediate filaments of nerve cells, provide the most impor- 
tant structural support in the axon. A disruption in neurofilament structure, or 
in the cross-linking proteins that attach the neurofilaments to the microtubules 
and actin filaments distributed along the axon, can result in axonal disorganiza- 
tion and eventually axonal degeneration. 

The construction of the elaborate branching architecture of the neuron dur- 
ing embryonic development requires actin-based motility. As mentioned earlier, 
the tips of growing axons and dendrites extend by means of a growth cone, a spe- 
cialized motile structure rich in actin (Figure 16-105). Most neuronal growth 
cones produce filopodia, and some make lamellipodia as well. The protrusion 



Figure 16-104 Microtubule organization 
in fibroblasts and neurons. (A) In a 
fibroblast, microtubules emanate 
outward from the centrosome in the 
middle of the cell. Vesicles with plus-end- 
directed kinesin attached move outward, 
and vesicles with minus-end-directed 
dynein attached move inward. (B) In a 
neuron, microtubule organization is more 
complex. In the axon, all microtubules 
share the same polarity, with the plus 
ends pointing outward toward the axon 
terminus. No one microtubule stretches 
the entire length of the axon; instead, 
short overlapping segments of parallel 
microtubules make the tracks for fast 
axonal transport. In dendrites, the 
microtubules are of mixed polarity, with 
some plus ends pointing outward and 
some pointing inward. 
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Figure 16-105 Neuronal growth cones. 

<AAGA> (A) Scanning electron micrograph 
of two growth cones at the end of a 
neurite, put out by a chick sympathetic 
neuron in culture. Here, a previously single 
growth cone has recently split into two. 
Note the many filopodia and the large 
lamellipodia. The taut appearance of the 
neurite is due to tension generated by the 
forward movement of the growth cones, 
which are often the only firm points of 
attachment of the axon to the substratum. 
(B) Scanning electron micrograph of the 
growth cone of a sensory neuron crawling 
over the inner surface of the epidermis of a 
Xenopus tadpole. (A, from D. Bray, in Cell 
Behaviour [R. Bellairs, A. Curtis and 
G. Dunn, edsj. Cambridge, UK: Cambridge 
University Press, 1982; B, from A. Roberts, 
Brain Res. 1 1 8:526-530, 1 976. With 
permission from Elsevier.) 



and stabilization of growth-cone filopodia are exquisitely sensitive to environ- 
mental cues. Some cells secrete soluble proteins such as netrin to attract or repel 
growth cones. These modulate the structure and motility of the growth cone 
cytoskeleton by altering the balance between Rac activity and Rho activity at the 
leading edge (see Figure 15-62). In addition, there are fixed guidance markers 
along the way, attached to the extracellular matrix or to the surfaces of cells. 
When a filopodium encounters such a "guidepost" in its exploration, it quickly 
forms adhesive contacts. It is thought that a myosin-dependent collapse of the 
actin meshwork in the unstabilized part of the growth cone then causes the 
developing axon to turn toward the guidepost. 

Thus, a complex combination of positive and negative signals, both soluble 
and insoluble, accurately guide the growth cone to its final destination. Micro- 
tubules then reinforce the directional decisions made by the actin-rich protru- 
sive structures at the leading edge of the growth cone. Microtubules from the 
axonal parallel array just behind the growth cone are constantly growing into the 
growth cone and shrinking back by dynamic instability. Adhesive guidance sig- 
nals are somehow relayed to the dynamic microtubule ends, so that micro- 
tubules growing in the correct direction are stabilized against disassembly. In 
this way, a microtubule-rich axon is left behind, marking the path that the 
growth cone has traveled. 

Dendrites are generally much shorter projections than axons, and they 
receive synaptic inputs rather than being specialized for sending signals like 
axons. The microtubules in dendrites all lie parallel to one another but their 
polarities are mixed, with some pointing their plus ends toward the dendrite tip, 
while others point back toward the cell body. Nevertheless, dendrites also form 
as the result of growth-cone activity. Therefore, it is the growth cones at the tips 
of axons and dendrites that create the intricate, highly individual morphology of 
each mature neuronal cell (Figure 16-106). 
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Figure 1 6-1 06 The complex architecture 
of a vertebrate neuron. The neuron 
shown is from the retina of a monkey. 
The arrows indicate the direction of travel 
of the electrical signal along the axon. 
The longest and largest neurons in the 
human body extend for a distance of 
about 1 m (1 million fim), from the base 
of the spinal cord to the tip of the big 
toe, and have an axon diameter of 15 jam. 
{Adapted from B.B. Boycott, in Essays on 
the Nervous System [R. Bellairs and 
E.G. Gray, eds.]. Oxford, UK: Clarendon 
Press, 1974.) 



1 050 Chapter 1 6: The Cytoskeleton 




20 40 

time in mins 



10 ^m 



Although the neurons of the central nervous system are long-lived cells, they 
are by no means static. Synapses are constantly being created, strengthened, 
weakened, and eliminated as the brain learns, evaluates, and forgets. High-res- 
olution imaging of the structure of neurons in the brains of adult mice has 
revealed that neuronal morphology is undergoing constant rearrangement as 
synapses are forged and broken (Figure 16-107). These actin-dependent rear- 
rangements are thought to be critical in learning and long-term memory. In this 
way, the cytoskeleton provides the engine for construction of the entire nervous 
system, as well as producing the supporting structures that strengthen, stabilize, 
and maintain its parts. 



Figure 16-107 Rapid changes in dendrite 
structure within a living mouse brain. 

(A) Image of cortical neurons in a transgenic 
mouse that has been engineered to express 
green fluorescent protein in a small fraction 
of its brain cells. Changes in these brain 
neurons and their projections can be 
followed for months using highly sensitive 
fluorescence microscopy. To make this 
possible, the mouse is subjected to an 
operation that introduces a small 
transparent window through its skull, and it 
is anesthetized each time that an image is 
recorded. (B) A single dendrite, imaged over 
the period of 80 minutes, demonstrates that 
dendrites are constantly sending out and 
retracting tiny actin-dependent protrusions 
to create the dendritic spines that receive 
the vast majority of excitatory synapses 
from axons in the brain. Those spines that 
become stabilized and persist for months 
are thought to be important for brain 
function, and may be involved in long-term 
memory. (Courtesy of Karel Svoboda.) 



Summary 

Two distinct types of specialized structures in eucaryotic cells are formed from highly 
ordered arrays of motor proteins that move on stabilized filament tracks. The 
myosln-actin system of the sarcomere powers the Contraction of various types of mus- 
cle, including skeletal smooth, and cardiac muscle. The dynvin- microtubule system of 
the axoneme powers the beating of alia and the undulations offlagella. 

Whole-cell movements and the large-scale shaping and structuring of cells require 
the coordinated activities of all three basic filament systems along with a large variety 
of cytoskeletal accessory protei?™, including motor proteins, During cell division, the 
functions of the micmtubule-based mitotic spindle require spatial and temporal coop- 
eration between dynamic cytoskeletal filaments, active molecular motor proteins, and 
a wide variety of accessory factors. Cell crawling— a widespread behavior important in 
embryonic development and also in wound healing, tissue maintenance, and immune 
system function in the adult animal— is another prime example of such complex, 
coordinated cytoskeletal action. For a cell to crawl, it must generate and maintain an 
overall structural polarity, which is influenced by external cues. In addition, the cell 
must coordinate protrusion at the leading edge (by assembly of new actin filaments), 
adhesion of the newly protruded part of the cell to the substratum, forces generated by 
molecular motors to bring the cell body forward. 

Complex cells, such as neurons, require the coordinated assembly of microtubules, 
neurofilaments (neuronal intermediate filaments), and actin filaments, as well as the 
actions of dozens of highly specialized molecular motors that transport subcellular 
components to their appropriate destinations. 



PROBLEMS 

Which statements are true? Explain why or why not. 

16-1 The role of ATP hydrolysis in actin polymerization is 
similar to the role of GTP hydrolysis in tubulin polymeriza- 
tion: both serve to weaken the bonds in the polymer and 
thereby promote depolymerization. 



16-2 In most animal cells, minus end-directed micro- 
tubule motors deliver their cargo to the periphery of the cell, 
whereas plus end-directed microtubule motors deliver their 
cargo to the interior of the cell. 

16-3 Motor neurons trigger action potentials in muscle 
cell membranes that open voltage- sensitive Ca 2+ channels 
in T- tubules, allowing extracellular Ca 2+ to enter the cytosol, 
bind to troponin C, and initiate rapid muscle contraction. 
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Discuss the following problems. 

16-4 At 1.4 mg/mL pure tubulin, microtubules grow at a 
rate of about 2 um/min. At this growth rate how many oc|3- 
tubulin dimers (8 nm in length) are added to the ends of a 
microtubule each second? 

16-5 A solution of pure a(3-tubulin dimers is thought to 
nucleate microtubules by forming a linear protofilament 
about seven dimers in length. At that point, the probabilities 
that the next a(3-dimer will bind laterally or to the end of the 
protofilament are about equal. The critical event for micro- 
tubule formation is thought to be the first lateral association 
(Figure Q16-1). How does lateral association promote the 
subsequent rapid formation of a microtubule? 

LINEAR GROWTH LATERAL ASSOCIATION 



(A) EXPERIMENTAL SETUP 
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Figure Q16-1 Model for microtubule nucleation by pure ap-tubulin 

dimers (Probiem 16-5). 

16-6 How does a centrosome "know" when it has found 
the center of the cell? 

16-7 The concentration of actin in cells is 50-100 times 
greater than the critical concentration observed for pure 
actin in a test tube. How is this possible? What prevents the 
actin subunits in cells from polymerizing into filaments? 
Why is it advantageous to the cell to maintain such a large 
pool of actin subunits? 

16-8 The movements of single motor-protein molecules 
can be analyzed directly. Using polarized laser light, it is pos- 
sible to create interference patterns that exert a centrally 
directed force, ranging from zero at the center to a few 
piconewtons at the periphery (about 200 nm from the cen- 
ter). Individual molecules that enter the interference pat- 
tern are rapidly pushed to the center, allowing them to be 
captured and moved at the experimenter's discretion. 

Using such "optical tweezers," single kinesin molecules 
can be positioned on a microtubule that is fixed to a cover- 
slip. Although a single kinesin molecule cannot be seen 
optically, it can be tagged with a silica bead and tracked indi- 
rectly by following the bead (Figure Q16-2A). In the absence 
of ATP, the kinesin molecule remains at the center of the 
interference pattern, but with ATP it moves toward the plus 
end of the microtubule. As kinesin moves along the micro- 
tubule, it encounters the force of the interference pattern, 
which simulates the load kinesin carries during its actual 
function in the cell. Moreover, the pressure against the silica 
bead counters the effects of Brownian (thermal) motion, so 
that the position of the bead more accurately reflects the 
position of the kinesin molecule on the microtubule. 

Traces of the movements of a kinesin molecule along a 
microtubule are shown in Figure Q16-2B. 
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Figure Ql 6-2 Movement of kinesin along a microtubule (Problem 
16-8). (A) Experimental setup with kinesin linked to a silica bead, 
moving along a microtubule. (B) Position of kinesin (as visualized by 
position of silica bead) relative to center of interference pattern, as a 
function of time of movement along the microtubule. The jagged 
nature of the trace results from Brownian motion of the bead. 



A. As shown in Figure Q16-2B, all movement of kinesin is 
in one direction (toward the plus end of the microtubule). 
What supplies the free energy needed to ensure a unidirec- 
tional movement along the microtubule? 

B. What is the average rate of movement of kinesin along 
the microtubule? 

C What is the length of each step that a kinesin takes as it 
moves along a microtubule? 

D. From other studies it is known that kinesin has two 
globular domains that each can bind to P-tubulin, and that 
kinesin moves along a single protofilament in a micro- 
tubule. In each protofilament the P-tubulin subunit repeats 
at 8-nm intervals. Given the step length and the interval 
between p-tubulin subunits, how do you suppose a kinesin 
molecule moves along a microtubule? 

E. Is there anything in the data in Figure Q16-2B that tells 
you how many ATP molecules are hydrolyzed per step? 

1 6-9 How is the unidirectional motion of a lamellipodium 
maintained? 

16 10 Detailed measurements of sarcomere length and ten- 
sion during isometric contraction in striated muscle pro- 
vided crucial early support for the sliding filament model of 
muscle contraction. Based on your understanding of the 
sliding filament model and the structure of a sarcomere, 
propose a molecular explanation for the relationship of ten- 
sion to sarcomere length in the portions of Figure Q16-3 
marked I, II, III, and IV (In this muscle, the length of the 
myosin filament is 1.6 jam and the lengths of the actin thin 
filaments that project from the Z discs are 1.0 urn.) 

Figure Ql 6-3 Tension as a 
function of sarcomere length 
during isometric contraction 
*100 ' (Problem 16-10). 
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The Cell Cycle 




The only way to make a new cell is to duplicate a cell that already exists. This 
simple fact, first established in the middle of the nineteenth century, carries with 
it a profound message for the continuity of life. All living organisms, from the 
unicellular bacterium to the multicellular mammal, are products of repeated 
rounds of cell growth and division extending back in time to the beginnings of 
life on Earth over three billion years ago. 

A cell reproduces by performing an orderly sequence of events in which it 
duplicates its contents and then divides in two. This cycle of duplication and 
division, known as the cell cycle, is the essential mechanism by which all living 
things reproduce. In unicellular species, such as bacteria and yeasts, each cell 
division produces a complete new organism. In multicellular species, long and 
complex sequences of cell divisions are required to produce a functioning 
organism. Even in the adult body, cell division is usually needed to replace cells 
that die. In fact, each of us must manufacture many millions of cells every sec- 
ond simply to survive: if all cell division were stopped — by exposure to a very 
large dose of x-rays, for example — we would die within a few days. 

The details of the cell cycle vary from organism to organism and at different 
times in an organism's life. Certain characteristics, however, are universal. The 
minimum set of processes that a cell has to perform are those that allow it to 
accomplish its most fundamental task: the passing on of its genetic information 
to the next generation of cells. To produce two genetically identical daughter 
cells, the DNA in each chromosome must first be faithfully replicated to produce 
two complete copies, and the replicated chromosomes must then be accurately 
distributed (segregated) to the two daughter cells, so that each receives a copy of 
the entire genome (Figure 17-1). 

Eucaryotic cells have evolved a complex network of regulatory proteins, 
known as the cell-cycle control system, that governs progression through the cell 
cycle. The core of this system is an ordered series of biochemical switches that 
initiate the main events of the cycle, including chromosome duplication and 
segregation. In most cells, additional layers of regulation enhance the fidelity of 
cell division and allow the control system to respond to various signals from 
both inside and outside the cell. Inside the cell, the control system monitors pro- 
gression through the cell cycle and delays later events until earlier events have 
been completed. It does not permit preparations for the segregation of dupli- 
cated chromosomes, for example, until DNA replication is complete. The con- 
trol system also monitors conditions outside the cell. In a multicellular animal, 
the system is highly responsive to signals from other cells, stimulating cell divi- 
sion when more cells are needed and blocking it when they are not. The cell- 
cycle control system therefore has a central role in regulating cell numbers in the 
tissues of the body. When the system malfunctions, excessive cell divisions can 
result in cancer. 

In addition to duplicating their genome, most cells also duplicate their other 
organelles and macromolecules; otherwise, daughter cells would get smaller 
with each division. To maintain their size, dividing cells must coordinate their 
growth (that is, their increase in cell mass) with their division. 

This chapter describes the various events of the cell cycle and how they are 
controlled and coordinated. We begin with a brief overview of the cell cycle. 
We then describe the cell-cycle control system and explain how it triggers the 
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Figure 1 7-1 The cell cycle. The division of a 
hypothetical eucaryotic cell with two 
chromosomes is shown to illustrate how 
two genetically identical daughter cells are 
produced in each cycle. Each of the 
daughter cells will often continue to divide 
by going through additional cell cycles. 



different events of the cycle. We next consider in detail the major stages of the 
cell cycle, in which the chromosomes are duplicated and then segregated into 
the two daughter cells. Finally, we consider how extracellular signals govern the 
rates of cell growth and division and how these two processes are coordinated. 



OVERVIEW OF THE CELL CYCLE 

We begin this section with a brief description of the four phases of the eucaryotic 
cell cycle. We then consider some of the methods and model cell systems used 
to study the cell cycle. 

The Eucaryotic Cell Cycle Is Divided into Four Phases 

The most basic function of the cell cycle is to duplicate accurately the vast 
amount of DNA in the chromosomes and then segregate the copies precisely 
into two genetically identical daughter cells. These processes define the two 
major phases of the cell cycle. Chromosome duplication occurs during S phase 
(S for DNA synthesis), which requires 10-12 hours and occupies about half of the 
cell-cycle time in a typical mammalian cell. After S phase, chromosome segre- 
gation and cell division occur in M phase (M for mitosis), which requires much 
less time (less than an hour in a mammalian cell). M phase comprises two major 
events: nuclear division, or mitosis, during which the copied chromosomes are 
distributed into a pair of daughter nuclei; and cytoplasmic division, or cytokine- 
sis, when the cell itself divides in two (Figure 17-2). 

At the end of S phase, the DNA molecules in each pair of duplicated chro- 
mosomes are intertwined and held tightly together by specialized protein link- 
ages. Early in mitosis at a stage called prophase, the two DNA molecules are grad- 
ually disentangled and condensed into pairs of rigid and compact rods called sis- 
ter chromatids, which remain linked together by sister-chromatid cohesion. 
When the nuclear envelope disassembles later in mitosis, the sister chromatid 
pairs become attached to the mitotic spindle, a giant bipolar array of micro- 
tubules (discussed in Chapter 16). Sister chromatids are attached to opposite 
poles of the spindle, and, eventually, all sisters align at the spindle equator in 
a stage called metaphase. The destruction of sister-chromatid cohesion at the 
start of anaphase separates the sister chromatids, which are pulled to opposite 
poles of the spindle. The spindle is then disassembled, and the segregated 
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Figure 1 7-2 The major events of the cell 
cycle. The major chromosomal events of 
the cell cycle occur in S phase, when the 
chromosomes are duplicated, and 
M phase, when the duplicated 
chromosomes are segregated into a pair 
of daughter nuclei (in mitosis), after 
which the cell itself divides into two 
(cytokinesis). 
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chromosomes are packaged into separate nuclei at telophase. Cytokinesis then 
cleaves the cell in two, so that each daughter cell inherits one of the two nuclei 
(Figure 17-3). <TACT> <TCAA> 

Most cells require much more time to grow and double their mass of pro- 
teins and organelles than they require to duplicate their chromosomes and 
divide. Partly to allow more time for growth, most cell cycles have extra gap 
phases— a Gi phase between M phase and S phase and a G2 phase between S 
phase and mitosis. Thus, the eucaryotic cell cycle is traditionally divided into 
four sequential phases: Gi, S, G2, and M. Gi, S, and G 2 together are called inter- 
phase (Figure 17-4, and see Figure 17-3). In a typical human cell proliferating in 
culture, interphase might occupy 23 hours of a 24-hour cycle, with 1 hour for M 
phase. Cell growth occurs throughout the cell cycle, except during mitosis. 

The two gap phases are more than simple time delays to allow cell growth. 
They also provide time for the cell to monitor the internal and external environ- 
ment to ensure that conditions are suitable and preparations are complete 
before the cell commits itself to the major upheavals of S phase and mitosis. The 
Gi phase is especially important in this respect. Its length can vary greatly 
depending on external conditions and extracellular signals from other cells. If 
extracellular conditions are unfavorable, for example, cells delay progress 
through Gi and may even enter a specialized resting state known as Go (G zero), 
in which they can remain for days, weeks, or even years before resuming prolif- 
eration. Indeed, many cells remain permanently in G until they or the organism 
dies. If extracellular conditions are favorable and signals to grow and divide are 
present, cells in early Gi or Go progress through a commitment point near the 
end of Gi known as Start (in yeasts) or the restriction point (in mammalian 
cells). We will use the term Start for both yeast and animal cells. After passing 
this point, cells are committed to DNA replication, even if the extracellular sig- 
nals that stimulate cell growth and division are removed. 



Figure 1 7-3 The events of eucaryotic 
cell division as seen under a 
microscope. The easily visible processes 
of nuclear division (mitosis) and cell 
division (cytokinesis), collectively called 
M phase, typically occupy only a small 
fraction of the cell cycle. The other, much 
longer, part of the cycle is known as 
interphase, which includes S phase and 
the gap phases (discussed in text). The 
five stages of mitosis are shown: an 
abrupt change in the biochemical state 
of the cell occurs at the transition from 
metaphase to anaphase. A cell can pause 
in metaphase before this transition point, 
but once it passes this point, the cell 
carries on to the end of mitosis and 
through cytokinesis into interphase. 
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Figure 1 7-4 The four phases of the cell 
cycle. In most cells, gap phases separate 
the major events of S phase and M phase. 
Gi is the gap between M phase and 
S phase, while G 2 is the gap between 
S phase and M phase. 
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Cell-Cycle Control Is Similar in All Eucaryotes 

Some features of the cell cycle, including the time required to complete certain 
events, vary greatly from one cell type to another, even in the same organism. 
The basic organization of the cycle, however, is essentially the same in all 
eucaryotic cells, and all eucaryotes appear to use similar machinery and control 
mechanisms to drive and regulate cell-cycle events. The proteins of the cell- 
cycle control system, for example, first appeared over a billion years ago. 
Remarkably, they have been so well conserved over the course of evolution that 
many of them function perfectly when transferred from a human cell to a yeast 
cell. We can therefore study the cell cycle and its regulation in a variety of organ- 
isms and use the findings from all of them to assemble a unified picture of how 
eucaryotic cells divide. In the rest of this section, we briefly review the three 
eucaryotic systems most commonly used to study cell-cycle organization and 
control: yeasts, animal embryos, and cultured mammalian cells. 



Cell-Cycle Control Can Be Dissected Genetically by Analysis of 
Yeast Mutants 

Yeasts are tiny, single-celled fungi, with a cell-cycle control system remarkably 
similar to our own. Two species are generally used in studies of the cell cycle. The 
fission yeast Schizosaccharomyces pombe is named after the African beer it is 
used to produce. It is a rod-shaped cell that grows by elongation at its ends. Divi- 
sion occurs when a septum, or cell plate, forms midway along the rod (Figure 
17-5A). The budding yeast Saccharomyces cerevisiae is used by both brewers 
and bakers. It is an oval cell that divides by forming a bud, which first appears 
during G x and grows steadily until it separates from the mother cell after mito- 
sis (Figure 17-5B). 

Despite their outward differences, the two yeast species share many features 
that are extremely useful for genetic studies, They reproduce almost as rapidly as 
bacteria and have a genome size less than 1% that of a mammal. They are 
amenable to rapid molecular genetic manipulation, in which genes can be 
deleted, replaced, or altered, Most importantly they have the ability to prolifer- 
ate in a hapioid state, with only a single copy of each gene present in the cell. 
When cells are hapioid it is easy to isolate and study mutations that inactivate a 
gene, because we avoid the complication of having a second copy of the gene in 
the cell. 

Many important discoveries about cell-cycle control have come from sys- 
tematic searches for mutations in yeasts that inactivate genes encoding essen- 
tial components of the cell-cycle control system. The genes affected by some of 
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Figure 1 7-5 A comparison of the cell 
cycles of fission yeasts and budding 
yeasts. (A) The fission yeast has a typical 
eucaryotic cell cycle with G 1# S, G2, and 
M phases. The nuclear envelope of the 
yeast cell, unlike that of a higher 
eucaryotic cell, does not break down 
during M phase. The microtubules of the 
mitotic spindle (light green) form inside 
the nucleus and are attached to spindle 
pole bodies (dark green) at its periphery. 
The cell divides by forming a partition 
(known as the cell plate) and splitting in 
two. (B)The budding yeast has normal Gt 
and S phases but does not have a normal 
G 2 phase. Instead, a microtubule-based 
spindle begins to form late in S phase; as 
in fission yeasts, the nuclear envelope 
remains intact during mitosis, and the 
spindle forms within the nucleus. In 
contrast with a fission yeast cell, the cell 
divides by budding. 
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these mutations are known as cell-division-cycle genes, or Cdc genes. Many of 
these mutations cause cells to arrest at a specific point in the cell cycle, suggest- 
ing that the normal gene product is required to get the cell past this point. 

A mutant that cannot complete the cell cycle, however, cannot be propa- 
gated. Thus, Cdc mutants can be selected and maintained only if their pheno- 
type is conditional— that is, if the mutant gene product fails to function only in 
certain specific conditions. Most conditional cell-cycle mutations are tempera- 
ture-sensitive mutations, in which the mutant protein fails to function at high 
temperatures but functions well enough to allow cell division at low tempera- 
tures. A temperature-sensitive Cdc mutant can be propagated at a low tempera- 
ture (the permissive condition) and then raised to a higher temperature (the 
restrictive condition) to switch off the function of the mutant gene. At the higher 
temperature, cells continue through the cell cycle until they reach the point 
where the function of the mutant gene is required for further progress, and at 
this point they halt (Figure 17-6). In budding yeasts, we can detect a uniform 
cell-cycle arrest of this type by just looking at the cells: the presence or absence 
of a bud, and bud size, indicate the point in the cycle at which the mutant is 
arrested (Figure 17-7). 



Figure 1 7-6 The behavior of a 
temperature-sensitive Cdc mutant. 

(A) At the permissive (low) temperature, the 
cells divide normally and are found in all 
phases of the cycle (the phase of the cell is 
indicated by its color). (B) On warming to 
the restrictive (high) temperature, at which 
the mutant gene product functions 
abnormally, the mutant cells continue to 
progress through the cycle until they come 
to the specific step that they are unable to 
complete (initiation of S phase, in this 
example). Because the Cdc mutants still 
continue to grow, they become abnormally 
large. By contrast, non-Cc/c mutants, if 
deficient in a process that is necessary 
throughout the cycle for biosynthesis and 
growth (such as ATP production), halt 
haphazardly at any stage of the cycle — 
depending on when their biochemical 
reserves run out (not shown). 



Cell-Cycle Control Can Be Analyzed Biochemically in Animal 
Embryos 

The biochemical features of the cell cycle are readily analyzed in the giant fertil- 
ized eggs of many animals, which carry large stockpiles of the proteins needed 
for cell division. The egg of the frog Xenopus, for example, is over 1 mm in diam- 
eter and contains 100,000 times more cytoplasm than an average cell in the 
human body (Figure 17-8). Fertilization of the Xenopus egg triggers an aston- 
ishingly rapid sequence of cell divisions, called cleavage divisions, in which the 
single giant cell divides, without growing, to generate an embryo containing 
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Figure 1 7-7 The morphology of budding yeast cells arrested by a Cdc mutation. (A) In a 

normal population of proliferating yeast cells, buds vary in size according to the cell-cycle stage. 
(B) In a CdclS mutant grown at the restrictive temperature, cells complete anaphase but cannot 
complete the exit from mitosis and cytokinesis. As a result, they arrest uniformly with large 
buds, which are characteristic of late M phase. (Courtesy of Jeff Ubersax.) 




0.5 mm 

Figure 1 7-8 A mature Xenopus egg, 
ready for fertilization. The pale spot near 
the top shows the site of the nucleus, 
which has displaced the brown pigment 
in the surface layer of the egg cytoplasm. 
Although this cannot be seen in the 
picture, the nuclear envelope has broken 
down during the process of egg 
maturation. (Courtesy of Tony Mills.) 
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Figure 1 7-9 Oocyte growth and egg cleavage in Xenopus. The oocyte grows without dividing for many months in the 
ovary of the mother frog and finally matures into an egg (discussed in Chapter 21). Upon fertilization, the egg cleaves very 
rapidly— initially at a rate of one division cycle every 30 minutes— forming a multicellular tadpole within a day or two. The 
cells get progressively smaller with each division, and the embryo remains the same size. Growth starts only when the 
tadpole begins feeding. The drawings in the top row are all on the same scale (but the frog below is not). 



thousands of smaller cells (Figure 17-9). <ATTT> After a first division that takes 
about 90 minutes, the next 1 1 divisions occur, more or less synchronously, at 30- 
minute intervals, producing about 4096 (2 12 ) cells within 7 hours. Each cycle is 
divided into S and M phases of about 15 minutes each, without detectable Gi or 
G2 phases. 

The early embryonic cells of Xenopus, as well as those of the clam Spisula and 
the fruit fly Drosophila, are thus capable of exceedingly rapid division in the 
absence of either growth or many of the control mechanisms (discussed later) 
that operate in more complex cell cycles. These early embryonic cell cycles there- 
fore reveal the workings of the cell-cycle control system stripped down and sim- 
plified to the minimum needed to achieve the most fundamental requirements — 
the duplication of the genome and its segregation into two daughter cells. 
Another advantage of these early embryos for cell-cycle analysis is their large size. 
It is relatively easy to inject test substances into an egg to determine their effect 
on cell-cycle progression. It is also possible to prepare almost pure cytoplasm 
from Xenopus eggs and reconstitute many events of the cell cycle in a test tube 
(Figure 17-10). In such cell extracts, we can observe and manipulate cell-cycle 
events under highly simplified and controllable conditions. 





j 
cell-free mitotic cycle: 40-60 min 




Figure 17-10 Studying the cell cycle in a 
cell-free system. Gentle centrifugation is 
used to break open a large batch of frog 
eggs and separate the cytoplasm from 
other cell components. The undiluted 
cytoplasm is collected, and sperm nuclei 
are added to it, together with ATP. The 
sperm nuclei decondense and then go 
through repeated cycles of DNA 
replication and mitosis, indicating that 
the cell-cycle control system is operating 
in this cell-free cytoplasmic extract. 
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Cell-Cycle Control Can Be Studied in Cultured Mammalian Cells 

It is not easy to observe individual cells in an intact mammal. Most studies on 
mammalian cell-cycle control therefore use cells that have been isolated from 
normal tissues or tumors and grown in plastic culture dishes in the presence of 
essential nutrients and other factors (Figure 17-11). There is a complication, 
however. When cells from normal mammalian tissues are cultured in standard 
conditions, they often stop dividing after a limited number of division cycles. 
Human fibroblasts, for example, permanently cease dividing after 25-40 divi- 
sions, a process called replicative cell senescence, as we discuss later. 

Mammalian cells occasionally undergo mutations that help them proliferate 
indefinitely in culture as "immortalized" cell lines. Although they are not normal, 
such cell lines are widely used for cell-cycle studies — and for cell biology gener- 
ally—because they provide an unlimited source of genetically homogeneous 
cells. In addition, these cells are sufficiently large to allow detailed cytological 
observations of cell-cycle events, and they are amenable to biochemical analy- 
sis of the proteins involved in cell-cycle control. 

Studies of cultured mammalian cells have been especially useful for exam- 
ining the molecular mechanisms governing the control of cell proliferation in 
multicellular organisms. Such studies are important not only for understanding 
the normal controls of cell numbers in tissues but also for understanding the 
loss of these controls in cancer (discussed in Chapter 20). 
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Figure 1 7-1 1 Mammalian cells 
proliferating in culture. The cells in this 
scanning electron micrograph are rat 
fibroblasts. (Courtesy of Guenter 
Albrecht-Buehler.) 



Cell-Cycle Progression Can Be Studied in Various Ways 

How can we tell what stage an animal cell has reached in the cell cycle? One way 
is simply to look at living cells with a microscope. A glance at a population of 
mammalian cells proliferating in culture reveals that a fraction of the cells have 
rounded up and are in mitosis (see Figure 17-11). Others can be observed in the 
process of cytokinesis. We can gain additional clues about cell-cycle position by 
staining cells with DNA-binding fluorescent dyes (which reveal the condensa- 
tion of chromosomes in mitosis) or with antibodies that recognize specific cel- 
lular components such as the microtubules (revealing the mitotic spindle). Sim- 
ilarly S-phase cells can be identified in the microscope by supplying them with 
visualizable molecules that are incorporated into newly synthesized DNA, such 
as the artificial thymidine analog bromo-deoxyuridine (BrdU). Cell nuclei that 
have incorporated BrdU are then visualized by staining with anti-BrdU antibod- 
ies (Figure 17-12). 

Typically in a population of cells that are all proliferating rapidly but asyn- 
chronously, about 30-40% will be in S phase at any instant and become labeled 
by a brief pulse of BrdU. From the proportion of cells in such a population that 
are labeled (the labeling index), we can estimate the duration of S phase as a 
fraction of the whole cell-cycle duration. Similarly from the proportion of cells 
in mitosis (the mitotic index), we can estimate the duration of M phase. In addi- 
tion, by giving a pulse of BrdU and allowing the cells to continue around the 
cycle for measured lengths of time, we can determine how long it takes for an S- 
phase cell to progress through G2 into M phase, through M phase into Gi, and 
finally through Gi back into S phase. 

Another way to assess the stage that a cell has reached in the cell cycle is by 
measuring its DNA content, which doubles during S phase. This approach is 
greatly facilitated by the use of fluorescent DNA-binding dyes and a flow cytome- 
ter, which allows the rapid and automatic analysis of large numbers of cells (Fig- 
ure 17-13). We can also use flow cytometry to determine the lengths of Gi, S, 



Figure 17-12 Labeling S-phase cells. An immunofluorescence micrograph 
of BrdU-labeled epithelial cells of the zebrafish gut. The fish was exposed to 
BrdU, after which the tissue was fixed and prepared for labeling with 
fluorescent anti-BrdU antibodies (green). All the cells are stained with a red 
fluorescent dye. (Courtesy of Cecile Crosnier.) 
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Figure 1 7-1 3 Analysis of DNA content with a flow cytometer. This graph 

shows typical results obtained for a proliferating cell population when the 

DNA content of its individual cells is determined in a flow cytometer. ce \\^ j n g 1 

(A flow cytometer, also called a fluorescence-activated cell sorter, or FACS, phase 

can also be used to sort cells according to their fluorescence — see Figure 

8-2). The cells analyzed here were stained with a dye that becomes 

fluorescent when it binds to DNA, so that the amount of fluorescence is 

directly proportional to the amount of DNA in each celL The cells fall into 

three categories: those that have an unreplicated complement of DNA and 

are therefore in Gi, those that have a fully replicated complement of DNA 

(twice the Gi DNA content) and are in G 2 or M phase, and those that have 

an intermediate amount of DNA and are in S phase. The distribution of cells 

in the case illustrated indicates that there are greater numbers of cells in 



Gi than in G 2 + M phase, showing that d is longer than G 2 + M in this | M phases* *" 

population. f — , 

I cells \n 
[ S phase 

and G2 + M phases, by following over time a population of DNA-labeled cells 

that have been preselected to be in one particular phase of the cell cycle: DNA 

content measurements on such a synchronized population of cells reveal how 1 2 

the cells progress through the cycle. re i ative amount of dna per cell 




Summary 

Cell division usually begins with duplication of the cell's contents, followed by distribu- 
tion of those contents into two daughter cells. Chromosome duplication occurs during 
S phase of the cell cycle, whereas most other cell components are duplicated continu- 
ously throughout the cycle. During M phase, the replicated chromosomes are segregated 
into individual nuclei (mitosis), and the cell then splits in two (cytokinesis). S phase and 
M phase are usually separated by gap phases called Gj and G2, when various intracel- 
lular and extracellular signals regulate cell-cycle progression. Cell-cycle organization 
and control have been highly conserved during evolution, and studies in a wide range 
of systems — including yeasts, animal embryos, and mammalian cells in culture — have 
led to a unified view ofeucaryotic cell-cycle control. 



THE CELL-CYCLE CONTROL SYSTEM 

For many years cell biologists watched the puppet show of DNA synthesis, mito- 
sis, and cytokinesis but had no idea of what lay behind the curtain controlling 
these events. The cell-cycle control system was simply a black box inside the cell. 
It was not even clear whether there was a separate control system, or whether 
the processes of DNA synthesis, mitosis, and cytokinesis somehow controlled 
themselves. A major breakthrough came in the late 1980s with the identification 
of the key proteins of the control system, along with the realization that they are 
distinct from the proteins that perform the processes of DNA replication, chro- 
mosome segregation, and so on. 

In this section, we first consider the basic principles upon which the cell- 
cycle control system operates. We then discuss the protein components of the 
control system and how they work together to time and coordinate the events of 
the cell cycle. 



The Cell-Cycle Control System Triggers the Major Events of the 
Cell Cycle 

The cell-cycle control system operates much like a timer or oscillator that trig- 
gers the events of the cell cycle in a set sequence (Figure 17-14). In its simplest 
form — as seen in the stripped-down embryonic cell cycles discussed earlier — 
the control system is like a rigidly programmed timer that provides a fixed 
amount of time for the completion of each cell-cycle event. The control system 
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Is all DNA replicated? 
Is environment favorable? 
G 2 /M CHECKPOINT 



I ENTER MITOSIS 



Are all chromosomes 
attached to the spindle? 

METAPHASE-TO-ANAPHASE 
TRANSITION 

TRIGGER ANAPHASE AND 
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Figure 17-1 4 The control of the cell 
cycle. A cell-cycle control system triggers 
the essential processes of the cell cycle- 
such as DNA replication, mitosis, and 
cytokinesis. The control system is 
represented here as a central arm — the 
controller— that rotates clockwise, 
triggering essential processes when it 
reaches specific checkpoints on the outer 
dial. Information about the completion of 
cell-cycle events, as well as signals from 
the environment, can cause the control 
system to arrest the cycle at these 
checkpoints. The most prominent 
checkpoints occur at locations marked 
with yellow boxes. 



in these cells is independent of the events it controls, so that its timing mecha- 
nisms continue to operate even if those events fail. In most cells, however, the 
control system does respond to information received back from the processes it 
controls. Sensors, for example, detect the completion of DNA synthesis, and if 
some malfunction prevents the successful completion of this process, signals 
are sent to the control system to delay progression to M phase. Such delays pro- 
vide time for the machinery to be repaired and also prevent the disaster that 
might result if the cycle progressed prematurely to the next stage — and segre- 
gated incompletely replicated chromosomes, for example. 

The cell-cycle control system is based on a connected series of biochemical 
switches, each of which initiates a specific cell-cycle event. This system of 
switches possesses many important engineering features that increase the accu- 
racy and reliability of cell-cycle progression. First, the switches are generally 
binary (on/ off) and launch events in a complete, irreversible fashion. It would 
clearly be disastrous, for example, if events like chromosome condensation or 
nuclear envelope breakdown were only partially initiated or started but not 
completed. Second, the cell-cycle control system is remarkably robust and reli- 
able, partly because backup mechanisms and other features allow the system to 
operate effectively under a variety of conditions and even if some components 
fail. Finally, the control system is highly adaptable and can be modified to suit 
specific cell types or to respond to specific intracellular or extracellular signals. 

In most eucaryotic cells, the cell-cycle control system triggers cell-cycle pro- 
gression at three major regulatory transitions, or checkpoints (see Figure 
17-14). The first checkpoint is Start (or the restriction point) in late Gi, where the 
cell commits to cell-cycle entry and chromosome duplication, as mentioned 
earlier. The second is the G 2 /M checkpoint, where the control system triggers 
the early mitotic events that lead to chromosome alignment on the spindle in 
metaphase. The third is the metaphase-to-anaphase transition, where the con- 
trol system stimulates sister- chromatid separation, leading to the completion of 
mitosis and cytokinesis. The control system blocks progression through each of 
these checkpoints if it detects problems inside or outside the cell. If the control 
system senses problems in the completion of DNA replication, for example, it 
will hold the cell at the G2/M checkpoint until those problems are solved. Simi- 
larly, if extracellular conditions are not appropriate for cell proliferation, the 
control system blocks progression through Start, thereby preventing cell divi- 
sion until conditions become favorable. 
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The Cell-Cycle Control System Depends on Cyclically Activated 
Cyclin-Dependent Protein Kinases (Cdks) 

Central components of the cell-cycle control system are members of a family of 
protein kinases known as cyclin-dependent kinases (Cdks). The activities of 
these kinases rise and fall as the cell progresses through the cycle, leading to 
cyclical changes in the phosphorylation of intracellular proteins that initiate or 
regulate the major events of the cell cycle. An increase in Cdk activity at the 
G2/M checkpoint, for example, increases the phosphorylation of proteins that 
control chromosome condensation, nuclear envelope breakdown, spindle 
assembly, and other events that occur at the onset of mitosis. 

Cyclical changes in Cdk activity are controlled by a complex array of 
enzymes and other proteins that regulate these kinases. The most important of 
these Cdk regulators are proteins known as cyclins. Cdks, as their name implies, 
are dependent on cyclins for their activity: unless they are tightly bound to a 
cyclin, they have no protein kinase activity (Figure 17-15). Cyclins were origi- 
nally named because they undergo a cycle of synthesis and degradation in each 
cell cycle. The levels of the Cdk proteins, by contrast, are constant, at least in the 
simplest cell cycles. Cyclical changes in cyclin protein levels result in the cyclic 
assembly and activation of the cyclin-Cdk complexes; this activation in turn 
triggers cell-cycle events. 

There are four classes of cyclins, each defined by the stage of the cell cycle at 

which they bind Cdks and function. All eucaryotic cells require three of these 

classes (Figure 17-16): 

1. Gi/S-cyclins activate Cdks in late Gi and thereby help trigger progression 

through Start, resulting in a commitment to cell-cycle entry. Their levels 

fall in S phase. 

S-cyclins bind Cdks soon after progression through Start and help stimu- 
late chromosome duplication. S-cyclin levels remain elevated until mito- 
sis, and these cyclins also contribute to the control of some early mitotic 
events. 

M-cyclins activate Cdks that stimulate entry into mitosis at the G 2 /M 
checkpoint. Mechanisms that we discuss later destroy M-cyclins in mid- 
mitosis. 
In most cells, a fourth class of cyclins, the Gi -cyclins, helps govern the activities 
of the Gi/S cyclins, which control progression through Start in late Gi. 

In yeast cells, a single Cdk protein binds all classes of cyclins and triggers dif- 
ferent cell-cycle events by changing cyclin partners at different stages of the 
cycle. In vertebrate cells, by contrast, there are four Cdks. Two interact with G r 
cyclins, one with Gi/S- and S-cyclins, and one with M-cyclins. In this chapter, we 
simply refer to the different cyclin-Cdk complexes as G^Cdk, Gi/S-Cdk, S-Cdk, 
and M-Cdk. Table 17-1 lists the names of the individual Cdks and cyclins. 

How do different cyclin-Cdk complexes trigger different cell-cycle events? 
The answer, at least in part, seems to be that the cyclin protein does not simply 
activate its Cdk partner but also directs it to specific target proteins. As a result, 
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Figure 1 7-1 5 Two key components of 
the cell-cycle control system. When 
cyclin forms a complex with Cdk, the 
protein kinase is activated to trigger 
specific cell-cycle events. Without cyclin, 
Cdk is inactive. 
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Figure 1 7-1 6 Cyclin-Cdk complexes of 
the cell-cycle control system. The 

concentrations of the three major cyclin 
types oscillate during the cell cycle, while 
the concentrations of Cdks {not shown) 
do not change and exceed the amounts 
of cyclins. In late G h rising GT/S-cyclin 
levels lead to the formation of Gi/S-Cdk 
complexes that trigger progression 
through the Start checkpoint. S-Cdk 
complexes form at the start of S phase 
and trigger DNA replication, as well as 
some early mitotic events. M-Cdk 
complexes form during G 2 but are held in 
an inactive state by mechanisms we 
describe later. These complexes are 
activated at the end of G 2 and trigger the 
early events of mitosis. A separate 
regulatory protein, the APC/C, which we 
discuss later, initiates the metaphase-to- 
anaphase transition. 
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each cyclin-Cdk complex phosphorylates a different set of substrate proteins. 
The same cyclin-Cdk complex can also induce different effects at different times 
in the cycle, probably because the accessibility of some Cdk substrates changes 
during the cell cycle. Certain proteins that function in mitosis, for example, may 
become available for phosphorylation only in G2. 

Studies of the three-dimensional structures of Cdk and cyclin proteins have 
revealed that, in the absence of cyclin, the active site in the Cdk protein is partly 
obscured by a slab of protein, like a stone blocking the entrance to a cave (Figure 
17-17A). Cyclin binding causes the slab to move away from the active site, result- 
ing in partial activation of the Cdk enzyme (Figure 17-17B). Full activation of the 
cyclin-Cdk complex then occurs when a separate kinase, the Cdk-activating 
kinase (CAK), phosphorylates an amino acid near the entrance of the Cdk active 
site. This causes a small conformational change that further increases the activ- 
ity of the Cdk, allowing the kinase to phosphorylate its target proteins effectively 
and thereby induce specific cell-cycle events (Figure 17-17C). <TAGA> 



Inhibitory Phosphorylation and Cdk Inhibitory Proteins (CKIs) Can 
Suppress Cdk Activity 

The rise and fall of cyclin levels is the primary determinant of Cdk activity dur- 
ing the cell cycle. Several additional mechanisms, however, fine-tune Cdk activ- 
ity at specific stages of the cycle. 

Phosphorylation at a pair of amino acids in the roof of the kinase active site 
inhibits the activity of a cyclin-Cdk complex. Phosphorylation of these sites by a 
protein kinase known as Weel inhibits Cdk activity, while dephosphorylation of 
these sites by a phosphatase known as Cdc25 increases Cdk activity (Figure 
17-18). We will see later that this regulatory mechanism is particularly impor- 
tant in the control of M-Cdk activity at the onset of mitosis. 

Binding of Cdk inhibitor proteins (CKIs) also regulates cyclin-Cdk com- 
plexes. The three-dimensional structure of a cyclin-Cdk-CKI complex reveals 
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Figure 1 7-1 7 The structural basis of Cdk 
activation. These drawings are based on 
three-dimensional structures of human 
Cdk2, as determined by x-ray 
crystallography. The location of the bound 
ATP is indicated. The enzyme is shown in 
three states. (A) In the inactive state, without 
cyclin bound, the active site is blocked by a 
region of the protein called the T-loop (red). 
(B) The binding of cyclin causes the T-loop to 
move out of the active site, resulting in 
partial activation of the Cdk2. (C) 
Phosphorylation of Cdk2 (by CAK) at a 
threonine residue in the T-loop further 
activates the enzyme by changing the shape 
of the T-loop, improving the ability of the 
enzyme to bind its protein substrates. 
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that CKI binding stimulates a large rearrangement in the structure of the Cdk 
active site, rendering it inactive (Figure 17-19). Cells use CKIs primarily to help 
govern the activities of Gi/S- and S-Cdks early in the cell cycle. 



The Cell-Cycle Control System Depends on Cyclical Proteolysis 

Whereas activation of specific cyclin-Cdk complexes drives progression through 
the Start and G2/M checkpoints (see Figure 17-16), progression through the 
metaphase-to-anaphase transition is triggered not by protein phosphorylation 
but by protein destruction, leading to the final stages of cell division. 

The key regulator of the metaphase-to-anaphase transition is the anaphase- 
promoting complex, or cyclosome (APC/C), a member of the ubiquitin ligase 
family of enzymes. As discussed in Chapter 3, many of these enzymes are used 
in numerous cell processes to stimulate the proteolytic destruction of specific 
regulatory proteins. They transfer multiple copies of the small protein ubiquitin 
to specific target proteins, resulting in their proteolytic destruction by the pro- 
teasomes. Other ubiquitin ligases mark proteins for purposes other than 
destruction. 

The APC/C catalyzes the ubiquitylation and destruction of two major pro- 
teins. The first is securin, which normally protects the protein linkages that hold 
sister chromatid pairs together in early mitosis. Destruction of securin at the 
metaphase-to-anaphase transition activates a protease that separates the sisters 
and unleashes anaphase. The S- and M-cyclins are the second major targets of 
the APC/C. Destroying these cyclins inactivates most Cdks in the cell (see Figure 
17-16). As a result, the many proteins phosphorylated by Cdks from S phase to 
early mitosis are dephosphorylated by various phosphatases that are present in 
the anaphase cell. This dephosphorylation of Cdk targets is required for the 
completion of M phase, including the final steps in mitosis and the process of 
cytokinesis. Following its activation in mid-mitosis, the APC/C remains active in 
Gi, thereby providing a stable period of Cdk inactivity. When Gi/S-Cdks are acti- 
vated in late d, the APC/C is turned off, thereby allowing cyclin accumulation 
to start the next cell cycle. 

The cell-cycle control system also uses another ubiquitin ligase called SCF 
(after the names of its three subunits). It ubiquitylates certain CKI proteins in 
late Gi and thereby helps control the activation of S-Cdks and DNA replication. 

The APC/C and SCF are both large, multisubunit complexes with some 
related components, but they are regulated differently. APC/C activity changes 
during the cell cycle, primarily as a result of changes in its association with an 
activating subunit— either Cdc20 during anaphase or Cdhl from late mitosis 
through early Gi, These subunits help the APC/C recognize its target proteins 
(Figure 17-20A). SCF activity also depends on subunits called F-box proteins, 
which help the complex recognize its target proteins. Unlike APC/C activity, 
however, SCF activity is constant during the cell cycle. Ubiquitylation by SCF is 
controlled instead by changes in the phosphorylation state of its target proteins, 
as F-box subunits recognize only specifically phosphorylated proteins (Figure 
17-20B). 
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Figure 1 7-1 8 The regulation of Cdk 
activity by inhibitory phosphorylation. 

The active cyclin-Cdk complex is turned 
off when the kinase Weel phosphorylates 
two closely spaced sites above the active 
site. Removal of these phosphates by the 
phosphatase Cdc25 activates the 
cyclin-Cdk complex. For simplicity, only 
one inhibitory phosphate is shown. CAK 
adds the activating phosphate, as shown 
in Figure 17-17. 
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Figure 17-1 9 The inhibition of a 
cyclin-Cdk complex by a CKI. This 
drawing is based on the three- 
dimensional structure of the human 
cyclin A-Cdk2 complex bound to the 
CKI p27, as determined by x-ray 
crystallography. The p27 binds to both 
the cyclin and Cdk in the complex, 
distorting the active site of the Cdk. It 
also inserts into the ATP-binding site, 
further inhibiting the enzyme activity. 
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Cell-Cycle Control Also Depends on Transcriptional Regulation 

In the frog embryonic cell cycle discussed earlier, gene transcription does not 
occur. Cell-cycle control depends exclusively on post-transcriptional mecha- 
nisms that involve the regulation of Cdks and ubiquitin ligases and their target 
proteins. In the more complex cell cycles of most cell types, however, transcrip- 
tional control provides an additional level of regulation. Changes in cyclin gene 
transcription, for example, help control cyclin levels in most cells. 

We can use DNA microarrays (discussed in Chapter 8) to analyze changes in 
the expression of all of the genes in the genome as the cell progresses through 
the cell cycle. The results of these studies are surprising. In budding yeast, for 
example, about 10% of the genes encode mRNAs whose levels oscillate during 
the cell cycle. Some of these genes encode proteins with known cell-cycle func- 
tions, but the functions of many others are unknown. 



Figure 1 7-20 The control of proteolysis 
by APC/C and SCF during the cell cycle. 

(A) The APC/C is activated in mitosis by 
association with the activating subunit 
Cdc20, which recognizes specific amino 
acid sequences on M-cyclin and other 
target proteins. With the help of two 
additional proteins called E1 and E2, the 
APC/C transfers multiple ubiquitin 
molecules onto the target protein. The 
polyubiquitylated target is then 
recognized and degraded in a 
proteasome. <B) The activity of the 
ubiquitin ligase SCF depends on 
substrate-binding subunits called F-box 
proteins, of which there are many 
different types. The phosphorylation of a 
target protein, such as the CKI shown, 
allows the target to be recognized by a 
specific F-box subunit. 



The Cell-Cycle Control System Functions as a Network of 
Biochemical Switches 

Table 17-2 summarizes some of the major components of the cell-cycle control 
system. These proteins are functionally linked together to form a robust net- 
work, which operates essentially autonomously to activate a series of biochem- 
ical switches, each of which triggers a specific cell-cycle event. 
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Table 1 7-2 Summary of the Major Cell-Cycle Regulatory Proteins 
GENERAL NAME FUNCTIONS AND COMMENTS 

Protein kinases and protein phosphatases that modify Cdks 




Cdk-activating kinase (CAK) 
Wee1 kinase 

Cdc25 phosphatase 

Cdk inhibitor proteins (CKIs) 

Sid (budding yeast) 
p27 (mammals) 



p21 (mammals) 
p16 (mammals) 

Ubiquitin ligases and their activators 

APC/C 



phosphorylates an activating site in Cdks 

phosphorylates inhibitory sites in Cdks; primarily involved in suppressing Cdk1 activity before 

mitosis 

removes inhibitory phosphates from Cdks; three family members (Cdc25A, B, C) in mammals; 

primarily involved in controlling Cdkl activation at the onset of mitosis 



suppresses Cdk1 activity in d; phosphorylation by Cdk1 at the end of Gi triggers its destruction 
suppresses Gi/S-Cdk and S-Cdk activities in G 1( * helps cells withdraw from cell cycle when they 
terminally differentiate; phosphorylation by Cdk2 triggers its ubiquitylation by SCF 
suppresses Gi/S-Cdk and S-Cdk activities following DNA damage 
suppresses G r Cdk activity in Gy, frequently inactivated in cancer 



Cdc20 



Cdhl 



SCF 



catalyzes ubiquitylation of regulatory proteins involved primarily in exit from mitosis, including 

securin and S- and M-cyclins; regulated by association with activating subunits 

APC/C-activating subunit in all cells; triggers initial activation of APC/C at metaphase-to-anaphase 

transition; stimulated by M-Cdk activity 

APC/C-activating subunit that maintains APC/C activity after anaphase and throughout Gi; 

inhibited by Cdk activity 

catalyzes ubiquitylation of regulatory proteins involved in Gi control, including some CKIs (Sicl in 

budding yeast, p27 in mammals); phosphorylation of target protein usually required for this 

activity 



When conditions for cell proliferation are right, various external and inter- 
nal signals stimulate the activation of Gi-Cdk, which in turn stimulates the 
expression of genes encoding Gi/S- and S-cyclins. The resulting activation of 
Gi/S-Cdk then drives progression through the Start checkpoint. By mechanisms 
we discuss later, Gi/S-Cdks unleash a wave of S-Cdk activity, which initiates 
chromosome duplication in S phase and also contributes to some early events 
of mitosis. M-Cdk activation then triggers progression through the G 2 /M check- 
point and the events of early mitosis, leading to the alignment of sister chro- 
matids at the equator of the mitotic spindle. Finally, the APC/C, together with its 
activator Cdc20, triggers the destruction of securin and cyclins at the 
metaphase-to-anaphase transition, thereby unleashing sister-chromatid segre- 
gation and the completion of mitosis (Figure 17-21). When mitosis is complete, 
multiple mechanisms collaborate to suppress Cdk activity after mitosis, result- 
ing in a stable Gi period, as we discuss later. We are now ready to discuss these 
cell-cycle stages in more detail, starting with S phase. 
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Figure 17-21 An overview of the 
cell-cycle control system. The core of the 
cell-cycle control system consists of a 
series of cyclin-Cdk complexes (yellow). 
As discussed in more detail later, the 
activity of each complex is also 
influenced by various inhibitory 
mechanisms, which provide information 
about the extracellular environment, cell 
damage, and incomplete cell-cycle 
events (top). These mechanisms are not 
present in all cell types; many are missing 
in early embryonic cell cycles, for 
example. 
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Summary 

The cell-cycle control system triggers the events of the cell cycle and ensures that these 
events are properly timed and occur in the correct order. The control system responds 
to various intracellular and extracellular signals and arrests the cycle when the cell 
either fails to complete an essential cell-cycle process or encounters unfavorable envi- 
ronmental or intracellular conditions. 

Central components of the cell-cycle control system are cyclin- dependent protein 
kinases (Cdks), which depend on cyclin subunits for their activity. Oscillations in the 
activities of various cyclin-Cdk complexes control various cell-cycle events. Thus, acti- 
vation ofS-phase cyclin-Cdk complexes (S-Cdk) initiates S phase, while activation of 
M-phase cyclin-Cdk complexes (M-Cdk) triggers mitosis. The mechanisms that con- 
trol the activities of cyclin-Cdk complexes include phosphorylation of the Cdk sub- 
unit, binding of Cdk inhibitor proteins (CKIs), proteolysis of cyclins, and changes in the 
transcription of genes encoding Cdk regulators. The cell-cycle control system also 
depends crucially on two additional enzyme complexes, theAPC/C and SCF ubiquitin 
ligases, which catalyze the ubiquitylation and consequent destruction of specific regu- 
latory proteins that control critical events in the cycle. 



S PHASE 

The linear chromosomes of eucaryotic cells are vast and dynamic assemblies of 
DNA and protein, and their duplication is a complex process that takes up a 
major fraction of the cell cycle. Not only must the long DNA molecule of each 
chromosome be duplicated accurately — a remarkable feat in itself — but the pro- 
tein packaging surrounding each region of that DNA must also be reproduced, 
ensuring that the daughter cells inherit all features of chromosome structure. 

The central event of chromosome duplication is replication of the DNA. A 
cell must solve two problems when initiating and completing DNA replication. 
First, replication must occur with extreme accuracy to minimize the risk of 
mutations in the next cell generation. Second, every nucleotide in the genome 
must be copied once, and only once, to prevent the damaging effects of gene 
amplification. In Chapter 5, we discuss the sophisticated protein machinery that 
performs DNA replication with astonishing speed and accuracy. In this section, 
we consider the elegant mechanisms by which the cell- cycle control system ini- 
tiates the replication process and, at the same time, prevents it from happening 
more than once per cycle. 

S-Cdk Initiates DNA Replication Once Per Cycle 

DNA replication begins at origins of replication, which are scattered at numer- 
ous locations in every chromosome. During S phase, the initiation of DNA repli- 
cation occurs at these origins when specialized protein machines (sometimes 
called initiator proteins) unwind the double helix at the origin and load DNA 
replication enzymes onto the two single-stranded templates. This leads to the 
elongation phase of replication, when the replication machinery moves outward 
from the origin at two replication forks (discussed in Chapter 5). 

To ensure that chromosome duplication occurs only once per cell cycle, 
the initiation phase of DNA replication is divided into two distinct steps that 
occur at different times in the cell cycle. The first step occurs in late mitosis 
and early Gi, when a large complex of initiator proteins, called the prereplica- 
tive complex, or pre-RC, assembles at origins of replication. This step is some- 
times called licensing of replication origins because initiation of DNA synthe- 
sis is permitted only at origins containing a pre-RC. The second step occurs at 
the onset of S phase, when components of the pre-RC nucleate the formation 
of a larger protein complex called the preinitiation complex. This complex 
then unwinds the DNA helix and loads DNA polymerases and other replication 
enzymes onto the DNA strands, thereby initiating DNA synthesis, as described 
in Chapter 5. Once the replication origin has been activated in this way, the 
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Figure 17-22 Control of chromosome 
duplication. Preparations for DNA 
replication begin in Gi with the assembly 
of prereplicative complexes (pre-RCs) at 
replication origins. S-Cdk activation leads 
to the formation of multiprotein 
preinitiation complexes that unwind the 
DNA at origins and begin the process of 
DNA replication. Two replication forks 
move out from each origin until the 
entire chromosome is duplicated. 
Duplicated chromosomes are then 
segregated in M phase. The activation of 
replication origins in S phase also causes 
disassembly of the prereplicative 
complex, which does not reform at the 
origin until the following Gi — thereby 
ensuring that each origin is activated 
only once in each cell cycle. 



pre-RC is dismantled and cannot be reassembled at that origin until the follow- 
ing Gi. As a result, origins can be activated only once per cell cycle. 

The cell-cycle control system governs both assembly of the pre-RC and 
assembly of the pre-initiation complex (Figure 17-22). Assembly of the pre-RC 
is inhibited by Cdk activity, and, in most cells, is stimulated by the APC/C. Pre- 
RC assembly therefore occurs only in late mitosis and early Gi, when Cdk activ- 
ity is low and APC/C activity is high. At the onset of S phase, activation of S-Cdk 
then triggers the formation of a preinitiation complex, which initiates DNA syn- 
thesis. In addition, the pre-RC is partly dismantled. Because S-Cdk and M-Cdk 
activities remain high (and APC/C activity remains low) until late mitosis, new 
pre-RCs cannot be assembled at fired origins until the cell cycle is complete. 

Figure 17-23 illustrates some of the proteins involved in the initiation of 
DNA replication. A key player is a large, multiprotein complex called the origin 
recognition complex (ORC), which binds to replication origins throughout the 
cell cycle. In late mitosis and early Gi, the proteins Cdc6 and Cdtl bind to the 
ORC at origins and help load a group of six related proteins called the Mem pro- 
teins. The resulting large complex is the pre-RC, and the origin is now licensed 
for replication. 

The six Mem proteins of the pre-RC form a ring around the DNA that is 
thought to serve as the major DNA helicase that unwinds the origin DNA when 
DNA synthesis begins and as the replication forks move out from the origin. 
Thus, the central purpose of the pre-RC is to load the helicase that will play a 
central part in the subsequent DNA replication process. 

Once the pre-RC has assembled in Gi, the replication origin is ready to fire. 
The activation of S-Cdk in late Gi triggers the assembly of several additional pro- 
tein complexes at the origin, leading to the formation of a giant preinitiation 
complex that unwinds the helix and begins DNA synthesis. 

At the same time as it initiates DNA replication, S-Cdk triggers the disas- 
sembly of some pre-RC components at the origin. Cdks phosphorylate both the 
ORC and Cdc6, resulting in their inhibition by various mechanisms. Further- 
more, inactivation of the APC/C in late Gi also helps turn off pre-RC assembly. 
In late mitosis and early Gi, the APC/C triggers the destruction of a protein, 
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Figure 17-23 Control of the initiation of 
DNA replication. The ORC remains 
associated with a replication origin 
throughout the cell cycle. In early Gi, 
Cdc6 and Cdtl associate with the ORC. 
The resulting protein complex then 
assembles Mem ring complexes on the 
adjacent DNA, resulting in the formation 
of the prereplicative complex (pre-RC). 
S-Cdk (with assistance from another 
protein kinase, not shown) then 
stimulates the assembly of several 
additional proteins at the origin to form 
the preinitiation complex. DNA 
polymerase and other replication 
proteins are recruited to the origin, the 
Mem protein rings are activated as DNA 
helicases, and DNA unwinding allows 
DNA replication to begin. S-Cdk also 
blocks rereplication by triggering the 
destruction of Cdc6 and the inactivation 
of the ORC. Cdtl is inactivated by the 
protein geminin. Geminin is an APC/C 
target and its levels therefore increase in 
S and M phases, when APC/C is inactive. 
Thus, the components of the pre-RC 
(Cdc6, Cdtl , Mem) cannot form a new 
pre-RC at the origins until M-Cdk is 
inactivated and the APC/C is activated at 
the end of mitosis (see text). 



geminin, that binds and inhibits the pre-RC component Cdtl. Thus, when the 
APC/C is turned off in late Gi, geminin accumulates and inhibits Cdtl. In these 
various ways, S- and M-Cdk activities, combined with low APC/C activity, block 
pre-RC formation during S phase and thereafter. How, then, is the cell-cycle con- 
trol system reset to allow replication to occur in the next cell cycle? The answer 
is simple. At the end of mitosis, APC/C activation leads to the inactivation of 
Cdks and the destruction of geminin. Pre-RC components are dephosphory- 
lated and Cdtl is activated, allowing pre-RC assembly to prepare the cell for the 
next S phase. 



Chromosome Duplication Requires Duplication of Chromatin 
Structure 

The DNA of the chromosomes is extensively packaged in a variety of protein 
components, including histones and various regulatory proteins involved in 
the control of gene expression (discussed in Chapter 4). Thus, duplication of a 
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chromosome is not simply a matter of duplicating the DNA at its core but also 
requires the duplication of these chromatin proteins and their proper assembly 
on the DNA. 

The production of chromatin proteins increases during S phase to provide 
the raw materials needed to package the newly synthesized DNA. Most impor- 
tantly, S-Cdks stimulate a large increase in the synthesis of the four histone sub- 
units that form the histone octamers at the core of each nucleosome. These sub- 
units are assembled into nucleosomes on the DNA by nucleosome assembly fac- 
tors, which typically associate with the replication fork and distribute nucleo- 
somes on both strands of the DNA as they emerge from the DNA synthesis 
machinery. 

Chromatin packaging helps to control gene expression. In some parts of the 
chromosome, the chromatin is highly condensed and is called heterochromatin, 
whereas in other regions it has a more open structure and is called euchromatin. 
These differences in chromatin structure depend on a variety of mechanisms, 
including modification of histone tails and the presence of non-histone proteins 
(discussed in Chapter 4). Because these differences are important in gene regu- 
lation, it is crucial that chromatin structure, like the DNA within, is reproduced 
accurately during S phase. How chromatin structure is duplicated is not well 
understood, however. During DNA synthesis, histone-modifying enzymes and 
various non-histone proteins are probably deposited onto the two new DNA 
strands as they emerge from the replication fork, and these proteins are thought 
to help reproduce the local chromatin structure of the parent chromosome. 



Cohesins Help Hold Sister Chromatids Together 

At the end of S phase, each replicated chromosome consists of a pair of identi- 
cal sister chromatids glued together along their length. This sister- chromatid 
cohesion sets the stage for a successful mitosis because it greatly facilitates the 
attachment of the two sister chromatids in a pair to opposite poles of the mitotic 
spindle. Imagine how difficult it would be to achieve this bipolar attachment if 
sister chromatids were allowed to drift apart after S phase. Indeed, defects in sis- 
ter-chromatid cohesion — in yeast mutants, for example — lead inevitably to 
major errors in chromosome segregation. 

Sister-chromatid cohesion depends on a large protein complex called 
cohesin, which is deposited at many locations along the length of each sister 
chromatid as the DNA is replicated in S phase. Two of the subunits of cohesin are 
members of a large family of proteins called SMC proteins (for Structural Main- 
tenance of Chromosomes). Cohesin forms giant ring-like structures, and it has 
been proposed that these might form rings that surround the two sister chro- 
matids (Figure 17-24). 
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Figure 17-24 Cohesin. Cohesin is a 
protein complex with four subunits. Two 
subunits, Smcl and Smc3, are coiled-coil 
proteins with an ATPase domain at one 
end; together, they form a large 
V-shaped structure as shown. Two 
additional subunits, Scd and Scc3, 
connect the ATPase head domains 
forming a ring structure that may encircle 
the sister chromatids as shown. 
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Sister- chromatid cohesion also results, at least in part, from DNA catenation, 
the intertwining of sister DNA molecules that occurs when two replication forks 
meet during DNA synthesis. The enzyme topoisomerase II gradually disentangles 
the catenated sister DNAs between S phase and early mitosis by cutting one DNA 
molecule, passing the other through the break, and then resealing the cut DNA 
(see Figure 5-23). Once the catenation has been removed, sister- chromatid cohe- 
sion depends primarily on cohesin complexes. The loss of sister cohesion at the 
metaphase-to-anaphase transition therefore depends primarily on disruption of 
these complexes, as we describe later. 

Summary 

Duplication of the chromosomes in S phase involves the accurate copying of the entire 
DNA molecule in each chromosome, as well as the duplication of the chromatin pro- 
teins that associate with the DNA and govern various aspects of chromosome function. 
Chromosome duplication is triggered by the activation ofS-Cdk, which activates pro- 
teins that unwind the DNA and initiate its replication at sites in the DNA called repli- 
cation origins. Once a replication origin is activated during S phase, S-Cdk also 
inhibits proteins that are required to allow that origin to initiate DNA replication 
again. Thus, each origin is fired once and only once in each S phase and cannot be 
reused until the next cell cycle. 

MITOSIS 

Following the completion of S phase and transition through G2, the cell under- 
goes the dramatic upheaval of M phase. This begins with mitosis, during which 
the sister chromatids are separated and distributed {segregated) to a pair of iden- 
tical daughter nuclei, each with its own copy of the genome. Mitosis is tradition- 
ally divided into five stages — prophase, prometaphase, metaphase, anaphase, and 
telophase — defined primarily on the basis of chromosome behavior as seen in a 
microscope. <TACT> <TCAA> As mitosis is completed, the second major event of 
M phase — cytokinesis — divides the cell into two halves, each with an identical 
nucleus. Panel 17-1 summarizes the major events of M phase. 

From a regulatory point of view, mitosis can be divided into two major parts, 
each governed by distinct components of the cell-cycle control system. First, an 
abrupt increase in M-Cdk activity at the G2/M checkpoint triggers the events of 
early mitosis (prophase, prometaphase, and metaphase). During this period, M- 
Cdk and several other mitotic protein kinases phosphorylate a variety of proteins, 
leading to the assembly of the mitotic spindle and its attachment to the sister 
chromatid pairs. The second major part of mitosis begins at the metaphase-to- 
anaphase transition, when the APC/C triggers the destruction of securin, liberat- 
ing a protease that cleaves cohesin and thereby initiates separation of the sister 
chromatids. The APC/C also triggers the destruction of cyclins, which leads to 
Cdk inactivation and the dephosphorylation of Cdk targets, which is required for 
all events of late M phase, including the completion of anaphase, the disassem- 
bly of the mitotic spindle, and the division of the cell by cytokinesis. 

In this section, we describe the key mechanical events of mitosis and how 
M-Cdk and the APC/C orchestrate them. 

M-Cdk Drives Entry Into Mitosis 

One of the most remarkable features of cell-cycle control is that a single pro- 
tein kinase, M-Cdk, brings about all of the diverse and complex cell rearrange- 
ments that occur in the early stages of mitosis. At a minimum, M-Cdk must 
induce the assembly of the mitotic spindle and ensure that each sister chro- 
matid in a pair is attached to the opposite pole of the spindle. It also triggers 
chromosome condensation, the large-scale reorganization of the intertwined 
sister chromatids into compact, rod-like structures. In animal cells, M-Cdk 
also promotes the breakdown of the nuclear envelope and rearrangements of 
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(Micrographs courtesy of Julie Canman and Ted Salmon.) 
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the actin cytoskeleton and the Golgi apparatus. Each of these processes is 
thought to be triggered when M-Cdk phosphorylates specific proteins involved 
in the process, although most of these proteins have not yet been identified. 

M-Cdk does not act alone to phosphorylate key proteins involved in early 
mitosis. Two additional families of protein kinases, the Polo-like kinases and the 
Aurora kinases, also make important contributions to the control of early 
mitotic events. The Polo-like kinase Plk, for example, is required for the normal 
assembly of a bipolar mitotic spindle, in part because it phosphorylates proteins 
involved in separation of the spindle poles early in mitosis. The Aurora kinase 
Aurora-A also helps control proteins that govern the assembly and stability of 
the spindle, whereas Aurora-B controls attachment of sister chromatids to the 
spindle, as we discuss later. Activation of Polo-like kinases and Aurora kinases 
depends on M-Cdk activity, but the precise activation mechanisms are not clear. 



Dephosphorylation Activates M-Cdk at the Onset of Mitosis 

M-Cdk activation begins with the accumulation of M-cyclin (cyclin B in verte- 
brate cells; see Table 17-1). In embryonic cell cycles, the synthesis of M-cyclin is 
constant throughout the cell cycle, and M-cyclin accumulation results from the 
high stability of the protein in interphase. In most cell types, however, M-cyclin 
synthesis increases during G2 and M, owing primarily to an increase in M-cyclin 
gene transcription. The increase in M-cyclin protein leads to a corresponding 
accumulation of M-Cdk (the complex of Cdkl and M-cyclin) as the cell 
approaches mitosis. Although the Cdk in these complexes is phosphorylated at 
an activating site by the Cdk-activating kinase (CAK), as discussed earlier, the 
protein kinase Weel holds it in an inactive state by inhibitory phosphorylation 
at two neighboring sites (see Figure 17-18) . Thus, by the time the cell reaches the 
end of G2, it contains an abundant stockpile of M-Cdk that is primed and ready 
to act but is suppressed by phosphates that block the active site of the kinase. 

What, then, triggers the activation of the M-Cdk stockpile? The crucial event 
is the activation of the protein phosphatase Cdc25, which removes the 
inhibitory phosphates that restrain M-Cdk (Figure 17-25). At the same time, the 
inhibitory activity of the kinase Weel is suppressed, further ensuring that M-Cdk 
activity increases. The mechanisms that unleash Cdc25 activity (and suppress 
Weel) in early mitosis are not well understood. One possibility is that the S-Cdks 
that are active in G2 and early prophase stimulate Cdc25. 

Interestingly, Cdc25 can also be activated, at least in part, by its target, M- 
Cdk. M-Cdk may also inhibit the inhibitory kinase Weel. The ability of M-Cdk 
to activate its own activator (Cdc25) and inhibit its own inhibitor (Weel) sug- 
gests that M-Cdk activation in mitosis involves positive feedback loops (see Fig- 
ure 17-25). According to this attractive model, the partial activation of Cdc25 
(perhaps by S-Cdk) leads to the partial activation of a subpopulation of M-Cdk 
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Figure 1 7-25 The activation of M-Cdk. 

Cdk1 associates with M-cyclin as the 
levels of M-cyclin gradually rise. The 
resulting M-Cdk complex is 
phosphorylated on an activating site by 
the Cdk-activating kinase (CAK) and on a 
pair of inhibitory sites by the Weel 
kinase. The resulting inactive M-Cdk 
complex is then activated at the end of 
G 2 by the phosphatase Cdc25. Cdc25 is 
further stimulated by active M-Cdk, 
resulting in positive feedback. This 
feedback is enhanced by the ability of 
M-Cdk to inhibit Weel. 
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Figure 17-26 The mitotic chromosome. Scanning electron micrograph of 
a human mitotic chromosome, consisting of two sister chromatids joined 
along their length. The constricted regions are the centromeres. (Courtesy 
of Terry D.Allen.) 

complexes, which then phosphorylate more Cdc25 and Weel molecules. This 
leads to more M-Cdk activation, and so on. Such a mechanism would quickly 
promote the complete activation of all the M-Cdk complexes in the cell. As 
mentioned earlier, similar molecular switches operate at various points in the 
cell cycle to promote the abrupt and complete transition from one cell-cycle 
state to the next. 



Condensin Helps Configure Duplicated Chromosomes for 
Separation 

At the end of S phase, the immensely long DNA molecules of the sister chro- 
matids are tangled in a mass of partially catenated DNA and proteins. Any 
attempt to pull the sisters apart in this state would undoubtedly lead to breaks 
in the chromosomes. To avoid this disaster, the cell devotes a great deal of energy 
in early mitosis to gradually reorganizing the sister chromatids into relatively 
short, distinct structures that can be pulled apart more easily in anaphase. These 
chromosomal changes involve two processes: chromosome condensation, in 
which the chromatids are dramatically compacted; and sister-chromatid resolu- 
tion, whereby the two sisters are resolved into distinct, separable units (Figure 
17-26). Resolution results from the decatenation of the sister DNAs, accompa- 
nied by the partial removal of cohesin molecules along the chromosome arms. 
As a result, when the cell reaches metaphase, the sister chromatids appear in the 
microscope as compact, rod-like structures that are joined tightly at their cen- 
tromeric regions and only loosely along their arms. 

The condensation and resolution of sister chromatids depends, at least in 
part, on a five-subunit protein complex called condensin. Condensin structure 
is related to that of the cohesin complex that holds sister chromatids together 
(see Figure 17-24). It contains two SMC subunits like those of cohesin, plus three 
non-SMC subunits (Figure 17-27). Condensin may form a ring-like structure 
that somehow uses the energy provided by ATP hydrolysis to promote the com- 
paction and resolution of sister chromatids. Condensin is able to change the 
coiling of DNA molecules in a test tube, and this coiling activity is thought to be 
important for chromosome condensation during mitosis. Interestingly, phos- 
phorylation of condensin subunits by M-Cdk stimulates this coiling activity, 
providing one mechanism by which M-Cdk may promote chromosome restruc- 
turing in early mitosis. 



The Mitotic Spindle Is a Microtubule-Based Machine 

The central event of mitosis — chromosome segregation — depends in all eucary- 
otes on a complex and beautiful machine called the mitotic spindle. The spin- 
dle is a bipolar array of microtubules, which pulls sister chromatids apart in 
anaphase, thereby segregating the two sets of chromosomes to opposite ends of 
the cell, where they are packaged into daughter nuclei. M-Cdk triggers the 
assembly of the spindle early in mitosis, in parallel with the chromosome 
restructuring just described. Before we consider how the spindle assembles and 
how its microtubules attach to sister chromatids, we briefly review the basic fea- 
tures of spindle structure. 

As discussed in Chapter 16, the core of the mitotic spindle is a bipolar array 
of microtubules, the minus ends of which are focused at the two spindle poles, 
and the plus ends of which radiate outward from the poles (Figure 17-28). 
<GTCT> The plus ends of some microtubules— called the interpolar micro- 
tubules— interact with the plus ends of microtubules from the other pole, 
resulting in an antiparallel array in the spindle midzone. The plus ends of other 
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Figure 17-27 Condensin. Condensin is a 
five-subunit protein complex that 
resembles cohesin (see Figure 1 7-24). 
The head domains of its two major 
subunits, Smc2 and Smc4, are held 
together by three additional subunits. It 
is not clear how condensin catalyzes the 
restructuring and compaction of 
chromosome DNA, but it may form a ring 
structure that encircles loops of DNA as 
shown; it can hydrolyze ATP and coil DNA 
molecules in a test tube. 
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Figure 1 7-28 The three classes of 
microtubules of the mitotic spindle in 
an animal cell. The plus ends of the 
microtubules project away from the 
centrosomes, while the minus ends are 
anchored at the spindle poles, which in 
this example are organized by 
centrosomes. Kinetochore microtubules 
connect the spindle poles with the 
kinetochores of sister chromatids, while 
interpolar microtubules from the two 
poles interdigitate at the spindle equator. 
Astral microtubules radiate out from the 
poles into the cytoplasm and usually 
interact with the cell cortex, helping to 
position the spindle in the cell. 



microtubules— the kinetochore microtubules — are attached to sister chro- 
matid pairs at large protein structures called kinetochores, which are located at 
the centromere of each sister chromatid. Finally, many spindles also contain 
astral microtubules that radiate outward from the poles and contact the cell 
cortex, helping to position the spindle in the cell. 

In most somatic animal cells, each spindle pole is focused at a protein 
organelle called the centrosome (discussed in Chapter 16). Each centrosome 
consists of a cloud of amorphous material (called the pericentriolar matrix) that 
surrounds a pair of centrioles (Figure 17-29). The pericentriolar matrix nucle- 
ates a radial array of microtubules, with their fast-growing plus ends projecting 
outward and their minus ends associated with the centrosome. The matrix con- 
tains a variety of proteins, including microtubule-dependent motor proteins, 
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Figure 1 7-29 The centrosome. (A) Electron micrograph of an S-phase mammalian cell in culture, 
showing a duplicated centrosome. Each centrosome contains a pair of centrioles; although the 
centrioles have duplicated, they remain together in a single complex, as shown in the drawing of the 
micrograph in (B). One centriole of each centriole pair has been cut in cross section, while the other 
is cut in longitudinal section, indicating that the two members of each pair are aligned at right 
angles to each other. The two halves of the replicated centrosome, each consisting of a centriole pair 
surrounded by pericentriolar matrix, will split and migrate apart to initiate the formation of the two 
poles of the mitotic spindle when the cell enters M phase. (C) Electron micrograph of a centriole pair 
that has been isolated from a cell. The two centrioles have partly separated during the isolation 
procedure but remain tethered together by fine fibers, which keep the centriole pair together until it 
is time for them to separate. Both centrioles are cut longitudinally, and it can now be seen that the 
two have different structures: the mother centriole is larger and more complex than the daughter 
centriole, and only the mother centriole is associated with pericentriolar matrix that nucleates 
microtubules. Each daughter centriole will mature during the next cell cycle, when it will replicate to 
give rise to its own daughter centriole. (A, from M. McGill, D.P. Highfield, T.M. Monahan and 
B.R. Brinkley, J. Ultrastruct. Res. 57:43-53, 1976. With permission from Academic Press; C, from 
M. Paintrand et al.,J. Struct. Biol. 108:107-128, 1992. With permission from Elsevier.) 
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coiled-coil proteins that link the motors to the centrosome, structural proteins, 
and components of the cell-cycle control system. Most important, it contains 
the y-tubulin ring complex, which is the component mainly responsible for 
nucleating microtubules (discussed in Chapter 16). 

Some cells — notably the cells of higher plants and the oocytes of many ver- 
tebrates — do not have centrosomes, and microtubule-dependent motor pro- 
teins and other proteins associated with microtubule minus ends organize and 
focus the spindle poles. 



Microtubule-Dependent Motor Proteins Govern Spindle 
Assembly and Function 

The assembly and function of the mitotic spindle depend on numerous micro- 
tubule-dependent motor proteins. As discussed in Chapter 16, these proteins 
belong to two families — the kinesin-related proteins, which usually move 
toward the plus end of microtubules, and dyneins, which move toward the 
minus end. In the mitotic spindle, these motor proteins generally operate at or 
near the ends of the microtubules. Four major types of motor proteins — kinesin- 
5, kinesin-14 } kinesins-4 and 10, and dynein — are particularly important in spin- 
dle assembly and function (Figure 17-30). 

Kinesin-5 proteins contain two motor domains that interact with the plus 
ends of antiparallel microtubules in the spindle midzone. Because the two 
motor domains move toward the plus ends of the microtubules, they slide the 
two antiparallel microtubules past each other toward the spindle poles, forcing 
the poles apart. Kinesin-14 proteins, by contrast, are minus-end directed motors 
with a single motor domain and other domains that can interact with a different 
microtubule. They can cross-link antiparallel interpolar microtubules at the 
spindle midzone and tend to pull the poles together. Kinesin-4 and kinesin-10 
proteins, also called chromokinesins, are plus-end directed motors that associ- 
ate with chromosome arms and push the attached chromosome away from the 
pole (or the pole away from the chromosome). Finally, dyneins are minus-end 
directed motors that, together with associated proteins, organize microtubules 
at various cellular locations. They link the plus ends of astral microtubules to 
components of the actin cytoskeleton at the cell cortex, for example; by moving 
toward the minus end of the microtubules, the dynein motors pull the spindle 
poles toward the cell cortex and away from each other. 



Two Mechanisms Collaborate in the Assembly of a Bipolar Mitotic 
Spindle 

The mitotic spindle must have two poles if it is to pull the two sets of sister chro- 
matids to opposite ends of the cell in anaphase. In animal cells, the primary 
focus of this chapter, two mechanisms collaborate to ensure the bipolarity of the 
spindle. One depends on the ability of mitotic chromosomes to nucleate and 
stabilize microtubules and on the ability of the various motor proteins just 
described to organize microtubules into a bipolar array, with minus ends 




Figure 17-30 Major motor proteins of 
the spindle. Four major classes of 
microtubule-dependent motor proteins 
{yellow boxes) contribute to spindle 
assembly and function (see text). The 
colored arrows indicate the direction of 
motor movement along a microtubule— 
blue toward the minus end, and red 
toward the plus end. 
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focused at two spindle poles and plus ends interacting with each other in the 
spindle midzone. The other depends on the ability of centrosomes to help form 
the spindle poles. A typical animal cell enters mitosis with a pair of centrosomes, 
each of which nucleates a radial array of microtubules. The two centrosomes 
facilitate bipolar spindle assembly by providing a pair of prefabricated spindle 
poles. Centrosomes are not essential for the assembly of a bipolar spindle, how- 
ever, since a functional spindle forms in cells that normally lack centrosomes 
and in cultured cells in which a laser beam has destroyed the centrosome. 

We now describe the steps of spindle assembly, beginning with centrosome- 
dependent assembly in early mitosis. We then consider the self-organization 
mechanism that does not require centrosomes and becomes particularly impor- 
tant after nuclear envelope breakdown. 



Centrosome Duplication Occurs Early in the Cell Cycle 

Most animal cells contain a single centrosome that nucleates most of the cell's 
cytoplasmic microtubules. The centrosome duplicates when the cell enters the 
cell cycle, so that by the time the cell reaches mitosis there are two centrosomes. 
Centrosome duplication begins at about the same time as the cell enters S 
phase. The d/S-Cdk (a complex of cyclin E and Cdk2 in animal cells; see Table 
17-1) that triggers cell cycle entry also initiates centrosome duplication. The two 
centrioles in the centrosome separate, and each nucleates the formation of a 
single new centriole, resulting in two centriole pairs within an enlarged pericen- 
triolar matrix (Figure 17-31). This centrosome pair remains together on one side 
of the nucleus until the cell enters mitosis. 

There are interesting parallels between centrosome duplication and chro- 
mosome duplication. Both use a semi-conservative mechanism of duplication, 
in which the two halves separate and serve as templates for construction of a 
new half. Centrosomes, like chromosomes, must replicate once and only once 
per cell cycle, to ensure that the cell enters mitosis with only two copies: an 
incorrect number of centrosomes could lead to defects in spindle assembly and 
thus errors in chromosome segregation. 

The mechanisms that limit centrosome duplication to once per cell cycle are 
uncertain. In many cell types, experimental inhibition of DNA synthesis blocks 
centrosome duplication, providing one mechanism by which centrosome num- 
ber is kept in check. Other cell types, however, including those in the early 
embryos of flies, sea urchins, and frogs, do not contain such a mechanism and 
centrosome duplication continues if chromosome duplication is blocked. It is 
not known how such cells limit centrosome duplication to once per cell cycle. 



M-Cdk Initiates Spindle Assembly in Prophase 

At the beginning of mitosis, the sudden rise in M-Cdk activity initiates spindle 
assembly. In animal cells, the two centrosomes move apart along the nuclear 
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Figure 1 7-31 Centriole replication. The 

centrosome consists of a centriole pair 
and associated pericentriolar matrix 
(green). At a certain point in G lf the two 
centrioles of the pair separate by a few 
micrometers. During S phase, a daughter 
centriole begins to grow near the base of 
each mother centriole and at a right 
angle to it. The elongation of the 
daughter centriole is usually completed 
by G 2 . The two centriole pairs remain 
close together in a single centrosomal 
complex until the beginning of M phase, 
when the complex splits in two and the 
two halves begin to separate. Each 
centrosome now nucleates its own radial 
array of microtubules called an aster. 
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Figure 1 7-32 The influence of opposing motor proteins on spindle length in budding yeast. (A) A differential- 
interference-contrast micrograph of a mitotic yeast cell. The spindle is highlighted in green, and the position of the spindle 
poles is indicated by red arrows. The nuclear envelope does not break down during mitosis in yeasts, and the spindle forms 
inside the nucleus. In (B-D), the mitotic spindles have been stained with fluorescent anti-tubulin antibodies. <B) Normal 
yeast cells. (C) Overexpression of the minus-end-directed motor protein Kar3 (a kinesin-14 protein) leads to abnormally 
short spindles. (D) Overexpression of the plus-end-directed motor protein Cin8 (a kinesin-5 protein) leads to abnormally 
long spindles. Thus, it seems that a balance between opposing motor proteins determines spindle length in these cells. 
(A, courtesy of Kerry Bloom; B-D, from W. Saunders, V. Lengyel and M.A. Hoyt, Mol. Biol. Cell 8:1025-1033, 1997. With 
permission from American Society for Cell Biology.) 

envelope, and the plus ends of the microtubules between them interdigitate to 
form the interpolar microtubules of the developing spindle. At the same time, 
the amount of y-tubulin ring complexes in each centrosome increases greatly, 
increasing the ability of the centrosomes to nucleate new microtubules, a pro- 
cess called centrosome maturation. 

Multiple motor proteins drive the separation of centrosomes in early mito- 
sis. In prophase, minus-end directed dynein motor proteins at the plus ends of 
astral microtubules provide the major force. These motors are anchored at the 
cell cortex or on the nuclear envelope, and their movement toward the micro- 
tubule minus end pulls the centrosomes apart (see Figure 17-30). Following 
nuclear envelope breakdown at the end of prophase, interactions between the 
centrosomal microtubules and the cell cortex allow actin-myosin bundles in the 
cortex to pull the centrosomes further apart. Finally, kinesin-5 motors cross-link 
the overlapping, antiparallel ends of interpolar microtubules and push the poles 
apart (see Figure 17-30). 

The balance of opposing forces generated by different types of motor pro- 
teins determines the final length of the spindle. Dynein and kinesin-5 motors 
generally promote centrosome separation and increase spindle length. Kinesin- 
14 proteins do the opposite: they are minus-end directed motors and interact 
with a microtubule from one pole while traveling toward the minus end of an 
antiparallel microtubule from the other pole; as a result, they tend to pull the 
poles together. It is not clear how the cell regulates the balance of opposing 
forces to generate the appropriate spindle length (Figure 17-32). 

M-Cdk and other mitotic protein kinases are required for centrosome sepa- 
ration and maturation. M-Cdk and aurora-A phosphorylate kinesin-5 motors 
and stimulate them to drive centrosome separation. Aurora-A and Plk also phos- 
phorylate components of the centrosome and thereby promote its maturation. 



The Completion of Spindle Assembly in Animal Cells Requires 
Nuclear Envelope Breakdown 

The centrosomes and microtubules of animal cells are located in the cytoplasm, 
separated from the chromosomes by the double membrane barrier of the 
nuclear envelope (discussed in Chapter 12). Clearly, the attachment of sister 
chromatids to the spindle requires the removal of this barrier. In addition, many 
of the motor proteins and microtubule regulators that promote spindle assem- 
bly are associated with the chromosomes inside the nucleus. Nuclear envelope 
breakdown allows these proteins to carry out their important functions in spin- 
dle assembly. 
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Nuclear envelope breakdown is a complex, multi-step process that is 
thought to begin when M-Cdk phosphorylates several subunits of the giant 
nuclear pore complexes in the nuclear envelope. This initiates the disassembly 
of nuclear pore complexes and their dissociation from the envelope. M-Cdk 
also phosphorylates components of the nuclear lamina, the structural frame- 
work that lies beneath the envelope. The phosphorylation of these lamina com- 
ponents and of several inner nuclear envelope proteins leads to disassembly of 
the nuclear lamina and the breakdown of the envelope membranes into small 
vesicles. 



Microtubule Instability Increases Greatly in Mitosis 

Most animal cells in interphase contain a cytoplasmic array of microtubules 
radiating out from the single centrosome. As discussed in Chapter 16, the micro- 
tubules of this interphase array are in a state of dynamic instability, in which 
individual microtubules are either growing or shrinking and stochastically 
switch between the two states. The switch from growth to shrinkage is called a 
catastrophe, and the switch from shrinkage to growth is called a rescue (see Fig- 
ure 16-16). New microtubules are continually being created to balance the loss 
of those that disappear completely by depolymerization. 

Entry into mitosis signals an abrupt change in the cell's microtubules. The 
interphase array of few, long microtubules is converted to a larger number of 
shorter and more dynamic microtubules surrounding each centrosome. During 
prophase, and particularly in prometaphase and metaphase (see Panel 17-1), the 
half-life of microtubules decreases dramatically. This increase in microtubule 
instability, coupled with the increased ability of centrosomes to nucleate micro- 
tubules as mentioned earlier, results in remarkably dense and dynamic arrays of 
spindle microtubules that are ideally suited for capturing sister chromatids. 

M-Cdk initiates these changes in microtubule behavior, at least in part, by 
phosphorylating two classes of proteins that control microtubule dynamics (dis- 
cussed in Chapter 16). These include microtubule-dependent motor proteins 
and microtubule-associated proteins (MAPs). Experiments using cell-free 
Xenopus egg extracts, which reproduce many of the changes that occur in intact 
cells during the cell cycle, have revealed the roles of these regulators in control- 
ling microtubule dynamics. If centrosomes and fluorescent tubulin are added to 
these extracts, fluorescent microtubules nucleate from the centrosomes, and we 
can observe the behavior of individual microtubules by time-lapse fluorescence 
video microscopy. The microtubules in mitotic extracts differ from those in 
interphase extracts primarily by the increased rate of catastrophes, in which the 
microtubules switch abruptly from slow growth to rapid shortening. 

Two classes of proteins govern microtubule dynamics in mitosis. Proteins 
called catastrophe factors destabilize microtubule arrays by increasing the fre- 
quency of catastrophes (see Figure 16-16). One of these proteins is a kinesin- 
related protein that does not function as a motor. MAPs, by contrast, have the 
opposite effect, stabilizing microtubules in various ways: they can increase the fre- 
quency of rescues, in which microtubules switch from shrinkage to growth, or they 
can either increase the growth rate or decrease the shrinkage rate of microtubules. 
Thus, in principle, changes in catastrophe factors and MAPs can make micro- 
tubules much more dynamic in M phase by increasing total microtubule depoly- 
merization rates, decreasing total microtubule polymerization rates, or both. 

In Xenopus egg extracts, the balance between a single type of catastrophe 
factor and a single type of MAP determines the catastrophe rate and the steady- 
state length of microtubules (Figure 17-33). This balance, in turn, governs the 
assembly of the mitotic spindle, as microtubules that are either too long or too 
short cannot form a normal spindle. One way in which M-Cdk may control 
microtubule length is by phosphorylating this MAP and reducing its ability to 
stabilize microtubules. Even tf the activity of the catastrophe factor remained 
constant throughout the cell cycle, the balance between the two opposing activ- 
ities of the MAP and catastrophe factor would shift, increasing the dynamic 
instability of the microtubules. 
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Mitotic Chromosomes Promote Bipolar Spindle Assembly 

Chromosomes are not just passive passengers in the process of spindle assem- 
bly. By creating a local environment that favors both microtubule nucleation 
and microtubule stabilization, they play an active part in spindle formation. The 
influence of the chromosomes can be demonstrated by using a fine glass needle 
to reposition them after the spindle has formed. For some cells in metaphase, if 
a single chromosome is tugged out of alignment, a mass of new spindle micro- 
tubules rapidly appears around the newly positioned chromosome, while the 
spindle microtubules at the chromosome's former position depolymerize. This 
property of the chromosomes seems to depend, at least in part, on a guanine- 
nucleotide exchange factor (GEF) that is bound to chromatin; the GEF stimu- 
lates a small GTPase in the cytosol called Ran to bind GTP in place of GDP. The 
activated Ran-GTP, which is also involved in nuclear transport (discussed in 
Chapter 12), releases microtubule -stabilizing proteins from protein complexes 
in the cytosol, thereby stimulating the local nucleation and stabilization of 
microtubules around chromosomes. 

It is this ability of chromosomes to stabilize and organize microtubules that 
enables cells to form bipolar spindles in the absence of centrosomes, as dis- 
cussed earlier. Acentrosomal spindle assembly is thought to begin with the 
nucleation and stabilization of microtubules around the chromosomes. Motor 
proteins, particularly members of the kinesin-5 family (see Figure 17-30), then 
cross-link microtubules in an antiparallel orientation and push their minus ends 
apart. Kinesins-4 and 10 on the chromosome arms also help push the minus 
ends away from the chromosomes. Dynein, kinesin-14, and various minus-end 
binding proteins then cross-link and focus the minus ends of the microtubules 
to form the two spindle poles (Figure 17-34). 

Cells that normally lack centrosomes, such as those of higher plants and 
many animal oocytes, use this chromosome-based self-organization process to 
form spindles. It is also the process used to assemble spindles in certain insect 
embryos that have been induced to develop from eggs without fertilization (that 
is, parthenogenetically); as the sperm normally provides the centrosome when it 
fertilizes an egg (discussed in Chapter 21), the mitotic spindles in these 
parthenogenic embryos develop without centrosomes (Figure 17-35). Even in 
cells that normally contain centrosomes, the chromosomes help organize the 



Figure 17-33 Experimental evidence that 
the balance between catastrophe factors 
and MAPs influences the frequency of 
microtubule catastrophes and 
microtubule length. Interphase or mitotic 
Xenopus egg extracts were incubated with 
centrosomes and fluorescent tubulin, and 
the behavior of individual microtubules 
nucleated from the centrosomes was 
followed by fluorescence video 
microscopy. As expected, the catastrophe 
rate is higher in mitotic than in interphase 
extracts. The depletion of a specific MAP 
(called Xmap21 5) from the mitotic extracts 
increases the catastrophe rate, indicating 
that this MAP inhibits catastrophes in 
mitotic extracts. Inhibition of a specific 
catastrophe factor (the kinesin-related 
protein Mcak) greatly reduces the 
catastrophe rate in the MAP-depleted 
mitotic extracts, indicating that this factor 
is responsible for stimulating catastrophes 
in mitotic extracts. Thus, the catastrophe 
rate depends on the balance between the 
MAP and the catastrophe factor. 
Fluorescence micrographs of the asters 
formed in the different experimental 
conditions are shown in the top panels; 
note that the higher the catastrophe rates, 
the shorter the microtubules. (From 
R. Tournebize et al, Nat. Cell Biol. 2:1 3-1 9, 
2000. With permission from Macmillan 
Publishers Ltd.) 
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Figure 17-34 Spindle self-organization by motor proteins. Mitotic chromosomes stimulate the local production of 
RanGTP (not shown), which activates proteins that nucleate and promote the formation of microtubules in the vicinity of 
the chromosomes. Kinesin-5 motor proteins (see Figure 1 7-30) organize these microtubules into antiparallel bundles, while 
plus-end directed kinesins-4 and 10 link the microtubules to chromosome arms and push minus ends away from the' 
chromosomes. Dynein and kinesin-14 motors, together with numerous other proteins, focus these minus ends into a pair 
of spindle poles. 

spindle microtubules and, with the help of various motor proteins, can promote 
the assembly of a bipolar mitotic spindle if the centrosomes are removed. 
Although the resulting acentrosomal spindle can segregate chromosomes nor- 
mally, it lacks astral microtubules, which are responsible for positioning the 
spindle in animal cells; as a result, the spindle is often mispositioned, resulting 
in abnormalities in cytokinesis. 



Kinetochores Attach Sister Chromatids to the Spindle 

Following the assembly of a bipolar microtubule array, the second major step in 
spindle formation is the attachment of the array to the chromosomes. Spindle 
microtubules are attached to each sister chromatid at the kinetochore, a giant, 
multilayered protein structure that is built on the heterochromatin that forms 
at the centromeric region of the chromosome (Figure 17-36). The plus ends of 
kinetochore microtubules are embedded head-on in specialized microtubule- 
attachment sites within the kinetochore. Animal cell kinetochores contain 
10-40 of these attachment sites, whereas yeast kinetochores contain just one. 
Each attachment site contains a protein collar that surrounds the microtubule 
near its end, thereby holding the microtubule tightly to the kinetochore while 
still allowing the addition or removal of tubulin subunits at this end (Figure 
17-37). Regulation of plus end polymerization and depolymerization at the 
kinetochore is critical for the control of chromosome movement on the spindle, 
as we discuss later. 

Cells containing centrosomes employ a "search and capture" mechanism to 
attach their mitotic chromosomes to the spindle. The dynamic plus ends of 
microtubules radiate outward from the centrosomes and eventually capture 
the kinetochore of one sister chromatid. In newt lung cells, where we can 
observe the initial capture event in a microscope, the kinetochore is seen first 
to bind to the side of the microtubule and then to slide rapidly along it toward 
the centrosome. The lateral attachment to the chromosome is rapidly con- 
verted to an end-on attachment. At the same time, microtubules growing from 

Figure 1 7-35 Bipolar spindle assembly without centrosomes in 
parthenogenetic embryos of the insect Sciara (or fungus gnat). The 

microtubules are stained green, the chromosomes red. The top fluorescence 
micrograph shows a normal spindle formed with centrosomes in a normally 
fertilized Sciara embryo. The bottom micrograph shows a spindle formed 
without centrosomes in an embryo that initiated development without 
fertilization. Note that the spindle with centrosomes has an aster at each 
pole of the spindle, whereas the spindle formed without centrosomes does 
not. Both types of spindles are able to segregate the replicated 
chromosomes. (From B. de Saint Phalle and W. Sullivan, J. Cell Biol. 
141:1383-1391, 1998. With permission from The Rockefeller University Press.) 
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Figure 17-36 The kinetochore. (A) A fluorescence micrograph of a metaphase chromosome stained with a DNA-binding 
fluorescent dye and with human autoantibodies that react with specific kinetochore proteins. The two kinetochores, one 
associated with each chromatid, are stained red. (B) A drawing of a metaphase chromosome showing its two sister 
chromatids attached to the plus ends of kinetochore microtubules. Each kinetochore forms a plaque on the surface of the 
centromere. The number of microtubules bound to a metaphase kinetochore varies from 1 in budding yeast to 40 in some 
mammalian cells. (C) Electron micrograph of an anaphase chromatid with microtubules attached to its kinetochore. While 
most kinetochores have a trilaminar structure, the one shown here (from a green alga) has an unusually complex structure 
with additional layers. (A, courtesy of B.R. Brinkley; C, from J.D. Pickett-Heaps and L.C. Fowke, Aust. J. Biol. Sci. 23:71-92, 
1970. With permission from CSIRO.) 

the opposite spindle pole attach to the kinetochore on the opposite side of the 
chromosome, forming a bipolar attachment (Figure 17-38). 

How does chromosome attachment occur in the absence of centrosomes? 
One possibility is that short microtubules in the vicinity of the chromosomes 
interact with kinetochores and become embedded in the plus-end-binding col- 
lars of the kinetochore. Polymerization at these plus ends would then result in 
growth of the microtubules away from the kinetochore. The minus ends of these 
kinetochore microtubules, like other minus ends in centrosome-free spindles, 
would eventually become cross-linked to other minus ends and focused by 
motor proteins at the spindle pole (see Figure 17-34). 

Bi-Orientation Is Achieved by Trial and Error 

The success of mitosis demands that sister chromatids in a pair attach to oppo- 
site poles of the mitotic spindle, so that they move to opposite ends of the cell 
when they separate in anaphase. How is this mode of attachment, called bi-ori- 
entation, achieved? What prevents the attachment of both kinetochores to the 
same spindle pole or the attachment of one kinetochore to both spindle poles? 
Part of the answer is that sister kinetochores are constructed in a back-to-back 
orientation that reduces the likelihood that both kinetochores can face the same 
spindle pole. Nevertheless, incorrect attachments do occur, and elegant regula- 
tory mechanisms have evolved to correct them. 
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Figure 17-37 A microtubule attachment 
site in a kinetochore. Each site is thought 
to contain a collar structure (yellow) that 
surrounds the microtubule plus end, 
allowing polymerization and 
depolymerization to occur at the 
exposed plus end while the microtubule 
remains attached to the kinetochore. 
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Incorrect attachments are corrected by a system of trial and error that is 
based on a simple principle: incorrect attachments are highly unstable and do 
not last, while correct attachments are locked in place. But, how does the kine- 
tochore sense a correct attach men tV The answer appears to be tension (Figure 
17-39). When a sister chromatid pair is properly bi-oriented on the spindle, the 
two kinetochores are pulled in opposite directions by strong poleward forces. 
Sister-dim mat id cohesion resists these poleward forces, creating high levels of 
tension within the kinetochores. When chromosomes are incorrectly attached— 
when both sister chromatids are attached to the same spindle pole, for exam- 
ple— tension is low and the kinetochore sends an inhibitory signal that loosens 
the grip of its microtubule attachment site, allowing detachment to occur. When 
bi-orientation occurs, the high tension at the kinetochore shuts off the 
inhibitory signal, strengthening microtubule attachment. In animal cells, ten- 
sion not only increases the affinity of the attachment site but also leads to the 
attachment of additional microtubules to the kinetochore. This results in the 
formation of a thick kinetochore fiber composed of multiple microtubules. 

The tension-sensing mechanism depends on the protein kinase aurora-B, 
which is associated with the kinetochore. Aurora-B is thought to generate the 
inhibitory signal that reduces the strength of microtubule attachment in the 
absence of tension. It phosphorylates several components of the microtubule 
attachment site, decreasing the site's affinity for a microtubule plus end. Aurora - 
B is inactivated when bi-orientation occurs, thereby reducing kinetochore phos- 
phorylation and increasing the affinity of the attachment site. 



Figure 1 7-38 The capture of centrosome 
microtubules by kinetochores. The red 

arrow in (A) indicates the direction of 
microtubule growth, while the gray arrow 
in (C) indicates the direction of 
chromosome sliding. 
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Figure 1 7-39 Alternative forms of 
chromosome attachment. Initially, a 
single microtubule from a spindle pole 
binds to one kinetochore in a sister 
chromatid pair. Additional microtubules 
can then bind to the chromosome in 
various ways. A microtubule from the 
same spindle pole can attach to the other 
sister kinetochore, or microtubules from 
both spindle poles can attach to one 
kinetochore. These incorrect attachments 
are unstable, however, so that one of the 
two microtubules tends to dissociate. 
When a second microtubule from the 
opposite pole binds to the second 
kinetochore, the sister kinetochores are 
thought to sense tension across their 
microtubule-binding sites, which triggers 
an increase in microtubule binding 
affinity. This correct attachment is 
thereby locked in place. 
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Following their attachment to the two spindle poles, the chromosomes are 
tugged back and forth, eventually assuming a position equidistant between the 
two spindle poles, a position called the metaphase plate. In vertebrate cells, the 
chromosomes then oscillate gently at the metaphase plate, awaiting the signal 
for the sister chromatids to separate. The signal is produced, with a predictable 
lag time, after the bipolar attachment of the last of the chromosomes, as we dis- 
cuss later. 



Multiple Forces Move Chromosomes on the Spindle 

Motor proteins and other mechanisms generate the forces that move chromo- 
somes on the microtubules of the mitotic spindle. Three major forces are 
thought to be particularly important. 

The first major force pulls the kinetochore and its associated chromosome 
along the kinetochore microtubule toward the spindle pole. It is produced by 
proteins at the kinetochore itself. By an uncertain mechanism, depolymeriza- 
tion at the plus end of the microtubule somehow generates a force that pulls the 
kinetochore poleward (Figure 17-40). This force pulls on chromosomes during 
prometaphase and metaphase and is particularly important for moving sister 
chromatids toward the poles after they separate in anaphase, as we discuss later. 
Interestingly, this kinetochore-generated poleward force does not require ATR 
This might seem implausible at first, but it has been shown that purified kineto- 
chores in a test tube, with no ATP present, can remain attached to depolymeriz- 
ing microtubules and thereby move. The energy that drives the movement is 
stored in the microtubule and is released when the microtubule depolymerizes; 
it ultimately comes from the hydrolysis of GTP that occurs after a tubulin sub- 
unit adds to the end of a microtubule (discussed in Chapter 16). 

A second poleward force is provided in some cell types by microtubule flux, 
whereby the microtubules themselves are moved toward the spindle poles and 
dismantled at their minus ends. Until the onset of anaphase, the addition of new 
tubulin at the plus end of a microtubule compensates for the loss of tubulin at 
the minus end, so that microtubule length remains constant despite the move- 
ment of microtubules toward the spindle pole. Microtubule flux in metaphase 
spindles can be seen by an ingenious method in which very small amounts of 
fluorescent tubulin are injected into living cells (Figure 17-41). This results in 
the appearance of tiny fluorescent speckles that travel toward the poles on both 
kinetochore and interpolar microtubules. Any kinetochore that is attached to a 
microtubule undergoing such flux experiences a poleward force, which con- 
tributes to the generation of tension at the kinetochore and the poleward move- 
ment of sister chromatids after they separate in anaphase. 

A third force acting on chromosomes is the polar ejection force. Plus-end- 
directed kinesin-4 and 10 motors on chromosome arms interact with interpolar 
microtubules and transport the chromosomes away from the spindle poles. This 
force is particularly important in prometaphase and metaphase, when it helps 
align the bi-oriented sister chromatid pairs at the metaphase plate (Figure 
17-42). 
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Figure 17-40 How depolymerization 
may pull the kinetochore toward the 
spindle pole. When depolymerization 
occurs, the protofilaments of the 
microtubule curl outward (see Figure 
16-16) and push against the collar 
structure that surrounds the microtubule 
plus end. In principle, this will move the 
kinetochore toward the microtubule 
minus end at the spindle pole. 
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Figure 1 7-41 Microtubule flux in the metaphase spindle. (A) To observe 
microtubule flux, a very small amount of fluorescent tubulin is injected into 
living cells so that individual microtubules form with a very small 
proportion of fluorescent tubulin. Such microtubules have a speckled 
appearance when viewed by fluorescence microscopy. (B) Fluorescence 
micrographs of a mitotic spindle in a living newt lung epithelial cell. The 
chromosomes are colored brown, and the tubulin speckles are red. (C) The 
movement of individual speckles can be followed by time-lapse video 
microscopy. Images of the long, thin, rectangular, boxed region (arrow) in 
(B) were taken at sequential times and pasted side by side to make a 
montage of the region over time. Individual speckles can be seen to move 
toward the poles at a rate of about 0.75 (im/min, indicating that the 
microtubules are moving poleward. (D) Microtubule length in the 
metaphase spindle does not change significantly because new tubulin 
subunits are added at the microtubule plus end at the same rate as tubulin 
subunits are removed from the minus end. (B and C, from T.J. Mitchison 
and E.D. Salmon, Nat Cell Biol. 3:E1 7-21, 2001 . With permission from 
Macmillan Publishers Ltd.) 
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One of the most striking aspects of mitosis in vertebrate cells is the continu- 
ous oscillatory movement of the chromosomes in prometaphase and 
metaphase. When studied by video microscopy in newt lung cells, the move- 
ments are seen to switch between two states — a poleward (P) state, when the 
chromosomes are pulled toward the pole, and an away- from- the-pole (AP), or 
neutral, state, when the poleward forces are turned off and the polar ejection 
force pushes the chromosomes away from the pole. The switch between the two 
states may depend on the degree of tension in the kinetochore. It has been pro- 
posed, for example, that as chromosomes move toward the spindle pole, an 
increasing polar ejection force generates tension in the kinetochore nearest the 
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Figure 17-42 How opposing forces may 
drive chromosomes to the metaphase 
plate. (A) Evidence for a polar ejection 
force that pushes chromosomes away 
from the spindle poles toward the 
spindle equator. In this experiment, a 
laser beam severs a prometaphase 
chromosome that is attached to a single 
pole by kinetochore microtubules. The 
part of the severed chromosome without 
a kinetochore is pushed rapidly away 
from the pole, whereas the part with the 
kinetochore moves toward the pole, 
reflecting a decreased repulsion. 
(B) A model of how two opposing forces 
may cooperate to move chromosomes to 
the metaphase plate. Plus-end-directed 
motor proteins (kinesin-4 and kinesin-1 0) 
on the chromosome arms are thought to 
interact with microtubules to generate 
the polar ejection force, which pushes 
chromosomes toward the spindle 
equator {see Figure 17-30). Poleward 
forces generated by depolymerization at 
the kinetochore, together with 
microtubule flux, are thought to pull 
chromosomes toward the pole. 
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Figure 17-43 Sister-chromatid separation at anaphase. In the transition from metaphase (A) to anaphase (B), 
sister chromatids suddenly separate and move toward opposite poles of the mitotic spindle — as shown in these light 
micrographs of Haemanthus (lily) endosperm cells that were stained with gold-labeled antibodies against tubulin. 
(Courtesy of Andrew Bajer.) 

pole, triggering a switch to the away-from-the-pole state and gradually resulting 
in the accumulation of chromosomes at the equator of the spindle. 

The APC/C Triggers Sister-Chromatid Separation and the 
Completion of Mitosis 

After M-Cdk has triggered the complex rearrangements that occur in early mito- 
sis, the cell cycle reaches its climax with the separation of the sister chromatids 
at the metaphase -to -anaphase transition (Figure 17-43). Although M-Cdk activ- 
ity sets the stage for this event, the anaphase-promoting complex (APC/C) dis- 
cussed earlier throws the switch that initiates sister-chromatid separation by 
ubiquitylating several mitotic regulatory proteins and thereby triggering their 
destruction (see Figure 17-20A). 

During metaphase, cohesins holding the sister chromatids together resist 
the poleward forces that pull the sister chromatids apart. Anaphase begins with 
a sudden disruption of sister-chromatid cohesion, which allows the sisters to 
separate and move to opposite poles of the spindle. The APC/C initiates the pro- 
cess by targeting the inhibitory protein securin for destruction. Before 
anaphase, securin binds to and inhibits the activity of a protease called sepa- 
rase. The destruction of securin at the end of metaphase releases separase, 
which is then free to cleave one of the subunits of cohesin. The cohesins fall 
away, and the sister chromatids abruptly and synchronously separate (Figure 
17-44). 

In addition to securin, the APC/C also targets the S- and M-cyclins for 
destruction, leading to the loss of most Cdk activity in anaphase. Cdk inactiva- 
tion allows phosphatases to dephosphorylate the many Cdk target substrates in 
the cell, as required for the completion of mitosis and cytokinesis (discussed 
later). 

If the APC/C triggers anaphase, what activates the APC/C? The answer is 
only partly known. As mentioned earlier, APC/C activation requires the protein 
Cdc20, which binds to and activates the APC/C in mitosis (see Figure 17-20A). At 
least two processes regulate Cdc20 and its association with the APC/C. First, 
Cdc20 synthesis increases as the cell approaches mitosis, owing to an increase in 
the transcription of its gene. Second, phosphorylation of the APC/C helps Cdc20 
bind to the APC/C, thereby helping to create an active complex. Among the 
kinases that phosphorylate and thus activate the APC/C is M-Cdk. Thus, M-Cdk 
not only triggers the early mitotic events leading up to metaphase, but it also 
sets the stage for progression into anaphase. The ability of M-Cdk to promote 
Cdc20-APC/C activity creates a negative feedback loop: M-Cdk sets in motion a 
regulatory process that leads to cyclin destruction and thus its own inactivation. 
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Figure 17-44 The initiation of sister- 
chromatid separation by the APC/C. The 

activation of APC/C by Cdc20 leads to the 
ubiquitylation and destruction of securin, 
which normally holds separase in an 
inactive state. The destruction of securin 
allows separase to cleave Scci, a subunit 
of the cohesin complex holding the sister 
chromatids together (see Figure 17-24). 
The pulling forces of the mitotic spindle 
then pull the sister chromatids apart. In 
animal cells, phosphorylation by Cdks 
also inhibits separase {not shown). Thus, 
Cdk inactivation in anaphase (resulting 
from cyclin destruction) also promotes 
separase activation by allowing its 
dephosphorylation. 
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Unattached Chromosomes Block Sister-Chromatid Separation: 
The Spindle Assembly Checkpoint 

Cells usually spend about half of mitosis in metaphase, with the chromosomes 
aligned on the metaphase plate, jostling about, awaiting the APC/C signal that 
induces sister chromatids to separate. Drugs that destabilize microtubules, such 
as colchicine or vinblastine (discussed in Chapter 16), arrest cells in mitosis for 
hours or even days. This observation led to the identification of a spindle assem- 
bly checkpoint mechanism that is activated by the drug treatment and blocks 
progression through the metaphase-to-anaphase transition. The checkpoint 
mechanism ensures that cells do not enter anaphase until all chromosomes are 
correctly bi-oriented on the mitotic spindle. 

The spindle assembly checkpoint depends on a sensor mechanism that 
monitors the strength of microtubule attachment, and possibly tension, at the 
kinetochore. Any kinetochore that is not properly attached to the spindle sends 
out a negative signal that blocks Cdc20-APC/C activation and thus blocks the 
metaphase-to-anaphase transition. Only when the last kinetochore is properly 
attached is this block removed, allowing sister-chromatid separation to occur. 

It is thought that inappropriately attached kinetochores somehow generate 
a diffusible signal that inhibits Cdc20-APC/C activity throughout the cell. The 
molecular basis of this signal is not clear, although several proteins, including 
Mad2, are recruited to unattached kinetochores and are required for the spindle 
assembly checkpoint to function (Figure 17-45). One appealing possibility, 
based primarily on detailed structural analyses of Mad2, is that the unattached 
kinetochore acts like an enzyme that catalyzes a change in the conformation of 
Mad2, so that Mad2 can bind and inhibit Cdc20-APC/C. 

In mammalian somatic cells, the spindle assembly checkpoint determines 
the normal timing of anaphase. The destruction of securin in these cells begins 
moments after the last sister chromatid pair becomes bi-oriented on the spin- 
dle, and anaphase begins about 20 minutes later. Experimental inhibition of the 
checkpoint mechanism causes premature sister-chromatid separation and 
anaphase. Surprisingly, the normal timing of anaphase does not depend on the 
spindle assembly checkpoint in some cells, such as yeasts and the cells of early 
frog and fly embryos. Some other mechanism, as yet unknown, must determine 
the timing of anaphase in these cells. 




Figure 17-45 Mad2 protein on 
unattached kinetochores. This 

fluorescence micrograph shows a 
mammalian cell in prometaphase, with 
the mitotic spindle in green and the sister 
chromatids in blue. One sister chromatid 
pair is attached to only one pole of the 
spindle. Staining with anti-Mad2 
antibodies indicates that Mad2 is bound 
to the kinetochore of the unattached 
sister chromatid {red dot, indicated by red 
arrow). A small amount of Mad2 is 
associated with the kinetochore of the 
sister chromatid that is attached to the 
spindle pole {pale dot, indicated by white 
arrow). (From J.C. Waters et al., J. Cell Biol. 
141:1181-1191, 1998. With permission 
from The Rockefeller University Press.) 
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Chromosomes Segregate in Anaphase A and B 

The sudden loss of sister-chromatid cohesion at the onset of anaphase leads to 
sister-chromatid separation, which allows the forces of the mitotic spindle to 
pull the sisters to opposite poles of the cell— called chromosome segregation. The 
chromosomes move by two independent and overlapping processes. The first, 
referred to as anaphase A, is the initial poleward movement of the chromo- 
somes, which is accompanied by shortening of the kinetochore microtubules. 
The second, referred to as anaphase B, is the separation of the spindle poles 
themselves, which begins after the sister chromatids have separated and the 
daughter chromosomes have moved some distance apart (Figure 17-46). 

Chromosome movement in anaphase A depends on a combination of the 
two major poleward forces described earlier. The first is the force generated by 
microtubule depolymerization at the kinetochore, which results in the loss of 
tubulin subunits at the plus end as the kinetochore moves toward the pole. The 
second is provided by microtubule flux, which is the poleward movement of the 
microtubules toward the spindle pole, where minus end depolymerization 
occurs. The relative importance of these two forces during anaphase varies in 
different cell types: in embryonic cells, chromosome movement depends mainly 
on microtubule flux, for example, whereas movement in yeast and vertebrate 
somatic cells results primarily from forces generated at the kinetochore. 

Spindle pole separation during anaphase B depends on motor-driven mech- 
anisms similar to those that separate the two centrosomes in early mitosis (see 
Figure 17-30). Plus-end directed kinesin-5 motor proteins, which cross-link the 
overlapping plus ends of the interpolar microtubules, push the poles apart. In 
addition, dynein motors that anchor astral microtubule plus ends to the cell cor- 
tex pull the poles apart. 

Although sister-chromatid separation initiates the chromosome movements 
of anaphase A, other mechanisms also ensure correct chromosome movements 
in anaphase A and spindle elongation in anaphase B. Most importantly, the 
completion of a normal anaphase depends on the dephosphorylation of Cdk 
substrates, which in most cells results from the APC/C-dependent destruction of 
cyclins. If M-cyclin destruction is prevented— by the production of a mutant 
form that is not recognized by the APC/C, for example— sister-chromatid sepa- 
ration generally occurs but the chromosome movements and microtubule 
behavior of anaphase are abnormal. 
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Figure 1 7-46 The major forces that 
separate sister chromatids at anaphase 
in mammalian cells. Chromosome 
movement toward the poles in anaphase 
A depends on the depolymerization of 
kinetochore microtubules and poleward 
microtubule flux. In anaphase B, the two 
spindle poles move apart. Two separate 
forces are thought to be responsible for 
anaphase B: the elongation and sliding of 
the interpolar microtubules past one 
another in the central spindle push the 
two poles apart, and motor proteins 
attached to the plasma membrane near 
each spindle pole act on astral 
microtubules to pull the poles away from 
each other, toward the cell surface. 
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The relative contributions of anaphase A and anaphase B to chromosome 
segregation vary greatly, depending on the cell type. In mammalian cells, 
anaphase B begins shortly after anaphase A and stops when the spindle is about 
twice its metaphase length; in contrast, the spindles of yeasts and certain proto- 
zoa primarily use anaphase B to separate the chromosomes at anaphase, and 
their spindles elongate to up to 15 times the metaphase length in the process. 

Segregated Chromosomes Are Packaged in Daughter Nuclei at 
Telophase 

By the end of anaphase, the daughter chromosomes have segregated into two 
equal groups at opposite ends of the cell. In telophase, the final stage of mitosis, 
the two sets of chromosomes are packaged into a pair of daughter nuclei. The 
first major event of telophase is the disassembly of the mitotic spindle, followed 
by the re-formation of the nuclear envelope. Initially, nuclear membrane frag- 
ments associate with the surface of individual chromosomes. These membrane 
fragments fuse to partly enclose clusters of chromosomes and then coalesce to 
re-form the complete nuclear envelope. Nuclear pore complexes are incorpo- 
rated into the envelope, the nuclear lamina re-forms, and the envelope once 
again becomes continuous with the endoplasmic reticulum. Once the nuclear 
envelope has re-formed, the pore complexes pump in nuclear proteins, the 
nucleus expands, and the condensed mitotic chromosomes are reorganized into 
their interphase state, allowing gene transcription to resume. A new nucleus has 
been created, and mitosis is complete. All that remains is for the cell to complete 
its division into two. 

We saw earlier that phosphorylation of various proteins by M-Cdk promotes 
spindle assembly, chromosome condensation, and nuclear envelope break- 
down in early mitosis. It is thus not surprising that the dephosphorylation of 
these same proteins is required for spindle disassembly and the re-formation of 
daughter nuclei in telophase. In principle, these dephosphorylations and the 
completion of mitosis could be triggered by the inactivation of Cdks, the activa- 
tion of phosphatases, or both. Although Cdk inactivation — resulting primarily 
from cyclin destruction — is mainly responsible in most cells, some cells also rely 
on activation of phosphatases. In budding yeast, for example, the completion of 
mitosis depends on the activation of a phosphatase called Cdcl4, which dephos- 
phorylates a subset of Cdk substrates involved in anaphase and telophase. 

Meiosis Is a Special Form of Nuclear Division Involved in Sexual 
Reproduction 

Most eucaryotic organisms reproduce sexually: the genomes of two parents mix 
to generate offspring that are genetically distinct from either parent (discussed 
in Chapter 21). The cells of these organisms are generally diploid: that is, they 
contain two slightly different copies, or homologs, of each chromosome, one 
from each parent. Sexual reproduction depends on a specialized nuclear divi- 
sion process called meiosis, which produces haploid cells carrying only a single 
copy of each chromosome. In many organisms, the haploid cells differentiate 
into specialized reproductive cells called gametes — eggs and sperm in most 
species. In these species, the reproductive cycle ends when a sperm and egg fuse 
to form a diploid zygote with the potential to form a new individual. Here, we 
consider the basic mechanisms and regulation of meiosis, with an emphasis on 
how they compare with those of mitosis. Meiosis is discussed in more detail in 
Chapter 21. 

Meiosis begins with a round of chromosome duplication, called meiotic S 
phase, followed by two rounds of chromosome segregation, called meiosis /and 
II. Meiosis I segregates the homologs (each composed of a tightly linked pair of 
sister chromatids). Meiosis II, like conventional mitosis, segregates the sister 
chromatids of each homolog (Figure 17-47). 

The first meiotic division solves the central problem of meiosis: how to seg- 
regate the homologous chromosomes. Like sister- chromatid segregation in 
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Figure 1 7-47 Comparison of 
meiosis and the mitotic cell cycle. 

(A) Meiosis is a form of nuclear 
division in which a single round of 
chromosome duplication (meiotic 
S phase) is followed by two rounds 
of chromosome segregation. The 
first round, meiosis I, segregates the 
homologs, whereas the second 
round, meiosis II, segregates the 
sister chromatids. (B) In a mitotic cell 
cycle, by contrast homologs do not 
pair up, and the sister chromatids 
segregate in a single division. 
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mitosis, homolog segregation in meiosis I depends on the formation of linkages 
between homologs. These linkages allow the homolog pairs to be bi-oriented on 
the first meiotic spindle, with the homologs in a pair attached to opposite poles. 
Homolog linkage is removed at the onset of anaphase I, allowing the spindle to 
pull the homologs to opposite ends of the cell. 

Linkages form between homologs by a remarkably complex and lengthy 
process that occurs after meiotic S phase, in a period called meiotic prophase or 
prophase I. This process begins with homolog pairing, whereby the homologs 
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gradually move close to each other in the nucleus, primarily as a result of inter- 
actions between complementary DNA sequences in the two homologs. 
Homolog linkages are then locked in place by homologous recombination 
between nonsister chromatids in each homolog pair: DNA double-strand breaks 
are formed at several locations in each sister chromatid, resulting in large num- 
bers of DNA recombination events between the homologs. Some of these events 
lead to reciprocal DNA exchanges called crossovers, where the DNA of a chro- 
matid crosses over to become continuous with the DNA of a homologous chro- 
matid (Figure 17-48). At least one of these crossovers occurs in each homolog 
pair, ensuring that the homologs in every pair are physically connected when 
the cell enters the first meiotic division. 

Another uniquely meiotic problem must be solved when the homolog pairs 
are attached to the first meiotic spindle. Each homolog contains two tightly 
linked sister chromatids, and thus the attachment of a homolog to a spindle pole 
requires that both sister kinetochores in a homolog attach to the same pole. This 
type of attachment is normally avoided during mitosis (see Figure 17-39). In 
meiosis I, however, the two sister kinetochores are somehow fused into a single 
microtubule-binding unit that attaches to just one pole (see Figure 21-12A). 
These mechanisms are reversed after meiosis I, so that in meiosis II the sister 
chromatid pairs can be bi-oriented on the spindle as they are in mitosis. 

Crossovers hold homolog pairs together only because the arms of the sister 
chromatids are connected by sister- chromatid cohesion (see Figure 17-48). The 
loss of cohesion from sister- chromatid arms therefore triggers homolog separa- 
tion at the onset of anaphase I, In most species, the loss of arm cohesion in 
meiosis I depends on APC/C activation, which leads to securin destruction, sep- 
arase activation, and cohesin cleavage along the arms (see Figure 17-44). In con- 
trast to mitosis, however, cohesin complexes near the centromeres remain 
uncleaved in meiosis I because cohesin in that region is protected from separase 
(discussed in Chapter 21). Sister- chromatid pairs therefore remain linked at 
their centromeres throughout meiosis I, allowing their correct bi-orientation on 
the spindle in meiosis II. The mechanisms that block cohesin cleavage at the 
centromere in meiosis I are removed in meiosis II. At the onset of anaphase II, 
APC/C activation therefore triggers centromeric cohesin cleavage and sister- 
chromatid separation — much as it does in mitosis. 



centromere 
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Figure 17-48 A crossover between 
homologs. As in mitosis, the sister 
chromatids in each homolog are tightly 
connected along their entire lengths. 
A single crossover has occurred between 
two nonsister chromatids in this example, 
but either of the two chromatids of a 
homolog can form a crossover with either 
chromatid of the other homolog, and it is 
usual for multiple crossovers to be 
formed. 



Summary 

M-Cdk triggers the events of early mitosis, including chromosome condensation, 
assembly of the mitotic spindle, and bipolar attachment of the sister chromatid pairs 
to microtubules of the spindle. Spindle formation in animal cells depends largely on 
the ability of mitotic chromosomes to stimulate local microtubule nucleation and sta- 
bility, as well as on the ability of motor proteins to organize microtubules into a bipo- 
lar array. Many cells also use centrosomes to facilitate spindle assembly. Anaphase is 
triggered by the APC/C, which stimulates the destruction of the proteins that hold the 
sister chromatids together. APC/C also promotes cyclin destruction and thus the inac- 
tivation ofM-Cdk. The resulting dephosphorylation ofCdk targets is required for the 
events that complete mitosis, including the disassembly of the spindle and the re-for- 
mation of the nuclear envelope. Meiosis is a specialized form of nuclear division in 
which a single round of chromosome duplication is followed by two rounds of chro- 
mosome segregation, resulting in the formation ofhaploid nuclei. 



CYTOKINESIS 



The final step in the cell cycle is cytokinesis, the division of the cytoplasm. In a 
typical cell, cytokinesis accompanies every mitosis, although some cells, such as 
early Drosophila embryos (discussed later) and some mammalian hepatocytes 
and heart muscle cells, undergo mitosis without cytokinesis and thereby acquire 
multiple nuclei. In most animal cells, cytokinesis begins in anaphase and ends 
shortly after the completion of mitosis in telophase. 
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The first visible change of cytokinesis in an animal cell is the sudden appear- 
ance of a pucker, or cleavage furrow, on the cell surface. The furrow rapidly 
deepens and spreads around the cell until it completely divides the cell in two. 
In animal cells and many unicellular eucaryotes, the structure underlying this 
process is the contractile ring— a dynamic assembly composed of actin fila- 
ments, myosin II filaments, and many structural and regulatory proteins. Dur- 
ing anaphase, the ring assembles just beneath the plasma membrane (Figure 
17-49; see also Panel 17-1). The ring gradually contracts, and, at the same time, 
fusion of intracellular vesicles with the plasma membrane inserts new mem- 
brane adjacent to the ring. This addition of membrane compensates for the 
increase in surface area that accompanies cytoplasmic division. When ring con- 
traction is completed, membrane insertion and fusion seal the gap between the 
daughter cells. Thus, cytokinesis can be considered to occur in four stages — ini- 
tiation, contraction, membrane insertion, and completion. 
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Figure 1 7-49 Cytokinesis. (A) The actin- 
myosin bundles of the contractile ring are 
oriented as shown, so that their 
contraction pulls the membrane inward. 
(B) In this low-magnification scanning 
electron micrograph of a cleaving frog 
egg, the cleavage furrow is especially 
obvious and well defined, as the cell is 
unusually large. The furrowing of the cell 
membrane is caused by the activity of the 
contractile ring underneath it. (C)The 
surface of a furrow at higher 
magnification. {B and C, from H.W. Beams 
and R.G. Kessel, Am. Sci. 64:279-290, 1 976. 
With permission from Sigma Xi.) 



Actin and Myosin II in the Contractile Ring Generate the Force for 
Cytokinesis 

In interphase cells, actin and myosin filaments form a cortical network underly- 
ing the plasma membrane. In some cells, they also form large cytoplasmic bun- 
dles called stress fibers (discussed in Chapter 16). As cells enter mitosis, these 
arrays of actin and myosin disassemble; much of the actin reorganizes, and 
myosin II filaments are released. As the sister chromatids separate in anaphase, 
actin and myosin II begin to accumulate in the rapidly assembling contractile 
ring (Figure 17-50), which also contains numerous other proteins that provide 

remaining interpolar microtubules 
from central spindle 
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myosin filaments in cleavage furrow 
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Figure 1 7-50 The contractile ring. (A) A drawing of the cleavage furrow in a dividing cell. (B) An electron micrograph of the ingrowing edge of 
a cleavage furrow of a dividing animal cell. (C) Fluorescence micrographs of a dividing slime mold amoeba stained for actin (red) and myosin II 
(green). Whereas all of the visible myosin II has redistributed to the contractile ring, only some of the actin has done so; the rest remains in the 
cortex of the nascent daughter cells. (B, from H.W. Beams and R.G. Kessel, Am. Sci. 64:279-290, 1 976. With permission from Sigma Xi; C, courtesy 
ofYoshio Fukui.) 
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Figure 17-51 The midbody. (A) A scanning 
electron micrograph of a cultured animal 
cell in the process of dividing; the midbody 
still joins the two daughter cells. (B) A 
conventional electron micrograph of the 
midbody of a dividing animal cell. 
Cleavage is almost complete, but the 
daughter cells remain attached by this thin 
strand of cytoplasm containing the 
remains of the central spindle. (A, courtesy 
of Guenter Albrecht-Buehler; B, courtesy of 
J.M.Mullins.) 
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structural support or assist in ring assembly. Assembly of the contractile ring 
results in part from the local formation of new actin filaments, which depends 
on formin proteins that nucleate the assembly of parallel arrays of linear, 
unbranched actin filaments (discussed in Chapter 16). After anaphase, the over- 
lapping arrays of actin and myosin II filaments contract to generate the force 
that divides the cytoplasm in two. Once contraction begins, the ring exerts a 
force large enough to bend a fine glass needle that is inserted in its path. As the 
ring constricts, it maintains the same thickness, suggesting that its total volume 
and the number of filaments it contains decrease steadily. Moreover, unlike actin 
in muscle, the actin filaments in the ring are highly dynamic, and their arrange- 
ment changes continually during cytokinesis. 

The contractile ring is finally dispensed with altogether when cleavage ends, 
as the plasma membrane of the cleavage furrow narrows to form the midbody. 
The midbody persists as a tether between the two daughter cells and contains 
the remains of the central spindle, a large protein structure derived from the 
antiparallel interpolar microtubules of the spindle midzone, packed tightly 
together within a dense matrix material (Figure 17-51). After the daughter cells 
separate completely, some of the components of the residual midbody often 
remain on the inside of the plasma membrane of each cell, where they may 
serve as a mark on the cortex that helps to orient the spindle in the subsequent 
cell division. 



Local Activation of RhoA Triggers Assembly and Contraction of 
the Contractile Ring 

RhoA, a small GTPase of the Ras superfamily (see Table 15-5), controls the assem- 
bly and function of the contractile ring at the site of cleavage. RhoA is activated at 
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the cell cortex at the future division site, where it promotes actin filament forma- 
tion, myosin II assembly, and ring contraction. It promotes actin filament forma- 
tion by activating formins, and it promotes myosin II assembly and contractions 
by activating multiple protein kinases, including the Rho- activated kinase Rock 
(Figure 17-52). These kinases phosphorylate the regulatory myosin light chain 
(RMLC), which is one of the subunits of myosin II. Phosphorylation of the RMLC 
stimulates bipolar myosin II filament formation and motor activity, thereby pro- 
moting the assembly and contraction of the actin-myosin ring. 

Like other GTPases, RhoA is inactive when bound to GDP and active when 
bound to GTP (discussed in Chapter 15). The local activation of RhoA at the 
cleavage furrow is thought to depend on a Rho guanine nucleotide exchange 
factor (RhoGEF), which is found at the cell cortex at the future division site and 
stimulates the release of GDP and binding of GTP to RhoA. We know little about 
how the RhoGEF is localized or activated at the division site, although the micro- 
tubules of the anaphase spindle seem to be involved, as we discuss next. 



The Microtubules of the Mitotic Spindle Determine the Plane of 
Animal Cell Division 

The central problem in cytokinesis is how to ensure that division occurs at the 
right time and in the right place. Cytokinesis must occur only after the two sets 
of chromosomes are fully segregated from each other, and the site of division 
must be placed between the two sets of daughter chromosomes, thereby ensur- 
ing that each daughter cell receives a complete set. The correct timing and posi- 
tioning of cytokinesis in animal cells are achieved by elegant mechanisms that 
depend on the mitotic spindle. During anaphase, the spindle generates signals 
that initiate furrow formation at a position midway between the spindle poles, 
thereby ensuring that division occurs between the two sets of separated chro- 
mosomes. Because these signals originate in the anaphase spindle, this mech- 
anism also contributes to the correct timing of cytokinesis in late mitosis. 
Cytokinesis also occurs at the correct time because dephosphorylation of Cdk 
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Figure 1 7-52 Regulation of the 
contractile ring by the GTPase RhoA. 

Like other Rho family GTPases, RhoA is 
activated by a RhoGEF protein and 
inactivated by a RhoGAP protein. The 
active GTP-bound form of RhoA is 
focused at the future cleavage site. By 
binding formins, activated RhoA 
promotes the assembly of actin filaments 
in the contractile ring. By activating Rho- 
activated protein kinases, such as Rock, it 
stimulates myosin II filament formation 
and activity, thereby promoting 
contraction of the ring. 
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(A) astral stimulation model (B) central spindle stimulation model (C) astral relaxation model 



substrates, which depends on cyclin destruction in metaphase and anaphase, 
initiates cytokinesis. We now describe these regulatory mechanisms in more 
detail, with an emphasis on cytokinesis in animal cells. 

Studies of the fertilized eggs of marine invertebrates first revealed the 
importance of spindle microtubules in determining the placement of the con- 
tractile ring. After fertilization, these embryos cleave rapidly without intervening 
periods of growth. In this way, the original egg is progressively divided up into 
smaller and smaller cells. Because the cytoplasm is clear, the spindle can be 
observed in real time with a microscope. If the spindle is tugged into a new posi- 
tion with a fine glass needle in early anaphase, the incipient cleavage furrow dis- 
appears, and a new one develops in accord with the new spindle site — support- 
ing the idea that signals generated by the spindle induce local furrow formation. 

How does the mitotic spindle specify the site of division? Three general 
mechanisms have been proposed, and most cells appear to employ a combina- 
tion of these (Figure 17-53). The first is termed the astral stimulation model, 
which postulates that the astral microtubules carry furrow-inducing signals to 
the cell cortex, where they are somehow focused into a ring halfway between the 
spindle poles. Evidence for this model comes from ingenious experiments in 
large embryonic cells, which demonstrate that a cleavage furrow forms midway 
between two asters, even when the two centrosomes nucleating the asters are 
not connected to each other by a mitotic spindle (Figure 17-54). 

A second possibility, called the central spindle stimulation model is that the 
spindle midzone, or central spindle, generates a furrow- inducing signal that spec- 
ifies the site of furrow formation at the cell cortex. The overlapping interpolar 
microtubules of the central spindle associate with numerous signaling proteins, 
including proteins that may stimulate RhoA (Figure 17-55). Defects in the func- 
tion of these proteins (in Drosophila mutants, for example) result in failure of 
cytokinesis. 

A third model proposes that, in some cell types, the astral microtubules pro- 
mote the local relaxation of actin-myosin bundles at the cell cortex. According 
to this astral relaxation model, the cortical relaxation is minimal at the spindle 
equator, thus promoting cortical contraction at that site. In the early embryos of 



chromosomes 



\ 



centrosome 
/ 



glass bead 



i ■ 

mm" 

-J! W f 

dividing egg cell 





a glass bead pushed 
into the cell displaces 
the spindle 



furrow forms only 
on one side of cell, 
producing a binucleate 
egg 








both nuclei 
enter mitosis 



cleavage occurs both between the centrosomes 
linked by mitotic spindles and between the two 
centrosomes that are simply adjacent, and 
four daughter cells are formed 



Figure 1 7-53 Three current models of 
how the microtubules of the anaphase 
spindle generate signals that influence 
the positioning of the contractile ring. 

No single model explains all the 
observations, and it is likely that furrow 
positioning is determined by a 
combination of these mechanisms, with 
the importance of the different 
mechanisms varying in different 
organisms. 



Figure 1 7-54 An experiment 
demonstrating the influence of the 
position of microtubule asters on the 
subsequent plane of cleavage in a large 
egg cell. If the mitotic spindle is 
mechanically pushed to one side of the 
cell with a glass bead, the membrane 
furrowing is incomplete, failing to occur 
on the opposite side of the cell. 
Subsequent cleavages occur not only at 
the midzone of each of the two 
subsequent mitotic spindles (yellow 
arrowheads), but also between the two 
adjacent asters that are not linked by a 
mitotic spindle— but in this abnormal cell 
share the same cytoplasm (red 
arrowhead). Apparently, the contractile 
ring that produces the cleavage furrow in 
these cells always forms in the region 
midway between two asters, suggesting 
that the asters somehow alter the 
adjacent region of cell cortex to induce 
furrow formation between them. 
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C. elegans, for example, treatments that result in the loss of astral microtubules 
lead to increased contractile activity throughout the cell cortex, consistent with 
this model. 

In some cell types, the site of ring assembly is chosen before mitosis. In bud- 
ding yeasts, for example, a ring of proteins called septins assembles in late Gi at 
the future division site. The septins are thought to form a scaffold onto which 
other components of the contractile ring, including myosin II, assemble. In 
plant cells, an organized band of microtubules and actin filaments, called the 
preprophase band, assembles just before mitosis and marks the site where the 
cell wall will assemble and divide the cell in two, as we now discuss. 



Figure 17-55 Localization of cytokinesis 
regulators at the central spindle of the 
human cell. (A) Fluorescence micrograph 
of a cultured human cell at the beginning 
of cytokinesis reveals the locations of the 
GTPase RhoA (red) and a protein called 
Cyk4 (green), which is one of several 
regulatory proteins that form complexes 
at the overlapping plus ends of interpolar 
microtubules. These proteins are thought 
to generate signals that help control 
RhoA activity at the cell cortex (see 
Figures 1 7-52 and 1 7-53B). (B) When the 
cell is cross-sectioned in the plane of the 
contractile ring as shown here, RhoA (red) 
forms a ring beneath the cell surface, 
while the central spindle protein Cyk4 
(green) is associated with microtubule 
bundles scattered throughout the 
equatorial plane of the cell. (Courtesy of 
Alisa Piekny and Michael Glotzer.) 



The Phragmoplast Guides Cytokinesis in Higher Plants 

In most animal cells, the inward movement of the cleavage furrow depends on 
an increase in the surface area of the plasma membrane. New membrane is 
added at the inner edge of the cleavage furrow and is generally provided by small 
membrane vesicles that are transported on microtubules from the Golgi appa- 
ratus to the furrow 

Membrane deposition is particularly important for cytokinesis in higher- 
plant cells. These cells are enclosed by a semirigid cell wall Rather than a con- 
tractile ring dividing the cytoplasm from the outside in, the cytoplasm of the 
plant cell is partitioned from the inside out by the construction of a new cell 
wall, called the cell plate, between the two daughter nuclei (Figure 17-56). The 
assembly of the cell plate begins in late anaphase and is guided by a structure 
called the phragmoplast, which contains microtubules derived from the mitotic 
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Figure 1 7-56 Cytokinesis in a plant cell 
in telophase. In this light micrograph, the 
early cell plate (between the two 
arrowheads) has formed in a plane 
perpendicular to the plane of the page. 
The microtubules of the spindle are 
stained with gold-labeled antibodies 
against tubulin, and the DNA in the two 
sets of daughter chromosomes is stained 
with a fluorescent dye. Note that there 
are no astral microtubules, because there 
are no centrosomes in higher-plant cells. 
(Courtesy of Andrew Bajer.) 
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Figure 1 7-57 The special features of cytokinesis in a higher plant cell. The division plane is established before M phase 
by a band of microtubules and actin filaments {the preprophase band) at the cell cortex. At the beginning of telophase, 
after the chromosomes have segregated, a new cell wall starts to assemble inside the cell at the equator of the old spindle. 
The interpolar microtubules of the mitotic spindle remaining at telophase form the phragmoplast. The plus ends of these 
microtubules no longer overlap but end at the cell equator. Golgi-derived vesicles, filled with cell-wall material, are 
transported along these microtubules and fuse to form the new cell wall, which grows outward to reach the plasma 
membrane and original cell wall. The plasma membrane and the membrane surrounding the new cell wall fuse, completely 
separating the two daughter cells. 

spindle. Motor proteins transport small vesicles along these microtubules from 
the Golgi apparatus to the cell center. These vesicles, filled with polysaccharide 
and glycoproteins required for the synthesis of the new cell wall, fuse to form a 
disclike, membrane-enclosed structure called the early cell plate. The plate 
expands outward by further vesicle fusion until it reaches the plasma membrane 
and the original cell wall and divides the cell in two. Later, cellulose microfibrils 
are laid down within the matrix of the cell plate to complete the construction of 
the new cell wall (Figure 17-57). 



Membrane-Enclosed Organelles Must Be Distributed to Daughter 
Cells During Cytokinesis 

The process of mitosis ensures that each daughter cell receives a full comple- 
ment of chromosomes. When a eucaryotic cell divides, however, each daughter 
cell must also inherit all of the other essential cell components, including the 
membrane-enclosed organelles. As discussed in Chapter 12, organelles such as 
mitochondria and chloroplasts cannot be assembled de novo from their indi- 
vidual components; they can arise only by the growth and division of the preex- 
isting organelles. Similarly, cells cannot make a new endoplasmic reticulum (ER) 
unless some part of it is already present. 

How, then, do the various membrane-enclosed organelles segregate when a 
cell divides? Organelles such as mitochondria and chloroplasts are usually pre- 
sent in large enough numbers to be safely inherited if, on average, their numbers 
roughly double once each cycle. The ER in interphase cells is continuous with 
the nuclear membrane and is organized by the microtubule cytoskeleton. Upon 
entry into M phase, the reorganization of the microtubules and breakdown of 
the nuclear envelope releases the ER. In most cells, the ER remains largely intact 
and is cut in two during cytokinesis. The Golgi apparatus is reorganized and 
fragmented during mitosis. Golgi fragments associate with the spindle poles and 
are thereby distributed to opposite ends of the spindle, ensuring that each 
daughter cell inherits the materials needed to reconstruct the Golgi in telophase. 
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Some Cells Reposition Their Spindle to Divide Asymmetrically 

Most animal cells divide symmetrically: the contractile ring forms around the 
equator of the parent cell, producing two daughter cells of equal size and with 
the same components. This symmetry results from the placement of the mitotic 
spindle, which in most cases tends to center itself in the cytoplasm. Astral 
microtubules and motor proteins that either push or pull on these microtubules 
contribute to the centering process. 

There are many instances in development, however, when cells divide asym- 
metrically to produce two cells that differ in size, in the cytoplasmic contents they 
inherit, or in both. Usually, the two different daughter cells are destined to 
develop along different pathways. To create daughter cells with different fates in 
this way, the mother cell must first segregate certain components (called cell fate 
determinants) to one side of the cell and then position the plane of division so 
that the appropriate daughter cell inherits these components (Figure 17-58). To 
position the plane of division asymmetrically, the spindle has to be moved in a 
controlled manner within the dividing cell. It seems likely that changes in local 
regions of the cell cortex direct such spindle movements and that motor proteins 
localized there pull one of the spindle poles, via its astral microtubules, to the 
appropriate region. Genetic analyses in C. elegans and Drosophilahave identified 
some of the proteins required for such asymmetric divisions (discussed in Chap- 
ter 22), and some of these proteins seem to have a similar role in vertebrates. 



Figure 17-58 An asymmetric cell 
division segregating cytoplasmic 
components to only one daughter cell. 

These light micrographs illustrate the 
controlled asymmetric segregation of 
specific cytoplasmic components to one 
daughter cell during the first division of a 
fertilized egg of the nematode C. elegans. 
The cells above have been stained with a 
blue, DNA-binding, fluorescent dye to 
show the nucleus (and polar bodies); 
they are viewed by both differential- 
interference-contrast and fluorescence 
microscopy. The cells below are the same 
cells stained with an antibody against 
P-granules and viewed by fluorescence 
microscopy. These small granules are 
made of RNA and proteins and determine 
which cells become the germ cells. They 
are distributed randomly throughout the 
cytoplasm of the unfertilized egg (not 
shown) but become segregated to the 
posterior pole of the fertilized egg, as 
shown on the left. The cleavage plane is 
oriented to ensure that only the posterior 
daughter cell receives the P-granules 
when the egg divides, as shown on the 
right. The same segregation process is 
repeated in several subsequent cell 
divisions, so that the P-granules end up 
only in the cells that give rise to eggs and 
sperm. (Courtesy of Susan Strome.) 



Mitosis Can Occur Without Cytokinesis 

Although nuclear division is usually followed by cytoplasmic division, there are 
exceptions. Some cells undergo multiple rounds of nuclear division without 
intervening cytoplasmic division. In the early Drosophila embryo, for example, 
the first 13 rounds of nuclear division occur without cytoplasmic division, 
resulting in the formation of a single large cell containing several thousand 
nuclei, arranged in a monolayer near the surface. A cell in which multiple nuclei 
share the same cytoplasm is called a syncytium. This arrangement greatly 
speeds up early development, as the cells do not have to take the time to go 
through all the steps of cytokinesis for each division. After these rapid nuclear 
divisions, membranes are created around each nucleus in one round of coordi- 
nated cytokinesis called cellularization. The plasma membrane extends inward 
and, with the help of an actin-myosin ring, pinches off to enclose each nucleus 
(Figure 17-59). <TTCT> 

Nuclear division without cytokinesis also occurs in some types of mam- 
malian cells. Megakaryocytes, which produce blood platelets, and some hepato- 
cytes and heart muscle cells, for example, become multinucleated in this way. 
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Figure 17-59 Mitosis without cytokinesis in the early Drosophila embryo. 

(A) The first 1 3 nuclear divisions occur synchronously and without 
cytoplasmic division to create a large syncytium. Most of the nuclei then 
migrate to the cortex, and the plasma membrane extends inward and 
pinches off to surround each nucleus to form individual cells in a process 
called cellularization. (B) Fluorescence micrograph of multiple mitotic 
spindles in a Drosophila embryo before cellularization. The microtubules 
are stained green and the centrosomes red. Note that all the nuclei go 
through the cycle synchronously; here, they are all in metaphase. 
(B, courtesy of Kristina Yu and William Sullivan.) 

After cytokinesis, most cells enter Gi, in which Cdks are mostly inactive. We 
end this section by discussing how this state is achieved at the end of M phase. 



The Gi Phase Is a Stable State of Cdk Inactivity 

A key regulatory event in late M phase is the inactivation of Cdks, which is driven 
primarily by APC/C-dependent cyclin destruction. As described earlier in this 
chapter, the inactivation of Cdks in late M phase has many functions: it triggers 
the events of late mitosis, promotes cytokinesis, and enables the synthesis of 
prereplicative complexes at DNA replication origins. It also provides a mecha- 
nism for resetting the cell-cycle control system to a state of Cdk inactivity as the 
cell prepares to enter a new cell cycle. In most cells, this state of Cdk inactivity 
generates a Gi gap phase, during which the cell grows and monitors its environ- 
ment before committing to a new division. 

In early animal embryos, the inactivation of M-Cdk in late mitosis is due 
almost entirely to the action of Cdc20-APC/C, discussed earlier. Recall, however, 
that M-Cdk stimulates Cdc20-APC/C activity. Thus, the destruction of M-cyclin 
in late mitosis soon leads to the inactivation of all APC/C activity in an embry- 
onic cell. This APC/C inactivation immediately after mitosis is especially useful 
in rapid embryonic cell cycles, as it allows the cell to quickly begin accumulat- 
ing new M-cyclin for the next cycle (Figure 17-60A). 
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Figure 1 7-60 The creation of a G, phase 
by stable Cdk inhibition after mitosis. 

(A) In early embryonic cell cycles, 
Cdc20-APC/C activity rises at the end of 
metaphase, triggering M-cyclin 
destruction. Because M-Cdk activity 
stimulates Cdc20-APC/C activity, the loss 
of M-cyclin leads to APC/C inactivation 
after mitosis, which allows M-cyclins to 
begin accumulating again. (B) In cells 
containing a Gi phase, the drop in M-Cdk 
activity in late mitosis leads to the 
activation of Cdhl -APC/C (as well as to 
the accumulation of Cdk inhibitor 
proteins; not shown). This ensures a 
continued suppression of Cdk activity 
after mitosis, as required for a Gi phase. 
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Rapid cyclin accumulation immediately after mitosis is not useful, however, 
for cells with cell cycles containing a Gi phase. These cells employ several mech- 
anisms to prevent Cdk reactivation after mitosis. One mechanism uses another 
APC/C-activating protein called Cdhl, a close relative of Cdc20. Although both 
Cdhl and Cdc20 bind to and activate the APC/C, they differ in one important 
respect. Whereas M-Cdk activates the Cdc20-APC/C complex, it inhibits the 
Cdhl-APC/C complex by directly phosphorylating Cdhl. As a result of this rela- 
tionship, Cdhl-APC/C activity increases in late mitosis after the Cdc20-APC/C 
complex has initiated the destruction of M-cyclin. M-cyclin destruction there- 
fore continues after mitosis: although Cdc20-APC/C activity has declined, 
Cdhl-APC/C activity is high (Figure 17-60B). 

A second mechanism that suppresses Cdk activity in Gi depends on the 
increased production of CKIs, the Cdk inhibitory proteins discussed earlier. 
Budding yeast cells, in which this mechanism is best understood, contain a CKI 
protein called Sicl, which binds to and inactivates M-Cdk in late mitosis and Gi. 
Like Cdhl, Sicl is inhibited by M-Cdk, which phosphorylates Sicl during mito- 
sis and thereby promotes its ubiquitylation by SCF. Thus, Sicl and M-Cdk, like 
Cdhl and M-Cdk, inhibit each other. As a result, the decline in M-Cdk activity 
that occurs in late mitosis causes the Sicl protein to accumulate, and this CKI 
helps keep M-Cdk activity low after mitosis. A CKI protein called p27 (see Figure 
17-19/17-19) may serve similar functions in animal cells. 

In most cells, decreased transcription of M-cyclin genes also inactivates M- 
Cdks in late mitosis. In budding yeast, for example, M-Cdk promotes the expres- 
sion of these genes, resulting in a positive feedback loop. This loop is turned off 
as cells exit from mitosis: the inactivation of M-Cdk by Cdhl and Sicl leads to 
decreased M-cyclin gene transcription and thus decreased M-cyclin synthesis. 
Gene regulatory proteins that promote the expression of Gi/S- and S-cyclins are 
also inhibited during Gi. 

Thus, Cdhl-APC/C activation, CKI accumulation, and decreased cyclin gene 
expression act together to ensure that the early Gi phase is a time when essentially 
all Cdk activity is suppressed. As in many other aspects of cell-cycle control, the 
use of multiple regulatory mechanisms makes the suppression system robust, so 
that it still operates with reasonable efficiency even if one mechanism fails. So how 
does the cell escape from this stable Gi state to initiate a new cell cycle? The 
answer is that Gi/S-Cdk activity, which rises in late Gi, releases all the braking 
mechanisms that suppress Cdk activity, as we describe in the next section. 

Summary 

After mitosis completes the formation of a pair of daughter nuclei, cytokinesis finishes 
the cell cycle by dividing the cell itself Cytokinesis depends on a ring of actin and 
myosin that contracts in late mitosis at a site midway between the segregated chromo- 
somes. In animal cells, the positioning of the contractile ring is determined by signals 
emanating from the microtubules of the anaphase spindle. Dephosphorylation of Cdk 
targets, which results from Cdk inactivation in anaphase, triggers cytokinesis at the 
correct time after anaphase. After cytokinesis, the cell enters a stable Gi state of low Cdk 
activity, where it awaits signals to enter a new cell cycle. 

CONTROL OF CELL DIVISION AND CELL GROWTH 

A fertilized mouse egg and a fertilized human egg are similar in size, yet they 
produce animals of very different sizes. What factors in the control of cell behav- 
ior in humans and mice are responsible for these size differences? The same fun- 
damental question can be asked for each organ and tissue in an animal's body. 
What factors in the control of cell behavior explain the length of an elephant's 
trunk or the size of its brain or its liver? These questions are largely unanswered, 
at least in part because they have received relatively little attention compared 
with other questions in cell and developmental biology. It is nevertheless possi- 
ble to say what the ingredients of an answer must be. 
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The size of an organ or organism depends mainly on its total cell mass, 
which depends on both the total number of cells and the size of the cells. Cell 
number, in turn, depends on the amounts of cell division and cell death. Organ 
and body size are therefore determined by three fundamental processes: cell 
growth, cell division, and cell death. Each is tightly regulated— both by intracel- 
lular programs and by extracellular signal molecules that control these pro- 
grams. 

The extracellular signal molecules that regulate cell size and cell number are 
generally soluble secreted proteins, proteins bound to the surface of cells, or 
components of the extracellular matrix. They can be divided operationally into 
three major classes: 
1. Mitogens, which stimulate cell division, primarily by triggering a wave of 
Gi/S-Cdk activity that relieves intracellular negative controls that other- 
wise block progress through the cell cycle. 

Growth factors, which stimulate cell growth (an increase in cell mass) by 
promoting the synthesis of proteins and other macromolecules and by 
inhibiting their degradation. 

Survival factors, which promote cell survival by suppressing the form of 
programmed cell death known as apoptosis. 
Many extracellular signal molecules promote all of these processes, while 
others promote one or two of them. Indeed, the term growth factor is often used 
inappropriately to describe a factor that has any of these activities. Even worse, 
the term cell growth is often used to mean an increase in cell number, or cell pro- 
liferation. 

In addition to these three classes of stimulating signals, there are extracellu- 
lar signal molecules that suppress cell proliferation, cell growth, or both; in gen- 
eral, less is known about them. There are also extracellular signal molecules that 
activate apoptosis. 

In this section, we focus primarily on how mitogens and other factors, such 
as DNA damage, control the rate of cell division. We then turn to the important 
but poorly understood problem of how a proliferating cell coordinates its growth 
with cell division so as to maintain its appropriate size. We discuss the control of 
cell survival and cell death by apoptosis in Chapter 18. 



Mitogens Stimulate Cell Division 

Unicellular organisms tend to grow and divide as fast as they can, and their rate 
of proliferation depends largely on the availability of nutrients in the environ- 
ment. The cells of a multicellular organism, however, divide only when the 
organism needs more cells, Thus, for an animal cell to proliferate, it must receive 
stimulatory extracellular signals, in the form of mitogens, from other cells, usu- 
ally its neighbors, Mitogens overcome intracellular braking mechanisms that 
block progress through the cell cycle 

One ol : the first mitogens to be identified was platelet -derived growth factor 
(PDGF), and it is typical of many others discovered since. The path to its isola- 
tion began with the observation that fibroblasts in a culture dish proliferate 
when provided with serum but not when provided with plasma. Plasma is pre- 
pared by removing the cells from blood without allowing clotting to occur; 
serum is prepared by allowing blood to clot and taking the cell-free liquid that 
remains. When blood clots, platelets incorporated in the clot are stimulated to 
release the contents of their secretory vesicles (Figure 17-01). The superior abil- 
ity of serum to support cell proliferation suggested that platelets contain one or 
more mitogens. This hypothesis was confirmed by showing that extracts of 
platelets could serve instead of serum to stimulate fibroblast proliferation. The 
crucial factor in the extracts was shown to be a protein, which was subsequently 
purified and named PDGE In the body PDGF liberated from blood clots helps 
stimulate cell division during wound healing. 

PDGF is only one of over 50 proteins that are known to act as mitogens. Most 
of these proteins have a broad specificity. PDGF, for example, can stimulate 
many types of cells to divide, including fibroblasts, smooth muscle cells, and 
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Figure 17-61 A platelet. Platelets are 
miniature cells without a nucleus. They 
circulate in the blood and help stimulate 
blood clotting at sites of tissue damage, 
thereby preventing excessive bleeding. 
They also release various factors that 
stimulate healing. The platelet shown 
here has been cut in half to show its 
secretory vesicles, some of which contain 
platelet-derived growth factor (PDGF). 
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neuroglial cells. Similarly, epidermal growth factor {EGF} acts not only on epi- 
dermal cells but also on many other cell types, including both epithelial and 
nonepithelial cells. Some mitogens, however, have a narrow specificity; erythro- 
poietin, for example, only induces the proliferation of red blood cell precursors. 
Many mitogens, including PDGF, also have other actions beside the stimulation 
of cell division: they can stimulate cell growth, survival, differentiation, or 
migration, depending on the circumstances and the cell type. 

In some tissues, inhibitory extracellular signal proteins oppose the positive 
regulators and thereby inhibit organ growth. The best-understood inhibitory 
signal proteins are TGF(i and its relatives. TGF(3 inhibits the proliferation of sev- 
eral cell types, either by blocking cell-cycle progression in Gi or by stimulating 
apoptosis. 

Cells Can Delay Division by Entering a Specialized Nondividing 
State 

In the absence of a mitogenic signal to proliferate, Cdk inhibition in Gi is main- 
tained by the multiple mechanisms discussed earlier, and progression into a new 
cell cycle is blocked. In some cases, cells partly disassemble their cell-cycle con- 
trol system and exit from the cycle to a specialized, nondividing state called Go. 

Most cells in our body are in Go, but the molecular basis and reversibility of 
this state vary in different cell types. Most of our neurons and skeletal muscle cells, 
for example, are in a terminally differentiated Go state, in which their cell-cycle 
control system is completely dismantled: the expression of the genes encoding 
various Cdks and cyclins are permanently turned off, and cell division rarely 
occurs. Other cell types withdraw from the cell cycle only transiently and retain 
the ability to reassemble the cell-cycle control system quickly and reenter the 
cycle. Most liver cells, for example, are in Go, but they can be stimulated to divide 
if the liver is damaged. Still other types of cells, including fibroblasts and some 
lymphocytes, withdraw from and re-enter the cell cycle repeatedly throughout 
their lifetime. 

Almost all the variation in cell-cycle length in the adult body occurs during 
the time the cell spends in Gi or G . By contrast, the time a cell takes to progress 
from the beginning of S phase through mitosis is usually brief (typically 12-24 
hours in mammals) and relatively constant, regardless of the interval from one 
division to the next. 

Mitogens Stimulate G r Cdk and G^S-Cdk Activities 

For the vast majority of animal cells, mitogens control the rate of cell division by 
acting in the Gi phase of the cell cycle. As discussed earlier, multiple mechanisms 
act during Gi to suppress Cdk activity and thereby block entry into S phase. Mito- 
gens release these brakes on Cdk activity, thereby allowing S phase to begin. 

As we discuss in Chapter 15, mitogens interact with cell-surface receptors to 
trigger multiple intracellular signaling pathways. One major pathway acts 
through the small GTPase Ras, which leads to the activation of a MAP kinase cas- 
cade. This leads to an increase in the production of gene regulatory proteins, 
including Myc. Myc is thought to promote cell-cycle entry by several mecha- 
nisms, one of which is to increase the expression of genes encoding Gi cyclins 
(D cyclins), thereby increasing Gi-Cdk (cyclin D-Cdk4) activity. As we discuss 
later, Myc also has a major role in stimulating the transcription of genes that 
increase cell growth. 

The key function of Gi-Cdk complexes in animal cells is to activate a group 
of gene regulatory factors called the E2F proteins, which bind to specific DNA 
sequences in the promoters of a wide variety of genes that encode proteins 
required for S-phase entry, including Gi/S-cyclins, S-cyclins, and proteins 
involved in DNA synthesis and chromosome duplication. In the absence of 
mitogenic stimulation, E2F-dependent gene expression is inhibited by an inter- 
action between E2F and members of the retinoblastoma protein (Rb) family. 
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Figure 17-62 Mechanisms controlling 
cell-cycle entry and S-phase initiation in 
animal cells. As discussed in Chapter 15, 
mitogens bind to cell-surface receptors 
to initiate intracellular signaling 
pathways. One of the major pathways 
involves activation of the small GTPase 
Ras, which activates a MAP kinase 
cascade, leading to increased expression 
of numerous immediate early genes, 
including the gene encoding the gene 
regulatory protein Myc. Myc increases the 
expression of many delayed-response 
genes, including some that lead to 
increased Gi-Cdk activity (cyclin D-Cdk4), 
which triggers the phosphorylation of 
members of the Rb family of proteins. 
This inactivates the Rb proteins, freeing 
the gene regulatory protein E2F to 
activate the transcription of Gi/S genes, 
including the genes for a Gi/S-cyclin 
(cyclin E) and S-cyclin (cyclin A). The 
resulting Gi/S-Cdk and S-Cdk activities 
further enhance Rb protein 
phosphorylation, forming a positive 
feedback loop. E2F proteins also 
stimulate the transcription of their own 
genes, forming another positive 
feedback loop. 



When cells are stimulated to divide by mitogens, active Gi-Cdk accumulates and 
phosphorylates Rb family members, reducing their binding to E2K The liberated 
K2F proteins then activate expression of their target genes (Figure 17-62). 

This transcriptional control system, like so many other control systems that 
regulate the cell cycle, includes feedback loops that sharpen the d /S transition. 
The liberated E2F proteins, for example, increase the transcription of their own 
genes. In addition, E2F-dependent transcription of G] /S-cyclin (cyclin EJ and S- 
cyclin (cyclin A) genes leads to increased G 3 /S-Cdk and S-Cdk activities, which 
in turn increase Rb protein phosphorylation and promote further E2F release 
(see Figure 17-62). 

The central member of the Rb family, the Rb protein itself, was identified 
originally through studies of an inherited form of eye cancer in children, known 
as retinoblastoma (discussed in Chapter 20). The loss of both copies of the Rb 
gene leads to excessive cell proliferation in the developing retina, suggesting that 
the Rb protein is particularly important for restraining cell division in this tissue. 
The complete loss of Rb does not immediately cause increased proliferation of 
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retinal or other cell types, in part because Cdhl and CKIs also help inhibit pro- 
gression through Gi and in part because other cell types contain Rb-related pro- 
teins that provide backup support in the absence of Rb. It is also likely that other 
proteins, unrelated to Rb, help to regulate the activity of E2F. 

Additional layers of control promote an overwhelming increase in S-Cdk 
activity at the beginning of S phase. We mentioned earlier that the APC/C acti- 
vator Cdhl suppresses cyclin levels after mitosis. In animal cells, however, Gl- 
and Gi/S-cyclins are resistant to Cdhl and can therefore act unopposed by the 
APC/C to promote Rb protein phosphorylation and E2F-dependent gene 
expression. S-cyclin, by contrast, is not resistant to Cdhl, and its level is initially 
restrained by Cdhl-APC/C activity. However, Gi/S-Cdk also phosphorylates and 
inactivates Cdhl-APC/C, thereby allowing the accumulation of S-cyclin, further 
promoting S-Cdk activation. Gi/S-Cdk also inactivates CKI proteins that sup- 
press S-Cdk activity. The overall effect of all these interactions is the rapid and 
complete activation of the S-Cdk complexes required for S-phase initiation. 

DNA Damage Blocks Cell Division: The DNA Damage Response 

Progression through the cell cycle, and thus the rate of cell proliferation, is con- 
trolled not only by extracellular mitogens but also by other extracellular and 
intracellular mechanisms. One of the most important influences is DNA dam- 
age, which can occur as a result of spontaneous chemical reactions in DNA, 
errors in DNA replication, or exposure to radiation or certain chemicals. It is 
essential that the cell repair damaged chromosomes before attempting to dupli- 
cate or segregate them. The cell-cycle control system can readily detect DNA 
damage and arrest the cycle at either of two checkpoints — one at Start in late Gi, 
which prevents entry into the cell cycle and into S phase, and one at the G2/M 
checkpoint, which prevents entry into mitosis (see Figure 17-21). 

DNA damage initiates a signaling pathway by activating one of a pair of 
related protein kinases called ATM and ATR, which associate with the site of 
damage and phosphorylate various target proteins, including two other protein 
kinases called Chkl and Chk2. Together these various kinases phosphorylate 
other target proteins that lead to cell-cycle arrest. A major target is the gene reg- 
ulatory protein p53, which stimulates transcription of the gene encoding a CKI 
protein called p21; this protein binds to Gi/S-Cdk and S-Cdk complexes and 
inhibits their activities, thereby helping to block entry into the cell cycle (Figure 
17-63). <TGAA> 

DNA damage activates p53 by an indirect mechanism. In undamaged cells, 
p53 is highly unstable and is present at very low concentrations. This is largely 
because it interacts with another protein, Mdm2, which acts as a ubiquitin lig- 
ase that targets p53 for destruction by proteasomes. Phosphorylation of p53 
after DNA damage reduces its binding to Mdm2. This decreases p53 degrada- 
tion, which results in a marked increase in p53 concentration in the cell. In addi- 
tion, the decreased binding to Mdm2 enhances the ability of p53 to stimulate 
gene transcription (see Figure 17-63). 

The protein kinases Chkl and Chk2 also block cell cycle progression by 
phosphorylating members of the Cdc25 family of protein phosphatases, thereby 
inhibiting their function. As described earlier, these kinases are particularly 
important in the activation of M-Cdk at the beginning of mitosis (see Figure 
17-25). Thus, the inhibition of Cdc25 activity by DNA damage helps block entry 
into mitosis (see Figure 17-21). 

The DNA-damage response also detects problems that arise when a replica- 
tion fork fails during DNA replication. When nucleotides are depleted, for exam- 
ple, replication forks stall during the elongation phase of DNA synthesis. To pre- 
vent the cell from attempting to segregate partially replicated chromosomes, the 
same mechanisms that respond to DNA damage detect the stalled replication 
forks and block entry into mitosis until the problems at the replication fork are 
resolved. 

The DNA damage response is not essential for normal cell division if envi- 
ronmental conditions are ideal. Conditions are rarely ideal, however: a low level 
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of DNA damage occurs in the normal life of any ceD, and this damage accumu- 
lates in the cells progeny if the damage response is not functioning. Over the 
long term, the accumulation of genetic damage in cells lacking the DNA damage 
response leads to an increased frequency of cancer-promoting mutations. 
Indeed, mutations in the p53 gene occur in at leasi half of all human cancers 
(discussed in Chapter 20). This loss of p53 function allows the cancer cell to 
accumulate mutations more readily. Similarly, a rare genetic disease known as 
ataxia telangiectasia is caused by a defect in ATM, one of the protein kinases that 
is activated in response to x-ray-induced DNA damage; patients with this dis- 
ease are very sensitive to x-rays and suffer from increased rates of cancer. 

What happens if DNA damage is so severe that repair is not possible? The 
answer differs in different organisms. Unicellular organisms such as budding 
yeast transiently arrest their ceil cycle to try to repair the damage, but the cycle 
resumes even if the repair cannot be completed. For a single-celled organism, 
life with mutations is apparently better than no life at aJU In multicellular organ- 
isms, however, the health of the organism takes precedence over the life of an 
individual cell. Cells that divide with severe DNA damage threaten the life of the 
organism, since genetic damage can often lead to cancer and other diseases. 
Thus, animal cells with severe DNA damage do not attempt to continue division, 
but instead commit suicide by undergoing apoptosis. Thus, unless the DNA 
damage is repaired, the DNA damage response can lead to either cell-cycle 
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Figure 1 7-63 How DNA damage arrests 
the cell cycle in Gi. When DNA is 
damaged, various protein kinases are 
recruited to the site of damage and 
initiate a signaling pathway that causes 
cell-cycle arrest. The first kinase at the 
damage site is either ATM or ATR, 
depending on the type of damage. 
Additional protein kinases, called Chkl 
and Chk2, are then recruited and 
activated, resulting in the 
phosphorylation of the gene regulatory 
protein p53. Mdm2 normally binds to 
p53 and promotes its ubiquitylation and 
destruction in proteasomes. 
Phosphorylation of p53 blocks its binding 
to Mdm2; as a result, p53 accumulates to 
high levels and stimulates transcription 
of the gene that encodes the CKI protein 
p21.Thep21 binds and inactivates 
Gi/S-Cdk and S-Cdk complexes, arresting 
the cell in G-|. In some cases, DNA 
damage also induces either the 
phosphorylation of Mdm2 or a decrease 
in Mdm2 production, which causes a 
further increase in p53 (not shown). 
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arrest or cell death. As we discuss in the next chapter, DNA damage-induced 
apoptosis often depends on the activation of p53. Indeed, it is this apoptosis- 
promoting function of p53 that is apparently most important in protecting us 
against cancer. 

Many Human Cells Have a Built-in Limitation on the Number of 
Times They Can Divide 

Many human cells divide a limited number of times before they stop and 
undergo a permanent cell-cycle arrest. Fibroblasts taken from normal human tis- 
sue, for example, go through only about 25-50 population doublings when cul- 
tured in a standard mitogenic medium. Toward the end of this time, proliferation 
slows down and finally halts, and the cells enter a nondividing state from which 
they never recover. This phenomenon is called replicative cell senescence, 
although it is unlikely to be responsible for the senescence (aging) of the organ- 
ism. Organism senescence is thought to depend, in part, on progressive oxidative 
damage to long-lived macromolecules, as strategies that reduce metabolism 
(such as reduced food intake), and thereby reduce the production of reactive oxy- 
gen species, can extend the lifespan of experimental animals. 

Replicative cell senescence in human fibroblasts seems to be caused by 
changes in the structure of the telomeres, the repetitive DNA sequences and 
associated proteins at the ends of chromosomes. As discussed in Chapter 5, 
when a cell divides, telomeric DNA sequences are not replicated in the same 
manner as the rest of the genome but instead are synthesized by the enzyme 
telomerase. Telomerase also promotes the formation of protein cap structures 
that protect the chromosome ends. Because human fibroblasts, and many other 
human somatic cells, are deficient in telomerase, their telomeres become 
shorter with every cell division, and their protective protein caps progressively 
deteriorate. Eventually, the exposed chromosome ends are sensed as DNA dam- 
age, which activates a p53-dependent cell-cycle arrest that resembles the arrest 
caused by other types of DNA damage (see Figure 17-63). Rodent cells, by con- 
trast, maintain telomerase activity when they proliferate in culture and therefore 
do not have such a telomere-dependent mechanism for limiting proliferation. 
The forced expression of telomerase in normal human fibroblasts, using genetic 
engineering techniques, blocks this form of senescence. Unfortunately, most 
cancer cells have regained the ability to produce telomerase and therefore main- 
tain telomere function as they proliferate; as a result, they do not undergo 
replicative cell senescence. 

Abnormal Proliferation Signals Cause Cell-Cycle Arrest or 
Apoptosis, Except in Cancer Cells 

Many of the components of mitogenic signaling pathways are encoded by genes 
that were originally identified as cancer-promoting genes, or oncogenes, because 
mutations in them contribute to the development of cancer. The mutation of a 
single amino acid in the small GTPase Ras, for example, causes the protein to 
become permanently overactive, leading to constant stimulation of Ras-depen- 
dent signaling pathways, even in the absence of mitogenic stimulation. Simi- 
larly, mutations that cause an overexpression of Myc stimulate excessive cell 
growth and proliferation and thereby promote the development of cancer. 

Surprisingly, however, when a hyperactivated form of Ras or Myc is experi- 
mentally overproduced in most normal cells, the result is not excessive prolifer- 
ation but the opposite: the cells undergo either cell-cycle arrest or apoptosis. 
The normal cell seems able to detect abnormal mitogenic stimulation, and it 
responds by preventing further division. Such responses help prevent the sur- 
vival and proliferation of cells with various cancer-promoting mutations. 

Although it is not known how a cell detects excessive mitogenic stimulation, 
such stimulation often leads to the production of a cell-cycle inhibitor protein 
called Arf, which binds and inhibits Mdm2. As discussed earlier, Mdm2 normally 
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Figure 1 7-64 Cell-cycle arrest or 
apoptosis induced by excessive 
stimulation of mitogenic pathways. 

Abnormally high levels of Myc cause the 
activation of Arf, which binds and inhibits 
Mdm2 and thereby increases p53 levels 
(see Figure 17-60). Depending on the cell 
type and extracellular conditions, p53 
then causes either cell-cycle arrest or 
apoptosis. 



promotes p53 degradation. Activation of Arf therefore causes p53 levels to 
increase, inducing either cell-cycle arrest or apoptosis (Figure 17-64). 

How do cancer cells ever arise if these mechanisms block the division or 
survival of mutant cells with overactive proliferation signals? The answer is that 
the protective system is often inactivated in cancer cells by mutations in the 
genes that encode essential components of the checkpoint responses, such as 
Arf or p53 or the proteins that help activate them. 



Organism and Organ Growth Depend on Cell Growth 

For an organism or organ to grow, cell division is not enough. If cells proliferated 
without growing, they would get progressively smaller and there would be no net 
increase in total cell mass. In most proliferating cell populations, therefore, cell 
growth accompanies cell division. In single-celled organisms such as yeasts, both 
cell growth and cell division require only nutrients. In animals, by contrast, both 
cell growth and cell proliferation depend on extracellular signal molecules, pro- 
duced by other cells, which we call growth factors and mitogens, respectively. 

Like mitogens, the extracellular growth factors that stimulate animal cell 
growth bind to receptors on the cell surface and activate intracellular signaling 
pathways. These pathways stimulate the accumulation of proteins and other 
macromolecules, and they do so by both increasing their rate of synthesis and 
decreasing their rate of degradation. They also trigger increased uptake of nutri- 
ents and production of the ATP required to fuel increased protein synthesis. One 
of the most important intracellular signaling pathways activated by growth factor 
receptors involves the enzyme PI 3 -kinase, which adds a phosphate from ATP to 
the 3 position of inositol phospholipids in the plasma membrane. As discussed in 
Chapter 15, the activation of PI 3-kinase leads to the activation of a kinase called 
TOR, which lies at the heart of growth regulatory pathways in all eucaryotes. TOR 
activates many targets in the cell that stimulate metabolic processes and increase 
protein synthesis. One target is a protein kinase called S6 kinase (S6K), which 
phosphorylates ribosomal protein S6, increasing the ability of ribosomes to trans- 
late a subset of mRNAs that mostly encode ribosomal components. TOR also indi- 
rectly activates a translation initiation factor called eIF4E and directly activates 
gene regulatory proteins that promote the increased expression of genes encoding 
ribosomal subunits (Figure 17-65). 



Proliferating Cells Usually Coordinate Their Growth and Division 

For proliferating cells to maintain a constant size, they must coordinate their 
growth with cell division to ensure that cell size doubles with each division: if 
cells grow too slowly, they will get smaller with each division, and if they grow 
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Figure 17-65 Stimulation of cell growth 
by extracellular growth factors and 
nutrients. As discussed in Chapter 15, the 
occupation of cell-surface receptors by 
growth factors leads to the activation of 
PI 3-kinase, which promotes protein 
synthesis through a complex signaling 
pathway that leads to the activation of 
the protein kinase TOR; extracellular 
nutrients such as amino acids also help 
activate TOR by an unknown pathway. 
TOR employs multiple mechanisms to 
stimulate protein synthesis, as shown; it 
also inhibits protein degradation (not 
shown). Growth factors also stimulate 
increased production of the gene 
regulatory protein Myc (not shown), 
which activates the transcription of 
various genes that promote cell 
metabolism and growth. 4E-BP is an 
inhibitor of the translation initiation 
factor elF4E. 



PROTEIN SYNTHESIS AND CELL GROWTH 



too fast, they will get larger with each division. It is not clear how cells achieve 
this coordination, but it is likely to involve multiple mechanisms that vary in dif- 
ferent organisms and even in different cell types of the same organism (Figure 
17-66). 

Animal cell growth and division are not always coordinated, however. In 
many cases, they are completely uncoupled to allow growth without division or 
division without growth. Muscle cells and nerve cells, for example, can grow dra- 
matically after they have permanently withdrawn from the cell cycle. Similarly, 
the eggs of many animals grow to an extremely large size without dividing; after 
fertilization, however, this relationship is reversed, and many rounds of division 
occur without growth (see Figure 17-9). 

Compared to cell division, there has been surprisingly little study of how cell 
size is controlled in animals. As a result, it remains a mystery how cell size is 
determined and why different cell types in the same animal grow to be so differ- 
ent in size (Figure 17-67). One of the best-understood cases in mammals is the 
adult sympathetic neuron, which has permanently withdrawn from the cell 
cycle. Its size depends on the amount of nerve growth factor (NGF) secreted by 
the target cells it innervates; the greater the amount of NGF the neuron has 
access to, the larger it becomes. It seems likely that the genes a cell expresses set 
limits on the size it can be, while extracellular signal molecules and nutrients 
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Figure 1 7-66 Potential mechanisms for coordinating cell growth and division. In proliferating cells, cell size is 
maintained by mechanisms that coordinate rates of cell division and cell growth. Numerous alternative coupling 
mechanisms are thought to exist, and different cell types appear to employ different combinations of these 
mechanisms. (A) In many cell types— particularly yeast— the rate of cell division is governed by the rate of cell 
growth, so that division occurs only when growth rate achieves some minimal threshold; in yeasts, it is mainly the 
levels of extracellular nutrients that regulate the rate of cell growth and thereby the rate of cell division. (B) In 
some animal cell types, growth and division can each be controlled by separate extracellular factors (growth 
factors and mitogens, respectively), and cell size depends on the relative levels of the two types of factors. 
(C) Some extracellular factors can stimulate both cell growth and cell division by simultaneously activating 
signaling pathways that promote growth and other pathways that promote cell-cycle progression. 
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Figure 1 7-67 The size difference between a neuron (from the retina) and 
a lymphocyte in a mammal. Both cells contain the same amount of DNA. 
A neuron grows progressively larger after it has permanently withdrawn 
from the cell cycle. During this time, the ratio of cytoplasm to DNA 
increases enormously (by a factor of more than 1 5 for some neurons). 
(Neuron from B.B. Boycott, in Essays on the Nervous System [R. Bellairs and 
E.G. Gray, eds]. Oxford, UK: Clarendon Press, 1974.) 



regulate the size within these limits. The challenge is to identify the relevant 
genes and signal molecules for each cell type. 



Neighboring Cells Compete for Extracellular Signal Proteins 

When most types of mammalian cells are cultured in a dish in the presence of 
serum, they adhere to the bottom of the dish, spread out, and divide until they 
form a confluent monolayer. Each cell is attached to the dish and contacts its 
neighbors on all sides. At this point, normal cells, unlike cancer cells, stop prolif- 
erating—a phenomenon known as density- dependent inhibition of cell division. 
This phenomenon was originally described in terms of "contact inhibition" of cell 
division, but it is unlikely that cell-cell contact interactions are solely responsible. 
The cell population density at which cell proliferation ceases in the confluent 
monolayer increases as the concentration of serum in the medium increases. 
Moreover, if a stream of fresh culture medium is passed over a confluent layer of 
fibroblasts to increase the supply of mitogens, the cells under the stream are 
induced to divide (Figure 17-68). Thus, density-dependent inhibition of cell pro- 
liferation seems to reflect, in part at least, the ability of a cell to deplete the 
medium around itself of extracellular mitogens, thereby depriving its neighbors. 

This type of competition could be important for cells in tissues as well as in 
culture, because it prevents them from proliferating beyond a certain population 
density, determined by the available amounts of mitogens, growth factors, and 
survival factors. The amounts of these factors in tissues are usually limiting, in 
that increasing their amounts results in an increase in cell number, cell size, or 
both. Thus, the amounts of these factors in tissues have important roles in deter- 
mining cell size and number, and possibly the final size of the organ or tissue. 

The overall size of a tissue may also be governed in some cases by extracel- 
lular inhibitory factors. Myostatin, for example, is a TGF(3 family member that 
normally inhibits the proliferation of myoblasts that fuse to form skeletal mus- 
cle cells. When the gene that encodes myostatin is deleted in mice, muscles grow 
to be several times larger than normal. Remarkably, two breeds of cattle that 
were bred for large muscles have both turned out to have mutations in the gene 
encoding myostatin (Figure 17-69). 
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Figure 1 7-68 The effect of fresh medium on a confluent cell monolayer. Cells in a confluent monolayer 
do not divide (gray). The cells resume dividing (green) when exposed directly to fresh culture medium. 
Apparently, in the undisturbed confluent monolayer, proliferation has halted because the medium close to 
the cells is depleted of mitogens, for which the cells compete. 
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Figure 1 7-69 The effects of a myostatin 
mutation on muscle size. The mutation 
leads to a great increase in the mass of 
muscle tissue, as illustrated in this Belgian 
Blue bull. The Belgian Blue was produced 
by cattle breeders and was only recently 
found to have a mutation in the 
Myostatin gene. (From HI. Sweeney, 
Sci. Am. 291 :62, 2004. With permission 
from Scientific American.) 



Animals Control Total Cell Mass by Unknown Mechanisms 

The size of an animal or one of its organs depends largely on the number and 
size of the cells it contains — that is, on total cell mass. Remarkably, animals can 
somehow assess the total cell mass in a tissue or organ and regulate it: in many 
circumstances, for example, if cell size is experimentally increased or decreased 
in an organ, cell numbers adjust to maintain a normal organ size. This has been 
most dramatically illustrated by experiments in salamanders, in which cell size 
was manipulated by altering cell ploidy (in all organisms, the size of a cell is pro- 
portional to its ploidy, or genome content). Salamanders of different ploidies are 
the same size but have different numbers of cells. Individual cells in a penta- 
ploid salamander are about five times the volume of those in a haploid sala- 
mander, and in each organ the pentaploids have only one-fifth as many cells as 
their haploid cousins, so that the organs are about the same size in the two ani- 
mals (Figure 17-70 and Figure 17-71). Evidently, in this case (and in many oth- 
ers) the size of organs and organisms depends on mechanisms that can some- 
how measure total cell mass. How animals measure and adjust total mass 
remains a mystery, however. 

The development of limbs and organs of specific size and shape depends on 
complex positional controls, as well as on local concentrations of extracellular 
signal proteins that stimulate or inhibit cell growth, division, and survival. As we 
discuss in Chapter 22, we now know many of the genes that help pattern these 
processes in the embryo. A great deal remains to be learned, however, about how 
these genes regulate cell growth, division, survival, and differentiation to gener- 
ate a complex organism. 

The controls that govern these processes in an adult body are also poorly 
understood. When a skin wound heals in a vertebrate, for example, about a 
dozen cell types, ranging from fibroblasts to Schwann cells, must be regenerated 
in appropriate numbers, sizes, and positions to reconstruct the lost tissue. The 
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Figure 1 7-70 Sections of kidney tubules 
from salamander larvae of different 
ploidies. In all organisms, from bacteria to 
humans, cell size is proportional to ploidy. 
Pentaploid salamanders, for example, have 
cells that are much larger than those of 
haploid salamanders. The animals and 
their individual organs, however, are the 
same size because each tissue in the 
pentaploid animal contains fewer cells. 
This indicates that the size of an organism 
or organ is not controlled simply by 
counting cell divisions or cell numbers; 
total cell mass must somehow be 
regulated. (Adapted from G. Fankhauser, in 
Analysis of Development [B.H. Willier, 
P.A. Weiss, and V. Hamburger, eds.], 
pp. 126-150. Philadelphia: Saunders, 1955.) 
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Figure 17-71 The hindbrain in a haploid and in a tetraploid salamander. 

(A) This light micrograph shows a cross section of the hindbrain of a 
haploid salamander. (B) A corresponding cross section of the hindbrain of a 
tetraploid salamander, revealing how reduced cell numbers compensate 
for increased cell size, so that the overall size of the hindbrain is the same 
in the two animals. (From G. Fankhauser, /nf. flev. Cyfo/. 1:165-193, 1952. 
With permission from Elsevier.) 



mechanisms that control cell growth and proliferation in tissues are likewise 
central to understanding cancer, a disease in which the controls go wrong, as 
discussed in Chapter 20. 



Summary 

In multicellular animals, cell size, cell division, and cell death are carefully controlled 
to ensure that the organism and its organs achieve and maintain an appropriate size. 
Mitogens stimulate the rate of cell division by removing intracellular molecular 
brakes that restrain cell-cycle progression in G } . Growth factors promote cell growth 
(an increase in cell mass) by stimulating the synthesis and inhibiting the degradation 
of macromolecules. For proliferating cells to maintain a constant cell size, they 
employ multiple mechanisms to ensure that cell growth is coordinated with cell divi- 
sion. Animals maintain the normal size of their tissues and organs by adjusting cell 
size to compensate for changes in cell number, or vice versa. The mechanisms that 
make this possible are not known. 
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PROBLEMS 

Which statements are true? Explain why or why not. 

17-1 Since there are about 10 13 cells in an adult human, 
and about 10 10 cells die and are replaced each day, we 
become new people every three years. 

17-2 The regulation of cyclin-Cdk complexes depends 
entirely on phosphorylation and dephosphorylation. 

1 7-3 In order for proliferating cells to maintain a relatively 
constant size, the length of the cell cycle must match the 
time it takes for the cell to double in size. 

17-4 While other proteins come and go during the cell 
cycle, the proteins of the origin recognition complex remain 
bound to the DNA throughout. 

17-5 Chromosomes are positioned on the metaphase 
plate by equal and opposite forces that pull them toward the 
two poles of the spindle. 

1 7-6 If we could turn on telomerase activity in all our cells, 
we could prevent aging. 

Discuss the following problems. 

17-7 Many cell-cycle genes from human cells function 
perfectly well when expressed in yeast cells. Why do you 
suppose that is considered remarkable? After all, many 
human genes encoding enzymes for metabolic reactions 
also function in yeast, and no one thinks that is remarkable. 

1 7-8 You have isolated a new Cdc mutant of budding yeast 
that forms colonies at 25°C but not at 37°C. You would now 



like to isolate the wild-type gene that corresponds to the 
defective gene in your Cdc mutant. How might you isolate 
the wild-type gene using a plasmid-based DNA library pre- 
pared from wild-type yeast cells? 

1 7-9 You have isolated a temperature-sensitive mutant of 
budding yeast. It proliferates well at 25°C, but at 35°C all the 
cells develop a large bud and then halt their progression 
through the cell cycle. The characteristic morphology of the 
cells at the time they stop cycling is known as the landmark 
morphology. 

It is very difficult to obtain synchronous cultures of this 
yeast, but you would like to know exactly where in the cell 
cycle the temperature-sensitive gene product must func- 
tion — its execution point, in the terminology of the field — in 
order for the cell to complete the cycle. A clever friend, who 
has a good microscope with a heated stage and a video cam- 
era, suggests that you take movies of a field of cells as they 
experience the temperature increase, and follow the mor- 
phology of the cells as they stop cycling. Since the cells do 
not move much, it is relatively simple to study individual 
cells. To make sense of what you see, you arrange a circle of 
pictures of cells at the start of the experiment in order of the 
size of their daughter buds. You then find the corresponding 
pictures of those same cells 6 hours later, when growth and 
division has completely stopped. The results with your 
mutant are shown in Figure Q17-1. 

A. Indicate on the diagram in Figure Q17-1 where the exe- 
cution point for your mutant lies. 

B. Does the execution point correspond to the time at which 
the cell cycle is arrested in your mutant? How can you tell? 

17-10 The yeast cohesin subunit Sec 1 , which is essential for 
sister- chromatid pairing, can be artificiallv reeulated for 
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Figure Q1 7-1 Time-lapse 
photography of a temperature- 
sensitive mutant of yeast 
(Problem 1 7-9). Cells on the 
inner ring are arranged in order 
of their bud size, which 
corresponds to their position in 
the cell cycle. After 6 hours at 
37°C, they have given rise to 
the cells shown on the outer 
ring. No further growth or 
division occurs. 



expression at any point in the cell cycle. If expression is 
turned on at the beginning of S phase, all the cells divide sat- 
isfactorily and survive. By contrast, if Sccl expression is 
turned on only after S phase is completed, the cells fail to 
divide and they die, even though Sccl accumulates in the 
nucleus and interacts efficiently with chromosomes. Why do 
you suppose that cohesin must be present during S phase for 
cells to divide normally? 

17-11 If cohesins join sister chromatids all along their 
length, how is it possible for condensins to generate mitotic 
chromosomes such as that shown in Figure Q17-2, which 
clearly shows the two sister chromatids as separate domains? 



Figure Q1 7-2 A 

scanning electron 
micrograph of a 
fully condensed 
mitotic 

chromosome from 
vertebrate cells 
(Problem 17-11). 
(Courtesy of Terry 
D. Allen.) 
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17-12 High doses of caffeine interfere with the DNA repli- 
cation checkpoint mechanism in mammalian cells. Why 
then do you suppose the Surgeon General has not yet issued 
an appropriate warning to heavy coffee and cola drinkers? A 
typical cup of coffee (150 mL) contains 100 mg of caffeine 
(196 g/mole). How many cups of coffee would you have to 
drink to reach the dose (10 mM) required to interfere with 
the DNA replication checkpoint mechanism? (A typical 
adult contains about 40 liters of water.) 

17-13 A living cell from the lung epithelium of a newt is 
shown at different stages in M phase in Figure Q17-3. Order 
these light micrographs into the correct sequence and iden- 
tify the stage in M phase that each represents. 

1 7-1 4 How many kinetochores are there in a human cell at 
mitosis? 

17-15 A classic paper clearly distinguished the properties of 
astral microtubules from those of kinetochore microtubules. 
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Figure Q1 7-3 Light micrographs of a single cell at different stages of 
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Centrosomes were used to initiate microtubule growth, and 
then chromosomes were added. The chromosomes bound 
to the free ends of the microtubules, as illustrated in Figure 
Q17-4. The complexes were then diluted to very low tubulin 
concentration (well below the critical concentration for 
microtubule assembly) and examined again (Figure Q17-4). 
As is evident, only the kinetochore microtubules were stable 
to dilution. 

A. Why do you think kinetochore microtubules are stable? 

B. How would you explain the disappearance of the astral 
microtubules after dilution? Do they detach from the cen- 
trosome, depolymerize from an end, or disintegrate along 
their length at random? 

C. How would a time course after dilution help to distin- 
guish among these possible mechanisms for disappearance 
of the astral microtubules? 

17-16 What are the two distinct cytoskeletal machines that 
are assembled to carry out the mechanical processes of 
mitosis and cytokinesis in animal cells? 

17-17 How do mitogens, growth factors, and survival fac- 
tors differ from one another? 





before dilution 



after dilution 



Figure Q17-4 Arrangements of centrosomes, chromosomes, and 
microtubules before and after dilution to low tubulin concentration 

(Problem 17-15). 
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Apoptosis 




Cell death plays a crucially important part in animal and plant development, and 
it usually continues into adulthood. In a healthy adult human, billions of cells die 
in the bone marrow and intestine every hour. Our tissues do not shrink because, 
by unknown regulatory mechanisms, cell division exactly balances the cell death. 
We now know that these "normal" cell deaths are suicides, in which the cells acti- 
vate an intracellular death program and kill themselves in a controlled way— a 
process known as programmed cell death. The idea that animal cells have a built- 
in death program was proposed in the 1970s, but its general acceptance took 
another 20 years and depended on genetic studies in the nematode C. elegans that 
identified the first genes dedicated to programmed cell death and its control. 

Programmed cell death in animals usually, but not exclusively, occurs by 
apoptosis (from the Greek word meaning "falling off," as leaves from a tree). 
Although apoptosis is only one form of programmed cell death, it is by far the 
most common and best understood, and, confusingly, biologists often use the 
terms programmed cell death and apoptosis interchangeably. Cells dying by 
apoptosis undergo characteristic morphological changes. <GCCC> They shrink 
and condense, the cytoskeleton collapses, the nuclear envelope disassembles, 
and the nuclear chromatin condenses and breaks up into fragments (Figure 
18-1 A). The cell surface often blebs and, if the cell is large, often breaks up into 
membrane-enclosed fragments called apoptotic bodies. Most importantly, the 
surface of the cell or apoptotic bodies becomes chemically altered, so that a 
neighboring cell or a macrophage (a specialized phagocytic cell, discussed in 
Chapter 23) rapidly engulfs them, before they can spill their contents (Figure 
18-1B). In this way, the cell dies neatly and is rapidly cleared away, without caus- 
ing a damaging inflammatory response. Because the cells are eaten and digested 
so quickly, there are usually few dead cells to be seen, even when large numbers 
of cells have died by apoptosis. This is probably why biologists overlooked apop- 
tosis for many years and still probably underestimate its extent. 

By contrast to apoptosis and other less well characterized forms of pro- 
grammed cell death (which implies the operation of an intracellular death pro- 
gram), animal cells that die accidentally in response to an acute insult, such as 
trauma or a lack of blood supply, usually do so by a process called cell necrosis. 
Necrotic cells swell and burst, spilling their contents over their neighbors and 
eliciting an inflammatory response (Figure 18-1C). 

Programmed cell death is not confined to animals. In plants, it occurs dur- 
ing development and in the senescence of flowers and leaves, as well as in the 
response to injury and infection. Programmed cell death even occurs in unicel- 
lular organisms, including yeasts and bacteria. The molecular mechanisms 
involved in these cases are distinct from those that mediate apoptosis in animal 
cells, and we shall not consider them. In this chapter, we discuss the functions 
of programmed cell death in animals, the molecular mechanism of apoptosis 
and its regulation, and how excessive or insufficient apoptosis can contribute 
to human disease. 



Programmed Cell Death Eliminates Unwanted Cells 

The amount of programmed cell death that occurs in developing and adult ani- 
mal tissues can be astonishing. In the developing vertebrate nervous system, for 
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example, more than half of many types of nerve cells normally die soon after 
they are formed. It seems remarkably wasteful for so many cells to die, especially 
as the vast majority are perfectly healthy at the time they kill themselves. What 
purposes does this massive cell death serve? 

In some cases, the answer is clear. In animal development, programmed cell 
death eliminates unwanted cells, usually by apoptosis. Cell death, for example, 
helps sculpt hands and feet during embryonic development: they start out as 
spade-like structures, and the individual digits separate only as the cells 
between them die, as illustrated for a mouse paw in Figure 18-2. In other cases, 
cells die when the structure they form is no longer needed. When a tadpole 
changes into a frog at metamorphosis, the cells in the tail die, and the tail, which 
is not needed in the frog, disappears (Figure 18-3). In many other cases, cell 
death helps regulate cell numbers. In the developing nervous system, for exam- 
ple, cell death adjusts the number of nerve cells to match the number of target 
cells that the nerve cells connect to, as we discuss later. 

Programmed cell death also functions as a quality-control process in devel- 
opment, eliminating cells that are abnormal, misplaced, nonfunctional, or 
potentially dangerous to the animal. Striking examples occur in the vertebrate 
adaptive immune system, where apoptosis eliminates developing T and B lym- 
phocytes that either fail to produce potentially useful antigen- specific receptors 
or produce self-reactive receptors that make the cells potentially dangerous; it 
also eliminates most of the lymphocytes activated by an infection, after they 
have helped destroy the responsible microbes (discussed in Chapter 25). 

In adult tissues that are neither growing nor shrinking, cell death and cell 
division must be tightly regulated to ensure that they are exactly in balance. If 




Figure 1 8-1 Two distinct forms of cell 

death. These electron micrographs show 
cells that have died by apoptosis (A and 
B) or by a type of accidental cell death 
called necrosis (C).The cells in (A) and (C) 
died in a culture dish, whereas the cell in 
(B) died in a developing tissue and has 
been engulfed by a phagocytic cell. Note 
that the cells in (A) and (B) have 
condensed but seem relatively intact, 
whereas the cell in (C) seems to have 
exploded. The large vacuoles visible in 
the cytoplasm of the cell in (A) are a 
variable feature of apoptosis. (Courtesy 
of Julia Burne.) 
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Figure 18-2 Sculpting the digits in the 
developing mouse paw by apoptosis. 

(A) The paw in this mouse fetus has been 
stained with a dye that specifically labels 
cells that have undergone apoptosis. The 
apoptotic cells appear as bright green 
dots between the developing digits. 
(B)The interdigital cell death has 
eliminated the tissue between the 
developing digits, as seen one day later, 
when there are very few apoptotic cells. 
{From W. Wood et a I., Development 
1 27:5245-5252, 2000. With permission 
from The Company of Biologists.) 
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part of the liver is removed in an adult rat, for example, liver cell proliferation 
increases to make up the loss. Conversely, if a rat is treated with the drug phe- 
nobarbital — which stimulates liver cell division (and thereby liver enlarge- 
ment) — and then the phenobarbital treatment is stopped, apoptosis in the liver 
greatly increases until the liver has returned to its original size, usually within a 
week or so. Thus, the liver is kept at a constant size through the regulation of 
both the cell death rate and the cell birth rate, although the control mechanisms 
responsible for such regulation are largely unknown. 

Apoptosis occurs at a staggeringly high rate in the adult human bone mar- 
row, where most blood cells are produced. Here, for example, neutrophils (a type 
of white blood cell discussed in Chapter 23) are produced continuously in very 
large numbers, but the vast majority die by apoptosis in the bone marrow within 
a few days without ever functioning. This apparently futile cycle of production 
and destruction serves to maintain a ready supply of short-lived neutrophils 
that can be rapidly mobilized to fight infection wherever it occurs in the body. 
Compared with the life of the organism, cells are evidently cheap. 

Animal cells can recognize damage in their various organelles and, if the 
damage is great enough, they can kill themselves by undergoing apoptosis. An 
important example is DNA damage, which can produce cancer-promoting 
mutations if not repaired. Cells have various ways of detecting DNA damage, 
and, if they cannot repair it, they often kill themselves by undergoing apoptosis. 



Figure 18-3 Apoptosis during the 
metamorphosis of a tadpole into a frog. 

As a tadpole changes into a frog, the cells 
in the tadpole tail are induced to undergo 
apoptosis; as a consequence, the tail is 
lost. An increase in thyroid hormone in 
the blood stimulates all the changes that 
occur during metamorphosis, including 
apoptosis in the tail. 



Apoptotic Cells Are Biochemically Recognizable 

Cells undergoing apoptosis not only have a characteristic morphology but also 
display characteristic biochemical changes, which can be used to identify apop- 
totic cells. During apoptosis, for example, an endonuclease cleaves the chromo- 
somal DNA into fragments of distinctive sizes; because the cleavages occur in 
the linker regions between nucleosomes, the fragments separate into a charac- 
teristic ladder pattern when analyzed by gel electrophoresis (Figure 18-4A). 
Moreover, the cleavage of DNA generates many new DNA ends, which can be 
marked in apoptotic nuclei by using a labeled nucleotide in the so-called TUNEL 
technique (Figure 18-4B). 

An especially important change occurs in the plasma membrane of apop- 
totic cells. The negatively charged phospholipid phosphatidylserine is normally 
exclusively located in the inner leaflet of the lipid bilayer of the plasma mem- 
brane (see Figures 10-3 and 10-16), but it flips to the outer leaflet in apoptotic 
cells, where it can serve as a marker of these cells. The phosphatidylserine on the 
surface of apoptotic cells can be visualized with a labeled form of the Annexin V 
protein, which specifically binds to this phospholipid. The cell-surface phos- 
phatidylserine is more than a convenient marker of apoptosis for biologists; it 
helps signal to neighboring cells and macrophages to phagocytose the dying 
cell. In addition to serving as an "eat me" signal, it also blocks the inflammation 
often associated with phagocytosis: the phosphatidylserine-dependent engulf- 
ment of apoptotic cells inhibits the production of inflammation-inducing signal 
proteins (cytokines) by the phagocytic cell. 

Macrophages will phagocytose most types of small particles, including oil 
droplets and glass beads, but they do not phagocytose any healthy cells in the 
animal, presumably because healthy cells express "don't eat me" signal molecules 
on their surface. Thus, in addition to expressing cell-surface "eat me" signals such 
as phosphatidylserine that stimulate phagocytosis, apoptotic cells must lose or 
inactivate their "don't eat me" signals in order for macrophages to ingest them. 
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Figure 18-4 Markers of apoptosis. (A) Cleavage of nuclear DNA into a 
characteristic ladder pattern of fragments. Mouse thymus lymphocytes 
were treated with an antibody against the cell-surface death receptor Fas 
{discussed later), inducing the cells to undergo apoptosis. After various 
times (indicated in hours at the top of the figure), DNA was extracted, and 
the fragments were separated by size by electrophoresis in an agarose gel 
and stained with ethidium bromide. (B) The TUNEL technique was used to 
label the cut ends of DNA fragments in the nuclei of apoptotic cells in a 
tissue section of a developing chick leg bud; this cross section through the 
skin and underlying tissue is from a region between two developing digits, 
as indicated in the underlying drawing. The procedure is called the TUNEL 
(IdT-mediated dUTP nick end labeling) technique because the enzyme 
terminal deoxynucleotidyl transferase (TdT) adds chains of labeled 
deoxynucleotide (dUTP) to the 3'-OH ends of DNA fragments. (A, from 
D. Mcllroy et al., Genes Dev. 14:549-558, 2000. With permisison from Cold 
Spring Harbor Laboratory Press; B, from V. Zuzarte-Luis and J.M. Hurle, 
Int. J. Dev. Biol. 46:871-876, 2002. With permission from UBC Press.) 



Cells undergoing apoptosis often lose the electrical potential that normally 
exists across the inner membrane of their mitochondria (discussed in Chapter 
14). This membrane potential can be measured by the use of positively charged 
fluorescent dyes that accumulate in mitochondria, driven by the negative charge 
on the inside of the inner membrane. A decrease in the labeling of mitochondria 
with these dyes helps to identify cells that are undergoing apoptosis. As we dis- 
cuss later, proteins such as cytochrome c are usually released from the space 
between the inner and outer membrane (the intermembrane space) of mito- 
chondria during apoptosis, and the relocation of cytochrome c from mitochon- 
dria to the cytosol can be used as another marker of apoptosis (see Figure 18-7). 



Apoptosis Depends on an Intracellular Proteolytic Cascade That Is 
Mediated by Caspases 

The intracellular machinery responsible for apoptosis is similar in all animal 
cells. It depends on a family of proteases that have a cysteine at their active site 
and cleave their target proteins at specific aspartic acids. They are therefore 
called caspases (c for cysteine and asp for aspartic acid). Caspases are synthe- 
sized in the cell as inactive precursors, or pro caspases, which are typically acti- 
vated by proteolytic cleavage. Procaspase cleavage occurs at one or two specific 
aspartic acids and is catalyzed by other (already active) caspases; the procaspase 
is split into a large and a small subunit that form a heterodimer, and two such 
dimers assemble to form the active tetramer (Figure 18-5A). Once activated, 
caspases cleave, and thereby activate, other procaspases, resulting in an ampli- 
fying proteolytic cascade (Figure 18-5B). 

Not all caspases mediate apoptosis. Indeed, the first caspase identified was 
a human protein called interleukin-1 -converting enzyme (ICE), which is con- 
cerned with inflammatory responses rather than with cell death; ICE cuts out 
the inflammation-inducing cytokine interleukin-1 (III) from a larger precursor 
protein. Subsequent to the discovery of ICE, a gene required for apoptosis in C. 
eleganswzs. shown to encode a protein that is structurally and functionally sim- 
ilar to ICE, providing the first evidence that proteolysis and caspases are 
involved in apoptosis. It is now clear diat, whereas several human caspases are 
involved in inflammatory and immune responses, most are involved in apopto- 
sis (Table 18-1). 

As shown in Figure 18-5B and Table 18-1, some of the procaspases that 
operate in apoptosis act at the start of the proteolytic cascade and are called ini- 
tiator procaspases; when activated, they cleave and activate downstream exe- 
cutioner procaspases, which, then cleave and activate other executioner pro- 
caspases, as well as specific target proteins in the cell. Among the many target 
proteins cleaved by executioner caspases are the nuclear lamins (see Figure 
18-5B), the cleavage of which causes the irreversible breakdown of the nuclear 
lamina (discussed in Chapter 16) . Another target is a protein that normally holds 
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the DNA- degrading enzyme mentioned earlier (an endonuclease) in an inactive 
form; its cleavage frees the endonuclease to cut up the DNA in the cell nucleus. 
Other target proteins include components of the cytoskeleton and cell-cell 
adhesion proteins that attach cells to their neighbors; the cleavage of these pro- 
teins helps the apoptotic cell to round up and detach from its neighbors, mak- 
ing it easier for a healthy neighboring cell to engulf it, or, in the case of an epithe- 
lial cell, for the neighbors to extrude the apoptotic cell from the cell sheet. The 
caspase cascade is not only destructive and self- amplifying but also irreversible, 
so that once a cell reaches a critical point along the path to destruction, it can- 
not turn back. 

The caspases required for apoptosis vary depending on the cell type and 
stimulus. Inactivation of the mouse gene encoding caspase-3, an executioner 
caspase, for example, reduces normal apoptosis in the developing brain. As a 
result, the mouse often dies around birth with a deformed brain that contains 
too many cells. Apoptosis occurs normally, however, in many other organs of 
such mice. 

From the earliest stages of an animal's development, healthy cells continu- 
ously make the procaspases and other proteins required for apoptosis. Thus, the 
apoptosis machinery is always in place; all that is needed is a trigger to activate 
it. How, then, is a caspase cascade initiated? In particular, how is the first pro- 
caspase in the cascade activated? Initiator procaspases have a long prodomain, 
which contains a caspase recruitment domain (CARD) that enables them to 
assemble with adaptor proteins into activation complexes when the cell receives 
a signal to undergo apoptosis. Once incorporated into such a complex, the ini- 
tiator procaspases are brought into close proximity, which is sufficient to acti- 
vate them; they then cleave each other to make the process irreversible. The acti- 
vated initiator caspases then cleave and activate executioner procaspases, 
thereby initiating a proteolytic caspase cascade, which amplifies the death sig- 
nal and spreads it throughout the cell. 

The two best understood signaling pathways that can activate a caspase cas- 
cade leading to apoptosis in mammalian cells are called the extrinsic pathway 
and the intrinsic pathway. Each uses its own initiator procaspases and activa- 
tion complex, as we now discuss. 



Figure 18-5 Procaspase activation 
during apoptosis. (A) Each caspase is 
initially made as an inactive proenzyme 
(procaspase). Some procaspases are 
activated by proteolytic cleavage by an 
activated caspase: two cleaved fragments 
from each of two procaspase molecules 
associate to form an active caspase, 
which is a tetramer of two small and two 
large subunits; the prodomains are 
usually discarded, as indicated. (B) The 
first procaspases activated are called 
initiator procaspases, which then cleave 
and activate many executioner procaspase 
molecules, producing an amplifying 
chain reaction (a proteolytic caspase 
cascade). The executioner caspases then 
cleave a variety of key proteins in the cell, 
including specific cytosolic proteins and 
nuclear lamins, as shown here, leading to 
the controlled death of the cell. Although 
not shown, the initiator procaspases are 
activated by adaptor proteins that bring 
the procaspases together in close 
proximity within an activation complex; 
although the initiator procaspases cleave 
each other within the complex, the 
cleavage serves only to stabilize the 
active protease. 



Table 18-1 Some Human Caspases 

Caspases involved in inflammation 
Caspases involved in apoptosis 
Initiator caspases 
Executioner caspases 



caspases 1 (ICE), 4, 5 

caspases 2, 8, 9, 10 
caspases 3, 6, 7 
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Cell-Surface Death Receptors Activate the Extrinsic Pathway of 
Apoptosis 

Extracellular signal proteins binding to cell- surface death receptors trigger the 
extrinsic pathway of apoptosis. Death receptors are transmembrane proteins 
containing an extracellular ligand-binding domain, a single transmembrane 
domain, and an intracellular death domain, which is required for the receptors 
to activate the apoptotic program. The receptors are homotrimers and belong to 
the tumor necrosis factor (TNF) receptor family, which includes a receptor for 
TNF itself (discussed in Chapter 15) and the Fas death receptor. The ligands that 
activate the death receptors are also homotrimers; they are structurally related 
to one another and belong to the TNF family of signal proteins. 

A well-understood example of how death receptors trigger the extrinsic 
pathway of apoptosis is the activation of Fas on the surface of a target cell by Fas 
ligand on the surface of a killer (cytotoxic) lymphocyte (discussed in Chapter 
25). When activated by the binding of Fas ligand, the death domains on the 
cytosolic tails of the Fas death receptors recruit intracellular adaptor proteins, 
which in turn recruit initiator procaspases {procaspase-8 } procaspase-10 t or 
both), forming a death-inducing signaling complex (DISC). Once activated in 
the DISC, the initiator caspases activate downstream executioner procaspases to 
induce apoptosis (Figure 18-6). As we discuss later, in some cells the extrinsic 
pathway must recruit the intrinsic apoptotic pathway to amplify the caspase 
cascade in order to kill the cell. 

Many cells produce inhibitory proteins that act either extracellularly or 
intracellularly to restrain the extrinsic pathway For example, some produce cell- 
surface decoy receptors, which have a ligand-binding domain but not a death 
domain; because they can bind a death ligand but cannot activate apoptosis, the 
decoys competitively inhibit the death receptors. Cells can also produce intra- 
cellular blocking proteins such as FLIP, which resembles an initiator procaspase 
but lacks the proteolytic domain; it competes with procaspase-8 and procas- 
pase- 10 for binding sites in the DISC and thereby inhibits the activation of these 
initiator procaspases. Such inhibitory mechanisms help prevent the inappropri- 
ate activation of the extrinsic pathway of apoptosis. 

In some circumstances, death receptors activate other intracellular signal- 
ing pathways that do not lead to apoptosis. TNF receptors, for example, can also 
activate the NFkB pathway (discussed in Chapter 15), which can promote cell 



Figure 1 8-6 The extrinsic pathway of 
apoptosis activated through Fas death 
receptors. Fas ligand on the surface of a 
killer lymphocyte activates Fas death 
receptors on the surface of the target 
cell. Both the ligand and receptor are 
homotrimers. The cytosolic tail of Fas 
then recruits the adaptor protein FADD 
via the death domain on each protein 
(FADD stands for Fas-associated death 
domain). Each FADD protein then 
recruits an initiator procaspase 
(procaspase-8, procaspase-10, or both) 
via a death effector domain on both 
FADD and the procaspase, forming a 
death-inducing signaling complex 
(DISC). Within the DISC, the initiator 
procaspase molecules are brought into 
close proximity, which activates them; 
the activated procaspases then cleave 
one another to stabilize the activated 
protease, which is now a caspase. 
Activated caspase-8 and caspase-10 
then cleave and activate executioner 
procaspases, producing a caspase 
cascade, which leads to apoptosis. 
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survival and activate genes involved in inflammatory responses. Which responses 
dominate depends on the type of cell and the other signals acting on it. 

The Intrinsic Pathway of Apoptosis Depends on Mitochondria 

Cells can also activate their apoptosis program from inside the cell, usually in 
response to injury or other stresses, such as DNA damage or lack of oxygen, 
nutrients, or extracellular survival signals (discussed later). In vertebrate cells, 
such intracellular activation of the apoptotic death program occurs via the 
intrinsic pathway of apoptosis, which depends on the release into the cytosol of 
mitochondrial proteins that normally reside in the intermembrane space of 
these organelles (see Figure 12-21A). Some of the released proteins activate a 
caspase proteolytic cascade in the cytoplasm, leading to apoptosis. 

A crucial protein released from mitochondria in the intrinsic pathway is 
cytochrome c, a water-soluble component of the mitochondrial electron-trans- 
port chain. When released into the cytosol (Figure 18-7), it has an entirely dif- 
ferent function: it binds to a procaspase-activating adaptor protein called Apaf 1 
(apoptotic protease activating factor- 1), causing the Apafl to oligomerize into a 
wheel-like heptamer called an apoptosome. The Apafl proteins in the apopto- 
some then recruit initiator procaspase proteins (procaspase-9), which are acti- 
vated by proximity in the apoptosome, just as procaspase-8 and -10 proteins are 
activated in the DISC. The activated caspase-9 molecules then activate down- 
stream executioner procaspases to induce apoptosis (Figure 18-8). 

As mentioned earlier, in some cells, the extrinsic pathway must recruit the 
intrinsic pathway to amplify the apoptotic signal to kill the cell It does so by 
activating a member of the Bcl2 family of proteins, which we now discuss. 



Bcl2 Proteins Regulate the Intrinsic Pathway of Apoptosis 

The intrinsic pathway of apoptosis is tightly regulated to ensure that cells kill 
themselves only when it is appropriate. A major class of intracellular regulators 
of apoptosis is the Bcl2 family of proteins, which, like the caspase family, has 
been conserved in evolution from worms to humans; a human Bcl2 protein, for 
example, can suppress apoptosis when expressed in C. elegans. 
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Figure 18-7 Release of cytochrome c 
from mitochondria during apoptosis. 

Fluorescence micrographs of human 
cancer cells in culture. (A) The control 
cells were transfected with a gene 
encoding a fusion protein consisting of 
cytochrome c linked to green fluorescent 
protein (cytochrome-c-GFP); they were 
also treated with a positively charged red 
dye that accumulates in mitochondria. 
The overlapping distribution of the green 
and red indicate that the cytochrome-c- 
GFP is located in mitochondria. (B) Cells 
expressing cytochrome-c-GFP were 
irradiated with ultraviolet light to induce 
apoptosis, and after 5 hours they were 
stained with antibodies (in red) against 
cytochrome c; the cytochrome-c-GFP is 
also shown (in green). The six cells in the 
bottom half of the micrographs in B have 
released their cytochrome c from 
mitochondria into the cytosol, whereas 
the cells in the upper half of the 
micrographs have not yet done so. (From 
J.C. Goldstein et al., Nat Cell Biol. 
2:156-162, 2000. With permission from 
Macmillan Publishers Ltd.) 
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Mammalian Bcl2 proteins regulate the intrinsic pathway of apoptosis 
mainly by controlling the release of cytochrome c and other intermembrane 
mitochondrial proteins into the cytosol. Some Bcl2 proteins are pro-apoptotic 
and promote apoptosis by enhancing the release, whereas others are anti-apop- 
totic and inhibit apoptosis by blocking the release. The pro-apoptotic and anti- 
apoptotic Bcl2 proteins can bind to each other in various combinations to form 
heterodimers, in which the two proteins inhibit each other's function. The bal- 
ance between the activities of these two functional classes of Bcl2 proteins 
largely determines whether a mammalian cell lives or dies by the intrinsic path- 
way of apoptosis. 

As illustrated in Figure 18-9, the anti-apoptotic Bcl2 proteins, including 
Bcl2 itself (the founding member of the Bcl2 family) and BcI-Xl> share four dis- 
tinctive Bcl2 homology (BH) domains (BH1-4). The pro-apoptotic Bcl2 proteins 
consist of two subfamilies — the BH123 proteins and the BH3~only proteins. The 
main BH123 proteins are Bax and Bak, which are structurally similar to Bcl2 but 
lack the BH4 domain. The BH3-only proteins share sequence homology with 
Bcl2 in only the BH3 domain (see Figure 18-9). 

When an apoptotic stimulus triggers the intrinsic pathway, the pro-apop- 
totic BH123 proteins become activated and aggregate to form oligomers in the 
mitochondrial outer membrane, inducing the release of cytochrome c and other 
intermembrane proteins by an unknown mechanism (Figure 18-10). In mam- 
malian cells, Bax and Bak are the main BH123 proteins, and at least one of them 
is required for the intrinsic pathway of apoptosis to operate: mutant mouse cells 
that lack both proteins are resistant to all pro-apoptotic signals that normally 
activate this pathway. Whereas Bak is tightly bound to the mitochondrial outer 
membrane even in the absence of an apoptotic signal, Bax is mainly located in 
the cytosol and translocates to the mitochondria only after an apoptotic signal 
activates it. As we discuss below, the activation of Bax and Bak usually depends 
on activated pro-apoptotic BH3-only proteins. Both Bax and Bak also operate on 
the surface of the endoplasmic reticulum (ER) and nuclear membranes; when 
activated in response to ER stress, they are thought to release Ca 2+ into the 
cytosol, which helps activate the mitochondrial-dependent intrinsic pathway of 
apoptosis by a poorly understood mechanism. 

The anti-apoptotic Bcl2 proteins such as Bcl2 itself and Bcl-X L are also 
mainly located on the cytosolic surface of the outer mitochondrial membrane, 
the ER, and the nuclear envelope, where they help preserve the integrity of the 



Figure 18-8 The intrinsic pathway of 
apoptosis. (A) A schematic drawing of how 
cytochrome c released from mitochondria 
activates Apafl . The binding of 
cytochrome c causes the Apafl to 
hydrolyze its bound dATP to dADP (not 
shown). The replacement of the dADP with 
dATP or ATP (not shown) then induces the 
complex of Apafl and cytochrome c to 
aggregate to form a large, heptameric 
apoptosome, which then recruits 
procaspase-9 through a caspase 
recruitment domain (CARD) in each 
protein. The procaspase-9 molecules are 
activated within the apoptosome and are 
now able to cleave and activate 
downstream executioner procaspases, 
which leads to the cleavage and activation 
of these molecules in a caspase cascade. 
Other proteins released from the 
mitochondrial intermembrane space are 
not shown. (B) A model of the three- 
dimensional structure of an apoptosome. 
Note that some scientists use the term 
"apoptosome" to refer to the complex 
containing procaspase-9. (B, from 
D. Aceham et al., Moi Cell 9:423-432, 2002. 
With permission from Elsevier.) 
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Figure 1 8-9 The three classes of Bcl2 
proteins. Note that the BH3 domain is 
the only BH domain shared by all Bcl2 
family members; it mediates the direct 
interactions between pro-apoptotic and 
anti-apoptotic family members. 
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membrane— preventing, for example, inappropriate release of intermembrane 
proteins from mitochondria and of Ca 2+ from the ER. These proteins inhibit 
apoptosis mainly by binding to and inhibiting pro-apoptotic Bcl2 proteins— 
either on these membranes or in the cytosol. On the outer mitochondrial mem- 
brane, for example, they bind to Bak and prevent it from oligomerizing, thereby 
inhibiting the release of cytochrome c and other intermembrane proteins. There 
are at least five mammalian anti-apoptotic Bcl2 proteins, and every mammalian 
cell requires at least one to survive. Moreover, a number of these proteins must 
be inhibited for the intrinsic pathway to induce apoptosis; the BH3-only pro- 
teins mediate the inhibition. 

The BH3-only proteins are the largest subclass of Bcl2 family proteins. The 
cell either produces or activates them in response to an apoptotic stimulus, and 
they are thought to promote apoptosis mainly by inhibiting anti-apoptotic Bcl2 
proteins. Their BH3 domain binds to a long hydrophobic groove on anti-apop- 
totic Bcl2 proteins, neutralizing their activity. By a poorly understood mecha- 
nism, this binding and inhibition enables the aggregation of Bax and Bak on the 
surface of mitochondria, which triggers the release of the intermembrane mito- 
chondrial proteins that induce apoptosis (Figure 18-11). Some BH3-only pro- 
teins may bind directly to Bax and Bak to help trigger the activation and aggre- 
gation of these BH123 pro-apoptotic proteins on mitochondria and thereby help 
release the intermembrane proteins. 

BH3-only proteins provide the crucial link between apoptotic stimuli and 
the intrinsic pathway of apoptosis, with different stimuli activating different 
BH3-only proteins. When some cells are deprived of extracellular survival sig- 
nals, for example, an intracellular signaling pathway that depends on the MAP 
kinase JNK activates the transcription of the gene encoding the BH3-only pro- 
tein Bim, which then triggers the intrinsic pathway. Similarly, in response to 
DNA damage that cannot be repaired, the tumor suppressor protein p53 accu- 
mulates (discussed in Chapters 17 and 20) and activates the transcription of 
genes that encode the BH3-only proteins Puma and Noxa; these BH3-only pro- 
teins then trigger the intrinsic pathway, thereby eliminating a potentially dan- 
gerous cell that could otherwise become cancerous. 
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Figure 18-10 The role of BH123 
pro-apoptotic Bcl2 proteins (mainly Bax 
and Bak) in the release of mitochondrial 
intermembrane proteins in the intrinsic 
pathway of apoptosis. When activated 
by an apoptotic stimulus, the BH1 23 
proteins aggregate on the outer 
mitochondrial membrane and release 
cytochrome c and other proteins from 
the intermembrane space into the 
cytosol by an unknown mechanism. 
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Figure 18-1 1 How pro-apoptotic 
BH3-onlyand anti-apoptotic Bcl2 
proteins regulate the intrinsic pathway 
of apoptosis. (A) In the absence of an 
apoptotic stimulus, anti-apoptotic Bcl2 
proteins bind to and inhibit the BH1 23 
proteins on the mitochondrial outer 
membrane (and in the cytosol — not 
shown). (B) In the presence of an 
apoptotic stimulus, BH3-only proteins are 
activated and bind to the anti-apoptotic 
Bcl2 proteins so that they can no longer 
inhibit the BH123 proteins, which now 
become activated and aggregate in the 
outer mitochondrial membrane and 
promote the release of intermembrane 
mitochondrial proteins into the cytosol. 
Some activated BH3-only proteins may 
stimulate mitochondrial protein release 
more directly by binding to and activating 
the BH123 proteins. Although not shown, 
the anti-apoptotic Bcl2 proteins are bound 
to the mitochondrial surface. 



As mentioned earlier, in some cells the extrinsic apoptotic pathway recruits 
the intrinsic pathway to amplify the caspase cascade to kill the cell. The BH3- 
only protein Bid is the link between the two pathways. When death receptors 
activate the extrinsic pathway in these cells, the initiator caspase, caspase-8, 
cleaves Bid, producing a truncated form of Bid called tBid. tBid translocates to 
mitochondria, where it inhibits anti-apoptotic Bcl2 proteins and triggers the 
aggregation of pro-apoptotic BH123 proteins to release cytochrome c and other 
intermembrane proteins, thereby amplifying the death signal. 

The BH3-only proteins Bid, Bim, and Puma (see Figure 18-9) can inhibit all 
of the anti-apoptotic Bcl2 proteins, whereas the other BH3-only proteins can 
inhibit only a small subset of the anti-apoptotic proteins. Thus, Bid, Bim, and 
Puma are the most potent activators of apoptosis in the BH3-only subfamily of 
Bcl2 proteins. 

Bcl2 proteins are not the only intracellular regulators of apoptosis. The IAP 
(inhibitor of apoptosis) proteins also play an important part in suppressing 
apoptosis, especially in Drosophila. 



lAPs Inhibit Caspases 

Inhibitors of apoptosis (IAPs) were first identified in certain insect viruses (bac- 
uloviruses), which encode IAP proteins to prevent a host cell that is infected by 
the virus from killing itself by apoptosis. (Virus-infected animal cells frequently 
kill themselves to prevent the virus from replicating and infecting other cells.) It 
is now known that most animal cells also make IAP proteins. 
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All IAPs have one or more BIR (baculovirus IAP repeat) domains, which 
enable them to bind to and inhibit activated caspases. Some IAPs also polyubiq- 
uitylate caspases, marking the caspases for destruction by proteasomes. In this 
way, the IAPs set an inhibitory threshold that activated caspases must overcome 
to trigger apoptosis. 

In Drosophila at least, this inhibitory barrier provided by IAPs can be neu- 
tralized by anti-IAP proteins, which are produced in response to various apop- 
totic stimuli. There are five anti-IAPs in flies, including Reaper, Grim, and Hid, 
and their only structural similarity is their short, N-terminal, IAP-binding motif, 
which binds to the BIR domain of IAPs, preventing the domain from binding to a 
caspase. Deletion of the three genes encoding Reaper, Grim, and Hid blocks 
apoptosis in flies. Conversely, inactivation of one of the two genes that encode 
IAPs in Drosophila causes all of the cells in the developing fly embryo to undergo 
apoptosis. Clearly, the balance between IAPs and anti-IAPs is tightly regulated 
and is crucial for controlling apoptosis in the fly. 

The role of mammalian anti-IAP proteins in apoptosis is more controversial. 
As illustrated in Figure 18-12, anti-IAPs are released from the mitochondrial 
intermembrane space when the intrinsic pathway of apoptosis is activated, 
blocking IAPs in the cytosol and thereby promoting apoptosis. When, however, 
the genes encoding two known mammalian anti-IAPs called Smac (also called 
DIABLO) and Omi are inactivated in mouse cells, apoptosis is apparently unaf- 
fected — hence the controversy about their normal roles in regulating apoptosis. 

In summary, the combined activities of the Bcl2 proteins, IAPs, and anti-IAPs 
determine the sensitivity of an animal cell to apoptosis-inducing stimuli, with 
IAPs and anti-IAPs dominant in flies and Bcl2 proteins dominant in mammals. 
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Figure 18-12 A proposed model for the 
roles of IAPs and anti-IAPs in the control 
of apoptosis in mammalian cells. (A) In 

the absence of an apoptotic stimulus, 
IAPs prevent accidental apoptosis caused 
by the spontaneous activation of 
procaspases. The IAPs are located in the 
cytosol and bind to and inhibit any 
spontaneously activated caspases. Some 
IAPs are also ubiquitin ligases that 
ubiquitylate the caspases they bind to, 
marking them for degradation in 
proteasomes (not shown). (B) When an 
apoptotic stimulus activates the intrinsic 
pathway, among the proteins released 
from the mitochondrial intermembrane 
space are anti-IAP proteins, which bind to 
and block the inhibitory activity of the 
IAPs. At the same time, the released 
cytochrome c triggers the assembly of 
apoptosomes, which can now activate a 
caspase cascade, leading to apoptosis. 
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Extracellular Survival Factors Inhibit Apoptosis in Various Ways 

As discussed in Chapter 15, intercellular signals regulate most activities of ani- 
mal cells, including apoptosis. These extracellular signals are part of the normal 
"social" controls that ensure that individual cells behave for the good of the 
organism as a whole— in this case, by surviving when they are needed and killing 
themselves when they are not. Some extracellular signal molecules stimulate 
apoptosis, whereas others inhibit it. We have discussed signal proteins such as 
Fas ligand that activate death receptors and thereby trigger the extrinsic path- 
way of apoptosis. Other extracellular signal molecules that stimulate apoptosis 
are especially important during animal development: a surge of thyroid hor- 
mone in the bloodstream, for example, signals cells in the tadpole tail to 
undergo apoptosis at metamorphosis (see Figure 18-3), while locally produced 
bone morphogenic proteins (BMPs, discussed in Chapters 15 and 22) stimulate 
cells between developing fingers and toes to kill themselves (see Figure 18-2). 
Here, however, we focus on extracellular signal molecules that inhibit apoptosis, 
which are collectively called survival factors. 

Most animal cells require continuous signaling from other cells to avoid 
apoptosis. This surprising arrangement apparently helps ensure that cells sur- 
vive only when and where they are needed. Nerve cells, for example, are pro- 
duced in excess in the developing nervous system and then compete for limited 
amounts of survival factors that are secreted by the target cells that they nor- 
mally connect to. Nerve cells that receive enough of the appropriate type of sur- 
vival signal live, while the others die. In this way, the number of surviving neu- 
rons is automatically adjusted so that it is appropriate for the number of target 
cells they connect with (Figure 18-13). A similar competition for limited 
amounts of survival factors produced by neighboring cells is thought to control 
cell numbers in other tissues, both during development and in adulthood. 

Survival factors usually bind to cell-surface receptors, which activate intra- 
cellular signaling pathways that suppress the apoptotic program, often by regu- 
lating members of the Bcl2 family of proteins. Some survival factors, for exam- 
ple, stimulate an increased production of anti-apoptotic Bcl2 proteins such as 
Bcl2 itself or Bcl-X L (Figure 18-14A). Others act by inhibiting the function of 
BH3-only pro-apoptotic Bcl2 proteins such as Bad (Figure 18-14B). In 
Drosophila, some survival factors act by phosphorylating and inactivating anti- 
IAP proteins, thereby enabling IAP proteins to suppress apoptosis (Figure 
18-14C). 

When mammalian cells are deprived of survival factors, they kill themselves 
by producing and activating pro-apoptotic BH3-only proteins, which activate 
the intrinsic pathway of apoptosis by overriding the anti-apoptotic Bcl2 proteins 
that are required to keep the cells alive. Mouse cells that lack both Bax and Bak 
are unable to activate the intrinsic pathway and can therefore live for weeks in 
culture in the absence of survival factors; without survival signals, however, the 
cells cannot efficiently import nutrients. Such cells fuel their metabolic needs 
through autophagy, in which the cell sequesters organelles and bits of its cyto- 
plasm within autophagosomes, which then fuse with lysosomes (discussed in 
Chapter 13). The cells eventually die from starvation, but not by apoptosis. 
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Figure 1 8-1 3 The role of survival factors 
and cell death in adjusting the number 
of developing nerve cells to the amount 
of target tissue. More nerve cells are 
produced than can be supported by the 
limited amount of survival factors 
released by the target cells. Therefore, 
some cells receive an insufficient amount 
of survival factors to avoid apoptosis. This 
strategy of overproduction followed by 
culling ensures that all target cells are 
contacted by nerve cells and that the 
extra nerve cells are automatically 
eliminated. 
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Either Excessive or Insufficient Apoptosis Can Contribute to 
Disease 

There are many human disorders in which excessive numbers of cells undergo 
apoptosis and thereby contribute to tissue damage. Among the most dramatic 
examples are heart attacks and strokes. In these acute conditions, many cells die 
by necrosis as a result of ischemia (inadequate blood supply), but some of the 
less affected cells die by apoptosis. It is hoped that, in the future, drugs such as 
caspase inhibitors that block apoptosis will prove useful in saving cells in these 
conditions. 

There are other conditions where too few cells die by apoptosis. Mutations 
in mice and humans, for example, that inactivate the genes that encode the Fas 
death receptor or the Fas ligand prevent the normal death of some lymphocytes, 
causing these cells to accumulate in excessive numbers in the spleen and lymph 
glands. In many cases, this leads to autoimmune disease, in which the lympho- 
cytes react against the individual's own tissues. 

Decreased apoptosis also makes an important contribution to many 
tumors, as cancer cells often regulate the apoptotic program abnormally 
<TGAA> The Bcl2 gene, for example, was first identified in a common form of 
lymphocyte cancer in humans, where a chromosome translocation causes 
excessive production of the Bcl2 protein; indeed, Bcl2 gets its name from this B 
cell lymphoma. The high level of Bcl2 protein in the lymphocytes that carry the 
translocation promotes the development of cancer by inhibiting apoptosis, 
thereby prolonging cell survival and increasing cell numbers; it also decreases 
the cells' sensitivity to anticancer drugs, which commonly work by causing can- 
cer cells to undergo apoptosis. 

Similarly, the gene encoding the tumor suppressor protein p53 is mutated in 
50% of human cancers so that it no longer promotes apoptosis or cell-cycle 
arrest in response to DNA damage. The lack of p53 function therefore enables 
the cancer cells to survive and proliferate even when their DNA is damaged; in 
this way, the cells accumulate more mutations, some of which make the cancer 
more malignant (discussed in Chapter 20). As many anticancer drugs induce 
apoptosis (and cell-cycle arrest) by a p53-dependent mechanism (discussed in 
Chapters 17 and 20), the loss of p53 function also makes cancer cells less sensi- 
tive to these drugs. 

Most human cancers arise in epithelial tissues such as those in the lung, 
intestinal tract, breast, and prostate. Such cancer cells display many abnormal- 
ities in their behavior, including a decreased ability to adhere to the extracellu- 
lar matrix and to adhere to one another at specialized cell-cell junctions. In the 
next chapter, we discuss the remarkable structures and functions of the extra- 
cellular matrix and cell junctions. 



Figure 18-1 4 Three ways that 
extracellular survival factors can inhibit 
apoptosis. (A) Some survival factors 
suppress apoptosis by stimulating the 
transcription of genes that encode 
anti-apoptotic Bcl2 proteins such as Bcl2 
itself or Bcl-X L . (B) Many others activate 
the serine/threonine protein kinase Akt, 
which, among many other targets, 
phosphorylates and inactivates the 
BH3-only pro-apoptotic Bcl2 protein Bad 
(see Figure 15-64). When not 
phosphorylated, Bad promotes apoptosis 
by binding to and inhibiting Bcl2; once 
phosphorylated, Bad dissociates, freeing 
Bcl2 to suppress apoptosis. Akt also 
suppresses apoptosis by phosphorylating 
and inactivating gene regulatory proteins 
of the Forkhead family that stimulate the 
transcription of genes encoding proteins 
that promote apoptosis (not shown). 
(C) In Drosophila, some survival factors 
inhibit apoptosis by stimulating the 
phosphorylation of the anti-lAP protein 
Hid. When not phosphorylated, Hid 
promotes cell death by inhibiting lAPs. 
Once phosphorylated, Hid no longer 
inhibits lAPs, which become active and 
block apoptosis. 



1 1 28 Chapter 1 8: Apoptosis 



Summary 

Cells can activate an intracellular death program and kill themselves in a controlled 
way — a process called programmed cell death. In this way, animal cells that are irre- 
versibly damaged, no longer needed, or are a threat to the organism can be eliminated 
quickly and neatly In most cases, these deaths occur by apoptosis: the cells shrink, con- 
dense, and frequently fragment, and neighboring cells or macrophages rapidly phago- 
cytose the cells or fragments before there is any leakage of cytoplasmic contents. Apop- 
tosis depends on proteolytic enzymes called caspases, which cleave specific intracellu- 
lar proteins to help kill the cell Caspases are present in all nucleated animal cells as 
inactive precursors called procaspases. Initiator procaspases are activated when 
brought into proximity in activation complexes: once activated, they cleave and acti- 
vate downstream executioner procaspases, which activate other executioner procas- 
pases (and various other target proteins in the cell), producing an amplifying, irre- 
versible proteolytic cascade. 

Cells use at least two distinct pathways to activate initiator procaspases and trigger 
a caspase cascade leading to apoptosis: the extrinsic pathway is activated by extracellu- 
lar ligands binding to cell-surface death receptors; the intrinsic pathway is activated by 
intracellular signals generated when cells are stressed. Each pathway uses its own ini- 
tiator procaspases, which are activated in distinct activation complexes, called the DISC 
and the apoptosome, respectively. In the extrinsic pathway, the death receptors recruit 
procaspases-8 and 10 via adaptor proteins to form the DISC; in the intrinsic pathway, 
cytochrome c released from the intermembrane space of mitochondria activates Apafl, 
which assembles into an apoptosome and recruits and activates procaspase-9. 

Both extracellular signal proteins and intracellular Bcl2 proteins and IAP proteins 
tightly regulate the apoptotic program to ensure that cells normally kill themselves only 
when it benefits the animal. Both anti-apoptotic and pro-apoptotic Bcl2 proteins regu- 
late the intrinsic pathway by controlling the release of mitochondrial intermembrane 
proteins, while IAP proteins inhibit activated caspases and promote their degradation. 



PROBLEMS 



Which statements are true? Explain why or why not. 

18-1 In normal adult tissues, cell death usually balances 
cell division. 

18-2 Mammalian cells that do not have cytochrome c 
should be resistant to apoptosis induced by UV light. 

Discuss the following problems. 

18-3 One important role of Fas and Fas ligand is to medi- 
ate elimination of tumor cells by killer lymphocytes. In a 
study of 35 primary lung and colon tumors, half the tumors 
were found to have amplified and overexpressed a gene for 
a secreted protein that binds to Fas ligand. How do you 
suppose that overexpression of this protein might con- 
tribute to the survival of these tumor cells? Explain your 
reasoning. 

18-4 Development of the nematode Caenorhabditis ele- 
gans generates exactly 959 somatic cells; it also produces an 
additional 131 cells that are later eliminated by programmed 
cell death. Classical genetic experiments in C. elegans iso- 
lated mutants that identified the first genes involved in 
apoptosis. Of the many mutant genes affecting apoptosis in 
the nematode, none have ever been found in the gene for 



cytochrome c. Why do you suppose that such a central effec- 
tor molecule in apoptosis was not found in the many genetic 
screens for "death" genes that have been carried out in C. 
elegans 7 . 

18-5 Imagine that you could microinject cytochrome c 
into the cytosol of wild-type cells and of cells that were dou- 
bly defective for Bax and Bak. Would you expect one, both, 
or neither type of cell to undergo apoptosis? Explain your 
reasoning. 

1 8-6 In contrast to their similar brain abnormalities, new- 
born mice deficient in Apafl or caspase- 9 have distinctive 
abnormalities in their paws. Apafl -deficient mice fail to 
eliminate the webs between their developing digits, whereas 
caspase-9-deficient mice have normally formed digits (Fig- 
ure Q18-1). If Apafl and caspase-9 function in the same 
apoptotic pathway, how is it possible for these deficient 
mice to differ in web-cell apoptosis? 
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Figure Q18-1 Appearance of paws in ApafV 1 ' and Casp9~ f ~ newborn 
mice relative to normal newborn mice (Problem 1 8-6). (From 
H.Yoshida et al., Cell 94:739-750, 1998. With permission from Elsevier.) 
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18-7 When human cancer (HeLa) cells are exposed to 
UV light at 90 mj/cm 2 , most of the cells undergo apopto- 
sis within 24 hours. Release of cytochrome c from mito- 
chondria can be detected as early as 6 hours after expo- 
sure of a population of cells to UV light, and it continues 
to increase for more than 10 hours thereafter. Does this 
mean that individual cells slowly release their 
cytochrome c over this time period? Or, alternatively, do 
individual cells release their cytochrome c rapidly but 
with different cells being triggered over the longer time 
period? 

To answer this fundamental question, you have fused the 
gene for green fluorescent protein (GFP) to the gene for 
cytochrome c, so that you can observe the behavior of indi- 
vidual cells by confocal fluorescence microscopy. In cells 
that are expressing the cytochrome c-GFP fusion, fluores- 
cence shows the punctate pattern typical of mitochondrial 
proteins. You then irradiate these cells with UV light and 
observe individual cells for changes in the punctate pattern. 
Two such cells (outlined in white) are shown in Figure 
Q18-2A and B. Release of cytochrome c-GFP is detected as 
a change from a punctate to a diffuse pattern of fluores- 
cence. Times after UV exposure are indicated as hours:min- 
utes below the individual panels. 

Which model for cytochrome c release do these observa- 
tions support? Explain your reasoning. 
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Figure Q18-2 Time-lapse video, fluorescence microscopic analysis 
of cytochrome c-GFP release from mitochondria of individual cells 
(Problem 1 8-7). (A) Cells observed for 8 minutes, 1 hours after 
UV irradiation. (B) Cells observed for 6 minutes, 1 7 hours after 
UV irradiation. One cell in (A) and one in (B), each outlined in white, 
have released their cytochrome c-GFP during the time frame of the 
observation, which is shown as hours:minutes below each panel. 
(From J.C. Goldstein et al., Nat. Cell Biol. 2:156-162, 2000. 
With permission from Macmillan Publishers Ltd.) 



REFERENCES 

Adams JM, Huang DC, Strasser A et al (2005) Subversion of the Bcl-2 

life/death switch in cancer development and therapy. Cold Spring 

Harb Symp Quant Biol 70:469-77. 
Boatright KM & Salvesen GS (2003) Mechanisms of caspase activation. 

CurrOpin Cell Biol] 5:725-731 
Danial NN & Korsmeyer SJ (2004) Cell death: critical control points Cell 

116:205-219 
Ellis RE, Yuan JY & Horvitz RA (1 991 ) Mechanisms and functions of cell 

death Annu Rev Ceil Biol 7:663-698. 
Fadok VA & Henson PM (2003) Apoptosis: giving phosphatidylserine 

recognition an assist — with a twist CurrBiol 13:R655-R657. 
Galonek HL & Hardwick JM (2006) Upgrading the BCL-2 network 

Nature Cell fi/o/8:131 7-1319 
Green DR (2005) Apoptotic pathways: ten minutes to dead Cell 

121:671-674 
Horvitz HR (2003) Worms, life, and death (Nobel lecture). Chembiochem 

4:697-711 
Hyun-Eui K, Fenghe D, Fang M & Wang X (2005) Formation of 

apoptosome is initiated by cytochrome c-induced dATP hydrolysis 

and subseguent nucleotide exchange on Apaf-1. Proc Natl Acad Sci 

USA 102:17545-17550. 
Jacobson MD, Weil M & Raff MC (1 997) Programmed cell death in 

animal development Cell 88:347-354, 
Jiang X & Wang X (2004) Cytochrome C-mediated apoptosis. Annu Rev 

Biochem 73:87-106. 
Kerr JF, Wyllie AH & Currie AR (1 972) Apoptosis: a basic biological 

phenomenon with wide-ranging implications in tissue kinetics, 

BJ Cancer 26:239-257 



Kumar S (2007) Caspase function in programmed cell death Cell Death 

Differ] 4:32-43 
Lavrik I, Golks A & Krammer PH (2005) Death receptor signaling J Cell 

Sci 118:265-267 
Lowe SW, Cepero E & Evan G (2004) Intrinsic tumour suppression 

Nature 432:307-31 5, 
Lum JJ, Bauer DE, Kong M et al (2005) Growth factor regulation of 

autophagy and cell survival in the absence of apoptosis. Cell 

120:237-48. 
McCall K & Steller H (1 997) Facing death in the fly: genetic analysis of 

apoptosis in Drosophila, Trends Genet 13:222-226 
Nagata S (1999) Fas ligand-induced apoptosis. Annu Rev Genet 

33:29-55 
Nagata S (2005) DNA degradation in development and programmed 

cell death. Annu Rev Immunol 23:853-875. 
Pop C, Timmer J, Sperandio S & Salvesen GS (2006) The apoptosome 

activates caspase-9 by dimerization, Mol Cell 22:269-275 
Raff MC (1999) Cell suicide for beginners. Nature 396:1 19-122. 
Rathmell JC & Thompson CB (2002) Pathways of apoptosis in 

lymphocyte development, homeostasis, and disease. Cell 

109:S97-107. 
Tittel JN & Steller H (2000) A comparison of programmed ceil death 

between species. Genome Biol 1 
Verhagen AM, Coulson EJ & Vaux DL (2001 ) Inhibitor of apoptosis 

proteins and their relatives: !APs and other BIRPs. Genome Biol 

2:3009.1-300910. 
Vousden KH (2005) Apoptosis. p53 and PUMA: a deadly duo, Science 

309:1685-1686 
Willis SN & Adams JM (2005) Life in the balance: how BH3-only proteins 

induce apoptosis. CurrOpin Cell Biol 1 7:61 7-625 




CELLS IN THEIR SOCIAL 
CONTEXT 



PartV 

Chapters 



Cell Junctions, Cell 

Adhesion, and the 
Extracellular Matrix 

Cancer 



Sexual Reproduction: 
Meiosis, Germ Cells, and 

Fertilization 

Development of 

Multicellular Organisms 

Specialized Tissues, 
Stem Cells, and Tissue 

Renewal 

Pathogens, Infection, 
and Innate Immunity 



The Adaptive Immune 
System 





Chapter 19 



Cell Junctions, Cell Adhesion, 
and the Extracellular Matrix 




Of all the social interactions between cells in a multicellular organism, the most 
fundamental are those that hold the cells together. Cells may cling to one 
another through direct cell-cell junctions, or they may be bound together by 
extracellular materials that they secrete; but by one means or another, they must 
cohere if they are to form an organized multicellular structure. 

The mechanisms of cohesion govern the architecture of the body — its 
shape, its strength, and the arrangement of its different cell typ es - The junctions 
between cells create pathways for communication, allowing the cells to 
exchange the signals that coordinate their behavior and regulate their patterns 
of gene expression. Attachments to other cells and to extracellular matrix con- 
trol the orientation of each cell's internal structure. The making and breaking of 
the attachments and the modeling of the matrix govern the way cells move 
within the organism, guiding them as the body grows, develops, and repairs 
itself. Thus, the apparatus of cell junctions, cell adhesion mechanisms, and 
extracellular matrix is critical for every aspect of the organization, function, and 
dynamics of multicellular structures. Defects in this apparatus underlie an enor- 
mous variety of diseases. 

As examples of structural engineering, large multicellular organisms repre- 
sent a most surprising feat. Cells are small, squishy, and often motile objects, 
filled with an aqueous medium and enclosed in a flimsy plasma membrane; yet 
they can combine in their millions to form a structure as massive, as strong, and 
as stable as a horse or a tree. How is this possible? 

The answer lies in two basic building strategies, corresponding to two ways 
in which stresses can be transmitted across a multicellular structure. One strat- 
egy depends on the strength of the extracellular matrix, a complex network of 
proteins and polysaccharide chains that the cells secrete. The other strategy 
depends on the strength of the cytoskeleton inside the cells and on cell-cell 
adhesions that tie the cytoskeletons of neighboring cells together. In plants, the 
extracellular matrix is all-important: plant tissues owe their strength to the cell 
walls that surround each cell. In animals, both architectural strategies are used, 
but to different extents in different tissues. 

Animal tissues are extraordinarily varied, as we shall see in Chapter 23, but 
most fall into one or other of two broad categories, representing two architec- 
tural extremes (Figure 19-1). In connective tissues, such as bone or tendon, the 
extracellular matrix is plentiful, and cells are sparsely distributed within it. The 
matrix is rich in fibrous polymers, especially collagen, and it is the matrix — 
rather than the cells — that bears most of the mechanical stress to which the tis- 
sue is subjected. Direct attachments between one cell and another are relatively 
rare, but the cells have important attachments to the matrix, allowing them to 
pull on it and to be pulled by it. 

By contrast, in epithelial tissues, such as the lining of the gut or the epider- 
mal covering of the skin, cells are closely bound together into sheets called 
epithelia. The extracellular matrix is scanty, consisting mainly of a thin mat 
called the basal lamina (or basement membrane) , underlying one face of the 
sheet. Within the epithelium, the cells are attached to each other directly by 
cell-cell adhesions, where cytoskeletal filaments are anchored, transmitting 
stresses across the interior of each cell, from adhesion site to adhesion site. 
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Figure 19-1 Two main ways in 
which animal cells are bound 
together. In connective tissue, the 
main stress-bearing component is 
the extracellular matrix. In epithelial 
tissue, it is the cytoskeletons of the 
cells themselves, linked from cell to 
cell by anchoring junctions. 
Cell-matrix attachments bond 
epithelial tissue to the connective 
tissue beneath it. 



collagen 
fibers 



Physical attachment is critical, both in epithelia and in nonepithelial tissues, 
but junctions between cell and cell or between cells and matrix are diverse in 
structure and do more than just transmit physical forces. Four main functions 
can be distinguished, each with a different molecular basis (Figure 19-2 and 
Table 19-1): 

1. Anchoring junctions, including both cell-cell adhesions and cell-matrix 
adhesions, transmit stresses and are tethered to cytoskeletal filaments 
inside the cell. 

2. Occluding junctions seal the gaps between cells in epithelia so as to make 
the cell sheet into an impermeable (or selectively permeable) barrier. 

3. Channel-forming junctions create passageways linking the cytoplasms of 
adjacent cells. 

4. Signal-relaying junctions allow signals to be relayed from cell to cell 
across their plasma membranes at sites of cell-to-cell contact. 

Chemical synapses in the nervous system (discussed in Chapter 11) and 
immunological synapses, where T lymphocytes interact with antigen-presenting 
cells (discussed in Chapter 25), are the most obvious examples of signal-relaying 
junctions, but they are not the only ones. Sites of cell-cell communication via 
transmembrane ligand-receptor pairs such as Delta and Notch, or ephrins and 
Eph receptors, as discussed in Chapter 15, fall under this heading: the cell mem- 
branes must be held in contact with one another for the ligands to activate the 
receptors. Moreover, we shall see that anchoring junctions, occluding junctions, 
and channel-forming junctions, in different ways, all can have important roles 
in signal transmission. 
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Figure 1 9-2 Four functional classes of cell junctions in animal tissues. (A) Anchoring junctions link cell to cell (typically 
via transmembrane cadherin proteins) or cell to matrix (typically via transmembrane integrin proteins). (B) Occluding 
junctions (involving claudin proteins) seal gaps between epithelial cells. (C) Channel-forming junctions (composed of 
connexin or Innexin proteins) form passageways for small molecules and ions to pass from cell to cell. (D) Signal-relaying 
junctions are complex structures, typically involving anchorage proteins alongside proteins mediating signal transduction. 
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Table 19-1 A Functional Classification of Cell Junctions 

ANCHORING JUNCTIONS 

Actin filament attachment sites 

1. cell-cell junctions [adherens junctions) 

2, cell-matrix junctions (actin-Hnked cell-matrix adhesions) 
intermediate filament attachment sites 

1. cell-cell junctions (desmosomes) 

2. cell-matrix junctions (hemicfesmosomes) 

OCCLUDING JUNCTIONS 

1 . tight junctions (in vertebrates) 

2. septate junctions (in invertebrates) 

CHANNEL-FORMING JUNCTIONS 

1. gap junctions (in animals) 

2. plasmodesmata (in plants) 

SIGNAL-RELAYING JUNCTIONS 

1, chemical synapses (in the nervous system) 

2, Immunological synapses (in the immune system) 

3, transmembrane Itgand-receptor ceJI-ceJI signaling contacts {Delta- 
Notch, ephrin-Eph, etc). Anchoring, occluding, and channel-forming 
junctions can all have signaling functions in addition to their 
structural roles 



The first part of this chapter will focus on animal cells and tissues, beginning 
with the cell-cell adhesions, occluding junctions, and channel-forming junc- 
tions that link cell to cell directly. As examples of signal-relaying junctions, we 
shall briefly examine neuronal synapses from the point of view of their adhesion 
mechanisms and assembly. We shall see how the different kinds of junctions 
together organize cells into polarized epithelial sheets. We shall then discuss the 
extracellular matrix in animals and the ways in which the cells interact with it 
through cell-matrix adhesions. Last, we shall turn to plants and the central role 
of the plant cell wall in their construction. 



CADHERINS AND CELL-CELL ADHESION 

The structures of cell-cell adhesions are most clearly seen in mature epithelia 
and in some other tissues, such as heart muscle, that are held together by strong 
direct anchorage of cell to cell. Study of these tissues by electron microscopy 
provided the first general classification of cell junctions. Biochemistry and 
molecular biology have since shown that the different structures seen in the 
electron microscope relate to distinct systems of molecules, important not only 
in adult epithelia but also in other tissues where the junctional specializations 
are not always so plainly visible. 

Figure 19-3 illustrates schematically the types of junctions that the electron 
microscope reveals in a section of mature epithelium and shows how the 
cell-cell adhesions (anchoring junctions) that will concern us in this section are 
distributed in relation to other types of junctions to be discussed later. The dia- 
gram shows the typical arrangement in a simple columnar epithelium such as 
the lining of the small intestine of a vertebrate. Here, a single layer of tall cells all 
stand on a basal lamina, with their uppermost surface, or apex, free and exposed 
to the extracellular medium. On their sides, or lateral surfaces, the cells make 
junctions with one another. Closest to the apex lie occluding junctions (known 
as tight junctions in vertebrates), preventing molecules from leaking across the 
epithelium through gaps between the cells. Below these are two types of cell-cell 
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tight junction seals gap between 
epithelial cells 



adherens junction connects actin 
filament bundle in one cell with 
that in the next cell 



desmosome connects intermediate 
filaments in one cell to those in 
the next cell 



gap junction allows the passage 
of small water-soluble molecules 
from cell to cell 



actin-linked cell-matrix adhesion 
anchors actin filaments in cell 
to extracellular matrix 



hemidesmosome anchors intermediate 
filaments in a cell to extracellular matrix 



adhesions. Adherens junctions are anchorage sites for actin filaments; desmo- 
some junctions are anchorage sites for intermediate filaments. Still lower, often 
mingled with additional desmosome junctions, lie channel-forming junctions, 
called gap junctions. 

Additional sets of adhesions attach the epithelial cells to the basal lamina 
and will be discussed in a later section. We classify these cell-matrix adhesions, 
like the cell-cell adhesions, according to their cytoskeletal connections: actin- 
linked cell-matrix adhesions (indistinct in the small intestine, but prominent 
elsewhere) anchor actin filaments to the matrix, while hemidesmosomes anchor 
intermediate filaments to it. 

At each of the four types of anchoring junctions, the central role is played by 
transmembrane adhesion proteins that span the membrane, with one end 
linking to the cytoskeleton inside the cell and the other end linking to other 
structures outside it (Figure 19-4). These cytoskeleton-linked transmembrane 
molecules fall neatly into two superfamilies, corresponding to the two basic 
kinds of external attachment (Table 19-2). Proteins of the cadherin superfamily 
chiefly mediate attachment of cell to cell. Proteins of the integrin superfamily 
chiefly mediate attachment of cells to matrix. Within each family there is spe- 
cialization: some cadherins link to actin and form adherens junctions, while 
others link to intermediate filaments and form desmosome junctions; likewise, 



Figure 19-3 A summary of the various 
cell junctions found in a vertebrate 
epithelial cell, classified according to 
their primary functions. In the most 
apical portion of the cell, the relative 
positions of the junctions are the same in 
nearly all vertebrate epithelia.The tight 
junction occupies the most apical 
position, followed by the adherens 
junction (adhesion belt) and then by a 
special parallel row of desmosomes; 
together these form a structure called a 
junctional complex. Gap junctions and 
additional desmosomes are less regularly 
organized.The drawing is based on 
epithelial cells of the small intestine. 
Specialized signal-relaying junctions are 
discussed later in the chapter. 
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Figure 19-4 Transmembrane adhesion 
proteins link the cytoskeleton to 
extracellular structures. The external 
linkage may be either to parts of other 
cells (cell-cell anchorage, mediated 
typically by cadherins) or to extracellular 
matrix (cell-matrix anchorage, mediated 
typically by integrins). The internal 
linkage to the cytoskeleton is generally 
indirect, via intracellular anchor proteins, 
to be discussed later. 
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Table 19-2 Anchoring Junctions 



JUNCTION 

1 


TRANSMEMBRANE 
ADHESION PROTEIN 


EXTRACELLULAR INTRACELLULAR 
LIGAND CYTOSKELETAL 
ATTACHMENT 


INTRACELLULAR 

ANCHOR 

PROTEINS 


CefKetl 


adherens junction 
desmosome 


cadherin 
(classical cadherin) 

cadherin (desmoglein, 
desmocollin) 


cadherin in 
neighboring cell 

desmoglein and 
desmocollin in 
neighboring cell 


actin filaments 

intermediate 
filaments 


a-catenin, (3-catenin, 
plakoglobin (y-catenin), 
p120-catenin, vinculin, 
a-actinin 

plakoglobin (y-catenin), 
plakophilin, 
desmoplakin 


Celt-Matrix 










actin-linked cell- 
matrix adhesion 

hemidesmosome 


integrin 

integrin a6(34, type XVII 
collagen (BP1 80) 


extracellular matrix 
proteins 

extracellular matrix 
proteins 


actin filaments 

intermediate 
filaments 


talin, vinculin, a-actinin, 
filamin, paxillin, focal 
adhesion kinase (FAK) 

plectin, dystonin (BP230) 



some integrins link to actin and form actin-linked cell-matrix adhesions, while 
others link to intermediate filaments and form hemidesmosomes. 

There are some exceptions to these rules. Some integrins, for example, 
mediate cell-cell rather than cell-matrix attachment. Moreover, there are other 
types of cell adhesion molecules that can provide attachments more flimsy than 
anchoring junctions, but sufficient to stick cells together in special circum- 
stances. Cell-cell adhesions based on cadherins, however, seem to be the most 
fundamentally important class, and we begin our account of cell-cell adhesion 
with them. <CGAA> 



Cadherins Mediate Ca 2+ -Dependent Cell-Cell Adhesion in All 
Animals 

Cadherins are present in all multicellular animals whose genomes have been 
analyzed, and in one other known group, the choanoflagellates. These creatures 
can exist either as free-living unicellular organisms or as multicellular colonies 
and are thought to be representatives of the group of protists from which all ani- 
mals evolved. Other eucaryotes, including fungi and plants, lack cadherins, and 
they are absent from bacteria and archaea also. Cadherins therefore seem to be 
part of the essence of what it is to be an animal. 

The cadherins take their name from their dependence on Ca 2+ ions: remov- 
ing Ca 2+ from the extracellular medium causes adhesions mediated by cad- 
herins to come adrift. Sometimes, especially for embryonic tissues, this is 
enough to let the cells be easily separated. In other cases, a more severe treat- 
ment is required, combining Ca 2+ removal with exposure to a protease such as 
trypsin. The protease loosens additional connections mediated by extracellular 
matrix and by other cell-cell adhesion molecules that do not depend on Ca 2+ . In 
either case, when the dissociated cells are put back into a normal culture 
medium, they will generally stick together again by reconstructing their adhe- 
sions. 

This type of cell-cell association provided one of the first assays that allowed 
cell-cell adhesion molecules to be identified. In these experiments, monoclonal 
antibodies were raised against the cells of interest, and each antibody was tested 
for its ability to prevent the cells from sticking together again after they had been 
dissociated. Rare antibodies that bound to the cell-cell adhesion molecules 
showed this blocking effect. These antibodies then were used to isolate the 
adhesion molecule that they recognized. 

Virtually all cells in vertebrates, and probably in other animals too, seem to 
express one or more proteins of the cadherin family, according to the cell type. 
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Several lines of evidence indicate that they are the main adhesion molecules hold- 
ing cells together in early embryonic tissues. For example, embryonic tissues in 
culture disintegrate when treated with anti-cadherin antibodies, and if cadherin- 
mediated adhesion is left intact, antibodies against other adhesion molecules 
have little effect. Studies of the early mouse embryo illustrate the role of cadherins 
in development. Up to the eight- cell stage, the mouse embryo cells are only very 
loosely held together, remaining individually more or less spherical; then, rather 
suddenly, in a process called compaction, they become tightly packed together 
and joined by cell-cell junctions, so that the outer surface of the embryo becomes 
smoother (Figure 19-5). Antibodies against a specific cadherin, called E-cadherin, 
block compaction, whereas antibodies that react with various other cell-surface 
molecules on these cells do not. Mutations that inactivate E-cadherin cause the 
embryos to fall apart and die early in development. 



Figure 19-5 Compaction of an early 
mouse embryo. The cells of the early 
embryo at first stick together only 
weakly. At about the eight-cell stage, 
they begin to express E-cadherin and as 
a result become strongly and closely 
adherent to one another. (Scanning 
electron micrographs courtesy of Patricia 
Calarco; 16-32-cell stage is from 
P. Calarco and C.J. Epstein, Dev. Biol. 
32:208-213, 1973. With permission from 
Academic Press.) 



The Cadherin Superfamily in Vertebrates Includes Hundreds of 
Different Proteins, Including Many with Signaling Functions 

The first three cadherins that were discovered were named according to the 
main tissues in which they were found: E-cadherin is present on many types of 
epithelial cells; N-cadherin on nerve, muscle, and lens cells; and P-cadherin on 
cells in the placenta and epidermis. All are also found in various other tissues; N- 
cadherin, for example, is expressed in fibroblasts, and E-cadherin is expressed in 
parts of the brain (Figure 19-6). These and other classical cadherins are closely 
related in sequence throughout their extracellular and intracellular domains. 
While all of them have well-defined adhesive functions, they are also important 
in signaling. Through their intracellular domains, as we shall see later, they relay 
information into the cell interior, enabling the cell to adapt its behavior accord- 
ing to whether it is attached or detached from other cells. 

There are also a large number of nonclassical cadherins more distantly 
related in sequence, with more than 50 expressed in the brain alone. The non- 
classical cadherins include proteins with known adhesive function, such as the 
diverse protocadherins found in the brain, and the desmocollins and desmogleins 
that form desmosome junctions. They also include proteins that appear to be 
primarily involved in signaling, such as T-cadherin, which lacks a transmem- 
brane domain and is attached to the plasma membrane of nerve and muscle 
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Figure 1 9-6 Cadherin diversity in the 
central nervous system. The diagram 
shows the expression patterns of three 
different classical cadherins in the 
embryonic mouse brain. More than 
70 other cadherins, both classical and 
nonclassical, are also expressed in the 
brain, in complex patterns that are 
thought to reflect their roles in guiding 
and maintaining the organization of this 
intricate oraan. 
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Figure 19-7 The cadherin superfamily. 

The diagram shows some of the diversity 
among cadherin superfamily members. 
These proteins all have extracellular 
portions containing multiple copies of 
the cadherin domain motif (green ovals), 
but their intracellular portions are more 
varied, reflecting interactions with a wide 
variety of intracellular ligands, including 
signaling molecules as well as 
components that anchor the cadherin to 
the cytoskeleton. The differently colored 
motifs in Fat, Flamingo, and Ret represent 
conserved domains that are also found in 
other protein families. 



INTRACELLULAR 



EXTRACELLULAR 



cells by a glycosylphosphatidylinositol (GPI) anchor, and the Fat and Flamingo 
proteins, which were first identified as the products of genes in Drosophila that 
regulate, respectively epithelial growth and cell polarity. Together, the classical 
and nonclassical cadherin proteins constitute the cadherin superfamily (Figure 
19-7 and Table 19-3), with more than 180 members in humans. How do the 
structures of these proteins relate to their functions, and why are there so many 
of them? 



Cadherins Mediate Homophilic Adhesion 

Anchoring junctions between cells are usually symmetrical: if the linkage is to 
actin, for example, in the cell on one side of the junction, it will be to actin in the 
cell on the other side also. In fact, the binding between cadherins is generally 
homophilic (like-to-like, Figure 19-8): cadherin molecules of a specific subtype 
on one cell bind to cadherin molecules of the same or closely related subtype on 
adjacent cells. According to a current model, the binding occurs at the N-termi- 
nal tips of the cadherin molecules — the ends that lie furthest from the mem- 
brane. The protein chain here forms a terminal knob and a nearby pocket, and 
the cadherin molecules protruding from opposite cell membranes bind by 
insertion of the knob of each one in the pocket of the other (Figure 19-9A). 

The spacing between the cell membranes at an anchoring junction is pre- 
cisely defined and depends on the structure of the participating cadherin 
molecules. All the members of the superfamily by definition, have an extracel- 
lular portion consisting of several copies of a motif called the cadherin domain. 
In the classical cadherins of vertebrates there are 5 of these repeats, and in 
desmogleins and desmocollins there are 4 or 5, but some nonclassical cadherins 
have more than 30. Each cadherin domain forms a more or less rigid unit, joined 
to the next cadherin domain by a hinge (Figure 19-9B). Ca 2+ ions bind to sites 
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Table 1 9-3 Some Members of the Cadherin Superfamily 



NAME 


MAIN LOCATION 


JUNCTION ASSOCIATION 


PHENOTYPE WHEN INACTIVATED 
IN MICE 


Classical cadherins 


E-cadherin 


many epithelia 


adherens junctions 


death at blastocyst stage; embryos 
fail to undergo compaction 


N-cadherin 


neurons, heart, skeletal 
muscle, lens, and fibroblasts 


adherens junctions and 
chemical synapses 


embryos die from heart defects 


P-cadherin 


placenta, epidermis, breast 
epithelium 


adherens junctions 


abnormal mammary gland development 


VE-cadherin 


endothelial cells 


adherens junctions 


abnormal vascular development 
(apoptosis of endothelial cells) 



Nondassical cadherins 


Desmocollin 


skin 


desmosomes 


blistering of skin 


Desmoglein 


skin 


desmosomes 


blistering skin disease due to loss of 
keratinocyte cell-cell adhesion 


T-cadherin 


neurons, muscle, heart 


none 


unknown 


Cadherin 23 


inner ear, other epithelia 


links between stereocilia in 
sensory hair cells 


deafness 


Fat (in Drosophila) 


epithelia and central nervous 


signal-relaying junction 


enlarged imaginal discs and tumors; 




system 


(planar cell polarity) 


disrupted planar cell polarity 


Fatl (in mammals) 


various epithelia and central 


slit diaphragm in kidney 


loss of slit diaphragm; malformation of 




nervous system 


glomerulus and other cell 
junctions 


forebrain and eye 


a, |3, and y- 


neurons 


chemical synapses and 


neuronal degeneration 


Protocadherins 




nonsynaptic membranes 




Flamingo 


sensory and some other 


cell-cell junctions 


disrupted planar cell polarity; neural tube 




epithelia 




defects 



near each hinge and prevent it from flexing, so that the whole string of cadherin 
domains behaves as a rigid, slightly curved, rod. When Ca 2+ is removed, the 
hinges can flex, and the structure becomes floppy. At the same time, the confor- 
mation at the N terminus is thought to change slightly, weakening the binding 
affinity for the matching cadherin molecule on the opposite cell. Cadherin 
molecules destabilized in this way by loss of Ca 2+ are rapidly degraded by prote- 
olytic enzymes. 

Unlike receptors for soluble signal molecules, which bind their specific lig- 
and with high affinity, cadherins (and most other cell-cell adhesion proteins) 
typically bind to their partners with relatively low affinity. Strong attachments 
result from the formation of many such weak bonds in parallel. When binding to 
oppositely oriented partners on another cell, cadherin molecules are often clus- 
tered side-to-side with many other cadherin molecules on the same cell. Many 
cadherin molecules packed side by side in this way collaborate to form an 
anchoring junction (Figure 19-9C). The strength of this junction is far greater 
than that of any individual intermolecular bond, and yet it can be easily disas- 
sembled by separating the molecules sequentially, just as two pieces of fabric 
can be strongly joined by Velcro and yet easily peeled apart. A similar "Velcro 
principle" also operates at cell-cell and cell-matrix adhesions formed by other 





HOMOPHILIC BINDING 



HETEROPHILIC BINDING 



Figure 19-8 Homophilic versus 
heterophilic binding. Cadherins in 
general bind homophilically; some other 
cell adhesion molecules, discussed later, 
bind heterophilically. 
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Figure 19-9 Cadherin structure and function. (A) The extracellular domain 
of a classical cadherin (C-cadherin) is shown here, illustrating how two such 
molecules on opposite cells are thought to bind homophilically, end-to- 
end. The structure was determined by x-ray diffraction of the crystallized 
C-cadherin extracellular domain. (B) The extracellular part of each 
polypeptide consists of a series of compact domains called cadherin 
repeats, joined by flexible hinge regions. Ca 2+ binds in the neighborhood 
of each hinge, preventing it from flexing. In the absence of Ca 2+ , the 
molecule becomes floppy and adhesion fails. (C) At a typical junction, 
many cadherin molecules are arrayed in parallel, functioning like Velcro to 
hold cells together. Cadherins on the same cell are thought to be coupled 
by side-to-side interactions between their N-terminal head regions, and via 
the attachments of their intracellular tails to a mat of other proteins (not 
shown here). (Based on T.J. Boggon et al., Science 296:1308-1313, 2002. 
With permission from AAAS.) 



types of transmembrane adhesion proteins. The making and breaking of 
anchoring junctions plays a vital part in development and in the constant 
turnover of tissues in many parts of the mature body <CGAA> 
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Selective Cell-Cell Adhesion Enables Dissociated Vertebrate Cells 
to Reassemble into Organized Tissues 

Cadherins form specific homophilic attachments, and this explains why there 
are so many different family members. Cadherins are not like glue, making cell 
surfaces generally sticky. Rather, they mediate highly selective recognition, 
enabling cells of a similar type to stick together and to stay segregated from 
other types of cells. 

This selectivity in the way that animal cells consort with one another was 
demonstrated more than 50 years ago, long before the discovery of cadherins, in 
experiments in which amphibian embryos were dissociated into single cells. 
These cells were then mixed up and allowed to reassociate. Remarkably, the dis- 
sociated cells often reassembled in vitro into structures resembling those of the 
original embryo (Figure 19-10). The same phenomenon occurs when dissoci- 
ated cells from two embryonic vertebrate organs, such as the liver and the retina, 
are mixed together and artificially formed into a pellet: the mixed aggregates 







Figure 19-10 Sorting out. Cells from 
different parts of an early amphibian 
embryo will sort out according to their 
origins. In the classical experiment shown 
here, mesoderm cells {green), neural plate 
cells (blue), and epidermal cells {red) have 
been disaggregated and then 
reaggregated in a random mixture. They 
sort out into an arrangement reminiscent 
of a normal embryo, with a "neural tube" 
internally, epidermis externally and 
mesoderm in between. (Modified from 
P.L. Townes and J. Holtfreter, J. Exp. Zooi. 
1 28:53-1 20, 1 955. With permission from 
Wiley-Liss.) 
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Figure 19-1 1 Selective dispersal and reassembly of cells to form tissues 
in a vertebrate embryo. Some cells that are initially part of the epithelial 
neural tube alter their adhesive properties and disengage from the 
epithelium to form the neural crest on the upper surface of the neural 
tube. The cells then migrate away and form a variety of cell types and 
tissues throughout the embryo. Here they are shown assembling and 
differentiating to form two clusters of nerve cells, called ganglia, in the 
peripheral nervous system. While some of the neural crest cells 
differentiate in the ganglion to become the neurons, others become 
satellite cells (specialized glial supporting cells) wrapped around the 
neurons. Changing patterns of expression of cell adhesion molecules 
underlie all these architectural rearrangements. 

gradually sort out according to their organ of origin. More generally, disaggre- 
gated cells are found to adhere more readily to aggregates of their own organ 
than to aggregates of other organs. Evidently there are cell-cell recognition sys- 
tems that make cells of the same differentiated tissue preferentially adhere to 
one another. 

Such findings suggest that tissue architecture in animals is not just a prod- 
uct of history but is actively organized and maintained by the system of affinities 
that cells have for one another and for the extracellular matrix. In the develop- 
ing embryo, we can indeed watch the cells as they differentiate, and see how 
they move and regroup to form new structures, guided by selective adhesion. 
Some of these movements are subtle, others more far-reaching, involving long- 
range migrations, as we shall describe in Chapter 22. In vertebrate embryos, for 
example, cells from the neural crest break away from the epithelial neural tube, 
of which they are initially a part, and migrate along specific paths to many other 
regions. There they reaggregate with other cells and with one another to form a 
variety of tissues, including those of the peripheral nervous system (Figure 
19-1 1). To find their way, the cells depend on guidance from the embryonic tis- 
sues along the path. This may involve chemotaxis or chemorepulsion, that is, 
movement under the influence of soluble chemicals that attract or repel migrat- 
ing cells. It may also involve contact guidance, in which the migrant cell touches 
other cells or extracellular matrix components, making transient adhesions that 
govern the track taken. Then, once the migrating cell has reached its destination, 
it must recognize and join other cells of the appropriate type to assemble into a 
tissue. In all these processes of sorting out, contact guidance, and tissue assem- 
bly, cadherins play a crucial part. 
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Cadherins Control the Selective Assortment of Cells 

The appearance and disappearance of specific cadherins correlate with steps in 
embryonic development where cells regroup and change their contacts to cre- 
ate new tissue structures. As the neural tube forms and pinches off from the 
overlying ectoderm, for example, neural tube cells lose E-cadherin and acquire 
other cadherins, including N-cadherin, while the cells in the overlying ectoderm 
continue to express E-cadherin (Figure 19-12A, B). Then, when the neural crest 
cells migrate away from the neural tube, these cadherins become scarcely 
detectable, and another cadherin (cadherin-7) appears that helps hold the 
migrating cells together as loosely associated cell groups (Figure 19-12Q. 
Finally when the cells aggregate to form a ganglion, they switch on expression 
of N-cadherin again (see Figure 19-11 J. If N-cadherin is artificially overex- 
pressed in the emerging neural crest cells, the cells fail to escape from the neu- 
ral tube. 

Studies with cultured cells support the suggestion that the homophilic bind- 
ing of cadherins controls these processes of tissue segregation. In a line of cul- 
tured fibroblasts called L cells, for example, cadherins are not expressed and the 
cells do not adhere to one another. When these cells are transfected with DNA 
encoding E-cadherin, however, they become adherent to one another, and the 
adhesion is inhibited by anti-E-cadherin antibodies. Since the transfected cells 
do not stick to untransfected L cells, we can conclude that the attachment 
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depends on E-cadherin on one cell binding to E-cadherin on another. If L cells 
expressing different cadherins are mixed together, they sort out and aggregate 
separately, indicating that different cadherins preferentially bind to their own 
type (Figure 19-13A), mimicking what happens when cells derived from tissues 
that express different cadherins are mixed together. A similar segregation of cells 
occurs if L cells expressing different amounts of the same cadherin are mixed 
together (Figure 19-13B). It therefore seems likely that both qualitative and 
quantitative differences in the expression of cadherins have a role in organizing 
tissues. 



Twist Regulates Epithelial-Mesenchymal Transitions 

The assembly of cells into an epithelium is a reversible process. By switching on 
expression of adhesion molecules, dispersed unattached cells — often called 
mesenchymal cells — can come together to form an epithelium. Conversely, 
epithelial cells can change their character, disassemble, and migrate away from 
their parent epithelium as separate individuals. Such epithelial-mesenchymal 
transitions play an important part in normal embryonic development; the origin 
of the neural crest is one example (see Figure 19-11). A control system involving 
a set of gene regulatory components called Slug, Snail, and Twist, with E-cad- 
herin as a downstream component, seems to be critical for such transitions: in 
several tissues, both in flies and vertebrates, switching on expression of Twist, for 
example, converts epithelial cells to a mesenchymal character, and switching it 
off does the opposite. 

Epithelial-mesenchymal transitions also occur as pathological events dur- 
ing adult life, in cancer. Most cancers originate in epithelia, but become danger- 
ously prone to spread — that is, malignant— only when the cancer cells escape 
from the epithelium of origin and invade other tissues. Experiments with malig- 
nant breast cancer cells in culture show that blocking expression of Twist can 
convert them back toward a nonmalignant character. Conversely, by forcing 
Twist expression, one can make normal epithelial cells undergo an epithe- 
lial-mesenchymal transition and behave like malignant cells. Twist exerts its 
effects, in part at least, by inhibiting expression of the cadherins that hold 
epithelial cells together. E-cadherin, in particular, is a target. Mutations that dis- 
rupt the production or function of E-cadherin are in fact often found in cancer 
cells and are thought to help make them malignant, as we shall discuss in Chap- 
ter 20. 



Figure 19-12 Changing patterns of cadherin expression during 
construction of the nervous system. The figure shows cross-sections of 
the early chick embryo, as the neural tube detaches from the ectoderm 
and then as neural crest cells detach from the neural tube. 
(A,B) Immunofluorescence micrographs showing the developing neural 
tube labeled with antibodies against (A) E-cadherin and (B) N-cadherin. 
(C) As the patterns of gene expression change, the different groups of cells 
segregate from one another according to the cadherins they express. 
(Micrographs courtesy of Kohei Hatta and Masatoshi Takeichi.) 
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Catenins Link Classical Cadherins to the Actin Cytoskeleton 

The extracellular domains of cadherins mediate homophilic binding. The intra- 
cellular domains of typical cadherins, including all classical and some nonclas- 
sical ones, provide anchorage for filaments of the cytoskeleton: anchorage to 
actin at adherens junctions, and to intermediate filaments at desmosome junc- 
tions, as mentioned earlier (see Figure 19-3). The linkage to the cytoskeleton is 
indirect and depends on a cluster of accessory intracellular anchor proteins that 
assemble on the tail of the cadherin. This linkage, connecting the cadherin fam- 
ily member to actin or intermediate filaments, includes several different com- 
ponents (Figure 19-14). These components vary somewhat according to the 
type of anchorage— but in general a central part is played by [1-catenin and/or its 
close relative y-catenin (plakoglobin). 

At adherens junctions, a remote relative of this pair of proteins, pl20- 
catenin, is also present and helps to regulate assembly of the whole complex. 
When pl20-catenin is artificially depleted, cadherin proteins are rapidly 
degraded, and cell-cell adhesion is lost. An artificial increase in the level of pl20- 
catenin has an opposite effect. It is possible that cells use changes in the level of 
pl20-catenin or in its phosphorylation state as one way to regulate their 
strength of adhesion. In any case, it seems that the linkage to actin is essential 
for efficient cell-cell adhesion, as classical cadherins that lack their cytoplasmic 
domain cannot hold cells strongly together. 



Adherens Junctions Coordinate the Actin-Based Motility of 
Adjacent Cells 

Adherens junctions are an essential part of the machinery for modeling the 
shapes of multicellular structures in the animal body. By indirectly linking the 
actin filaments in one cell to those in its neighbors, they enable the cells in the 
tissue to use their actin cytoskeletons in a coordinated way. 

Adherens junctions occur in various forms. In many nonepithelial tissues, 
they appear as small punctate or streaklike attachments that indirectly connect 
the cortical actin filaments beneath the plasma membranes of two interacting 
cells. In heart muscle (discussed in Chapter 23), they anchor the actin bundles 
of the contractile apparatus and act in parallel with desmosome junctions to link 
the contractile cells end-to-end. (The cell-cell interfaces in the muscle where 
these adhesions occur are so substantial that they show up clearly in stained 
light-microscope sections as so-called intercalated discs.) But the prototypical 
examples of adherens junctions occur in epithelia, where they often form a con- 
tinuous adhesion belt (or zonula adherens) close beneath the apical face of the 
epithelium, encircling each of the interacting cells in the sheet (Figure 19-15). 
Within each cell, a contractile bundle of actin filaments lies adjacent to the 
adhesion belt, oriented parallel to the plasma membrane and tethered to it by 
the cadherins and their associated intracellular anchor proteins. The actin bun- 
dles are thus linked, via the cadherins and anchor proteins, into an extensive 
transcellular network. This network can contract with the help of myosin motor 
proteins (discussed in Chapter 16), providing the motile force for a fundamental 
process in animal morphogenesis— the folding of epithelial cell sheets into 
tubes, vesicles, and other related structures (Figure 19-16). 



Figure 1 9-14 The linkage of classical cadherins to actin filaments. The 

cadherins are coupled indirectly to actin filaments via p-catenin and other 
anchor proteins. a-Catenin, vinculin, and plakoglobin (a relative of 
P-catenin, also called y-catenin) are probably also present in the linkage or 
involved in control of its assembly, but the details of the anchorage are not 
well understood. Another intracellular protein, called p120-catenin, also 
binds to the cadherin cytoplasmic tail and regulates cadherin function. 
P-Catenin has a second, and very important, function in intracellular 
signaling, as we discuss in Chapter 1 5 (see Figure 1 5-77). 
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Figure 19-13 Cadherin-dependent cell 
sorting. Cells in culture can sort 
themselves out according to the type and 
level of cadherins they express. This can 
be visualized by labeling different 
populations of cells with dyes of different 
colors. (A) Cells expressing N-cadherin 
sort out from cells expressing E-cadherin. 
(B) Cells expressing high levels of 
E-cadherin sort out from cells expressing 
low levels of E-cadherin. 




pl20-tatenin 
p-catenin 



other anchor 
proteins 




artfn 
filament 






CADHERINS AND CELL-CELL ADHESION 



1143 



actin filaments 
inside microvillus 



LUMEM 



microvilli extending 
from apical surface 




tight junction 

bundle of 
actin filaments 



lateral plasma 
membranes of 
adjacent 
epithelial cells 



Figure 19-15 Adherens junctions 
between epithelial cells in the small 
intestine. These cells are specialized for 
absorption of nutrients; at their apex, 
facing the lumen of the gut, they have 
many microvilli (protrusions that serve to 
increase the absorptive surface area). The 
adherens junction takes the form of an 
adhesion belt, encircling each of the 
interacting cells. Its most obvious feature 
is a contractile bundle of actin filaments 
running along the cytoplasmic surface of 
the junctional plasma membrane. The 
actin filament bundles are tethered by 
intracellular anchor proteins to cadherins. 
The cadherins span the plasma 
membrane, and their extracellular 
domains bind homophilically to those of 
the cadherins on the adjacent cell. In this 
way, the actin filament bundles in 
adjacent cells are tied together. 



*- basal surface 



Desmosome Junctions Give Epithelia Mechanical Strength 

Desmosome junctions are structurally similar to adherens junctions but link to 
intermediate filaments instead of actin. Their main function is to provide 
mechanical strength. Desmosome junctions are important in vertebrates but 
are not found, for example, in Drosophila. They are present in most mature ver- 
tebrate epithelia, and are extremely plentiful in the epidermis, the epithelium 
that forms the outer layer of the skin; a favorite source for biochemical studies is 
the epidermis of the snout of cows, which has to withstand constant battering as 
the animal grazes. 

Figure 19-17A shows the general structure of a desmosome, and Figure 
19-17B shows some of the proteins that form it. Desmosomes typically appear 
as buttonlike spots of intercellular adhesion, riveting the cells together (Figure 
19-17C). Inside the cell, the bundles of ropelike intermediate filaments that are 
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Figure 19-16 The folding of an 
epithelial sheet to form an epithelial 
tube. The oriented contraction of the 
bundles of actin filaments running along 
adhesion belts causes the epithelial cells 
to narrow at their apex and helps the 
epithelial sheet to roll up into a tube. An 
example is the formation of the neural 
tube in early vertebrate development 
(see Figure 1 9-1 2 and Chapter 22). 
Although not shown here, 
rearrangements of the cells within the 
epithelial sheet are also thought to have 
an important role in the process. 
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Figure 19-17 Desmosomes. (A) The structural components of a desmosome. On the cytoplasmic surface of 
each interacting plasma membrane is a dense plaque composed of a mixture of intracellular anchor 
proteins. A bundle of keratin intermediate filaments is attached to the surface of each plaque. 
Transmembrane adhesion proteins of the cadherin family bind to the plaques and interact through their 
extracellular domains to hold the adjacent membranes together by a Ca 2+ -dependent mechanism. 
(B) Some of the molecular components of a desmosome. Desmoglein and desmocollin are members of the 
cadherin family of adhesion proteins. Their cytoplasmic tails bind plakoglobin (y-catenin) and plakophilin 
(a distant relative of p120-catenin), which in turn bind to desmoplakin. Desmoplakin binds to the sides of 
intermediate filaments, thereby tying the desmosome to these filaments. (C) An electron micrograph of 
desmosome junctions between epidermal cells in the skin of a baby mouse. (D) Part of the same tissue at 
higher magnification, showing a single desmosome, with intermediate filaments attached to it. (C and D, 
from W. He, P. Cowin and D.L. Stokes, Science 302:1 09-1 1 3, 2003. With permission from AAAS.) 



anchored to the desmosomes form a structural framework of great tensile 
strength (Figure 19-17D), with linkage to similar bundles in adjacent cells, cre- 
ating a network that extends throughout the tissue (Figure 19-18). The particu- 
lar type of intermediate filaments attached to the desmosomes depends on the 
cell type: they are keratin filaments in most epithelial cells, for example, and 
desmin filaments in heart muscle cells. 

The importance of desmosome junctions is demonstrated by some forms of 
the potentially fatal skin disease pemphigus. Affected individuals make antibod- 
ies against one of their own desmosomal cadherin proteins. These antibodies 
bind to and disrupt the desmosomes that hold their epidermal cells (ker- 
atinocytes) together. This results in a severe blistering of the skin, with leakage 
of body fluids into the loosened epithelium. 



Figure 19-18 Desmosomes, hemidesmosomes, and the intermediate 
filament network. The keratin intermediate filament networks of adjacent 
cells— in this example, epithelial cells of the small intestine— are indirectly 
connected to one another through desmosomes, and to the basal lamina 
through hemidesmosomes. 
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Cell-Cell Junctions Send Signals into the Cell Interior 

The making and breaking of attachments are important events in the lives of 
cells and provoke large changes in their internal affairs. Conversely, changes in 
the internal state of a cell must be able to trigger the making or breaking of 
attachments. Thus there is a complex cross-talk between the adhesion machin- 
ery and chemical signaling pathways. We have described, for example, how 
changes in pl20-catenin may regulate the formation of adherens junctions, and 
several intracellular signaling pathways can control junction formation by phos- 
phorylating this and other junctional proteins. Later, we shall discuss how the 
making and breaking of adhesions can send signals into the cell interior through 
mechanisms involving scaffold proteins on the intracellular side of the junction. 

Another of the central players in the two-way interaction between adhesion 
and signaling is thought to be (J-catenin. In this chapter, we have mentioned it 
as an essential intracellular anchor protein at adherens junctions, linking cad- 
herins to actin filaments. In Chapter 15, we encountered it in another guise, as a 
component of the Wnt cell-cell signaling pathway, moving from the cytoplasm 
to the nucleus to activate the transcription of target genes. Separate parts of the 
molecule are responsible for the adhesive and gene-regulatory functions, but an 
individual molecule cannot do both things at once. Disintegration of an 
adherens junction can set (3-catenin molecules free to move from the cell surface 
into the nucleus as signaling molecules, and, conversely, the activities of com- 
ponents of the Wnt signaling pathway (which regulate the phosphorylation and 
degradation of p-catenin) may control the availability of (3-catenin to form 
adherens junctions. 

Some nonclassical cadherins transmit signals into the cell interior in yet 
other ways. Members of the Flamingo subfamily, for example, have a seven-pass 
transmembrane domain suggesting that they might function as G-protein-cou- 
pled receptors. Vascular endothelial cadherin (VE-cadherin) provides another 
example. This protein not only mediates adhesion between endothelial cells but 
also is required for endothelial cell survival. Although endothelial cells that do 
not express VE-cadherin still adhere to one another via N-cadherin, they fail to 
survive, because they are unable to respond to an extracellular protein called 
vascular endothelial growth factor (VEGF) that acts as a survival signal. VEGF 
binds to a receptor tyrosine kinase (discussed in Chapter 15) that requires VE- 
cadherin as a co-receptor. 

Selectins Mediate Transient Cell-Cell Adhesions in the 
Bloodstream 

The cadherin superfamily is central to cell-cell adhesion in animals, but at least 
three other superfamilies of cell-cell adhesion proteins are also important: the 
integrins, the selectins, and the adhesive immunoglobulin {Ig)-superfamily mem- 
bers. We shall discuss integrins in more detail later: their main function is in 
cell-matrix adhesion, but a few of them mediate cell-cell adhesion in specialized 
circumstances. Ca 2+ dependence provides one simple way to distinguish among 
these classes of proteins experimentally. Selectins, like cadherins and integrins, 
require Ca 2+ for their adhesive function; Ig- superfamily members do not. 

Selectins are cell-surface carbohydrate-binding proteins (lectins) that medi- 
ate a variety of transient, cell-cell adhesion interactions in the bloodstream. 
Their main role, in vertebrates at least, is in inflammatory responses and in gov- 
erning the traffic of white blood cells. White blood cells lead a nomadic life, rov- 
ing between the bloodstream and the tissues, and this necessitates special adhe- 
sive behavior. The selectins control the binding of white blood cells to the 
endothelial cells lining blood vessels, thereby enabling the blood cells to migrate 
out of the bloodstream into a tissue. 

Each selectin is a transmembrane protein with a conserved lectin domain 
that binds to a specific oligosaccharide on another cell (Figure 19-19A) . There are 
at least three types: L-selectin on white blood cells, P-selectin on blood platelets 
and on endothelial cells that have been locally activated by an inflammatory 
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response, and E-selectin on activated endothelial cells. In a lymphoid organ, 
such as a lymph node or a tonsil, the endothelial cells express oligosaccharides 
that are recognized by L-selectin on lymphocytes, causing the lymphocytes to 
loiter and become trapped. At sites of inflammation, the roles are reversed: the 
endothelial cells switch on expression of selectins that recognize the oligosac- 
charides on white blood cells and platelets, flagging the cells down to help deal 
with the local emergency. Selectins do not act alone, however; they collaborate 
with integrins, which strengthen the binding of the blood cells to the endothe- 
lium. The cell-cell adhesions mediated by both selectins and integrins are het- 
erophils— that is, the binding is to a molecule of a different type: selectins bind 
to specific oligosaccharides on glycoproteins and glycolipids, while integrins 
bind to other specific proteins. 

Selectins and integrins act in sequence to let white blood cells leave the 
bloodstream and enter tissues (Figure 19-19B). The selectins mediate a weak 
adhesion because the binding of the lectin domain of the selectin to its carbo- 
hydrate ligand is of low affinity. This allows the white blood cell to adhere weakly 
and reversibly to the endothelium, rolling along the surface of the blood vessel, 
propelled by the flow of blood. The rolling continues until the blood cell acti- 
vates its integrins. As we discuss later, these transmembrane molecules can be 
switched into an adhesive conformation that enables them to latch onto other 
molecules external to the cell— in the present case, proteins on the surfaces of 
the endothelial cells. Once it has attached in this way, the white blood cell 
escapes from the blood stream into the tissue by crawling out of the blood ves- 
sel between adjacent endothelial cells. 



Figure 19-1 9 The structure and function of 
selectins. (A) The structure of P-selectin. The 
selectin attaches to the actin cytoskeleton 
through anchor proteins that are still poorly 
characterized. (B) How selectins and 
integrins mediate the cell-cell adhesions 
required for a white blood cell to migrate 
out of the bloodstream into a tissue. First, 
selectins on endothelial cells bind to 
oligosaccharides on the white blood cell, so 
that it becomes loosely attached to the 
vessel wall. Then the white blood cell 
activates an integrin (usually one called 
LFA1) in its plasma membrane, enabling this 
integrin to bind to a protein called ICAM1, 
belonging to the immunoglobulin 
superfamily, in the membrane of the 
endothelial cell. This creates a stronger 
attachment that allows the white blood cell 
to crawl out of the vessel. 



Members of the Immunoglobulin Superfamily of Proteins 
Mediate Ca 2+ -lndependent Cell-Cell Adhesion 

The chief endothelial cell proteins that are recognized by the white blood cell 
integrins are called ICAMs (intercellular cell adhesion molecules) or VCAMs (vas- 
cular celt adhesion moIecidrs}, They are members of another large and ancient 
family of cell surface molecules— the immunoglobulin (Ig) superfamily. These 
contain one or more of the extracellular Tg-like domains chat are characteristic 
nl antibody molecules (discussed in Chapter 25). They have many functions out- 
side the immune system that are unrelated to immune defenses. 

While ICAMs and VCAMs on endothelial cells both mediate heterophil^ 
binding to integrins, many other Ig superfamily members appear to mediate 
hemophilic binding. An example is the neural cell adhesion molecule (NCAM), 
which is expressed by various cell types, including most nerve cells, and can take 
different forms, generated by alternative splicing of an RNA transcript produced 
from a single gene (Kgure 19-20), Some forms of NCAM carry an unusually 
large quantity of sialic acid (with chains containing hundreds of repeating sialic 
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Figure 1 9-20 Two members of the Ig superfamily of cell-cell adhesion 
molecules. NCAM is expressed on neurons and many other cell types, and 
mediates homophilic binding. Only the protein backbone of NCAM is 
shown here; this often has side chains of sialic acid (a polysaccharide) 
covalently attached to it, hindering adhesion. ICAM is expressed on 
endothelial cells and some other cell types and binds heterophilically to an 
integrin on white blood cells. 



acid units). By virtue of their negative charge, the long polysialic acid chains can 
interfere with cell adhesion (because like charges repel one another); NCAM 
heavily loaded with sialic acid may even serve to inhibit adhesion, rather than 
cause it. 

A cell of a given type generally uses an assortment of different adhesion pro- 
teins to interact with other cells, just as each cell uses an assortment of different 
receptors to respond to the many soluble extracellular signal molecules, such as 
hormones and growth factors, in its environment. Although cadherins and Ig 
family members are frequently expressed on the same cells, the adhesions medi- 
ated by cadherins are much stronger, and they are largely responsible for holding 
cells together, segregating cell collectives into discrete tissues, and maintaining 
tissue integrity. Molecules such as NCAM seem to contribute more to the fine- 
tuning of these adhesive interactions during development and regeneration, 
playing a part in various specialized adhesive phenomena, such as that discussed 
for blood and endothelial cells. Thus, while mutant mice that lack N-cadherin die 
early in development, those that lack NCAM develop relatively normally but 
show some mild abnormalities in the development of certain specific tissues, 
including parts of the nervous system. 
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Many Types of Cell Adhesion Molecules Act in Parallel to Create a 
Synapse 

Cells of the nervous system, especially, rely on complex systems of adhesion 
molecules, as well as chemotaxis and soluble signal factors, to guide axon out- 
growth along precise pathways and to direct the formation of specific nerve con- 
nections (discussed in Chapter 22). Adhesion proteins of the Ig superfamily 
along with many other classes of adhesion and signaling molecules, have impor- 
tant roles in these processes. Thus, for example, in flies with a mutation of Fas- 
ciclin2, related to NCAM, some axons follow aberrant pathways and fail to reach 
their proper targets. 

Another member of the Ig superfamily, Fasciclin3 y enables the neuronal 
growth cones to recognize their proper targets when they meet them. This pro- 
tein is expressed transiently on some motor neurons in Drosophila, as well as on 
the muscle cells they normally innervate. If Fasciclin3 is genetically removed 
from these motor neurons, they fail to recognize their muscle targets and do not 
make synapses with them. Conversely, if motor neurons that normally do not 
express Fasciclin3 are made to express this protein, they will synapse with Fasci- 
clin3-expressing muscle cells to which they normally do not connect. It seems 
that Fasciclin3 mediates these synaptic connections by a homophilic "match- 
making" mechanism. Ig superfamily proteins have similar roles in vertebrates. 
Proteins of the Sidekicks subfamily for example, mediate homophilic adhesion, 
and different Sidekicks proteins are expressed in different layers of the retina, 
with synapses forming between sets of retinal neurons that share expression of 
the same family member. When the pattern of expression of the proteins is arti- 
ficially altered, the pattern of synaptic connections changes accordingly. 

These Ig superfamily members are by no means the only adhesion 
molecules involved in initiating synapse formation. Misexpression of certain 
other synaptic adhesion proteins, unrelated to any of the types we have men- 
tioned so far, can even trick growth cones into synapsing on non-neuronal cells 
that would never normally be innervated. Thus, if non-neuronal cells are forced 
to express neuroligin, a transmembrane protein evolutionary related to the 
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enzyme acetylcholinesterase, neurons will synapse on them, as a consequence 
of binding of neuroligin to a protein called neurexin in the membrane of the 
presynaptic neuron. 



Scaffold Proteins Organize Junctional Complexes 

To make a synapse, the pre- and postsynaptic cells have to do more than recog- 
nize one another and adhere: they have to assemble a complex system of signal 
receptors, ion channels, synaptic vesicles, docking proteins, and other compo- 
nents, as described in Chapter 11. This apparatus for synaptic signaling could 
not exist without cell adhesion molecules to join the pre- and postsynaptic 
membranes firmly together and to help hold all the components of the signaling 
machinery in their proper positions. Thus, cadherins are generally present, con- 
centrated at spots around the periphery of the synapse and within it, as well as 
Ig superfamily members and various other types of adhesion molecules. In fact, 
about 20 different classical cadherins are expressed in the vertebrate nervous 
system, in different combinations in different subsets of neurons, and it is likely 
that selective binding of these molecules also plays a part in ensuring that neu- 
rons synapse with their correct partners. 

But how does the array of adhesion molecules recruit the other components 
of the synapse and hold them in place? Scaffold proteins are thought to have a 
central role here. These intracellular molecules consist of strings of protein- 
binding domains, typically including several PDZ domains— segments about 70 
amino acids long that can recognize and bind the C-terminal intracellular tails 
of specific transmembrane molecules (Figure 19-21). One domain of a scaffold 
protein may attach to a cell-cell adhesion protein, for example, while another 
latches onto a ligand-gated ion channel, and yet another binds a protein that 
regulates exocytosis or endocytosis or provides attachment to the cytoskeleton. 
Moreover, one molecule of scaffold protein can bind to another. In this way, the 
cell can assemble a mat of proteins, with all the components that are needed at 
the synapse woven into its fabric (Figure 19-22). Several hundred different types 
of proteins participate in this complex structure. Mutations in synaptic scaffold 
proteins alter the size and structure of synapses and can have severe conse- 
quences for the function of the nervous system. Among other things, such muta- 
tions can damage the molecular machinery underlying learning and memory, 
which depend on the ability of electrical activity to leave a long-lasting trace in 
the form of alterations of synaptic architecture. 

The scaffold proteins, with their many potential binding partners, are 
involved in organizing other structures and functions beside synapses and 
synaptic signaling. The Discs large (Dig) protein of Drosoph ila is an example (see 
Figure 19-21). Dig is needed for the construction of normal synapses; but we 
shall see that it, along with a set of other related scaffold proteins, also plays an 



voltage -gated 

JC channel 



glutamate receptors 
(NMDA receptors) 




other scaffold and 
scaffold-associated proteins 



13 domain PDZ domains 

- Dig4 (PSD95) scaffold proteins 



Figure 19-21 A scaffold protein. The 

diagram shows the domain structure of 
Dlg4, a mammalian homolog of the 
Drosophiia protein Discs-large, along with 
some of its binding partners. Dlg4 is 
concentrated beneath the postsynaptic 
membrane at synapses, and is also 
known as postsynaptic density protein 
95, or PSD95. With its multiple protein- 
binding domains, it can link together 
different components of the synapse. 
One molecule of Dlg4 can also bind to 
another or to scaffolding molecules of 
other types, thereby creating an 
extensive framework that holds together 
all the components of the synapse. 
Scaffold proteins also have important 
roles at other types of cell junctions. 
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essential part in almost every aspect of the organization of epithelia, including 
the formation of occluding junctions between the cells, the control of cell polar- 
ity, and even the control of cell proliferation. All these processes have a shared 
dependence on the same machinery, not only in flies, but also in vertebrates. 



Summary 

In epithelia, as well as in some other types of tissue, cells are directly attached to one 
another through strong cell-cell adhesions, mediated by transmembrane proteins that 
are anchored intracellularly to the cytoskeleton. At adherens junctions, the anchorage 
is to actin filaments; at desmosome junctions, it is to intermediate filaments. In both 
these structures, and in many less conspicuous cell-cell junctions, the adhesive trans- 
membrane proteins are members of the cadherin superfamily Cadherins generally 
bind to one another homophilically: the head of one cadherin molecule binds to the 
head of a similar cadherin on an opposite cell This selectivity enables mixed popula- 
tions of cells of different types to sort out from one another according to the specific 
cadherins they express, and it helps to control cell rearrangements during develop- 
ment, where many different cadherins are expressed in complex, changing patterns. 
Changes in cadherin expression can cause cells to undergo transitions between a cohe- 
sive epithelial state and a detached mesenchymal state — a phenomenon important in 
cancer as well as in embryonic development. 

The "classical" cadherins are linked to the actin cytoskeleton by intracellular pro- 
teins called catenins. These form an anchoring complex on the intracellular tail of the 



Figure 19-22 Organization of a synapse. 

(A) Electron micrograph and (B) line 
drawing of a cross section of two nerve 
terminals synapsing on a dendrite in the 
mammalian brain. Note the synaptic 
vesicles in the two nerve terminals and 
the dark-staining material associated 
with the pre- and postsynaptic 
membranes. (C) Schematic diagram 
showing some of the synaptic 
components that are assembled at these 
sites. Cell-cell adhesion molecules, 
including cadherins and neuroligins and 
neurexins, hold the pre- and postsynaptic 
membranes together. Scaffolding 
proteins help to form a mat 
(corresponding to the dark-staining 
material seen in (A)) that links the 
adhesion molecules by their intracellular 
tails to the components of the synaptic 
signal-transmission machinery such as 
ion channels and neurotransmitter 
receptors. The structure of this large, 
complex multiprotein assembly is not 
yet known in detail. It includes anchorage 
sites for hundreds of additional 
components, not shown here, including 
cytoskeletal molecules and various 
regulatory kinases and phosphatases. 
(A, courtesy of Cedric Raine.) 



1 1 50 Chapter 1 9: Cell Junctions, Cell Adhesion, and the Extracellular Matrix 



cadherin molecule, and are involved not only in physical anchorage but also in the 
genesis of intracellular signals. Conversely intracellular signals can regulate the for- 
mation of cadherin- mediated adhesions. p-Catenin, for example, is also a key compo- 
nent of the Wnt cell signaling pathway. 

In addition to cadherins, at least three other classes of transmembrane molecules 
are also important mediators of cell-cell adhesion: selectins, immunoglobulin (Ig) 
superfamily members, and integrins. Selectins are expressed on white blood cells, 
blood platelets, and endothelial cells, and they bind heterophilically to carbohydrate 
groups on cell surfaces. They help to trap circulating white blood cells at sites of 
inflammation. Ig- superfamily proteins also play a part in this trapping, as well as in 
many other adhesive processes; some of them bind homophilically some heterophili- 
cally. Integrins, though they mainly serve to attach cells to the extracellular matrix, can 
also mediate cell-cell adhesion by binding to the Ig- superfamily members. 

Many different Ig- superfamily members, cadherins, and other cell-cell adhesion 
molecules guide the formation of nerve connections and hold neuronal membranes 
together at synapses. In these complicated structures, as well as at other types of 
cell-cell junctions, intracellular scaffold proteins containing multiple PDZ protein- 
binding domains have an important role in holding the many different adhesive and 
signaling molecules in their proper arrangements. 



TIGHT JUNCTIONS AND THE ORGANIZATION OF 
EPITHELIA 

An epithelial sheet, with its cells joined side by side and standing on a basal lam- 
ina, may seem a specialized type of structure, but it is central to the construction 
of multicellular animals. In fact, more than 60% of the cell types in the vertebrate 
body are epithelial. Just as cell membranes enclose and partition the interior of 
the eucaryotic cell, so epithelia enclose and partition the animal body, lining all 
its surfaces and cavities, and creating internal compartments where specialized 
processes occur. The epithelial sheet seems to be one of the inventions that lie 
at the origin of animal evolution, diversifying in a huge variety of ways (as we see 
in Chapter 23), but retaining an organization based on a set of conserved molec- 
ular mechanisms that practically all epithelia have in common. 

Essentially all epithelia are anchored to other tissue on one side— the basal 
side— and free of such attachment on their opposite side— the apical side. A 
basal lamina lies at the interface with the underlying tissue, mediating the 
attachment, while the apical surface of the epithelium is generally bathed by 
extracellular fluid (but sometimes covered by material that the cells have 
secreted at their apices). Thus all epithelia are structurally polarized, and so are 
their individual cells: the basal end of a cell, adherent to the basal lamina below, 
differs from the apical end, exposed to the medium above. 

Correspondingly, all epithelia have at least one function in common: they 
serve as selective permeability barriers, separating the fluid that permeates the 
tissue on their basal side from fluid with a different chemical composition on 
their apical side. This barrier function requires that the adjacent cells be sealed 
together by occluding junctions, so that molecules cannot leak freely across the 
cell sheet. In this section we consider how the occluding junctions are formed, 
and how the polarized architecture of the epithelium is maintained. These two 
fundamental aspects of epithelia are closely linked: the junctions play a key part 
in organizing and maintaining the polarity of the cells in the sheet. 



Tight Junctions Form a Seal Between Cells and a Fence Between 
Membrane Domains 

The occluding junctions found in vertebrate epithelia are called tight junctions. 
The epithelium of the small intestine provides a good illustration of their struc- 
ture and function (see Figure 19-3). This epithelium has a simple columnar 
structure; that is, it consists of a single layer of tall (columnar) cells. These are of 
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several differentiated types, but the majority are absorptive cells, specialized for 
uptake of nutrients from the internal cavity, or lumen, of the gut. 

The absorptive cells have to transport selected nutrients across the epithe- 
lium from the lumen into the extracellular fluid that permeates the connective 
tissue on the other side. From there, these nutrients diffuse into small blood ves- 
sels to provide nourishment to the organism. This transcellular transport 
depends on two sets of transport proteins in the plasma membrane of the 
absorptive cell. One set is confined to the apical surface of the cell (facing the 
lumen) and actively transports selected molecules into the cell from the gut. The 
other set is confined to the basolateral (basal and lateral) surfaces of the cell, and 
it allows the same molecules to leave the cell by facilitated diffusion into the 
extracellular fluid on the other side of the epithelium. For this transport activity 
to be effective, the spaces between the epithelial cells must be tightly sealed, so 
that the transported molecules cannot leak back into the gut lumen through 
these spaces (Figure 19-23). Moreover, the proteins that form the pumps and 
channels must be correctly distributed in the cell membranes: the apical set of 
active transport proteins must be delivered to the cell apex (as discussed in 
Chapter 13) and must not be allowed to drift to the basolateral surface, and the 
basolateral set of channel proteins must be delivered to the basolateral surface 
and must not be allowed to drift to the apical surface. The tight junctions 
between epithelial cells, besides sealing the gaps between the cells, may also 
function as "fences" helping to separate domains within the plasma membrane 
of each cell, so as to hinder apical proteins (and lipids) from diffusing into the 
basal region, and vice versa (see Figure 19-23). 

The sealing function of tight junctions is easy to demonstrate experimentally: 
a low-molecular- weight tracer added to one side of an epithelium will generally 
not pass beyond the tight junction (Figure 19-24). This seal is not absolute, how- 
ever. Although all tight junctions are impermeable to macromolecules, their per- 
meability to small molecules varies. Tight junctions in the epithelium lining the 
small intestine, for example, are 10,000 times more permeable to inorganic ions, 
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Figure 1 9-23 The role of tight junctions 
in transcellular transport. Transport 
proteins are confined to different regions 
of the plasma membrane in epithelial cells 
of the small intestine. This segregation 
permits a vectorial transfer of nutrients 
across the epithelium from the gut lumen 
to the blood. In the example shown, 
glucose is actively transported into the cell 
by Na + -driven glucose symports at its 
apical surface, and it diffuses out of the 
cell by facilitated diffusion mediated by 
glucose carriers in its basolateral 
membrane. Tight junctions are thought to 
confine the transport proteins to their 
appropriate membrane domains by acting 
as diffusion barriers or "fences" within the 
lipid bilayer of the plasma membrane; 
these junctions also block the backflow of 
glucose from the basal side of the 
epithelium into the gut lumen. 
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Figure 19-24 The role of tight junctions 
in allowing epithelia to serve as barriers 
to solute diffusion. (A) The drawing shows 
how a small extracellular tracer molecule 
added on one side of an epithelium is 
prevented from crossing the epithelium by 
the tight junctions that seal adjacent cells 
together. (B) Electron micrographs of cells 
in an epithelium in which a small, 
extracellular, electron-dense tracer 
molecule has been added to either the 
apical side (on the left) or the basolateral 
side (on the right). In both cases, the tight 
junction blocks passage of the tracer. 
(B, courtesy of Daniel Friend.) 
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such as Na + , than the tight junctions in the epithelium lining the urinary bladder. 
These differences reflect differences in the proteins that form the junctions. 

Epithelial cells can also alter their tight junctions transiently to permit an 
increased flow of solutes and water through breaches in the junctional barriers. 
Such paracellular transport is especially important in the absorption of amino 
acids and monosaccharides from the lumen of the intestine, where the concen- 
tration of these nutrients can increase enough after a meal to drive passive 
transport in the proper direction. 

When tight junctions are visualized by freeze-fracture electron microscopy, 
they seem to consist of a branching network of sealing strands that completely 
encircles the apical end of each cell in the epithelial sheet (Figure 19-25A and B). 
In conventional electron micrographs, the outer leaflets of the two interacting 
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Figure 19-25 The structure of a tight junction between epithelial cells of the small intestine. The junctions are shown (A) schematically, 
(B) in a freeze-fracture electron micrograph, and (C) in a conventional electron micrograph. In (B), the plane of the micrograph is parallel to the 
plane of the membrane, and the tight junction appears as a band of branching sealing strands that encircle each cell in the epithelium. The 
sealing strands are seen as ridges of intramembrane particles on the cytoplasmic fracture face of the membrane (the P face) or as 
complementary grooves on the external face of the membrane (the E face) (see Figure 19-26A). In (C), the junction is seen in cross section as a 
series of focal connections between the outer leaflets of the two interacting plasma membranes, each connection corresponding to a sealing 
strand in cross section. (B and C, from N.B. Gilula, in Cell Communication [R.P. Cox, ed.], pp. 1 -29. New York: Wiley, 1 974.) 
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plasma membranes are seen to be tightly apposed where sealing strands are pre- 
sent (Figure 19-25C). Each tight junction sealing strand is composed of a long 
row of transmembrane adhesion proteins embedded in each of the two inter- 
acting plasma membranes. The extracellular domains of these proteins adhere 
directly to one another to occlude the intercellular space (Figure 19-26). 

The main transmembrane proteins forming these strands are the claudins, 
which are essential for tight junction formation and function. Mice that lack the 
claudin-1 gene, for example, fail to make tight junctions between the cells in 
the epidermal layer of the skin; as a result, the baby mice lose water rapidly by 
evaporation through the skin and die within a day after birth. Conversely, if 
nonepithelial cells such as fibroblasts are artificially caused to express claudin 
genes, they will form tight -junctional connections with one another. Normal 
tight junctions also contain a second major transmembrane protein called 
occludin, but the function of this protein is uncertain, and it does not seem to 
be as essential as the claudins. A third transmembrane protein, tricellulin 
(related to occludin), is required to seal cell membranes together and prevent 
transepithelial leakage at the points where three cells meet. 

The claudin protein family has many members (24 in humans), and these 
are expressed in different combinations in different epithelia to confer particu- 
lar permeability properties on the epithelial sheet. They are thought to form 
paracellular pores — selective channels allowing specific ions to cross the tight - 
junctional barrier, from one extracellular space to another. A specific claudin 
found in kidney epithelial cells, for example, is needed to let Mg 2+ pass between 
the cells of the sheet so that this ion can be resorbed from the urine into the 
blood. A mutation in the gene encoding this claudin results in excessive loss of 
Mg 2+ in the urine. 



_ 



tight-junction proteins 



Figure 1 9-26 A model of a tight junction. 

(A) The sealing strands hold adjacent 
plasma membranes together. The strands 
are composed of transmembrane proteins 
that make contact across the intercellular 
space and create a seal. (B) The molecular 
composition of a sealing strand. The 
claudins are the main functional 
components; the role of the occludins 
is uncertain. 



Scaffold Proteins in Junctional Complexes Play a Key Part in the 
Control of Cell Proliferation 

The claudins and occludins have to be held in the right position in the cell, so as 
to form the tight-junctional network of sealing strands. This network usually lies 
just apical to the adherens and desmosome junctions that bond the cells 
together mechanically, and the whole assembly is called a junctional complex 
(Figure 19-27). The parts of this junctional complex depend on each other for 
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their formation. For example, anti-cadherin antibodies that block the formation 
of adherens junctions also block the formation of tight junctions. The position- 
ing and organization of tight junctions in relation to these other structures is 
thought to depend on association with intracellular scaffold proteins of the Tjp 
(Tight junction protein) family, also called ZO proteins (a tight junction is also 
known as a zonula occludens). The vertebrate Tjp proteins belong to the same 
family as the Discs-large proteins that we mentioned earlier for their role at 
synapses, and they anchor the tight-junctional strands to other components 
including the actin cytoskeleton. 

In invertebrates such as insects and mollusks, occluding junctions have a 
different appearance and are called septate junctions. Like tight junctions, 
these form a continuous band around each epithelial cell, but the structure is 
more regular, and the interacting plasma membranes are joined by proteins that 
are arranged in parallel rows with a regular periodicity (Figure 19-28). Septate 
junctions are nevertheless based on proteins homologous to the vertebrate 
claudins, and they depend on scaffold proteins in a similar way, including in 
particular the same Discs-large protein that is present at synapses. Mutant flies 
that are deficient in Discs-large have defective septate junctions. 

Strikingly, these mutants also develop epithelial tumors, in the form of large 
overgrowths of the imaginal discs — the structures in the fly larva from which 
most of the adult body derives (as described in Chapter 22). The gene takes its 
name from this remarkable effect, which depends on the presence of binding 
sites for growth regulators on the Discs-large protein. But why should the appa- 
ratus of cell-cell adhesion be linked in this way with the control of cell prolifer- 
ation? The relationship seems to be fundamental: in vertebrates also, genes 



Figure 19-27 A junctional complex 
between two epithelial cells in the 
lining of the gut. Most apically, there is a 
tight junction; beneath this, an adherens 
junction; and beneath the adherens 
junction, a desmosomal junction. This 
example is from a vertebrate; in insects, 
the arrangement is different. (Courtesy of 
Daniel S. Friend.) 
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Figure 19-28 A septate junction. 

A conventional electron micrograph of a 
septate junction between two epithelial 
cells in a mollusk.The interacting plasma 
membranes, seen in cross section, are 
connected by parallel rows of junctional 
proteins. The rows, which have a regular 
periodicity, are seen as dense bars, or 
septa. (From N.B. Gilula, in Cell 
Communication [R.P. Cox, ed.] ( pp. 1-29. 
New York: Wiley, 1974.) 
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homologous to Discs large have this dual involvement. One possibility is that it 
reflects a basic mechanism for repair and maintenance of epithelia. If an 
epithelial cell is deprived of adhesive contacts with neighbors, its program of 
growth and proliferation is activated, thereby creating new cells to reconstruct 
a continuous multicellular sheet. In fact, a large body of evidence indicates that 
junctional complexes are important sites of cell-cell signaling not only via 
Discs-large but also through other components of these structures, including 
cadherins as we have seen. 
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Cell-Cell Junctions and the Basal Lamina Govern Apico-Basal 
Polarity in Epithelia 

Most cells in animal tissues are strongly polarized: they have a front that differs 
from the back, or a top that differs from the bottom. Examples include virtually 
all epithelial cells, as we have discussed, as well as neurons with their den- 
drite-axon polarity, migrating fibroblasts and white blood cells, with their loco- 
motor leading edge and trailing rear end, and many other cells in embryos as 
they prepare to divide asymmetrically to create daughter cells that are different. 
A core set of components is critical for cell polarity in all these cases, throughout 
the animal kingdom, from worms and flies to mammals. 

In the case of epithelial cells, these fundamental generators of cell polarity 
have to establish the difference between the apical and basal poles, and they have 
to do so in a properly oriented way, in accordance with the cell's surroundings. 
The basic phenomenon is nicely illustrated by experiments with a cultured line 
of epithelial cells, called MDCK cells (Figure 19-29A). These can be separated 
from one another and cultured in suspension in a collagen gel. A single isolated 
cell in these circumstances does not show any obvious polarity, but if it is allowed 
to divide to form a small colony of cells, these cells will organize themselves into 
a hollow epithelial vesicle where the polarity of each cell is clearly apparent. The 
vesicle becomes surrounded by a basal lamina, and all the cells orient themselves 
in the same way, with apex-specific marker molecules facing the lumen. Evi- 
dently, the MDCK cells have a spontaneous tendency to become polarized, but 
the mechanism is cooperative and depends on contacts with neighbors. 

To discover how the underlying molecular mechanism works, the first step 
is to identify its components. Studies in the worm C. elegans and in Drosophila 
have been most informative here. In the worm, a screen for mutations upsetting 
the organization of the early embryo has revealed a set of genes essential for nor- 
mal cell polarity and asymmetry of cell division (as discussed in Chapter 22). 
There are at least six of these genes, called Par (partitioning defective) genes. In 
all animal species studied, they and their homologs (along with other genes dis- 
covered through studies in Drosophila and vertebrates) have a fundamental role 
not just in asymmetric cell division in the early embryo, but in many other pro- 
cesses of cell polarization, including the polarization of epithelial cells. The Par4 
gene of C. elegans, for example, is homologous to a gene called Lkbl in mammals 
and Drosophila y coding for a serine/threonine kinase. In the fly, mutations of 
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Figure 19-29 Cooperative polarization of 
a cluster of epithelial cells in culture and 
its dependence on Rac and laminin. Cells 
of the MDCK line, derived from dog kidney 
epithelium, were dissociated, embedded 
in a collagen matrix, and allowed to 
proliferate, creating small isolated 
colonies, shown here schematically in 
cross section. (A) The cells in such a colony 
will normally organize themselves 
spontaneously into an epithelium 
surrounding a central cavity. Staining for 
actin (which marks apical microvilli), ZOI 
protein (a tight-junction protein), Golgi 
apparatus, and laminin (a basal lamina 
component) shows that the cells have all 
cooperatively become polarized, with 
apical components facing the lumen of 
the cavity and basal components facing 
the surrounding collagen gel. (B) When 
Rac function is blocked by expression of a 
dominant-negative form of the protein, 
the cells show inverted polarity, fail to 
form a cyst with a central cavity, and cease 
to deposit laminin in the normal manner 
around the periphery of the cell cluster. 
(C) When the cyst is embedded in a matrix 
rich in exogenous laminin, near-normal 
polarity is restored even though Rac 
function is still blocked. (Based on 
L.E. O'Brien et ah, Not. Cell Biol. 3:831-838, 
2001 . With permission from Macmillan 
Publishers Ltd.) 
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this gene disrupt the polarity of the egg cell and of cells in epithelia. In humans, 
such mutations give rise to Peutz-Jeghers syndrome, involving disorderly abnor- 
mal growths of the lining of the gut and a predisposition to certain rare types of 
cancer. When cultured human colon epithelial cells are prevented from express- 
ing LKB1, they fail to polarize normally. Moreover, when such cells in culture are 
artificially driven to express abnormally high levels of LKB1 activity, they can 
become individually polarized, even when isolated from other cells, and sur- 
rounded on all sides by a uniform medium (Figure 19-30). This suggests that 
normal epithelial polarity depends on two interlocking mechanisms: one that 
endows individual cells with a tendency to become polarized cell- 
autonomously, and another that orients their polarity axis in relation to their 
neighbors and the basal lamina. The latter mechanism would be peculiar to 
epithelia; the former could be much more general, operating also in other polar- 
ized cell types. 

The molecules known to be needed for epithelial polarity can be classified in 
relation to these two mechanisms. Central to the polarity of individual animal 
cells in general is a set of three membrane-associated proteins: Par3, Par6, and 
atypical protein kinase C (aPKC). Par3 and Par6 are both scaffold proteins con- 
taining PDZ domains, and they bind to one another and to aPKC. The complex of 
these three components also has binding sites for various other molecules, 
including the small GTPases Rac and Cdc42. These latter molecules play a crucial 
part. Thus, for example, when Rac function is blocked in a cluster of MDCK cells, 
the cells develop with inverted polarity (see Figure 19-29B). Rac and Cdc42 are 
key regulators of actin assembly, as explained in Chapter 16; through them, it 
seems, assembly of a Par3-Par6-aPKC complex in a specific region of the cell cor- 
tex is associated with polarization of the cytoskeleton towards that region. The 
assembly process is evidently cooperative and involves some positive feedback 
and spatial signaling, so that a small initial cluster of these components is able to 
recruit more of them and to inhibit the development of clusters of the same type 
elsewhere in the cell. One source of positive feedback may lie in the behavior of 
Cdc42 and Rac: a high activity of these molecules at a particular site, by organiz- 
ing the cytoskeleton, may direct intracellular transport so as to bring still more 
Cdc42 or Rac, or more of their activators, to the same site. This is suspected to be 
an essential part of the polarization mechanism in budding yeast cells, and it may 
be the way in which cells such as migrating fibroblasts establish the difference 
between their leading edge and the rest of their periphery. It could be the core of 
the eucaryotic cell polarization machinery, at least in evolutionary terms. 

The Par3-Par6-aPKC complex, combined with Cdc42 or Rac, seems to con- 
trol the organization of other protein complexes associated with the internal 
face of the cell membrane. In particular, in epithelial cells, it causes the Crumbs 
complex, held together by the PDZ-domain scaffold proteins Discs-lost and Star- 
dust, to become localized toward the apex of the cell, while a third such com- 
plex, called the Scribble complex, held together by the scaffold proteins Scribble 
and Discs-large (the same protein that we encountered previously) is localized 
more basally (Figure 19-31). These various protein assemblies interact with one 
another and with other cell components in ways that are only beginning to be 
understood. 

But how is this whole elaborate system oriented correctly in relation to 
neighboring cells? In an epithelium, the Par3-Par6-aPKC complex assembles at 



Figure 19-30 Development of polarity 
in single isolated epithelial cells. Cells of 
a line derived from intestinal epithelium 
were transfected with DNA constructs 
coding for regulatory components 
through which the activity of the LKB1 
protein could be switched on or off by a 
change in the composition of the culture 
medium. When LKB1 activity is low, the 
cells appear unpolarized; when it is high, 
they become individually polarized. Their 
polarity is manifest in the distribution of 
tight-junction proteins (Z01) and 
adherens-junction proteins (p120- 
catenin), which accumulate on one side 
of the cell, around a cap of actin-filled 
microvilli, even though the cells are 
isolated from one another and make no 
cell-cell junctions. This cell-autonomous 
polarization occurs even when the cells 
are cultured in suspension, without 
contact with any substratum that could 
tell them which way was up. (From 
A.F. Baas et al., Cell 1 1 6:457-466, 2004. 
With permission from Elsevier.) 
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cell-cell junctions — tight junctions in vertebrates, adherens junctions in 
Drosophila — because the scaffold proteins in the complex bind to the tails of 
certain of the junctional transmembrane adhesion proteins. Meanwhile, the 
cytoskeleton, under the influence of Rac or its relatives, directs the delivery of 
basal lamina components to the opposite end of the cell. These extracellular 
matrix molecules then act back on the cell to give that region a basal character 
(see Figure 19-29C). In this way, the polarity of the cell is coupled to its orienta- 
tion in the epithelial sheet and its relation to the basal lamina. 



A Separate Signaling System Controls Planar Cell Polarity 

Apico -basal polarity is a universal feature of epithelia, but the cells of some 
epithelia show an additional polarity at right angles to this axis: it is as if they had 
an arrow written on them, pointing in a specific direction in the plane of the 
epithelium. This type of polarity is called planar cell polarity (Figure 19-32A 
and B). In the wing of a fly, for example, each epithelial cell has a tiny asymmet- 
rical projection, called a wing-hair, on its surface, and the hairs all point toward 
the tip of the wing. Similarly, in the inner ear of a vertebrate, each mechanosen- 
sory hair cell has an asymmetric bundle of stereocilia (actin-filled rod-like pro- 
trusions) sticking up from its apical surface: tilting the bundle in one direction 
causes ion channels to open, stimulating the cell electrically; tilting in the oppo- 
site direction has the contrary effect. For the ear to function correctly, the hair 
cells must be correctly oriented. Planar cell polarity is important also in the res- 
piratory tract, for example, where every ciliated cell must orient its beating so as 
to sweep mucus up out of the lungs, and not down into them (see Chapter 23). 
Screens for mutants with disorderly wing hairs in Drosophila have identified 
a set of genes that are critical for planar cell polarity in the fly. Some of these, 



Figure 19-31 The coordinated 
arrangement of three membrane- 
associated protein complexes thought to 
be critical for epithelial polarity. 

A Drosophila epithelial cell is shown 
schematically on the left, and a vertebrate 
epithelial cell on the right. All three 
complexes — the Par3-Par6-aPkc complex, 
the Crumbs complex, and the Scribble 
complex—are organized around scaffold 
proteins containing PDZ domains. The 
detailed distribution of the complexes 
varies somewhat according to cell type. 



Figure 1 9-32 Planar cell polarity. (A) Wing 
hairs on the wing of a fly. Each cell in the 
wing epithelium forms one of these little 
spiky protrusions or"hairs"at its apex, and 
all the hairs point the same way, toward 
the tip of the wing. This reflects a planar 
polarity in the structure of each cell. 
(B) Sensory hair cells in the inner ear of a 
mouse similarly have a well-defined planar 
polarity, manifest in the oriented pattern 
of stereocilia (actin-filled protrusions) on 
their surface. The detection of sound 
depends on the correct, coordinated 
orientation of the hair cells. (C) A mutation 
in the gene Flamingo in the fly, coding for 
a non-classical cadherin, disrupts the 
pattern of planar cell polarity in the wing. 
(D) A mutation in a homologous Flamingo 
gene in the mouse randomizes the 
orientation of the planar cell polarity 
vector of the hair cells in the ear. The 
mutant mice are deaf. (A and C, from 
J. Chae et at, Development 126:5421-5429, 
1 999. With permission from The Company 
of Biologists; B and D, from J.A. Curtin et 
al„ Curr. Biol. 1 3:1 1 29-1 1 33, 2003. With 
permission from Elsevier.) 
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such as Frizzled, for example, and Dishevelled, code for proteins that have since 
been shown to be components of the Wnt signaling pathway (discussed in 
Chapter 15). Two others, Flamingo (see Figure 19-32C) and Dachsous, code for 
members of the cadherin superfamily. Still others are less easily classified func- 
tionally, but it is clear that planar cell polarity is organized by machinery formed 
from these components and assembled at cell-cell junctions in such a way that 
a polarizing influence can propagate from cell to cell. Essentially the same sys- 
tem of proteins controls planar cell polarity in vertebrates. Mice with mutations 
in a Flamingo homolog, for example, have incorrectly oriented hair cells in their 
ears (among other defects) and thus are deaf (see Figure 19-32D). 



Summary 

Occluding junctions — tight junctions in vertebrates, septate junctions in insects and 
molluscs — seal the gaps between cells in epithelia, creating a barrier to the diffusion of 
molecules across the cell sheet. They also form a bar to the diffusion of proteins in the 
plane of the membrane, and so help to maintain a difference between the populations 
of proteins in the apical and basolateral membrane domains of the epithelial cell The 
major transmembrane proteins forming occluding junctions are called claudins; dif- 
ferent members of the family are expressed in different tissues, conferring different per- 
meability properties on the various epithelial sheets. 

Intracellular scaffold proteins bind to the transmembrane components at occlud- 
ing junctions and coordinate these junctions with cadherin-based anchoring junc- 
tions, so as to create junctional complexes. The junctional scaffold proteins have at 
least two other crucial functions. They play a part in the control of epithelial cell pro- 
liferation; and, in conjunction with other regulatory molecules such as Rac and Cdc42, 
they govern cell polarity. Epithelial cells have an intrinsic tendency to develop a polar- 
ized apico-basal axis. The orientation of this axis in relation to the cell's neighbors in 
an epithelial sheet depends on protein complexes involving scaffold proteins that 
assemble at cell-cell junctions, as well as on cytoskeletal polarization controlled by 
Rac/Cdc42 and on influences from the basal lamina. 

The cells of some epithelia have an additional polarity in the plane of the epithelium, 
at right angles to the apico-basal axis. A separate set of conserved proteins, operating in a 
similar way in vertebrates and in insects, governs this planar cell polarity through poorly 
understood signaling processes that are likewise based on cell-cell junctions. 



PASSAGEWAYS FROM CELL TO CELL: GAP 
JUNCTIONS AND PLASMODESMATA 

Tight junctions block the passageways through the gaps between cells, prevent- 
ing extracellular molecules from leaking from one side of an epithelium to the 
other. Another type of junctional structure has a radically different function: it 
bridges gaps between adjacent cells so as to create direct passageways from the 
cytoplasm of one into that of the other. These passageways take quite different 
forms in animal tissues, where they are called gap junctions, and in plants, 
where they are called plasmodesmata (singular plasmodesma) . In both cases, 
however, the function is similar: the connections allow neighboring cells to 
exchange small molecules but not macromolecules (with some exceptions for 
plasmodesmata). Many of the implications of this cell coupling are only begin- 
ning to be understood. 



Gap Junctions Couple Cells Both Electrically and Metabolically 

Gap junctions are present in most animal tissues, including connective tissues 
as well as epithelia, allowing the cells to communicate with their neighbors. 
Each gap junction appears in conventional electron micrographs as a patch 
where the membranes of two adjacent cells are separated by a uniform narrow 
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gap of about 2-4 nm. The gap is spanned by channel-forming proteins, of which 
there are two distinct families, called the connexins and the innexins. These are 
unrelated in sequence but similar in shape and function: in vertebrates, both 
families are present, but connexins predominate, with 21 members in humans. 
In Drosophila and C. elegans, only innexins are present, with 15 family members 
in the fly and 25 in the worm. 

The channels formed by the gap-junction proteins allow inorganic ions and 
other small water-soluble molecules to pass directly from the cytoplasm of one 
cell to the cytoplasm of the other, thereby coupling the cells both electrically and 
metabolically. Thus, when a suitable dye is injected into one cell, it diffuses read- 
ily into the other, without escaping into the extracellular space. Similarly, an 
electric current injected into one cell through a microelectrode causes an almost 
instantaneous electrical disturbance in the neighboring cell, due to the flow of 
ions carrying electric charge through gap junctions. With microelectrodes 
inserted into both cells, one can easily monitor this effect and measure proper- 
ties of the gap junctions, such as their electrical resistance and the ways in which 
the coupling changes as conditions change. In fact, some of the earliest evidence 
of gap-junctional communication came from electrophysiological studies that 
demonstrated this type of rapid, direct electrical coupling between some types 
of neurons. Similar methods were used to identify connexins as the proteins that 
mediate the gap-junctional communication: when connexin mRNA is injected 
into either frog oocytes or gap-junction-deficient cultured cells, channels with 
the properties expected of gap-junction channels can be demonstrated electro- 
physiologically where pairs of injected cells make contact. 

From experiments with injected dye molecules of different sizes, it seems 
that the largest functional pore size for gap-junctional channels is about 1.5 nm. 
Thus, the coupled cells share their small molecules (such as inorganic ions, sug- 
ars, amino acids, nucleotides, vitamins, and the intracellular mediators cyclic 
AMP and inositol trisphosphate) but not their macromolecules (proteins, 
nucleic acids, and polysaccharides) (Figure 19-33). 




Figure 19-33 Determining the size of a 
gap-junction channel. When fluorescent 
molecules of various sizes are injected into 
one of two cells coupled by gap junctions, 
molecules with a mass of less than about 
1000 daltons can pass into the other cell, 
but larger molecules cannot. 



A Gap-Junction Connexon Is Made Up of Six Transmembrane 
Connexin Subunits 

Connexins are four-pass transmembrane proteins, six of which assemble to 
form a hemichannel, or connexon. When the connexons in the plasma mem- 
branes of two cells in contact are aligned, they form a continuous aqueous chan- 
nel that connects the two cell interiors (Figure 19-34A and Figure 19-35). A gap 
junction consists of many such connexon pairs in parallel, forming a sort of 
molecular sieve. The connexons hold the interacting plasma membranes a fixed 
distance apart — hence the gap. 

Gap junctions in different tissues can have different properties because they 
are formed from different combinations of connexins, creating channels that 
differ in permeability. Most cell types express more than one type of connexin, 
and two different connexin proteins can assemble into a heteromeric connexon, 
with its own distinct properties. Moreover, adjacent cells expressing different 



Figure 19-34 Gap junctions. (A) A three- 
dimensional drawing showing the 
interacting plasma membranes of two 
adjacent cells connected by gap 
junctions. Each lipid bilayer is shown as a 
pair of red sheets. Protein assemblies 
called connexons [green), each of which 
is formed by six connexin subunits, 
penetrate the apposed lipid bilayers {red). 
Two connexons join across the 
intercellular gap to form a continuous 
aqueous channel connecting the two 
cells. (B) The organization of connexins 
into connexons and connexons into 
intercellular channels. The connexons can 
be homomeric or heteromeric, and the 
intercellular channels can be homotypic 
or heterotypic. 
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Figure 19-35 Gap junctions as seen in 
the electron microscope. (A) Thin-section 
and (B) freeze-fracture electron 
micrographs of a large and a small gap 
junction between fibroblasts in culture. In 
(B), each gap junction is seen as a cluster 
of homogeneous intramembrane 
particles. Each intramembrane particle 
corresponds to a connexon. (From 
N.B. Gilula, in Cell Communication 
[R.P. Cox, ed.], pp. 1-29. New York: Wiley, 
1974.) 



100 nm 



connexins can form intercellular channels in which the two aligned half-chan- 
nels are different (Figure 19-34B). 

Each gap-junctional plaque is a dynamic structure that can readily assem- 
ble, disassemble, or be remodelled, and it can contain a cluster of a few to many 
thousands of connexons (see Figure 19-35B). Studies with fluorescently labeled 
connexins in living cells show that new connexons are continually added around 
the periphery of an existing junctional plaque, while old connexons are removed 
from the middle of it and destroyed (Figure 19-36). This turnover is rapid: the 
connexin molecules have a half- life of a few hours. 

The mechanism of removal of old connexons from the middle of the plaque 
is not known, but the route of delivery of new connexons to its periphery seems 
clear: they are inserted into the plasma membrane by exocytosis, like other inte- 
gral membrane proteins, and then diffuse in the plane of the membrane until 
they bump into the periphery of a plaque and become trapped. This has a corol- 
lary: the plasma membrane away from the gap junction should contain connex- 
ons — hemichannels— that have not yet paired with their counterparts on 
another cell. It is thought that these unpaired hemichannels are normally held 



Figure 19-36 Connexin turnover at a gap junction. Cells were transfected 
with a slightly modified connexin gene, coding for a connexin with a short 
amino-acid tag containing four cysteines in the sequence ...Cys-Cys-X-X- 
Cys-Cys (where X denotes an arbitrary amino acid). This tetracysteine tag 
can bind strongly, and in effect irreversibly, to certain small fluorescent dye 
molecules that can be added to the culture medium and will readily enter 
cells by diffusing across the plasma membrane. In the experiment shown, a 
green dye was added first, and the cells were then washed and incubated 
for 4 or 8 hours. At the end of this time, a red dye was added to the 
medium and the cells were washed again and fixed. Connexin molecules 
already present at the beginning of the experiment are labeled green (and 
take up no red dye because their tetracysteine tags are already saturated 
with green dye), while connexins synthesized subsequently, during the 
4- or 8-hour incubation, are labeled red. The fluorescence images show 
optical sections of gap junctions between pairs of cells prepared in this 
way. The central part of the gap-junction plaque is green, indicating that it 
consists of old connexin molecules, while the periphery is red, indicating 
that it consists of connexins synthesized during the past 4 or 8 hours. The 
longer the time of incubation, the smaller the green central patch of old 
molecules, and the larger the peripheral ring of new molecules that have 
been recruited to replace them. (From G. Gaietta et al., Science 
296:503-507, 2002. With permission from AAAS.) 
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in a closed conformation, preventing the cell from losing its small molecules by 
leakage through them. But there is also evidence that in some physiological cir- 
cumstances they can open and serve as channels for the release of small 
molecules, such as the neurotransmitter glutamate, to the exterior, or for the 
entry of small molecules into the cell. 

Gap Junctions Have Diverse Functions 

In tissues containing electrically excitable cells, cell-cell coupling via gap junc- 
tions serves an obvious purpose. Some nerve cells, for example, are electrically 
coupled, allowing action potentials to spread rapidly from cell to cell, without 
the delay that occurs at chemical synapses. This is advantageous when speed 
and reliability are crucial, as in certain escape responses in fish and insects, or 
where a set of neurons need to act in synchrony. Similarly, in vertebrates, elec- 
trical coupling through gap junctions synchronizes the contractions of heart 
muscle cells as well as those of the smooth muscle cells responsible for the peri- 
staltic movements of the intestine. 

Gap junctions also occur in many tissues whose cells are not electrically 
excitable. In principle, the sharing of small metabolites and ions provides a 
mechanism for coordinating the activities of individual cells in such tissues and 
for smoothing out random fluctuations in small-molecule concentrations in dif- 
ferent cells. Gap junctions are required in the liver, for example, to coordinate 
the response of the liver cells to signals from nerve terminals that contact only a 
part of the cell population (see Figure 15-7). The normal development of ovar- 
ian follicles also depends on gap-junction-mediated communication — in this 
case, between the oocyte and the surrounding granulosa cells. A mutation in the 
gene that encodes the connexin that normally couples these two cell types 
causes infertility. 

Mutations in connexins, especially connexin-26, are the commonest of all 
genetic causes of congenital deafness: they result in the death of cells in the 
organ of Corti, probably because they disrupt functionally important pathways 
for the flow of ions from cell to cell in this electrically active sensory epithelium. 
Connexin mutations are responsible for many other disorders besides deafness, 
ranging from cataracts in the lens of the eye to a form of demyelinating disease 
in peripheral nerves. 

Cell coupling via gap junctions also seems to play a part in embryogenesis. 
In early vertebrate embryos (beginning with the late eight-cell stage in mouse 
embryos), most cells are electrically coupled to one another. As specific groups 
of cells in the embryo develop their distinct identities and begin to differenti- 
ate, they commonly uncouple from surrounding tissue. As the neural plate 
folds up and pinches off to form the neural tube, for instance (see Figure 
19-16), its cells uncouple from the overlying ectoderm. Meanwhile, the cells 
within each group remain coupled with one another and therefore tend to 
behave as a cooperative assembly, all following a similar developmental path- 
way in a coordinated fashion. 

Cells Can Regulate the Permeability of Their Gap Junctions 

Like conventional ion channels (discussed in Chapter 11), individual gap-junc- 
tion channels do not remain continuously open; instead, they flip between open 
and closed states. Moreover, the permeability of gap junctions is rapidly (within 
seconds) and reversibly reduced by experimental manipulations that decrease 
the cytosolic pH or increase the cytosolic concentration of free Ca 2+ to very high 
levels. 

The purpose of the pH regulation of gap-junction permeability is unknown. 
In one case, however, the purpose of Ca 2+ control seems clear. When a cell is 
damaged, its plasma membrane can become leaky. Ions present at high concen- 
tration in the extracellular fluid, such as Ca 2+ and Na + , then move into the cell, 
and valuable metabolites leak out. If the cell were to remain coupled to its 
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Figure 19-37 The regulation of gap- 
junction coupling by a neurotransmitter. 

(A) A neuron in a rabbit retina was injected 
with the dye Lucifer yellow, which passes 
readily through gap junctions and labels 
other neurons of the same type that are 
connected to the injected cell by gap 
junctions. (B) The retina was first treated 
with the neurotransmitter dopamine, 
before the neuron was injected with dye. 
As can be seen, the dopamine treatment 
greatly decreased the permeability of the 
gap junctions. Dopamine acts by 
increasing intracellular cyclic AMP levels. 
(Courtesy of David Vaney.) 



healthy neighbors, these too would suffer a dangerous disturbance of their 
internal chemistry. But the large influx of Ca 2+ into the damaged cell causes its 
gap-junction channels to close immediately, effectively isolating the cell and 
preventing the damage from spreading to other cells. 

Gap-junction communication can also be regulated by extracellular signals. 
The neurotransmitter dopamine, for example, reduces gap-junction communi- 
cation between a class of neurons in the retina in response to an increase in light 
intensity (Figure 19-37). This reduction in gap-junction permeability helps the 
retina switch from using rod photoreceptors, which are good detectors of low 
light, to cone photoreceptors, which detect color and fine detail in bright light. 



In Plants, Plasmodesmata Perform Many of the Same Functions 
as Gap Junctions 

The tissues of a plant are organized on different principles from those of an ani- 
mal. This is because plant cells are imprisoned within tough cell walls composed 
of an extracellular matrix rich in cellulose and other polysacharides, as we dis- 
cuss later. The cell walls of adjacent cells are firmly cemented to those of their 
neighbors, which eliminates the need for anchoring junctions to hold the cells 
in place. But a need for direct cell-cell communication remains. Thus, plant cells 
have only one class of intercellular junctions, plasmodesmata. Like gap junc- 
tions, they directly connect the cytoplasms of adjacent cells. 

In plants, the cell wall between a typical pair of adjacent cells is at least 0.1 
(im thick, and so a structure very different from a gap junction is required to 
mediate communication across it. Plasmodesmata solve the problem. With a 
few specialized exceptions, every living cell in a higher plant is connected to its 
living neighbors by these structures, which form fine cytoplasmic channels 
through the intervening cell walls. As shown in Figure 19-38A, the plasma mem- 
brane of one cell is continuous with that of its neighbor at each plasmodesma, 
which connects the cytoplasms of the two cells by a roughly cylindrical channel 
with a diameter of 20-40 nm. 

Running through the center of the channel in most plasmodesmata is a nar- 
rower cylindrical structure, the desmotubule, which is continuous with elements 
of the smooth endoplasmic reticulum in each of the connected cells (Figure 
19-38B-D). Between the outside of the desmotubule and the inner face of the 
cylindrical channel formed by plasma membrane is an annulus of cytosol 
through which small molecules can pass from cell to cell. As each new cell wall 
is assembled during the cytokinesis phase of cell division, plasmodesmata are 
created within it. They form around elements of smooth ER that become 
trapped across the developing cell plate (discussed in Chapter 17). They can also 
be inserted de novo through preexisting cell walls, where they are commonly 
found in dense clusters called pit fields. When no longer required, plasmodes- 
mata can be readily removed. 
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Figure 1 9-38 Plasmodesmata. (A) The cytoplasmic channels of 
plasmodesmata pierce the plant cell wall and connect all cells in a plant 
together. (B) Each plasmodesma is lined with plasma membrane that is 
common to two connected cells. It usually also contains a fine tubular 
structure, the desmotubule, derived from smooth endoplasmic reticulum. 
(C) Electron micrograph of a longitudinal section of a plasmodesma from a 
water fern. The plasma membrane lines the pore and is continuous from one 
cell to the next. Endoplasmic reticulum and its association with the central 
desmotubule can also be seen. (D) A similar plasmodesma seen in cross 
section. (C and D, from R. Overall, J. Wolfe and B.E.S. Gunning, in Protoplasma 
9, pp. 137 and 140. Heidelberg: Springer-Verlag, 1982.) 



In spite of the radical difference in structure between plasmodesmata and 
gap junctions, they seem to function in remarkably similar ways. Evidence 
obtained by injecting tracer molecules of different sizes suggests that plasmo- 
desmata allow the passage of molecules with a molecular weight of less than 
about 800, which is similar to the molecular-weight cutoff for gap junctions. As 
with gap junctions, transport through plasmodesmata is regulated. Dye-injec- 
tion experiments, for example, show that there can be barriers to the movement 
of even low-molecular-weight molecules between certain cells, or groups of 
cells, that are connected by apparently normal plasmodesmata; the mecha- 
nisms that restrict communication in these cases are not understood. 

During plant development, groups of cells within the shoot and root meri- 
stems signal to one another in the process of defining their future fates (dis- 
cussed in Chapter 22). Some gene regulatory proteins involved in this process 
of cell fate determination pass from cell to cell through plasmodesmata. They 
bind to components of the plasmodesmata and override the size exclusion 
mechanism that would otherwise prevent their passage. In some cases, the 
mRNAthat encodes the protein can also pass through. Some plant viruses also 
exploit this route: infectious viral RNA, or even intact virus particles, can pass 
from cell to cell in this way These viruses produce proteins that bind to com- 
ponents of the plasmodesmata in ways that increase the effective pore size of 
the channel dramatically The functional components of plasmodesmata that 
mediate this effect are unknown, and it is still a mystery how endogenous or 
viral macromolecules regulate the transport properties of the channel so as to 
pass through it. 
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Summary 

The cells of many animal tissues, both epithelial and nonepithelial, are coupled by 
channel- forming junctions called gap junctions. These take the form of plaques of clus- 
tered connexons that usually allow molecules smaller than about 1000 daltons to pass 
directly from the inside of one cell to the inside of the next. The cells can regulate the per- 
meability of these junctions. Gap junctions are dynamic structures: new connexons are 
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continually recruited to the periphery of the plaque, while old connexons are continu- 
ally removed from its center. 

Cells connected by gap junctions share many of their inorganic ions and other 
small molecules and are therefore chemically and electrically coupled. Gap junctions 
are important in coordinating the activities of electrically active cells, and they have a 
coordinating role in other groups of cells as well In plants, cells are linked by commu- 
nicating junctions called plasmodesmata. Although their structure is entirely different 
from that of gap junctions, and they can sometimes transport informational macro- 
molecules, they function remarkably like gap junctions in permitting small molecules 
to pass from cell to cell while blocking the passage of most large molecules. 



THE BASAL LAMINA 

Tissues are not made up solely of cells. A part of their volume — sometimes a very 
large part — is extracellular space, which is occupied by an intricate network of 
macromolecules constituting the extracellular matrix. This matrix is composed 
of various proteins and polysaccharides that are secreted locally and assembled 
into an organized meshwork in close association with the surfaces of the cells 
that produced them. 

In our own bodies, the most plentiful forms of extracellular matrix are 
found in bulky connective tissues such as bone, tendon, and the dermal layer 
of the skin. For animals in general, however, from an evolutionary point of view, 
pride of place goes to the extracellular matrix that forms a much less obvious 
structure — the basal lamina (also referred to as the basement membrane). 
This exceedingly thin, tough, flexible sheet of matrix molecules is an essential 
underpinning of all epithelia. Small as it is in volume, it has a critical role in the 
architecture of the body. Like the cadherins, it seems to be one of the defining 
features common to all multicellular animals. Other forms of extracellular 
matrix are more variable from one animal phylum to another, in both compo- 
sition and quantity. 

In this section, we discuss the basal lamina itself. In the next section we shall 
consider how epithelial cells and basal lamina interact with one another, 
through integrin proteins in the epithelial cell membranes, and we shall see that 
integrins are also present in other cell types, mediating their interactions with 
the varied types of extracellular matrix found in connective tissues. These other 
forms of extracellular matrix will be discussed in detail later. 



Basal Laminae Underlie All Epithelia and Surround Some 
Nonepithelial Cell Types 

Basal laminae are typically 40-120 nm thick. A sheet of basal lamina not only 
underlies all epithelia but also surrounds individual muscle cells, fat cells, and 
Schwann cells (which wrap around peripheral nerve cell axons to form myelin). 
The basal lamina thus separates these cells and epithelia from the underlying or 
surrounding connective tissue and forms the mechanical connection between 
them. In other locations, such as the kidney glomerulus, a basal lamina lies 
between two cell sheets and functions as a selective filter (Figure 19-39). Basal 
laminae have more than simple structural and filtering roles, however. They are 
able to determine cell polarity, influence cell metabolism, organize the proteins 
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Figure 1 9-39 Three ways in 
which basal laminae are 
organized. Basal laminae 
(yellow) surround certain cells 
(such as skeletal muscle cells), 
underlie epithelia, and are 
interposed between two cell 
sheets (as in the kidney 
glomerulus). Note that, in the 
kidney glomerulus, both cell 
sheets have gaps in them, and 
the basal lamina has a filtering 
as well as a supportive function, 
helping to determine which 
molecules will pass into the 
urine from the blood. The 
filtration also depends on other 
protein-based structures, called 
slit diaphragms, that span the 
intercellular gaps in the 
epithelial sheet. 
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in adjacent plasma membranes, promote cell survival, proliferation, or differen- 
tiation, and serve as highways for cell migration. 

The mechanical role is nevertheless essential. In the skin, for example, the 
epithelial outer layer — the epidermis — depends on the strength of the basal 
lamina to keep it attached to the underlying connective tissue — the dermis. In 
people with genetic defects in certain basal lamina proteins or in a special type 
of collagen that anchors the basal lamina to the underlying connective tissue, 
the epidermis becomes detached from the dermis. This causes a blistering dis- 
ease called junctional epidermolysis bullosa, a severe and sometimes lethal con- 
dition. 



Laminin Is a Primary Component of the Basal Lamina 

The basal lamina is synthesized by the cells on each side of it: the epithelial cells 
contribute one set of basal lamina components, while cells of the underlying bed 
of connective tissue (called the stroma, Greek for "bedding") contribute another 
set (Figure 19-40). Like other extracellular matrices in animal tissues, the basal 
lamina consists of two main classes of extracellular macromolecules: (1) fibrous 
proteins (usually glycoproteins, which have short oligosaccharide side chains 
attached) and (2) polysaccharide chains of the type called glycosaminoglycans 
(GAGs), which are usually found covalently linked to specific core proteins to form 
proteoglycans (Figure 19-41). In a later section, we shall discuss these two large 
and varied classes of matrix molecules in greater detail. We introduce them here 
through the special subset that are found in basal laminae. 

Although the precise composition of the mature basal lamina varies from 
tissue to tissue and even from region to region in the same lamina, it typically 
contains the glycoproteins laminin, type IV collagen, and nidogen, along with 
the proteoglycan perlecan. Together with these key components, present in the 
basal laminae of virtually all animals from jellyfish to mammals, it holds in its 
meshes, or is closely associated with, various other molecules. These include 
collagen XVIII (an atypical member of the collagen family, forming the core pro- 
tein of a proteoglycan) and fibronectin, a fibrous protein important in the adhe- 
sion of connective-tissue cells to matrix. 

The laminin is thought to be the primary organizer of the sheet structure, 
and early in development, basal laminae consist mainly of laminin molecules. 
Laminin- 1 (classical laminin) is a large, flexible protein composed of three very 
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Figure 19-40 The basal lamina in the 
cornea of a chick embryo. In this 
scanning electron micrograph, some of 
the epithelial cells have been removed to 
expose the upper surface of the matlike 
basal lamina. A network of collagen fibrils 
in the underlying connective tissue 
interacts with the lower face of the 
lamina. (Courtesy of Robert Trelstad.) 
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Figure 1 9-41 The comparative shapes 
and sizes of some of the major 
extracellular matrix macromolecules. 

Protein is shown in green, and 
glycosaminoglycan in red. 



long polypeptide chains (a, p, and y) held together by disulfide bonds and 
arranged in the shape of an asymmetric bouquet, like a bunch of three flowers 
whose stems are twisted together at the foot but whose heads remain separate 
(Figure 19-42). These heterotrimers can self-assemble in vitro into a network, 
largely through interactions between their heads, although interaction with cells 
is needed to organize the the network into an orderly sheet. Since there are sev- 
eral isoforms of each type of chain, and these can associate in different combi- 
nations, many different laminins can be produced, creating basal laminae with 
distinctive properties. The laminin y-1 chain is, however, a component of most 
laminin heterotrimers; mice lacking it die during embryogenesis because they 
are unable to make basal lamina. 



Type IV Collagen Gives the Basal Lamina Tensile Strength 

Type IV collagen is a second essential component of mature basal laminae, and 
it, too, exists in several isoforms. Like the fibrillar collagens that constitute the 
bulk of the protein in connective tissues such as bone or tendon (discussed 
later), type IV collagen molecules consist of three separately synthesized long 
protein chains that twist together to form a ropelike superhelix; but they differ 
from the fibrillar collagens in that the triple-stranded helical structure is inter- 
rupted in more than 20 regions, allowing multiple bends. The type IV collagen 
molecules interact via their terminal domains to assemble extracellularly into a 
flexible, felt-like network. In this way, type IV collagen gives the basal lamina 
tensile strength. 

But how do the networks of laminin and type IV collagen bond to one 
another and to the surfaces of the cells that sit on the basal lamina? Why do they 
form a two-dimensional sheet, rather than a three-dimensional gel? The 
molecules of laminin have several functional domains, including one that binds 
to the perlecan proteoglycan, one that binds to the nidogen protein, and two or 
more that bind to laminin receptor proteins on the surface of cells. Type IV col- 
lagen also has domains that bind nidogen and perlecan. It is thought, therefore, 
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Figure 1 9-42 The structure of laminin. 

(A) The subunits of a laminin-1 molecule, 
and some of their binding sites for other 
molecules (yellow boxes). Laminin is a 
multidomain glycoprotein composed of 
three polypeptides (a, p, and y) that are 
disulfide-bonded into an asymmetric 
crosslike structure. Each of the polypeptide 
chains is more than 1 500 amino acids long. 
Five types of a chains, three types of 

p chains, and three types of 7 chains are 
known; in principle, they can assemble to 
form 45 (5 x 3 x 3) laminin isoforms. 
Several such isoforms have been found, 
each with a characteristic tissue 
distribution. Through their binding sites for 
other proteins, laminin molecules play a 
central part in organizing the assembly of 
basal laminas and anchoring them to cells. 

(B) Electron micrographs of laminin 
molecules shadowed with platinum. 
(B, from J. Engel et al., J. Moi. Biol. 

1 50:97-1 20, 1 981 . With permission from 
Academic Press.) 
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that nidogen and perlecan serve as linkers to connect the laminin and type IV 
collagen networks once the laminin is in place (Figure 19-43). 

The laminin molecules that generate the initial sheet structure first join to 
each other while bound to receptors on the surface of the cells that produce 
them. The cell-surface receptors are of several sorts. Many of them are members 
of the integrin family; another important type of laminin receptor is dystrogly- 
can, a proteoglycan with a core protein that spans the cell membrane, dangling 
its glycosaminoglycan polysaccharide chains in the extracellular space. 
Together, these receptors organize basal lamina assembly: they hold the laminin 
molecules by their feet, leaving the laminin heads positioned to interact so as to 
form a two-dimensional network. This laminin network then presumably coor- 
dinates the assembly of the other basal lamina components. 



Basal Laminae Have Diverse Functions 

As we have mentioned, in the kidney glomerulus, an unusually thick basal lam- 
ina acts as one of the layers of a molecular filter, helping to prevent the passage 
of macromolecules from the blood into the urine as urine is formed (see Figure 
19-39). The proteoglycan in the basal lamina seems to be important for this func- 
tion: when its GAG chains are removed by specific enzymes, the filtering proper- 
ties of the lamina are destroyed. Type IV collagen also has a role: in a human 
hereditary kidney disorder (Alport syndrome), mutations in type IV collagen 
genes result in an irregularly thickened and dysfunctional glomerular filter. 
Laminin mutations, too, can disrupt the function of the kidney filter, but in a dif- 
ferent way — by interfering with the differentiation of the cells that contact it and 
support it. 



integrin 



Figure 19-43 A model of the molecular 
structure of a basal lamina. (A) The basal 
lamina is formed by specific interactions 
(B) between the proteins laminin, type IV 
collagen, and nidogen, and the 
proteoglycan perlecan. Arrows in (B) 
connect molecules that can bind directly 
to each other. There are various isoforms 
of type IV collagen and laminin, each with 
a distinctive tissue distribution. 
Transmembrane laminin receptors 
(integrins and dystroglycan) in the plasma 
membrane are thought to organize the 
assembly of the basal lamina; only the 
integrins are shown. (Based on 
H. Colognato and P.D. Yurchenco, Dev. Dyn. 
218:213-234, 2000. With permission from 
Wiley-Liss.) 
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The basal lamina can act as a selective barrier to the movement of cells, as 
well as a filter for molecules. The lamina beneath an epithelium, for example, 
usually prevents fibroblasts in the underlying connective tissue from making 
contact with the epithelial cells. It does not, however, stop macrophages, lym- 
phocytes, or nerve processes from passing through it, using specialized protease 
enzymes to cut a hole for their transit. The basal lamina is also important in tis- 
sue regeneration after injury. When cells in tissues such as muscles, nerves, and 
epithelia are damaged or killed, the basal lamina often survives and provides a 
scaffold along which regenerating cells can migrate. In this way, the original tis- 
sue architecture is readily reconstructed. 

A particularly striking example of the role of the basal lamina in regenera- 
tion comes from studies on the neuromuscular junction, the site where the 
nerve terminals of a motor neuron form a chemical synapse with a skeletal mus- 
cle cell (discussed in Chapter 1 1). In vertebrates, the basal lamina that surrounds 
the muscle cell separates the nerve and muscle cell plasma membranes at the 
synapse, and the synaptic region of the lamina has a distinctive chemical char- 
acter, with special isoforms of type IV collagen and laminin and a proteoglycan 
called agrin. 

This basal lamina at the synapse has a central role in reconstructing the 
synapse after nerve or muscle injury. If a frog muscle and its motor nerve are 
destroyed, the basal lamina around each muscle cell remains intact and the sites 
of the old neuromuscular junctions are still recognizable. If the motor nerve, but 
not the muscle, is allowed to regenerate, the nerve axons seek out the original 
synaptic sites on the empty basal lamina and differentiate there to form normal- 
looking nerve terminals. Thus, the junctional basal lamina by itself can guide the 
regeneration of motor nerve terminals. 

Similar experiments show that the basal lamina also controls the localiza- 
tion of the acetylcholine receptors that cluster in the muscle cell plasma mem- 
brane at a neuromuscular junction. If the muscle and nerve are both destroyed, 
but now the muscle is allowed to regenerate while the nerve is prevented from 
doing so, the acetylcholine receptors synthesized by the regenerated muscle 
localize predominantly in the region of the old junctions, even though the nerve 
is absent (Figure 19-44). Thus, the junctional basal lamina apparently coordi- 
nates the local spatial organization of the components in each of the two cells 
that form a neuromuscular junction. Some of the molecules responsible for 
these effects have been identified. Motor neuron axons, for example, deposit 
agrin in the junctional basal lamina, where it regulates the assembly of acetyl- 
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Figure 19-44 Regeneration experiments 
demonstrating the special character of 
the junctional basal lamina at a 
neuromuscular junction. When the nerve, 
but not the muscle, is allowed to 
regenerate after both the nerve and 
muscle have been damaged [upper part of 
figure), the junctional basal lamina directs 
the regenerating nerve to the original 
synaptic site. When the muscle, but not 
the nerve, is allowed to regenerate [lower 
port of figure), the junctional basal lamina 
causes newly made acetylcholine 
receptors (blue) to accumulate at the 
original synaptic site. The muscle 
regenerates from satellite cells (discussed 
in Chapter 23) located between the basal 
lamina and the original muscle cell (not 
shown). These experiments show that the 
junctional basal lamina controls the 
localization of synaptic components on 
both sides of the lamina. 
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choline receptors and other proteins in the junctional plasma membrane of the 
muscle cell. Reciprocally, muscle cells deposit a particular isoform of laminin in 
the junctional basal lamina, and some evidence suggests that this binds directly 
to the extracellular domain of voltage-gated Ca 2+ channels in the presynaptic 
membrane of the nerve cell, helping to hold them at the synapse where they are 
needed. Both agrin and the synaptic isoform of laminin are essential for the for- 
mation of normal neuromuscular junctions. Defects in components of the basal 
lamina or in proteins that tether muscle cell components to it at the synapse are 
responsible for many of the forms of muscular dystrophy, in which muscles at 
first develop normally but then degenerate in later years of life. 

Summary 

The basal lamina is a thin tough sheet of extracellular matrix that closely underlies 
epithelia in all multicellular animals. It also wraps around certain other cell types, 
such as muscle cells. All basal laminae are organized on a framework of laminin 
molecules, linked together by their side-arms and held close beneath the basal ends of 
the epithelial cells by attachment to integrins and other receptors in the basal plasma 
membrane. Type TV collagen molecules are recruited into this structure, assembling 
into a sheetlike mesh that is an essential component of all mature basal laminae. The 
collagen and laminin networks in mature basal laminae are bridged by the protein 
nidogen and the large heparan sulfate proteoglycan perlecan. 

Basal laminae provide mechanical support for epithelia; they form the interface 
and the attachment between epithelia and connective tissue; they serve as filters in the 
kidney; they act as barriers to keep cells in their proper compartments; they influence 
cell polarity and cell differentiation; they guide cell migrations; and molecules embed- 
ded in them help to organize elaborate structures such as neuromuscular synapses. 
When cells are damaged or killed, basal laminae often survive and can help guide tis- 
sue regeneration. 



INTEGRINS AND CELL-MATRIX ADHESION 

Cells make extracellular matrix, organize it, and degrade it. The matrix in its turn 
exerts powerful influences on the cells. The influences are exerted chiefly 
through transmembrane cell adhesion proteins that act as matrix receptors. 
These tie the matrix outside the cell to the cytoskeleton inside it, but their role 
goes far beyond simple passive mechanical attachment. Through them, compo- 
nents of the matrix can affect almost any aspect of a cell's behavior. The matrix 
receptors have a crucial role in epithelial cells, mediating their interactions with 
the basal lamina beneath them; and they are no less important in connective- 
tissue cells, for their interactions with the matrix that surrounds them. 

Several types of molecules can function as matrix receptors or co-receptors, 
including the transmembrane proteoglycans. But the principal receptors on ani- 
mal cells for binding most extracellular matrix proteins are the integrins. Like 
the cadherins and the key components of the basal lamina, integrins are part of 
the fundamental architectural toolkit that is characteristic of multicellular ani- 
mals. The members of this large family of homologous transmembrane adhe- 
sion molecules have a remarkable ability to transmit signals in both directions 
across the cell membrane. The binding of a matrix component to an integrin can 
send a message into the interior of the cell, and conditions in the cell interior 
can send a signal outward to control binding of the integrin to matrix (or, in 
some cases, to a cell-surface molecule on another cell, as we saw in the case of 
white blood cells binding to endothelial cells). Tension applied to an integrin 
can cause it to tighten its grip on intracellular and extracellular structures, and 
loss of tension can loosen its hold, so that molecular signaling complexes fall 
apart on either side of the membrane. In this way, integrins can also serve not 
only to transmit mechanical and molecular signals, but also to convert the one 
type of signal into the other. Studies of the structure of integrin molecules have 
begun to reveal how they perform these tasks. 
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Figure 1 9-45 The subunit structure of 
an active integrin molecule, linking 
extracellular matrix to the actin 
cytoskeleton. The head of the integrin 
molecule attaches directly to an 
extracellular protein such as fibronectin; 
the intracellular tail of the integrin binds 
to talin, which in turn binds to 
filamentous actin. A set of other 
intracellular anchor proteins, including 
a-actinin, filamin, and vinculin, help to 
reinforce the linkage. 



Integrins Are Transmembrane Heterodimers That Link to the 
Cytoskeleton 

There are many varieties of integrins — at least 24 in humans — but they all con- 
form to a common plan. An integrin molecule is composed of two noncovalently 
associated glycoprotein subunits called a and p. Both subunits span the cell 
membrane, with short intracellular C-terminal tails and large N-terminal extra- 
cellular domains. The extracellular portion of the integrin dimer binds to spe- 
cific amino acid sequences in extracellular matrix proteins such as laminin or 
fibronectin or, in some cases, to ligands on the surfaces of other cells. The intra- 
cellular portion binds to a complex of proteins that form a linkage to the 
cytoskeleton. 

For all but one of the 24 varieties of human integrins, this intracellular link- 
age is to actin filaments, via talin and a set of other intracellular anchorage pro- 
teins (Figure 19-45); talin, as we shall see later, seems to be the key component 
of the linkage. Like the actin-linked cell-cell junctions formed by cadherins, the 
actin-linked cell-matrix junctions formed by integrins may be small, inconspic- 
uous and transient, or large, prominent, and durable. Examples of the latter are 
the focal adhesions that form when fibroblasts have sufficient time to form 
strong attachments to the rigid surface of a culture dish, and the myotendinous 
junctions that attach muscle cells to their tendons. 

In epithelia, the most prominent cell-matrix attachment sites are the 
hemidesmosomes, where a specific type of integrin (a6(J4) anchors the cells to 
laminin in the basal lamina. Here, uniquely, the intracellular attachment is to 
keratin filaments, via the intracellular anchor proteins plectin and dystonin 
(Figure 19-46). 



Integrins Can Switch Between an Active and an Inactive 
Conformation 

A cell crawling through a tissue — a fibroblast or a macrophage, for example, or an 
epithelial cell migrating along a basal lamina— has to be able both to make and 
to break attachments to the matrix, and to do so rapidly if it is to travel quickly. 
<TGAT> Similarly, a circulating white blood cell has to be able to switch on or off 
its tendency to bind to endothelial cells in order to crawl out of a blood vessel at 
a site of inflammation under the appropriate circumstances. Furthermore, if 
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force is to be applied where it is needed, the making and breaking of the extra- 
cellular attachments in all these cases has to be coupled to the prompt assembly 
and disassembly of cytoskeletal attachments inside the cell. The integrin 
molecules that span the membrane and mediate the attachments cannot simply 
be passive, rigid objects with sticky patches at their two ends. They must be able 
to switch between an active state, where they readily form attachments, and an 
inactive state, where they do not; and the binding of their ligands on one side of 
the membrane must alter their propensity to bind to a different set of ligands on 
the opposite side. 

The basis for these dynamic phenomena is allosteric regulation: as an inte- 
grin binds to or detaches from its ligands, it undergoes conformational changes 
that affect both the intracellular and the extracellular ends of the molecule. 
Structural change at one end is coupled to structural change at the other, so that 
influences can be transmitted in either direction across the cell membrane. The 
timber tongs that lumberjacks use to grab hold of logs of wood provide a simple 
mechanical analogy (Figure 19-47). 

The structural changes in integrins can be demonstrated by taking a purified 
preparation of integrin molecules and examining them at high resolution by 
electron microscopy If the integrins are kept in a calcium-rich medium similar 
to normal extracellular fluid, but without any extracellular ligand, and then 
rapidly prepared for microscopy, they appear as tightly folded V-shaped objects. 
But if a small synthetic peptide containing a sequence that mimics the integrin - 
binding domain of a natural extracellular matrix protein is added to the 
medium, the integrins bind this molecule and extend into a different shape, with 
two legs that are no longer tightly bent, but are now straightened and separated 
from each other, supporting a head region high above them (Figure 19-48A). 
This pair of structures can be compared with more detailed data from x-ray crys- 
tallography, which reveals that the two legs correspond to the integrin a and p 
chains. The head region, where they meet, contains the binding site for the 
extracellular ligand. Binding of the ligand distorts this region so as to favor adop- 
tion of the extended, "active" conformation; conversely, adoption of the 
extended conformation creates a more favorable binding site, with a higher 
affinity for ligand (Figure 19-48B). 

But how do these changes in the extracellular region of the integrin relate to 
events at the intracellular end of the integrin molecule? In its folded, inactive 



Figure 19-47Timber tongs. Holding the handles together causes the 
claws to grip the log; and closing the claws on the log causes the 
handles to come together. Moreover, the greater the pull on the tongs, 
the tighter the grip at both ends. In an integrin molecule, the details of 
the linkage are different, but the mechanical principles are similar: 
conformational changes at opposite ends of the molecule are coupled, 
and pulling tightens the grip. 



Figure 19-46 Hemidesmosomes. 

(A) Hemidesmosomes spot-weld epithelial 
cells to the basal lamina, linking laminin 
outside the cell to keratin filaments inside 
it. (B) Molecular components of a 
hemisdesmosome. A specialized integrin 
(ct6|34 integrin) spans the membrane, 
attaching to keratin filaments 
intracellular^ via anchor proteins called 
plectin and dystonin, and to laminin 
extracellularly. The adhesive complex also 
contains, in parallel with the integrin, an 
unusual collagen family member, collagen 
type XVII; this has a membrane-spanning 
domain attached to its extracellular 
collagenous portion. Defects in any of 
these components can give rise to a 
blistering disease of the skin. 
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state, the intracellular portions of its a and (3 chains lie close together and adhere 
to one another. When the extracellular domain unfolds, this contact is broken 
and the intracellular (and transmembrane) portions of these chains move apart. 
As a result, a binding site for talin on the tail of the p chain is exposed. The bind- 
ing of talin then leads to assembly of actin filaments anchored to the intracellu- 
lar end of the integrin molecule (see Figure 19-45). In this way, when an integrin 
catches hold of its ligand outside the cell, the cell reacts by tying its cytoskeleton 
to the integrin molecule, so that force can be applied at the point of attachment. 
This is referred to as "outside-in activation". 

The chain of cause and effect can also operate in reverse, from inside to out- 
side instead of outside to inside. Talin competes with the integrin a chain for its 
binding site on the tail of the (3 chain. Thus when talin binds to the (J chain it 
undoes the intracellular a-p linkage, allowing the two legs of the integrin 
molecule to spring apart. This drives the extracellular portion of the integrin into 
its extended, active conformation. 

This "inside-out activation" is triggered by intracellular regulatory 
molecules. These include the phosphoinositide PIP 2 (discussed in Chapter 15), 
which is thought to be capable of activating talin so that it binds to the integrin 
p chain strongly. In this way, a signal generated inside the cell can trigger its inte- 
grin molecules to reach out and grab hold of their extracellular ligands. 

Intracellular signal molecules such as PIP 2 are themselves produced in 
response to signals received from outside the cell via other types of cell-surface 
receptors, such as G-protein-coupled receptors and receptor tyrosine kinases, 
which can thus control integrin activation (Figure 19-49). Conversely, the acti- 
vation of integrins by attachment to matrix can influence the reception of sig- 
nals by other pathways. The cross-talk between all these communication path- 
ways, transmitting signals in both directions across the cell membrane, allows 
for some complex interactions between the cell and its physical and chemical 
environment. 



Figure 19-48 Change in conformation 
of an integrin molecule when it binds its 
ligand. (A) Images were produced by 
averaging many similarly aligned electron 
micrographs of individual integrin 
molecules. In the absence of extracellular 
ligand, the integrin molecules appear 
small and tightly folded. When incubated 
with an RGD peptide, the integrins unfold 
into an extended structure with two 
distinct legs. (B) Active (extended) and 
inactive (folded) structures of an integrin 
molecule, based on data from x-ray 
crystallography. Although it is difficult to 
crystallize the intact molecule in its 
natural conformations, with and without 
ligand bound, the complete structure can 
be inferred with reasonable confidence 
from x-ray crystallography of defined 
molecular fragments. (A, From J.Takagi et 
al., Cell 1 10:599-61 1, 2002. With 
permission from Elsevier; B, based on 
T. Xiao at al., Nature 432:59-67, 2004. 
With permission from Macmillan 
Publishers Ltd.) 



Integrin Defects Are Responsible for Many Different Genetic 
Diseases 

The 24 types of integrins found in a human are formed from the products of 8 
different (3-chain genes and 18 different a-chain genes, dimerized in different 
combinations. Each integrin has distinctive properties and functions. Moreover, 
because the same integrin molecule in different cell types can have different lig- 
and-binding specificities, it seems that additional cell-type-specific factors can 
interact with integrins to modulate their binding activity. The binding of inte- 
grins to their ligands is also affected by the concentration of Ca 2+ and Mg 2+ in the 
extracellular medium, reflecting the presence of divalent-cation-binding 
domains in the a and (3 subunits. The divalent cations can influence both the 
affinity and the specificity of the binding of an integrin to its ligands. 
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Figure 19-49 Activation of integrins by cross-talk from other signaling pathways. Signals received from outside the cell 
via other types of cell surface receptors, such as G-protein-coupled receptors and receptor tyrosine kinases, can alter the 
conformation of talin and thereby activate the cell's integrins. 



Although there is some overlap in the activities of the different integrins — at 
least five bind laminin, for example — it is the diversity of integrin functions that 
is more remarkable. Table 19-4 lists some of varieties of integrins and the prob- 
lems that result when individual integrin a or P chains are defective. 

The |3l subunits form dimers with at least 12 distinct a subunits and are 
found on almost all vertebrate cells: a5pi is a fibronectin receptor and oc6pl a 
laminin receptor on many types of cells. Mutant mice that cannot make any pi 
integrins die at implantation (very early in embryonic development). Mice that 
are only unable to make the cc7 subunit (the partner for pi in muscle) survive but 
develop muscular dystrophy (as do mice that cannot make the laminin ligand 
for the a7pi integrin). 



Table 1 9-4 Some Types of Integrins 
INTEGRIN LIGAND* DISTRIBUTION 




PHENOTYPE WHEN 

a SUBINUT IS MUTATED 




a5pi 


fibronectin 


ubiquitous 


death of embryo; defects in blood 
vessels, somites, neural crest 


a6pi 


laminin 


ubiquitous 


severe skin blistering; defects in 
other epithelia also 


a7p1 


laminin 


muscle 


muscular dystrophy; defective 
myotendinous junctions 


al_p2 


Ig superfamily 


white blood cells 


impaired recruitment of 


(LFA1) 


counterreceptors 

(ICAM) 




leucocytes 


allbp3 


fibrinogen 


platelets 


bleeding; no platelet aggregation 
(Glanzmann's disease) 


a6p4 


laminin 


hemidesmosomes 


severe skin blistering; defects in 






in epithelia 


other epithelia also 




PHENOTYPE WHEN p SUBUNI 
IS MUTATED 



early death of embryo (at 

implantation) 
early death of embryo (at 

implantation) 
early death of embryo (at 

implantation) 
leucocyte adhesion deficiency (LAD) 

impaired inflammatory responses; 

recurrent life-threatening infections 
bleeding; no platelet aggregation 

(Glanzmann's disease); mild 

osteopetrosis 
severe skin blistering; defects in 

other epithelia also 



*Not all ligands are listed. 
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The (32 subunits form dimers with at least four types of a subunit and are 
expressed exclusively on the surface of white blood cells, where they have an 
essential role in enabling these cells to fight infection. The (32 integrins mainly 
mediate cell-cell rather than cell-matrix interactions, binding to specific ligands 
on another cell, such as an endothelial cell. The ligands, sometimes referred to 
as counterreceptors, are members of the Ig superfamily of cell-cell adhesion 
molecules. We have already described an example earlier in the chapter: an inte- 
grin of this class (aL[32, also known as LFA1) on white blood cells enables them 
to attach firmly to the Ig-family protein ICAM on endothelial cells at sites of 
infection and, through this attachment, to migrate out of the bloodstream into 
the infected tissue (see Figure 19-19B). People with the genetic disease called 
leucocyte adhesion deficiency fail to synthesize functional (32 subunits. As a con- 
sequence, their white blood cells lack the entire family of P2 receptors, and they 
suffer repeated bacterial infections. 

The p3 integrins are found on blood platelets (as well as various other cells), 
and they bind several matrix proteins, including the blood clotting factor fib- 
rinogen. Platelets have to interact with fibrinogen to mediate normal blood clot- 
ting, and humans with Glanzmann's disease, who are genetically deficient in P3 
integrins, suffer from defective clotting and bleed excessively 

In both white blood cells and platelets, the ability to regulate integrin activ- 
ity via inside-out signaling is particularly important. Regulated adhesion allows 
the cells to circulate unimpeded until they are activated by an appropriate stim- 
ulus. Because the integrins do not need to be synthesized de novo, the signaled 
adhesion response can be rapid. Platelets, for example, respond to contact with 
the wall of a damaged blood vessel and to various soluble signaling molecules, 
triggering activation of the p3 integrin in the platelet membrane. The resulting 
interaction of platelets with fibrinogen leads to formation of a platelet plug, 
which helps to stop the bleeding at just the site where it is needed. Similarly, the 
binding of a T lymphocyte to its specific antigen on the surface of an antigen- 
presenting cell (discussed in Chapter 25) switches on intracellular signaling 
pathways in the T cell that activate its P2 integrins (Figure 19-50). The activated 
integrins then enable the T cell to adhere strongly to the antigen-presenting cell 
so that it remains in contact long enough to become stimulated fully. The inte- 
grins may then return to an inactive state, allowing the T cell to disengage. 



Figure 19-50 Integrin activation in the 
encounter of a T lymphocyte with an 
antigen-presenting cell. The two cells at 
first adhere weakly through binding of 
the LFA1 integrin in theT cell to the 
Ig-superfamily molecule ICAM in the 
membrane of the antigen-presenting cell. 
If the T-cell receptor at the same time 
recognizes its specific antigen, presented 
to it by the MHC molecule on the 
antigen-presenting cell, an intracellular 
signal is generated from the T cell 
receptor to activate the LFA1 integrin. As 
a result, LFA1 binds more strongly and 
persistently to ICAM. This gives the 
antigen-presenting cell time to activate 
the T cell and thereby elicit a specific 
immune response. (Adapted from 
K. Murphy et al., Janeway's 
Immunobiology, 7th ed. New York: 
Garland Science, 2008.) 



Integrins Cluster to Form Strong Adhesions 

Integrins, like other cell adhesion molecules, differ from cell-surface receptors 
for hormones and for other extracellular soluble signal molecules in that they 
usually bind their ligand with lower affinity and are usually present at a 10- to 
100-fold higher concentration on the cell surface. The Velcro principle, men- 
tioned earlier, operates here too. Strong adhesion depends on clustering of 
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integrins, creating a plaque in which many cytoskeletal filaments are anchored, 
as at a hemidesmosome in the epidermis or at a focal adhesion made by a 
fibroblast on a culture dish. At focal adhesions, and probably also in the less 
prominent actin-linked cell-matrix adhesions that cells mainly make in normal 
tissues, activation of the small GTPase Rho plays a part in the maturation of the 
adhesive complex, by promoting recruitment of actin filaments and integrins to 
the contact site. Artificially mutated integrins that lack an intracellular tail fail to 
connect with cytoskeletal filaments, fail to cluster, and are unable to form strong 
adhesions. 



Extracellular Matrix Attachments Act Through Integrins to 
Control Cell Proliferation and Survival 

Like other transmembrane cell adhesion proteins, integrins do more than just 
create attachments. They also activate intracellular signaling pathways and 
thereby allow control of almost any aspect of the cell's behavior according to the 
nature of the surrounding matrix and the state of the cell's attachments to it. 

Studies in culture show that many cells will not grow or proliferate unless 
they are attached to extracellular matrix; nutrients and soluble growth factors in 
the culture medium are not enough. For some cell types, including epithelial, 
endothelial, and muscle cells, even cell survival depends on such attachments. 
When these cells lose contact with the extracellular matrix, they undergo pro- 
grammed cell death, or apoptosis. This dependence of cell growth, proliferation, 
and survival on attachment to a substratum is known as anchorage depen- 
dence, and it is mediated mainly by integrins and the intracellular signals they 
generate. Anchorage dependence is thought to help ensure that each type of cell 
survives and proliferates only when it is in an appropriate situation. Mutations 
that disrupt or override this form of control, allowing cells to escape from 
anchorage dependence, occur in cancer cells and play a major part in their inva- 
sive behavior. 

The physical spreading of a cell on the matrix also has a strong influence on 
intracellular events. Cells that are forced to spread over a large surface area by 
the formation of multiple adhesions at widely separate sites survive better and 
proliferate faster than cells that are not so spread out (Figure 19-51). The stim- 
ulatory effect of cell spreading presumably helps tissues to regenerate after 
injury. If cells are lost from an epithelium, for example, the spreading of the 
remaining cells into the vacated space will help stimulate these survivors to pro- 
liferate until they fill the gap. It is uncertain how a cell senses its extent of spread- 
ing so as to adjust its behavior accordingly, but the ability to spread depends on 
integrins, and signals generated by integrins at the sites of adhesion must play a 
part in providing the spread cells with stimulation. 

Our understanding of anchorage dependence and of the effects of cell 
spreading has come mainly from studies of cells living on the surface of matrix- 
coated culture dishes. For connective-tissue cells that are normally surrounded 
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Figure 19-51 The importance of cell 
spreading. In this experiment, cell growth 
and survival are shown to depend on the 
extent of cell spreading on a substratum, 
rather than the mere fact of attachment or 
the number of matrix molecules the cell 
contacts. (Based on C.S. Chen et al., 
Science 276:1425-1428, 1997. With 
permission from AAAS.) 
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by matrix on all sides, this is a far cry from the natural environment. Walking 
over a plain is very different from clambering through a jungle. The types of con- 
tacts that cells make with a rigid substratum are not the same as those, much 
less well studied, that they make with the deformable web of fibers of the extra- 
cellular matrix, and there are substantial differences of cell behavior between 
the two contexts. Nevertheless, it is likely that the same basic principles apply. 
Both in vitro and in vivo, intracellular signals generated at cell-matrix adhesion 
sites, by molecular complexes organized around integrins, are crucial for cell 
proliferation and survival. 



Integrins Recruit Intracellular Signaling Proteins at Sites of 
Cell-Substratum Adhesion 

The mechanisms by which integrins signal into the cell interior are complex, 
involving several different pathways, and integrins and conventional signaling 
receptors often influence one another and work together to regulate cell behav- 
ior, as we have already emphasized. The Ras/MAP kinase pathway (see Figure 
15-61), for example, can be activated both by conventional signaling receptors 
and by integrins, but cells often need both kinds of stimulation of this pathway 
at the same time to give sufficient activation to induce cell proliferation. Inte- 
grins and conventional signaling receptors also cooperate in activating similar 
pathways to promote cell survival (discussed in Chapters 15 and 17). 

One of the best-studied modes of integrin signaling depends on a cytoplas- 
mic protein tyrosine kinase called focal adhesion kinase (FAK). In studies of 
cells cultured in the normal way on rigid substrata, focal adhesions are often 
prominent sites of tyrosine phosphorylation (Figure 19-52A), and FAK is one of 
the major tyrosine-phosphorylated proteins found at these sites. When inte- 
grins cluster at cell-matrix contacts, FAK is recruited by intracellular anchor 
proteins such as talin (binding to the integrin p subunit) or paxillin (which 
binds to one type of integrin a subunit). The clustered FAK molecules cross- 
phosphorylate each other on a specific tyrosine, creating a phosphotyrosine 
docking site for members of the Src family of cytoplasmic tyrosine kinases. In 
addition to phosphorylating other proteins at the adhesion sites, these kinases 
then phosphorylate FAK on additional tyrosines, creating docking sites for a 
variety of additional intracellular signaling proteins. In this way, outside-in sig- 
naling from integrins, via FAK and Src-family kinases, is relayed into the cell (as 
discussed in Chapter 15). 

One way to analyze the function of FAK is to examine focal adhesions in cells 
from mutant mice that lack the protein. FAK- deficient fibroblasts still adhere to 



Figure 19-52 Focal adhesions and the 
role of focal adhesion kinase (FAK). 

(A) A fibroblast cultured on a fibronectin- 
coated substratum and stained with 
fluorescent antibodies: actin filaments are 
stained green and activated proteins that 
contain phosphotyrosine are red, giving 
orange where the two components 
overlap. The actin filaments terminate at 
focal adhesions, where the cell attaches 
to the substratum by means of integrins. 
Proteins containing phosphotyrosine are 
also concentrated at these sites, 
reflecting the local activation of FAK and 
other protein kinases. Signals generated 
at such adhesion sites help regulate cell 
division, growth, and survival. (B, C) The 
influence of FAK on formation of focal 
adhesions is shown by a comparison of 
normal and FAK-deficient fibroblasts, 
stained with an antibody against vinculin 
to reveal the focal adhesions. (B)The 
normal fibroblasts have fewer focal 
adhesions and have spread after 2 hours 
in culture. (C) At the same time point, the 
FAK-deficient fibroblasts have more focal 
adhesions and have not spread. 
(A, courtesy of Keith Burridge; B, C, from 
D. Ilic et al., Nature 377:539-544, 1995. 
With permission from Macmillan 
Publishers Ltd.) 
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fibronectin and form focal adhesions. In fact, they form too many focal adhe- 
sions; as a result, cell spreading and migration are slowed (Figure 19-52B and C). 
This unexpected finding suggests that FAK normally helps disassemble focal 
adhesions and that this loss of adhesions is required for normal cell migration. 
Many cancer cells have elevated levels of FAK, which may help explain why they 
are often more motile than their normal counterparts. 



Integrins Can Produce Localized Intracellular Effects 

Through FAK and other pathways, activated integrins, like other signaling recep- 
tors, can induce global cell responses, often including changes in gene expres- 
sion. But the integrins are especially adept at stimulating localized changes in 
the cytoplasm close to the cell-matrix contact. We have already mentioned an 
important example in our discussion of epithelial cell polarity: it is through inte- 
grins that the basal lamina plays its part in directing the internal apico-basal 
organization of epithelial cells. 

Localized intracellular effects may be a common feature of signaling by 
transmembrane cell adhesion proteins in general. In the developing nervous 
system, for example, the growing tip of an axon is guided mainly by its responses 
to local adhesive (and repellent) cues in the environment that are recognized by 
transmembrane cell adhesion proteins, as discussed in Chapter 22. The primary 
effects of the adhesion proteins are thought to result from the activation of intra- 
cellular signaling pathways that act locally in the axon tip, rather than through 
cell-cell adhesion itself or signals conveyed to the cell body. Through localized 
activation of the Rho family of small GTPases, for example (as discussed in 
Chapters 15 and 16), the transmembrane adhesion proteins can control motility 
and guide forward movement. In this and other ways, practically all the classes 
of cell-cell and cell-matrix adhesion molecules that we have mentioned, includ- 
ing integrins, are deployed to help guide axon outgrowth in the developing ner- 
vous system. 

Table 19-5 summarizes the categories of cell adhesion molecules that we 
have considered in this chapter. In the next section, we turn from the adhesion 
molecules in cell membranes to look in detail at the extracellular matrix that sur- 
rounds cells in connective tissues. 
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Summary 

Integrins are the principal receptors used by animal cells to bind to the extracellular 
matrix: they function as transmembrane linkers between the extracellular matrix and 
the cytoskeleton connecting usually to actin } but to intermediate filaments for the spe- 
cialized integrins at hemidesmosomes. Integrin molecules are heterodimers, and the 
binding ofligands is associated with dramatic changes of conformation. This creates 
an allosteric coupling between binding to matrix outside the cell and binding to the 
cytoskeleton inside it, allowing the integrin to convey signals in both directions across 
the plasma membrane— from inside to out and from outside to in. Binding of the 
intracellular anchor protein talin to the tail of an integrin molecule tends to drive the 
integrin into an extended conformation with increased affinity for its extracellular lig- 
and. Conversely binding to an extracellular ligand, by promoting the same conforma- 
tional change, leads to binding of talin and formation of a linkage to the actin 
cytoskeleton. Complex assemblies of proteins become organized around the intracel- 
lular tails of integrins, producing intracellular signals that can influence almost any 
aspect of cell behavior, from proliferation and survival, as in the phenomenon of 
anchorage dependence, to cell polarity and guidance of migration. 



THE EXTRACELLULAR MATRIX OF ANIMAL 
CONNECTIVE TISSUES 

We have already discussed the basal lamina as an archetypal example of extra- 
cellular matrix, common to practically all multicellular animals and an essential 
feature of epithelial tissues. We now turn to the much more varied and bulky 
forms of extracellular matrix found in connective tissues (Figure 19-53). Here, 
the extracellular matrix is generally more plentiful than the cells it surrounds, 
and it determines the tissue's physical properties. 

The classes of macromolecules constituting the extracellular matrix in ani- 
mal tissues are broadly similar, whether we consider the basal lamina or the 
other forms that matrix can take, but variations in the relative amounts of these 
different classes of molecules and in the ways in which they are organized give 
rise to an amazing diversity of materials. The matrix can become calcified to 
form the rock-hard structures of bone or teeth, or it can form the transparent 
substance of the cornea, or it can adopt the ropelike organization that gives ten- 
dons their enormous tensile strength. It forms the jelly in a jellyfish. Covering the 
body of a beetle or a lobster, it forms a rigid carapace. Moreover, the extracellu- 
lar matrix is more than a passive scaffold to provide physical support. It has an 
active and complex role in regulating the behavior of the cells that touch it, 
inhabit it, or crawl through its meshes, influencing their survival, development, 
migration, proliferation, shape, and function. 

In this section, we focus our discussion on the extracellular matrix of con- 
nective tissues in vertebrates, but bulky forms of extracellular matrix play an 
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Figure 1 9-53 The connective tissue 
underlying an epithelium. This tissue 
contains a variety of cells and extracellular 
matrix components. The predominant cell 
type is the fibroblast, which secretes 
abundant extracellular matrix. 
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important part in virtually all multicellular organisms; examples include the 
cuticles of worms and insects, the shells of mollusks, the cell walls of fungi, and, 
as we discuss later, the cell walls of plants. 



The Extracellular Matrix Is Made and Oriented by the Cells 
Within It 

The macromolecules that constitute the extracellular matrix are mainly pro- 
duced locally by cells in the matrix. As we discuss later, these cells also help to 
organize the matrix: the orientation of the cytoskeleton inside the cell can con- 
trol the orientation of the matrix produced outside. In most connective tissues, 
the matrix macromolecules are secreted largely by cells called fibroblasts (Fig- 
ure 19-54). In certain specialized types of connective tissues, such as cartilage 
and bone, however, they are secreted by cells of the fibroblast family that have 
more specific names: chondroblasts, for example, form cartilage, and osteoblasts 
form bone. 

The matrix in connective tissue is constructed from the same two main 
classes of macromolecules as in basal laminae: (1) glycosaminoglycan polysac- 
charide chains, usually covalently linked to protein in the form of proteoglycans, 
and (2) fibrous proteins such as collagen. We shall see that the members of both 
classes come in a great variety of shapes and sizes. 

The proteoglycan molecules in connective tissue typically form a highly 
hydrated, gel-like "ground substance" in which the fibrous proteins are embed- 
ded. The polysaccharide gel resists compressive forces on the matrix while per- 
mitting the rapid diffusion of nutrients, metabolites, and hormones between the 
blood and the tissue cells. The collagen fibers strengthen and help organize the 
matrix, while other fibrous proteins, such as the rubberlike elastin, give it 
resilience. Finally, many matrix proteins help cells migrate, settle, and differen- 
tiate in the appropriate locations. 
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Figure 19-54 Fibroblasts in connective 
tissue. This scanning electron 
micrograph shows tissue from the cornea 
of a rat. The extracellular matrix 
surrounding the fibroblasts is here 
composed largely of collagen fibrils. The 
glycoproteins, hyaluronan, and 
proteoglycans, which normally form a 
hydrated gel filling the interstices of the 
fibrous network, have been removed by 
enzyme and acid treatment. (From 
T. Nishida et al., invest Ophthalmol. Vis. 
Sci, 29:1887-1890, 1988. With permission 
from Association for Research in Vision 
and Opthalmology.) 



Glycosaminoglycan (GAG) Chains Occupy Large Amounts of 
Space and Form Hydrated Gels 

Glycosaminoglycans (GAGs) are unbranched polysaccharide chains composed 
of repeating disaccharide units. They are called GAGs because one of the two 
sugars in the repeating disaccharide is always an amino sugar (Af-acetylglu- 
cosamine or ^-acetylgalactosamine), which in most cases is sulfated. The sec- 
ond sugar is usually a uronic acid (glucuronic or iduronic). Because there are 
sulfate or carboxyl groups on most of their sugars, GAGs are highly negatively 
charged (Figure 19-55). Indeed, they are the most anionic molecules produced 
by animal cells. Four main groups of GAGs are distinguished by their sugars, the 
type of linkage between the sugars, and the number and location of sulfate 
groups: (1) hyaluronan, (2) chondroitin sulfate and dermatan sulfate, (3) hep- 
aran sulfate, and (4) keratan sulfate. 

Polysaccharide chains are too stiff to fold up into the compact globular 
structures that polypeptide chains typically form. Moreover, they are strongly 
hydrophilic. Thus, GAGs tend to adopt highly extended conformations that 
occupy a huge volume relative to their mass (Figure 19-56), and they form gels 
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Figure 19-55 The repeating disaccharide 
sequence of a heparan sulfate 
glycosaminoglycan (GAG) chain. These 

chains can consist of as many as 200 
disaccharide units, but are typically less 
than half that size. There is a high density 
of negative charges along the chain due 
to the presence of both carboxyl and 
sulfate groups. The proteoglycans of the 
basal lamina— perlecan, dystroglycan, and 
collagen XVIII— all carry heparan sulfate 
GAGs. The molecule is shown here with its 
maximal number of sulfate groups, in vivo, 
the proportion of sulfated and 
nonsulfated groups is variable. Heparin 
typically has >70% sulfation, while 
heparan sulfate has <50%. 
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even at very low concentrations. The weight of GAGs in connective tissue is usu- 
ally less than 10% of the weight of the fibrous proteins. But, because they form 
porous hydrated gels, the GAG chains fill most of the extracellular space. Their 
high density of negative charges attracts a cloud of cations, especially Na + , that 
are osmotically active, causing large amounts of water to be sucked into the 
matrix. This creates a swelling pressure, or turgor, that enables the matrix to 
withstand compressive forces (in contrast to collagen fibrils, which resist 
stretching forces) . The cartilage matrix that lines the knee joint, for example, can 
support pressures of hundreds of atmospheres in this way. 

Defects in the production of GAGs can affect many different body systems. 
In one rare human genetic disease, for example, there is a severe deficiency in 
the synthesis of dermatan sulfate disaccharide. The affected individuals have a 
short stature, prematurely aged appearance, and generalized defects in their 
skin, joints, muscles, and bones. 

In invertebrates, plants, and fungi, other types of polysaccharides, rather 
than GAGs, often dominate the extracellular matrix. Thus, in higher plants, as we 
discuss later, cellulose (polyglucose) chains are packed tightly together in rib- 
bonlike arrays to form the main component of the cell wall. In insects, crus- 
taceans, and other arthropods, chitin (poly-iV-acetylglucosamine) similarly 
forms the main component of the exoskeleton. Fungi, too, make their cell walls 
mainly out of chitin. Together, cellulose and chitin are the most abundant 
biopolymers on Earth. 



Hyaluronan Acts as a Space Filler and a Facilitator of Cell 
Migration During Tissue Morphogenesis and Repair 

Hyaluronan (also called hyaluronic acid or hyaluronate) is the simplest of the 
GAGs (Figure 19-57). It consists of a regular repeating sequence of up to 25,000 
disaccharide units, is found in variable amounts in all tissues and fluids in adult 
animals, and is especially abundant in early embryos. Hyaluronan is not typical 
of the majority of GAGs. In contrast with all of the others, it contains no sulfated 
sugars, all its disaccharide units are identical, its chain length is enormous 
(thousands of sugar monomers), and it is not generally linked covalently to any 
core protein. Moreover, whereas other GAGs are synthesized inside the cell and 
released by exocytosis, hyaluronan is spun out directly from the cell surface by 
an enzyme complex embedded in the plasma membrane. 

Hyaluronan is thought to have a role in resisting compressive forces in tis- 
sues and joints. It is also important as a space filler during embryonic develop- 
ment, where it can be used to force a change in the shape of a structure, as a 
small quantity expands with water to occupy a large volume (see Figure 19-56). 
Hyaluronan synthesized locally from the basal side of an epithelium can deform 
the epithelium by creating a cell-free space beneath it, into which cells subse- 
quently migrate. In the developing heart, for example, hyaluronan synthesis 
helps in this way to drive formation of the valves and septa that separate the 
heart's chambers. Similar processes occur in several other organs. When cell 
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Figure 1 9-56 The relative dimensions 
and volumes occupied by various 
macromolecules. Several proteins, a 
glycogen granule, and a single hydrated 
molecule of hyaluronan are shown. 
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Figure 19-57 The repeating disaccharide 
sequence in hyaluronan, a relatively 
simple GAG. This ubiquitous molecule in 
vertebrates consists of a single long chain 
of up to 25,000 sugar monomers. Note the 
absence of sulfate groups. 



THE EXTRACELLULAR MATRIX OF ANIMAL CONNECTIVE TISSUES 



1181 



migration ends, the excess hyaluronan is generally degraded by the enzyme 
hyaluronidase. Hyaluronan is also produced in large quantities during wound 
healing, and it is an important constituent of joint fluid, in which it serves as a 
lubricant. 

Many of the functions of hyaluronan depend on specific interactions with 
other molecules, including both proteins and proteoglycans. Some of these 
molecules that bind to hyaluronan are constituents of the extracellular matrix, 
while others are integral components of cell surfaces. 



Proteoglycans Are Composed of GAG Chains Covalently Linked to 
a Core Protein 

Except for hyaluronan, all GAGs are covalently attached to protein as proteogly- 
cans, which are produced by most animal cells. Membrane-bound ribosomes 
make the polypeptide chain, or core protein, of a proteoglycan, which is then 
threaded into the lumen of the endoplasmic reticulum. The polysaccharide 
chains are mainly assembled on this core protein in the Golgi apparatus before 
delivery to the exterior of the cell by exocytosis. First, a special linkage tetrasac- 
charide is attached to a serine side chain on the core protein to serve as a primer 
for polysaccharide growth; then, one sugar at a time is added by specific glyco- 
syl transferases (Figure 19-58). While still in the Golgi apparatus, many of the 
polymerized sugars are covalently modified by a sequential and coordinated 
series of reactions. Epimerizations alter the configuration of the substituents 
around individual carbon atoms in the sugar molecule; sulfations increase the 
negative charge. 

Proteoglycans are usually clearly distinguished from other glycoproteins by 
the nature, quantity, and arrangement of their sugar side chains. By definition, 
at least one of the sugar side chains of a proteoglycan must be a GAG. Whereas 
glycoproteins contain 1-60% carbohydrate by weight, and usually only a few 
percent, in the form of numerous relatively short, branched oligosaccharide 
chains, proteoglycans can contain as much as 95% carbohydrate by weight, 
mostly in the form of long, unbranched GAG chains, each typically about 80 sug- 
ars long. 

In principle, proteoglycans have the potential for almost limitless hetero- 
geneity. Even a single type of core protein can carry highly variable numbers and 
types of attached GAG chains. Moreover, the underlying repeating sequence of 
disaccharides in each GAG can be modified by a complex pattern of sulfate 
groups. The core proteins, too, are diverse, though many of them share some 
characteristic domains such as the LINK domain, involved in binding to GAGs. 

Proteoglycans can be huge. The proteoglycan aggrecan, for example, which 
is a major component of cartilage, has a mass of about 3 x 10 6 daltons with over 
100 GAG chains. Other proteoglycans are much smaller and have only 1-10 GAG 
chains; an example is decorin, which is secreted by fibroblasts and has a single 
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Figure 19-58 The linkage between a 
GAG chain and its core protein in a 
proteoglycan molecule. A specific link 
tetrasaccharide is first assembled on a 
serine side chain. In most cases, it is 
unclear how the particular serine is 
selected, but it seems that a specific local 
conformation of the polypeptide chain, 
rather than a specific linear sequence of 
amino acids, is recognized. The rest of the 
GAG chain, consisting mainly of a 
repeating disaccharide unit, is then 
synthesized, with one sugar being added 
at a time. In chondroitin sulfate, the 
disaccharide is composed of D-glucuronic 
acid and A/-acetyl-D-galactosamine; in 
heparan sulfate, it is D-glucosamine (or 
L-iduronic acid) and N-acetyl-D- 
glucosamine; in keratan sulfate, it is 
D-galactose and /V-acetyl-D-glucosamine. 
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GAG chain (Figure 19-59). Decorin binds to collagen fibrils and regulates fibril 
assembly and fibril diameter; mice that cannot make decorin have fragile skin 
that has reduced tensile strength. The GAGs and proteoglycans of these various 
types can associate to form even larger polymeric complexes in the extracellular 
matrix. Molecules of aggrecan, for example, assemble with hyaluronan in carti- 
lage matrix to form aggregates that are as big as a bacterium (Figure 19-60). 
Moreover, besides associating with one another, GAGs and proteoglycans associ- 
ate with fibrous matrix proteins such as collagen and with protein meshworks 
such as the basal lamina, creating extremely complex composites (Figure 19-61). 



Proteoglycans Can Regulate the Activities of Secreted Proteins 

Proeteoglycans are as diverse in function as they are in chemistry and struc- 
ture. Their GAG chains, for example, can form gels of varying pore size and 
charge density; one possible function, therefore, is to serve as selective sieves 



Figure 19-59 Examples of a small 
(decorin) and a large (aggrecan) 
proteoglycan found in the extracellular 
matrix. The figure compares these two 
proteoglycans with a typical secreted 
glycoprotein molecule, pancreatic 
ribonuclease B. All three are drawn to 
scale. The core proteins of both aggrecan 
and decorin contain oligosaccharide 
chains as well as the GAG chains, but 
these are not shown. Aggrecan typically 
consists of about 100 chondroitin sulfate 
chains and about 30 keratan sulfate 
chains linked to a serine-rich core protein 
of almost 3000 amino acids. Decorin 
"decorates" the surface of collagen fibrils, 
hence its name. 
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Figure 19-60 An aggrecan aggregate from fetal bovine cartilage. (A) An electron micrograph of an aggrecan aggregate 
shadowed with platinum. Many free aggrecan molecules are also visible. (B) A drawing of the giant aggrecan aggregate 
shown in (A). It consists of about 1 00 aggrecan monomers (each like the one shown in Figure 1 9-59) noncovalently bound 
through the N-terminal domain of the core protein to a single hyaluronan chain. A link protein binds both to the core 
protein of the proteoglycan and to the hyaluronan chain, thereby stabilizing the aggregate. The link proteins are members 
of a family of hyaluronan-binding proteins, some of which are cell-surface proteins. The molecular weight of such a 
complex can be 1 8 or more, and it occupies a volume equivalent to that of a bacterium, which is about 2 x 1 0" 12 cm 3 . 
(A, courtesy of Lawrence Rosenberg.) 
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to regulate the traffic of molecules and cells according to their size and charge, 
as in the thick basal lamina of the kidney glomerulus (see p. 1167). 

Proteoglycans have an important role in chemical signaling between cells. 
They bind various secreted signal molecules, such as certain protein growth fac- 
tors — controlling their diffusion through the matrix, their range of action, and 
their lifetime, as well as enhancing or inhibiting their signaling activity For 
example, the heparan sulfate chains of proteoglycans bind to fibroblast growth 
factors (FGFs), which (among other effects) stimulate a variety of cell types to 
proliferate; this interaction oligomerizes the growth factor molecules, enabling 
them to cross-link and activate their cell-surface receptors, which are trans- 
membrane tyrosine kinases (see Figure 15-54A). In inflammatory responses, 
heparan sulfate proteoglycans immobilize secreted chemotactic attractants 
called chemokines (discussed in Chapter 25) on the endothelial surface of a 
blood vessel at an inflammatory site. This allows the chemokines to remain there 
for a prolonged period, stimulating white blood cells to leave the bloodstream 
and migrate into the inflamed tissue.Whereas in most cases the signal molecules 
bind to the GAG chains of the proteoglycan, this is not always so. Some members 
of the transforming growth factor [3 (TGFfi) family bind to the core proteins of 
several matrix proteoglycans, including decorin; binding to decorin inhibits the 
growth factor activity. Signal molecules (including TGF(3) can also bind to 
fibrous proteins in the matrix. Vascular endothelial growth factor (VEGF), for 
example, binds to fibronectin. 

Proteoglycans also bind, and regulate the activities of, other types of 
secreted proteins, including proteolytic enzymes (proteases) and protease 
inhibitors. Thus they play a part in controlling both the assembly and the degra- 
dation of other components of the extracellular matrix, including collagen. 



Figure 19-61 Proteoglycans in the 
extracellular matrix of rat cartilage. The 

tissue was rapidly frozen at -1 96°C, and 
fixed and stained while still frozen (a 
process called freeze substitution) to 
prevent the GAG chains from collapsing. 
In this electron micrograph, the 
proteoglycan molecules are seen to form 
a fine filamentous network in which a 
single striated collagen fibril is 
embedded. The more darkly stained parts 
of the proteoglycan molecules are the 
core proteins; the faintly stained threads 
are the GAG chains. (Reproduced from 
E.B. Hunziker and R.K. Schenk, J. Cell Biol. 
98:277-282, 1984. With permission from 
The Rockefeller University Press.) 



Cell-Surface Proteoglycans Act as Co-Receptors 

Not all proteoglycans are secreted components of the extracellular matrix. Some 
are integral components of plasma membranes and have their core protein 
either inserted across the lipid bilayer or attached to the lipid bilayer by a glyco- 
sylphosphatidylinositol (GPI) anchor. Some of these plasma membrane proteo- 
glycans act as co-receptors that collaborate with conventional cell-surface recep- 
tor proteins. In addition, some conventional receptors have one or more GAG 
chains and are therefore proteoglycans themselves. 

Among the best-characterized plasma membrane proteoglycans are the syn- 
decans, which have a membrane-spanning core protein whose intracellular 
domain is thought to interact with the actin cytoskeleton and with signaling 
molecules in the cell cortex. Syndecans are located on the surface of many types 
of cells, including fibroblasts and epithelial cells. In fibroblasts, syndecans can 
be found in cell-matrix adhesions, where they modulate integrin function by 
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Table 19-6 Some Common Proteoglycans 



PROTEOGLYCAN APPROXIMATE 


TYPE OF GAG 


NUMBER OF 


LOCATION 


FUNCTIONS 


MOUCULAR 


CHAINS 


GAG CHAINS 






WEJGHTOF 










CORE PROTEIN 











Aggrecan 

Betaglycan 
Decorin 
Perlecan 
Syndecan-1 



Dally (in 
Drosophila) 



210,000 chondroitin sulfate -130 

+ keratan sulfate 

(in separate chains) 
36,000 chondroitin sulfate/ 1 

dermatan sulfate 
40,000 chondroitin sulfate/ 1 

dermatan sulfate 
600,000 heparan sulfate 2-1 5 

32,000 chondroitin sulfate 1-3 

+ heparan sulfate 
(in separate chains) 

60,000 heparan sulfate 1-3 



cartilage 



cell surface 

and matrix 
widespread in 

connective tissues 
basal laminae 

cell surface 



cell surface 



mechanical support; 

forms large aggregates 

with hyaluronan 
binds TGFp 

binds to type I collagen 
fibrils and TGFp 

structural and filtering 
function in basal lamina 

cell adhesion; binds FGF 
and other growth factors 

co-receptor for Wingless 
and Decapentaplegic 
signaling proteins 



interacting with fibronectin on the cell surface and with cytoskeletal and sig- 
naling proteins inside the cell. Syndecans also bind FGFs and present them to 
FGF receptor proteins on the same cell. Similarly, another plasma membrane 
proteoglycan, called betaglycan, binds TGFp and presents it to TGFp receptors. 

The importance of proteoglycans as co-receptors and as regulators of the 
distribution and activity of signal molecules is illustrated by the severe develop- 
mental defects that can occur when specific proteoglycans are inactivated by 
mutation. In Drosophila, for example, the products of the Dally and Dally-like 
genes, coding for members of the glypican family, with heparan sulfate side 
chains, are needed for signaling by no less than four of the main signal proteins 
that govern the patterning of the embryo (Wingless, Hedgehog, FGF, and 
Decapentaplegic (Dpp), as discussed in Chapter 22). For some of these signal 
pathways, mutations in Dally or Dally-like mimic the effects of mutations in the 
genes that code for the signal proteins themselves. 

The structure, function, and location of some of the proteoglycans discussed 
in this chapter are summarized in Table 19-6. 



Collagens Are the Major Proteins of the Extracellular Matrix 

The fibrous proteins are no less important than the proteoglycans as compo- 
nents of the extracellular matrix. Foremost among them are the collagens, a 
family of fibrous proteins found in all multicellular animals. Collagens are 
secreted in large quantities by connective tissue cells, and in smaller quantities 
by many other cell types. As a major component of skin and bone, they are the 
most abundant proteins in mammals, where they constitute 25% of the total 
protein mass. 

The primary feature of a typical collagen molecule is its long, stiff, triple- 
stranded helical structure, in which three collagen polypeptide chains, called a 
chains, are wound around one another in a ropelike superhelix (Figure 19-62). 
Collagens are extremely rich in proline and glycine, both of which are important 
in the formation of the triple-stranded helix. Proline, because of its ring struc- 
ture, stabilizes the helical conformation in each a chain, while glycine is regu- 
larly spaced at every third residue throughout the central region of the a chain. 
Being the smallest amino acid (having only a hydrogen atom as a side chain), 
glycine allows the three helical a chains to pack tightly together to form the final 
collagen superhelix. 

The human genome contains 42 distinct genes coding for different collagen 
a chains. Different combinations of these genes are expressed in different tis- 
sues. Although in principle thousands of types of triple-stranded collagen 
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Figure 19-62 The structure of a typical collagen molecule. (A) A model of 
part of a single collagen a chain in which each amino acid is represented 
by a sphere. The chain is about 1 000 amino acids long. It is arranged as a 
left-handed helix, with three amino acids per turn and with glycine as 
every third amino acid. Therefore, an a chain is composed of a series of 
triplet Gly-X-Y sequences, in which X and Y can be any amino acid 
(although X is commonly proline and Y is commonly hydroxyproline). 
(B) A model of part of a collagen molecule in which three a chains, each 
shown in a different color, are wrapped around one another to form a 
triple-stranded helical rod. Glycine is the only amino acid small enough to 
occupy the crowded interior of the triple helix. Only a short length of the 
molecule is shown; the entire molecule is 300 nm long. (From a model by 
B.LTrus.) 



molecules could be assembled from various combinations of the 42 a chains, 
only a limited number of triple-helical combinations are possible, and less than 
40 types of collagen molecules have been found. Type I is by far the most com- 
mon, being the principal collagen of skin and bone. It belongs to the class of fib- 
rillar collagens, or fibril-forming collagens, which have long ropelike structures 
with few or no interruptions. After being secreted into the extracellular space, 
these collagen molecules assemble into higher-order polymers called collagen 
fibrils, which are thin structures (10-300 nm in diameter) many hundreds of 
micrometers long in mature tissues and clearly visible in electron micrographs 
(Figure 19-63; see also Figure 19-61). Collagen fibrils often aggregate into larger, 
cablelike bundles, several micrometers in diameter, that are visible in the light 
microscope as collagen fibers. 

Collagen types IX and XII are called fibril-associated collagens because they 
decorate the surface of collagen fibrils. They are thought to link these fibrils to 
one another and to other components in the extracellular matrix. Type IV, as we 
have already seen, is a network-forming collagen, forming a major part of basal 
laminae, while type VII molecules form dimers that assemble into specialized 
structures called anchoring fibrils. Anchoring fibrils help attach the basal lamina 
of multilayered epithelia to the underlying connective tissue and therefore are 
especially abundant in the skin. 

There are also a number of "collagen-like" proteins, including type XVII, 
which has a transmembrane domain and is found in hemidesmosomes, and 
type XVIII, which we mentioned earlier as the core protein of a proteoglycan in 
basal laminae. 

Many proteins appear to have evolved by repeated duplications of an origi- 
nal DNA sequence, giving rise to a repetitive pattern of amino acids. The genes 
that encode the a chains of most of the fibrillar collagens provide a good exam- 
ple: they are very large (up to 44 kilobases in length) and contain about 50 exons. 
Most of the exons are 54, or multiples of 54, nucleotides long, suggesting that 
these collagens originated through multiple duplications of a primordial gene 
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Figure 1 9-63 A fibroblast surrounded 
by collagen fibrils in the connective 
tissue of embryonic chick skin. In this 
electron micrograph, the fibrils are 
organized into bundles that run 
approximately at right angles to one 
another. Therefore, some bundles are 
oriented longitudinally, whereas others 
are seen in cross section. The collagen 
fibrils are produced by the fibroblasts, 
which contain abundant endoplasmic 
reticulum, where secreted proteins such 
as collagen are synthesized. (From 
C. Ploetz, E.I. Zycband and D.E. Birk, 
J. Struct. Biol. 1 06:73-81 , 1 991 . With 
permission from Elsevier.) 
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Table 1 9-7 Some Types of Collagen and Their Properties 



fYPE POLYMERIZED FORM 



TISSUE DISTRIBUTE 



MUTANT PHENOTYPE 



Fibril-forming 


1 


fibril 


bone, skin, tendons, ligaments, cornea, 


severe bone defects, fractures 


(fibrillar) 






internal organs (accounts for 90% of 
body collagen) 






II 


fibril 


cartilage, invertebral disc, notochord, 
vitreous humor of the eye 


cartilage deficiency, dwarfism 




ill 


fibril 


skin, blood vessels, internal organs 


fragile skin, loose joints, blood 
vessels prone to rupture 




V 


fibril (with type 1) 


as for type 1 


fragile skin, loose joints, blood 
vessels prone to rupture 




XI 


fibril (with type II) 


as for type II 


myopia, blindness 


Fibril-associated 


IX 


lateral association 
with type II fibrils 


cartilage 


osteoarthritis 


Network-forming 


IV 


sheetlike network 


basal lamina 


kidney disease 
(glomerulonephritis), deafness 




VII 


anchoring fibrils 


beneath stratified squamous epitheita 


skin blistering 


Transmembrane 


XVII 


non-fibrillar 


hemidesmosomes 


skin blistering 


Proteoglycan core 


XVIII 


non-fibrillar 


basal lamina 


myopia, detached retina, 


protein 








hydrocephalus 



Note that types I, IV, V, IX, and XI are each composed of two or three types of a chains (distinct, nonoverlapping sets in each case), whereas types I 
III, VII, XII, XVII, and XVIII are composed of only one type of a chain each Only 10 types of collagen are shown, but about 27 types of collagen and 
42 types of a chains have been identified in humans 



containing 54 nucleotides and encoding exactly 6 Gly-X-Y repeats (see Figure 
19-62). 

Table 19-7 provides additional details for some of the collagen types dis- 
cussed in this chapter. 



Collagen Chains Undergo a Series of Post-Translational 
Modifications 

Individual collagen polypeptide chains are synthesized on membrane-bound 
ribosomes and injected into the lumen of the endoplasmic reticulum (ER) as 
larger precursors, called pro-a chains. These precursors not only have the short 
amino-terminal signal peptide required to direct the nascent polypeptide to 
the ER, but also have, at both their N- and C-terminal ends, additional amino 
acids, called propeptides, that are clipped off at a later step of collagen assem- 
bly. Moreover, in the lumen of the ER, selected prolines and lysines are hydrox- 
ylated to form hydroxyproline and hydroxyzine, respectively, and some of the 
hydroxy-lysines are glycosylated. Each pro-a chain then combines with two 
others to form a hydrogen-bonded, triple-stranded, helical molecule known as 
procollagen. 

Hydroxylysines and hydroxyprolines (Figure 19-64) are infrequently found 
in other animal proteins, although hydroxyproline is abundant in some pro- 
teins in the plant cell wall. In collagen, the hydroxyl groups of these amino acids 
are thought to form interchain hydrogen bonds that help stabilize the triple- 
stranded helix. Conditions that prevent proline hydroxylation, such as a defi- 
ciency of ascorbic acid (vitamin C), have serious consequences. In scurvy, the 
often fatal disease caused by a dietary deficiency of vitamin C that was com- 
mon in sailors until the nineteenth century, the defective pro-a chains that are 
synthesized fail to form a stable triple helix and are immediately degraded 
within the cell, and synthesis of new collagen is inhibited. In healthy tissues, 
collagen is continually degraded and replaced (with a turnover time of months 
or years, depending on the tissue). In scurvy, replacement fails, and within a 
few months, with the gradual loss of the preexisting normal collagen in the 
matrix, blood vessels become fragile, teeth become loose in their sockets, and 
wounds cease to heal. 
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Figure 1 9-64 Hydroxylysine and hydroxyproline. These modified amino 
acids are common in collagen. They are formed by enzymes that act after 
the lysine and proline have been incorporated into procollagen molecules. 



Propeptides Are Clipped Off Procollagen After its Secretion, to 
Allow Assembly of Fibrils 

After secretion, the propeptides of the fibrillar procollagen molecules are 
removed by specific proteolytic enzymes outside the cell. This converts the pro- 
collagen molecules to collagen molecules, which assemble in the extracellular 
space to form much larger collagen fibrils. The propeptides have at least two 
functions. First, they guide the intracellular formation of the triple-stranded col- 
lagen molecules. Second, because they are retained until after secretion, they 
prevent the intracellular formation of large collagen fibrils, which could be 
catastrophic for the cell. 

The process of fibril formation is driven, in part, by the tendency of the col- 
lagen molecules, which are more than a thousandfold less soluble than procol- 
lagen molecules, to self- assemble. The fibrils begin to form close to the cell sur- 
face, often in deep infoldings of the plasma membrane formed by the fusion of 
secretory vesicles with the cell surface. The underlying cortical cytoskeleton can 
influence the sites, rates, and orientation of fibril assembly 

When viewed in an electron microscope, collagen fibrils have characteristic 
cross-striations every 67 nm, reflecting the regularly staggered packing of the 
individual collagen molecules in the fibril. After the fibrils have formed in the 
extracellular space, they are greatly strengthened by the formation of covalent 
cross-links between lysine residues of the constituent collagen molecules (Fig- 
ure 19-65). The types of covalent bonds involved are found only in collagen and 
elastin. If cross-linking is inhibited, the tensile strength of the fibrils is drastically 
reduced: collagenous tissues become fragile, and structures such as skin, ten- 
dons, and blood vessels tend to tear. The extent and type of cross-linking vary 
from tissue to tissue. Collagen is especially highly cross-linked in the Achilles 
tendon, for example, where tensile strength is crucial. 

Figure 19-66 summarizes the steps in the synthesis and assembly of colla- 
gen fibrils. Given the large number of enzymatic processes involved, it is not sur- 
prising that there are many human genetic diseases that affect fibril formation. 
Mutations affecting type I collagen cause osteogenesis imperfecta, characterized 
by weak bones that fracture easily. Mutations affecting type II collagen cause 
chondrodysplasias, characterized by abnormal cartilage, which leads to bone 
and joint deformities. And mutations affecting type III collagen cause 
Ehlers-Danlos syndrome, characterized by fragile skin and blood vessels and 
hypermobile joints. 
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Secreted Fibril-Associated Collagens Help Organize the Fibrils 

In contrast to GAGs, which resist compressive forces, collagen fibrils form struc- 
tures that resist tensile forces. The fibrils have various diameters and are orga- 
nized in different ways in different tissues. In mammalian skin, for example, they 
are woven in a wickerwork pattern so that they resist tensile stress in multiple 
directions; leather consists of this material, suitably preserved. In tendons, col- 
lagen fibrils are organized in parallel bundles aligned along the major axis of 
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Figure 19-65 Cross-links formed 
between modified lysine side chains 
within a collagen fibril. Covalent 
intramolecular and intermolecular 
cross-links are formed in several steps. 
First, the extracellular enzyme lysyl 
oxidase deaminates certain lysines and 
hydroxyzines, to yield highly reactive 
aldehyde groups. The aldehydes then 
react spontaneously to form covalent 
bonds with each other or with other 
lysines or hydroxy lysines. Most of the 
cross-links form between the short 
nonhelical segments at each end of the 
collagen molecules. 
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1. SYNTHESIS OF PRO-oc CHAIN 




collagen fibril 



Figure 1 9-66 The intracellular and extracellular events in the formation 
of a collagen fibril. (A) Note that procollagen assembles into collagen 
fibrils in the extracellular space, often within large infoldings in the plasma 
membrane (not shown). As one example of how collagen fibrils can form 
ordered arrays in the extracellular space, they are shown further 
assembling into large collagen fibers, which are visible in the light 
microscope. The covalent cross-links that stabilize the extracellular 
assemblies are omitted in this picture. (B) Electron micrograph of a 
negatively stained collagen fibril reveals its typical striated appearance. 
(B, courtesy of Robert Home.) 
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tension. In mature bone and in the cornea, they are arranged in orderly ply- 
woodlike layers, with the fibrils in each layer lying parallel to one another but 
nearly at right angles to the fibrils in the layers on either side. The same arrange- 
ment occurs in tadpole skin (Figure 19-67). 

The connective tissue cells themselves determine the size and arrangement 
of the collagen fibrils. The cells can express one or more genes for the different 
types of fibrillar procollagen molecules. But even fibrils composed of the same 
mixture of fibrillar collagen molecules have different arrangements in different 
tissues. How is this achieved? Part of the answer is that cells can regulate the dis- 
position of the collagen molecules after secretion by guiding collagen fibril for- 
mation in close association with the plasma membrane (see Figure 19-66). In 
addition, cells can influence this organization by secreting, along with their fib- 
rillar collagens, different kinds and amounts of other matrix macromolecules. In 
particular, they secrete the fibrous protein fibronectin, as we shall discuss later, 
and this precedes the formation of collagen fibrils and helps guide their organi- 
zation. 

Fibril- associated collagens, such as types IX and XII collagens, are thought 
to be especially important in this regard. They differ from fibrillar collagens in 
several ways. 

1. Their triple-stranded helical structure is interrupted by one or two short 
nonhelical domains, which makes the molecules more flexible than fibril- 
lar collagen molecules. 
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2. They are not cleaved after secretion and therefore retain their propeptides. 

3. They do not aggregate with one another to form fibrils in the extracellular 
space. Instead, they bind in a periodic manner to the surface of fibrils 
formed by the fibrillar collagens. Type IX molecules bind to type-II-colla- 
gen- containing fibrils in cartilage, the cornea, and the vitreous of the eye 
(Figure 19-68), whereas type XII molecules bind to type-I-collagen-con- 
taining fibrils in tendons and various other tissues. 

Fibril-associated collagens are thought to mediate the interactions of colla- 
gen fibrils with one another and with other matrix macromolecules to help 
determine the organization of the fibrils in the matrix. 



Cells Help Organize the Collagen Fibrils They Secrete by Exerting 
Tension on the Matrix 

Cells interact with the extracellular matrix mechanically as well as chemically, 
and studies in culture suggest that this mechanical interaction can have dra- 
matic effects on the architecture of connective tissue. Thus, when fibroblasts are 
mixed with a meshwork of randomly oriented collagen fibrils that form a gel in 
a culture dish, the fibroblasts tug on the meshwork, drawing in collagen from 
their surroundings and thereby causing the gel to contract to a small fraction of 
its initial volume. By similar activities, a cluster of fibroblasts surrounds itself 
with a capsule of densely packed and circumferentially oriented collagen fibers. 

If two small pieces of embryonic tissue containing fibroblasts are placed far 
apart on a collagen gel, the intervening collagen becomes organized into a com- 
pact band of aligned fibers that connect the two explants (Figure 19-69). The 
fibroblasts subsequently migrate out from the explants along the aligned colla- 
gen fibers. Thus, the fibroblasts influence the alignment of the collagen fibers, 
and the collagen fibers in turn affect the distribution of the fibroblasts. 

Fibroblasts may have a similar role in organizing the extracellular matrix 
inside the body, first of all synthesizing the collagen fibrils and depositing them 
in the correct orientation, then working on the matrix they have secreted, crawl- 
ing over it and tugging on it so as to create tendons and ligaments and the tough, 
dense layers of connective tissue that ensheathe and bind together most organs. 
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Figure 19-67 Collagen fibrils in the 
tadpole skin. This electron micrograph 
shows the plywoodlike arrangement of 
the fibrils: successive layers of fibrils are 
laid down nearly at right angles to each 
other. This organization is also found in 
mature bone and in the cornea. (Courtesy 
of Jerome Gross.) 



Elastin Gives Tissues Their Elasticity 

Many vertebrate tissues, such as skin, blood vessels, and lungs, need to be both 
strong and elastic in order to function. A network of elastic fibers in the extra- 
cellular matrix of these tissues gives them the required resilience so that they 
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Figure 1 9-68 Type IX collagen. (A) Type IX collagen molecules binding in a 
periodic pattern to the surface of a fibril containing type II collagen. 
(B) Electron micrograph of a rotary-shadowed type-ll-collagen-containing 
fibril in cartilage, sheathed in type IX collagen molecules. (C) An individual 
type IX collagen molecule. (B and C, from L Vaughan et al., J. Cell Biol. 
106:991-997, 1988. With permission from The Rockefeller University Press.) 
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can recoil after transient stretch (Figure 19-70). Elastic fibers are at least five 
times more extensible than a rubber band of the same cross-sectional area. 
Long, inelastic collagen fibrils are interwoven with the elastic fibers to limit the 
extent of stretching and prevent the tissue from tearing. 

The main component of elastic fibers is elastin, a highly hydrophobic pro- 
tein (about 750 amino acids long), which, like collagen, is unusually rich in pro- 
line and glycine but, unlike collagen, is not glycosylated and contains some 
hydroxyproline but no hydroxylysine. Soluble tropoelastin (the biosynthetic pre- 
cursor of elastin) is secreted into the extracellular space and assembled into 
elastic fibers close to the plasma membrane, generally in cell-surface infoldings. 
After secretion, the tropoelastin molecules become highly cross-linked to one 
another, generating an extensive network of elastin fibers and sheets. A mecha- 
nism similar to the one that operates in cross-linking collagen molecules forms 
cross-links between the lysines. 

The elastin protein is composed largely of two types of short segments that 
alternate along the polypeptide chain: hydrophobic segments, which are 
responsible for the elastic properties of the molecule; and alanine- and lysine- 
rich oc-helical segments, which form cross-links between adjacent molecules. 
Each segment is encoded by a separate exon. There is still uncertainty concern- 
ing the conformation of elastin molecules in elastic fibers and how the structure 
of these fibers accounts for their rubberlike properties. However, it seems that 
parts of the elastin polypeptide chain, like the polymer chains in ordinary rub- 
ber, adopt a loose "random coil" conformation, and it is the random coil nature 
of the component molecules cross-linked into the elastic fiber network that 
allows the network to stretch and recoil like a rubber band (Figure 19-71). 

Elastin is the dominant extracellular matrix protein in arteries, comprising 
50% of the dry weight of the largest artery— the aorta. Mutations in the elastin 
gene causing a deficiency of the protein in mice or humans result in narrowing 
of the aorta or other arteries and excessive proliferation of smooth muscle cells 
in the arterial wall. Apparently the normal elasticity of an artery is required to 
restrain the proliferation of these cells. 

Elastic fibers do not consist solely of elastin. The elastin core is covered with 
a sheath of microfibrils, each of which has a diameter of about 10 nm. The 
microfibrils appear before elastin in developing tissues and seem to provide 
scaffolding to guide elastin deposition. Arrays of microfibrils are elastic in their 
own right, and in some places they persist in the absence of elastin: they help to 
hold the lens in its place in the eye, for example. Microfibrils are composed of a 
number of distinct glycoproteins, including the large glycoprotein fibrillin, 
which binds to elastin and is essential for the integrity of elastic fibers. Muta- 
tions in the fibrillin gene result in Marfan's syndrome, a relatively common 
human genetic disorder. In the most severely affected individuals the aorta is 
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Figure 1 9-69 The shaping of the 
extracellular matrix by cells. This 
micrograph shows a region between two 
pieces of embryonic chick heart (rich in 
fibroblasts as well as heart muscle cells) 
that were cultured on a collagen gel for 
4 days. A dense tract of aligned collagen 
fibers has formed between the explants, 
presumably as a result of the fibroblasts 
in the explants tugging on the collagen. 
(From D. Stopak and A.K. Harris, Dev. Biol. 
90:383-398, 1982. With permission from 
Academic Press.) 




1 rum 



100 (j m 



Figure 19-70 Elastic fibers. These 
scanning electron micrographs show (A) 
a low- power view of a segment of a dog's 
aorta and (B) a high-power view of the 
dense network of longitudinally oriented 
elastic fibers in the outer layer of the 
same blood vessel. All the other 
components have been digested away 
with enzymes and formic acid. (From 
K.S. Haas et al., Anal Rec 230:86-96, 
1 991 . With permission from Wiley-Liss.) 
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prone to rupture; other common effects include displacement of the lens and 
abnormalities of the skeleton and joints. Affected individuals are often unusu- 
ally tall and lanky: Abraham Lincoln is suspected to have had the condition. 

Fibronectin Is an Extracellular Protein That Helps Cells Attach to 
the Matrix 

The extracellular matrix contains a number of noncollagen proteins that typi- 
cally have multiple domains, each with specific binding sites for other matrix 
macromolecules and for receptors on the surface of cells. These proteins there- 
fore contribute to both organizing the matrix and helping cells attach to it. Like 
the proteoglycans, they also guide cell movements in developing tissues, by 
serving as tracks along which cells can migrate or as repellents that keep cells 
out of forbidden areas. 

The first of this class of matrix proteins to be well characterized was 
fibronectin, a large glycoprotein found in all vertebrates and important for 
many cell-matrix interactions. Thus, for example, mutant mice that are unable 
to make fibronectin die early in embryogenesis because their endothelial cells 
fail to form proper blood vessels. The defect is thought to result from abnormal- 
ities in the interactions of these cells with the surrounding extracellular matrix, 
which normally contains fibronectin. 

Fibronectin is a dimer composed of two very large subunits joined by disul- 
fide bonds at one end. Each subunit is folded into a series of functionally distinct 
domains separated by regions of flexible polypeptide chain (Figure 19-72). The 
domains in turn consist of smaller modules, each of which is serially repeated 
and usually encoded by a separate exon, suggesting that the fibronectin gene, 
like the collagen genes, evolved by multiple exon duplications. In the human 
genome, there is only one fibronectin gene, containing about 50 exons of simi- 
lar size, but the transcripts can be spliced in different ways to produce many dif- 
ferent fibronectin isoforms. Sites in the main type of module, called the type III 
fibronectin repeat, bind to integrins and thereby to cell surfaces. This module is 
about 90 amino acids long and occurs at least 15 times in each subunit. The type 
III fibronectin repeat is among the most common of all protein domains in ver- 
tebrates. 



Figure 19-71 Stretching a network of 
elastin molecules. The molecules are 
joined together by covalent bonds (red) 
to generate a cross-linked network. In 
this model, each elastin molecule in the 
network can extend and contract in a 
manner resembling a random coil, so that 
the entire assembly can stretch and recoil 
like a rubber band. 



Tension Exerted by Cells Regulates the Assembly of Fibronectin 
Fibrils 

Fibronectin can exist both in a soluble form, circulating in the blood and other 
body fluids, and as insoluble fibronectin fibrils, in which fibronectin dimers are 
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Figure 1 9-72 The structure of a fibronectin dimer. (A) Electron micrographs of individual fibronectin dimer molecules 
shadowed with platinum; red arrows mark the C-termini. (B) The two polypeptide chains are similar but generally not 
identical (being made from the same gene but from differently spliced mRNAs). They are joined by two disulfide bonds 
near the C-termini. Each chain is almost 2500 amino acids long and is folded into five or six domains connected by flexible 
polypeptide segments. Individual domains contain modules specialized for binding to a particular molecule or to a cell, as 
indicated for five of the domains. For simplicity, not all of the known binding sites are shown (there are other cell-binding 
sites, for example). (C) The three-dimensional structure of two type III fibronectin repeat modules as determined by x-ray 
crystallography. The type III repeat is the main repeating module in fibronectin. Both the Arg-Gly-Asp (RGD) and the 
"synergy" sequences shown in red form part of the major cell-binding site, and are important for binding. (A, from J. Engel 
et al., J. Mol. Biol. 1 50:97-1 20, 1 981 . With permission from Academic Press; C, from Daniel J. Leahy, Annu. Rev. Ceil Dev. Biol. 
13:363-393, 1997. With permission from Annual Reviews.) 



cross-linked to one another by additional disulfide bonds and form part of the 
extracellular matrix. Unlike fibrillar collagen molecules, however, which can 
self-assemble into fibrils in a test tube, fibronectin molecules assemble into fib- 
rils only on the surface of cells, and only where those cells possess appropriate 
fibroneetin-binding proteins^in particular, integrins. The integrins provide a 
linkage from the fibronectin outside the cell to the actin cytoskeleton inside it 
The linkage transmits tension to the fibronectin molecules— provided that they 
also have an attachment to some other structure— and stretches them, exposing 
a cryptic (hidden) binding site in the fibronectin molecules (Figure 19-73). This 
allows them to bind directly to one another and to recruit additional fibronectin 
molecules to form a fibril (Figure 19-74). This dependence on tension and inter- 
action with cell surfaces ensures that fibronectin fibrils assemble where there is 
a mechanical need for them and not in inappropriate locations such as the 
bloodstream. 




Figure 19-73 Unfolding of a type III 
fibronectin domain in response to 
tension. The stretching of fibronectin in 
this way exposes cryptic binding sites 
that cause the stretched molecules to 
assemble into filaments, as shown in 
Figure 1 9-74. (From V. Vogel and 
M. Sheetz, Nat. Rev. Mol. Cell Biol. 
7:265-275, 2006. With permission from 
Macmillan Publishers Ltd.) 
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Many other extracellular matrix proteins have multiple repeats similar to the 
type III fibronectin repeat, and it is possible that tension exerted on these pro- 
teins also uncovers cryptic binding sites and thereby influences their polymer- 
ization. 



Fibronectin Binds to Integrins Through an RGD Motif 

One way to analyze a complex multifunctional protein molecule such as 
fibronectin is to chop it into pieces and determine the function of its individual 
domains. When fibronectin is treated with a low concentration of a proteolytic 
enzyme, the polypeptide chain is cut in the connecting regions between the 
domains, leaving the domains themselves intact. One can then show that one of 
its domains binds to collagen, another to heparin, another to specific receptors 
on the surface of various types of cells, and so on (see Figure 19-72B). Synthetic 
peptides corresponding to different segments of the cell-binding domain have 
been used to identify a specific tripeptide sequence [Arg-Gly-Asp, or RGD), 
which is found in one of the type III repeats (see Figure 19-72C), as a central fea- 
ture of the cell-binding site. Even very short peptides containing this RGD 
sequence can compete with fibronectin for the binding site on cells, thereby 
inhibiting the attachment of the cells to a fibronectin matrix. If these peptides 
are coupled to a solid surface, they cause cells to adhere to it. 

Several extracellular proteins besides fibronectin also have an RGD 
sequence that mediates cell-surface binding. Some of these are involved in 
blood clotting, and peptides containing the RGD sequence have been useful in 
the development of anti-clotting drugs. Some snakes use a similar strategy to 
cause their victims to bleed: they secrete RGD-containing anti-clotting proteins 
called disintegrins into their venom. 

The cell-surface receptors that bind RGD-containing proteins are members 
of the integrin family. Each integrin, however, specifically recognizes its own 
small set of matrix molecules, indicating that tight binding requires more than 
just the RGD sequence. Moreover, RGD sequences are not the only sequence 
motifs used for binding to integrins: many integrins recognize and bind to other 
motifs instead. 




Figure 19-74 Organization of 
fibronectin into fibrils at the cell 
surface. The picture shows the front end 
of a migrating mouse fibroblast. The 
extracellular adhesion protein fibronectin 
is stained green, and the intracellular 
filaments of actin, red. The fibronectin is 
initially present as small dot-like 
aggregates near the leading edge of the 
cell. It accumulates at focal adhesions 
(sites of anchorage of actin filaments) 
and becomes organized into fibrils 
parallel to the actin filaments. Integrin 
molecules spanning the cell membrane 
link the fibronectin outside the cell to the 
actin filaments inside it. Tension exerted 
on the fibronectin molecules through this 
linkage is thought to stretch them, 
exposing binding sites that promote fibril 
formation. {Courtesy of Roumen Pankov 
and Kenneth Yamada.) 



Cells Have to Be Able to Degrade Matrix, as Well as Make it 

The ability of cells to degrade and destroy extracellular matrix is as important 
as their ability to make it and bind to it. Rapid matrix degradation is required 
in processes such as tissue repair, and even in the seemingly static extracellu- 
lar matrix of adult animals there is a slow, continuous turnover, with matrix 
macromolecules being degraded and resynthesized. This allows bone, for 
example, to be remodelled so as to adapt to the stresses on it, as discussed in 
Chapter 23. 

From the point of view of individual cells, the ability to cut through matrix is 
crucial in two ways: it enables them to divide while embedded in matrix, and it 
enables them to travel through it. As we have already mentioned, cells in con- 
nective tissues generally need to be able to stretch out in order to divide. If a cell 
lacks the enzymes needed to cut through the surrounding matrix or is embed- 
ded in a matrix that resists their action, it remains rounded, unable to extend 
processes because the matrix is impenetrable; as a result, the cell is strongly 
inhibited from dividing, as well as being hindered from migrating. 

Localized degradation of matrix components is also required wherever cells 
have to escape from confinement by a basal lamina. It is needed during normal 
branching growth of epithelial structures such as glands, for example, to allow 
the population of epithelial cells to expand, and needed also when white blood 
cells migrate across the basal lamina of a blood vessel into tissues in response to 
infection or injury. Less benignly, matrix degradation is important both for the 
spread of cancer cells through the body and for the ability of the cancer cells to 
proliferate in the tissues that they invade (discussed in Chapter 20). 
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Matrix Degradation Is Localized to the Vicinity of Cells 

In general, matrix components are degraded by extracellular proteolytic 
enzymes (proteases) that act close to the cells that produce them. Thus, anti- 
bodies that recognize the products of proteolytic cleavage stain matrix only 
around cells. Many of these proteases belong to one of two general classes. Most 
are matrix metalloproteases, which depend on bound Ca 2+ or Zn 2+ for activity; 
the others are serine proteases, which have a highly reactive serine in their 
active site. Together, metalloproteases and serine proteases cooperate to 
degrade matrix proteins such as collagen, laminin, and fibronectin. Some met- 
alloproteases, such as the collagenases, are highly specific, cleaving particular 
proteins at a small number of sites. In this way the structural integrity of the 
matrix is largely retained, while the limited amount of proteolysis that occurs is 
sufficient for cell migration. Other metalloproteases may be less specific, but, 
because they are anchored to the plasma membrane, they can act just where 
they are needed; it is this type of matrix metalloprotease that is crucial for a cell's 
ability to divide when embedded in matrix. 

Clearly, the activity of the proteases that degrade the matrix components has 
to be tightly controlled, if the fabric of the body is not to collapse in a heap. Three 
basic control mechanisms operate: 

Local activation: Many proteases are secreted as inactive precursors that can 
be activated locally when needed. An example is plasminogen, an inactive pro- 
tease precursor that is abundant in the blood. Proteases called plasminogen 
activators cleave plasminogen locally to yield the active serine protease plasmin, 
which helps break up blood clots. Tissue-type plasminogen activator (tPA) is 
often given to patients who have just had a heart attack or thrombotic stroke; it 
helps dissolve the arterial clot that caused the attack, thereby restoring blood- 
flow to the tissue. 

Confinement by cell-surface receptors: Many cells have receptors on their 
surface that bind proteases, thereby confining the enzyme to the sites where it is 
needed. (Jmkinase-rype plasminogen activator (uPA) is an example. It is found 
bound to receptors on the growing tips of axons and at the leading edge of some 
migrating cells, where it may serve to clear a pathway for their migration. Recep- 
tor-bound uPA may also help some cancer cells metastasize {Figure 19-75). 

Secretion ofitihibitors;Thv action of proteases is confined to specific areas by 
various secreted protease inhibitors, including the tissue inhibitors of metallo- 
proteases (TIMPs) and the serine protease inhibitors known as serpins. The 
inhibitors are protease-specific and bind tightly to the activated enzyme, block- 
ing its activity An attractive idea is that the inhibitors are secreted by cells at the 
margins of areas of active protein degradation to protect uninvolved matrix; 
they may also protect cell-surface proteins required for cell adhesion and migra- 
tion. The overexpression of TIMPs inhibits the migration of some cell types, 
emphasizing the importance of metalloproteases for the migration. 
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Figure 19-75 The importance of 
proteases bound to cell-surface 
receptors. (A) Human prostate cancer 
cells make and secrete the serine 
protease uPA, which binds to cell-surface 
uPA receptor proteins. (B) The same cells 
have been transfected with DNA that 
encodes an excess of an inactive form of 
uPA, which binds to the uPA receptors 
but has no protease activity. By 
occupying most of the uPA receptors, the 
inactive uPA prevents the active protease 
from binding to the cell surface. Both 
types of cells secrete active uPA, grow 
rapidly, and produce tumors when 
injected into experimental animals. But 
the cells in (A) metastasize widely, 
whereas the cells in (B) do not. To 
metastasize via the blood, tumor cells 
have to crawl through basal laminae and 
other extracellular matrices on the way 
into and out of the bloodstream. This 
experiment suggests that proteases must 
be bound to the cell surface to facilitate 
migration through the matrix. 
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Summary 

Cells in connective tissues are embedded in an intricate extracellular matrix that not 
only binds the cells together but also influences their survival, development, shape, 
polarity, and migratory behavior. The matrix contains various protein fibers interwo- 
ven in a hydrated gel composed of a network of glycosaminoglycan (GAG) chains. 

GAGs are a heterogeneous group of negatively charged polysaccharide chains that 
(except for hyaluronan) are covalently linked to protein to form proteoglycan 
molecules. The negative charges attract counterions, which have a powerful osmotic 
effect, drawing water into the matrix and keeping it swollen so as to occupy a large vol- 
ume of extracellular space. Proteoglycans are also found on the surface of cells, where 
they often function as co-receptors to help cells respond to secreted signal proteins. 

Fiber-forming proteins give the matrix strength and resilience. They also form 
structures to which cells can be anchored, often via large multidomain glycoproteins 
such as laminin and fibronectin that have multiple binding sites for integrins on the 
cell surface. Elasticity is provided by elastin molecules, which form an extensive cross- 
linked network of fibers and sheets that can stretch and recoil. The fibrillar collagens 
(types I, II, III, V, and XI) provide tensile strength. They are ropelike, triple- stranded 
helical molecules that aggregate into long fibrils in the extracellular space. The fibrils 
in turn can assemble into various highly ordered arrays. Fibril-associated collagen 
molecules, such as types IX and XII, decorate the surface of collagen fibrils and influ- 
ence the interactions of the fibrils with one another and with other matrix compo- 
nents. 

Matrix components are degraded by extracellular proteolytic enzymes. Most of 
these are matrix metalloproteases, which depend on bound Ca 2+ or Zn 2+ for activity, 
while others are serine proteases, which have a reactive serine in their active site. The 
degradation of matrix components is subject to complex controls, and cells can, for 
example, cause a localized degradation of matrix components to clear a path through 
the matrix. 

THE PLANT CELL WALL 

Each cell in a plant deposits, and is in turn completely enclosed by, an elaborate 
extracellular matrix called the plant cell wall It was the thick cell walls of cork, 
visible in a primitive microscope, that in 1663 enabled Robert Hooke to distin- 
guish and name cells for the first time. The walls of neighboring plant cells, 
cemented together to form the intact plant (Figure 19-76), are generally thicker, 
stronger, and, most important of all, more rigid than the extracellular matrix pro- 
duced by animal cells. In evolving relatively rigid walls, which can be up to many 
micrometers thick, early plant cells forfeited the ability to crawl about and 
adopted a sedentary lifestyle that has persisted in all present-day plants. 

The Composition of the Cell Wall Depends on the Cell Type 

All cell walls in plants have their origin in dividing cells, as the cell plate forms 
during cytokinesis to create a new partition wall between the daughter cells (dis- 
cussed in Chapter 17). The new cells are usually produced in special regions 
called meristems (discussed in Chapter 22), and they are generally small in com- 
parison with their final size. To accommodate subsequent cell growth, the walls 
of the newborn cells, called primary cell walls, are thin and extensible, although 
tough. Once growth stops, the wall no longer needs to be extensible: sometimes 
the primary wall is retained without major modification, but, more commonly, 
a rigid, secondary cell wall is produced by depositing new layers of matrix inside 
the old ones. These new layers generally have a composition that is significantly 
different from that of the primary wall. The most common additional polymer in 
secondary walls is lignin, a complex network of covalently linked phenolic com- 
pounds found in the walls of the xylem vessels and fiber cells of woody tissues. 
Although the cell walls of higher plants vary in both composition and orga- 
nization, they are all constructed, like animal extracellular matrices, using a 
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Figure 19-76 Plant cell walls. (A) Electron 
micrograph of the root tip of a rush, 
showing the organized pattern of cells 
that results from an ordered sequence of 
cell divisions in cells with relatively rigid 
cell walls. In this growing tissue, the cell 
walls are still relatively thin, appearing as 
fine black lines between the cells in the 
micrograph. (B) Section of a typical cell 
wall separating two adjacent plant cells. 
The two dark transverse bands correspond 
to plasmodesmata that span the wall (see 
Figure 1 9-38). {A, courtesy of C. Busby and 
B. Gunning, Eur. J. Cell Biol. 21:214-233, 
1980. With permission from Elsevier; 
B, courtesy of Jeremy Burgess.) 
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structural principle common to all fiber-composites, including fiberglass and 
reinforced concrete. One component provides tensile strength, while another, in 
which the first is embedded, provides resistance to compression. While the prin- 
ciple is the same in plants and animals, the chemistry is different. Unlike the ani- 
mal extracellular matrix, which is rich in protein and other nitrogen-containing 
polymers, the plant cell wall is made almost entirely of polymers that contain no 
nitrogen, including cellulose and lignin. For a sedentary organism that depends 
on C0 2 , H 2 0, and sunlight, these two abundant biopolymers represent "cheap," 
carbon-based, structural materials, helping to conserve the scarce fixed nitro- 
gen available in the soil that generally limits plant growth. Thus trees, for exam- 
ple, make a huge investment in the cellulose and lignin that comprise the bulk 
of their biomass. 

In the cell walls of higher plants, the tensile fibers are made from the 
polysaccharide cellulose, the most abundant organic macromolecule on Earth, 
tightly linked into a network by cross-linking glycans. In primary ceil walls, the 
matrix in which the cross-linked cellulose network is embedded is composed of 
pectin, a highly hydrated network of polysaccharides rich in galaeturonic acid. 
Secondary cell walls contain additional molecules to make them rigid and per- 
manent: lignin, in particular, forms a hard, waterproof filler in the interstices 
between the other components. All of these molecules are held together by a 
combination of covalent and noncovalent bonds to form a highly complex 
structure, whose composition, thickness and architecture depend on the cell 
type. 

The plant cell wall thus has a "skeletal" role in supporting the structure of 
the plant as a whole, a protective role as an enclosure for each cell individually, 
and a transport role, helping to form channels for the movement of fluid in the 
plant. When plant cells become specialized, they generally adopt a specific 
shape and produce specially adapted types of walls, according to which the dif- 
ferent types of cells in a plant can be recognized and classified (Figure 19-77; 
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Figure 19-77 Some specialized plant cell 
types with appropriately modified cell 
walls. (A) A trichome, or hair, on the upper 
surface of an Arabidopsis leaf. This spiky, 
protective single cell is shaped by the 
local deposition of a tough, cellulose-rich 
wall. (B) Surface view of tomato leaf 
epidermal cells. The cells fit together 
snugly like the pieces of a jigsaw puzzle, 
providing a strong outer covering for the 
leaf. The outer cell wall is reinforced with a 
cuticle and with waxes that waterproof 
the leaf and help defend it against 
pathogens. (C) This view into young xylem 
elements shows the thick, lignified, hoop- 
reinforced secondary cell wall that creates 
robust tubes for the transport of water 
throughout the plant. (A, courtesy of Paul 
Linstead; B and C, courtesy of Kim Findlay.) 



see also Panel 22-2, pp. 1404-1405) .We focus here, however, on the primary cell 
wall and the molecular architecture that underlies its remarkable combination 
of strength, resilience, and plasticity, as seen in the growing parts of a plant. 



The Tensile Strength of the Cell Wall Allows Plant Cells to Develop 
Turgor Pressure 

The aqueous extracellular environment of a plant cell consists of the fluid con- 
tained in the walls that surround the cell. Although the fluid in the plant cell wall 
contains more solutes than does the water in the plant's external milieu (for 
example, soil), it is still hypotonic in comparison with the cell interior. This 
osmotic imbalance causes the cell to develop a large internal hydrostatic pres- 
sure, or turgor pressure, which pushes outward on the cell wall, just as an inner 
tube pushes outward on a tire. The turgor pressure increases just to the point 
where the cell is in osmotic equilibrium, with no net influx of water despite the 
salt imbalance (see Panel 11-1, p. 664). The turgor pressure generated in this way 
may reach 10 or more atmospheres, about five times that in the average car tire. 
This pressure is vital to plants because it is the main driving force for cell expan- 
sion during growth, and it provides much of the mechanical rigidity of living 
plant tissues. Compare the wilted leaf of a dehydrated plant, for example, with 
the turgid leaf of a well- watered one. It is the mechanical strength of the cell wall 
that allows plant cells to sustain this internal pressure. 

The Primary Cell Wall Is Built from Cellulose Microfibrils 
Interwoven with a Network of Pectic Polysaccharides 

Cellulose gives the primary cell wall tensile strength. Each cellulose molecule 
consists of a linear chain of at least 500 glucose residues that are covalently 
linked to one another to form a ribbonlike structure, which is stabilized by 
hydrogen bonds within the chain (Figure 19-78). In addition, hydrogen bonds 
between adjacent cellulose molecules cause them to stick together in overlap- 
ping parallel arrays, forming bundles of about 40 cellulose chains, all of which 
have the same polarity. These highly ordered crystalline aggregates, many 
micrometers long, are called cellulose microfibrils, and they have a tensile 
strength comparable to steel (see Figure 19-78). Sets of microfibrils are 
arranged in layers, or lamellae, with each microfibril about 20-40 nm from its 
neighbors and connected to them by long cross-linking glycan molecules that 
are attached by hydrogen bonds to the surface of the microfibrils. The primary 
cell wall consists of several such lamellae arranged in a plywoodlike network 
(Figure 19-79). 
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Figure 19-78 Cellulose. Cellulose 
molecules are long, unbranched chains of 
(51,4-linked glucose units. Each glucose 
residue is inverted with respect to its 
neighbors, and the resulting disaccharide 
repeat occurs hundreds of times in a 
single cellulose molecule. About 40 
individual cellulose molecules assemble 
to form a strong, hydrogen-bonded 
cellulose microfibril. 
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The cross-linking glycans are a heterogeneous group of branched polysac- 
charides that bind tightly to the surface of each cellulose microfibril and thereby 
help to cross-link the microfibrils into a complex network. Their function is 
analogous to that of the fibril-associated collagens discussed earlier (see Figure 
19-68). There are many classes of cross-linking glycans, but they all have a long 
linear backbone composed of one type of sugar (glucose, xylose, or mannose) 
from which short side chains of other sugars protrude. It is the backbone sugar 
molecules that form hydrogen bonds with the surface of cellulose microfibrils, 
cross-linking them in the process. Both the backbone and the side-chain sugars 
vary according to the plant species and its stage of development. 

Coextensive with this network of cellulose microfibrils and cross-linking gly- 
cans is another cross-linked polysaccharide network based on pectins (see Fig- 
ure 19-79). Pectins are a heterogeneous group of branched polysaccharides that 
contain many negatively charged galacturonic acid units. Because of their neg- 
ative charge, pectins are highly hydrated and associated with a cloud of cations, 
resembling the glycosaminoglycans of animal cells in the large amount of space 
they occupy (see Figure 19-56). When Ca 2+ is added to a solution of pectin 
molecules, it cross-links them to produce a semirigid gel (it is pectin that is 
added to fruit juice to make jam set). Certain pectins are particularly abundant 
in the middle lamella, the specialized region that cements together the walls of 
adjacent cells (see Figure 19-79); here, Ca 2+ cross-links are thought to help hold 
cell-wall components together. Although covalent bonds also play a part in link- 
ing the components, very little is known about their nature. Regulated separa- 
tion of cells at the middle lamella underlies such processes as the ripening of 
tomatoes and the abscission (detachment) of leaves in the fall. 

In addition to the two polysaccharide-based networks that form the bulk of 
all plant primary cell walls, proteins are present, contributing up to about 5% of 
the wall's dry mass. Many of these proteins are enzymes, responsible for wall 
turnover and remodeling, particularly during growth. Another class of wall pro- 
teins, like collagen, contain high levels of hydroxyproline. These proteins are 
thought to strengthen the wall, and they are produced in greatly increased 
amounts as a local response to attack by pathogens. From the genome sequence 
of Arabidopsis, it has been estimated that more than 700 genes are required to 
synthesize, assemble, and remodel the plant cell wall. Some of the main poly- 
mers found in the primary and secondary cell wall are listed in Table 19-8. 
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Figure 19-79 Scale model of a portion 
of a primary plant cell wall showing the 
two major polysaccharide networks. The 

orthogonally arranged layers of cellulose 
microfibrils (green) are tied into a network 
by the cross-linking glycans (red) that 
form hydrogen bonds with the 
microfibrils. This network is coextensive 
with a network or matrix of pectic 
polysaccharides (blue). The network of 
cellulose and cross-linking glycans 
provides tensile strength, while the 
pectin network resists compression. 
Cellulose, cross-linking glycans, and 
pectin are typically present in roughly 
equal amounts in a primary cell wall. The 
middle lamella is especially rich in pectin, 
and it cements adjacent cells together. 
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Table 1 9-8 The Polymers of the Plant Cell Wall 
POLYMER COMPOSITION 



Cellulose 


linear polymer of glucose 


Cross-linking glycans 


xyloglucan, glucuronoarabinoxylan, 
and mannan 


Pectin 


homogalacturonans and 
rhamnogalacturonans 


Lignin 


cross-linked coumaryl, coniferyl, and 
sinapyl alcohols 


Proteins and 
glycoproteins 


enzymes, hydroxyproline-rich 
proteins 



FUNCTIONS 

fibrils confer tensile strength on all walls 
cross-link cellulose fibrils into robust network 

forms negatively charged, hydrophilic network that gives 
compressive strength to primary walls; cell-cell adhesion 

forms strong waterproof polymer that reinforces secondary 
cell walls 

responsible for wall turnover and remodeling; help defend 
against pathogens 



Oriented Cell-Wall Deposition Controls Plant Cell Growth 

Once a plant cell has left the meristem where it is generated, it can grow dra- 
matically, commonly by more than a thousand times in volume. The manner of 
this expansion determines the final shape of each cell, and hence the final form 
of the plant as a whole. Turgor pressure inside the cell drives the expansion, but 
it is the behavior of the cell wall that governs its direction and extent. Complex 
wall-remodeling activities are required, as well as the deposition of new wall 
materials. Because of their crystalline structure, the individual cellulose 
microfibrils in the wall are unable to stretch, and this gives them a crucial role in 
the process. For the cell wall to stretch or deform, the microfibrils must either 
slide past one another or become more widely separated, or both. The orienta- 
tion of the microfibrils in the innermost layers of the wall governs the direction 
in which the cell expands. Cells in plants therefore anticipate their future mor- 
phology by controlling the orientation of the cellulose microfibrils that they 
deposit in the wall (Figure 19-80). 

Unlike most other matrix macromolecules, which are made in the endo- 
plasmic reticulum and Golgi apparatus and are secreted, cellulose, like 
hyaluronan in animals, is spun out from the surface of the cell by a plasma- 
membrane-bound enzyme complex (cellulose synthase), which uses as its sub- 
strate the sugar nucleotide UDP-glucose supplied from the cytosol. Each 
enzyme complex, or rosette, has a six-fold symmetry and contains the protein 
products of three separate cellulose synthase (CESA) genes. Each CESA protein 




Figure 1 9-80 Cellulose microfibrils 
influence the direction of cell elongation. 

(A) The orientation of cellulose microfibrils 
in the primary cell wall of an elongating 
carrot cell is shown in this electron 
micrograph of a shadowed replica from a 
rapidly frozen and deep-etched cell wall. 
The cellulose microfibrils are aligned 
parallel to one another and perpendicular 
to the axis of cell elongation. The 
microfibrils are cross-linked by and 
interwoven with, a complex web of matrix 
molecules (compare with Figure 1 9-79). 
(B,C)The cells in (A) and (B) start off with 
identical shapes (shown here as cubes) but 
with different net orientations of cellulose 
microfibrils in their walls. Although turgor 
pressure is uniform in all directions, cell 
wall loosening allows each cell to elongate 
only in a direction perpendicular to the 
orientation of the innermost layer of 
microfibrils, which have great tensile 
strength. Cell expansion occurs in concert 
with the insertion of new wall material. 
The final shape of an organ, such as a 
shoot, is determined in part by the 
direction in which its component cells can 
expand. (A, courtesy of Brian Wells and 
Keith Roberts.) 
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is essential for the production of a cellulose microfibril. Three CESA genes are 
required for primary cell wall synthesis and a different three for secondary cell 
wall synthesis. 

As they are being synthesized, the nascent cellulose chains assemble into 
microfibrils. These are spun out on the extracellular surface of the plasma mem- 
brane, forming a layer, or lamella, in which all the microfibrils have more or less 
the same alignment (see Figure 19-79). Each new lamella is deposited internally 
to the previous one, so that the wall consists of concentrically arranged lamellae, 
with the oldest on the outside. The most recently deposited microfibrils in elon- 
gating cells commonly lie perpendicular to the axis of cell elongation, although 
the orientation of the microfibrils in the outer lamellae that were laid down ear- 
lier may be different (see Figure 19-80B and C). 



Microtubules Orient Cell-Wall Deposition 

An important clue to the mechanism that dictates this orientation came from 
observations of the microtubules in plant cells. These are frequently arranged 
in the cortical cytoplasm with the same orientation as the cellulose microfibrils 
that are currently being deposited in the cell wall in that region. These cortical 
microtubules form a cortical array close to the cytosolic face of the plasma 
membrane, held there by poorly characterized proteins (Figure 19-81). The 
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Figure 19-81 The cortical array of 
microtubules in a plant cell. (A) A grazing 
section of a root-tip cell from timothy 
grass, showing a cortical array of 
microtubules lying just below the plasma 
membrane. These microtubules are 
oriented perpendicularly to the long axis of 
the cell. (B) Epidermal cells in a root tip of 
Arabidopsis that has been stained, by 
immunofluorescence, to reveal the entire 
cortical microtubule arrays in these rapidly 
elongating cells. The clear transverse arrays 
are at right angles to the axis of root 
elongation. (A, courtesy of Brian Gunning; 
B, courtesy of Keiko Sugimoto-Shirasu.) 
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congruent orientation of the cortical array of microtubules (lying just inside the 
plasma membrane) and cellulose microfibrils (lying just outside) is seen in 
many types and shapes of plant cells and is present during both primary and 
secondary cell-wall deposition, suggesting a causal relationship. 

This relationship can be tested by treating a plant tissue with a microtubule- 
depolymerizing drug so as to disassemble the entire system of cortical micro- 
tubules. The consequences for subsequent cellulose deposition, however, are not 
as straightforward as might be expected. The drug treatment does not disrupt the 
production of new cellulose microfibrils, and in some cases cells can continue to 
deposit new microfibrils in the preexisting orientation. Any developmental 
switch in the orientation of the microfibril pattern that would normally occur 
between successive lamellae, however, is invariably blocked. It seems that a pre- 
existing orientation of microfibrils can be propagated even in the absence of 
microtubules, but any change in the deposition of cellulose microfibrils requires 
that intact microtubules be present to determine the new orientation. 

These observations are consistent with the following model. The cellulose- 
synthesizing rosettes embedded in the plasma membrane spin out long cellu- 
lose molecules. As the synthesis of cellulose molecules and their self-assembly 
into microfibrils proceeds, the distal end of each microfibril presumably forms 
indirect cross-links to the previous layer of wall material, orienting the new 
microfibril in parallel with the old ones as it becomes integrated into the texture 
of the wall. Since the microfibril is stiff, the rosette at its growing, proximal end 
has to move as it deposits the new material, traveling in the plane of the mem- 
brane in the direction defined by the way in which the far end of the microfibril 
is anchored in the existing wall. In this way, each layer of microfibrils would tend 
to be spun out from the membrane in the same orientation as the layer laid 
down previously, with the rosettes following the direction of the preexisting ori- 
ented microfibrils outside the cell. Oriented microtubules inside the cell, how- 
ever, can force a change in the direction in which the rosettes move: they can 
create boundaries in the plasma membrane that act like the banks of a canal to 
constrain rosette movement (Figure 19-82). In this view, cellulose synthesis can 
occur independently of microtubules; but it is constrained spatially when corti- 
cal microtubules are present to define membrane microdomains within which 
the enzyme complex can move. 

In this way, plant cells can change their direction of expansion by a sudden 
change in the orientation of their cortical array of microtubules. Because plant 
cells cannot move (being constrained by their walls), the entire morphology of a 
multicellular plant presumably depends on a coordinated, highly patterned 
deployment of cortical microtubule orientations during plant development. It is 
not known how these orientations are controlled, although it has been shown 
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Figure 1 9-82 One model of how the 
orientation of newly deposited cellulose 
microfibrils might be determined by the 
orientation of cortical microtubules. 

(A) The large cellulose synthase 
complexes are integral membrane 
proteins that continuously synthesize 
cellulose microfibrils on the outer face of 
the plasma membrane. The distal ends of 
the stiff microfibrils become integrated 
into the texture of the wall, and their 
elongation at the proximal end pushes 
the synthase complex along in the plane 
of the membrane. Because the cortical 
array of microtubules is attached to the 
plasma membrane in a way that confines 
this complex to defined membrane 
channels, the orientation of these 
microtubules — when they are present — 
determines the axis along which the new 
microfibrils are laid down. (B,C)Two 
electron micrographs show the tight 
association of the cortical microtubules 
with the plasma membrane. One shows 
the microtubules in cross section while 
the other shows a longitudinal section. 
Both emphasize the constant gap of 
about 20 nm between membrane and 
microtubule; the connecting molecules 
responsible remain obscure. (B and C, 
courtesy of Andrew Staehelin.) 
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that the microtubules can reorient rapidly in response to extracellular stimuli, 
including low- molecular- weight plant growth regulators such as ethylene and 
gibberellic acid (see Figure 22-119). 

Microtubules are not, however, the only cytoskeletal elements that influence 
wall deposition. Local foci of cortical actin filaments can also direct the deposi- 
tion of new wall material at specific sites on the cell surface, contributing to the 
elaborate final shaping of many differentiated plant cells, such as the leaf tri- 
chomes and epidermal cells shown in Figure 19-77. In cells that cannot organize 
their cortical actin network, as in Arp2/3 mutants, the cells lose their character- 
istic shapes, as illustrated in Figure 16-35. 



Summary 

Plant cells are surrounded by a tough extracellular matrix, or cell wall which is 
responsible for many of the unique features of a plant's life style. The wall is composed 
of a network of cellulose microfibrils and cross - linking gly cans, embedded in a highly 
cross-linked matrix ofpectic polysaccharides. In secondary cell walls, lignin may be 
deposited making them waterproof hard, and woody. A cortical array of microtubules 
can control the orientation of newly deposited cellulose microfibrils, which in turn 
determine the direction of cell expansion and therefore the final shape of the cell and, 
ultimately, of the plant as a whole. 



PROBLEMS 

Which statements are true? Explain why or why not. 

19-1 Given the numerous processes inside cells that are 
regulated by changes in Ca 2+ concentration, it seems likely 
that Ca 2+ -dependent cell-cell adhesions are also regulated 
by changes in Ca 2+ concentration. 

19-2 Tight junctions perform two distinct functions: they 
seal the space between cells to restrict paracellular flow and 
they fence off membrane domains to prevent mixing of api- 
cal and basolateral proteins. 

19-3 Integrins can convert mechanical signals into molec- 
ular signals. 

1 9-4 The elasticity of elastin derives from its high content 
of a helices, which act as molecular springs. 

Discuss the following problems. 

19-5 Comment on the following (1922) quote from Warren 
Lewis, who was one of the pioneers of cell biology. "Were the 
various types of cells to lose their stickiness for one another 
and for the supporting extracellular matrix, our bodies 
would at once disintegrate and flow off into the ground in a 
mixed stream of cells." 

19-6 Cell adhesion molecules were originally identified 
using antibodies raised against cell-surface components to 
block cell aggregation. In the adhesion-blocking assays, the 
researchers found it necessary to use antibody fragments, 
each with a single binding site (so-called Fab fragments), 
rather than intact IgG antibodies, which are Y-shaped 
molecules with two identical binding sites. The Fab frag- 
ments were generated by digesting the IgG antibodies with 
papain, a protease, to separate the two binding sites (Figure 
Q19-1). Why do you suppose it was necessary to use Fab 
fragments to block cell aggregation? 



19-7 The food poisoning bacterium Clostridium perfrin- 
gens makes a toxin that binds to members of the claudin 
family of proteins, which are the main constituents of tight 
junctions. When bound to a claudin, the C-terminus of the 
toxin allows the N-terminus to insert into the adjacent cell 
membrane, forming holes that kill the cell. The portion of 
the toxin that binds to the claudins has proven to be a valu- 
able reagent for investigating the properties of tight junc- 
tions. MDCK cells are a common choice for studies of tight 
junctions because they can form an intact epithelial sheet 
with high transepithelial resistance. MDCK cells express two 
claudins: claudin- 1, which is not bound by the toxin, and 
claudin-4, which is. 

When an intact MDCK epithelial sheet is incubated with 
the C-terminal toxin fragment, claudin-4 disappears, 
becoming undetectable within 24 hours. In the absence of 
claudin-4, the cells remain healthy and the epithelial sheet 
appears intact. The mean number of strands in the tight 
junctions that link the cells also decreases over 24 hours 
from about four to about two, and they are less highly 
branched. A functional assay for the integrity of the tight 
junctions shows that transepithelial resistance decreases 
dramatically in the presence of the toxin, but the resistance 
can be restored by washing it out (Figure Q19-2A). Curi- 
ously, the toxin produces these effects only when it is added 
to the basolateral side of the sheet; it has no effect when 
added to the apical surface (Figure Q19-2B). 
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Figure Q19-1 Production of Fab fragments from IgG antibodies by 

digestion with papain (Problem 19-6). 
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Figure Q1 9-2 Effects of Clostridium toxin on the barrier function of 
MDCK cells (Problem 1 9-7). (A) Addition of toxin from the basolateral 
side of the epithelial sheet. (B) Addition of toxin from the apical side 
of the epithelial sheet. The higher the resistance 
(ohms cm 2 ), the less the paracellular current for a given voltage. 



A. How can it be that two tight junction strands remain, 
even though all of the claudin-4 has disappeared? 

B. How do you suppose the toxin fragment causes the tight 
junction strands to disintegrate? 

C. Why do you suppose the toxin works when it is added to 
the basolateral side of the epithelial sheet, but not when 
added to the apical side? 

1 9-8 It is not an easy matter to assign particular functions 
to specific components of the basal lamina, since the overall 
structure is a complicated composite material with both 
mechanical and signaling properties. Nidogen, for example, 
cross-links two central components of the basal lamina by 
binding to the laminin-yl chain and to type IV collagen. 
Given such a key role, it was surprising that mice with a 
homozygous knockout of the gene for nidogen- 1 were 
entirely healthy, with no abnormal phenotype. Similarly, 
mice homozygous for a knockout of the gene for nidogen-2 
also appeared completely normal. By contrast, mice that 
were homozygous for a defined mutation in the gene for 
laminin-yl, which eliminated just the binding site for nido- 
gen, died at birth with severe defects in lung and kidney for- 
mation. The mutant portion of the laminin-yl chain is 
thought to have no other function than to bind nidogen, and 
does not affect laminin structure or its ability to assemble 
into basal lamina. How would you explain these genetic 
observations, which are summarized in Table Q19-1? What 
would you predict would be the phenotype of a mouse that 
was homozygous for knockouts of both nidogen genes? 

19-9 Discuss the following statement: "The basal lamina 
of muscle fibers serves as a molecular bulletin board, in 
which adjoining cells can post messages that direct the dif- 
ferentiation and function of the underlying cells." 

Table Q1 9-1 Phenotypes of mice with genetic defects in 
components of the basal lamina (Problem 19-8). 



PROTEIN GENETIC OEFEC 



GENOTYPE 



nidogen-1 gene knockout (-/-) none 

nidogen-2 gene knockout (-/-) none 

laminin-y1 nidogen binding-site deletion (+/-) none 

laminin-yl nidogen binding-site deletion (-/-) dead at birth 

+/- stands for heterozygous, -/- stands for homozygous. 



19-10 The affinity of integrins for matrix components can 
be modulated by changes to their cytoplasmic domains: a 
process known as inside-out signaling. You have identified a 
key region in the cytoplasmic domains of allbp3 integrin 
that seems to be required for inside-out signaling (Figure 
Q19-3). Substitution of alanine for either D723 in the p 
chain or R995 in the a chain leads to a high level of sponta- 
neous activation, under conditions where the wild-type 
chains are inactive. Your advisor suggests that you convert 
the aspartate in the p chain to an arginine (D723R) and the 
arginine in the a chain to an aspartate (R995D). You com- 
pare all three a chains (R995, R995A, and R995D) against all 
three P chains (D723, D723A, and D723R). You find that all 
pairs have a high level of spontaneous activation, except 
D723 vs R995 (the wild type) and D723R vs R995D, which 
have low levels. Based on these results, how do you think the 
allbp3 integrin is held in its inactive state? 
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Figure Q1 9-3 Schematic representation of allbp3 integrin (Problem 
1 9 1 0). The D723 and R995 residues are indicated. (From RE. Hughes 
et al M BioL Chem. 271:6571-6574, 1996. With permission from the 
American Society for Biochemistry and Molecular Biology.) 

19-11 Carboxymethyl Sephadex is a negatively charged 
crosslinked dextran that is commonly used for purifying 
proteins. It comes in the form of dry beads that swell 
tremendously when added to water. You have packed a 
chromatography column with the swollen gel. When you 
start equilibrating the column with a buffer that contains 50 
mJVI NaCl at neutral pH, you are alarmed to see a massive 
shrinkage in gel volume. Why does the dry Sephadex swell so 
dramatically when it is placed in water? Why does the 
swollen gel shrink so much when a salt solution is added? 

19-12 At body temperature, L-aspartate in proteins racem- 
izes to D-aspartate at an appreciable rate. Most proteins in 
the body have a very low level of D-aspartate, if it can be 
detected at all. Elastin, however, has a fairly high level of D- 
aspartate. Moreover, the amount of D-aspartate increases in 
direct proportion to the age of the person from whom the 
sample was taken. Why do you suppose that most proteins 
have little if any D-aspartate, while elastin has high, age- 
dependent levels? 

1 9-1 3 Your boss is coming to dinner! All you have for a salad 
is some wilted, day- old lettuce. You vaguely recall that there is 
a trick to rejuvenating wilted lettuce, but you can't remember 
what it is. Should you soak the lettuce in salt water, soak it in 
tap water, or soak it in sugar water, or maybe just shine a 
bright light on it and hope that photosynthesis will perk it up? 

19-14 The hydraulic conductivity of a single water channel 
is 4.4 x 10~ 22 m 3 per second per MPa (megapascal) of pres- 
sure. What does this correspond to in terms of water 
molecules per second at atmospheric pressure? [Atmos- 
pheric pressure is 0.1 MPa (1 bar) and the concentration of 
water is 55.5 M.] 
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Chapter 20 



Cancer 




About one in five of us will die of cancer, but that is not why we devote a chap- 
ter to this disease. Cancer cells break the most basic rules of cell behavior by 
which multicellular organisms are built and maintained, and they exploit every 
kind of opportunity to do so. In studying these transgressions, we discover what 
the normal rules are and how they are enforced. Thus, in the context of cell biol- 
ogy, cancer has a unique importance, and the emphasis given to cancer research 
has profoundly benefited a much wider area of biomedical science than that of 
cancer alone. 

The effort to combat cancer has led to many fundamental discoveries in cell 
biology. Many proteins have been discovered because abnormalities in their 
function can lead to uncontrolled growth, increased division, decreased death, 
or other aberrant characteristics of cancer cells. Among them are proteins 
involved in DNA repair (Chapter 5), cell signaling (Chapter 15), the cell cycle and 
cell growth (Chapter 17), programmed cell death (apoptosis, Chapter 18), and 
tissue architecture (Chapter 19). 

In this chapter, we first examine the nature of cancer by describing the nat- 
ural history of the disease from a cellular standpoint. In a second section, we dis- 
cuss the molecular changes that make a cell cancerous. Finally, we end the chap- 
ter by discussing how our enhanced understanding of the molecular basis of 
cancer is leading to improved methods for its prevention and treatment. 
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CANCER AS A MICROEVOLUTIONARY PROCESS 

The body of an animal operates as a society or ecosystem. The individual mem- 
bers are cells that reproduce by cell division and organize into collaborative 
assemblies called tissues. This society is very peculiar, however, because self- 
sacrifice — as opposed to survival of the fittest — is the rule. Ultimately all of the 
somatic cell lineages in animals are committed to die: they leave no progeny and 
instead dedicate their existence to support of the germ cells, which alone have a 
chance of continued survival (discussed in Chapter 21). There is no particular 
mystery in this, for the body is a clone derived from a fertilized egg, and the 
genome of the somatic cells is the same as that of the germ cell lineage that gives 
rise to sperm or eggs. By their self-sacrifice for the sake of the germ cells, the 
somatic cells help to propagate copies of their own genes. 

Thus, unlike free-living cells such as bacteria, which compete to survive, the 
cells of a multicellular organism are committed to collaboration. To coordinate 
their behavior, the cells send, receive, and interpret an elaborate set of extracel- 
lular signals that serve as social controls, directing each of them how to act (dis- 
cussed in Chapter 15). As a result, each cell behaves in a socially responsible 
manner — resting, growing, dividing, differentiating, or dying — as needed for the 
good of the organism. Molecular disturbances that upset this harmony mean 
trouble for a multicellular society. In a human body with more than 10 14 cells, 
billions of cells experience mutations every day, potentially disrupting the social 
controls. Most dangerously, a mutation may give one cell a selective advantage, 
allowing it to grow and divide more vigorously and survive more readily than its 
neighbors and to become a founder of a growing mutant clone. A mutation that 
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promotes such selfish behavior by individual members of the cooperative can 
jeopardize the future of the whole enterprise. Over time, repeated rounds of 
mutation, competition, and natural selection operating within the population of 
somatic cells can cause matters to go from bad to worse. These are the basic 
ingredients of cancer: it is a disease in which an individual mutant clone of cells 
begins by prospering at the expense of its neighbors, but in the end the descen- 
dants of this clone can destroy the whole cellular society. 

In this section, we discuss the development of cancer as a microevolution- 
ary process. In a human, this process occurs on a time scale of years or decades, 
in a subpopulation of cells in the body. But the final result depends on the same 
principles of mutation and natural selection that have driven the evolution of 
living organisms for billions of years. 



Cancer Cells Reproduce Without Restraint and Colonize Other 
Tissues 

Cancer cells are defined by two heritable properties: (1) they reproduce in defi- 
ance of the normal restraints on cell growth and division, and (2) they invade 
and colonize territories normally reserved for other cells. It is the combination 
of these properties that makes cancers particularly dangerous. An abnormal cell 
that grows (increases in mass) and proliferates (divides) out of control will give 
rise to a tumor, or neoplasm— literally, a new growth. As long as the neoplastic 
cells do not become invasive, however, the tumor is said to be benign, and 
removing or destroying the mass locally usually achieves a complete cure. A 
tumor is considered a cancer only if it is malignant, that is, only if its cells have 
acquired the ability to invade surrounding tissue. Invasiveness is an essential 
characteristic of cancer cells. It allows them to break loose, enter blood or lym- 
phatic vessels, and form secondary tumors, called metastases, at other sites in 
the body (Figure 20-1) . The more widely a cancer spreads, the harder it becomes 
to eradicate, and it is generally metastases that kill the cancer patient. 

Cancers are classified according to the tissue and cell type from which they 
arise. Carcinomas are cancers arising from epithelial cells, and they are by far 
the most common cancers in humans. Sarcomas arise from connective tissue or 
muscle cells. Cancers that do not fit in either of these two broad categories 
include the various leukemias and lymphomas, derived from white blood cells 
and their precursors (hemopoietic cells), as well as cancers derived from cells of 
the nervous system. Figure 20-2 shows the types of cancers that are common in 
the United States, together with their incidence and death rate. Each broad cat- 
egory has many subdivisions according to the specific cell type, location in the 
body, and the microscopic appearance of the tumor. 

In parallel with the set of names for malignant tumors, there is a related set 
of names for benign tumors: an adenoma, for example, is a benign epithelial 
tumor with a glandular organization; the corresponding type of malignant 
tumor is an adenocarcinoma (Figure 20-3). Similarly a chondroma and a chon- 
drosarcoma are, respectively, benign and malignant tumors of cartilage. 

Most cancers have characteristics that reflect their origin. Thus, for example, 
the cells of a basal-cell carcinoma, derived from a keratinocyte stem cell in the 
skin, generally continue to synthesize cytokeratin intermediate filaments, 
whereas the cells of a melanoma, derived from a pigment cell in the skin, will 

Figure 20-1 Metastasis. Malignant tumors typically give rise to metastases, 
making the cancer hard to eradicate. Shown in this fusion image is a 
whole-body scan of a patient with metastatic non-Hodgkins lymphoma 
(NHL). The background image of the body's tissues was obtained by CT 
(computed X-ray tomography) scanning. Overlaid on this image, a PET 
(positron emission tomography) scan that detects the uptake of 
radioactively labeled fluorodeoxyglucose (FDG) in various tissues reveals 
the tumor tissue {yellow). High FDG uptake indicates cells with unusually 
active glucose uptake and metabolism, a characteristic of tumors. The 
yellow spots in the abdominal region reveal multiple NHL metastases. 
(Courtesy of S. Gambhir.) 
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Figure 20-2 Cancer incidence and 
mortality in the United States. The total 
number of new cases diagnosed in 2004 
in the United States was 1,368,030, and 
total cancer deaths were 563,700. Note 
that deaths reflect cases diagnosed at 
many different times and that somewhat 
less than half of the people who develop 
cancer die of it. In the world as a whole, 
the five most common cancers are those 
of the lung, stomach, breast, 
colon/rectum, and uterine cervix 
(included in the figure under the heading 
of reproductive tract), and the total 
number of new cancer cases recorded 
per year is just over 6 million. Skin 
cancers other than melanomas are not 
included in these figures, since almost all 
are cured easily and many are 
unrecorded. (Data from American Cancer 
Society, Cancer Facts and Figures, 2004.) 



often (but not always) continue to make pigment granules. Cancers originating 
from different cell types are, in general, very different diseases. Basal-cell carci- 
nomas of the skin, for example, are only locally invasive and rarely metastasize, 
whereas melanomas can become much more malignant and often form metas- 
tases (reflecting the migratory behavior of normal pigment-cell precursors dur- 
ing development, discussed in Chapter 22). Basal-cell carcinomas are readily 
cured by surgery or local irradiation, whereas malignant melanomas, once they 
have metastasized widely, are usually fatal. 

About 80% of human cancers are carcinomas, perhaps because most of the 
cell proliferation in adults occurs in epithelia or because epithelial tissues are 
most frequently exposed to the various forms of physical and chemical damage 
that favor the development of cancer. 



Most Cancers Derive from a Single Abnormal Cell 

Even when a cancer has metastasized, we can usually trace its origins to a single 
primary tumor, arising in a specific organ. The primary tumor is thought to 
derive by cell division from a single cell that initially experienced some heritable 
change. Subsequently, additional changes accumulate in some of the descen- 
dants of this cell, allowing them to outgrow, out-divide, and often outlive their 
neighbors. By the time it is first detected, a typical human cancer will have been 
developing for many years and will already contain about a billion cancer cells 
or more (Figure 20-4). Tumors will usually also contain a variety of other cell 
types — fibroblasts, for example, in the supporting connective tissue associated 
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Figure 20-3 Benign versus malignant 
tumors. A benign glandular tumor (an 
adenoma) remains inside the basal 
lamina that marks the boundary of the 
normal structure (a duct, in this example), 
whereas a malignant glandular tumor (an 
adenocarcinoma) destroys duct integrity 
as shown. There are many forms that 
such tumors may take. 
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with a carcinoma, as well as inflammatory cells. How can we be sure that the 
cancer cells are a clone descended from a single abnormal cell? 

Molecular analyses of chromosomes in tumor cells clearly demonstrate the 
clonal origin of those cells. In almost all patients with chronic myelogenous 
leukemia (CML), for example, we can distinguish leukemic white blood cells 
from normal cells by a specific chromosomal abnormality: the so-called 
Philadelphia chromosome, created by a translocation between the long arms of 
chromosomes 9 and 22 (Figure 20-5). When the DNA at the site of translocation 
is cloned and sequenced, it is found that the site of breakage and rejoining of the 
translocated fragments is identical in all the leukemic cells in any given patient, 
but that this site differs slightly (by a few hundred or thousand base pairs) from 
one patient to another. This is the expected result if, and only if, the cancer in 
each patient arises from a unique accident occurring in a single cell. We will see 
later how this translocation promotes the development of CML by creating a 
novel hybrid gene encoding a protein that promotes cell proliferation. 

Many other lines of evidence, from a variety of cancers, point to the same 
conclusion: most cancers originate from a single aberrant cell (Figure 20-6). 



Cancer Cells Contain Somatic Mutations 

If a single abnormal cell is to give rise to a tumor, it must pass on its abnormality 
to its progeny: the aberration has to be heritable. One problem in understanding 
a cancer is to discover whether a particular heritable aberration is due to a 
genetic change— that is, an alteration in the cell's DNA sequence— or to an epige- 
netic change— that is, a persistent change in the pattern of gene expression with- 
out a change in the DNA sequence. Epigenetic changes occur during normal 
development, and they are responsible for the stability of the differentiated state 
and for such phenomena as X-chromosome inactivation and genomic imprint- 
ing (discussed in Chapter 7). As we shall discuss later, epigenetic changes are now 
known to play an important part in the development of cancers. 

But it is also clear that the development of a clone of cancer cells also 
involves genetic changes. The cells of many cancers can be shown to have one 
or more shared detectable abnormalities in their DNA sequence that distinguish 
them from the normal cells surrounding the tumor, as in the example of CML 
just described, and advances in DNA analysis have identified such genetic faults 
in an increasing proportion of cancer types. This agrees with the finding that 
many of the agents that provoke the development of cancer also cause genetic 
changes. Thus, carcinogenesis (the generation of cancer) appears to be linked to 
mutagenesis (the production of a change in the DNA sequence). This correlation 
is particularly clear for two classes of agents: (1) chemical carcinogens (which 
typically cause simple local changes in the nucleotide sequence), and (2) radia- 
tion such as x-rays (which typically cause chromosome breaks and transloca- 
tions) or UV light (which causes specific DNA base alterations). We shall discuss 
these carcinogenic agents in detail later. 

The conclusion that the development of cancer depends on somatic muta- 
tions is further supported by the finding that people who have inherited a 
genetic defect in one of several DNA repair mechanisms, causing their cells to 
accumulate mutations at an elevated rate, show a strong predisposition to 
develop cancer. People with the disease xeroderma pigmentosum, for example, 
have defects in the system that repairs DNA damage induced by UV light, and 
they have a greatly increased incidence of skin cancers. Likewise, mice lacking 
specific DNA repair genes are abnormally prone to cancer. 



Figure 20-5 The translocation between chromosomes 9 and 22 
responsible for chronic myelogenous leukemia. The normal structures of 
chromosomes 9 and 22 are shown at the left. When a translocation occurs 
between them at the indicated site, the result is the abnormal pair at the 
right. The smaller of the two resulting abnormal chromosomes (22q") is 
called the Philadelphia chromosome, after the city where the abnormality 
was first recorded. 
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A Single Mutation Is Not Enough to Cause Cancer 

An estimated 10 16 cell divisions occur in a normal human body in the course of 
a typical lifetime; in a mouse, with its smaller number of cells and its shorter life 
span, the number is about 10 12 . Even in an environment that is free of mutagens, 
mutations would occur spontaneously at an estimated rate of about 10~ 6 muta- 
tions per gene per cell division — a value set by fundamental limitations on the 
accuracy of DNA replication and repair (see p. 263). Thus, in a typical lifetime, 
every single gene is likely to have undergone mutation on about 10 10 separate 
occasions in a human, or on about 10 6 occasions in a mouse. Among the result- 
ing mutant cells, we might expect a large number that have sustained deleteri- 
ous mutations in genes that regulate cell growth and division, causing the cells 
to disobey the normal restrictions on cell proliferation. From this point of view, 
the problem of cancer seems to be not why it occurs, but why it occurs so infre- 
quently 

Clearly, if a single mutation were enough to convert a typical healthy cell 
into a cancer cell that proliferates without restraint, we would not be viable 
organisms. Many lines of evidence indicate that the development of a cancer 
typically requires that a substantial number of independent, rare genetic acci- 
dents occur in the lineage of one cell. One comes from epidemiological studies 
of the incidence of cancer as a function of age (Figure 20-7). If a single mutation 
were responsible, occurring with a fixed probability per year, the chance of 
developing cancer in any given year of life should be independent of age. In fact, 
for most types of cancer, the incidence rises steeply with age — as would be 
expected if cancer is caused by a progressive accumulation of random muta- 
tions in a single lineage of cells. 

Now that many of the specific mutations responsible for the development of 
cancer have been identified, we can test directly for their presence in the 
genome of a particular type of cancer cell. Such tests indicate that a human 
malignant cell generally harbors many mutations. Animal models confirm that 
a single genetic alteration is insufficient to cause cancer: in genetically engi- 
neered mice carrying one cancer-causing mutation, the disease occurs only 



Figure 20-7 Cancer incidence as a function of age. The number of newly 
diagnosed cases of colon cancer in women in England and Wales in 1 year 
is plotted as a function of age at diagnosis and expressed relative to the 
total number of individuals in each age group. The incidence of cancer 
rises steeply as a function of age. If only a single mutation were required to 
trigger the cancer and this mutation had an equal chance of occurring at 
any time, the incidence would be independent of age. (Data from C. Muir 
etal., Cancer Incidence in Five Continents, Vol. V. Lyon: International 
Agency for Research on Cancer, 1987.) 



Figure 20-6 Evidence from X-inactivation 
mosaics that demonstrates the 
monoclonal origin of cancers. (A) As a 

result of a random process that occurs in 
the early embryo, practically every normal 
tissue in a woman's body is a mixture of 
cells with different X chromosomes 
heritably inactivated (indicated here by 
the mixture of red cells and blue cells in 
the normal tissue). (B) When the cells of a 
cancer are tested for their expression of an 
X-linked marker gene (a specific form of 
the enzyme G6PD), however, they are 
usually all found to have the same 
X chromosome inactivated, as shown in 
(C). This implies that they are all derived 
from a single cancerous founder cell. 
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after a lag period of months and arises only in a small scattered subset of cells in 
the body, implying that other genetic events are necessary for cancer to develop. 
Furthermore, engineered mice with mutations in more than one cancer-critical 
gene develop cancer faster than mice with single mutations, demonstrating that 
multiple genetic events in a single cell can cooperate to promote cancerous cell 
growth and the development of a tumor. 



Cancers Develop Gradually from Increasingly Aberrant Cells 

For those cancers known to have a specific external cause, the disease does not 
usually become apparent until long after exposure to the causal agent. The inci- 
dence of lung cancer, for example, does not begin to rise steeply until after 20 
years of heavy smoking. Similarly, the incidence of leukemias in Hiroshima and 
Nagasaki did not show a marked rise until about 5 years after the explosion of 
the atomic bombs, and industrial workers exposed for a limited period to chem- 
ical carcinogens do not usually develop the cancers characteristic of their occu- 
pation until 10, 20, or even more years after the exposure (Figure 20-8). During 
this long incubation period, the prospective cancer cells undergo a succession 
of genetic and epigenetic changes. The same seems to apply to cancers where 
the initial genetic lesion has no such obvious external cause. 

The concept that the development of a cancer requires a gradual accumula- 
tion of mutations in a number of different genes — different for different cancers, 
but usually at least five — helps to explain the well-known phenomenon of 
tumor progression, whereby an initial mild disorder of cell behavior evolves 
gradually into a full-blown cancer. Chronic myelogenous leukemia provides a 
clear example. It begins as a disorder characterized by a nonlethal overproduc- 
tion of white blood cells and continues in this form for several years before 
changing into a much more rapidly progressing illness that usually ends in death 
within a few months. In the early chronic phase, the leukemic cells are distin- 
guished mainly by the chromosomal translocation (the Philadelphia chromo- 
some) mentioned previously, although there may well be other less visible 
genetic or epigenetic changes. In the subsequent acute phase, cells that show 
not only the translocation but also several other chromosomal abnormalities 
overrun the hemopoietic (blood-forming) system. It appears as though cells of 
the initial mutant clone have undergone further mutations that make them pro- 
liferate even more extensively, so that they come to outnumber both the normal 
blood cells and the ancestor cells that have only the primary chromosomal 
translocation. 

Carcinomas and other solid tumors are thought to evolve in a similar way. 
Although many such cancers in humans are not diagnosed until a relatively late 
stage, in some cases it is possible to observe the earlier steps. Cancers of the 
uterine cervix (the neck of the womb) are one such example, because of the rou- 
tine screening for this cancer by cervical scraping. 
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Figure 20-8 Delayed onset of cancer 
following exposure to a carcinogen. 

The graph shows the length of the delay 
before onset of bladder cancer in a set of 
78 workers in the chemical industry who 
had been exposed to the carcinogen 
2-naphthylamine, grouped according to 
the duration of their exposure. (Modified 
from J. Cairns, Cancer: Science and 
Society. San Francisco: W.H. Freeman, 
1 978. After M.H.C. Williams, in Cancer, 
Vol. Ill [R.W. Raven, ed.]. London: 
Butterworth Heinemann, 1958.) 
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Figure 20-9 The stages of progression in the development of cancer of the epithelium of the uterine cervix. 

Pathologists use standardized terminology to classify the types of disorders they see, so as to guide the choice of 
treatment. (A) In the normal stratified squamous epithelium, dividing cells are confined to the basal layer. (B) In low-grade 
intraepithelial neoplasia, dividing cells can be found throughout the lower third of the epithelium; the superficial cells are 
still flattened and show signs of differentiation, but this is incomplete. (C) In high-grade intraepithelial neoplasia, cells in all 
the epithelial layers are proliferating and exhibit defective differentiation. (D) True malignancy begins when the cells move 
through or destroy the basal lamina that underlies the basal layer of epithelium and invade the underlying connective 
tissue. (Photographs courtesy of Andrew J. Connolly.) 



Cervical Cancers Are Prevented by Early Detection 

The epithelium covering the cervix is initially organized as a stratified (multilay- 
ered) squamous epithelium (Figure 20-9A), which is similar in structure to the 
epidermis of the skin (see Figure 23-3). In such stratified epithelia, cell prolifer- 
ation normally occurs only in the basal layer, generating cells that stop dividing 
and then move out toward the surface, differentiating as they move to form flat- 
tened, keratin-rich cells that are eventually sloughed off when they reach the 
surface. When specimens of cervical epithelium from different women are 
examined, however, it is not unusual to find patches in which this organization 
is disturbed in a way that suggests the beginnings of a cancerous transforma- 
tion. Pathologists describe these changes as intraepithelial neoplasia, and clas- 
sify them as low-grade (mild) or high-grade (moderate to severe). 

In the low-grade lesions, the undifferentiated dividing cells are no longer 
confined to the basal layer but occupy other layers in the lower third of the 
epithelium; although differentiation proceeds in the upper epithelial layers, it 
is somewhat disordered (Figure 20-9B). Left alone, most of these mild lesions 
will spontaneously regress, but about 10% progress to become high-grade 
lesions, in which most or all of the epithelial layers are occupied by undifferen- 
tiated dividing cells, which are usually highly variable in size and shape. Abnor- 
mal mitotic structures are frequently seen, and the karyotype (the display of the 
full set of chromosomes) is usually abnormal. But the aberrant cells are still 
confined to the epithelial side of the basal lamina (Figure 20-9C). At this stage, 
it is still easy to cure the condition by destroying or surgically removing the 
abnormal tissue. Fortunately, the presence of such lesions can be detected by 
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scraping off a sample of cells from the surface of the cervix and viewing it under 
the microscope (the "Pap smear" technique — Figure 20-10), thereby saving 
many lives. 

Without treatment, the abnormal patch of tissue may simply persist and 
progress no further or may even regress spontaneously In at least 30-40% of 
cases, however, progression will occur, giving rise, over a period of several years, 
to a frank invasive carcinoma (see Figure 20-9D): the cancer cells cross or 
destroy the basal lamina, invade the underlying tissue, and may metastasize to 
local lymph nodes via lymphatic vessels. Surgical cure becomes progressively 
more difficult as the invasive growth spreads. 



Figure 20-1 Photographs of cells 
collected by scraping the surface of the 
uterine cervix (the Papanicolaou or "Pap 
smear" technique). (A) Normal: the cells 
are large and well differentiated, with 
highly condensed nuclei. (B) Precancerous 
lesion: differentiation and proliferation are 
abnormal but the lesion is not yet invasive; 
the cells are in various stages of 
differentiation, some quite immature. 
(C) Invasive carcinoma: the cells all appear 
undifferentiated, with scanty cytoplasm 
and a relatively large nucleus. For all three 
panels, debris in the background includes 
some white blood cells. (Courtesy of 
Winifred Gray.) 



Tumor Progression Involves Successive Rounds of Random 
Inherited Change Followed by Natural Selection 

From all the evidence, it seems that cancers arise by a process in which an initial 
population of slightly abnormal cells, descendants of a single abnormal ances- 
tor, evolve from bad to worse through successive cycles of random inherited 
change followed by natural selection. At each stage, one cell acquires an addi- 
tional mutation or epigenetic change that gives it a selective advantage over its 
neighbors, making it better able to thrive in its environment — an environment 
that, inside a tumor, may be harsh, with low levels of oxygen, scarce nutrients, 
and the natural barriers to growth presented by the surrounding normal tissues. 
The offspring of the best-adapted cell continue to divide, eventually taking over 
the tumor and becoming the dominant clone in the developing lesion (Figure 
20-11). Tumors grow in fits and starts, as additional advantageous inherited 
changes arise and the cells bearing them flourish. Tumor progression involves a 
large element of chance and usually takes many years, which is why the major- 
ity of us will die of causes other than cancer. 

Why are so many changes needed? Clearly large animals have had to evolve 
a very complex set of regulatory mechanisms to keep their cells in check. Without 
multiple controls, inevitable errors in the maintenance of DNA sequences would 

Figure 20-1 1 Clonal evolution. In this schematic diagram, a tumor 
develops through repeated rounds of mutation and proliferation, giving 
rise eventually to a clone of fully malignant cancer cells. At each step, a 
single cell undergoes a mutation that either enhances cell proliferation or 
decreases cell death, so that its progeny become the dominant clone in the 
tumor. Proliferation of each clone hastens the occurrence of the next step 
of tumor progression by increasing the size of the cell population at risk of 
undergoing an additional mutation. The final step depicted here is invasion 
through the basement membrane, an initial step in metastasis. In reality, 
there are more than the three steps shown here, and a combination of 
genetic and epigenetic changes are involved. 



epithelial cells 
accidental production growing on 

of mutant cell basal lamina 



CELL PROLIFERATION 



cell with 2 mutations 



cell with 

3 mutations 

••••••• 




CELL PROLIFERATION 



• ••*•••• 



! DANGEROUS 
CELL PROLIFERATION 




CANCER AS A MICROEVOLUTIONARY PROCESS 1213 

produce numerous tumors early in life and quickly destroy any large multicellu- 
lar organism. Thus, we should not be surprised that cells employ multiple regu- 
latory mechanisms to help them maintain tight and precise control over their 
behavior, and that many different regulatory systems have to be disrupted before 
a cell can throw off its normal restraints and behave as an asocial cancer cell. The 
restraints are of many types, and tumor cells meet new barriers to further expan- 
sion at each stage of their evolution. Oxygen and nutrients, for example, do not 
become limiting until a tumor is one or two millimeters in diameter, at which 
point the cells in the tumor interior may not have adequate access to these nec- 
essary resources. Cells must acquire additional mutations, epigenetic changes (or 
both) to overcome each new barrier, whether physical or physiological. 

In general, the rate of evolution in any population of organisms on Earth 
would be expected to depend mainly on four parameters: (1) the mutation rate, 
that is, the probability per gene per unit time that any given member of the pop- 
ulation will undergo genetic change; (2) the number of reproducing individuals 
in the population; (3) the rate of reproduction, that is, the average number of 
generations of progeny produced per unit time; and (4) the selective advantage 
enjoyed by successful mutant individuals, that is, the ratio of the number of sur- 
viving fertile progeny they produce per unit time to the number of surviving fer- 
tile progeny produced by nonmutant individuals. The same types of factors are 
also crucial for the evolution of cancer cells in a multicellular organism, except 
that both genetic and epigenetic changes help drive the evolutionary process. 

The Epigenetic Changes That Accumulate in Cancer Cells Involve 
Inherited Chromatin Structures and DNA Methylation 

As just stated, the progression toward cancer differs from normal biological evolu- 
tion in one important respect: epigenetic changes also occur that give the cells a 
selective advantage. 

For many years, pathologists have used an abnormal appearance of the cell 
nucleus to identify and classify cancer cells in tumor biopsies. For example, can- 
cer cells sometimes contain an unusually large amount of heterochromatin — a 
condensed form of interphase chromatin that silences genes (see p. 238). We 
now understand some of the molecular mechanisms involved in forming this 
chromatin, and it is possible to associate new heterochromatin formation with 
the epigenetic silencing of specific genes that would otherwise block tumor pro- 
gression. 

Heterochromatin formation and maintenance involve specific covalent 
modifications of histones; these in turn attract complexes of chromatin-binding 
proteins that are stably maintained following DNA replication (discussed in 
Chapter 4). In this way, genes can be turned off in a cell-to-cell inherited man- 
ner without any change in the DNA sequence. This epigenetic form of gene reg- 
ulation plays a major role in driving the orderly pattern of cell specializations 
that occur during embryonic development (discussed in Chapter 22). Errors that 
occur in this process are potentially dangerous because they can be transmitted 
to the progeny of the cell in which the original change occurred. 

We now know that many of the mutations that make a cell cancerous alter 
the proteins that determine chromatin structures. These include not only 
enzymes that modify the histones in nucleosomes, but also proteins in the "code 
reader-writer" complexes that interpret the histone code (see Figure 4-43). 
These findings provide strong evidence for the importance of chromatin-based 
epigenetic changes in tumor progression. 

During normal developmental processes, an inherited pattern of DNA 
methylation reinforces many of the gene silencing events caused by the packag- 
ing of genes into heterochromatin (see p. 467). Analyses reveal that a large 
amount of DNA methylation also occurs on selected genes during tumor pro- 
gression. In summary, although recognized relatively recently, it now appears 
that the abnormal epigenetic silencing of genes is no less important than muta- 
tions in DNA sequences for the development of most cancers (Figure 20-12). 
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Figure 20-12 Comparison of the genetic and epigenetic changes observed in tumors. A mutation results from an 
irreversible change in DNA sequence. In contrast to such genetic changes, epigenetic changes are based on alterations 
that, while heritable from cell to cell, can be reversed either by site-specific changes in histone modification 
(heterochromatin pathway) or by site-specific DNA demethylation (methylation pathway). Because the epigenetic marks on 
genes are generally reversed during the formation of eggs and sperm, they are not inherited between generations and 
therefore have not been extensively studied by geneticists. 

As discussed in Chapter 7, it is thought that DNA methylation (an inherited pattern of methylation of C nucleotides in CpG 
sequences) is a mechanism that is used to more permanently silence genes that have already been turned off, and DNA 
methylation seems to be a silencing mechanism that normally follows the formation of heterochromatin. But the 
misregulation that arises during tumor progression may be able to produce gene silencing independently of 
heterochromatinization, as schematically illustrated here. 



Human Cancer Cells Are Genetically Unstable 

Most human cancer cells accumulate genetic changes at an abnormally rapid 
rate and are said to be genetically unstable. This instability can take various 
forms. Some cancer cells are unable to repair certain types of DNA damage or to 
correct replication errors of various kinds. These cells tend to accumulate more 
point mutations or other DNA sequence changes than do normal cells. Other 
cancer cells fail to maintain either the number or the integrity of their chromo- 
somes, and they consequently accumulate gross abnormalities in their kary- 
otype that are visible at mitosis (Figure 20-13). The genetic instability is further 
amplified when some of the DNA changes alter epigenetic control mechanisms 
in ways that produce extra heterochromatin and DNA methylation. And epige- 
netic changes by themselves, arising accidentally and independently of prior 
genetic changes, may also lead— in principle at least— to destabilization of nor- 
mal patterns of cellular inheritance by facilitating other changes either at the 
genetic or the epigenetic level. From an evolutionary perspective, none of this 
should be a surprise: anything that increases the probability of random changes 
in gene function that are heritable from one cell generation to the next is likely 
to speed the evolution of a clone of cells toward malignancy. 

Although the epigenetic changes in cancer cells have not been so thor- 
oughly analyzed, a great deal is now known about the genetic changes. Differ- 
ent tumors — even those from the same tissues— can show different kinds of 
genetic instability, caused by heritable alterations in any one of a large number 
of so-called DNA maintenance genes involved in propagating DNA or chromo- 
somes. It is rare for a person to inherit a mutation in one of these genes, but 
those people that do so have a raised incidence of cancer, confirming that a loss 
of genetic stability can cause cancer. Generally, such destabilizing mutations 
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are not inherited but arise de novo in the clone of cells in which a tumor devel- 
ops, helping the cancer cell to accumulate mutations much more rapidly than 
its neighbors. Recent analyses show that the cells in a variety of human cancers 
experience single nucleotide substitutions at a rate that is 10-20 times higher 
than the rate observed in normal cells. As a result, the sequencing of more than 
10,000 genes in a set of breast and colorectal cancers reveals that cancer cells 
have accumulated enough mutations to cause an amino acid change in the pro- 
teins of roughly 100 genes. Most are random changes that affect different genes 
in different individual tumors. But a subset of genes are found to be repeatedly 
mutated in a particular type of cancer, suggesting that alterations in as many as 
20 genes are needed to drive tumor progression. 

Genetic instability does not in itself give a cell a selective advantage. It seems 
that there is some optimum level of genetic instability for the development of 
cancer, making a cell mutable enough to evolve rapidly, but not so mutable that 
it accumulates too many harmful changes and dies. To be favored by selection, 
a genetically unstable cell must acquire properties that confer some competitive 
advantage. 



Figure 20-1 3 Chromosomes from a 
breast tumor displaying abnormalities in 
structure and number. Chromosomes 
were prepared from a breast tumor cell in 
metaphase, spread on a glass slide, and 
stained with (A) a general DNA stain or 
(B) a combination of fluorescent stains 
that color each normal human 
chromosome differently. The staining 
(displayed in false color) shows multiple 
translocations, including a doubly 
translocated chromosome (white arrow) 
made up of two pieces of chromosome 8 
(brown) and a piece of chromosome 17 
(purple). The karyotype also contains 
48 chromosomes, instead of the normal 
46. {Courtesy of Joanne Davidson and 
Paul Edwards.) 



Cancerous Growth Often Depends on Defective Control of Cell 
Death, Cell Differentiation, or Both 

Just as an increased mutation rate per cell can raise the probability of cancer, so 
can any circumstance that increases the number of proliferating cells available 
for mutating. People who are clinically obese, for example, have a strongly 
increased risk of many types of cancer, compared with people of normal weight; 
and this is presumably due, in part at least, to an increase in both the number of 
cells in the body and the rate at which these cells divide when overnourished or 
overstimulated by growth factors. The same principle applies to both cancer ini- 
tiation and cancer progression: the bigger the clone of altered cells resulting 
from an early inherited change, the greater the chance that at least one of these 
cells will undergo an additional mutation or epigenetic change that will allow 
the cancer to progress. Thus, at every stage in the development of cancer, any 
condition that helps the altered cells to increase in number favors the progres- 
sion of the tumor. 

An early mutation or epigenetic change can have this effect by increasing 
the rate at which a clone of cells proliferates, as we discuss in detail later. Such 
changes, however, are not the only — or necessarily the most important — mech- 
anism for increasing cell number. In normal adult tissues, especially those at risk 
of cancer, cells may proliferate continually; but their numbers remain steady 
because cell production is balanced by cell loss, as part of the body's homeo- 
static control mechanisms. Programmed cell death by apoptosis usually plays an 
essential part in this balance, as we discuss in Chapters 18 and 23. If too many 
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cells are generated, the rate of apoptosis increases to dispose of the surplus. One 
of the most important properties of many types of cancer cells is that they fail to 
undergo apoptosis when a normal cell would do so. This greatly contributes to 
the growth of a tumor (Figure 20-14). 

Inherited changes can also increase the size of a clone of mutant cells by 
altering their ability to differentiate, as illustrated by the situation in the uterine 
cervix, discussed earlier. When a stem cell in the basal layer divides, each daugh- 
ter cell has a choice— it can either remain a stem cell or commit to a pathway 
leading to differentiation; the committed cells initially proliferate and then stop 
dividing and terminally differentiate (the committed, proliferating cells are 
called transit amplifying cells). If the differentiation program is blocked at some 
stage, proliferating cells accumulate, contributing to the progression from low- 
grade intraepithelial neoplasia of the cervix to high-grade intraepithelial neo- 
plasia and malignant cancer (see Figure 20-9). Similar considerations apply to 
the development of cancer in other tissues that rely on stem cells, such as the 
skin, the lining of the gut, and the hemopoietic system. Several forms of 
leukemia, for example, seem to arise from a disruption of the normal program of 
differentiation, such that a committed progenitor of a particular type of blood 
cell eventually becomes able to divide indefinitely, instead of undergoing termi- 
nal differentiation in the normal way and dying after a limited number of cell 
divisions (as discussed in Chapter 23). 

Thus, the accumulation of mutations and epigenetic changes that lead to 
defects in the normal controls of cell division, apoptosis, and differentiation can 
all contribute to the development and progression of cancers. 



Figure 20-14 Both increased cell 
division and decreased apoptosis can 
contribute to tumorigenesis. In normal 
tissues, apoptosis balances cell division to 
maintain homeostasis. <GCCC> During 
the development of cancer, either an 
increase in cell division or an inhibition of 
apoptosis can lead to the increased cell 
numbers important for tumorigenesis. 
Cells fated to undergo apoptosis are 
shown in gray. 



Cancer Cells Are Usually Altered in Their Responses to DNA 
Damage and Other Forms of Stress 

As just discussed, many normal cells permanently stop dividing when they dif- 
ferentiate into specialized cells. Differentiation, however, is not the only reason 
that proliferating cells stop dividing; they can also do so in response to stress or 
to damage to their DNA. As described in Chapter 17, normal cells contain a set of 
checkpoint control mechanisms. These arrest the cell cycle temporarily when 
something goes awry, providing time to correct the problem. Chromosome 
breakage and other types of damage to DNA generate intracellular signals that 
activate these checkpoint mechanisms, causing a normal cell to halt its cycle, and 
thereby giving it a chance to repair the damage before it attempts to progress 
through the cycle and divide. And if the damage is irreparable, a normal cell, 
rather than forge ahead and generate daughters with a damaged genome, will 
either permanently withdraw from the cell cycle or commit suicide by apoptosis. 
Cancer cells usually acquire mutations and epigenetic changes that inacti- 
vate such responses to DNA damage. From the selfish standpoint of the cancer 
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cells, the loss of these restraints confers an advantage, allowing them to con- 
tinue to multiply even with damaged DNA. At the same time, this accelerates the 
rate at which mutations accumulate in the tumor cell population and can 
thereby hasten the progression to greater malignancy. The cancer cells are also 
often defective in other checkpoint control mechanisms that help regulate the 
cell cycle. Thus, they wiil often stagger on through the cycle when tilings go 
wrong, and in this way will do themselves still further genetic damage, con- 
tributing to the chromosome abnormalities that they often display Even though 
such genetically damaged ceils usually take longer to complete each cell cycle 
than normal cycling cells, tiiey accumulate over time because diey display a 
greater propensity to proliferate. It is perhaps remarkable that these cells can 
manage to ou [compete their normal neighbors when they are so messed up, 

Human Cancer Cells Escape a Built-in Limit to Cell Proliferation 

Many normal human cells have a built-in limit to the number of times they can 
divide when stimulated to proliferate in culture: they permanently stop dividing 
after a certain number of population doublings (25-50 for human fibroblasts, for 
example), a phenomenon termed replicative cell senescence (discussed in 
Chapter 17). This cell-divisinn-counting mechanism generally depends on the 
progressive shortening of the telomeres at the ends of chromosomes, which 
eventually changes their structure. As discussed in Chapter 5, the replication of 
telomere DNA during S phase depends on the enzyme telomerase, which main- 
tains the special telomeric DNA sequence and promotes the formation of pro- 
tein cap structures that protect the chromosome ends. Because many prolifer- 
ating human cells (but not stem cells) are deficient in telomerase, their telom- 
eres shorten with every division, and their protective caps deteriorate. Eventu- 
ally the altered chromosome ends trigger a permanent cell-cycle arrest. 

The cell-division-counting mechanism just described sets a physiological 
limit to cell proliferation for most of the types of cells in the body. Human can- 
cer cells must somehow avoid or overcome this barrier to form large tumors. 
Some rodent cells, by contrast, maintain telomerase activity and normal telom- 
eres as they proliferate and therefore lack this barrier. It has been proposed that 
humans need replicative cell senescence to help prevent cancer because our 
comparatively long lifespan would otherwise provide enormous opportunities 
for tumor progression. 

Human cancer cells avoid replicative cell senescence in two ways. First, 
they acquire genetic and epigenetic changes that disable the checkpoint con- 
trol so that the cells continue to cycle even when telomeres become uncapped. 
Mutations that inactivate the p53 pathway have this effect and are very com- 
mon in cancer cells, as we shall discuss below. As another strategy to escape 
replicative senescence, cancer cells often maintain telomerase activity as they 
proliferate, so that their telomeres do not shorten or become uncapped. In 
some cases, the cancer cells may maintain telomerase activity because the can- 
cer originated in stem cells that have this activity. In other cases, although the 
cancer originated in cells without appreciable telomerase activity, the cancer 
cells have acquired the activity as a result of genetic or epigenetic changes that 
were selected for as their telomeres shortened. Yet other cancer cells evolve 
alternate mechanisms for elongating chromosome ends. Regardless of the 
strategy used, the result is that the cancer cells continue to proliferate under 
conditions when normal cells would stop. 

A Small Population of Cancer Stem Cells Maintains Many Tumors 

As we discuss in Chapter 23, in a normal adult cell lineage with substantial cell 
turnover, such as the hemopoietic cell lineage, the cells are typically organized 
as a hierarchy: rare, slowly dividing stem cells produce cells (the transit amplify- 
ing cells) that proliferate extensively and eventually differentiate into the spe- 
cialized cells characteristic of the lineage. The stem cells maintain the lineage by 
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producing some daughter cells that can remain stem cells (a process called self- 
renewal), while producing others that commit to a pathway leading to differen- 
tiation. There is increasing evidence that many cancers are organized in the 
same hierarchical way with rare cancer stem cells at the top of the hierarchy 
responsible for maintaining the population of cells in a tumor. These stem cells 
are capable of indefinite self- renewal, but they also give rise to rapidly dividing 
transit amplifying cells that have a limited capacity for self-renewal. It has only 
recently become widely recognized that these cells with a limited growth poten- 
tial make up the vast majority of the cells in many cancers. 

It has been known for over 40 years that there is only a small chance — usu- 
ally much less than 1%— that a randomly chosen individual cell from a cancer 
will be able to generate a new tumor when tested for its ability to do so— for 
example, by implanting it into an immunodeficient mouse. New technologies 
for sorting cells have shown that, in many cases, this is because the cancer cells 
are heterogeneous, and only a small subset of them have the special property 
needed for tumor propagation. One way to enrich the small population of pre- 
sumptive cancer stem cells in some cancers uses a cell-permeable fluorescent 
dye and a fluorescence-activated cell sorter (see Figure 8-2). When a cell sus- 
pension prepared from a normal tissue is exposed to the dye and then sorted in 
this way, the stem cells of the tissue are often found to be highly enriched in a 
small side population of weakly fluorescent cells. The stem cells are only weakly 
fluorescent because they have an ABC transporter (discussed in Chapter 11) in 
their plasma membrane that actively pumps the dye out of the cell. 

When some cancers are analyzed in the same way, a small side population 
with an ABC transporter is found to contain most, if not all, of the cells capable 
of forming new cancers after transplantation into immunodeficient mice. 
Remarkably, this is also the case for some cancer cell lines that have been prop- 
agated for more than thirty years in culture: whereas the side population of each 
of these lines readily forms tumors after transplantation, the remaining highly 
fluorescent population, which includes the vast majority of the cells in the line, 
does not (Figure 20-15). Moreover, when the side population isolated from such 
a cancer cell line is returned to culture, a small side population rapidly reappears 
and is maintained indefinitely, whereas a side population does not develop in 
cultures of the remaining cells. 

In other studies, cell fractionations using monoclonal antibodies that recog- 
nize normal stem cells in the tissue of origin of the cancer have given similar 
results. 




red light emission — »- 

Figure 20-1 5 Cell sorting reveals cancer 
stem cells in a small side population of 
cells. In this example, the cells in an 
established cell line obtained from a rat 
glioma tumor were treated with a dye that 
can pass through the plasma membrane 
(Hoechst 33342) and then analyzed by cell 
sorting. About 0.4% of the cells pump out 
the dye and are therefore only lightly 
stained, as indicated in yellow. 
Experiments comparing these side 
population (SP) cells with the remaining 
cells showed that only the side population 
cells could proliferate extensively and form 
tumors. Moreover, whereas the SP cells 
produced both SP and non-SP cells, the 
non-SP cells could not produce SP cells. 
(Adapted from T. Kondo, T. Setoguchi and 
T.Taga, Proc. Natl Acad. Sci. U.S.A. 
1 01 :781 -786, 2004. With permission from 
National Academy of Sciences.) 



How Do Cancer Stem Cells Arise? 

The finding that many cancers are maintained by a small population of cancer 
stem cells has extremely important implications, both for understanding can- 
cer and for treating it, and it raises a number of intriguing questions. How do 
these stem cells arise? What is the nature of the cancer stem cell, and what dis- 
tinguishes it in molecular terms from the majority of cells in the cancer? From 
the results just described with cultured cancer cell lines, many of these differ- 
ences must result from epigenetic rather than from genetic changes. 

It is clear for some tumors that the cancer originates in a normal tissue stem 
cell, which gradually accumulates the mutations and epigenetic changes 
responsible for the asocial behavior of the cancer. For example, this sequence of 
events is almost certainly the way chronic myelogenous leukemias arise. Thus, 
when the multipotential hemopoietic stem cells (see p. 1456) are purified from 
the bone marrow of these patients, many contain the Philadelphia chromo- 
some, with the translocation characteristic of this cancer. Likewise, many 
human cancers probably arise from normal stem cells, because they develop in 
epithelia that undergo substantial cell turnover. In these tissues, only stem cells 
remain in the body and proliferate for long enough to accumulate the number 
of mutations required to develop into a cancer. 

The second way that a cancer stem cell can arise is through a change in a 
proliferating cell that is more differentiated, such as a transit amplifying cell. The 
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Figure 20-1 6 Cancers may arise from 
cancer stem cells. Cancer stem cells are 
defined as those cancer cells that can 
self-renew to produce additional 
malignant stem cells, and at the same 
time generate non-tumorigenic cells such 
as transit amplifying cells. Cancer stem 
cells can arise from normal stem cells that 
have sustained mutations to make them 
cancerous; or from more differentiated 
cells that have undergone mutations or 
epigenetic changes that give them stem 
cell properties. 
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changes must confer the two crucial stem-cell properties: the ability to be 
retained in the body, and the capacity for prolonged self-renewal. Then, and 
only then, the altered cell will have the time to accumulate the other mutations 
and epigenetic changes needed to become a full-blown cancer cell. Some B lym- 
phocyte leukemias, for example, seem to arise in this way. All of the cells in such 
a leukemia contain the same genetic rearrangements in their antibody genes 
(discussed in Chapter 25), suggesting that the cancer originated in a committed 
B lymphocyte precursor cell, rather than in an uncommitted stem cell. Thus, it 
seems likely that, while most human cancers arise from a stem cell, others arise 
from a cell that has acquired the capacity for sustained self-renewal through 
mutation, epigenetic change, or both (Figure 20-16). 

Most current cancer therapies such as radiation and cytotoxic drugs are 
likely to preferentially kill the most rapidly proliferating cancer cells in a tumor. 
Because the cancer stem cells usually divide much more slowly, they may be less 
sensitive to these treatments. Moreover, in the 1970s, it was proposed that evo- 
lution would have selected for mechanisms that protect stem cells from accu- 
mulating mutations. In particular, since many mutations arise during DNA 
replication, could each division of a stem cell utilize a special mechanism that 
allows the old DNA template strand in each chromosome to be segregated into 
the one daughter cell that will remain a stem cell? Recent experiments suggest 
that stem cells, and only stem cells, can indeed coordinate the segregation of 
their chromosomes in this way (see Figure 23-16). If this is also the case for can- 
cer stem cells, it could make them unusually resistant to many treatments that 
kill other cells. 

If the stem cells are not eradicated, they will very likely regenerate the can- 
cer. This may be a major reason why cancers commonly recur after a dramatic 
initial response to therapy. To cure cancers, it is clearly crucial to find better ways 
to target the cancer stem cells and to kill them. 
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To Metastasize, Malignant Cancer Cells Must Survive and 
Proliferate in a Foreign Environment 

Metastasis is the most deadly— and least understood— aspect of cancer, being 
responsible for 90% of cancer- associated deaths. By spreading throughout the 
body a cancer becomes almost impossible to eradicate by either surgery or 
localized irradiation. Metastasis is itself a multistep process: the cancer cells 
have to invade local tissues and vessels, move through the circulation, leave the 
vessels, and then establish new cellular colonies at distant sites. Each of these 
events is, in itself, complex, and most of the molecular mechanisms involved are 
not yet clear. 

For a cancer cell to metastasize, it must break free of constraints that keep 
normal cells in their proper places and prevent them from invading neighboring 
tissues. Invasiveness is thus one of the defining properties of malignant tumors, 
which show a disorganized pattern of growth and ragged borders, with exten- 
sions into the surrounding tissue (see, for example, Figure 20-9). Although the 
underlying molecular changes are not well understood, invasiveness almost cer- 
tainly requires a disruption of the adhesive mechanisms that normally keep cells 
tethered to their proper neighbors and to the extracellular matrix. As we discuss 
later, for carcinomas, this change resembles the epithelial-to-mesenchymal tran- 
sition (EMT), which occurs in some epithelial tissues during normal develop- 
ment (see Figure 19-12). 

The next step in metastasis — the establishment of colonies in distant 
organs — is a complex, slow, and inefficient operation; few cells achieve it. Before 
it can metastasize successfully, a cell must penetrate a blood vessel or a lym- 
phatic vessel by crossing the basal lamina and the endothelial lining of the ves- 
sel so as to enter the blood or lymph, exit from a vessel elsewhere in the body, 
and grow in the new site, forming at first a small clump of cells known as a 
micrometastasis. To complete the metastatic process, some of these 
micrometastases must produce cells that survive and proliferate extensively in 
the new environment, a difficult process known as colonization (Figure 20-17). 

The vasculature that grows into tumors is leaky, and it seems that many of 
the cells in a tumor manage to escape into lymphatic or blood vessels. However, 
experiments show that only a tiny proportion of these, fewer than one in thou- 
sands, perhaps one in millions, survive to form anything more than 
micrometastases. Many cancers are discovered before they have managed to 
found metastatic colonies and can be cured if the primary tumor is removed. 

While it is possible in principle that each cancer cell entering the bloodstream 
might have the same tiny chance of surviving, settling, and growing in an alien 
site, this seems unlikely. More likely, the low rate of colonization reflects the rarity 
of cells that have the special properties needed. That is, it may be that only a very 
small population of the cells in a typical cancer have both the stem-cell character 
that will enable them to divide without limit, and the ability to setde and survive 
in an alien environment. The rarity of cancer stem cells would explain why so few 
cells acheive colonization, even after forming a micrometastasis. 

The need for special survival capabilities is likely to be an additional 
restraint. Normal cells depend on extracellular survival signals that are relatively 
abundant in their normal environment; when deprived of these signals, they 
activate their cell death machinery and undergo apoptosis (discussed in Chap- 
ter 18). Cancer cells capable of metastasis are often resistant to apoptosis com- 
pared with normal cells and therefore can more readily survive after escaping 
from their proper home. They are also less dependent on signals from other cells 
to grow and divide. 



Tumors Induce Angiogenesis 

In addition to all the requirements just described, to grow large, a tumor must 
recruit an adequate blood supply to ensure that it gets sufficient oxygen and 
nutrients. Thus, angiogenesis, the formation of new blood vessels, is required for 
tumor growth beyond a certain size. Like normal tissues, tumors attract a blood 
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Figure 20-1 7 Steps in the process of metastasis. This example illustrates the spread of a tumor from an 
organ such as the bladder to the liver. Tumor cells may enter the bloodstream directly by crossing the wall of 
a blood vessel, as diagrammed here, or, more commonly perhaps, by crossing the wall of a lymphatic vessel 
that ultimately discharges its contents (lymph) into the bloodstream. Tumor cells that have entered a 
lymphatic vessel often become trapped in lymph nodes along the way, giving rise to lymph-node 
metastases. Studies in animals show that typically far fewer than one in every thousand malignant tumor 
cells that enter the bloodstream will colonize a new tissue so as to produce a detectable tumor at a new site. 



supply by secreting angiogenic signals. These signals are produced in response to 
hypoxia, which begins to affect the cells as the tumor enlarges beyond a millime- 
ter or two in diameter. The hypoxia activates an angiogenic switch to increase the 
blood supply by provoking an increase in the level of Hypoxia Inducible Factor- la 
(HIF-la), a gene regulatory protein described in Chapter 23; this protein, in turn, 
activates the transcription of genes that encode pro-angiogenic factors, such as 
vascular endothelial growth factor (VEGF). These are secreted proteins that 
attract endothelial cells and stimulate the growth of new blood vessels (see Fig- 
ure 23-34) . The vessels not only help supply the tumor with nutrients and oxygen, 
but also provide an escape route for its cells to metastasize. 

However, the induced vessels are tortuous, heterogeneous in diameter, and 
leaky, and they have many branches with dead ends. These abnormalities, which 
probably result from an abnormal balance of signaling molecules, lead to irreg- 
ular blood flow within the tumor, helping to create additional regions of hypoxia 
(Figure 20-18). Hypoxia, in turn, selects for mutant cancer cells that are better 
able to survive in these harsh and stressful conditions, which usually means that 
they are more malignant. Ultimately, however, tumor growth relies on an ade- 
quate blood supply, and the abnormal vessels that are attracted into tumors are 
one obvious target for drug therapy, as we discuss later. 
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The Tumor Microenvironment Influences Cancer Development 

As mentioned earlier, carcinomas are complex entities, containing many cell 
types in addition to the cancer cells. While the cancer cells are grossly abnormal, 
being the bearers of mutations in these tumors, the other cells in the tumor— 
especially those of the supporting connective tissue, or stroma — are far from 
innocent bystanders. This is because the development of a tumor relies on 
cross-talk between the tumor cells and the tumor stroma, just as the normal 
development of epithelial organs relies on cross-talk between epithelial cells 
and mesenchymal cells (discussed in Chapter 22). 

The stroma provides the framework for the tumor and is composed of normal 
connective tissue containing fibroblasts, myofibroblasts, inflammatory white 
blood cells, the endothelial cells of blood and lymphatic vessels, and their atten- 
dant pericytes and smooth muscle cells (Figure 20-19). As a carcinoma pro- 
gresses, the cancer cells induce changes in the stroma by secreting both signal 
proteins that alter the behavior of the stromal cells and proteolytic enzymes that 
modify the extracellular matrix. The stromal cells in turn act back on the tumor 
cells in various ways. They secrete signal proteins that stimulate cancer cell 
growth and division, and they secrete proteases that further remodel the extra- 
cellular matrix. In these ways, the tumor and its stroma evolve together, and the 
tumor becomes dependent on its particular stromal cells. Experiments in mice 
indicate that the growth of some transplanted carcinomas depends on the 
tumor-associated fibroblasts, and normal fibroblasts will not do. As a second 
example, the growth of skin tumors is significantly retarded in mice lacking a spe- 
cific matrix metalloproteinase that is normally secreted by stromal mast cells; the 
protease is responsible for releasing angiogenic factors from the extracellular 
matrix that stimulate the angiogenesis required for the skin tumor to grow. These 
and other results suggest that cancer treatments could be designed to inhibit 
activated stromal cells, in addition to targeting the tumor cells themselves. 



Figure 20-18 Colon adenocarcinoma 
metastasis in the lung. This tissue slice 
shows well-differentiated colorectal 
cancer cells, still forming cohesive glands 
in the lung. This metastasis has central 
pink areas of necrosis where the cancer 
cells have outgrown their blood supply. 
(Courtesy of Andrew J. Connolly.) 
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Figure 20-1 9 The tumor microenvironment 
plays a role in tumorigenesis. Tumors 
consist of many cell types including cancer 
cells, vascular epithelial cells, fibroblasts, and 
inflammatory white blood cells. Cross-talk 
between these and other cell types plays an 
important part in tumor development. 
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Many Properties Typically Contribute to Cancerous Growth 

Clearly, to produce a cancer, a cell must acquire a range of aberrant properties — 
a collection of subversive new skills — as it evolves. Different cancers require dif- 
ferent combinations of these properties. Nevertheless, we can draw up a short 
list of the key behaviors of cancer cells in general: 

1. They are more self-sufficient than normal cells for their growth and prolif- 
eration. For example, unlike most normal cells, they can often survive and 
proliferate in cell culture even when not adherent to a substratum and 
floating free in suspension. 

2. They are relatively insensitive to anti-proliferative extracellular signals. 

3. They are less prone than normal cells to undergo apoptosis. 

4. They are defective in the intracellular control mechanisms that normally 
stop cell division permanently in response to stress (such as hypoxia) or 
DNA damage. 

5. They induce help from the normal stromal cells in their local environment. 

6. They induce angiogenesis. 

7. They escape from their home tissues (that is, they are invasive) and survive 
and proliferate in foreign sites (that is, they metastasize). 

8. They are genetically unstable. 

9. They either produce telomerase, or acquire another way of stabilizing their 
telomeres. 

In the later sections of the chapter, we examine the mutations and molecu- 
lar mechanisms that underlie some of these properties, as well as the environ- 
mental factors that promote the development of cancer. 

Summary 

Cancer cells, by definition, grow and proliferate in defiance of normal controls (that is, 
they are neoplastic) and are able to invade surrounding tissues and colonize distant 
organs (that is, they are malignant). By giving rise to secondary tumors, or metastases, 
they become difficult to eradicate by surgery or local irradiation. Cancers are thought to 
originate from a single cell that has experienced an initial mutation, but the progeny of 
this cell must undergo many further changes, requiring numerous additional muta- 
tions and epigenetic events, to become cancerous. The cell of origin of a cancer may be 
either a tissue stem cell, which is already endowed with the ability to self-renew indefi- 
nitely, or a more differentiated cell that acquires the capacity for indefinite self-renewal 
Tumor progression usually takes many years and reflects the operation of a Darwinian- 
like process of evolution, in which somatic cells undergo mutation and epigenetic 
changes accompanied by natural selection. 

Cancer cells acquire a variety of special properties as they evolve, multiply, and 
spread. These include alterations in cell signaling pathways, enabling the cells to 
ignore the signals from their environment that normally keep cell proliferation under 
tight control. As part of the evolutionary process of tumor progression, cancer cells 
acquire defects in differentiation and in the control mechanisms that permanently 
stop cell division or induce apoptosis in response to cell stress or DNA damage. All of 
these changes increase the ability of cancer cells to survive, grow, and divide in their 
original tissue and then to metastasize — which requires survival and proliferation in 
foreign environments. However, the evolution of a tumor does not simply rely on 
changes in the cancer cells themselves; it also depends on other cells present in the 
tumor microenvironment, collectively called stromal cells, including new blood vessels 
that allow the tumor to grow large and metastasize via the bloodstream. 

Since many mutations and epigenetic changes are needed to confer this collection 
of asocial behaviors, it is not surprising that nearly all cancer cells are genetically 
unstable. The genetic instability can arise from defects in the ability to repair DNA 
damage or to correct replication errors of various kinds, which lead to changes in DNA 
sequence. Also common are defects in chromosome segregation during mitosis, which 
lead to chromosome instability and changes in karyotype. This genetic instability is 
selected for in the clones of aberrant cells that are able to produce tumors, because it 
greatly accelerates the accumulation of the further genetic and epigenetic changes that 
are required for tumor progression. 
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THE PREVENTABLE CAUSES OF CANCER 

The development of a cancer generally requires many steps, each governed by 
multiple factors — some dependent on the genetic constitution of the individual, 
others dependent on his or her environment and way of life. A certain irre- 
ducible background incidence of cancer is to be expected regardless of circum- 
stances: mutations can never be absolutely avoided because they are an 
inescapable consequence of fundamental limitations on the accuracy of DNA 
replication and repair, as discussed in Chapter 5. If a human could live long 
enough, it is inevitable that at least one of his or her cells would eventually accu- 
mulate a set of mutations sufficient for cancer to develop. 

Nevertheless, there is evidence that environmental factors, including the 
food we eat, accelerate the onset of most cancers. This is demonstrated most 
clearly by a comparison of cancer incidence in different countries: for almost 
every cancer that is common in one country, there is another country where the 
incidence is much lower. Because migrant populations tend to adopt the pat- 
tern of cancer incidence typical of the host country, the differences appear to 
be due mostly to environmental, not genetic, factors. From such findings, it is 
estimated that 80-90% of cancers should be avoidable, or at least postponable 
(Figure 20-20). 

Unfortunately, different cancers have different environmental risk factors, 
and a population that escapes one such danger is usually exposed to another. 
This is not, however, inevitable. There are some human subgroups whose way of 
life substantially reduces the total cancer death rate among individuals of a 
given age. Under the current conditions in the United States and Europe, 
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Figure 20-20 Cancer incidence is related 
to environmental influences. (A) This map 
of the world shows the rates of cancer 
increasing {red arrows) or decreasing {blue 
arrows) when specific populations move 
from one location to another. These 
observations suggest the importance of 
environmental factors, including diet, in 
dictating cancer risk. (B) Some estimated 
effects of environment and lifestyle on 
cancer in the United States. The table 
shows the percentage of total cancer cases 
attributable to each specified factor. 
(B, adapted from Cancer Facts and Figures, 
American Cancer Society, 1990. With 
permission from American Cancer Society.) 
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approximately one in five people will die of cancer. But the incidence of cancer 
among strict Mormons in Utah (but, importantly, not among non-practicing 
members of the same family), who avoid alcohol, coffee, cigarettes, drugs, and 
casual sex, is only about half the incidence among Americans in general. Cancer 
incidence is also low in certain relatively affluent populations in Africa. 

Although such observations on human populations indicate that cancer can 
often be avoided, it has been difficult in most cases — with the exception of 
tobacco — to identify the specific environmental risk factors or to establish how 
they act. We will first look at what has been learned about the external agents 
that are known to cause cancer. We will then consider some of the triumphs, and 
difficulties, in finding ways to prevent human cancer. We will discuss the treat- 
ment of cancer in the last section, after we have examined the molecular biology 
of the disease. 



Many, But Not All, Cancer-Causing Agents Damage DNA 

The agents that can cause cancer, known as carcinogens, are many and varied, 
but the easiest to understand are those that damage DNA and so generate muta- 
tions. These cancer- causing mutagens include chemicals and various forms of 
radiation — UV light from sunshine, as well as ionizing radiation, such as y rays 
and a particles from radioactive decay. 

Many quite disparate chemicals are carcinogenic when they are fed to 
experimental animals or painted repeatedly on their skin. Examples include a 
range of aromatic hydrocarbons and derivatives of them such as aromatic 
amines, nitrosamines, and alkylating agents such as mustard gas. Although 
these chemical carcinogens have diverse structures, they have at least one com- 
mon property — they cause mutations. In one common test for mutagenicity, the 
carcinogen is mixed with an activating extract prepared from rat liver cells (to 
mimic the biochemical processing that occurs in an intact animal — discussed 
below) and is added to a culture of specially designed test bacteria; the resulting 
mutation rate of the bacteria is then measured (Figure 20-21). Most of the com- 
pounds scored as mutagenic by this rapid and convenient assay in bacteria also 
cause mutations or chromosome aberrations when tested on mammalian cells. 
When mutagenicity data from various sources are analyzed, one finds that many 
carcinogens are mutagens. 

A few of these carcinogens act directly on DNA, but generally the more 
potent ones are relatively inert chemically; they become damaging only after 
they have been changed to a more reactive form by metabolic processes in the 
liver, catalyzed by a set of intracellular enzymes known as the cytochrome P-450 
oxidases. These enzymes normally help to convert ingested toxins into harmless 
and easily excreted compounds. Unfortunately, their activity on certain chemi- 
cals generates products that are highly mutagenic. Examples of carcinogens 
activated in this way include the fungal toxin aflatoxin Bl and benzo[a]pyrene, a 
cancer- causing chemical present in coal tar and tobacco smoke (Figure 20-22). 
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Figure 20-21 The Ames test for 
mutagenicity. The test uses a strain of 
Salmonella bacteria that require histidine 
in the medium because of a defect in a 
gene necessary for histidine synthesis. 
Mutagens can cause a further change in 
this gene that reverses the defect, 
creating revertant bacteria that do not 
require histidine. To increase the 
sensitivity of the test the bacteria also 
have a defect in their DNA repair 
machinery that makes them especially 
susceptible to agents that damage DNA. 
A majority of compounds that are 
mutagenic in tests such as this are also 
carcinogenic. 
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Figure 20-22 Some known carcinogens. (A) Carcinogen activation. 
A metabolic transformation must activate many chemical carcinogens 
before they will cause mutations by reacting with DNA. The compound 
illustrated here is aflatoxin Bl, a toxin from a mold (Aspergillus flavus oryzae) 
that grows on grain and peanuts when they are stored under humid 
tropical conditions. It is thought to be a contributory cause of liver cancer 
in the tropics and is associated with characteristic mutations of the p53 
tumor suppressor gene. (B) Different carcinogens cause different types of 
cancer. (B, data from Cancer and the Environment: Gene Environment 
Interactions, National Academies Press, 2002. With permission from 
National Academies Press.) 

Tumor Initiators Damage DNA; Tumor Promoters Do Not 

Not all substances that favor the development of cancer are mutagens. Some of 
the clearest evidence comes from studies done long ago on the effects of cancer- 
causing chemicals on mouse skin, where it is easy to observe the stages of tumor 
progression. Skin cancers can be elicited in mice by repeatedly painting the skin 
with a DNA-damaging carcinogen such as benzo[a]pyrene or the related com- 
pound dimethylbenz [a] anthracene (DMBA). A single application of these muta- 
gens, however, does not by itself give rise to a tumor or any other obvious lasting 
abnormality. Yet, it does cause latent genetic damage— mutations that set the 
stage for a greatly increased incidence of cancer when the cells are exposed 
either to further treatments with the carcinogen or to certain other, quite differ- 
ent, insults. A carcinogen that sows the seeds of cancer in this way acts as a 
tumor initiator. 

Simply wounding skin that has been exposed once to such an initiator can, 
by stimulating cell proliferation, cause cancers to develop from some of the cells 
at the edge of the wound. Inflammatory white blood cells are attracted to a 
wound, and it is thought that inflammatory cells play a part in the cancer-pro- 
moting effect, as we mentioned earlier. Alternatively, repeated exposure over a 
period of months to substances known as tumor promoters, chemicals that are 
not themselves mutagenic, can cause cancer selectively in skin previously 
exposed to a tumor initiator. The most widely studied tumor promoters are 
phorbol esters, such as tetradecanoylphorbol acetate (TPA), which behave as 
artificial activators of protein kinase C (PKC) and hence activate part of the phos- 
phatidylinositol intracellular signaling pathway (discussed in Chapter 15). These 
substances cause cancers at high frequency only if they are applied after a treat- 
ment with a mutagenic initiator (Figure 20-23). 

The immediate effect of a tumor promoter is apparently to induce an inflam- 
matory response, causing the secretion of both growth factors and proteases in 
the local environment. These act both directly and indirectly on cells to stimulate 
cell division (or to cause cells that would normally differentiate and stop dividing 
to continue dividing instead) . In a region of skin that has previously been exposed 
to an initiator, this proliferation results in the development of many small, 
benign, wartlike tumors called papillomas. The greater the prior dose of initiator, 
the larger the number of papillomas induced; it is thought that each papilloma 
consists of a single clone of cells descended from a mutant cell that the initiator 
has produced. As one might expect for genetic damage, the hidden changes 
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Figure 20-23 Some possible schedules 
of exposure to a tumor initiator 
(mutagenic) and a tumor promoter 
(nonmutagenic) and their outcomes. 

Cancer ensues only if the exposure to the 
promoter follows exposure to the 
initiator and only if the intensity of 
exposure to the promoter exceeds a 
certain threshold. Cancer can also occur 
as a result of repeated exposure to the 
initiator alone. 
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caused by a tumor initiator are irreversible; for this reason, they can be uncov- 
ered by treatment with a tumor promoter even after a long delay. 

It is still not certain how tumor promoters work, and different promoters are 
likely to work in different ways. One possibility is that they simply provoke the 
expression of proliferation-inducing genes that had been mutated before the 
promoter was applied but were not expressed: a mutation that makes a gene 
product hyperactive will not show its effects until the gene is expressed. Alter- 
natively, the tumor promoter may temporarily alter the way the cell reacts to the 
product of the mutated gene, either by releasing the cell from a counteracting 
inhibitory influence or by triggering production of a co-factor necessary for pro- 
liferation of the mutated gene product. Whichever the mechanism, the result is 
that the mutant cell is enabled to grow and divide and produce a large cluster of 
cells (Figure 20-24). 

A typical papilloma might contain about 10 5 cells. If exposure to the tumor 
promoter is stopped, almost all the papillomas regress, and the skin regains a 
largely normal appearance. In a few of the papillomas, however, further changes 
occur that enable cell growth and division to continue in an uncontrolled way, 
even after the promoter has been withdrawn. These changes seem to originate 
in an occasional single papilloma cell, at about the frequency expected for spon- 
taneous mutations. In this way, a small proportion of the papillomas progress to 
become cancers. Thus, the tumor promoter apparently favors the development 
of cancer by expanding the population of cells that carry an initial mutation: the 
more such cells there are and the more times they divide, the greater is the 
chance that at least one of them will sustain another mutation, or an epigenetic 
change, that carries it one step further toward malignancy. 

Although naturally occurring cancers do not necessarily arise through the 
specific sequence of distinct initiation and promotion steps just described, their 
evolution must be governed by similar principles. They too will evolve at a rate 
that depends not only on the frequency of genetic or epigenetic changes, but 
also on the local influences affecting the survival, growth, proliferation, and 
spread of these altered cells. 



Viruses and Other Infections Contribute to a Significant 
Proportion of Human Cancers 

A small but significant proportion of human cancers, perhaps 15% in the world 
as a whole, are thought to arise by mechanisms that involve viruses, bacteria, or 
parasites. The main culprits, as shown in Table 20-1, are the DNA viruses. Evi- 
dence for their involvement comes partly from the detection of viruses in cancer 
patients and partly from epidemiology. Liver cancer, for example, is common in 
parts of the world (Africa and Southeast Asia) where hepatitis-B viral infections 
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Figure 20-24 The effect of a tumor 
promoter. The tumor promoter creates a 
local environment that expands the 
population of mutant cells, thereby 
increasing the probability of tumor 
progression by further genetic change. 
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Table 20-1 Viruses Associated with Human Cancers 



v 



mus 

DNA viruses 
Papovavirus family 
Papillomavirus 
(many distinct 
strains) 
Hepadnavirus family 
Hepatitis-B virus 

Hepatitis-C virus 

Herpesvirus family 
Epstein-Barr virus 



RNA viruses 
Retrovirus family 

Human T-cell 
leukemia virus 
typel(HTLV-l) 

Human immuno- 
deficiency virus 
(HIV, the AIDS virus) 



ASSOCIATED CANCER 



warts (benign) 
carcinoma of the uterine 
cervix 

liver cancer (hepatocellular 

carcinoma) 
liver cancer (hepatocellular 

carcinoma) 

Burkitt's lymphoma (cancer 

of B lymphocytes) 
nasopharyngeal carcinoma 



adult T-cell leukemia/ 
lymphoma 

Kaposi's sarcoma 



XREAS OF HIGH INCIDENCE 



worldwide 
worldwide 



Southeast Asia, tropical 
Africa 

worldwide 



West Africa, Papua New 

Guinea 
Southern China, Greenland 



Japan, West Indies 



Central and Southern Africa 



For all these viruses, the number of people infected is much larger than the numbers who develop 
cancer: the viruses must act in conjunction with other factors. Moreover, some of the viruses 
contribute to cancer only indirectly; HIV, for example, destroys helper T lymphocyte?,, which allows 
a herpes virus to transform endothelial cells. Similarly, hepatitis-C virus causes chronic hepatitis, 
which promotes the development of liver cancer. 



are common, and in those regions the cancer occurs almost exclusively in peo- 
ple who show signs of chronic hepatitis-B infection. Chronic infection with hep- 
atitis-C virus, which has infected 170 million people worldwide, is also clearly 
associated with the development of liver cancer. 

The precise role of a cancer-associated virus is often hard to decipher 
because there is a delay of many years from the initial viral infection to the 
development of the cancer. Moreover, the virus is responsible for only one of a 
series of steps in the progression to cancer, and other environmental factors and 
genetic accidents ;irc also involved. As we explain later, DNA viruses frequently 
carry genes thai can subvert the control of cell division in the host cell, causing 
uncontrolled cell proliferation. DNA viruses thar operate in this manner include 
the human papillomaviruses. Some of these cause harmless wans, but others 
infect the uterine cervix and are implicated in the development of carcinomas of 
the cervix. 

In some cancers, viruses seem to have additional, indirect tumor-promoting 
actions. The hepatitis-B and C viruses may, for example, favor the development 
of liver cancer by causing chronic inflammation (hepatitis), which stimulates 
cell division in the liver, and enhances the viruses' more direct effects on cell 
proliferation, in AIDS, the human immunodeficiency virus [HIV) promotes 
development of an otherwise rare cancer called Kaposi's sarcoma by destroying 
the immune system, thereby permitting a secondary infection with a human 
herpes virus (HHV-8) that has a direct carcinogenic action, Chronic infection 
with parasites and bacteria can also promote the development of some cancers. 
For example, chronic infection of the stomach with the bacterium Helicobacter 
pylori, which causes ulcers, appears to be a major cause of stomach cancer. And 
bladder cancer in some parts of the world is associated with chronic infection by 
the blood fluke, Schistosoma haematobium, a parasitic flatworm. 
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Identification of Carcinogens Reveals Ways to Avoid Cancer 

Tobacco smoke is by far the most important environmental cause of cancer in 
the world today. It contains both carcinogens and tumor promoters. Other com- 
parably important chemical causes of cancer in humans remain to be identified. 
It is sometimes thought that the main environmental causes of cancer are the 
products of a highly industrialized way of life — the rise in pollution, the 
enhanced use of food additives, and so on — but there is little evidence to sup- 
port this view. The idea may have come in part from the identification of some 
highly carcinogenic materials used in industry such as 2-naphthylamine and 
asbestos. Except for the increase in cancers caused by smoking, however, and a 
remarkable decrease in stomach cancer that may reflect decreasing rates of 
infection with Helicobacter pylori, the incidence of the most common cancers 
for individuals of a given age has not changed very much in the past hundred 
years (Figure 20-25). 

Most of the carcinogenic factors that are known to be significant are by no 
means specific to the modern world. The most potent known carcinogen, by cer- 
tain assays at least, is aflatoxin Bl (see Figure 20-22). It is produced by fungi that 
naturally contaminate foods such as tropical peanuts and is an important cause 
of liver cancer in Africa and Asia. Some factors that accelerate cancer are intrin- 
sic to our bodies. Thus, in women, the risk of cancer is powerfully influenced by 
the reproductive hormones produced at different stages of life, and there is a 
striking correlation between reproductive history and the occurrence of breast 
cancer (Figure 20-26). The reproductive hormones presumably affect breast 
cancer incidence through their influence on cell proliferation in the breast. 
Clearly, when attempting to identify the environmental causes of cancer, we 
need to keep an open mind. 

Epidemiology — the analysis of disease frequency in populations — remains 
the principal tool for finding environmental causes of human cancer. The 
approach has enjoyed some notable successes, and it promises more to come. 
By revealing the role of smoking, for example, epidemiology has shown a way to 
reduce the total cancer death rate in North America and Europe by as much as 
25%. The approach works best when applied to a fairly homogeneous popula- 
tion in which it is easy to distinguish between individuals who were exposed to 
the agent and those who were not, and when the agent under investigation is 
responsible for most of the cases of a certain kind of cancer. In the early part of 
the twentieth century for example, in one British factory all of the men who had 
been employed in distilling 2-napthylamine (and were thereby subjected to pro- 
longed exposure) eventually developed bladder cancer (see Figure 20-8); the 
connection was relatively easy to establish because both the chemical and the 
form of cancer were uncommon in the general population. 

In contrast, it is very hard to identify, by epidemiology alone, everyday envi- 
ronmental factors that favor the development of common cancers: most of these 
factors are probably agents to which we are all exposed to some extent, and 



Figure 20-25 Age-adjusted cancer 
death rates, United States, 1930-2001. 

Selected death rates adjusted to the age 
distribution of the US population in 1970, 
are plotted for (A) females and (B) males. 
Note the dramatic rise in lung cancer for 
both sexes, following the pattern of 
tobacco smoking, and the fall in deaths 
from stomach cancer, possibly related to 
changes in diet or in patterns of infection 
with Helicobacter. Recent reductions in 
other cancer death rates may correspond 
to improvements in detection and 
treatment. Age-adjusted data like these 
are needed to compensate for the 
inevitable increase in cancer rates as 
people live longer, on average. (Adapted 
from Cancer Facts and Figures, 2005. 
With permission from American Cancer 
Society.) 
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Figure 20-26 Effects of childbearing on 
the risk of breast cancer. The relative 
probability of breast cancer developing at 
some time in a woman's life is plotted as a 
function of the age at which she gives 
birth to her first child. The graph shows the 
probability relative to that for a childless 
woman. The longer the period of exposure 
to reproductive hormones up to birth of 
the first child, the greater the risk. It is 
thought that the first full-term pregnancy 
may result in a permanent change in the 
state of differentiation of the cells of the 
breast, altering their subsequent 
responses to hormones. Several other lines 
of epidemiological evidence also support 
the view that exposure to certain 
combinations of reproductive hormones, 
especially estrogen, can promote 
development of breast cancer. (From 
J. Cairns, Cancer: Science and Society. San 
Francisco: W.H. Freeman, 1978. After 
B. MacMahon, P. Cole and J. Brown, J. Natl 
Cancer Inst. 50:21-42, 1973. With 
permission from Oxford University Press.) 
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many of them probably contribute together to a given cancer's incidence. If, say, 
eating oranges either doubled or halved the risk of colorectal cancer, it is 
unlikely that we would find it out, unless we had some prior reason to suspect a 
connection. The same goes for the countless other substances that we eat, drink, 
breathe, and put on our bodies. The epidemiological evidence is, however, fairly 
clear on one dietary point: as mentioned earlier, too much food, leading to obe- 
sity, significantly increases the risk of cancer. 

Even when evidence is obtained that a substance may be carcinogenic, 
either from epidemiology or from laboratory tests, it may be difficult to decide 
what level of human exposure is acceptable. Estimating how many cases of 
human cancer a certain amount of a substance is likely to cause is difficult; bal- 
ancing this risk against the utility of the substance is more difficult still. Certain 
agricultural fungicides, for example, appear to be mildly carcinogenic at high 
doses in animal tests. But it has been calculated that if they were not used in 
agriculture, the contamination of food by fungal metabolites such as aflatoxin 
Bl would cause far more cases of cancer than the fungicide residues in food ever 
would. 

Nevertheless, efforts to identify potential carcinogens still have a central 
place in our struggle to prevent cancer. Prevention is not only better than a cure, 
but for many types of cancer, it may be more readily attainable. 



Summary 

Environmental agents accelerate the rate of tumor development and progression. 
These can act either as tumor initiators or tumor promoters. Tumor promoters induce 
an inflammatory response, and they create a local environment that alters gene 
expression, stimulates cell proliferation, and increases the population of mutant cells 
created by the initiator The majority of the known environmental agents that increase 
cancer development are mutagens, including chemical carcinogens and various forms 
of radiation, including UV light and ionizing radiation. Because many environmental 
factors contribute to the development of a given cancer and some of these are under 
our control, a large proportion of cancers are in principle preventable. 

The finding that immigrant populations adopt the cancer patterns of their new 
host country strongly suggests that the majority of cancers could be avoided by 
changing the food we eat and other environmental exposures. Epidemiology can be 
a powerful tool for identifying these environmental effects on human cancer. The 
epidemiological approach does not require knowing how the environmental agents 
work, and it can uncover factors that are not chemical entities, such as lifestyle and 
certain patterns of child-bearing. Individuals can avoid many of the environmental 
risk factors that are identified in this way. These include smoking tobacco and 
falling prey to infection with cancer-causing viruses such as papillomaviruses or 
hepatitis B or C. However, because many factors that affect cancer risk are very diffi- 
cult to identify, we remain largely ignorant of the principal environmental factors 
that affect cancer incidence. 



FINDING THE CANCER-CRITICAL GENES 

As we have seen, cancer depends on the accumulation of inherited changes in 
somatic cells. To understand it at a molecular level we need to identify the muta- 
tions and epigenetic changes involved and to discover how they give rise to can- 
cerous cell behavior. Finding the relevant cells is often easy; they are favored by 
natural selection and call attention to themselves by giving rise to tumors. But 
how do we identify the small number of genes with the cancer-promoting 
changes among all the other genes in the cancerous cells? A similar needle-in- 
haystack problem arises in any search for a gene underlying a given mutant phe- 
notype, but for cancer the task is particularly complex. A typical cancer depends 
on a whole set of mutations and epigenetic changes — usually a somewhat dif- 
ferent set in each individual patient — and introduction of any single one of these 
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into a normal cell is not enough to make the cell cancerous. In addition, the 
cooperation between different altered genes makes it harder to test the signifi- 
cance of an inherited change in any individual gene. To make matters worse, a 
given cancer cell will also contain a large number of somatic mutations that are 
accidental by-products of its genetic instability, and it can be difficult to distin- 
guish these meaningless changes from those changes that have a causative role 
in the disease. 

Despite these difficulties, many genes that are repeatedly altered in human 
cancers have been identified — several hundred of them — although it is clear 
that many more remain to be discovered. We will call such genes, for want of a 
better term, cancer-critical genes, meaning all genes whose alteration fre- 
quently contributes to the causation of cancer. Our knowledge of these genes 
has accumulated piecemeal through many different and sometimes circuitous 
approaches, ranging from early studies of cancer-causing infections in chickens 
to investigations of embryonic development. Analyses of rare but highly reveal- 
ing inherited forms of cancer have also added to our understanding. More 
recently, sequencing of DNA from multiple cases of specific types of cancers has 
begun to give a more systematic picture of the genetic changes that are a regu- 
lar feature of those diseases. 

In this section, we discuss both the methods used for identifying cancer- 
critical genes and the varied kinds of inherited changes that occur in them dur- 
ing the development of cancer. 

The Identification of Gain-of-Function and Loss-of-Function 
Mutations Requires Different Methods 

Cancer-critical genes are grouped into two broad classes, according to 
whether the cancer risk arises from too much activity of the gene product, or 
too little. Genes of the first class, in which a gain- of- function mutation can 
drive a cell toward cancer, are called proto-oncogenes; their mutant, overac- 
tive or overexpressed forms are called oncogenes. Genes of the second class, 
in which a loss-of-function mutation can contribute to cancer, are called 
tumor suppressor genes. A third class, whose effects are more indirect, are 
those genes whose mutation results in genomic instability, a class we describe 
as DNA maintenance genes. 

As we shall see, mutations in both oncogenes and tumor suppressor genes 
can have similar effects in enhancing cell proliferation and cell survival, as well 
as in promoting tumor development. Thus, from the point of view of a cancer 
cell, oncogenes and tumor suppressor genes— and the mutations that affect 
them— are flip sides of the same coin. The techniques needed to find these 
genes, however, differ — depending on whether the genes are overactive or 
underactive in cancer. 

Mutation of a single copy of a proto-oncogene that converts it to an onco- 
gene has a dominant, growth-promoting effect on a cell (Figure 20-27A). 
Thus, we can identify the oncogene by its effect when it is added— by DNA 
transfection, for example, or through infection with a viral vector— to the 
genome of a suitable type of tester cell. In the case of the tumor suppressor 
gene, on the other hand, the cancer-causing alleles produced by the change 
are generally recessive: often (but not always) both copies of the normal gene 
must be removed or inactivated in the diploid somatic cell before an effect is 
seen (Figure 20-27B). This calls for a different approach, aimed at detecting 
what is missing in the cancer cell. 

In some cases, a specific gross chromosomal abnormality, visible under the 
microscope, is repeatedly associated with a particular type of cancer. This can 
give a clue to the location of an oncogene that is activated as a result of the chro- 
mosomal rearrangement (as in the example of the chromosomal translocation 
responsible for chronic myelogenous leukemia, discussed earlier). Alternatively, 
a visible deletion of a chromosomal segment may reveal the site of a deleted 
tumor suppressor gene. 
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Retroviruses Can Act as Vectors for Oncogenes That Alter Cell 
Behavior 

Tumor viruses have played a remarkable part in the search for the genetic 
causes of human cancer. Although viruses have no role in the majority of com- 
mon human cancers, they are more prominent as causes of cancer in some ani- 
mal species. Analysis of these animal tumor viruses provided a critical key to 
understanding the mechanisms of cancer in general and to the discovery of 
oncogenes in particular. 

One of the first animal viruses to be implicated in cancer was discovered 
nearly 100 years ago in chickens, which are subject to infections that cause con- 
nective-tissue tumors, or sarcomas. The infectious agent was characterized as a 
virus — the Rous sarcoma virus, which we now know to be an RNA virus. Like all 
the other RNA tumor viruses discovered since, it is a retrovirus. When it infects 
a cell, its RNA is copied into DNA by reverse transcription, and the DNA is 
inserted into the host genome, where it can persist and be inherited by subse- 
quent generations of cells. The Rous sarcoma virus carries an oncogene that 
causes cancer in chickens. This oncogene is not necessary for the virus's own 
survival or reproduction, as demonstrated by the discovery of mutant forms of 
the virus that multiply normally but no longer make their host cells cancerous— 
a process called cell transformation. Some of these mutants were found to have 
deleted all or part of a gene that codes for a protein called Src (pronounced 
"Sarc"). Other mutations in this gene made the transforming effect of the virus 
temperature-sensitive: infected cells show a transformed phenotype at 34°C, but 
return to the normal phenotype within a few hours when the temperature is 
raised to 39°C. 

What was the origin of this gene? In 1975, when a radioactive DNA copy of 
the viral Src gene sequences was used as a probe to search for closely related 
sequences by DNA-DNA hybridization (see Figure 8-36), the genomes of verte- 
brate cells were found to contain sequences that are closely similar, but not 
identical, to this viral gene. The viral oncogene had evidently been picked up 
accidentally by the retrovirus from the genome of a previous infected host cell, 
but the host cell proto- oncogene [c-Src] had undergone mutation in the process 
to become an oncogene [v-Src] (Figure 20^28). This finding that cancer could 
originate from mutation of a specific gene that was present in the genome ola 
normal animal profoundly altered the field of cancer research. 

A large number of other oncogenes have since been identified in other retro- 
viruses and analyzed in similar ways. Each has led to the discovery of a corre- 
sponding proto-oncogene that is present in normal animal cells. 



Figure 20-27 Cancer-critical mutations 
fall into two readily distinguishable 
categories, dominant and recessive. In 

this diagram, activating mutations are 
represented by solid red boxes, 
inactivating mutations by hollow red 
boxes. (A) Oncogenes act in a dominant 
manner: a gain-of-function mutation in a 
single copy of the cancer-critical gene 
can drive a cell toward cancer. 
(B) Mutations in tumor suppressor genes, 
on the other hand, generally act in a 
recessive manner: the function of both 
alleles of the cancer-critical gene must be 
lost to drive a cell toward cancer. 
Although in this diagram the second 
allele of the tumor suppressor gene is 
inactivated by mutation, it is often 
inactivated instead by loss of the second 
chromosome. Not shown is the fact that 
mutation of some tumor suppressor 
genes can have an effect even when only 
one of the two gene copies is damaged. 
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Different Searches for Oncogenes Have Converged on the Same 
Gene — Ras 

While some researchers searched for oncogenes in retroviruses, others took a 
more direct approach and searched for DNA sequences in human cancer cells 
that could provoke uncontrolled proliferation when introduced into noncancer- 
ous cell lines. As tester cells for the assay, they usually used cell lines derived 
from mouse fibroblasts. These cells, previously selected to proliferate indefi- 
nitely in culture, are thought to already contain genetic alterations that take 
them part of the way toward malignancy. For this reason, the addition of a sin- 
gle oncogene can be enough to produce a dramatic effect. 

To detect an oncogene in this way DNA is extracted from the tumor cells, 
broken into fragments, and introduced into the cultured cells. If any of the frag- 
ments contains an oncogene, small colonies of abnormally proliferating cells 
begin to appear; because the cells forming such colonies display cancerous 
properties, they are said to be transformed or to have undergone transforma- 
tion. <CCGA> Each colony is a clone of cells that originated from a single cell, 
carrying an added gene that released the cell from some of the social controls on 
cell proliferation; the transformed cells forming the colony outgrow the untrans- 
formed cells in the culture and pile up layer upon layer as they proliferate (Fig- 
ure 20-29). 



Figure 20-28 The structure of the Rous 
sarcoma virus. (A) The organization of 
the viral genome as compared with that 
of a more typical retrovirus (murine 
leukemia virus). Rous sarcoma virus is 
unusual among the retroviruses that 
carry oncogenes in that it has retained 
the three viral genes required for the 
ordinary viral life cycle: Gog (which 
produces a polyprotein that is cleaved to 
generate the capsid proteins), Pol (which 
produces reverse transcriptase and an 
enzyme involved in integrating the viral 
chromosome into the host genome), and 
Env (which produces the envelope 
glycoprotein). In other oncogenic 
retroviruses, one or more of these viral 
genes are wholly or partly lost when the 
retrovirus acquires the transforming 
oncogene. Therefore the transforming 
virus can generate infectious particles 
only in a cell that is simultaneously 
infected with a nondefective, 
nontransforming helper virus, which 
supplies the missing functions. (B) The 
relationship between the v-Src oncogene 
and the cellular Src proto-oncogene from 
which it has been derived. The introns 
present in cellular Src have been spliced 
out of v-Src; in addition, v-Src contains 
mutations that alter the amino acid 
sequence of the protein (not shown), 
making it hyperactive and unregulated as 
a tyrosine-specific protein kinase (see 
Figure 3-69). The v-Src protein is also 
greatly overproduced because the virus 
makes large amounts of RNA. 
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Figure 20-29 Loss of contact inhibition 
in cell culture. Most normal cells stop 
proliferating once they have carpeted the 
dish with a single layer of cells: 
proliferation seems to depend on contact 
with the dish, and to be inhibited by 
contacts with other cells — a 
phenomenon known as "contact 
inhibition." Cancer cells, in contrast, 
usually disregard these restraints and 
continue to grow, so that they pile up on 
top of one another. 
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This assay led to the isolation and sequencing of the first human oncogene, 
a mutant version of the proto-oncogene Ras. The Ras gene is now known to be 
mutated in about one in five human cancers. The discovery of the human Ras 
oncogene was all the more dramatic because, shortly before, a mutated Ras gene 
had been found to be the tumor-causing gene in a retrovirus that causes sarco- 
mas in rodents. This discovery in the early 1980s of the same oncogene in 
human tumor cells and in an animal tumor virus was electrifying. The implica- 
tion that cancers are caused by mutations in a limited number of cancer- critical 
genes changed our understanding of the molecular biology of cancer. 

As discussed in Chapter 15, normal Ras proteins are monomeric GTPases 
that help transmit signals from cell-surface receptors to the cell interior. 
<GAAC> The Ras oncogenes isolated from human tumors contain point muta- 
tions that create a hyperactive Ras protein that cannot shut itself off by hydrolyz- 
ing its bound GTP to GDP. Because this makes the protein hyperactive, its effect 
is dominant — that is, only one of the cell's two gene copies needs to change to 
have an effect. Because the Ras genes are mutated in a wide range of human 
cancers, they remain one of the most important examples of cancer- critical 
genes. 

Through methods such as those we have just described, and various other 
routes, we now know of hundreds of proto-oncogenes — that is, genes that can 
be converted by activating mutations into oncogenes that help to cause cancer. 



Studies of Rare Hereditary Cancer Syndromes First Identified 
Tumor Suppressor Genes 

Identifying a gene that has been inactivated in the genome of a cancer cell 
requires a different strategy from finding a gene that has become hyperactive: 
one cannot, for example, use a cell transformation assay to identify something 
that simply is not there. The key insight that led to the discovery of the first 
tumor suppressor gene came from studies of a rare type of human cancer, 
retinoblastoma, which arises from cells in the retina of the eye that are con- 
verted to a cancerous state by an unusually small number of mutations. As often 
happens in biology, the discovery arose from examination of a special case, but 
it turned out to reveal a gene of widespread relevance. In fact, genes identified in 
rare familial cancer syndromes are commonly found to be relevant in more 
common sporadic cancers, where they often serve as tumor suppressors. 

Retinoblastoma occurs in childhood, and tumors develop from neural pre- 
cursor cells in the immature retina. About one child in 20,000 is afflicted. One 
form of the disease is hereditary, and the other is not. In the hereditary form, 
multiple tumors usually arise independently, affecting both eyes; in the non- 
hereditary form, only one eye is affected, and by only one tumor. A few individ- 
uals with retinoblastoma have a visibly abnormal karyotype, with a deletion of a 
specific band on chromosome 13 that, if inherited, predisposes an individual to 
the disease. Deletions of this same region are also encountered in tumor cells 
from some patients with the nonhereditary disease, which suggested that the 
cancer was caused by loss of a critical gene in that location. 

Using the location of this chromosomal deletion, it was possible to clone 
and sequence the Rb gene. It was then discovered that those who suffer from 
the hereditary form of the disease have a deletion or loss-of- function mutation 
present in one copy of the Rb gene in every somatic cell. These cells are pre- 
disposed to becoming cancerous, but do not do so if they retain one good copy 
of the gene. The retinal cells that are cancerous are defective in both copies of 
Rb because of a somatic event that has eliminated the function of the previ- 
ously good copy. 

In patients with the nonhereditary form of the disease, by contrast, the non- 
cancerous cells show no defect in either copy of Rb, while the cancerous cells have 
become defective in both copies. These nonhereditary retinoblastomas are very 
rare because they require two independent events that inactivate the same gene 
on two chromosomes in a single retinal cell lineage (Figure 20-30). The Rb gene is 
also missing in several common types of sporadic cancer, including carcinomas of 
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lung, breast, and bladder. These more common cancers arise by a more complex 
series of genetic changes than does retinoblastoma, and they make their appear- 
ance much later in life. But in all of them, it seems, loss of Rb function is fre- 
quently a major step in the progression toward malignancy 

The Rb gene encodes the Rb protein, which is a universal regulator of the 
cell cycle present in almost all cells of the body (see Figure 17-62). It acts as one 
of the main brakes on progress through the cell-division cycle, and its loss can 
allow cells to enter the cell cycle inappropriately, as we discuss later. 

Tumor Suppressor Genes Can Also Be Identified from Studies of 
Tumors 

The Rb story illustrates how rare hereditary cancer syndromes can be used to 
uncover tumor suppressor genes that are relevant to common sporadic cancers. 
However, only a few of the tumor suppressor genes now known to be important 
have been discovered in this way. A more direct approach to the identification of 
such genes involves comparing tumor cells with non-cancerous cells from the 
same patient so as to discover what exactly, out of the 3 billion nucleotides of the 
human genome, is missing, functionally defective, or abnormally silenced. 
Because of the genetic instability of cancer cells, there is usually a great deal 
changed. Most of the alterations will be random, accidental by-products of the 
genetic instability. Therefore, a relevant tumor suppressor gene can only be 
identified by the criterion that it is repeatedly missing, defective, or otherwise 
silenced in many independent cases of a particular type ot cancer. Tracking 
down tumor suppressor genes in this way is a hard task, but feasible with mod- 
ern techniques for large-scale DNA analysis (as we explain below]. Dozens of 
tumor suppressor genes have already been well characterized and many more 
are known. 



Figure 20-30 The genetic mechanisms 
that cause retinoblastoma. In the 

hereditary form, all cells in the body lack 
one of the normal two functional copies 
of the Rb tumor suppressor gene, and 
tumors occur where the remaining copy 
is lost or inactivated by a somatic event 
(either mutation or epigenetic silencing). 
In the nonhereditary form, all cells 
initially contain two functional copies of 
the gene, and the tumor arises because 
both copies are lost or inactivated 
through the coincidence of two somatic 
events in one line of cells. 



Both Genetic and Epigenetic Mechanisms Can Inactivate Tumor 
Suppressor Genes 

It is the inactivation of tumor suppressor genes that is dangerous. This inactiva- 
tion can occur in many ways, with different combinations of mishaps converg- 
ing to eliminate or cripple both gene copies. The first copy may, for example, be 
lost by a small chromosomal deletion or inactivated by a point mutation. The 
second copy is commonly eliminated by a less specific and more probable 
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Figure 20-31 Six ways of losing the 
remaining good copy of a tumor 
suppressor gene through changes in 
DNA sequences. A cell that is defective in 
only one of its two copies of a tumor 
suppressor gene — for example, the 
Rb gene— usually behaves as a normal, 
healthy cell; the diagrams below show 
how this cell may lose the function of the 
other gene copy as well and thereby 
progress toward cancer. A seventh 
possibility, frequently encountered with 
some tumor suppressors, is that the gene 
may be silenced by an epigenetic 
change, without alteration of the DNA 
sequence, as illustrated in Figure 20-32. 
(After W.K. Cavenee et al., Nature 
305:779-784, 1983. With permission from 
Macmillan Publishers Ltd.) 



mechanism: the chromosome carrying the remaining normal copy may be lost 
from the cell through errors in chromosome segregation, or the normal gene 
may be replaced by a mutant version through either mitotic recombination or a 
gene conversion event. 

Figure 20-31 summarizes the range of possible ways to lose the remaining 
good copy of a tumor suppressor gene through DNA sequence changes, using 
the Rb gene as an example. It is important to note that, except for the point 
mutation mechanism illustrated at the far right, these pathways all produce cells 
that carry only a single type of DNA sequence in the chromosomal region con- 
taining their Rb genes— a sequence that is identical to the sequence in the orig- 
inal mutant chromosome. 

As discussed in Chapter 4, normal human genetic variation makes each of 
our maternal and paternal chromosome sets distinguishably different. On aver- 
age, human DNA sequences differ— that is, we are heterozygous— at roughly 
one in every thousand nucleotides. Where a large segment of one chromosome 
has been either lost or converted to the DNA sequence in its homologous chro- 
mosome, as in Figure 20-31, there is a loss of heterozygosity (LOH): only one ver- 
sion of each variable DNA sequence in that neighborhood remains. Millions of 
common sites of heterozygosity in the human genome have been mapped as 
part of the Human Genome Project: each of these sites is characterized by a spe- 
cific DNA sequence that is known to be polymorphic— that is, to occur com- 
monly in two or more slightly different versions in the human population. Given 
a sample of tumor DNA, one can check which of the versions of these polymor- 
phic sequences are present. The same can be done with a sample of DNA from 
noncancerous tissue from the same patient, for comparison. A loss of heterozy- 
gosity throughout a region of the genome containing one or more polymorphic 
sites, or loss of a genetic marker sequence that is seen in the non-cancerous con- 
trol DNA, can point the way toward a chromosomal region that contains a rele- 
vant tumor suppressor gene. However, because of their genetic instability, can- 
cer cells often exhibit a loss of heterogeneity for many different chromosomal 
regions. Therefore the detection of tumor suppressor genes by this approach 
generally requires the subtraction of a large amount of random noise. 

Epigenetic changes provide another important way to permanently inacti- 
vate a tumor suppressor gene. Most commonly, the gene may become packaged 
into heterochromatin and the C nucleotides in CpG sequences in its promoter 
may become methylated in a heritable manner (see Figure 20-12). These mech- 
anisms can irreversibly silence the gene in a cell and in all of its progeny. Given 
a catalog of possible tumor suppressor genes, it is relatively easy to test their pro- 
moters for abnormal amounts of DNA methylation. Studies of this type suggest 
that epigenetic gene silencing is a frequent event in tumor progression, and epi- 
genetic mechanisms are now thought to help inactivate several different tumor 
suppressor genes in most human cancers (Figure 20-32). 
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Genes Mutated in Cancer Can Be Made Overactive in Many Ways 

In the case of a proto-oncogenes it is gene activation that leads toward cancer. 
Figure 20-33 summarizes the types of accidents that can convert a proto-onco- 
gene into an oncogene. (1) A small change in DNA sequence such as a point 
mutation or deletion may produce a hyperactive protein when it occurs within 
a protein-coding sequence, or lead to protein overproduction when it occurs 
within a regulatory region for that gene. (2) Gene amplification events, such as 
those that can be caused by errors in DNA replication, may produce extra gene 
copies; this can lead to overproduction of the protein (Figure 20-34). (3) A chro- 
mosomal rearrangement, involving the breakage and rejoining of the DNA helix, 
may either change the protein -coding region, resulting in a hyperactive fusion 
protein, or alter the control regions for a gene so that a normal protein is over- 
produced. 

Specific types of abnormality are characteristic of particular genes and of 
the responses to particular carcinogens. For example, 90% of the skin tumors 



Figure 20-32 The pathways leading to 
loss of tumor suppressor gene function 
in cancer involve both genetic and 
epigenetic changes. As discussed in 
Chapter 4, the packaging of a gene into 
condensed chromatin can prevent its 
expression in a way that is inherited 
when a cell divides (see Figure 4-52). As 
indicated, the changes that silence tumor 
suppressor genes can occur in any order. 
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Figure 20-33 The types of accidents that can make a proto-oncogene overactive and convert it into an oncogene. 



1238 Chapter 20: Cancer 





chromosome > 
(Q 



homogeneous - 
staining region 



aberrant replication 



si.*' 



double minute 
chromosomes 



a 



- 



chromosome B 



Figure 20-34 Chromosomal changes in cancer cells resulting from gene amplification. In these examples, the numbers 
of copies of a Myc proto-oncogene have been amplified. Amplification of oncogenes is common in carcinomas and is often 
visible as a curious change in the karyotype: the cell is seen to contain additional pairs of miniature chromosomes— so- 
called double minute chromosomes— or to have a homogeneously staining region interpolated in the normal banding 
pattern of one of its regular chromosomes. Both these aberrations consist of massively amplified numbers of copies of a 
small segment of the genome. The chromosomes are stained with a red fluorescent dye, while the multiple copies of the 
Myc gene are detected by in situ hybridization with a yellow fluorescent probe. (A) Karyotype of a cell in which the Myc 
gene copies are present as double minute chromosomes (paired yellow specks). (B) Karyotype of a cell in which the multiple 
Myc gene copies appear as a homogeneously staining region (yellow) interpolated in one of the regular chromosomes. 
(Ordinary single-copy Myc genes can be detected as tiny yellow dots elsewhere in the genome.) (C) Schematic of how gene 
amplification occurs. It is thought that a rare, abnormal DNA replication event produces a chromosome with extra copies of 
one chromosomal region, as shown. The repair of this structure releases DNA circles that can replicate to form long 
tandemly repeated sequences, producing double minute chromosomes. As the result of a second rare event, the DNA 
from one of these chromosomes can become integrated into a new site on a normal chromosome to produce a 
homogeneously staining region. Other pathways can also amplify genes, such as that described later in Figure 20-41 
(A and B, courtesy of Denise Sheer.) 



evoked in mice by painting the skin with the tumor initiator dimethyl- 
benz [a] anthracene (DMBA) have an A-to-T alteration at exactly the same site in 
a mutant Ras gene; presumably, of the many mutations caused by DMBA, only 
those at this site efficiently stimulate skin cells to form a tumor. 

The receptor for the extracellular signal protein epidermal growth factor 
(EOF), by contrast, can be activated by a deletion that removes part of its extra- 
eel hilar domain. These mutant receptors are able to form active dimers even in 
the absence of EGH and thus they produce an inappropriate stimulatory signal, 
analogous to a faulty doorbell that rings even when nobody is pressing the but- 
ton. Mutations of this type are found in the most common type of human brain 
tumor, called glioblastoma. 

The Myc protein, on the other hand, generally contributes to cancer by 
being overproduced in its normal form. The Myc protein acts in the nucleus to 
stimulate cell growth and division, as discussed in Chapter 17; dius, excessive 
quantities of Myc cause cells to proliferate in circumstances where a normal cell 
would halt. Overproduction of Myc can occur in various ways, in some cases, the 
gene is amplified— that is, errors of DNA replication lead to the creation of large 
numbers of gene copies in a single cell (see Figure 20-34). More commonly the 
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overproduction appears to be due to a change in a regulatory element that acts 
on the gene. For example, a chromosomal translocation can inappropriately 
bring powerful gene regulatory sequences next to the Myc protein-coding 
sequence, so as to produce unusually large amounts of Myc mRNA. Thus, in 
Burkitt's lymphoma a translocation brings the Myc gene under the control of 
sequences that normally drive the expression of antibody genes in B lympho- 
cytes. As a result, the mutant B cells tend to proliferate excessively and form a 
tumor. Similar specific chromosome translocations are common in other lym- 
phomas and leukemias. 



The Hunt for Cancer-critical Genes Continues 



The sequencing of the human genome has opened up new avenues for the sys- 
tematic discovery of cancer-critical genes. It is now possible, in principle, to 
examine every one of the approximately 25,000 human genes in a given cancer 
cell line, or in samples of tissue from a set of cases of cancer of a given type, look- 
ing for potentially significant abnormalities, using automated analysis of either 
their genomic DNA or the mRNAs the cells produce. By analyzing substantial 
numbers of different cancers, it should be possible eventually to identify all the 
genes that are commonly altered in human cancer. Although enormously costly, 
large-scale DNA sequencing efforts have already begun to identify new human 
oncogenes. One example is the finding that a hyperactive form of the Raf protein 
kinase (discussed in Chapter 15) is present in a high percentage of melanomas 
and, at lower frequency, in other cancers. 

Straightforward DNA sequencing is not the only way to tackle the problem, 
however, and there are other powerful new methods for identifying new onco- 
genes and tumor suppressor genes that seem more efficient. Three different 
types of approaches appear particularly promising. 
1. Comparative genomic hybridization (CGH) uses fluorescent labeling of 
fragments of DNA extracted from normal cells and from cancer cells to 
identify regions of the genome that are amplified or deleted in a given type 
of cancer (Figure 20-35). The labeled fragments are allowed to hybridize to 
DNA microarrays in which each spot corresponds to a known location in 
the normal genome. Different spots light up in different fluorescent colors, 
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Figure 20-35 Comparative genomic 
hybridization for detection of the DNA 
changes in tumor cells. (A) DNA 

fragments from tumor cells and normal 
cells are labeled with two different 
fluorescent: molecules [red for the tumor, 
green for the normal control) and 
hybridized to a DNA microarray in which 
each spot corresponds to a defined 
position in the normal genome. (B) The 
ratios of red to green fluorescence for 
each spot is plotted as shown below to 
define those regions amplified or deleted 
in the tumor cells. A red signal indicates 
an amplification, while a green signal 
signifies a deletion. 
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according to the ratio of normal to cancerous labeled fragments that bind to 
them. In this way, one can locate regions of the cancer cell genome that have 
been amplified or deleted. One then searches more narrowly for candidate 
genes within these regions that may contribute to cancer development. 

2. DNA microarrays can also be used to reveal specific changes in gene 
expression associated with cancer. In this case, a cell's population of 
mRNAs is used to prepare the probe for hybridization, rather than its chro- 
mosomal DNA (see Figure 8-76). 

S. Finally large genetic screens based on RNA interference (RNAi) technology 
(discussed in Chapter 8) provide a powerful new functional approach for 
identifying tumor suppressor genes. The expression of small interfering 
RNAs (siRNA] in a cell can inactivate both copies of a gene by either 
destroying the corresponding mRNA or inhibiting its translation (see 
p. 571). This provides an efficient way in principle, to identify any tumor 
suppressor gene whose loss promotes cancerous transformation in the 
particular animal or cell line being tested. 

Whatever the approach used to identify a new candidate cancer- critical 
gene, it remains a challenge to determine whether the gene really contributes 
to cancer causation. Testing for the effects of an overexpression of candidate 
oncogenes or an inhibition of candidate tumor suppressor genes in cell culture 
assays can help. But more convincing tests use transgenic mice that overex- 
press candidate oncogenes and knockout mice that lack candidate tumor sup- 
pressor genes. In both types of mice, cancer development should be acceler- 
ated if the genes are real culprits. 

To search for novel cancer-critical genes in the ways just described, there is 
in principle no need to know at die outset what the normal functions of the gene 
are or how altering them promotes cancer development. As our understanding 
of cell biology improves, however, it becomes easier to guess which genes are 
likely suspects and to test their culpability by directly determining if they are fre- 
quently altered In specific cancers. The task of finding cancer-critical genes is 
therefore closely entangled with the problem of discovering what they normally 
do and how, when altered, they contribute to the development of cancer, This is 
the topic of the next section. 



Summary 

Cancer-critical genes can be classified into two groups, according to whether their gain 
of function or their loss of function contributes to cancer development. Gain-of func- 
tion mutations that convert proto- oncogenes to oncogenes stimulate cells to increase 
their numbers when they should not; loss- of function mutations of tumor suppressor 
genes abolish the inhibitory controls that normally help to hold cell numbers in check. 
Oncogenes have a dominant genetic effect, and many of them were first discovered 
because they cause cancer in animals when introduced by a retrovirus that originally 
picked up a normal form of the gene (a proto- oncogene) from a previous host cell 
Oncogenes can also be identified by characteristic chromosomal aberrations that can 
activate a proto-oncogene. 

Mutations in tumor suppressor genes are generally recessive, in that cells tend to 
behave normally until both gene copies are deleted, inactivated, or silenced epigeneti- 
cally. Several of these genes were initially identified in rare, hereditary cancer syn- 
dromes, but their loss or inactl 'nation is a common feature of many sporadic cancers. 
Individuals who inherit one defective and me functional copy of a tumor suppressor 
gene have an increased predisposition toward developing cancer because a single 
change, anywhere in the body, that eliminates or inactivates the remaining good gene 
copy is enough to produce a cell that totally lacks the tumor-suppressor function. 

The quest for cancer-critical genes continues, with increasingly powerful tools 
available for systematically searching die DNA or mRNAs of cancer cells for either sig- 
nificant mutations or altered expression. Once a candidate gene has been identified, its 
importance for cancer development can be assessed in mice— by overexposing it in 
the case of a candidate oncogene, or by inactivating it in the case of a candidate tumor 
suppressor gene. 
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THE MOLECULAR BASIS OF CANCER-CELL 
BEHAVIOR 

The cancer-gene hunter views oncogenes and tumor suppressor genes — and the 
mutations and epigenetic changes that affect them — differently. But, from a 
cancer cell's point of view, they are two sides of the same coin. The same kinds 
of effects on cell behavior can result from mutations in either class of genes, 
because most cellular control mechanisms have both stimulatory (proto-onco- 
gene) and inhibitory (tumor suppressor) components. If the aim is to under- 
stand how cancer cells function, and to do this in terms of molecular genetics, 
the important distinction is not between tumor-suppressor genes and proto- 
oncogenes, but between cancer- critical genes that act in different biochemical 
and regulatory pathways. 

Some of the pathways important in cancer convey signals from a cell's envi- 
ronment (discussed in Chapter 15); others are responsible for the cell's internal 
programs, such as those that control the cell cycle (discussed in Chapter 17) or 
cell death (discussed in Chapter 18); still others govern the cell's movements 
(discussed in Chapter 16) or mechanical interactions with its neighbors (dis- 
cussed in Chapter 19). The various pathways are linked and interdependent in 
complex ways. Much of what we know about them has been learned as a 
byproduct of cancer research; conversely, research on basic aspects of cell biol- 
ogy has transformed our understanding of cancer. 

In the first section of this chapter, we summarized the general properties 
that make a cell cancerous and listed the kinds of misbehaviors that a cancer cell 
displays. In this section, we consider how these characteristic misbehaviors 
arise from mutations in cancer- critical genes and how the functions of these 
genes in the context of cancer can be determined. We end the section by dis- 
cussing colon cancer as an extended example, showing how a succession of 
changes in cancer-critical genes enables a tumor to evolve from one pattern of 
bad behavior to another that is worse. 



Studies of Both Developing Embryos and Genetically Engineered 
Mice Have Helped to Uncover the Function of Cancer-critical 
Genes 

Given a gene that is mutated in a cancer, we need to understand both how the 
gene functions in normal cells and how mutations in the gene contribute to the 
aberrant behaviors characteristic of cancer cells. When the Rb gene was origi- 
nally cloned, for example, all that was known was that it was mutated in 
retinoblastomas. In the case of Ras, the mutant gene was known to direct cells to 
proliferate excessively and inappropriately in culture, but how the Ras protein 
functioned in either normal or cancer cells was a mystery. For both Rb and Ras, 
cancer research was the starting point for studies that revealed the key role of 
these proteins in normal cells — Rb as a cell-cycle inhibitor and Ras as a central 
component of intracellular signaling pathways. 

Today, we know much more about cells, so that when a new gene is identi- 
fied as critical for cancer, it often turns out to be familiar from studies in another 
context. Many oncogenes and tumor suppressor genes, for example, are 
homologs of genes already known for their role in embryonic development. 
Examples include components of practically all the major signaling pathways 
through which cells communicate during development (discussed in Chapters 
15 and 22): the Wnt, Hedgehog, TGFP, Notch, and receptor tyrosine kinase sig- 
naling pathways all include components encoded by cancer-critical genes — 
with Ras being part of the last of these pathways. 

With hindsight, this is no surprise. As we discuss in Chapter 23, the same sig- 
naling mechanisms that control embryonic development operate in the normal 
adult body to control cell turnover and maintain homeostasis. Both the devel- 
opment of a multicellular animal and its maintenance as an adult depend on 
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cell-cell communication and on regulated cell growth, cell division, cell differ- 
entiation, cell death, cell movement, and cell adhesion — in other words, on all 
the aspects of cell behavior that are deranged in cancer. Developmental biology, 
often using model animals such as Drosophila and C. elegans, thus provides a 
key to the normal functions of many cancer-critical genes. 

Ultimately, however, we want to know what alterations in these genes do to 
cells in the tissues that give rise to the cancer. Some information can be obtained 
by studying cells in culture or by examining samples from human cancer patients. 
But to investigate how mutations in cancer- critical genes affect tissues in a whole 
organism, transgenic and knockout mice have proven particularly useful. 

Transgenic mice that carry an oncogene in all their cells can be generated by 
methods described in Chapter 8. Oncogenes introduced in this way may be 
expressed in many tissues or in only a select few, depending on the regulatory 
DNA associated with the transgene. Studies of such mice confirm that a single 
oncogene is generally not sufficient to turn a normal cell into a cancer cell. Typ- 
ically, in mice engineered to express a Myc or Ras oncogene, some of the tissues 
that express the oncogene may show enhanced cell proliferation, and, over time, 
occasional cells will undergo further changes to give rise to cancers. Most cells 
expressing the transgene, however, do not give rise to cancers. Nevertheless, 
from the point of view of the whole animal, the inherited oncogene is a serious 
menace because it creates a high risk that a cancer will arise somewhere in the 
body. Mice that express both Myc and Ras oncogenes (bred by mating a trans- 
genic mouse carrying a Myc oncogene with one carrying a Ras oncogene) 
develop cancers earlier and at a much higher rate than either parental strain 
(Figure 20-36); but, again, the cancers originate as scattered isolated tumors 
among noncancerous cells. Thus, even cells expressing these two oncogenes 
must undergo further, randomly generated changes to become cancerous, 
strongly suggesting that multiple mutations are required for tumorigenesis, as 
supported by other evidence and discussed earlier. 

Numerous tumor suppressor genes have been knocked out in mice, includ- 
ing Rb. As anticipated, many of the mutant strains that are missing one copy of 
a tumor suppressor gene are cancer prone, and the tumors that develop typi- 
cally show a loss of heterozygosity — that is, they have lost or inactivated the sec- 
ond copy of the gene. Deletion of both copies of a tumor suppressor gene often 
leads to death at an embryonic stage, reflecting the essential roles of these genes 
in normal development. To overcome this problem and test the effect of 
homozygous mutations in an adult tissue, one can engineer mice to carry con- 
ditional mutations (see Figure 5-79), such that only one organ or tissue — say, the 
liver — displays the genetic defect. Alternatively, mice can be engineered so that 
the defect is created in response to an experimental signal (a drug) delivered at 
a chosen time. Through the use of these techniques, mice not only provide 
insights into the mechanisms of tumor formation, but also models in which to 
test and develop new cancer therapies. 
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Figure 20-36 Oncogene collaboration 
in transgenic mice. The graphs show the 
incidence of tumors in three types of 
transgenic mouse strains, one carrying a 
Myc oncogene, one carrying a Ras 
oncogene, and one carrying both 
oncogenes. For these experiments two 
lines of transgenic mice were first 
generated. One carries an inserted copy 
of an oncogene created by fusing the 
proto-oncogene Myc with the mouse 
mammary tumor virus regulatory DNA 
(which then drives Myc overexpression in 
the mammary gland). The other line 
carries an inserted copy of the Ras 
oncogene under control of the same 
regulatory element. Both strains of mice 
develop tumors much more frequently 
than normal, most often in the mammary 
or salivary glands. Mice that carry both 
oncogenes together are obtained by 
crossing the two strains. These hybrids 
develop tumors at a far higher rate still, 
much greater than the sum of the rates 
for the two oncogenes separately. 
Nevertheless, the tumors arise only after 
a delay and only from a small proportion 
of the cells in the tissues where the two 
genes are expressed. Further accidental 
changes, in addition to the two 
oncogenes, are apparently required for 
the development of cancer. (After E. Sinn 
et al., Cell 49:465-475, 1 987. With 
permission from Elsevier.) 



Many Cancer-Critical Genes Regulate Cell Proliferation 

Many cancer- critical genes encode components of the pathways that regulate the 
social behaviors of cells in an animal — in particular, components of the signaling 
pathways through which influences from neighboring cells control whether a cell 
grows, divides, differentiates, or dies (Figure 20-37). In fact, many of the compo- 
nents of these signaling pathways were first identified through searches for can- 
cer-causing genes, and a full list of the proteins encoded by proto- oncogenes and 
tumor suppressor genes includes examples of practically every type of protein 
involved in cell signaling: secreted signal proteins, transmembrane receptors, 
intracellular GTP-binding proteins, protein kinases, gene regulatory proteins, and 
many other classes of proteins besides. Cancer mutations alter these components 
in a way that creates proliferative signals even when more cells are not needed, 
switching on cell growth, DNA replication, and cell division inappropriately 
Examples include the gain-of-function mutations that inappropriately activate 
the EGF receptor and the Ras protein, discussed earlier. 
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The components in signaling pathways that normally function to inhibit cell 
proliferation often appear as tumor suppressors. A well- studied example is the 
TGF(3 signaling pathway (discussed in Chapter 15). Loss-of-function mutations 
in this pathway contribute to the development of several types of human can- 
cers. The receptor TGFP-RII, for example, is mutated in some cancers of the 
colon, as is Smad4 — a key intracellular signaling protein in the pathway— which 
is also often inactivated in cancers of the pancreas and some other organs. These 
findings reflect the normal function of the TGF(3 pathway in restricting cell pro- 
liferation (see Figure 23-26). 

Not surprisingly mutations that directly affect the central cell-cycle con- 
trol system feature prominently in many cancers. The tumor suppressor pro- 
tein Rb, discussed earlier, controls a key point at which cells decide to enter 
the cell cycle and replicate their DNA. Rb serves as a brake that restricts entry 
into S phase by binding to and inhibiting gene regulatory proteins of the E2F 
family which are needed to transcribe genes that encode proteins required for 
entrance into S phase. Normally this inhibition by Rb is relieved at the appro- 
priate time by the phosphorylation of Rb by various cy din- dependent kinases 
(Cdks), which cause Rb to release its inhibitory grip on the E2F proteins (dis- 
cussed in Chapter 17). 

Many cancer cells proliferate inappropriately by eliminating Rb entirely, as 
we have already seen. Others achieve the same effect by acquiring mutations 
that alter other components of the Rb regulatory pathway (Figure 20-38). In 
normal cells, a complex of cyclin D and the cyclin-dependent kinase Cdk4 (Gi- 
Cdk) is responsible for phosphorylating Rb so as to allow progress through the 
cycle (see Figure 17-62 and Table 17-1, p. 1063). The pl6 (INK4) protein— which 
is produced when cells are stressed— inhibits cell-cycle progression by prevent- 
ing the formation of an active cyclin D-Cdk4 complex; it is an important com- 
ponent of the normal cell-cycle-arrest response to stress. Some glioblastomas 
and breast cancers have amplified the genes encoding Cdk4 or cyclin D, thus 
favoring cell proliferation. And deletion or inactivation of the pl6 gene is com- 
mon in many forms of human cancer. In cancers in which mutations do not 
inactivate the pi 6 gene, DNA methylation of its regulatory region often silences 
the gene; this is an example of an epigenetic change contributing to the devel- 
opment of cancer. Mutation in any one component of a given pathway is suffi- 
cient to inactivate the pathway and promote cancer. As expected, therefore, a 



Figure 20-37 Chart of the major 
signaling pathways relevant to cancer in 
human cells, indicating the cellular 
locations of some of the proteins 
modified by mutation in cancers. 
Products of both oncogenes and tumor 
suppressor genes often act within the 
same pathways. Individual signaling 
proteins are indicated by red circles, with 
the cancer-critical components and 
control mechanisms discussed in this 
chapter in green. Stimulatory and 
inhibitory interactions between proteins 
are designated as shown in the key. 
(Adapted from D. Hanahan and 
R.A. Weinberg, Cell 1 00:57-70, 2000. With 
permission from Elsevier.) 
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Figure 20-38 The pathway by which Rb controls cell cycle entry contains both proto-oncogenes and tumor suppressor 
genes. All the components of this pathway have been found to be altered by mutation in human cancers (products of 
proto-oncogenes, green; products of tumor suppressor genes, red; E2F is shown in blue because it has both inhibitory and 
stimulatory actions, depending on the other proteins that are bound to it). In most cases, only one of the components is 
altered in any individual tumor. (A) A simplified view of the dependency relationships in this pathway; see Figure 1 7-62 for 
further details. (B) The Rb protein inhibits entry into the cell-division cycle when it is unphosphorylated. The complex of 
Cdk4 and cyclin D phosphorylates Rb, thereby encouraging cell proliferation. When a cell is stressed, pi 6 inhibits the 
formation of an active Cdk4-cyclin D complex, preventing proliferation. Inactivation of Rb or pi 6 by mutation encourages 
cell division (thus each can be regarded as a tumor suppressor), while overactivity of Cdk4 or cyclin D encourages cell 
division (thus each can be regarded as a proto-oncogene). 

cancer rarely inactivates more than one component in a pathway: this would 
bring no additional benefit for the cancer's evolution. 

Distinct Pathways May Mediate the Disregulation of Cell-Cycle 
Progression and the Disregulation of Cell Growth in Cancer Cells 

As described in Chapter 17, a special cell-cycle control system ensures that 
eucaryotic cells make a copy of each of their chromosomes and segregate exactly 
one copy into each of the two daughter cells created by cell division. The initia- 
tion of this process requires cell-cycle progression signals that are carefully reg- 
ulated, as the default state for each cell in a multicellular organism is to remain 
in a quiescent G state (see p. 1103). But cell proliferation requires more than 
progression through the cell cycle; it also requires cell growth, which involves 
complex anabolic processes through which the cell synthesizes all the necessary 
macromolecules from small-molecule precursors. If a cell divides inappropri- 
ately without growing first, it will get smaller at each division and will ultimately 
halt or die. The continued growth of a cancer therefore requires heritable 
changes that not only disregulate cell-cycle progression but also provoke cell 
growth (Figure 20-39). 

The phosphoinosltide 3-kinase (PI 3 - kinase) /Akt intracellular signaling 
pathway is critical for cell growth control. As described in Chapter 15, various 
extracellular signal proteins, including insulin and insulin -like growth factors, 
normally stimulate this pathway. In cancer cells, however, the pathway is acti- 
vated by mutation so that the cell can grow in the absence of such signals. The 
resulting ah normal activation of the protein kinase Akt is central to this disregu- 
lated growth process, since this activation not only stimulates proiein synthesis 
(see Figure 17-65). but also greatly increases both glucose uptake (via mTOR 
activation) and the production of the acetyl CoA in the cytosol required for cell 
lipid synthesis (via ACL activation), as outlined in Figure 20-39. Thus, a common 
tumor-suppressor gene mutation, in many different cancers, is loss of the PTEN 
phosphatase, whose normal function is to limit Akt activation by dephosphory- 
lating the molecules that PI 3-kinase phosphorylates. 
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The abnormal activation of the PI 3-kinase/Akt pathway, which normally 
occurs early in the process of tumor progression, explains the excessive rate of 
glycolysis that is observed in tumor cells, known as the Warburg effect. Accom- 
panied by excretion of excess pyruvate as lactate, the excessive glucose uptake 
by cancer cells is used to locate tumors by modern whole-body imaging tech- 
niques (see Figure 20-1). 



Mutations in Genes That Regulate Apoptosis Allow Cancer Cells 
to Survive When They Should Not 

Control of cell numbers depends on maintaining a balance between cell prolif- 
eration and cell death. In the germinal centers of lymph nodes, for example, B 
cells proliferate rapidly, but most of their progeny are eliminated by apoptosis. 
Correct regulation of apoptosis is thus essential in maintaining the normal bal- 
ance of cell birth and death in tissues that undergo cell turnover. It also has a 
vital role in eliminating damaged or stressed cells. As described in Chapter 18, 
animal cells commit suicide by undergoing apoptosis when they sense that 
something has gone drastically wrong — when their DNA is severely damaged, 
for example, or when they are deprived of extracellular survival signals that tell 
them they are in their proper place. As previously discussed (see Figure 20-14), 
cancer cells are relatively resistant to apoptosis, which allows them to increase 
in number and survive where they should not. 

Mutations in apoptosis-control genes usually account for this resistance. One 
protein that normally inhibits apoptosis, called Bcl2, was discovered and named 
because its expression is activated by a chromosomal translocation in a B-cell 
lymphoma. The translocation places the Bcl2 gene under the control of a regula- 
tory DNA sequence that drives Bcl2 overexpression. This allows the survival of B 



Figure 20-39 Cells may require two 
types of signals to proliferate. 

(A) In order to multiply successfully, most 
normal cells are suspected to require 
both extracellular signals that drive cell 
cycle progression {here red mitogen) and 
extracellular signals that drive cell growth 
(here blue growth factor). (B) Diagram of 
the signaling system containing Akt that 
drives cell growth through greatly 
stimulating glucose uptake and 
utilization, including a conversion of the 
excess citric acid produced from sugar 
intermediates in mitochondria into the 
acetyl CoA that is needed in the cytosol 
for lipid synthesis and new membrane 
production. As indicated, protein 
synthesis is also increased. This system 
becomes abnormally activated early in 
tumor progression. 
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lymphocytes that would normally have died, greatly increasing the number of B 
cells and contributing to the development of the B-cell cancer. 

One of the genes involved in the control of apoptosis is mutated in an excep- 
tionally wide range of cancers. This tumor suppressor gene encodes a protein 
that stands at a crucial intersection in the network of intracellular control path- 
ways governing a cell's responses to DNA damage and various other cell stresses, 
including low oxygen (hypoxia) and growth factor deprivation. The protein is 
tumor protein p53, and it is produced by the Tp53 gene, more commonly known 
as p53. As we now explain, when p53 is defective, genetically damaged dividing 
cells do not merely fail to die; they persist in proliferating, accumulating yet 
more genetic damage that can increase malignancy. 



Mutations in the p53 Gene Allow Many Cancer Cells to Survive 
and Proliferate Despite DNA Damage 

The p53 gene— named for the molecular mass of its protein product— may be 
the most important gene preventing human cancer. Either this tumor suppres- 
sor gene or other components in the p53 pathway are mutated in nearly all 
human cancers. Why is p53 so critical? The answer lies in its multifaceted func- 
tion in cell-cycle control, in apoptosis, and in maintenance of genetic stability — 
all aspects of the fundamental role of the p53 protein in protecting the organism 
against the consequences of cell damage and the risk of cancer. 

In contrast to Rb, most body cells have very little p53 protein under normal 
conditions: although the protein is synthesized, it is rapidly degraded. Moreover, 
p53 is not essential for normal development. Mice in which both copies of the 
gene have been deleted or inactivated typically appear normal in all respects 
except one— they universally develop cancer before 10 months of age. These 
observations suggest that p53 may have a function that is required only in spe- 
cial circumstances. Indeed, when normal cells are deprived of oxygen or 
exposed to treatments that damage DNA, such as ultraviolet light or gamma 
rays, they raise their concentration of p53 protein by blocking the degradation of 
the molecule. The accumulation of p53 protein is seen also in cells where onco- 
genes such as Ras and Myc are unusually active, generating an abnormal stimu- 
lus for cell division. 

In all these cases, the high level of p53 protein acts to limit the harm done. 
Depending on circumstances and die severity of the damage, p53 may either 
drive the damaged or abnormally proliferating cell to commit suicide by apopto- 
sis— a relatively harmless event for the multicellular organism— or may trigger a 
mechanism that bars the cell from dividing so long as the damage remains unre- 
paired. Similarly when telomeres get too short. p53 is activated and inhibits fur- 
ther cell division, producing the phenomenon of replicative cell senescence (Fig- 
ure 20-40). The protection that p53 provides is part of the reason why mutations 
that activate oncogenes such as Ras and Myc are not sufficient to create a tumor. 

The p53 protein performs its job mainly by acting as a gene regulatory pro- 
tein ! ( t.V\> Indeed, the most common mutations observed in p53 in human 
tumors are in its DNA-binding domain, where they cripple the ability of p53 to 
bind to its DNA target sequences. As discussed in Chapter 17, the p53 protein 
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Figure 20-40 Modes of action of the 
p53 tumor suppressor. The p53 protein 
is a cellular stress sensor. In response to 
hyperproliferative signals, DNA damage, 
hypoxia, and/or telomere shortening, the 
p53 levels in the cell rise and cause cells 
to undergo cell cycle arrest, apoptosis, or 
replicative cell senescence. (As discussed 
in Chapter 1 7, a senescent cell 
progressively loses the ability to divide.) 
All of these outcomes keep tumor growth 
in check. 
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exerts its inhibitory effects on the cell cycle, in part at least, by binding to DNA 
to induce the transcription of p21, which encodes a protein that binds to and 
inhibits the Cdk complexes required for progression through the cell cycle. By 
blocking the kinase activity of these Cdk complexes, the p21 protein prevents 
the cell from entering S phase and replicating its DNA. 

The mechanisms by which p53 induces apoptosis are more complex; it can 
stimulate the expression of many pro-apoptotic genes, but it may also bind to 
and inactivate anti-apoptotic Bcl2 proteins on the surface of mitochondria, 
thereby promoting apoptosis (discussed in Chapter 18). 

Cells defective in p53 fail to show these p53-dependent stress responses. 
They tend to escape apoptosis, and, if their DNA is damaged, they may carry on 
dividing, plunging into DNA replication without pausing to repair the damage. As 
a result, they may either die or, far worse, survive and proliferate with a corrupted 
genome (Figure 20-41). Such genetic mayhem can lead both to loss of tumor 
suppressor genes and to activation of oncogenes. The loss of p53 also makes 
some cancer cells much less sensitive to irradiation and to many anticancer 
drugs, which would otherwise cause them to die or to halt their proliferation. 

In summary, p53 helps a multicellular organism cope safely with DNA dam- 
age and other cell stresses, acting as a check on cell proliferation in circum- 
stances where it would be dangerous. The loss of p53 activity is unusually dan- 
gerous because it can promote cancer for four different reasons. First, it allows 
cells with DNA damage to continue through the cell cycle. Second, it allows 
them to escape apoptosis. Third, by permitting division of cells with damaged 
chromosomes, it leads to the genetic instability characteristic of cancer cells, 
allowing further cancer-promoting mutations to accumulate as they divide. 
Fourth, it makes cells relatively resistant to anticancer drugs and irradiation in 
some tumor types. Many other mutations can contribute to one or another of 
these types of misbehavior, but p53 mutations contribute to them all. 



DNA Tumor Viruses Block the Action of Key Tumor Suppressor 
Proteins 

DNA tumor viruses cause cancer mainly by interfering with the cell's controls on 
the cell cycle and apoptosis, including those that depend on p53. To understand 
this type of viral carcinogenesis, it is important to understand the life history of 
viruses. Many DNA viruses use the host cell's DNA replication machinery to 
replicate their own genomes. To produce numerous infectious virus particles 
within a single host cell, the DNA virus has to commandeer this machinery and 
drive it hard, breaking through the normal constraints on DNA replication and 
usually killing the host cell in the process. Many DNA viruses reproduce only in 
this way. But some of them have a second option: they can propagate their 
genome as a quiet, well-behaved passenger in the host cell, replicating in paral- 
lel with the host cell's DNA in the course of ordinary cell division cycles. These 
viruses can switch between two modes of existence according to circumstances, 



Figure 20-41 How the replication of 
damaged DNA can lead to chromosome 
abnormalities, such as gene 
amplification and gene loss. The 

diagram shows one of several possible 
mechanisms. The process begins with 
accidental DNA damage in a cell that 
lacks functional p53 protein. Instead of 
halting at the p53-dependent checkpoint 
in the division cycle, where a normal cell 
with damaged DNA would pause until it 
repaired the damage, the p53-defective 
cell enters S phase, with the 
consequences shown. Once a 
chromosome carrying a duplication and 
lacking a telomere has been generated, 
repeated rounds of replication, chromatid 
fusion, and unequal breakage (the 
so-called breakage-fusion-bridge cycle) 
can increase the number of copies of the 
duplicated region still further. Selection 
in favor of cells with increased numbers 
of copies of a gene in the affected 
chromosomal region will thus lead to 
mutants in which the gene is amplified to 
a high copy number. The multiple copies 
may — either through a recombination 
event or through unrepaired DNA strand 
breakage — become excised from their 
original locus and so appear as 
independent double minute 
chromosomes (see Figure 20-34). The 
chromosomal disorder can also lead to a 
loss of genes, with selection in favor of 
cells that have lost tumor suppressors. 
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remaining latent and harmless for a long time, but then proliferating in occa- 
sional cells in a process that kills the host cell and generates large numbers of 
infectious particles. 

Those DNA viruses that have the ability to replicate their DNA as quiescent 
passengers in a latent phase of viral replication either integrate their viral 
genome into one or more host chromosomes, or they have the ability to form an 
extrachromosomal plasmid that replicates in step with the host chromosomes 
during the latent phase. 

No matter which way of life a DNA virus is following, it is not in its interest 
to kill the host organism. But for viruses with a latent phase, accidents can occur 
that prematurely activate some of the viral proteins that the virus would nor- 
mally use in its replicative phase to allow the viral DNA to replicate indepen- 
dently of the cell cycle. This type of accident can switch on the persistent prolif- 
eration of the host cell, leading to cancer, and the DNA viruses susceptible to 
such accidents have been designated as DNA tumor viruses. 

The papillomaviruses, for example, are the cause of human warts and are 
especially important as a key causative factor in carcinomas of the uterine cervix 
(about 6% of all human cancers). Papillomaviruses infect the cervical epithe- 
lium and maintain themselves in a latent phase in the basal layer of cells as 
extrachromosomal plasmids, which replicate in step with the chromosomes. 
Infectious virus particles are generated through a switch to a replicative phase in 
the outer epithelial layers, as progeny of these cells begin to differentiate before 
being sloughed from the surface. Here, cell division should normally stop, but 
the virus interferes with this cell-cycle arrest so as to allow replication of its own 
genome. Usually the effect is restricted to the outer layers of cells and is rela- 
tively harmless, as in a wart. Occasionally, however, a genetic accident causes 
the viral genes that encode the proteins that prevent cell-cycle arrest to integrate 
into the host chromosome and become active in the basal layer, where the stem 
cells of the epithelium reside. This can lead to cancer, with the viral genes acting 
as oncogenes (Figure 20-42). 

In papillomaviruses, the viral genes that are mainly to blame are called E6 
and E7. The protein products of these viral oncogenes interact with many host 
cell proteins, but, in particular, they bind to two key tumor suppressor proteins 
of the host cell, putting them out of action and so permitting the cell to replicate 
its DNA and divide in an uncontrolled way. One of these host proteins is Rb: by 
binding to Rb, the viral E7 protein prevents Rb from binding to and inhibiting 
the E2F proteins mentioned earlier, which allows uncontrolled entry into S 
phase. The other is p53; by binding to p53, the viral E6 protein triggers p53 
destruction (Figure 20-43), allowing the abnormal cell to survive, divide, and 
accumulate yet more abnormalities. Other DNA tumor viruses use similar 
mechanisms to inhibit Rb and p53, underlining the importance of inactivating 



Figure 20-42 How certain 
papillomaviruses are thought to give 
rise to cancer of the uterine cervix. 

Papillomaviruses have double-stranded 
circular DNA chromosomes of about 8000 
nucleotide pairs. These chromosomes are 
normally stably maintained in the basal 
cells of the epithelium as plasmids whose 
replication is regulated so as to keep step 
with the chromosomes of the host (left). 
Rare accidents can cause the integration 
of a fragment of such a plasmid into a 
chromosome of the host, altering the 
environment of the viral genes in the 
basal cells. This (or possibly some other 
cause) disrupts the normal control of viral 
gene expression. The unregulated 
production of viral proteins interferes 
with the control of cell division in the 
basal cells, thereby helping to generate a 
cancer (right). 
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both of these tumor suppressor pathways for releasing the normal constraints 
on cell proliferation. It is striking that virtually all human cancers have also inac- 
tivated both pathways, usually affecting only one component in each. 



Figure 20-43 Activation of cell 
proliferation by a DNA tumor virus. 

Papillomavirus produces two viral 
proteins, E6 and E7, which sequester the 
host cell's p53 and Rb proteins, 
respectively. E6 protein binding leads to 
ubiquitylation of its p53 partner, inducing 
p53 proteolysis (not shown). The SV40 
virus (a related virus that infects 
monkeys) uses a single dual-purpose 
protein, called large T antigen, to 
sequester these same two proteins. 



The Changes in Tumor Cells That Lead to Metastasis Are Still 
Largely a Mystery 

Perhaps the most significant gap in our understanding of cancer concerns inva- 
siveness and metastasis. We have yet to clearly identify mutations that specifi- 
cally permit cells to invade surrounding tissues, spread through the body, and 
form metastases. Indeed, it is not even clear exactly what properties a cancer cell 
must acquire to become metastatic. One extreme view is that the ability of can- 
cer cells to metastasize requires no further genetic changes beyond those 
needed to weaken the normal controls on cell growth, cell division, and cell 
death. Some experiments using DNA microarrays to compare the mRNAs 
expressed in a metastasis and in the primary tumor support this view. The alter- 
native view, more widely held, is that metastasis is a difficult and complex task 
that requires additional mutations, but that these may be so varied according to 
circumstance that they are hard to discover individually. 

One molecular change in metastatic cancer cells from a variety of tumors — 
including melanomas and carcinomas of breast, stomach, and liver — is an over- 
expression of the Rho-family GTPase RhoC, which helps to mediate actin-based 
cell motility. In some cases, inhibition of RhoC activity in these cells is sufficient 
to block their metastatic ability. It is not yet clear, however, exactly how RhoC 
contributes to the ability to metastasize. 

Metastasis presents different problems for different types of cells. For a can- 
cerous white blood cell already roaming the body via the circulation, metastasis 
should be easier than for a carcinoma cell that has to escape from an epithelium. 
As we discussed earlier, it is helpful to distinguish two phases of tumor progres- 
sion required for a carcinoma to metastasize. First, the cells must escape the 
normal confines of their parent epithelium and begin to invade the tissue imme- 
diately beneath. Second, they must travel to distant sites via the blood or lymph 
and establish metastases. 

The first step, local invasiveness, requires a loss of the mechanisms that nor- 
mally hold epithelial cells together. As mentioned earlier, this step resembles the 
normal developmental process known as the epithelial-to-mesenchymal transi- 
tion (EMT), in which epithelial cells undergo a shift in character, becoming less 
adhesive and more migratory (discussed in Chapter 19). At the heart of this tran- 
sition are changes in expression of the E-cadherin gene. The primary function of 
the transmembrane E-cadherin protein is in cell-cell adhesion, and it is especially 
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important in binding epithelial cells together through adherens junctions (see 
Figure 19-15). In some carcinomas of the stomach and of the breast, the E-cad- 
herin gene has been identified as a tumor suppressor gene. The abnormal 
behavior of tumor cells lacking the protein can be partially rescued, at least in 
culture: if a functional E-cadherin gene is put back into these cancer cells, they 
lose some of their invasive characteristics and begin to cohere more like normal 
epithelial cells. Loss of E-cadherin t therefore, may promote cancer development 
by facilitating local invasiveness. 

The initial entry of tumor cells into the circulation is helped by the presence 
of a dense supply of blood vessels and sometimes lymphatic vessels, which 
tumors attract to themselves by secreting angiogenic factors such as VEGF, as 
discussed earlier; the abnormal fragility and leakiness of these new vessels may 
help the cells that have become invasive to enter and then move through the cir- 
culation with relative ease. The remaining steps in metastasis, involving exit 
from a blood or lymphatic vessel and the effective colonization of remote sites, 
are much harder to study. To discover which of the later steps in metastasis pre- 
sent cancer cells with the greatest difficulties, one can label the cells with a flu- 
orescent dye or green fluorescent protein (GFP), inject them into the blood- 
stream of a mouse, and then monitor their fate. In such experiments one 
observes that many cells survive in the circulation, lodge in small vessels, and 
exit into the surrounding tissue, regardless of whether they come from a tumor 
that metastasizes or one that does not. Some cells die immediately after they 
enter foreign tissue; others survive entry into the foreign tissue but fail to prolif- 
erate; still others divide a few times and then stop, forming micrometastases 
containing ten to several thousand cells. At this point, the metastasis-competent 
cells outperform their nonmetastatic relatives, as only they form growing 
tumors, raising the possibility that the ability to continue to proliferate in a for- 
eign tissue (the process called colonization) is a key property that cancer cells 
must acquire to become metastatic (Figure 20-44). 

The challenge is to use assays such as this to discover at a molecular level 
what, if anything, distinguishes the cells that colonize from those that do not. 
One possibility, as discussed earlier, is that the cells capable of colonization are 
just those rare members of the parent tumor population that have the special 
character of cancer stem cells. It seems that this must be at least part of the story, 
as we have already explained. But although we know of markers that help iden- 
tify cancer stem cells, at least for some cancers, we do not yet understand what 
it is that fundamentally gives them their special stem-cell properties. This is part 
of the wider problem of understanding stem cells in general, discussed at length 
in Chapter 23. In any case, it remains very hard to pinpoint what additional 
genetic or epigenetic changes, if any, are needed to enable cancer stem cells to 
colonize foreign tissue. 



Figure 20-44 The barriers to metastasis. 

Studies of labeled tumor cells leaving a 
tumor site, entering the circulation, and 
establishing metastases show which 
steps in the metastatic process, outlined 
in Figure 20-1 7, are difficult or 
'inefficient,' in the sense that they are 
steps in which large numbers of cells fail 
and are lost. It is in these difficult steps 
that highly metastatic cells are observed 
to have much greater success than 
nonmetastatic cells. It seems that the 
ability to escape from the parent tissue, 
and an ability to survive and grow in the 
foreign tissue, are key properties that 
cells must acquire to become metastatic. 
(Adapted from A.F. Chambers et al., Breast 
Cancer Res. 2:400-407, 2000. With 
permission from BioMed Central Ltd.) 



Colorectal Cancers Evolve Slowly Via a Succession of Visible 
Changes 

At the beginning of this chapter, we saw that most cancers develop gradually 
from a single aberrant cell, progressing from benign to malignant tumors by the 
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accumulation of a number of independent genetic accidents and epigenetic 
changes. We have discussed what some of these accidents and changes are in 
molecular terms and seen how they contribute to cancerous behavior. We now 
examine one of the common human cancers more closely, using it to illustrate 
and enlarge upon some of the general principles and molecular mechanisms we 
have introduced, attempting to make sense of the natural history of the disease 
in these terms. We take colorectal cancer as our example, in which the steps of 
tumor progression have been followed in vivo with a colonoscope (a fiber-optic 
device for viewing the interior of the colon and rectum) and carefully studied at 
the molecular level. This model, based on studies of both sporadic and inherited 
forms of the human disease, serves as a paradigm for illustrating the stepwise 
nature of tumor development, and it allows us to associate specific morpholog- 
ical stages with inherited changes in the function of particular genes. 

Colorectal cancers arise from the epithelium lining the colon (the large 
intestine) and rectum (the terminal segment of the gut). The organization of this 
tissue is broadly similar to that of the small intestine, discussed in detail in 
Chapter 23 (pp. 1436-1442). Both regions of gut epithelium are renewed at an 
extraordinarily rapid rate, taking about a week to completely replace most of the 
epithelial sheet. In both regions the renewal depends on stem cells that lie in 
deep pockets of the epithelium, called intestinal crypts. The signals that main- 
tain the stem cells and control the normal organization and renewal of the 
epithelium are beginning to be quite well understood, as explained in Chapter 
23. This is in large part thanks to insights that have come from cancer research. 
Mutations that disrupt the normal organizing signals begin the process of tumor 
progression for most colorectal cancers. 

Colorectal cancers are common, currendy causing nearly 60,000 deaths a year 
in the United States, or about 10% of total deaths from cancer. Like most cancers, 
they are not usually diagnosed until late in life (90% occur after the age of 55). 
However, routine examination of normal adults with a colonoscope often reveals 
a small benign tumor, or adenoma, of the gut epithelium in the form of a protrud- 
ing mass of tissue called a polyp (Figure 20-45A; see also Figure 23-23) . These ade- 
nomatous polyps are believed to be the precursors of a large proportion of col- 
orectal cancers. Because the progression of the disease is usually very slow, there 
is typically a period of about 10 years in which the slowly growing tumor is 
detectable but has not yet turned malignant. Thus, when people are screened by 
colonoscopy in their fifties and the polyps are removed through the colono- 
scope — a quick and easy surgical procedure — the subsequent incidence of col- 
orectal cancer is very low, according to some studies, less than a quarter of what it 
would be otherwise. 

Colon cancer provides a clear example of the phenomenon of tumor pro- 
gression discussed previously. In microscopic sections of polyps smaller than 1 
cm in diameter, the cells and their arrangement in the epithelium usually appear 
almost normal. The larger the polyp, the more likely it is to contain cells that look 
abnormally undifferentiated and form abnormally organized structures. Some- 
times, two or more distinct areas can be distinguished within a single polyp, with 
the cells in one area appearing relatively normal and those in the other appear- 
ing clearly cancerous, as though they have arisen as a mutant subclone within 
the original clone of adenomatous cells. At later stages in the disease, the tumor 
cells become invasive, first breaking through the epithelial basal lamina, then 
spreading through the layer of muscle that surrounds the gut (Figure 20-45B), 
and finally metastasizing to lymph nodes via lymphatic vessels and to liver, lung, 
and other organs via blood vessels. 

A Few Key Genetic Lesions Are Common to a Large Fraction of 
Colorectal Cancers 

What are the mutations that accumulate with time to produce this chain of 
events? Of those genes so far discovered to be involved in colorectal cancer, 
three stand out as most frequently mutated — the proto-oncogene K-Ras (a 
member of the Ras gene family) and the tumor suppressor genes p53 and Ape 
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Figure 20-45 Cross sections showing 
the stages in development of a typical 
colon cancer. (A) An adenomatous polyp 
from the colon. The polyp protrudes into 
the lumen — the space inside the colon. 
The rest of the wall of the colon is 
covered with normal colonic epithelium; 
the epithelium on the polyp appears 
mildly abnormal. (B) A carcinoma that is 
beginning to invade the underlying 
muscle layer. (Courtesy of Paul Edwards.) 
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(discussed below). Others are involved in smaller numbers of colon cancers 
(Table 20-2). Still others remain to be identified. 

One approach to discovery of the mutations responsible for colorectal can- 
cer is to screen the cells for abnormalities in genes already known or suspected 
to be involved in cancers elsewhere. This type of genetic screening has revealed 
that about 40% of colorectal cancers have a specific point mutation in K-Ras, 
activating it to become an oncogene, and about 60% have inactivating muta- 
tions or deletions of p53. 

As discussed earlier, another approach is to track down the genetic defects 
in those rare families that show a hereditary predisposition to colorectal cancer. 



Table 20-2 Some Genetic Abnormalities Detected in Colorectal Cancer Cells 







PATHWAY 


HUMAN COLON 






AFFECTED 


CANCERS (%) 


K-Ras 


oncogene 


receptor tyrosine-kinase signaling 


40 


p-catenin 1 


oncogene 


Wnt signaling 


5-10 


Apc } 


tumor suppressor 


Wnt signaling 


>80 


p53 


tumor suppressor 


response to stress and DNA damage 


60 


TGF/3 receptor II 2 


tumor suppressor 


TGFP signaling 


10 


Smad4 2 


tumor suppressor 


TGFp signaling 


30 


MLH1 and other DNA 


tumor suppressor 


DNA mismatch repair 


15 


mismatch repair genes 


(genetic stability) 




(often silenced by 
methylation) 



The genes with the same superscript act in the same pathway, and therefore only one of the components is mutated in an individual cancer. 



THE MOLECULAR BASIS OF CANCER-CELL BEHAVIOR 



1253 



Figure 20-46 Colon of familial adenomatous polyposis coli patient 
compared with normal colon. (A) The normal colon wall is a gently 
undulating but smooth surface. (B) The polyposis colon is completely 
covered by hundreds of projecting polyps (shown in section in Figure 
20-45), each resembling a tiny cauliflower when viewed with the naked 
eye. (Courtesy of Andrew Wyllie and Mark Arends.) 



The first hereditary colorectal cancer syndrome to be analyzed was familial ade- 
nomatous polyposis coli (FAP), in which hundreds or thousands of polyps 
develop along the length of the colon (Figure 20-46). These polyps start to 
appear in early adult life, and if they are not removed, one or more will almost 
always progress to become malignant; the average time from the first detection 
of polyps to the diagnosis of cancer is 12 years. The disease can be traced to a 
deletion or inactivation of the tumor suppressor gene Ape, named after the syn- 
drome. Individuals with FAP have inactivating mutations or deletions of one 
copy of the Ape gene in all their cells and show loss of heterozygosity in tumors, 
even in the benign polyps. Most patients with colorectal cancer do not have the 
hereditary condition. But more than 80% of their cancers (but not their normal 
cells) have inactivated both copies of the Ape gene through mutations acquired 
during their lifetime. Thus, by a route similar to that we discussed for retinoblas- 
toma, mutation of the Ape gene has been identified as one of the central ingre- 
dients of colorectal cancer. 

As explained earlier, even where there is no known hereditary syndrome, 
tumor suppressor genes can be tracked down by searching for genetic changes 
acquired during the evolution of the tumor cells. A systematic scan of a large 
number of colorectal cancers reveals that selected regions of certain chromo- 
somes frequently have precisely the same DNA sequences on both the mater- 
nally and paternally inherited chromosomes. This loss of heterozygosity sug- 
gests that those regions may harbor tumor suppressor genes. One of them is the 
region that includes Ape. Another includes the Smad4 gene, which is mutated in 
about 30% of colon cancers; its removal blocks the growth-inhibitory effects of 
the TGFfJ pathway. Specific parts of other chromosomes also show frequent 
losses or gains in colorectal cancers and are likely to be fruitful targets for find- 
ing additional cancer- critical genes. 

As our knowledge of genes and their functions has expanded, another use- 
ful approach has been to look at genes that encode proteins that interact with 
the protein encoded by a known cancer- critical gene, in the expectation that 
these genes, too, may be targets for mutation. The Ape protein is now known to 
be an inhibitory component of the Wnt signaling pathway (discussed in Chap- 
ter 15). It binds to the P-catenin protein, another component of the Wnt path- 
way, and helps to induce its degradation. By inhibiting P-catenin in this way, 
APC prevents the P-catenin from migrating to the nucleus, where it acts as a 
transcriptional regulator to maintain the stem-cell state, as discussed in Chap- 
ter 23. Loss of Ape results in an excess of free P-catenin and thus to an uncon- 
trolled expansion of the stem-cell population and a massive increase in the 
number and size of the intestinal crypts (see Figure 23-23). 

When the /5-catenin gene was sequenced in a collection of colorectal tumors, 
among the few tumors that did not have Ape mutations, a high proportion had 
activating mutations in p-catenin instead. Thus, it is excessive activity in the Wnt 
signaling pathway that is critical for the initiation of this cancer, rather than any 
single oncogene or tumor suppressor gene that the pathway contains. This is in 
accord with the general principle that an individual cancer rarely acquires muta- 
tions in more than one component in a pathway that is critical to tumorigene- 
sis, as there is no competitive advantage for it to do so. 

The being so, why is the Ape gene in particular so much the most common 
culprit in colorectal cancer? The Ape protein is large and it interacts not only 
with the p-catenin but also with various other cell components, including 
microtubules. Loss of Ape appears to increase the frequency of mitotic spindle 
defects, leading to chromosome abnormalities when cells divide. This addi- 
tional, independent cancer-promoting effect could explain why Ape mutations 
feature so prominently in the causation of colorectal cancer. 
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Some Colorectal Cancers Have Defects in DNA Mismatch Repair 

In addition to the hereditary disease (FAP) associated with Ape mutations, there 
is a second, more common, kind of hereditary predisposition to colon carci- 
noma in which the course of events differs from the one we have described for 
FAR In this condition, called hereditary nonpolyposis colorectal cancer (HNPCC), 
the probability of colon cancer is increased without any increase in the number 
of colorectal polyps (adenomas). Moreover, the cancer cells are unusual, in that 
they have a normal (or almost normal) karyotype; the majority of colorectal 
tumors in non-HNPCC patients, in contrast, have gross chromosomal abnor- 
malities, with multiple translocations, deletions, and other aberrations, as well 
as having many more chromosomes than normal (Figure 20-47). 

The mutations that predispose HNPCC individuals to colorectal cancer 
occur in one of several genes that code for central components of the DNA mis- 
match repair system, which are homologous in structure and function to the 
MutL and MutS genes in bacteria and yeast (see Figure 5-20). Only one of the 
two copies of the involved gene is defective, so the repair system is still able to 
remove the inevitable DNA replication errors that occur in the patient's cells. 
However, as discussed previously, these individuals are at risk, because the acci- 
dental loss or inactivation of the remaining good gene copy will immediately 
elevate the spontaneous mutation rate by a hundredfold or more (discussed in 
Chapter 5). These genetically unstable cells now can presumably speed through 
the standard processes of mutation and natural selection that allow clones of 
cells to progress to malignancy. 

This particular type of genetic instability produces invisible changes in the 
chromosomes— most notably changes in individual nucleotides and short 
expansions and contractions of mono- and dinucleotide repeats such as AAAA. . . 
or CACACA. . . . Once the defect in HNPCC patients was recognized, mutations in 
mismatch repair genes were found in about 15% of the colorectal cancers occur- 
ring in people with no inherited predisposing mutation. 

Thus, the genetic instability found in many colorectal cancers can be 
acquired in at least two ways. The majority of the cancers display a form of 
chromosomal instability that leads to visibly altered chromosomes, whereas in 
the others the instability occurs on a much smaller scale and reflects a defect in 
DNA mismatch repair. Indeed, many carcinomas show either chromosomal 
instability or defective mismatch repair— but rarely both. These findings clearly 
demonstrate that genetic instability is not an accidental byproduct of malig- 
nant behavior but a contributory cause — and that cancer cells can acquire this 
instability in multiple ways. 



Figure 20-47 Chromosome 
complements (karyotypes) of colon 
cancers showing different kinds of 
genetic instability. (A) The karyotype of a 
typical cancer shows many gross 
abnormalities in chromosome number 
and structure. Considerable variation can 
also exist from cell to cell (not shown). 
(B) The karyotype of a tumor that has a 
stable chromosome complement with 
few chromosomal anomalies. Its defects 
are mostly invisible, having been created 
by defects in DNA mismatch repair. All of 
the chromosomes in this figure were 
stained as in Figure 20-1 3, the DNA of 
each human chromosome being stained 
with a different combination of 
fluorescent dyes. {Courtesy Wael Abdel- 
Rahman and Paul Edwards.) 



The Steps of Tumor Progression Can Often Be Correlated with 
Specific Mutations 

In what order do K-Ras, p53, Ape, and the other identified colorectal cancer-crit- 
ical genes mutate, and what contribution does each of them make to the asocial 
behavior of the cancer cell? There is no single answer, because colorectal cancer 
can arise by more than one route: thus, we know that in some cases, the first 
mutation can be in a DNA mismatch repair gene; in others, it can be in a gene 
regulating cell proliferation. Moreover, as previously discussed, a general feature 



THE MOLECULAR BASIS OF CANCER-CELL BEHAVIOR 



1255 



alteration 



loss of 
Ape 



normal 
epithelium 



L 



activation of 
K-Ras 



loss of Smad4 

and other tumor 

suppressors 



p53 



otr»ei Ui 



hyperplastic 
epithelium 



early 
adenoma 



* intprmprli^tp * latp — ** ^ » 



intermediate 
adenoma 



late 
adenoma 



carcinoma ■ 



invasion and 
metastasis 



acquisition of increased genetic and epigenetk instability 



such as genetic instability or a tendency to proliferate abnormally can arise in a 
variety of ways, through mutations in different genes. 

Nevertheless, certain sets of mutations are particularly common in colorec- 
tal cancer, and they occur in a characteristic order. Thus, in most cases, muta- 
tions inactivating the Ape gene appear to be the first, or at least a very early step, 
as they are detected at the same high frequency in small benign polyps as in 
large malignant tumors. Inherited changes that lead to genetic and epigenetic 
instability are likely also to arise early in tumor progression, since they are 
needed to drive the later steps. 

Activating mutations in the K-Ras gene occur later, as they are rare in small 
polyps but common in larger ones that show disturbances in cell differentiation 
and histological pattern. Cultured colorectal carcinoma cells at this stage show 
the typical features of transformed cells, such as the ability to proliferate with- 
out anchorage to a substratum. Loss of the Smad4 tumor suppressor gene and 
inactivating mutations in p53 come later still, as they are rare in polyps but com- 
mon in carcinomas (Figure 20-48). As discussed earlier, loss of p53 function 
allows cancer cells to accumulate additional mutations and to avoid apoptosis 
and cell-cycle arrest. 

In summary, although the exact set of mutations will vary from one colorec- 
tal cancer to the next, colorectal carcinogenesis requires certain genetic alter- 
ations that disable particular control mechanisms, and the order in which these 
alterations occur is not random (Figure 20-49). 
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Figure 20-48 Suggested typical 
sequence of genetic changes 
underlying the development of a 
colorectal carcinoma. This over- 
simplified diagram provides a general 
idea of the way mutation and tumor 
development can fit together. But there 
are certainly other mutant genes of 
which we are not yet aware, and different 
colon cancers can progress through 
different sequences of mutations. 
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Figure 20-49 The general nature of tumor 
progression at the molecular level. Each 
row in this diagram represents a 
biochemical pathway, so that changing any 
one protein (rectangular box) by altering its 
gene is often enough to gain the full effect. 
The three paths (colored vertical lines) have 
been drawn to indicate the mutations that 
one evolving clone of tumor cells 
accumulates as it progresses toward 
cancer. This diagram is modeled after some 
of the known events in the development of 
colon cancer, but it should not be taken 
literally. The major points are that, while 
many different genes will be altered in 
individual colon tumors as they develop to 
metastasis, so that tumors are not 
expected to be exactly alike, there are 
some common patterns. In particular, both 
the signaling pathways that are altered and 
the general order in which these changes 
arise are often similar for different cancers. 
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Each Case of Cancer Is Characterized by Its Own Array of Genetic 
Lesions 

As illustrated by colorectal cancer, the traditional classification of cancers is sim- 
plistic. Even a single type of cancer is a heterogeneous collection of disorders; 
despite some common features, each disorder will be characterized by its own 
array of genetic lesions. In the form of lung cancer known as small-cell carci- 
noma, for example, one finds mutations in p53, Rb, Myc, and at least five other 
known proto-oncogenes and tumor suppressor genes. Different patients have 
different combinations of mutations, and these genetic differences often corre- 
spond to different responses to treatment. 

In principle, molecular biology provides the tools to find out precisely which 
genes are amplified, deleted, mutated, or misregulated by epigenetic mechanisms 
in the tumor cells of any given patient. As we discuss next, such information may 
soon prove to be as important for the diagnosis and treatment of cancer as was the 
identification of microorganisms for patients with infectious diseases. 



Summary 

Studies of developing embryos and genetically engineered mice have helped to reveal 
the functions of a large number of cancer-critical genes. Many of the oncogenes and 
tumor suppressor genes mutated in cancer code for components of the social control 
pathways that regulate when cells grow, divide, differentiate, or die. Other genes, 
loosely categorized as "DNA maintenance genes" help maintain the integrity of the 
genome. The molecular changes that allow cancers to metastasize are still largely 
unknown. 

DNA tumor viruses such as papillomaviruses can promote the development of 
cancer by encoding proteins that inhibit the products of some tumor suppressor genes. 
The viral proteins bind and inhibit the Rb protein, which normally acts as a brake on 
cell division, and the p53 protein, which normally induces cell-cycle arrest or apopto- 
sis in response to DNA damage and other cell stresses. Loss or inactivation of the p53 
pathway, which occurs in nearly all spontaneous human cancers, is especially danger- 
ous, as it allows genetically damaged cells to escape apoptosis and continue to prolif- 
erate. Inactivation of the Rb pathway also occurs in most human cancers, illustrating 
how fundamental each of these pathways is in protecting us against cancer. 

We can often correlate the steps of tumor progression with mutations that activate 
specific oncogenes and inactivate specific tumor suppressor genes, with colon cancer 
providing the best-understood example. But different combinations of mutations and 
epigenetic changes are found in different types of cancer and even in different patients 
with the same type of cancer, reflecting the random way in which these inherited 
changes occur. Nevertheless, many of the same types of changes are encountered 
repeatedly, suggesting that there is only a limited number of ways in which our 
defenses against cancer can be breached. 



CANCER TREATMENT: PRESENT AND FUTURE 

We can apply our growing understanding of the biology of cancer to sharpen our 
attack on the disease at three levels: prevention, diagnosis, and treatment. Pre- 
vention is always better than cure, and, as discussed earlier, many cancers can 
be prevented, especially by avoiding smoking. Moreover, cancers can often be 
nipped in the bud by screening for small primary tumors, which can be removed 
before they have metastasized, as we saw for cervical cancers. Highly sensitive 
molecular assays promise to create new opportunities for better prevention and 
treatment through earlier and more precise diagnosis. Advances in screening 
methods along with changes in lifestyle probably offer the most immediate 
prospects for reducing the cancer death rate, but they can never be perfectly 
effective. Thus, for many years to come, full-blown malignant disease will con- 
tinue to be common and cancer treatments will continue to be needed. 
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The Search for Cancer Cures Is Difficult but Not Hopeless 

The difficulty of curing a cancer is similar to the difficulty of getting rid of weeds. 
Cancer cells can be removed surgically or destroyed with toxic chemicals or radi- 
ation, but it is hard to eradicate every single one of them. Surgery can rarely fer- 
ret out every metastasis, and treatments that kill cancer cells are generally toxic 
to normal cells as well. In addition, as described earlier, whereas the great major- 
ity of the cancer cells can often be killed by irradiation or chemotherapy, the 
small population of slowly dividing cancer stem cells may be harder to eliminate 
in these ways; if even a few cancer stem cells remain, they can regenerate the 
tumor. Moreover, unlike normal cells, cancer cells mutate rapidly and will often 
evolve resistance to the poisons and irradiation used against them. 

In spite of these difficulties, effective cures using anticancer drugs (alone or 
in combination with other treatments) have already been found for some for- 
merly highly lethal cancers, including Hodgkin's lymphoma, testicular cancer, 
choriocarcinoma, and some leukemias and other cancers of childhood. Even for 
types of cancer where a cure at present seems beyond our reach, there are treat- 
ments that will prolong life or at least relieve distress. But what prospect is there 
of doing better and finding cures for the most common forms of cancer, which 
still cause great suffering and so many tragic deaths? 

Traditional Therapies Exploit the Genetic Instability and Loss of 
Cell-Cycle Checkpoint Responses in Cancer Cells 

Anticancer therapies need to take advantage of some molecular abnormality of 
cancer cells that distinguishes them from normal cells. One such property is 
genetic instability, caused by an abnormality in chromosome maintenance, cell 
cycle checkpoints, or DNA repair. Remarkably, most current cancer therapies 
work by exploiting these abnormalities, although this was not known by the sci- 
entists who developed the treatments. Most anticancer drugs and ionizing radi- 
ation damage DNA. They preferentially kill certain kinds of cancer cells because 
these mutant cells have a diminished ability to survive the damage. Normal cells 
treated with radiation, for example, arrest their cell cycle until they have 
repaired the damage to their DNA. This cell-cycle arrest is an example of a cell- 
cycle checkpoint response discussed in Chapter 17. Cancer cells generally have 
defects in many of these checkpoint responses and often continue to divide after 
irradiation; this causes many to die after a few days because of the severe genetic 
damage they experience. 

Unfortunately, while some of the molecular defects present in cancer cells 
enhance their sensitivity to cytotoxic agents, others increase their resistance. 
Some of the cell death that DNA damage induces, for example, occurs by apop- 
tosis, and cancer cells often acquire defects in the control systems that activate 
apoptosis in response to such damage. For example, as we discussed earlier, DNA 
damage induced by anticancer drugs or irradiation normally activates p53, which 
can then trigger apoptosis. Thus the inactivation of the p53 pathway that occurs 
in most cancers makes certain types of cancer cells less sensitive to these agents. 
Cancer cells vary widely in their response to various treatments, probably reflect- 
ing the particular kinds of defects they have in DNA repair, cell-cycle check- 
points, and the control of apoptosis. 

New Drugs Can Exploit the Specific Cause of a Tumor's Genetic 
Instability 

As we become increasingly able to pinpoint the specific alterations in cancer 
cells that make them different from their normal neighbors, we can use this 
knowledge to design therapies that kill the cells in the cancer without killing 
normal cells. One of the characteristics of cancer cells is their genetic instabil- 
ity; as we have explained, this is one of the features that helps them to evolve 
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and proliferate dangerously. But it is at the same time a defect — a vulnerability 
that we can exploit to kill them. 

We have seen that cancer cells are forced to walk a difficult line as they 
evolve toward metastasis: they need to have a defect in their DNA maintenance 
processes that is severe enough to allow them to accumulate new mutations at 
a significantly increased rate, but not so severe that they kill themselves by fre- 
quent loss or genes necessary for cell survival. Since there are hundreds of dif- 
ferent genes required to maintain DNA sequences and chromosome structures 
with high fidelity (discussed in Chapters 4 and 5), we would expect there to be at 
least dozens of different ways for a particular tumor cell to acquire its genetic 
instability. Moreover, these ways should be mutually exclusive: once a cell has 
become genetically unstable to a modest extent, it is likely to increase its risk of 
death if it inactivates additional DNA maintenance genes. Those cells that do so 
will die and be lost from the tumor population. 

Detailed studies of the mechanisms for DNA maintenance discussed in 
Chapter 5 reveal a surprising amount of apparent redundancy, with multiple 
pathways for repairing each type of DNA damage. Thus, knocking out a particu- 
lar pathway for DNA repair is generally less disastrous than one might expect, 
because alternate repair pathways exist. We have seen, for example, how stalled 
DNA replication forks can arise when the fork encounters a single-strand break 
in a template strand, and how cells avoid the disaster that would otherwise 
result. First, they have machinery to escape the problem by directly repairing 
single-strand breaks; and then, if that fails, they can repair stalled forks by 
homologous recombination (see Figure 5-53). Suppose that the cells in a partic- 
ular cancer have become genetically unstable by acquiring a mutation that 
reduces their ability to repair stalled forks by homologous recombination. Might 
it be possible to eradicate that cancer by treating it with a drug that inhibits the 
repair of single-strand breaks, thereby greatly increasing the number of forks 
that stall? The consequences would be expected to be harmless for normal cells, 
which can repair stalled forks, but lethal for the cancer cells, which cannot. 

While such a possibility might seem too good to be true, precisely this strat- 
egy appears to work to kill the cells in cancers that have inactivated both of their 
Brcal or Brca2 tumor suppressor genes. As described in Chapter 5, Brca2 is an 
accessory protein that interacts with the Rad51 protein (the RecA analog in 
humans) in the initiation of general recombination events. Brca2 is another pro- 
tein that is also required for this repair process. Like Kb, Brcal and Brca2 were 
discovered as mutations that predispose humans to cancer — in this case, can- 
cers of the breast and ovaries (though unlike Rb f they seem to be involved in only 
a small proportion of such cancers). Individuals who inherit one mutant copy of 
Brcal or Brca2 develop tumors that have inactivated the second copy of the 
same gene, presumably because this change makes the cells genetically unsta- 
ble and speeds tumor progression. 

Drugs that inhibit an enzyme called PARP (poly ADP-ribose polymerase) 
have a dramatic effect on cells of such tumors, killing them with enormous 
selectivity. This is attributed to the fact that PARP is required for the repair of sin- 
gle-strand breaks in DNA. Perhaps surprisingly, PARP inhibition has very little 
effect on normal cells; in fact, mice that have been engineered to lack PARP1 — 
the major PARP family member involved in DNA repair — remain healthy under 
laboratory conditions. This result suggests that, while the repair pathway requir- 
ing PARP provides a first line of defense against breaks in a DNA strand, these 
breaks can easily be repaired by a genetic recombination pathway in normal 
cells (Figure 20-50). In contrast, tumor cells that have acquired their genetic 
instability by the loss of Brcal or Brca2 have lost this second line of defense, and 
they are therefore uniquely sensitive to PARP inhibitors (that is, they are missing 
repair pathway 2 in Figure 20-50). 

PARP inhibition is still only under trial as a cancer treatment in humans, and 
is likely to be applicable to only a small proportion of cancer cases. But it exem- 
plifies the type of rational, highly selective approach to cancer therapy that is 
beginning to be possible, and thus it offers hope for many other cancers. To 
extend this approach widely, we will need new tools for determining the precise 
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Figure 20-50 How a tumor's genetic 
instability can be exploited for cancer 
therapy. As explained in Chapter 5, the 
maintenance of DNA sequences is so 
critical for life that cells have evolved 
multiple pathways for repairing DNA 
damage and avoiding DNA replication 
errors. As illustrated, a replication fork will 
stall whenever it encounters a break in a 
DNA template strand. In this example, 
normal cells have two different repair 
pathways that can fix the problem and 
thereby prevent a mutation from arising 
in the newly synthesized DNA sequences. 
They are therefore not harmed by 
treatment with a drug that blocks repair 
pathway 1 . In contrast, the inactivation of 
repair pathway 2 was selected for in the 
evolution of the tumor cells (because it 
made them genetically unstable). 
Consequently, only the tumor cells are 
killed by a drug treatment that blocks 
repair pathway 1. If a treated cell does 
not die by apoptosis, its daughters will 
die because they inherit fragmented, 
incomplete chromosome sets. 

In the actual case that underlies this 
more general schematic example, the 
function of repair pathway 1 (requiring 
the PARP protein discussed in the text) is 
to remove accidental breaks that occur in 
a DNA single strand before they are 
encountered by a moving replication 
fork, and pathway 2 is the recombination- 
dependent process for repairing stalled 
replication forks illustrated in Figure 5-53 
(requiring the Brca2 and Brcal proteins; 
for details, see H.E. Bryant et al., Nature 
434:91 3-91 6, 2005 and H. Farmer et al., 
Afaftire434:917-921 f 2005.) 



cause of the genetic instability in individual tumors, as well as the development 
of many more drugs that target alternate DNA repair pathways. 



Genetic Instability Helps Cancers Become Progressively More 
Resistant to Therapies 

Genetic instability itself can be both good and bad for anticancer therapy. 
Although it seems to provide an Achilles' heel that therapies can exploit, it can 
also make eradicating cancer more difficult. An abnormally high mutation rate 
tends to make cancer cell populations heterogeneous, which may make it diffi- 
cult to kill off the entire population with a single type of treatment. Moreover, it 
allows many cancers to evolve resistance to therapeutic drugs at an alarming 
rate. 

To make matters worse, cells that are exposed to one anticancer drug often 
develop a resistance not only to that drug but also to other drugs to which they 
have never been exposed. This phenomenon of multidrug resistance frequently 
correlates with amplification of a part of the genome that contains a gene called 
Mdrl. This gene encodes a plasma-membrane-bound transport ATPase of the 
ABC transporter superfamily (discussed in Chapter 11), which pumps lipophilic 
drugs out of the cell. The overproduction of this protein (or some of its other 
family members) by a cancer cell can prevent the intracellular accumulation of 
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many cytotoxic drugs, making the cell insensitive to them. The amplification of 
other types of genes can likewise give the cancer cell a selective advantage. The 
gene that encodes the enzyme dihydrofolate reductase (DHFR), for example, 
can become amplified in cancer cells treated with the anticancer drug 
methotrexate. Methotrexate binds to and inhibits the ability of DHFR to bind 
folic acid, and the amplification greatly increases the amount of enzyme, reduc- 
ing the cells' sensitivity to the drug. 



New Therapies Are Emerging from Our Knowledge of Cancer 
Biology 

Our growing understanding of cancer cell biology and tumor progression is 
gradually leading to better methods for treating the disease, and not only by tar- 
geting defects in cell-cycle checkpoints and DNA repair processes. As an exam- 
ple, estrogen receptor antagonists (such as tamoxifen) and drugs that block 
estrogen synthesis are widely used to prevent or delay recurrence of breast can- 
cers that have been screened and found to express estrogen receptors. These 
antiestrogen treatments are also being tested for their ability to prevent the 
development of new breast cancers. These drugs do not directly kill the tumor 
cells, but instead prevent estrogen from promoting their proliferation. 

The greatest hopes lie, however, in finding more powerful and selective ways 
to exterminate cancer cells directly A variety of adventurous new ways to attack 
tumor cells have been shown to work in model systems — typically reducing or 
preventing the growth of human tumors transplanted into immunodeficient 
mice. Many of these strategies will turn out to be of no medical use, because they 
do not work in humans, have bad side effects, or are simply too difficult to 
implement. But some have turned out to be highly successful in the clinic. One 
strategy depends on the reliance of some cancer cells on a particular hyperac- 
tive protein they produce, a phenomenon known as oncogene addiction. Block- 
ing the activity of the protein may be an effective means of treating the cancer if 
it does not unduly damage normal tissues. About 25% of breast cancers, for 
example, express unusually high levels of the Her2 protein, a receptor tyrosine 
kinase related to the EGF receptor that plays a part in the normal development 
of mammary epithelium. Monoclonal antibodies that inhibit Her2 function slow 
the growth of breast tumors in humans that overexpress Her2, and they are now 
an approved therapy for these cancers. 

A similar approach uses antibodies to deliver toxic molecules directly to the 
cancer cells. Antibodies against proteins like Her2 that are abundant on the can- 
cer cell surface can be armed with a toxin or made to carry an enzyme that 
cleaves a harmless 'prodrug' and converts this into a toxic molecule. In the latter 
case, one molecule of enzyme can then generate a large number of toxic 
molecules on the surface of a tumor cell; these molecules can also diffuse to 
neighboring tumor cells, increasing the odds that they too will be killed, even if 
the antibody did not bind to them directly. 

Antibodies are hard to make in large amounts, are very expensive to produce 
and buy, and must be given by injection. The ultimate goal in cancer therapy is 
to develop small molecules that kill cancer cells specifically. The PARP inhibitors 
discussed above are one example; but to tackle the majority of cancers with sim- 
ple drug treatments, we shall need a large collection of different small 
molecules, tailored to the many different types of cancer. 



Small Molecules Can Be Designed to Inhibit Specific Oncogenic 
Proteins 

As our knowledge of specific molecules involved in the genesis of particular 
cancers builds, there is an increasing effort to devise targeted therapies 
directed against the oncogenic proteins that are essential for a cancer cell to 
survive and proliferate — thereby exploiting the phenomenon of oncogene 



CANCER TREATMENT: PRESENT AND FUTURE 



addiction mentioned previously. A particular dramatic success of this type has 
raised high hopes for the general utility of such targeted therapies in the future. 

As we saw earlier, chronic myelogenous leukemia (CML) is usually associ- 
ated with a particular chromosomal translocation, visible as the Philadelphia 
chromosome (see Figure 20-5). This is the consequence of chromosome break- 
age and rejoining at the sites of two specific genes, called Ab/ and Bcr. The fusion 
of these genes creates a hybrid gene that codes for a chimeric protein called Bcr- 
Abl, consisting of the N-terminal fragment of Bcr fused to the C-terminal portion 
of Abl (Figure 20-51). Abl is a tyrosine kinase involved in cell signaling. The sub- 
stitution of the Bcr fragment for the normal N-terminus of Abl makes it hyper- 
active, so that it stimulates inappropriate proliferation of the hemopoietic pre- 
cursor cells that contain it and prevents these cells from dying by apoptosis — 
which many of them would normally do. As a result, excessive numbers of white 
blood cells accumulate in the bloodstream, producing CML. 

The chimeric Bcr-Abl protein is an obvious target for therapeutic attack. 
Searches for synthetic drug molecules that can inhibit the activity of tyrosine 
kinases discovered one, called Gleevec, that blocks Bcr-Abl (Figure 20-52). 
When the drug was first given to patients with CML, nearly all of them showed a 
dramatic response, with an apparent disappearance of the cells carrying the 
Philadelphia chromosome in over 80% of patients. The response appears rela- 
tively durable: after years of continuous treatment, many patients have not pro- 
gressed to later stages of the disease — although Gleevec-resistant cancers 
emerge with a probability of at least 5% per year. 

Results are not so good for those patients who have already progressed to 
the acute phase of myeloid leukemia, known as blast crisis, where genetic insta- 
bility has set in and the march of the disease is far more rapid. These patients 
show a response at first and then relapse because the cancer cells develop a 
resistance to Gleevec. This resistance is usually associated with secondary muta- 
tions in the part of the Bcr-Abl gene that encodes the kinase domain, disrupting 
the ability of Gleevec to bind to Bcr-Abl kinase. Second-generation inhibitors 
that function effectively against these Gleevec-resistant mutants have now been 
developed. Ultimately a cocktail of multiple agents that cooperatively block Bcr- 
Abl action may provide the key to successful treatment, by preventing the selec- 
tion for drug- resistant cancer cells at all stages of the disease. 

Despite the complications with resistance, the extraordinary success of 
Gleevec for the patients in the chronic (early) stage of the disease is enough to 
prove the principle: once we understand precisely what genetic lesions have 
occurred in a cancer, we can begin to design effective rational methods to treat 
it. This success story has fueled efforts to identify small-molecule inhibitors for 
other oncogenic protein kinases that could be effective targets for new anti- 
cancer drugs. A second example of such a therapy is provided by a small- 
molecule inhibitor of the EGF receptor, currently approved for the treatment of 
some lung cancers. 
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Figure 20-51 The conversion of the Abl 
proto-oncogene into an oncogene in 
patients with chronic myelogenous 
leukemia. The chromosome translocation 
responsible joins the Bcr gene on 
chromosome 22 to the Abl gene from 
chromosome 9, thereby generating a 
Philadelphia chromosome (see Figure 
20-5). The resulting fusion protein has the 
N-terminus of the Bcr protein joined to the 
C-terminus of the Abl tyrosine protein 
kinase; in consequence, the Abl kinase 
domain becomes inappropriately active, 
driving excessive proliferation of a clone of 
hemopoietic cells in the bone marrow. 
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Figure 20-52 How Gleevec blocks the activity of Bcr-Abl protein and halts chronic myeloid leukemia. 

(A) The chemical structure of Gleevec (imatinib). The drug can be given by mouth; it has side effects but 
they are usually quite tolerable. (B) The structure of the complex of Gleevec (solid green object) with the 
tyrosine kinase domain of the Abl protein (ribbon diagram), as determined by X-ray crystallography. 
(C) Gleevec sits in the ATP-binding pocket of the tyrosine kinase domain of Bcr-Abl and thereby prevents 
Bcr-Abl from transferring a phosphate group from ATP onto a tyrosine residue in a substrate protein. This 
blocks onward transmission of a signal for cell proliferation and survival. (B, from T. Schindler et al., Science 
289:1938-1942, 2000. With permission from AAAS.) 



Tumor Blood Vessels Are Logical Targets for Cancer Therapy 

Another approach to destroying tumors does not directly target cancer cells at 
all but instead targets the blood vessels on which large tumors depend. As dis- 
cussed earlier, the growth of these vessels requires angiogenic signals such as 
VEGF, which, in animal models at least, can be blocked to prevent further tumor 
growth. Moreover, endothelial cells in the process of forming new vessels 
express distinctive cell-surface markers, which might provide an opportunity to 
attack developing tumor vessels specifically, without harming existing blood 
vessels in normal tissues. Clinical trials with various inhibitors of angiogenic sig- 
nals are now taking place, and several drugs that inhibit the VEGF receptor have 
recently been approved for treating kidney cancer. Similarly, a monoclonal anti- 
body against VEGF has been approved for treatment of colon cancer in combi- 
nation with chemotherapy, even though the added benefit is only modest. 



Many Cancers May Be Treatable by Enhancing the Immune 
Response Against a Specific Tumor 

In recent years, we have realized that the body's normal immune responses help 
to protect us against cancer. Mice lacking important parts of their immune sys- 
tem have elevated levels of several types of cancer. Similarly, humans who are 
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immunocompromised have two- to three-fold higher rates of certain solid 
tumors. The self-antigens that are recognized in these cases may attract the 
attention of the immune system because they are expressed in unusually large 
amounts in the particular tumor cells. Alternatively it is possible that these pro- 
teins provoke an autoimmune response because they are normally expressed 
only in embryos or in another immunologically privileged area such as the 
brain, so that the immune system has never had the opportunity to develop tol- 
erance towards them. 

Variant tumor cells with a reduced exposure to immune attack can defeat 
tumor immunosurveillance, and the process of tumor progression is thought to 
include heritable changes that reduce the antigenicity of the tumor. However, 
through our increased understanding of the complex mechanisms of both adap- 
tive and innate immunity (see Chapters 24 and 25), oncologists are learning how 
to manipulate the immune response to intensify the attack on specific tumors. 
The hope is that this will allow these cancer cells to be eliminated as if they were 
foreign tissue. 



Treating Patients with Several Drugs Simultaneously Has 
Potential Advantages for Cancer Therapy 

As already described, some cancers can be dramatically reduced in size by an 
initial drug treatment, with all of the detectable tumor cells seeming to disap- 
pear. But months or years later the cancer will reappear in an altered form that 
is resistant to the drug that was at first so successful. Given the hypermutable 
nature of tumor cells, this is not surprising. The initial drug treatment has pre- 
sumably failed to affect a tiny fraction of mutant cells in the original tumor cell 
population (some of the cancer stem cells, according to that view of tumors), 
and these cells eventually recreate the tumor by continuing to proliferate. 

In some cases, this problem can be prevented by treating patients with two 
or more drugs simultaneously. The logic is the same as that behind the current 
treatment of HIV-AIDS with a cocktail of three different protease inhibitors: 
whereas there may always be some cells in the initial populations resistant to 
any one drug treatment, there should be no cell resistant to two drugs that are 
different. In contrast, sequential drug treatments can allow the few cells resis- 
tant to the first drug to multiply to large numbers, thereby making it likely that 
cells with double resistance will arise (Figure 20-53). 

As our armament of possible drugs against cancer cells increases, it 
becomes increasingly likely that effective treatments involving simultaneous 
treatment with multiple drugs can be designed for many cancers. 
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Figure 20-53 Why multidrug treatments 
may be more effective than sequential 
treatments for cancer therapy. Because 
tumor cells are hypermutable, single 
drug treatments that are given 
sequentially often allow for the selection 
of mutant cell clones that are resistant to 
both drugs. 
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Gene Expression Profiling Can Help Classify Cancers into 
Clinically Meaningful Subgroups 

It is now possible to characterize each individual cancer at a molecular level in 
unprecedented detail For example, DNA microarray technology (discussed in 
Chapter 8) can be used to determine the pattern of expression of thousands of 
genes simultaneously in a single sample of a cancer and compare it with the pat- 
tern of expression of the same genes in normal control tissue. Each case of a 
given type of cancer, such as breast cancer, will have its own gene expression 
profile. However, comparing the profiles of many patients shows that they can 
be grouped into a small number of distinct classes, whose members share com- 
mon profile features. 

Why might such molecular profiles be useful? All medical progress depends 
on accurate diagnosis. If we cannot identify a disease correctly, we cannot dis- 
cover its causes, predict its outcome, select the appropriate treatment for a given 
patient, or conduct effective trials on a population of patients to judge whether 
a treatment is effective. Cancers, as we have seen, are an extraordinarily hetero- 
geneous collection of diseases. Gene expression profiling represents a tool to 
make diagnosis more precise through better tumor classification. Standard clas- 
sification schemes mainly rely on histological analysis and we now know that 
they can lump together cancers that behave differently. Thus, gene expression 
profiling can not only help to identify a tumor in uncertain cases, but also help 
to classify tumors more precisly, with important consequences for the patient. 
Analysis of a collection of diffuse large B-cell lymphomas, for example, demon- 
strated that there are two separate classes based on gene expression profiles, 
even though the classes could not be distinguished histologically. One class was 
associated with a poor prognosis and the other with a better prognosis, estab- 
lishing the usefulness of the classification and providing an explanation for the 
previous observation that some patients with this disease respond effectively to 
therapy and survive, whereas others do not. Gene expression profiling is still 
being developed for most cancers, but it is likely to find widespread use in the 
future for guiding treatments. 



There Is Still Much More to Do 

The molecular analysis of cancer promises to transform cancer treatment by 
enabling us to tailor therapy much more accurately to the individual patient. 
The discovery of a large number of cancer- critical genes brought an end to an 
era of groping in the dark for clues to the molecular basis of cancer. It has been 
encouraging to find that there are, after all, some general principles and that 
many forms of the disease share some key genetic abnormalities. Because we 
know the identities of many cancer-critical genes and their normal functions, it 
is beginning to be possible to devise precisely targeted, rational treatments. But 
we are still far from fully understanding the most common human cancers, and 
we still need a much better understanding of nearly all of the processes 
described in this chapter before we can conquer this lethal cell-biological dis- 
ease. Among other things, we need better ways to define cancer stem cells and 
isolate them, so that we can determine their special characteristics and the 
molecular basis of their "sternness;" most crucially, we need to find ways to kill 
them — otherwise, our therapies will ultimately fail. 

Cancer survivors and their families have long been important advocates for 
public investment in basic biomedical research. These groups occasionally 
express understandable frustration with the slow pace at which our greatly 
increased knowledge of cancer has been translated into effective cancer thera- 
pies. But the accumulation of knowledge brings an accelerating improvement in 
understanding, and there is a real sense in the cancer research community today 
that major progress in the treatment of this group of diseases is imminent. To 
take one example, in recent years we have obtained a much deeper understand- 
ing of the detailed molecular processes that underlie the all-or-none decision 
that cells make to commit suicide through apoptosis. How long will it be before 
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we can analyze the abnormal signaling network in an individual tumor well 
enough to allow us to select a tailor-made cocktail of drugs and growth factors 
that specifically causes all cells in that cancer to die? 

Looking back on the history of cell biology and contemplating the speed of 
recent progress, we can be hopeful. The desire to understand, which drives basic 
research, will surely reveal new ways to use our knowledge of the cell for human- 
itarian goals, not only in relation to cancer, but also with regard to infectious dis- 
ease, mental illness, agriculture, and other areas that we can scarcely foresee. 



Summary 

Our growing understanding of the cell biology of cancers has already begun to lead to 
better ways of preventing, diagnosing and treating these diseases. Anticancer therapies 
can be designed to destroy cancer cells preferentially by exploiting the properties that 
distinguish them from normal cells, including their dependence on oncogenic proteins 
and the defects they harbor in their DNA repair mechanisms, cell- cycle checkpoint 
mechanisms, and apoptosis control pathways. It may also be possible to control the 
growth of tumors by attacking their blood supply and depriving them of the help they 
require from stromal cells. We now have proof that, by understanding normal cell con- 
trol mechanisms and exactly how they are subverted in specific cancers, we can devise 
drugs to kill cancers precisely by attacking specific molecules critical for the growth 
and survival of the cancer cells. And, as we become better able to determine which 
genes are altered in the cells of any given tumor, we can begin to tailor treatments more 
accurately to each individual patient. 



PROBLEMS 



Which statements are true? Explain why or why not. 

20-1 Genetic instability in the form of point mutations, 
chromosome rearrangements, and epigenetic changes 
needs to be maximal to allow the development of cancer. 

20-2 Cancer therapies directed solely at killing the rapidly 
dividing cells that make up the bulk of a tumor are unlikely 
to eliminate the cancer from many patients. 

20-3 The main environmental causes of cancer are the 
products of our highly industrialized way of life such as pol- 
lution and food additives. 

20-4 Dimethylbenz [a] anthracene (DMBA) must be an 
extraordinarily specific mutagen since 90% of the skin 
tumors it causes have an A-to-T alteration at exactly the 
same site in the mutant Ras gene. 



20-5 In the cellular regulatory 
pathways that control cell growth 
and proliferation, the products of 
oncogenes are stimulatory compo- 
nents and the products of tumor 
suppressor genes are inhibitory 
components. 

Discuss the following problems. 

20-6 In contrast to colon cancer, 
whose incidence increases dra- 
matically with age, osteosar- 
coma — a tumor that occurs most 
commonly in the long bones — 
peaks during adolescence. Osteo- 



colon 
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(A) 



32 40 50 
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sarcomas are relatively rare in young children (up to age 9) 
and in adults (over 20). Why do you suppose that the inci- 
dence of osteosarcoma does not show the same sort of age- 
dependence as colon cancer? 

20-7 As shown in Figure Q20-1, plots of deaths due to 
breast cancer and cervical cancer in women differ dramati- 
cally from the same plot for colon cancer. At around age 50 
the age- dependent increase in death rates for breast and cer- 
vical cancer slows markedly, whereas death rates due to 
colon cancer (and most other cancers) continue to increase. 
Why do you suppose that the age-dependent increase in 
death rates for breast and cervical cancer slows after age 50? 

20-8 By analogy with automobiles, defects in cancer-criti- 
cal genes have been likened to broken brakes and stuck 
accelerators, which are caused in some cases through faulty 
service by bad mechanics. Using this analogy, decide how 
oncogenes, tumor suppressor genes, and DNA maintenance 
genes relate to broken brakes, stuck accelerators, and bad 
mechanics. Explain the basis for each of your choices. 
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Figure Q20-1 Cancer death rates as a function of age (Problem 20-7). (A) Death rates for colon 
cancer in females. (B) Death rates for breast cancer in females. (C) Death rates for cervical cancer. 
The data in all cases are plotted as log of the death rate versus the patient age (on a log scale) at 
death. The straight lines in B and C are fitted to the data for the earlier age groups, whereas the 
line in A is fitted to all the data points. (Data from P. Armitage and R. Doll, fir. J. Cancer 
91:1983-1989, 2004. With permission from Macmillan Publishers Ltd.) 
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Figure Q20-2 Cumulative 
risk of lung cancer mortality 
for nonsmokers, smokers, 
and former smokers 
(Problem 20-9). Cumulative 
risk is the running total of 
deaths, as a percentage, for 
each group. Thus, for 
continuing smokers, 1% died 
of lung cancer between ages 
45 and 55; an additional 4% 
died between 55 and 65 
(giving a cumulative risk of 
5%); and 11% more died 
between 65 and 75 (for a 
cumulative risk of 16%). 
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20-9 Mortality due to lung cancer was followed in groups 
of males in the United Kingdom for 50 years. Figure Q20-2 
shows the cumulative risk of dying from lung cancer as a 
function of age and smoking habits for four groups of males: 
those who never smoked, those who stopped at age 30, 
those who stopped at age 50, and those who continued to 
smoke. These data show clearly that an individual can sub- 
stantially reduce his cumulative risk of dying from lung can- 
cer by stopping smoking. What do you suppose is the bio- 
logical basis for this observation? 

20-1 The Tasmanian devil, a carnivorous Australian marsu- 
pial, is threatened with extinction by the spread of a fatal dis- 
ease in which a malignant oral-facial tumor interferes with 
the animal's ability to feed. You have been called in to analyze 
the source of this unusual cancer. It seems clear to you that 
the cancer is somehow spread from devil to devil, very likely 
by their frequent fighting, which is accompanied by biting 
around the face and mouth. To uncover the source of the 
cancer, you isolate tumors from 1 1 devils captured in widely 
separated regions and examine them. As might be expected, 
the karyotypes of the tumor cells are highly rearranged rela- 
tive to that of the wild-type devil (Figure Q20-3). Surpris- 
ingly, you find that the karyotypes from all 1 1 tumor samples 
are very similar. Moreover, one of the Tasmanian devils has 
an inversion on chromosome 5 that is not present in its facial 
tumor. How do you suppose this cancer is transmitted from 
devil to devil? Is it likely to arise as a consequence of an infec- 
tion by a virus or microorganism? Explain your reasoning. 

20-11 Now that DNA sequencing is so inexpensive, reliable, 
and fast, your mentor has set up a consortium of investiga- 
tors to pursue the ambitious goal of tracking down all the 
mutations in a set of human tumors. He has decided to 
focus on breast cancer and colorectal cancer because they 
cause 14% of all cancer deaths. For each of 1 1 breast cancers 
and 11 colorectal cancers, you design primers to amplify 
120,839 exons in 14,661 transcripts from 13,023 genes. As 
controls, you amplify the same regions from DNA samples 
taken from two normal individuals. You sequence the PCR 
products and use analytical software to compare the 456 Mb 
of tumor sequence with the published human genome 
sequence. You are astounded to find 816,986 putative muta- 
tions. This represents more than 37,000 mutations per 
tumor! Surely that can't be right. 

Once you think about it for a while, you realize the computer 
sometimes makes mistakes in calling bases. To test for that 
source of error, you visually inspect every sequencing read and 



find that you can exclude 353,738 changes, leaving you with 
463,248, or about 21,000 mutations per tumor. Still a lot! 

A. Can you suggest at least three other sources of apparent 
mutations that do not actually contribute to the tumor? 

B. After applying a number of criteria to filter out irrele- 
vant sequence changes, you find a total of 1307 mutations in 
the 22 breast and colorectal cancers, or about 59 mutations 
per tumor. How might you go about deciding which of these 
sequence changes are likely to be cancer-related mutations 
and which are probably "passenger" mutations that 
occurred in genes with nothing to do with cancer (but were 
found in the tumors because they happened to occur in the 
same cells with true cancer mutations)? 

C. Will your comprehensive sequencing strategy detect all 
possible genetic changes that affect the targeted genes in 
the cancer cells? 

20-12 Virtually all cancer treatments are designed to kill 
cancer cells, usually by inducing apoptosis. However, one 
particular cancer — acute promyelocyte leukemia (APL) — 
has been successfully treated with all- rrans-retinoic acid, 
which causes the promyelocytes to differentiate into neu- 
trophils. How might a change in the state of differentiation 
of APL cancer cells help the patient? 

20-1 3 One major goal of modern cancer therapy is to iden- 
tify small molecules — anticancer drugs — that can be used to 
inhibit the products of specific cancer- critical genes. If you 
were searching for such molecules, would you design 
inhibitors for the products of oncogenes, tumor suppressor 
genes, or DNA maintenance genes? Explain why you would 
(or would not) select each type of gene. 
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Figure Q20-3 Karyotypes of cells from Tasmanian devils (Problem 
20 1 0). (A) A Tasmanian devil. (B) Normal karyotype for a male 
Tasmanian devil. The karyotype has 14 chromosomes, including XY. 
(C) Karyotype of cancer cells found in each of the 1 1 facial tumors 
studied. The karyotype has 13 chromosomes, no sex chromosomes, 
no chromosome-2 pair, one chromosome 6, two chromosomes 1 
with deleted long arms, and four highly rearranged marker 
chromosomes (M1-M4). (From A.M. Pearse and K. Swift, Nature 
439:549, 2006. With permission from Macmillan Publishers Ltd.) 
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Sexual Reproduction: Meiosis, 
Germ Cells, and Fertilization 




Sex is not absolutely necessary. Single-celled organisms can reproduce by sim- 
ple mitotic division, and many plants propagate vegetatively by forming multi- 
cellular offshoots that later detach from the parent. Likewise, in the animal king- 
dom, a solitary multicellular Hydra can produce offspring by budding (Figure 
21-1), and sea anemones and marine worms can split into two half- organisms, 
each of which then regenerates its missing half. There are even some lizard 
species that consist only of females that reproduce without mating. Although 
such asexual reproduction is simple and direct, it gives rise to offspring that are 
genetically identical to their parent. Sexual reproduction, by contrast, mixes the 
genomes from two individuals to produce offspring that differ genetically from 
one another and from both parents. This mode of reproduction apparently has 
great advantages, as the vast majority of plants and animals have adopted it. 
Even many procaryotes and eucaryotes that normally reproduce asexually 
engage in occasional bouts of genetic exchange, thereby producing offspring 
with new combinations of genes. This chapter describes the cellular machinery 
of sexual reproduction. Before discussing in detail how the machinery works, 
however, we will briefly consider what sexual reproduction involves and what its 
benefits might be. 



OVERVIEW OF SEXUAL REPRODUCTION 

Sexual reproduction occurs in diploid organisms, in which each cell contains 
two sets of chromosomes, one inherited from each parent. The specialized cells 
that carry out sexual reproduction, however, are haploid; that is, they each con- 
tain only one set of chromosomes. In the final step of sexual reproduction, a 
haploid cell of one individual fuses with a haploid cell of another, mixing the two 
genomes and restoring the diploid state. Sexual reproduction, therefore, 
requires a specialized type of cell division called meiosis, in which a diploid pre- 
cursor cell gives rise to haploid progeny cells, rather than to diploid cells as 
occurs in ordinary mitotic cell division. 

In sexually reproducing multicellular organisms, the haploid cells produced 
by meiosis develop into highly specialized gametes — eggs (or ova), sperm (or 
spermatozoa), pollen, or spores. In animals, females typically produce large and 
nonmotile eggs, whereas males typically produce small and motile sperm (Fig- 
ure 21-2). At fertilization, a haploid sperm fuses with a haploid egg to form a 
diploid cell (a fertilized egg, or zygote), which contains a new combination of 
chromosomes. The zygote then develops into a new multicellular organism 
through repeated rounds of ordinary mitosis, followed by cell specialization, 
which includes the production of gametes (Figure 2 1-3A). 



The Haploid Phase in Higher Eucaryotes Is Brief 

In most organisms that reproduce sexually, diploid cells proliferate by mitotic 
cell division, and the haploid cells that form by meiosis do not proliferate. Some 
simple organisms, such as fission yeasts, are exceptional in that haploid cells 
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Figure 21-1 Photograph of a Hydra from 
which two new organisms are budding 
(arrows). The offspring, which are 
genetically identical to their parent, will 
eventually detach and live 
independently. (Courtesy of Amata 
Hornbruch.) 
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Figure 21-2 Scanning electron 
micrograph of an egg with many human 
sperm bound to its surface. Whereas the 
egg is immotile, the sperm are highly 
motile. Although many sperm are bound 
to the egg, only one will fertilize it, as we 
discuss later. (Courtesy of D. Phillips/ 
Science Photo Library.) 



proliferate by mitotic cell division, and the diploid cells formed by the fusion of 
haploid cells proceed directly to meiosis to produce new haploid cells (Figure 
21-3B). A less extreme exception occurs in plants, where both haploid and diploid 
cells proliferate. In all but the most primitive plants, such as mosses and ferns, 
however, the haploid phase is very brief and simple, while the diploid phase is 
extended into a long period of development and cell proliferation. 

For almost all animals, including all vertebrates, only the diploid cells prolif- 
erate: the haploid gametes exist only briefly, do not divide at all, and are highly 
specialized for sexual fusion. In these organisms, it is useful to distinguish 
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Figure 21-3 Haploid and diploid cells in 
the life cycles of some complex and 
simple eucaryotes. The haploid cells are 
shown in red and the diploid cells in blue. 

(A) Cells in most animals and plants 
usually proliferate in the diploid phase to 
form a multicellular organism; only the 
gametes (eggs and sperm in animals) are 
haploid, and they fuse at fertilization to 
form a diploid zygote, which develops 
into a new individual. The gametes 
develop from diploid germ-line cells 
(gray) in the gonads; all the rest of the 
cells in the organism are somatic cells. 

(B) In some simple eucaryotes such as 
fission yeast and the green alga 
Chlamydomonas, by contrast, the haploid 
cells proliferate, and the only diploid cell 
is the zygote, which exists transiently 
after mating. 
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between the cells of the germ line (or germ cells), which include gametes and 
their specified diploid precursor cells, and the somatic cells, which form the rest 
of the body and ultimately leave no progeny (see Figure 21-3A). In a sense, the 
somatic cells exist only to help the germ-line cells survive, develop, and transmit 
their DNA to the next generation. 



Meiosis Creates Genetic Diversity 

Sexually reproducing organisms inherit two full sets of chromosomes, one from 
each parent. Each set contains autosomes, which are common to all members of 
the species, and sex chromosomes, which are differently distributed according to 
the sex of the individual. Therefore, each diploid nucleus contains two closely 
similar versions of each autosome, plus a set of sex chromosomes appropriate to 
the sex of the individual. The two copies of each autosome, one from the mother 
and one from the father, are called homologous chromosomes, or homologs, 
and in most cells they maintain a separate existence as independent chromo- 
somes. During meiosis, however, each chromosome must communicate with its 
unique homologous partner by physically pairing and undergoing genetic 
recombination. This communication is essential to enable the homologs to seg- 
regate accurately into different daughter cells during meiosis. 

A crucial feature of meiosis is that it generates haploid cells that are geneti- 
cally different from one another and from the two haploid cells that formed the 
organism in the first place. The genetic differences arise by two mechanisms. 
First, an individual gamete contains either the maternal or paternal version of 
each chromosome; because the choice of maternal or paternal occurs indepen- 
dently and randomly for each pair of homologs, the original maternal and pater- 
nal chromosomes are reshuffled into novel combinations in the haploid cells. 
Second, although the maternal and paternal versions of each chromosome have 
similar DNA sequences, they are not identical, and they undergo genetic recom- 
bination during meiosis — a process called crossing-over (discussed in Chapter 5) 
to produce novel hybrid versions of each chromosome; thus, each chromosome 
in a gamete contains a unique mixture of genetic information from both par- 
ents. We discuss these two mechanisms in more detail later (see Figure 21-13). 



Sexual Reproduction Gives Organisms a Competitive Advantage 

The machinery of sexual reproduction is elaborate, and the resources spent on it 
are large (Figure 21-4) . What are its benefits, and why did it evolve? Sexually repro- 
ducing individuals produce varied offspring, whose varied genotypes are at least 
as likely to represent a change for the worse as a change for the better. Why, then, 
should they have a competitive advantage over individuals that breed true, by an 
asexual process? This question continues to perplex evolutionary biologists. 

One advantage of sexual reproduction seems to be that the reshuffling of 
genes helps a species to survive in an unpredictably variable environment. If two 
parents produce many offspring with a wide variety of gene combinations, the 
chance that at least one of their progeny will have the combination of features 
necessary for survival in a changing environment is increased. Indeed, a popu- 
lation of budding yeast genetically engineered so that it cannot undergo meiotic 
genetic recombination and therefore cannot reproduce sexually adapts much 
less well over time to harsh environmental conditions than does the wild-type 
population, which can reproduce sexually. 

Another advantage of sexual reproduction seems to be that it can help elim- 
inate deleterious genes from a population: females generally mate with the 
fittest males, so that the least fit males leave no progeny and serve only as a sort 
of genetic trashcan. This stringent selection among males means that "good" 
genes are transmitted and "bad" genes are lost from the population more effi- 
ciently than they would otherwise be. As a result, members of the sexually repro- 
ducing population are expected to have much higher average fitness than mem- 
bers of an equivalent population that reproduces asexually 




Figure 21 -4 A peacock displaying his 
elaborate tail. This extravagant plumage 
serves to attract females for the purpose 
of sexual reproduction. It has evolved 
because only the fittest and most 
handsome males leave progeny. 
(Courtesy of Cyril Laubscher.) 
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Whatever the benefits of sexual reproduction may be, it is striking that prac- 
tically all complex present-day organisms have evolved largely through genera- 
tions of sexual, rather than asexual, reproduction. Asexual organisms, although 
plentiful, seem mostly to have remained comparatively simple and primitive. 

We now turn to the cellular mechanisms of sex, beginning with the events of 
meiosis. We then focus our discussion mainly on mammals. We first consider the 
diploid cells of the germ line that give rise to the gametes and how the sex of a 
mammal is determined. We then discuss the nature of the gametes themselves. 
Finally, we consider the process of fertilization, in which an egg and a sperm fuse 
to form a new diploid organism. 



Summary 

The sexual reproductive cycle involves an alternation of diploid and haploid states: 
diploid cells divide by meiosis to form haploid cells, and the haploid cells from two 
individuals fuse in pairs to form new diploid zygotes. In the process, genomes are 
mixed and recombined to produce individuals with novel genetic combinations. In 
most higher eucaryotes, diploid cells proliferate by mitosis, and only a small propor- 
tion of them (those of the germ line) undergo meiosis to produce haploid cells; the hap- 
loid cells develop into gametes, which are specialized for sexual reproduction, exist 
only briefly, and do not divide. Sexual reproduction is thought to be advantageous 
both because it produces individuals with novel genetic combinations, some of which 
can survive and procreate in an unpredictably variable environment, and because it 
provides an efficient way to eliminate harmful mutations from a population. 



MEIOSIS 

The realization that gametes are haploid came from an observation that also 
suggested that chromosomes carry genetic information. In 1883, it was discov- 
ered in a study of roundworms that the nucleus of an unfertilized egg and that 
of a sperm each contain two chromosomes, whereas the fertilized egg (zygote) 
contains four. This led to the chromosome theory of heredity, which explained 
the long-standing paradox that the maternal and paternal contributions to the 
character of the progeny seem to be equal, despite the enormous difference in 
size between the egg and sperm (see Figure 21-2). 

The finding also implied that haploid germ cells arise from a special kind of 
cell division in which the number of chromosomes is precisely halved. This type 
of division, called meiosis — the Greek word for diminution or lessening — begins 
in animals in diploid germ-line cells in the ovaries or testes. It might seem as if 
meiosis could occur by a simple modification of mitosis, in which DNA synthe- 
sis (S phase) is omitted and a single cell division produces two haploid cells 
directly. Meiosis, however, is more complex than this and involves two cell divi- 
sions rather than one, but with only one round of DNA synthesis. It was not until 
the early 1930s, as a result of painstaking cytological and genetic studies, that the 
basic features of meiosis were established. More recently, genetic and molecular 
studies have begun to identify the various meiosis-specific proteins that cause 
meiotic chromosomes to behave differently from mitotic chromosomes and 
help mediate the crucial genetic recombination events that occur in meiosis. We 
will see that the recombination events are important not only for genetic mixing, 
but also for accurate chromosome segregation during meiosis. 



Gametes Are Produced by Two Meiotic Cell Divisions 

Meiosis uses much of the same molecular machinery and control systems that 
operate in ordinary mitosis. In this chapter, however, we focus on the special fea- 
tures of meiosis that distinguish it from mitosis. At the beginning of meiosis, as in 
mitosis, the chromosomes have replicated their DNA (in meiotic S phase), and the 
two copies are tightly bound together by cohesin complexes along their entire 
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Figure 21-5 Comparison of 
meiosis and mitotic cell 
division. For clarity, only 
one pair of homologous 
chromosomes (homologs) is 
shown. (A) In meiosis, after 
DNA replication, two 
nuclear (and cell) divisions 
are required to produce the 
haploid gametes. The 
duplicated homologs, each 
consisting of tightly bound 
sister chromatids, pair up 
and are segregated into 
different daughter cells in 
meiosis I; the sister 
chromatids separate only in 
meiosis II. As indicated by 
the formation of 
chromosomes that are 
partly red and partly gray, 
homolog pairing in meiosis 
leads to genetic 
recombination (crossing- 
over) during meiosis I, as 
discussed later. Each diploid 
cell that enters meiosis 
therefore produces four 
genetically different haploid 
cells. <AGTG> (B) In mitosis, 
by contrast, homologs do 
not pair up, and the sister 
chromatids separate during 
the single division. Thus, 
each diploid cell that 
divides by mitosis produces 
two genetically identical 
diploid daughter cells. 
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length (see Figure 17-24) and are called sister chromatids. Unlike mitosis, how- 
ever, meiosis has to produce gametes with half as many chromosomes as their 
diploid precursor cells. This is achieved by modifying the mitotic program so that 
a single round of DNA replication is followed by two successive rounds of chro- 
mosome segregation (Figure 21-5A). Recall that in mitosis (discussed in Chapter 
17), the duplicated chromosomes line up in random order at the equator of the 
mitotic spindle, and the sister chromatids are pulled apart and segregated into the 
two daughter cells, so that each daughter inherits a complete diploid set of chro- 
mosomes and is genetically identical to the parent cell (Figure 21-5B). In divi- 
sion I of meiosis (meiosis I), by contrast, the duplicated paternal and maternal 
homologs (including the two replicated sex chromosomes) pair up along side 
each other and exchange genetic information through the process of genetic 
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recombination. They then line up at the equator of the meiotic spindle, after 
which the duplicated homologs rather than the sister chromatids are pulled 
apart and segregated into the two daughter cells. Only in division II of meiosis 
(meiosis II), which occurs without further DNA replication, are the sister chro- 
matids pulled apart and segregated to produce haploid daughter cells. In this 
way, each diploid cell that enters meiosis produces four haploid cells, each of 
which inherits either the maternal or paternal copy of each chromosome, but 
not both (see Figure 21-5A). 



Duplicated Homologs (and Sex Chromosomes) Pair During Early 
Prophase I 

During mitosis in most organisms, homologous chromosomes behave indepen- 
dently of each other. During meiosis I, however, it is crucial that homologs rec- 
ognize each other and associate physically in order for the maternal and pater- 
nal homologs to undergo genetic recombination and to segregate to different 
daughter cells at anaphase I. Special mechanisms mediate these intimate inter- 
actions between homologs. 

The progressive juxtaposition of homologs occurs during a very prolonged 
meiotic prophase (prophase I), which can take hours in yeasts, days in mice, and 
weeks in higher plants. Like their mitotic counterparts, duplicated meiotic 
prophase chromosomes initially appear as long threadlike structures, in which 
the sister chromatids are so tightly glued together that they appear as one. It is 
during early prophase I that the homologs begin to associate along their length 
in a process called pairing, which, in some organisms at least, occurs initially 
through interactions between complementary DNA sequences (called pairing 
sites) in the two homologs; in most organisms, stable pairing requires genetic 
recombination between the homologs. As prophase I progresses, the homologs 
become more closely juxtaposed, forming a four- chromatid structure called a 
bivalent (Figure 21-6A). As we discuss later, genetic recombination begins dur- 
ing pairing in early prophase I, with the production of programmed double- 
strand breaks in chromatid DNA; some of these recombination events will later 
resolve into crossovers, where a fragment of a maternal chromatid is exchanged 
for a corresponding fragment of a homologous paternal chromatid (Figure 
21-6B; also see Figure 5-64). 

The pairing of homologs requires chromosome movements, but it is not 
known what drives these movements. The replicated chromosomes undergo 
major rearrangements within the nucleus during prophase I. The ends of the 
chromosomes (the telomeres) are tightly bound to the inner surface of the nuclear 
envelope. They are initially distributed diffusely there, but they then cluster tran- 
siently at one spot on the envelope and, later still, disperse again (Figure 21-7). 
Neither the mechanism nor the roles of these rearrangements are known, 
although they are thought to make prophase I faster and more efficient. One pos- 
sibility is that they help prevent chromosome entanglements during prophase I. 
In fission yeast, telomere clustering is required for homolog pairing and crossing- 
over, but in some organisms it occurs after pairing is well underway. 
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Figure 21-6 Homolog alignment and 
crossing-over. (A) The structure formed 
by two closely aligned duplicated 
homologs is called a bivalent. As in 
mitosis, the sister chromatids in each 
homolog are tightly connected along 
their entire lengths, as well as at their 
centromeres. At this stage, the homologs 
are usually joined together by a protein 
complex called the synaptonemal 
complex (not shown; see Figure 21-9). 
(B) A later-stage bivalent in which a single 
crossover event has occurred between 
non-sister chromatids. It is only when the 
synaptonemal complex disassembles and 
the paired homologs separate a little at 
the end of prophase I, as shown, that the 
crossover is seen microscopically as a thin 
connection between the homologs called 
a chiasma. 
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Figure 21-7 Rearrangements of 
telomeres during prophase in 
developing bovine eggs. The nucleus is 
stained blue and the telomeres are 
stained red. During prophase I, the 
telomeres are bound to the inner surface 
of the nuclear envelope. At first, they are 
dispersed around the nuclear envelope 
(not shown). Then they become clustered 
at one region of the envelope (A); finally, 
toward the end of prophase I, they 
disperse again (B). (From C. Pfeifer et al., 
Dev. Biol. 255:206-21 5, 2003. With 
permission from Elsevier.) 
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We have described the pairing of homologous autosomes during prophase 
I, but what happens to the sex chromosomes? This varies between different 
organisms. Female mammals have two X chromosomes, which pair and segre- 
gate like other homologs. But the males have one X and one Y chromosome. 
Although these chromosomes are not homologous, they too must pair and 
undergo crossing-over during prophase if they are to segregate normally at 
anaphase I. Pairing, crossing- over, and segregation are possible because of a 
small region of homology between the X and the Y at one or both ends of these 
chromosomes. The two chromosomes pair and crossover in this region during 
prophase I, ensuring that each sperm receives either one Y or one X chromo- 
some and not both or neither. Thus, only two types of sperm are normally pro- 
duced: those containing one Y chromosome, which will give rise to male 
embryos, and those containing one X chromosome, which will give rise to 
female embryos. 



Homolog Pairing Culminates in the Formation of a Synaptonemal 
Complex 

The paired homologs are brought into closer juxtaposition, with their structural 
axes {axial cores) about 400 nm apart, by a mechanism that depends in most 
species on the programmed double-strand DNA breaks that occur in sister chro- 
matids. What pulls the axes together? One possibility is that the large protein 
machine, called a recombination complex, which assembles on a double-strand 
break in a chromatid, binds the matching DNA sequence in the nearby homolog 
and helps reel in this partner. This so-called presynaptic alignment of the 
homologs is followed by synapsis, in which the axial core of a homolog becomes 
tightly linked to the axial core of its partner by a closely packed array of trans- 
verse filaments to create a synaptonemal complex, which bridges the gap, now 
only 100 nm, between the homologs (Figure 21-8). Although crossing-over 
begins before the synaptonemal complex assembles, the final steps occur while 
the DNA is held in the complex (discussed in Chapter 5) . 

The morphological changes that occur during the pairing of meiotic chro- 
mosomes are the basis for dividing prophase I into five sequential stages — lep- 
totene, zygotene, pachytene, diplotene, and diakinesis. As shown in Figure 21-9, 
prophase I starts with leptotene, when homologs condense and pair, and genetic 
recombination begins. At zygotene, the synaptonemal complex begins to assem- 
ble in local regions along the homologs; assembly initiates at sites where the 
homologs are closely associated and recombination events are occurring. At 
pachytene, the assembly process is complete, and the homologs are synapsed 
along their entire lengths. The pachytene stage can persist for days or longer, 
until desynapsis begins at diplotene with the disassembly of the synaptonemal 
complexes and the concomitant condensation and shortening of the chromo- 
somes. It is only at this stage, after the complexes have disassembled, that the 
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Figure 21-8 Simplified schematic 
drawing of a synaptonemal complex. 

Before the synaptonemal complex forms, 
recombination complexes assemble on 
double-strand DNA breaks on sister 
chromatids and help catalyze crossing- 
over between nonsister chromatid loops 
from opposite sides of the complex (not 
shown). (Modified from K. Nasmyth, 
Annu. Rev. Genet 35:673-745, 2001 . With 
permission from Annual Reviews.) 
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individual crossover events between nonsister chromatids can be seen as inter- 
homolog connections called chiasmata (singular chiasma), which now play a 
crucial part in holding the compact homologs together (Figure 21-10). The 
homologs are now ready to begin the process of segregation. Prophase I ends 
with diakinesis — the stage of transition to metaphase I. 

The proteins that form the transverse filaments that bridge between the 
axial cores of the homologs have been identified in several species, including 
yeasts, worms, flies, and mammals. They form homodimers that interact with 
each other across the 100 nm gap between the homologs, as illustrated in Figure 
21-11. In most eucaryotes, these proteins are important for crossing- over, as 
mutants that lack them fail to form crossovers. The cohesin complexes that 
assemble on the DNA during S phase and bind the sister chromatids together 
during meiosis are major components of the axial core of each homolog (see Fig- 
ure 21-8). Some of the cohesin subunits that operate in meiosis are the same as 
those that function in mitosis, whereas others are specific for meiosis. Both the 
crossovers and the cohesin complexes play crucial parts in segregating the 
homologs during meiotic division I, as we now discuss. 
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Figure 21-9 Homolog synapsis and 
desynapsis during the different stages of 
prophase I. (A) A single bivalent is shown 
schematically. At leptotene, the two sister 
chromatids coalesce, and their chromatid 
loops extend out together from a common 
axial core. The synaptonemal complex 
begins to assemble focally in early 
zygotene. Assembly continues through 
zygotene and is complete in pachytene. 
The complex disassembles in diplotene. 
(B) An electron micrograph of a 
synaptonemal complex from a meiotic cell 
at pachytene in a lily flower. (C and D) 
Immunofluorescence micrographs of 
prophase I cells of the fungus Sordaria. 
Partially synapsed bivalents at zygotene 
are shown in (C) and fully synapsed 
bivalents ore shown in (D). Red arrowheads 
in (C) point to regions where synapsis is 
still incomplete. (B, courtesy of Brian Wells; 
C and D, from A. Storlazzi et a I., Genes Dev. 
17:2675-2687, 2003. With permission from 
Cold Spring Harbor Laboratory Press.) 



Homolog Segregation Depends on Meiosis-Specific, Kinetochore- 
Associated Proteins 

One fundamental difference between meiosis I and mitosis (and meiosis II) is that 
in meiosis I homologs rather than sister chromatids separate and then segregate 

Figure 21-10 A bivalent with three chiasmata resulting from three crossover 
events. (A) Light micrograph of a grasshopper bivalent. (B) Drawing showing 
the arrangement of the crossovers in (A). Note that chromatid 1 has 
undergone an exchange with chromatid 3, and chromatid 2 has undergone 
exchanges with chromatids 3 and 4. Note also how the combination of the 
chiasmata and the tight attachment of the sister chromatid arms to each other 
(mediated by cohesin complexes) hold the two homologs together after the 
synaptonemal complex has disassembled; if either the chiasmata or the sister- 
chromatid adhesion failed to form, the homologs would come apart at this 
stage and not be segregated properly when the cell divides at the end of 
meiosis I. (A, courtesy of Bernard John.) 
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into the two daughter cells (see Figure 21-5). This difference depends on three 
features of meiosis I that distinguish it from mitosis (Figure 21-12). First, the 
kinetochores (protein complexes associated with the centromeres, discussed in 
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Figure 21-1 1 A molecular model of how 
transverse filaments may be formed by 
a single type of protein. (A) A diagram of 
the polypeptide chain showing the 
N- and C-terminal globular domains, 
connected by a coiled-coil region. 
(B) It is proposed that the protein forms 
homodimers, which then interact across 
the 100 nm gap separating the axial 
cores of the two homologs. (Adapted 
from S.L. Page and R.S. Hawley, Science 
301:785-789, 2003. With permission 
from A A AS.) 
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Figure 21-12 Comparison of chromosome behavior in meiosis I, meiosis II, and mitosis. Chromosomes behave similarly in mitosis and 
meiosis II, but they behave very differently in meiosis I. (A) In meiosis I, the two sister kinetochores are located side-by-side on each homolog at 
the sister centromeres and attach to microtubules emanating from the same spindle pole. The proteolytic destruction of the cohesin 
complexes along the sister chromatid arms unglues the arms and resolves the crossovers, allowing the duplicated homologs to separate at 
anaphase I, while the residual cohesin complexes at the centromeres keep the sisters together. The proteolytic destruction of the residual 
cohesin complexes at the centromeres allows the sister chromatids to separate at anaphase II. (B) In mitosis, by contrast, the two sister 
kinetochores attach to microtubules emanating from different spindle poles, and the two sister chromatids come apart at the start of anaphase 
and segregate into separate daughter cells (discussed in Chapter 1 7). 
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Chapters 4 and 17) on the two sister chromatids of a homolog attach to micro- 
tubules emanating from the same pole of the meiosis I spindle and thus segre- 
gate together into the same daughter cell at anaphase I; this contrasts with mito- 
sis (and meiosis II), in which the kinetochores on the two sister chromatids of a 
chromosome attach to opposite poles of the spindle and therefore segregate 
into different daughter cells at anaphase. Second, a strong physical linkage is 
maintained between the homologs that resists the pulling forces of the meiosis 
I spindle until the bivalents are aligned at the equator of the spindle and the 
homologs separate at anaphase I. The chiasmata formed between nonsister 
chromatids and the cohesion between sister- chromatid arms cooperate in hold- 
ing the homologs together (see Figure 21-10). Third, the arms of the sister chro- 
matids separate at anaphase I, resolving the chiasmata and allowing the 
homologs to separate, but the sisters stay glued together in the region of their 
centromeres until anaphase II and therefore do not separate in anaphase I. 

In micromanipulation experiments, meiosis I chromosomes transferred to a 
meiosis II spindle behave as they do in meiosis I, indicating that the specialized 
behavior of meiosis I chromosomes is determined by the chromosomes them- 
selves rather than by the spindle or other cytoplasmic factors. Various meiosis- 
specific proteins associated with meiosis I chromosomes account for the special 
behavior, although they work together with non-meiosis-specific proteins that 
help mediate both mitosis and meiosis. Meiosis -specific protein complexes, for 
example, associate with the two kinetochores on each replicated homolog and 
help ensure that the two sister chromatids attach to microtubules emanating 
from a single spindle pole. Other proteins (called shugoshins) associated with 
kinetochores help ensure that sister kinetochores do not come apart at 
anaphase I when the proteolytic enzyme separase (discussed in Chapter 17) 
cleaves the cohesin complexes that tie the arms of sister chromatids together. 
One way that the shugoshins protect the cohesin complexes at centromeres is by 
recruiting a specific protein phosphatase to the centromeres; the phosphatase 
reverses the phosphorylation of the cohesin complexes that is required for sep- 
arase to cleave them. Thus, whereas the chromatid arms come apart at anaphase 

I, the centromeres do not. The sisters separate only when separase cleaves the 
remaining cohesin complexes at the centromeres at anaphase II (see Figure 
21-12A), when the shugoshins are gone. 

Unlike meiosis I, meiosis II occurs rapidly and closely resembles a mitotic 
cell division, although it occurs without DNA replication. Prophase II is brief: the 
nuclear envelope breaks down as the new spindle forms, after which metaphase 

II, anaphase II, and telophase II usually follow in quick succession. After nuclear 
envelopes have formed around the four haploid nuclei produced at telophase II, 
cytokinesis occurs, and meiosis is complete. 



Meiosis Frequently Goes Wrong 

The sorting of chromosomes that takes place during meiosis is a remarkable feat 
of intracellular bookkeeping. In humans, each meiosis requires that the starting 
cell keep track of 92 chromatids (46 chromosomes, each of which has duplicated), 
distributing one complete set of each type of chromosome to each of the four hap- 
loid progeny cells. Not surprisingly, mistakes can occur in allocating the chromo- 
somes during this elaborate process. Mistakes are especially common in human 
female meiosis, which arrests for years after diplotene: meiosis I is completed only 
at ovulation, and meiosis II only after the egg is fertilized. Indeed, such chromo- 
some segregation errors during egg development are the commonest cause of 
both spontaneous abortion (miscarriage) and mental retardation in humans. 

When homologs fail to separate properly — a phenomenon called nondis- 
junction — the result is that some of the haploid gametes produced lack a par- 
ticular chromosome, while others have more than one copy of it. (Cells with an 
abnormal number of chromosomes are said to be aneuploid, whereas those with 
the correct number are said to be euploid.) Upon fertilization, aneuploid 
gametes form abnormal embryos, most of which die. Some survive, however. 
Down syndrome in humans, for example, which is the leading single cause of 
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Figure 21-13 Two major contributions to 
the reassortment of genetic material that 
occurs in the production of gametes 
during meiosis. (A) The independent 
assortment of the maternal and paternal 
homologs during meiosis produces 2 n 
different haploid gametes for an organism 
with n chromosomes. Here n = 3, and there 
are 8 different possible gametes. 
(B) Crossing-over during prophase I 
exchanges DNA segments between 
homologous chromosomes and thereby 
re-assorts genes on individual 
chromosomes. Because of the many small 
differences in DNA sequence that always 
exist between any two homologs, both 
mechanisms increase the genetic variability 
of organisms that reproduce sexually. 



(A) 



(B) 



mental retardation, is caused by an extra copy of chromosome 21, usually result- 
ing from nondisjunction during meiosis I in the female ovary Segregation errors 
during meiosis I increase greatly with advancing maternal age. 

Despite its fallibility, almost all eucaryotes use meiosis, intermittently at 
least, to shuffle their genetic information before passing it on to the next gen- 
eration. Crossing-over makes a major contribution to this genetic shuffling 
process, as we now discuss. 



Crossing-Over Enhances Genetic Reassortment 

Unless they are identical twins, which develop from a single zygote, no two off- 
spring of the same parents are genetically the same. As we discussed earlier, 
this is because, long before the two gametes fuse at fertilization, two kinds of 
randomizing genetic reassortment have occurred in meiosis I, during the pro- 
duction of the gametes: the random distribution of maternal and paternal 
homologs, and crossing- over. The random distribution of maternal and pater- 
nal homologs (Figure 21-13A) could, in principle, produce 2 n genetically dif- 
ferent gametes, where n is the haploid number of chromosomes. In humans, 
for example, each individual can produce at least 2 23 = 8.4 x 10 6 genetically dif- 
ferent gametes. But the actual number of variants is very much greater than 
this because of chromosomal crossing-over (or simply crossing- over), which 
is an outcome of homologous recombination (discussed in Chapter 5), in 
which DNA segments of homologous chromosomes are exchanged. In meio- 
sis, when the exchange occurs between nonsister chromatids, it mixes the 
genetic constitution of each of the chromosomes (Figure 21-13B). On average, 
between two and three crossovers occur between each pair of human 
homologs (Figure 21-14). 



1 280 Chapter 21 : Sexual Reproduction: Meiosis, Germ Cells, and Fertilization 




10|um 



The molecular details of crossing- over are discussed in Chapter 5 (see Figure 
5-64). Briefly, a conserved meiosis-specific protein called Spoil initiates cross- 
ing-over by creating a double-strand break in the DNA of either a maternal or a 
paternal chromatid. A very large multienzyme recombination complex, contain- 
ing double-strand DNA repair enzymes, assembles on the break and catalyzes 
homologous recombination. In most cases, these events do not result in a 
crossover. In some cases, however, homologous recombination leads to a 
crossover, where DNA segments are exchanged between two nonsister chro- 
matids in a reciprocal fashion. As discussed earlier, after desynapsis, each 
crossover can be seen in the microscope as a chiasma (see Figure 21-10A). As 
illustrated in Figure 21-10B, each of the two sister chromatids of a homolog can 
form one or more crossovers with either of the two sister chromatids of its part- 
ner homolog. 



Crossing-Over Is Highly Regulated 

Crossing- over has two distinct functions in meiosis: it helps hold homologs 
together so that they are properly segregated to the two daughter cells produced 
by meiosis I, and it contributes to the genetic diversification of the gametes that 
are eventually produced. As might be expected, therefore, crossing-over is highly 
regulated: the number and location of double-strand breaks along each chro- 
mosome is controlled, as is the likelihood that a break will be converted into a 
crossover. Although the double-strand breaks that occur in meiosis I can be 
located almost anywhere along the chromosome (see Figure 21-14), they are not 
distributed uniformly: they cluster at "hot spots", where the chromatin is acces- 
sible, and occur only rarely in "cold spots", such as the heterochromatin regions 
around centromeres and telomeres. 

At least two kinds of regulation influence the location and number of 
crossovers that form, neither of which is well understood. Both operate before 
the synaptonemal complex assembles. One ensures that at least one crossover 
forms between the members of each homolog pair, as is necessary for normal 
homolog segregation in meiosis I. In the other, called crossover interference, the 
presence of one crossover event inhibits another from forming close by, perhaps 
by locally depleting proteins required for converting a double-strand DNA break 
into a stable crossover. 



Figure 21-14 Crossovers between 
homologs in the human testis. In these 
immunofluorescence micrographs, 
antibodies have been used to stain the 
synaptonemal complexes {red), the 
centromeres {blue), and the sites of 
crossing-over {green). Note that all of 
the bivalents have at least 1 crossover 
and none have more than 3. (Modified 
from A. Lynn et al., Science 
296:2222-2225, 2002. With permission 
from AAAS.) 



Meiosis Is Regulated Differently in Male and Female Mammals 

The basic mechanisms of meiosis have been conserved in evolution in all sexually 
reproducing eucaryotes. In all of these organisms, for example, most of meiosis is 
spent in prophase I, although the details of the timing of different stages vary 
among organisms (Figure 21-15). There are, however, some remarkable differ- 
ences in the regulation of meiosis in different species and in different sexes of the 
same species. The difference between the two sexes is very striking in mammals. 
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In mammalian females, egg precursor cells (oocytes) begin meiosis in the 
fetal ovary but arrest after diplotene, after the synaptonemal complex has disas- 
sembled in meiosis I. They complete meiosis I only after the female has become 
sexually mature and the oocyte is released from the ovary during ovulation; 
moreover, the released oocyte completes meiosis II only if it is fertilized. Thus, 
there are special stop and start mechanisms during meiosis in female mammals. 
In humans, some oocytes remain arrested in meiosis I for 40 years or more, 
which is presumably at least part of the reason why nondisjunction increases 
dramatically in older women. In mammalian males, by contrast, meiosis only 
begins in sperm precursor cells (spermatocytes) in the testes at puberty and then 
goes on continuously, without the stop and start mechanisms that operate in 
female meiosis. It takes about 24 days for a human spermatocyte to complete 
meiosis. 

There is also a big difference in the error rates of meiosis in mammalian 
females and males, and this is especially striking in humans. About 20% of 
human eggs are aneuploid, compared with 3-4% of human sperm, and, largely 
as a result of this, up to 25% of all human fetuses are aneuploid, and most of 
these result from nondisjunction in oocytes at meiosis I. Mammalian fertiliza- 
tion typically involves the ovulation of a small number of eggs at one end of the 
female reproductive tract and the entry of millions of sperm at the other. Given 
the relative scarcity of eggs, one might have expected that egg development 
would be subject to more stringent quality control than sperm development, but 
the opposite is the case. If meiosis goes wrong in male cells, a cell-cycle check- 
point mechanism (discussed in Chapter 17) is activated, which arrests meiosis 
and leads to cell death by apoptosis. Such checkpoint mechanisms apparently 
do not operate in female meiotic cells: if homolog segregation fails to occur nor- 
mally, the cells continue through meiosis and produce aneuploid eggs. The male 
germ line, on the other hand, is thought to be the main source of another type of 
genetic error. Because many more mitotic cell divisions occur on the way to the 
production of a sperm, and each round of DNA replication is liable to error, the 
average number of new mutations contributed by fathers is larger than the 
number contributed by mothers. 

The production of gametes involves more than just meiosis, and the other 
processes also differ for eggs and sperm. As we will see, by the end of meiosis, a 
mammalian egg is fully mature, whereas a sperm that has completed meiosis has 
only just begun its differentiation. Before discussing these gametes, however, we 
first consider how certain cells in the mammalian embryo initially become spec- 
ified to develop into germ cells and how these cells then become committed to 
developing into either sperm or eggs, depending on the sex of the individual. 



Summary 

Haploid gametes (eggs, sperm, pollen, and spores) are produced by meiosis, in which 
two successive cell divisions follow one round of DNA replication to give rise to four 
haploid cells from a single diploid cell. Meiosis is dominated by a prolonged prophase 
I, which can occupy 90% or more of the total meiotic period. At the start of prophase I, 
the chromosomes have replicated and consist of two tightly joined sister chromatids. 
Homologous chromosomes (homologs) then pair up side-by-side and become progres- 
sively more closely juxtaposed as prophase I proceeds. The tightly aligned homologs 
(bivalents) undergo genetic recombination, forming crossovers that can later be seen 
as chiasmata, which help hold each pair of homologs together during metaphase I. 
Both crossing-over and the independent segregation of the maternal and paternal 
copies of each chromosome during meiosis I have important roles in producing 
gametes that are genetically different from one another and from both parents. Meio- 
sis-specific, kinetochore- associated proteins help ensure that both sister chromatids in 
a homolog attach to the same spindle pole; other kinetochore- associated proteins 
ensure that the homologs remain connected at their centromeres during anaphase I, so 
that homologs rather than sister chromatids are segregated in meiosis I. After the long 
meiosis I, meiosis II follows rapidly, without DNA replication, in a process that resem- 
bles mitosis, in that sister chromatids are pulled apart at anaphase. 
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Figure 21-15 Comparison of times 
required for each of the stages of 
meiosis. (A) Approximate times for a 
male mammal (mouse). (B) Approximate 
times for the male tissue of a plant (lily). 
Times differ for male and female gametes 
(sperm and eggs, respectively) of the 
same species, as well as for the same 
gametes of different species. Meiosis in a 
human male, for example, lasts for 
24 days, compared with 1 2 days in the 
mouse. In human females, it can last 
40 years or more, because meiosis I 
arrests after diplotene. In all species, 
however, prophase I is always much 
longer than all the other meiotic stages 
combined. 
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PRIMORDIAL GERM CELLS AND SEX 
DETERMINATION IN MAMMALS 

Sexual reproductive strategies vary enormously among different organisms. In 
the rest of this chapter, we focus mainly on the strategies used by mammals. 

In all vertebrate embryos, certain cells are singled out early in development 
as progenitors of the gametes. These diploid primordial germ cells (PGCs) 
migrate to the developing gonads, which will form the ovaries in females and the 
testes in males. After a period of mitotic proliferation in the developing gonads, 
the PGCs undergo meiosis and differentiate into mature haploid gametes — 
either eggs or sperm. Later, the fusion of egg and sperm after mating initiates 
embryogenesis. The subsequent production of new PGCs in this new embryo 
begins the cycle again (see Figure 21-3A). 

In this section, we consider how mammalian PGCs arise, how the sex of a 
mammal is determined, and how sex determination dictates whether the PGCs 
develop into sperm or eggs. 



Signals from Neighbors Specify PGCs in Mammalian Embryos 

In many animals, including many vertebrates, the unfertilized egg contains spe- 
cific molecules localized in a particular region of the cytoplasm that determine 
which cells will become the germ cells. When the egg is fertilized and divides 
repeatedly to produce the cells of the early embryo, the cells that ultimately 
inherit these germ cell determinants become PGCs (Figure 21-16). Although the 
molecular nature and functions of the determinants are largely unknown, pro- 
teins of the Vasa family are a required component in all of these animals. Vasa 
proteins are structurally similar to ATP-dependent RNA helicases, but their pre- 
cise function in germ cell determination remains a mystery. 

By contrast, in other animals, including mammals, the egg cytoplasm does 
not contain localized germ cell determinants. Instead, signals from neighboring 
cells dictate which cells become PGCs. In mammals, all cells produced by the 
first few divisions of the fertilized egg are totipotent— that is, they have the 
potential to give rise to any of the cell types of the animal, including the germ 
cells and cells of extraembryonic tissues such as the placenta. Only later is a 
small group of cells induced by signals from neighbors to become PGCs. In mice, 
for instance, about 6 days after fertilization, signals (including bone mor- 
phogenic protein 4, BMP4) secreted by cells in tissue lying outside the embryo 
proper induce about 10 cells in the adjacent underlying embryo to become PGC 
precursors. These cells divide and mature to become PGCs, turning off the 
expression of a number of somatic cell genes and turning on the expression of 
genes involved in maintaining the special character of the germ cells. 

Although different mechanisms specify PGCs in different animals, some of 
the mechanisms that control their proliferation and development have been 



Figure 21-16 Segregation of germ cell 
determinants in the nematode 
G elegans. The micrographs in the upper 
row show the pattern of cell divisions, 
with the cell nuclei stained blue; below, 
the same cells are stained with an 
antibody that labels (in green) small 
granules (called P granules) that function 
as germ cell determinants. The P granules 
are composed of RNA and protein 
molecules and are distributed randomly 
throughout the cytoplasm of the 
unfertilized egg (not shown). As shown in 
the far left-hand panels, after fertilization, 
the granules accumulate at one pole of 
the zygote. The granules are then 
segregated into one of the two daughter 
cells at each cell division. The single cell 
containing the P granules in the embryo 
shown in the far right-hand panels is the 
precursor of the germ line. (Courtesy of 
Susan Strome.) 
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Figure 21-17 Migration of mammalian 
PGCs. (A) Fluorescence micrograph 
showing migrating PGCs in a cross 
section of an early mouse embryo. The 
PGCs are stained with a monoclonal 
antibody (in green) that specifically labels 
these cells at this stage of 
embryogenesis.The remaining cells in 
the embryo are stained with a lectin (in 
red) that binds to sialic acid, which is 
found on the surface of all cells. 
(B) Drawing corresponding to the 
micrograph shown in (A). (A, courtesy of 
Robert Anderson and Chris Wylie.) 



conserved in evolution from worms to humans. For example, PGC development 
in all animals that have been studied depends on the suppression of somatic cell 
fates by gene repression, as well as on the inhibition of translation of specific 
mRNAs by Nanos RNA-binding proteins. 



PGCs Migrate into the Developing Gonads 

After mammalian PGCs develop, they proliferate and migrate to their final des- 
tination in the developing gonads (Figure 21-17). As they migrate through the 
embryo, various extracellular signal proteins produced by adjacent somatic cells 
signal them to survive, proliferate, and migrate. Among the secreted signal pro- 
teins that help attract PGCs into the developing gonad are chemokines, which 
bind to G-protein-coupled receptors (GPCRs) and guide the migration of vari- 
ous cell types, including PGCs and white blood cells (discussed in Chapter 23). 

After the PGCs enter the developing gonad, which at this stage is called the 
genital ridge, they go through several more mitotic cell divisions, in the course of 
which they become specified to follow a pathway that will lead them to develop 
as either eggs or sperm. 

When the PGCs first migrate into the embryonic gonad, however, they are 
not irreversibly committed to becoming gametes. When removed from the 
embryo and cultured in the presence of appropriate extracellular signal pro- 
teins, they convert into cells that can be maintained in culture indefinitely as a 
cell line that can produce any of the cell types of the body of the animal, 
although not the extraembryonic cells that go on to form structures such as the 
placenta; for this reason, they are said to be pluripotent, rather than totipotent. 
In these respects, these so-called embryonic germ (EG) cells resemble embryonic 
stem (ES) cells (discussed in Chapter 23). EG and ES cells are promising sources 
of various human cell types — both for drug testing and for the treatment of dis- 
eases such as heart attacks, strokes, and various neurodegenerative diseases, in 
which specific cell types die. 

What determines whether the PGCs that migrate into the developing mam- 
malian gonad develop into eggs or sperm? Surprisingly, it is not their own sex 
chromosome constitution but rather whether the genital ridge has begun to 
develop into an ovary or a testis, respectively. The sex chromosomes in the 
somatic cells of the genital ridge determine which type of gonad the ridge 
becomes. Although many genes influence the outcome, a single gene on the Y 
chromosome has an especially important role. 



The Sry Gene Directs the Developing Mammalian Gonad to 
Become a Testis 

Aristotle believed that the temperature of the male during sexual intercourse 
determined the offspring's sex: the higher the temperature, the greater the 
chance of producing a male. Had he been referring to lizards or alligators rather 
than to humans, he would have been closer to the truth, as in many egg-laying 
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Figure 21-18 Chromosomes of a normal 
human male. The chromosomes have 
been labeled with Giemsa stain. See also 
Figures 4-10 and 4-1 1 . Note the 
difference in size of the two sex 
chromosomes. Whereas the 
X chromosome contains more than 
1 000 genes, the Y chromosome contains 
only about 80. (Courtesy of Julie 
Robertson of the Wisconsin State 
Laboratory of Hygiene.) 



reptiles the temperature of the incubating egg determines the sex of the off- 
spring; in lizards and alligators, males develop at warm temperatures and 
females develop at cool temperatures. We now know, however, that sex chromo- 
somes, rather than the temperature of the parents or embryo, determine the sex 
of a mammal. 

Female mammals have two X chromosomes in all of their somatic cells, 
whereas males have one X and one Y. The presence or absence of the Y chromo- 
some, which is the smallest human chromosome (Figure 21-18), determines the 
sex of the individual. Individuals with a Y chromosome develop as males no mat- 
ter how many X chromosomes they have, whereas individuals without a Y chro- 
mosome develop as females, even if they have only one X chromosome. The 
sperm that fertilizes the egg determines the sex of the resulting zygote: eggs have 
a single X chromosome, whereas sperm can have either an X or a Y. 

The Y chromosome influences the sex of the individual by directing the 
somatic cells of the genital ridge to develop into a testis instead of an ovary. 
Mammalian embryos are programmed to develop as females unless prevented 
from doing so by the testes, which direct the embryo to develop into a male. If 
the genital ridges are removed before they have started to develop into a testis or 
an ovary, a mammal develops into a female, regardless of the sex chromosomes 
it carries. This does not mean that signals are not required for the development 
of female- specific organs in mammals: the secreted signal protein Wnt4, for 
example, is required for normal mammalian ovary development. 

The crucial gene on the Y chromosome that directs the genital ridge to 
develop into a testis instead of an ovary is called Sry, for "sex- determining region 
of Y." Remarkably, when this gene is introduced into the genome of an XX mouse 
zygote, the transgenic embryo produced develops as a male, even though it lacks 
all of the other genes on the Y chromosome. Such sex-reversed mice, however, 
cannot produce sperm, because they lack the other genes on the Y chromosome 
that are required for sperm development. Similarly, XY humans with an inacti- 
vating mutation in Sry develop as females, even though they are otherwise 
genetically male. 

Sry is expressed in a subpopulation of the somatic cells of the developing 
gonad, and it causes these cells to differentiate into Sertoli cells, which are the 
main type of supporting cells in the testis (see Figure 21-29). The Sertoli cells 
then direct sexual development along a male pathway by influencing other cells 
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in the genital ridge and elsewhere in the embryo in at least four ways: 

1. They stimulate the newly arriving PGCs to develop along a pathway that 
produces sperm. They do so by inhibiting the cells from entering meiosis 
and developing along the pathway that produces eggs, as we discuss later. 

2. They secrete anti- Mullerian hormone, which circulates in the blood and 
suppresses the development of the female reproductive tract by causing 
the Mullerian duct to regress (this duct otherwise gives rise to the oviduct, 
uterus, and upper part of the vagina). 

3. They stimulate endothelial and smooth muscle cells in adjacent mes- 
enchymal tissue to migrate into the developing gonad. These cells form 
critical elements within the testis that are required for normal sperm pro- 
duction, which begins when the organism reaches sexual maturity. 

4. They help to induce other somatic cells in the developing gonad to become 
Leydig cells, which secrete the male sex hormone testosterone into the 
blood. The secretion of testosterone is responsible for inducing all male 
secondary sexual characteristics, including the structures of the male 
reproductive tract, such as the prostate and seminal vesicles, which 
develop from another duct called the Wolffian duct system. This duct sys- 
tem degenerates in the developing female because it requires testosterone 
to survive and develop. Testosterone secretion also helps masculinize the 
early developing brain, influencing sexual identity and sexual orientation, 
and thereby sexual behavior: female rats that are treated with testosterone 
around birth, for example, later display malelike sexual behavior. 

The Sry gene encodes a gene regulatory protein (Sry) that binds to DNA and 
influences the transcription of other genes involved in Sertoli cell development. 
One crucial downstream gene encodes another gene regulatory protein related to 
Sry, which is called Sox9. The Sox9 gene is not on the Y chromosome, but it is 
expressed in males in all vertebrates, unlike Sry, which is found only in mammals. 
If Sox9 is expressed ectopically in the developing gonads of an XX mouse embryo, 
the embryo develops as a male, even if it lacks the Sry gene, suggesting that Sry 
normally acts by inducing the expression of Sox9. The Sox9 protein directly acti- 
vates the transcription of at least some Sertoli- cell- specific genes, including the 
gene encoding anti- Mullerian hormone. 

In the absence of either Sry or Sox9, the genital ridge of an XY embryo devel- 
ops into an ovary instead of a testis. The supporting cells become follicle cells 
instead of Sertoli cells. Other somatic cells become theca cells (instead of Leydig 
cells), which, beginning at puberty, contribute to the production of the female 
sex hormone estrogen. The PGCs develop along a pathway that produces eggs 
rather than sperm (Figure 21-19), and the animal develops as a female. 

How do Sertoli cells induce the PGCs that migrate into the developing gonad 
in males to follow the pathway leading to sperm production rather than to egg 
production? The mechanism depends on the small signal molecule retinoic acid 
(see Figure 15-13), which, in both sexes, is produced by cells in a transient tubu- 
lar structure called the mesonephros that lies adjacent to the developing gonad. 
In the embryonic ovary, the retinoic acid induces the proliferating germ-line 
cells to enter meiosis and start down the pathway leading to egg production; the 
cells become arrested after diplotene of prophase I, where they remain until 
ovulation, beginning when the female reaches sexual maturity. In the embryonic 
testis, by contrast, Sertoli cells produce an enzyme that degrades retinoic acid, 
preventing the retinoic acid from the mesonephros from inducing the germ-line 
cells to enter meiosis and begin egg development. Only much later, when the 
male becomes sexually mature, do the germ-line cells in the testis begin pro- 
ducting sperm. 

Many Aspects of Sexual Reproduction Vary Greatly between 
Animal Species 

Although meiosis is highly conserved in all sexually reproducing eucaryotes, 
other aspects of sexual reproduction are extremely variable. We have seen that 
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Figure 21-19 Influence of Sry on gonad 
development. The germ-line cells are 
shaded in red, and the somatic cells are 
shaded in green or blue. The change from 
light to darker color indicates that the cell 
has differentiated. The Sry gene acts in a 
subpopulation of somatic cells in the 
developing gonad to direct them to 
differentiate into Sertoli cells instead of 
follicle cells. The Sertoli cells then prevent 
the germ-line cells from developing 
along the egg pathway and help direct 
them down the sperm pathway of 
development, beginning at puberty. They 
also secrete anti-Mullerian hormone, 
which causes the Mullerian duct to 
regress, and they help to induce other 
somatic cells to differentiate into Leydig 
cells, which secrete testosterone (see 
Figure 21-29). In the absence of Sry, the 
germ-line cells commit to egg 
development, and the somatic cells 
develop into either follicle cells, which 
support egg development, or theca cells, 
which produce progesterone; the 
progesterone is converted to estrogen by 
the follicle cells. Whereas the testis 
begins secreting testosterone in the 
fetus, the ovary does not begin secreting 
estrogen until puberty. 



testosterone 



estrogen 



an animal's sex can depend on either its chromosomes or the environment in 
which it develops. But even the genetic mechanisms of sex determination vary 
greatly. In C. elegans and Drosophila, for example, sex is determined by the ratio 
of X chromosomes to autosomes, rather than by the presence or absence of a Y 
chromosome, as in mammals. In C. elegans, sex determination depends mainly 
on transcriptional and translational controls on gene expression, whereas in 
Drosophila it depends on a cascade of regulated RNA splicing events, as dis- 
cussed in Chapter 7. In Drosophila, moreover, the sex-specific character of each 
cell in the body is programmed individually by its own chromosomes, instead of 
being controlled mainly by hormones. It remains a mystery why some aspects of 
sexual reproduction have been conserved in evolution, while others have 
become so fundamentally different. 



Summary 

A small number of cells in the early mammalian embryo are signaled by their neigh- 
bors to become germ-line cells. The resulting primordial germ cells (PGCs) proliferate 
and migrate into the developing gonads. Here, the germ-line cells commit to develop 
into either eggs, if the gonad is becoming an ovary, or sperm, if the gonad is becoming 
a testis. A developing gonad will develop into an ovary unless its somatic cells contain 
a Y chromosome, in which case it develops into a testis. The Sry gene on the mam- 
malian Y chromosome is crucial for testis development: it is expressed in a subpopula- 
tion of somatic cells in the developing gonad and directs them to differentiate into Ser- 
toli cells, which then produce signal molecules that promote the development of male 
characteristics, and suppress the development of female characteristics. Mammalian 
embryos are programmed to follow a female pathway of development unless they are 
diverted to follow the male pathway by Sertoli cells. 
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In one respect at least, an egg is the most remarkable of animal cells: once acti- 
vated, it can give rise to a complete new individual within a matter of days or 
weeks. No other cell in a higher animal has this capacity. Activation is usually the 
consequence of fertilization — fusion of a sperm with the egg — but eggs can also 
be activated artificially by various nonspecific chemical or physical treatments. 
Indeed, some organisms, including a few vertebrates such as some lizards, nor- 
mally reproduce from eggs that become activated in the absence of sperm — that 
is, parthenogenetically. Mammals are the only animals that cannot be pro- 
duced parthenogenetically; because of genomic imprinting (discussed in Chap- 
ter 7), they require both maternal and paternal genetic contributions. 

Although an egg can give rise to every cell type in the adult organism, it is 
itself a highly specialized cell, uniquely equipped for the single function of gen- 
erating a new individual. The cytoplasm of an egg can even reprogram a somatic 
cell nucleus so that the nucleus can direct the development of a new individual, 
although the egg components responsible are mostly unknown. That is how the 
famous sheep Dolly was produced. The nucleus of an unfertilized sheep egg was 
removed with a glass pipette and replaced with the nucleus of an adult somatic 
cell. An electric shock was used to activate the egg, and the resulting embryo was 
implanted in the uterus of a surrogate mother. The resulting adult sheep had the 
genome of the donor somatic cell nucleus and was therefore a clone of the donor 
sheep. 

The same approach, called reproductive cloning, has been used to produce 
clones of various mammals, including mice, rats, cats, dogs, goats, pigs, cattle, 
and horses (see Figure 21-38). In all cases, the efficiency is low: most of the 
clones die before birth and less than 5% of them develop to adulthood, probably 
because the transplanted somatic nucleus is not completely reprogrammed and 
so expresses many genes inappropriately. 

In this section, we briefly consider some of the specialized features of an egg, 
before discussing how it undergoes its final preparations for fertilization. 
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Figure 21 -20 The actual sizes of three 
eggs. The human egg is 0.1 mm in 
diameter. 



An Egg Is Highly Specialized for Independent Development 

The eggs of most animals are giant single cells. They contain stockpiles of all the 
materials needed for initial development of the embryo until the stage at which 
the new individual can begin feeding. Before the feeding stage, the giant cell 
cleaves into many smaller cells, but no net growth occurs. The mammalian 
embryo is an exception. It can start to grow early by taking up nutrients from the 
mother via the placenta. Thus, a mammalian egg, although still a large cell, is not 
as large as the egg of a frog or bird, for example. Eggs are typically spherical or 
ovoid, with a diameter of about 0.1 mm in humans and sea urchins (whose feed- 
ing larvae are tiny), 1-2 mm in frogs and fishes, and many centimeters in birds 
and reptiles (Figure 21-20). A typical somatic cell, by contrast, has a diameter of 
only about 10-30 urn (Figure 21-21). 

The egg cytoplasm usually contains nutritional reserves in the form of yolk, 
which is rich in lipids, proteins, and polysaccharides and is often contained 
within discrete structures called yolk granules. In some species, a membrane 
encloses each yolk granule. In eggs that develop into large animals outside the 
mother's body, yolk can account for more than 95% of the volume of the cell. In 
mammals, whose embryos are largely nourished by their mothers via the pla- 
centa, there is little, if any, yolk. 

The egg coat is another peculiarity of eggs. It is a specialized form of extra- 
cellular matrix consisting largely of glycoproteins — some secreted by the egg 
and some by surrounding cells. In many species, the major coat is a layer imme- 
diately surrounding the egg plasma membrane; in nonmammalian eggs, such as 
those of sea urchins or chickens, it is called the vitelline layer, whereas in mam- 
malian eggs it is called the zona pellucida (Figure 21-22). This layer protects the 
egg from mechanical damage, and in many eggs it also acts as a species-specific 
barrier to sperm, admitting only those of the same or closely related species. 



typical somatic cell 



© 



human or sea urchin egg 




typical frog or fish egg 



1 mm = 1000|um 

Figure 21-21 The relative sizes of 
various eggs, compared with that of a 
typical somatic cell. 
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Many eggs (including those of mammals) contain specialized secretory vesi- 
cles just under the plasma membrane in the outer region, or cortex, of the egg 
cytoplasm. When a sperm activates the egg, these cortical granules release their 
contents by exocytosis; the contents of the granules alter the egg coat so as to 
help prevent more than one sperm from fusing with the egg. 

Cortical granules are usually distributed evenly throughout the egg cortex. 
In many organisms, however, some egg cytoplasmic components have a strik- 
ingly asymmetrical distribution. Some of these localized components later serve 
as germ cell determinants (see Figure 21-16) or help establish the polarity of the 
embryo, as discussed in Chapter 22. 



Figure 21-22 The zona pellucida. 

(A) Scanning electron micrograph of a 
hamster egg, showing the zona pellucida. 

(B) A scanning electron micrograph of a 
similar egg in which the zona (to which 
many sperm are attached) has been 
peeled back to reveal the underlying 
plasma membrane, which contains 
numerous microvilli. The zona is made 
entirely by the developing oocyte. 
(From D.M. Phillips, 1 Ultrastruct. Res. 
72:1-12, 1980. With permission from 
Elsevier.) 



Eggs Develop in Stages 

A developing egg, or oocyte, differentiates into a mature egg (or ovum) through 
a progressive series of changes. The timing of these changes is coordinated with 
the steps of meiosis, in which the germ cells go through their two final, highly 
specialized, divisions. As discussed earlier, oocytes arrest for a prolonged period 
in meiosis I while they grow in size and progressively differentiate; in many 
cases, after completing meiosis I, they arrest again in metaphase II while await- 
ing fertilization (although they can await fertilization at various other points, 
depending on the species). 

While the details of oocyte development (oogenesis) vary from species to 
species, the general stages are similar, as outlined in Figure 21-23. Primordial 
germ cells migrate to the forming gonad to become oogonia, which proliferate 
by mitosis for a period before beginning meiosis I, at which point they are called 
primary oocytes; this usually occurs before birth in mammals. As discussed ear- 
lier, prior to the onset of meiosis I, the DNA replicates so that each chromosome 
consists of two sister chromatids; at the start of prophase I, the duplicated 
homologous chromosomes pair along their long axes, and crossing-over occurs 
between nonsister chromatids of these paired homologs (see Figure 21-10). 
After these events, the cell remains arrested after diplotene of prophase I for a 
period lasting from a few days to many years, depending on the species. During 
this long arrest period (or, in some cases, at the onset of sexual maturity), the 
primary oocytes synthesize a coat and cortical granules. The large oocytes of 
nonmammalian species also accumulate ribosomes, yolk, glycogen, lipid, and 
the mRNAs that will later direct the synthesis of proteins required for early 
embryonic growth and development. In many of these oocytes, we can observe 
the intensive biosynthetic activities in the structure of the chromosomes, which 
decondense and form lateral loops, taking on a characteristic "lampbrush" 
appearance, signifying that the genes in the loops are being busily transcribed 
(see Figures 4-54 and 4-55). 

The next phase of oocyte development, oocyte maturation, usually does 
not occur until sexual maturity, when hormones stimulate the oocyte. Under 
these hormonal influences, the cell resumes its progress through meiosis I. 
The chromosomes recondense, the nuclear envelope breaks down, the meiotic 
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spindle assembles, and the replicated homologous chromosomes segregate at 
anaphase I into two sets, each containing half the original number of chromo- 
somes. To end meiosis I, the cytoplasm divides asymmetrically to produce two 
cells that differ greatly in size: one is a small polar body, and the other is a large 
secondary oocyte, the precursor of the egg. At this stage, each chromosome is 
still composed of two sister chromatids held together at their centromeres. The 
sister chromatids do not separate until anaphase II, after which the cytoplasm of 
the large secondary oocyte again divides asymmetrically to produce the mature 
egg (or ovum) and a second small polar body, each with a haploid set of single 
chromosomes (see Figure 21-23). Because of these asymmetrical divisions of 
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Figure 21-23 The stages of oogenesis. 

Oogonia develop from primordial germ 
cells (PGCs) that migrate into the 
developing gonad early in embryogenesis. 
For clarity, only one pair of homologous 
chromosomes is shown. After several 
mitotic divisions, oogonia begin meiosis 
and are now called primary oocytes. In 
mammals, primary oocytes are formed 
very early (between 3 and 8 months of 
gestation in the human embryo) and 
remain arrested after diplotene of 
prophase I until the female becomes 
sexually mature. At this point, a small 
number of primary oocytes periodically 
mature under the influence of hormones, 
completing meiosis I to produce secondary 
oocytes, which eventually undergo meiosis 
II to produce mature eggs (ova). The stage 
at which the egg or oocyte is released from 
the ovary and is fertilized varies from 
species to species. In most vertebrates, 
oocyte maturation is arrested at 
metaphase II, and the secondary oocyte 
completes meiosis II only after fertilization. 
All of the polar bodies eventually 
degenerate. In most animals, the 
developing oocyte is surrounded by 
specialized accessory cells that help to 
isolate and nourish it (not shown). 
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their cytoplasm, oocytes maintain their large size despite undergoing the two 
meiotic divisions. Both of the polar bodies are small, and they eventually degen- 
erate. 

In most vertebrates, oocyte maturation proceeds to metaphase of meiosis II, 
at which point they become arrested. At ovulation, the arrested secondary 
oocyte is released from the ovary, ready to be fertilized. If fertilization occurs, the 
block is lifted and the cell completes meiosis, becoming a mature egg. Because 
it is fertilized, it is also called a zygote. 



Oocytes Use Special Mechanisms to Grow to Their Large Size 

A somatic cell with a diameter of 10-20 jiim typically takes about 24 hours to dou- 
ble its mass in preparation for cell division. At this rate of biosynthesis, such a 
cell would take a very long time to reach the thousand-fold greater mass of a 
mammalian egg with a diameter of 100 jam. It would take even longer to reach 
the million-fold greater mass of an insect egg with a diameter of 1000 jam. Yet, 
some insects live only a few days and manage to produce eggs with diameters 
even greater than 1000 jam. Eggs must have special mechanisms for achieving 
their large size. 

One simple strategy for rapid growth is to have extra gene copies in the cell. 
Most of the growth of an oocyte occurs after DNA replication, during the pro- 
longed arrest after diplotene in prophase I, when the diploid chromosome set is 
in duplicate (see Figure 21-23). In this way, it has twice as much DNA available 
for RNA synthesis as does an average somatic cell in the Gi phase of the cell 
cycle. The oocytes of some species go even further to accumulate extra DNA: 
they produce many extra copies of certain genes. As discussed in Chapter 6, the 
somatic cells of most organisms contain 100 to 500 copies of the ribosomal RNA 
genes so as to produce enough ribosomes for protein synthesis. Eggs need even 
greater numbers of ribosomes to support the increased rate of protein synthesis 
required during early embryogenesis, and in the oocytes of many animals the 
ribosomal RNA genes are specifically amplified; some amphibian eggs, for 
example, contain 1 or 2 million copies of these genes. 

Oocytes may also depend partly on the synthetic activities of other cells for 
their growth. Yolk, for example, is usually synthesized outside the ovary and 
imported into the oocyte. In birds, amphibians, and insects, yolk proteins are 
made by liver cells (or their equivalents), which secrete these proteins into the 
blood. Within the ovaries, oocytes use receptor-mediated endocytosis to take up 
the yolk proteins from the extracellular fluid (see Figure 13-46). Nutritive help 
can also come from neighboring accessory cells in the ovary. These can be of two 
types. In some invertebrates, some of the progeny of the oogonia become nurse 
cells instead of becoming oocytes. Cytoplasmic bridges connect these cells to 
the oocyte, allowing macromolecules to pass directly from the nurse cells into 
the oocyte cytoplasm (Figure 21-24). For the insect oocyte, the nurse cells man- 
ufacture many of the products — ribosomes, mRNA, protein, and so on — that 
vertebrate oocytes have to make for themselves. 

The other accessory cells in the ovary that help to nourish developing 
oocytes are ordinary somatic cells called follicle cells, which surround each 
developing oocyte in both invertebrates and vertebrates. They are arranged as 
an epithelial layer around the oocyte (Figure 21-25, and see Figure 21-24), and 
they are connected to each other and to the oocyte by gap junctions, which per- 
mit the exchange of small molecules but not macromolecules (discussed in 
Chapter 19). Although follicle cells are unable to provide the oocyte with pre- 
formed macromolecules through these junctions, they can supply the smaller 
precursor molecules from which macromolecules are made. The critical impor- 
tance of gap -junction communication has been elegantly demonstrated in the 
mouse ovary, where the gap -junction proteins (connexins) involved in connect- 
ing follicle cells to each other are different from those connecting follicle cells 
to the oocyte. If the genes that encode either of these proteins are disrupted in 
mice, both the follicle cells and oocytes fail to develop normally and the female 
mice are sterile. In many species, follicle cells secrete macromolecules that 
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Figure 21-24 Nurse cells and follicle 
cells associated with a Drosophila 
oocyte. The nurse cells and the oocyte 
arise from a common oogonium, which 
gives rise to one oocyte and 15 nurse 
cells (only 7 of which are seen in this 
plane of section). These cells remain 
joined by cytoplasmic bridges, which 
result from incomplete cell division. 
Eventually, the nurse cells dump their 
cytoplasmic contents into the developing 
oocyte and then kill themselves. The 
follicle cells develop independently from 
mesodermal cells. 
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either contribute to the egg coat, are taken up by receptor-mediated endocyto- 
sis into the growing oocyte, or act on egg cell- surface receptors to control the 
spatial patterning and axial asymmetries of the egg (discussed in Chapter 22). 
The communication between oocytes and their follicle cells occurs in both 
directions. The timing of developmental processes in the two sets of cells has to 
be coordinated, and it seems that this depends on signals from the oocyte to the 
follicle cells. Experiments in which young oocytes are combined with older fol- 
licle cells, or vice versa, show that an intrinsic developmental program in the 
oocyte normally controls the rate of follicle cell development. 



Most Human Oocytes Die Without Maturing 

Figure 21-26 outlines the stages in human oocyte development in the ovary. A 
single layer of follicle cells surrounds most of the primary oocytes in newborn 
girls. Such an oocyte, together with its surrounding follicle cells, is called a pri- 
mordial follicle (see Figure 21-25A). Periodically, beginning sometime before 
birth, a small proportion of primordial follicles begin to grow to become develop- 
ing follicles, in which multiple layers of follicle cells (now called granulosa cells) 



Figure 21-25 Electron micrographs of 
developing primary oocytes in the rabbit 
ovary. (A) An early stage of primary oocyte 
development. Neither a zona pellucida nor 
cortical granules have developed, and a 
single layer of flattened follicle cells 
surrounds the oocyte. (B) A more mature 
primary oocyte, which is shown at a sixfold 
lower magnification because it is much 
larger than the oocyte in (A). This oocyte 
has acquired a thick zona pellucida and is 
surrounded by several layers of follicle cells 
(now called granulosa cells) and a basal 
lamina, which isolate the oocyte from the 
other cells in the ovary. The granulosa cells 
are connected to one another and to the 
oocyte by gap junctions. (From The Cellular 
Basis of Mammalian Reproduction [J. Van 
Blerkom and P. Motta eds.]. Baltimore- 
Munich: Urban & Schwarzenberg, 1979.) 
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Figure 21-26 The stages in human oocyte development. Note that for most of its development the oocyte is surrounded by granulosa cells 
(green), which are separated from an outer layer of theca cells (blue) by an intervening basal lamina (black). After ovulation, the emptied follicle 
transforms into an endocrine structure, the corpus luteum, which secretes progesterone to help prepare the uterus for pregnancy. If fertilization 
does not occur, the corpus luteum regresses, and the lining of the uterus is sloughed off during menstruation. 
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surround the growing oocyte (see Figure 21-25B). It is not known what causes 
certain primordial follicles to begin growing. Some of these developing follicles 
go on to acquire a fluid-filled cavity, or antrum, to become antral follicles. 

After puberty, about once each month, the pituitary secretes a surge of folli- 
cle stimulating hormone (FSH), which accelerates the growth of about 10-12 
antral follicles. One of these antral follicles becomes dominant, and, toward the 
middle of the menstral cycle, a surge in FSH and luteinizing hormone (LH) trig- 
gers ovulation: the dominant primary oocyte completes meiosis I, and the 
resulting secondary oocyte arrests at metaphase II; the follicle rapidly enlarges 
and ruptures at the surface of the ovary, releasing the secondary oocyte, still sur- 
rounded by a shell of granulosa cells embedded in a hyaluronan-rich gel-like 
matrix. The released oocyte is triggered to complete meiosis II only if a sperm 
fertilizes it within a day or so. 

It remains a mystery why only a small proportion of the many antral follicles 
present in the ovaries at the time of the FSH surge each month are stimulated to 
accelerate their growth, and why only one of these matures and releases its 
oocyte, while the rest degenerate. Once the selected follicle has matured beyond 
a certain point, some feedback mechanism must operate to ensure that no other 
follicles complete maturation and ovulate during that cycle. Whatever the mech- 
anism, the result is that, during the 40 or so years of a woman's reproductive life, 
only 400 to 500 oocytes will be released. All of the other million or so primary 
oocytes present at birth die without maturing. It is still a puzzle why so many 
oocytes are formed only to die in the ovaries. 



Summary 

Oocytes develop in stages from primordial germ cells (PGCs) that migrate into the 
developing gonad, where they become oogonia. After a period of mitotic prolifera- 
tion, the oogonia begin meiosis I and are now called primary oocytes. Primary 
oocytes remain arrested after diplotene of prophase I for days to years, depending on 
the species. During this arrest period, they grow, synthesize a coat, and accumulate 
ribosomes, mRNAs, and proteins, often enlisting the help of other cells, including 
surrounding follicle cells. Bidirectional signaling between oocytes and their follicle 
cells is required for normal oocyte growth and development. In the process ofhor- 
monally induced oocyte maturation, primary oocytes complete meiosis I to form a 
small polar body and a large secondary oocyte, which proceeds to metaphase of 
meiosis II. In most vertebrates, the secondary oocyte arrests at metaphase II until 
stimulated by fertilization to complete meiosis and begin embryonic development. 



SPERM 

In most species, there are two radically different types of gametes. The egg is 
among the largest cells in an organism, whereas the sperm (spermatozoon, plu- 
ral spermatozoa) is often the smallest. The egg and the sperm are optimized in 
opposite ways for the propagation of the genes they carry. The egg is nonmotile 
and aids the survival of the maternal genes by providing large stocks of raw 
materials for embryo growth and development, together with an effective pro- 
tective wrapping. The sperm, by contrast, is optimized to propagate the paternal 
genes by exploiting this maternal investment: it is usually highly motile and 
streamlined for speed and efficiency in the task of fertilization. Competition 
between sperm is fierce, and the vast majority fail in their mission: of the billions 
of sperm released during a human male's reproductive life, only a few ever man- 
age to fertilize an egg. 



Sperm Are Highly Adapted for Delivering Their DNA to an Egg 

Typical sperm are "stripped -down" cells, equipped with a strong flageHum to 
propel them through an aqueous medium but unencumbered by cytoplasmic 
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organelles such as ribosomes, endoplasmic reticulum, or Golgi apparatus, 
which are unnecessary for the task of delivering the DNA to the egg. Sperm, 
however, contain many mitochondria strategically placed where they can most 
efficiently power the fiagellum. Sperm usually consist of two morphologically 
and functionally distinct regions enclosed by a single plasma membrane: the 
tail, which propels the sperm to the egg and helps it to burrow through the egg 
coat, and the head, which contains a highly condensed haploid nucleus (Figure 
21-27). The DNA in the nucleus is extremely tightly packed, to minimize its vol- 
ume for transport, and transcription is shut down. The chromosomes of many 
sperm lack the histones of somatic cells and are packaged instead with simple, 
highly positively charged proteins called protamines, as well as with sperm-spe- 
cific histones. 

In the head of most animal sperm, closely apposed to the anterior end of 
the nuclear envelope, is a specialized secretory vesicle called the acrosomal 
vesicle. This vesicle contains hydrolytic enzymes that are thought to help the 
sperm penetrate the egg's outer coat. When a sperm contacts the egg coat, the 
contents of the vesicle are released by exocytosis in the acrosome reaction. This 
reaction is required for the sperm to bind to the coat, burrow through it, and 
fuse with the egg. 

The motile tail of a sperm is a long fiagellum, whose central axoneme 
emanates from a basal body situated just behind the nucleus. As described in 
Chapter 16, the axoneme consists of two central singlet microtubules sur- 
rounded by nine evenly spaced microtubule doublets. The fiagellum of some 
sperm (including those of mammals) differs from other flagella in that the 9 + 2 
pattern of microtubules is surrounded by nine outer dense fibers (Figure 21-28). 
The dense fibers are stiff and noncontractile, and they are thought to restrict the 
flexibility of the fiagellum and protect it from shear forces; defects in these fibers 
lead to abnormal sperm morphology and to infertility. The active bending of the 
fiagellum is caused by the sliding of adjacent microtubule doublets past one 
another, driven by dynein motor proteins, which use the energy of ATP hydroly- 
sis to slide the microtubules. The ATP is generated by a large number of highly 
specialized mitochondria that are concentrated in the anterior part of the sperm 
tail (called the midpiece), where the ATP is needed. 
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Figure 21-27 A human sperm. It is 

shown in longitudinal section. 



Sperm Are Produced Continuously in the Mammalian Testis 

In contrast to oocytes, which begin meiosis before birth and remain arrested 
after diplotene of prophase I until a human female reaches puberty, meiosis and 
sperm production (spermatogenesis) do not begin in the testes of human males 
until puberty They then go on continuously in the epithelial lining of very long, 
tightly coiled tubes, called seminiferous tubules. Immature germ cells, called 
spermatogonia (singular, spermatogonium), are located around the outer edge 
of these tubes next to the basal lamina (Figure 21-29A). Most of these cells 
divide a limited number of times by mitosis before they stop proliferating and 
begin meiosis I, at which point they are now called primary spermatocytes; the 
primary spermatocytes give rise to secondary spermatocytes, which differentiate 
into spermatids and ultimately sperm (Figure 21-29B). A small proportion of the 
spermatogonia serve as stem cells, which divide slowly by mitosis throughout 
life, producing daughter cells that either remain stem cells or commit to matu- 
ration. 

The stages of spermatogenesis and their relationship to meiosis are illus- 
trated in Figure 21-30. During prophase I, the paired homologous chromo- 
somes participate in crossing- over. The primary spermatocytes then complete 

Figure 21-28 Drawing of the midpiece of a mammalian sperm as seen in 
cross section in an electron microscope. The core of the fiagellum is 
composed of an axoneme surrounded by nine dense fibers. The axoneme 
consists of two singlet microtubules surrounded by nine microtubule 
doublets. The mitochondria (shown in green) are well placed for providing 
the ATP required for flagellar movement; they are distributed in an unusual 
spiral arrangement around the axoneme (see Figure 21-27). 
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meiosis I to produce two secondary spermatocytes, each containing 22 dupli- 
cated autosomal chromosomes and either a duplicated X or a duplicated Y chro- 
mosome. The two secondary spermatocytes derived from each primary sper- 
matocyte proceed through meiosis II to produce four spermatids, each with a 
haploid number of single chromosomes. The haploid spermatids then undergo 
dramatic morphological changes as they differentiate into sperm, which escape 
into the lumen of the seminiferous tubule. The sperm subsequently pass into 
the epididymis, a coiled tube overlying the testis, where they are stored and 
undergo further maturation. The stored sperm are still not ready to fertilize an 
egg, however; as we discuss later, they undergo further maturation in the female 
genital tract — a process called capacitation. 



Sperm Develop as a Syncytium 

An intriguing feature of spermatogenesis is that once a spermatogonium begins 
to mature, its progeny no longer complete cytoplasmic division (cytokinesis) 
during mitosis and subsequent meiosis. Consequently, large clones of differen- 
tiating daughter cells that have descended from one maturing spermatogonium 
remain connected by cytoplasmic bridges, forming a syncytium (Figure 21-31). 
The cytoplasmic bridges persist until the very end of sperm differentiation, 
when individual sperm are released into the lumen of the seminiferous tubule. 
As a result, mature sperm are produced in synchronous batches in any given 
area of a seminiferous tubule. What is the function of this syncytial arrange- 
ment? 

We saw earlier that oocytes grow and differentiate while containing the 
diploid set of chromosomes in duplicate. Sperm, by contrast, do not grow, and 
they undergo most of their differentiation after their nuclei have completed 
meiosis to become haploid. The presence of cytoplasmic bridges between 
them, however, means that each developing haploid sperm shares a common 
cytoplasm with its neighbors. In this way, it can be supplied with all the gene 



Figure 21-29 Highly simplified drawings 
of a cross section of a seminiferous 
tubule in a mammalian testis. (A) All of 

the stages of spermatogenesis shown 
take place while the developing germ- 
line cells are in intimate association with 
Sertoli cells. Sertoli cells direct sexual 
differentiation along a male pathway. 
They are large cells, extending from the 
basal lamina to the lumen of the 
seminiferous tubule; they are required for 
the survival of the spermatogonia and 
are analogous to follicle cells in the ovary 
(see Figure 21-19). Spermatogenesis also 
depends on testosterone secreted by 
Leydig cells, located between the 
seminiferous tubules. (B) Spermatogonia 
divide by mitosis at the periphery of the 
seminiferous tubule. Some of these cells 
enter meiosis I to become primary 
spermatocytes; they then complete 
meiosis I to become secondary 
spermatocytes. The secondary 
spermatocytes then complete meiosis II 
to become spermatids, which 
differentiate into spermatozoa (sperm) 
and are released into the lumen of the 
tubule (see Figure 21-30). In man, it takes 
a spermatogonium about 24 days from 
the onset of meiosis to emergence as a 
spermatid and another 5 weeks for the 
spermatid to develop into a sperm. 
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products of a complete diploid genome. Developing sperm that carry a Y chro- 
mosome, for example, can be supplied with essential proteins encoded by genes 
on the X chromosome. Thus, the diploid genome directs sperm differentiation 
just as it directs egg differentiation. 

Some of the genes that regulate spermatogenesis have been conserved in 
evolution from flies to humans. The Daz genes, for example, encode RNA-bind- 
ing proteins and are located in a cluster on the human Y chromosome. The clus- 
ter is found to be deleted in a substantial proportion of infertile men, many of 
whom cannot make sperm. A Drosophila gene that is homologous to the human 
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Figure 21-30 The stages of 
spermatogenesis. Spermatogonia 
develop from primordial germ cells 
(PGCs) that migrate into the developing 
gonad early in embryogenesis. When the 
animal becomes sexually mature, the 
spermatogonia begin to proliferate 
rapidly by mitosis. Some retain the 
capacity to divide indefinitely (as stem- 
cell spermatogonia). Others (maturing 
spermatogonia) undergo a limited 
number of mitotic division cycles before 
beginning meiosis to become 
spermatocytes, which eventually become 
haploid spermatids and then sperm. 
Spermatogenesis differs from oogenesis 
(see Figure 21-23) in several ways. 
(1) New cells enter meiosis continually 
from the time of puberty. (2) Each cell 
that begins meiosis gives rise to four 
mature gametes rather than one. 
(3) Mature sperm form by an elaborate 
process of cell differentiation that begins 
after meiosis is complete. (4) About twice 
as many cell divisions occur in the 
production of a sperm as in the 
production of an egg; in a mouse, for 
example, it is estimated that on average 
about 56 divisions occur from zygote to 
mature sperm, and about 27 divisions 
occur from zygote to mature egg. 
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Figure 21-31 Cytoplasmic bridges in 
developing sperm cells and their 
precursors. The progeny of a single 
maturing spermatogonium remain 
connected to one another by cytoplasmic 
bridges throughout their differentiation 
into mature sperm. For the sake of 
simplicity, only two connected maturing 
spermatogonia are shown entering 
meiosis, eventually to form eight 
connected haploid spermatids. In fact, 
the number of connected cells that go 
through meiosis and differentiate 
synchronously is very much larger than 
shown here. Note that in the process of 
differentiating, most of the spermatid 
cytoplasm is discarded as residual bodies, 
which are phagocytosed by Sertoli cells. 



Daz genes is similarly essential for spermatogenesis in the fly: Daz-deficient 
male flies are infertile because they cannot make sperm, but, remarkably, they 
can be cured by a human Daz transgene. RNA-binding proteins are especially 
important in spermatogenesis, because many of the genes expressed in the 
sperm lineage are regulated at the level of RNA translation. 



Summary 

A sperm is usually a small, compact cell, highly specialized for the task of fertilizing an 
egg. Whereas in women a large pool of oocytes is produced before birth, in men sper- 
matogonia only begin to enter meiosis to produce spermatocytes (and sperm) after sex- 
ual maturation, and they continue to do so from then on. Each diploid primary sper- 
matocyte gives rise to four mature haploid sperm. The process of sperm differentiation 
occurs after meiosis is complete, requiring five weeks in humans. Because the matur- 
ing spermatogonia and spermatocytes fail to complete cytokinesis, however, the 
progeny of a single maturing spermatogonium develop as a large syncytium. Thus, the 
products encoded by both parental chromosomes direct sperm differentiation, even 
though each sperm nucleus is haploid. 
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FERTILIZATION 

Once released, egg and sperm alike are destined to die within minutes or hours 
unless they find each other and fuse in the process of fertilization. Through fer- 
tilization, the egg and sperm are saved: the egg is activated to begin its develop- 
mental program, and the haploid nuclei of the two gametes come together to 
form the diploid genome of a new organism. Fertilization was originally studied 
most intensively in marine invertebrates such as sea urchins and starfish, where 
fertilization occurs in seawater after the release of huge numbers of both sperm 
and eggs. Such external fertilization is easier to study than the internal fertiliza- 
tion of mammals, which normally occurs in the female reproductive tract after 
mating. In the late 1950s, however, it became possible to fertilize mammalian 
eggs in vitro, opening the way to an analysis of the cellular and molecular events 
in mammalian fertilization. 

In this section, we focus on mammalian fertilization. We begin by consider- 
ing the maturation of sperm that occurs during their passage through the female 
genital tract. We then discuss the binding of sperm to the egg coat (zona pellu- 
cida), which induces the acrosome reaction, required for the sperm to burrow 
through the zona and fuse with the egg. We next consider the binding of the 
sperm to the egg plasma membrane and its subsequent fusion with this mem- 
brane. After discussing how the fusion of a sperm activates the egg and how the 
haploid nuclei of the two gametes come together in the zygote to complete fer- 
tilization, we briefly consider the burgeoning field of assisted reproductive tech- 
nology, which has revolutionized the treatment of human infertility and opened 
up new ways of manipulating the reproductive process. 

Ejaculated Sperm Become Capacitated in the Female Genital 
Tract 

Of the 300,000,000 or so human sperm ejaculated during coitus, only about 200 
reach the site of fertilization in the oviduct. Once a sperm finds an egg, it must 
first migrate through the layers of granulosa cells that surround the egg and then 
bind to and cross the zona pellucida. Finally, it must bind to and fuse with the 
egg plasma membrane. 

Ejaculated mammalian sperm are initially not competent to accomplish 
any of these tasks. They must first be modified by conditions in the female 
reproductive tract. Because it is required for sperm to acquire the capacity to 
fertilize an egg, the process is called capacitation. Capacitation takes about 5-6 
hours in humans and is completed only when the sperm arrive in the oviduct. 
The sperm undergo extensive biochemical and functional changes, including 
changes in glycoproteins, lipids, and ion channels in the sperm plasma mem- 
brane and a large change in the resting potential of this membrane (the mem- 
brane potential moves to a more negative value so that the membrane becomes 
hyperpolarized). Capacitation is also associated with an increase in cytosolic 
pH, tyrosine phosphorylation of various sperm proteins, and the unmasking of 
cell-surface receptors that help bind the sperm to the zona pellucida. Capaci- 
tation alters two crucial aspects of sperm behavior: it greatly increases the 
motility of the flagellum, and it makes the sperm capable of undergoing the 
acrosome reaction. 

Capacitation can occur in vitro in the appropriate culture medium and is 
usually a required part of in vitro fertilization. Three critical components are 
needed in the medium, all of which are normally in high concentration in the 
female genital tract — albumin, Ca 2+ , and HC03~. The albumin protein helps 
extract cholesterol from the plasma membrane, increasing the ability of this 
membrane to fuse with the acrosome membrane during the acrosome reac- 
tion. The Ca 2+ and HCO3" enter the sperm and directly activate a soluble 
adenylyl cyclase enzyme in the cytosol to produce cyclic AMP (discussed in 
Chapter 15), which helps to initiate many of the changes associated with 
capacitation. 
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Capacitated Sperm Bind to the Zona Pellucida and Undergo an 
Acrosome Reaction 

During ovulation, mammalian eggs are released from the ovary into the peri- 
toneal cavity next to the entrance to the oviduct, into which they are rapidly 
swept. They are covered with several layers of granulosa cells embedded in an 
extracellular matrix that is rich in hyaluronic acid (discussed in Chapter 19). The 
granulosa cells may help the egg get picked up into the oviduct, and they may 
also secrete unidentified chemical signals that attract sperm to the egg. 

On encountering an egg, a capacitated sperm first must penetrate the layers 
of granulosa cells, making use of a hyaluronidase enzyme on the surface of the 
sperm. It can then bind to the zona pellucida (see Figure 21-22). The zona usually 
acts as a barrier to fertilization across species, and removing it often eliminates 
this barrier. Human sperm, for example, will fertilize hamster eggs from which the 
zona has been removed with specific enzymes; not surprisingly, such hybrid 
zygotes fail to develop. Zona-free hamster eggs are sometimes used in infertility 
clinics to assess the fertilizing capacity of human sperm in vitro (Figure 21-32). 

The zona pellucida of most mammalian eggs is composed mainly of three 
glycoproteins, all of which are produced exclusively by the growing oocyte. Two 
of them, ZP2 and ZP3, assemble into long filaments, while the other, ZP1, cross- 
links the filaments into a three-dimensional network. The ZP3 protein is crucial: 
female mice with a disrupted Zp3 gene produce eggs that lack a zona and are 
infertile. O- linked oligosaccharides on ZP3 seem to be at least partly responsible 
for the species-specific binding of sperm to the zona. The binding of sperm to 
the zona is complex, however, and involves both ZP3-dependent and ZP3-inde- 
pendent mechanisms and a variety of proteins on the sperm surface. 

The zona induces sperm to undergo the acrosome reaction, in which the 
contents of the acrosome are released by exocytosis (Figure 21-33). The acro- 
some reaction is required for normal fertilization, as it exposes various 
hydrolytic enzymes that are believed to help the sperm tunnel through the zona 
pellucida, and it alters the sperm surface so that it can bind to and fuse with the 
plasma membrane of the egg, as we discuss below. In vitro, purified ZP3 can trig- 
ger the acrosome reaction, possibly by activating a lectin-like receptor on the 
sperm surface, thought to be a transmembrane form of the enzyme galactosyl- 
transferase. Receptor activation leads to an increase in Ca 2+ in the sperm 
cytosol, which initiates the exocytosis. 
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Figure 21-32 Scanning electron 
micrograph of a human sperm 
contacting a hamster egg. The zona 
pellucida of the egg has been removed, 
exposing the plasma membrane, which 
contains numerous microvilli. The ability 
of an individual's sperm to penetrate 
hamster eggs is used as an assay of male 
fertility; penetration of more than 
10-25% of the eggs is considered to be 
normal. (Courtesy of David M. Phillips.) 



The Mechanism of Sperm-Egg Fusion Is Still Unknown 

After a sperm has undergone the acrosome reaction and penetrated the zona 
pellucida, it binds to the egg plasma membrane overlying the tips of microvilli 
on the egg surface (see Figure 21-32). The sperm binds initially by its tip and 
then by its side (see Figure 21-33). Neighboring microvilli on the egg surface 
rapidly elongate and cluster around the sperm to ensure that it is held firmly so 
that it can fuse with the egg. After fusion, all of the sperm contents are drawn 
into the egg, as the microvilli are resorbed. 

The molecular mechanisms responsible for sperm-egg binding and fusion 
are largely unknown, although, after a number of false starts, two membrane 
proteins have been shown to be required for the fusion. One is a sperm- specific 
transmembrane protein of the immunoglobulin superfamily called Izumo (after 
a Japanese shrine dedicated to marriage). It becomes exposed on the surface of 
mouse and human sperm during the acrosome reaction. Anti-Izumo antibodies 
block the fusion, and Izumo -deficient mouse sperm fail to fuse with normal 
eggs, but it is still unknown how Izumo promotes sperm-egg fusion. The only 
protein on the egg surface demonstrated to be required for fusion with a sperm 
is the CD9 protein, which is a member of the tetraspanin family, so-called 
because these proteins have four membrane-spanning segments. Normal 
sperm fail to fuse with CD9-deficient mouse eggs, indicating that sperm-egg 
fusion depends on CD9, but it is not known how. CD9 does not act alone on the 
egg surface to promote fusion: normal sperm also fail to fuse with eggs treated 
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with an enzyme that removes proteins attached to the plasma membrane by a 
glycosylphosphatidylinositol (GPI) anchor (discussed in Chapter 10), indicating 
that one or more GPI-linked proteins is also required for fusion, although the 
relevant protein or proteins have yet to be identified. 



Sperm Fusion Activates the Egg by Increasing Ca 2+ in the Cytosol 

Fusion with a sperm activates the egg, causing the cortical granules to release 
their contents by exocytosis, a process called the cortical reaction. Meiosis, 
which was arrested in metaphase II, resumes, producing a second polar body, 
and a zygote, which begins to develop. 

An increase in Ca 2+ in the cytosol of the fertilized egg triggers all of these 
events. <AGGA> If the Ca 2+ concentration in the cytosol of an unfertilized egg is 
increased artificially — either directly by an injection of Ca 2+ or indirectly by the 
use of a Ca 2+ -carrying ionophore (discussed in Chapter 11) — the eggs of all ani- 
mals so far tested, including mammals, are activated. Conversely, preventing the 
increase in Ca 2+ by injecting the Ca 2+ chelator EGTA inhibits activation of the egg 
in response to fertilization. 

When the sperm fuses with the egg plasma membrane in the normal way, it 
causes a local increase in cytosolic Ca 2+ , which spreads through the cell in a wave 
(see Figure 15-40). The wave propagates by positive feedback: the increase in 
cytosolic Ca 2+ causes Ca 2+ channels to open, allowing still more Ca 2+ to enter the 
cytosol. The initial wave of Ca 2+ release is usually followed within a few minutes 
by Ca 2+ oscillations (discussed in Chapter 15), which persist for several hours. 

The fused sperm triggers the Ca 2+ wave and oscillations by bringing a factor 
into the egg cytosol. An injection of an intact sperm, a sperm head, or a sperm 
extract into an egg does the same thing. All of these treatments increase the con- 
centration of inositol 1,4,5-trisphosphate (IP3), which releases Ca 2+ from the 
endoplasmic reticulum and initiates the Ca 2+ wave and oscillations (discussed 
in Chapter 15). A strong candidate for the critical factor that mammalian sperm 
introduce into the egg is a sperm- specific form of phospholipase C (PLCQ, 
which directly cleaves phosphoinositol 4,5-bisphosphate (PI(4,5)P2) to produce 
IP3 (and diacyglycerol) (see Figure 15-39). 
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Figure 21-33 The acrosome reaction 
that occurs when a mammalian sperm 
fertilizes an egg. In mice, the zona 
pellucida is about 6 jam thick, and sperm 
tunnel through it at a rate of about 
1 jim/min. 
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Figure 21-34 How the cortical reaction 
in a mouse egg is thought to prevent 
additional sperm from entering the egg. 

<TGAC> The released contents of the 
cortical granules inactivate ZP3 so it can 
no longer bind to the sperm plasma 
membrane. They also partly cleave ZP2, 
hardening the zona pellucida so that 
sperm cannot penetrate it. Together, 
these changes provide a block to 
polyspermy. 
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The Cortical Reaction Helps Ensure That Only One Sperm 
Fertilizes the Egg 

Although many sperm can bind to an egg, normally only one fuses with the egg 
plasma membrane and injects its cytosol, nucleus, and other organelles into the 
egg cytoplasm. If more than one sperm fuses — a condition called polyspermy — 
extra or multipolar mitotic spindles are formed, resulting in faulty segregation of 
chromosomes during the first mitotic cell divisions; aneuploid cells are pro- 
duced, and development usually stops. 

Two mechanisms operate to ensure that only one sperm fertilizes the egg. 
First, a change in the egg plasma membrane caused by the fusion of the first 
sperm prevents other sperm from fusing. In sea urchin eggs, the change is a 
rapid depolarization of the egg membrane; in mammalian eggs, the mechanism 
is not known. The second block to polyspermy is provided by the egg cortical 
reaction, which releases various enzymes that change the structure of the zona 
pellucida so that sperm cannot bind to or penetrate it. Among the changes that 
occur in the mammalian zona is the inactivation of ZP3 so that it can no longer 
bind sperm or induce an acrosome reaction; in addition, ZP2 is cleaved, which 
somehow helps to make the zona impenetrable (Figure 21-34). 
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The Sperm Provides Centrioles as Well as Its Genome to the 
Zygote 

Once fertilized, the egg is called a zygote. Fertilization is not complete, however, 
until the two haploid nuclei (called pronuclei)-one from the egg, the other from 
the sperm — have come together and combined their chromosomes into a single 
diploid nucleus. In fertilized mammalian eggs, the two pronuclei do not fuse 
directly as they do in many other species. They approach each other but remain 
distinct until after the membrane of each pronucleus has broken down in prepa- 
ration for the zygote's first mitotic division (Figure 21-35). 

In most animals, including humans, the sperm contributes more than its 
genome to the zygote. It also donates its centrioles — structures that are lacking 
in unfertilized human eggs. The sperm centrioles enter the egg along with the 
sperm nucleus and tail, and a centrosome forms around them. In humans, the 
centrosome duplicates, and the two resulting centrosomes then help organize 
the assembly of the first mitotic spindle in the zygote (Figure 21-36, and see Fig- 
ure 21-35). This explains why polyspermy, in which several sperm contribute 
their centrioles to the egg, causes extra or multipolar mitotic spindles to form. 



Figure 21-35 The coming together of 
the sperm and egg pronuclei after 
mammalian fertilization. The pronuclei 
migrate toward the center of the egg. 
When they come together, their nuclear 
envelopes interdigitate.The centrosome 
duplicates, the nuclear envelopes break 
down, and the chromosomes of both 
gametes are eventually integrated into a 
single mitotic spindle, which organizes 
the first cleavage division of the zygote. 
(Adapted from drawings and electron 
micrographs provided by Daniel Szollosi.) 



IVF and ICSI Have Revolutionized the Treatment of Human 
Infertility 

About 10% of human couples have reduced fertility, such that the female part- 
ner fails to become pregnant after 12-18 months of unprotected sex. In roughly 
half of these cases it is the male that is the problem, and in half it is the female. 
Although there are numerous reasons for reduced fertility in both males and 
females, in the great majority of cases some form of assisted reproductive tech- 
nology can solve the problem. 

The first major breakthrough in the treatment of infertility occurred in 1978, 
with the birth of Louise Brown, the first child produced by in vitro fertilization 
(IVF). Before this success, there were heated debates about the ethics and safety 
of IVF — remarkably similar to the current ethical debates about the production 
and use of human embryonic stem (ES) cells. IVF is now a routine procedure and 
has produced more than a million children. To begin the process, the woman is 
usually pretreated with hormones to stimulate the simultaneous maturation of 
multiple oocytes. Just before their release by ovulation, the eggs are harvested 
from the ovary (using a long needle inserted through the vagina) and are fertil- 
ized in a culture dish with sperm from the man. After a few days in culture, 2 or 
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Figure 21-36 Immunofluorescence 
micrographs of human sperm and egg 
pronuclei coming together after in vitro 
fertilization. Spindle microtubules are 
stained in green with anti-tubulin 
antibodies, and DNA is labeled in blue 
with a DNA stain. (A) A meiotic spindle in 
a mature unfertilized secondary oocyte. 
(B) A fertilized egg extruding its second 
polar body about 5 hours after fusion 
with a sperm. The sperm head (left) has 
nucleated an array of microtubules. The 
egg and sperm pronuclei are still far 
apart. (C) The two pronuclei have come 
together. (D) By 16 hours after fusion with 
a sperm, the centrosome that entered the 
egg with the sperm has duplicated, and 
the daughter centrosomes have 
organized a bipolar mitotic spindle. The 
chromosomes of both pronuclei are 
aligned at the metaphase plate of the 
spindle. As indicated by the arrows in (C) 
and (D), the sperm tail is still associated 
with one of the centrosomes. (From 
C. Simerly et al., Nat Med. 1 :47-53, 1 995. 
With permission from Macmillan 
Publishers Ltd.) 
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3 of the best-looking early embryos are transferred with a catheter into the 
woman's uterus; the remaining embryos are usually kept frozen in liquid nitro- 
gen, for further implantations, if necessary The main complication of IVF is 
multiple pregnancies, which occur in over 30% of cases, compared with about 
2% in unassisted pregnancies. 

The IVF procedure just described has enabled many previously infertile 
women to produce normal children. It does not, however, solve the problem for 
infertile men who usually produce too few or abnormal sperm. A second break- 
through, which occurred in 1992, provided the solution for most such men. In 
this modification of IVF, called intracytoplasmic sperm injection (ICSI), an egg 
is fertilized by injecting a single sperm into it (Figure 21-37). This strategy elim- 
inates the need for large numbers of motile sperm and bypasses many of the 
hurdles that a sperm normally has to clear to fertilize an egg, including capaci- 
tation, swimming to the egg, undergoing an acrosomal reaction, burrowing 
through the zona pellucida, and fusing with the egg plasma membrane. ICSI has 
a success rate of better than 50% and has produced more than 100,000 children. 

In addition to revolutionizing the treatment of infertility, IVF has opened up 
many new possibilities for manipulating the reproductive process. It has, for 
example, made it possible for parents carrying a defective gene to avoid passing 
the gene on to their children, by screening IVF embryos for the gene before 
implanting them into the uterus. 

As discussed earlier, in vitro techniques for handling mammalian eggs have 
made it possible to produce clones of many types of mammals by transferring 
the nucleus of a somatic cell from the animal to be cloned into an unfertilized 
egg that has had its own nucleus removed or destroyed. It is not an easy proce- 
dure; the success rate is low, and it is still uncertain whether a human could be 
cloned in the same way. Moreover, there are serious ethical arguments about 
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Figure 21-37 Intracytoplasmic sperm 
injection (ICSI). Light micrograph of a 
human secondary oocyte being held 
with a suction pipette (on the left) and 
injected with a single human sperm 
through a glass needle. The zona 
pellucida surrounds the egg and the 
polar body. (Courtesy of Reproductive 
Biology Associates, Atlanta, Georgia.) 
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Figure 21-38 The difference between reproductive cloning and the preparation of "personalized" embryonic stem cells. 

In both cases, one produces a reconstructed embryo by removing (or destroying) the nucleus from an unfertilized egg and 
replacing it with the nucleus of a somatic cell from the animal to be cloned. The reconstructed egg is activated to develop 
by an electric shock. In reproductive cloning, the embryo that develops in culture is transplanted into the uterus of a foster 
mother and develops into a cloned animal. In the preparation of personalized embryonic stem (ES) cells (sometimes called 
therapeutic cloning), by contrast, the embryo is used to produce ES cells in culture, which can then be used to produce 
various specialized cell types for the treatment of the individual that provided the somatic nucleus; because the specialized 
cells produced by these ES cells are genetically the same as the donor of the somatic nucleus, they will not be 
immunologically rejected. 

whether one should ever attempt to clone a human. There is general agreement, 
however, that it should not be attempted with existing technology, as the likeli- 
hood of producing an abnormal child is high; indeed, many countries and 
American states have made the attempt illegal. 

Such reproductive cloning, however, should not be confused with therapeu- 
tic cloning, in which the early embryo produced in vitro from a such a reconsti- 
tuted zygote is not implanted into a uterus to produce a new individual, but is 
instead used to make ES cells that are genetically the same as the donor of the 
somatic nucleus (Figure 21-38). Various types of specialized cells produced 
from such "personalized" ES cells could then be used to treat the donor, avoid- 
ing the problem of immunological rejection associated with using cells derived 
from genetically dissimilar ES cells. Clearly, societies will have to make some dif- 
ficult decisions about how far they are willing to go in exploiting these new tech- 
nologies to manipulate the reproductive process for the potential benefit of 
individuals. Alternatively, it might be possible in the future to produce personal- 
ized ES-like cells in ways that bypass these ethical dilemmas: in recent experi- 
ments, for example, genetic engineering was used to express in cultured mouse 
fibroblasts a number of gene regulatory proteins normally expressed in ES cells; 
when four such transgenes were expressed simultaneously, the fibroblasts 
behaved much like ES cells. 

Fertilization marks the beginning of one of the most remarkable phenom- 
ena in all of biology — the process of embryogenesis, in which the zygote devel- 
ops into a new individual. This is the subject of the next chapter. 

Summary 

Mammalian fertilization normally begins when a sperm, which has undergone 
capacitation in the female genital tract, binds to the zona pellucida surrounding an 
egg in the oviduct. This induces the sperm to undergo an acrosome reaction, releasing 
the contents of the acrosomal vesicle, which are thought to help the sperm to digest its 
way through the zona. The acrosome reaction is also required for the sperm to bind to 
and fuse with the egg plasma membrane. The fusion of the sperm with the egg induces 
a Ca 2+ wave and oscillations in the eggcytosol, which activate the egg. The activation 
includes the egg cortical reaction, in which cortical granules release their contents, 
which alter the zona pellucida so that other sperm cannot bind to or penetrate it. The 
Ca 2+ signal also triggers the development of the zygote, which begins after the two hap- 
loid pronuclei have come together and their chromosomes have aligned on a single 
mitotic spindle, which mediates the first mitotic division of the zygote. Many previ- 
ously infertile couples can now reproduce thanks to IVF and ICSI. 
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Development of Multicellular 
Organisms 




An animal or plant starts its life as a single cell — a fertilized egg. During devel- 
opment, this cell divides repeatedly to produce many different cells in a final 
pattern of spectacular complexity and precision. Ultimately, the genome deter- 
mines the pattern, and the puzzle of developmental biology is to understand 
how it does so. 

The genome is normally identical in every cell; the cells differ not because 
they contain different genetic information, but because they express different 
sets of genes. This selective gene expression controls the four essential processes 
by which the embryo is constructed: (1) cell proliferation, producing many cells 
from one, (2) cell specialization, creating cells with different characteristics at 
different positions, (3) cell interactions, coordinating the behavior of one cell 
with that of its neighbors, and (4) cell movement, rearranging the cells to form 
structured tissues and organs (Figure 22-1). 

In a developing embryo, all these processes are happening at once, in a 
kaleidoscopic variety of different ways in different parts of the organism. To 
understand the basic strategies of development, we have to narrow our focus. In 
particular, we must understand the course of events from the standpoint of the 
individual cell and the way the genome acts within it. There is no commanding 
officer standing above the fray to direct the troops; each of the millions of cells 
in the embryo has to make its own decisions, according to its own copy of the 
genetic instructions and its own particular circumstances. 

The complexity of animals and plants depends on a remarkable feature of 
the genetic control system. Cells have a memory: the genes a cell expresses and 
the way it behaves depend on the cell's past as well as its present environment. 
The cells of your body — the muscle cells, the neurons, the skin cells, the gut 
cells, and so on — maintain their specialized characters not because they contin- 
ually receive the same instructions from their surroundings, but because they 
retain a record of signals their ancestors received in early embryonic develop- 
ment. The molecular mechanisms of cell memory have been introduced in 
Chapter 7. In this chapter we shall encounter its consequences. 



UNIVERSAL MECHANISMS OF ANIMAL 
DEVELOPMENT 

There are about ten million species of animals, and they are fantastically varied. 
One would no more expect the worm, the flea, the eagle and the giant squid all 
to be generated by the same developmental mechanisms, than one would sup- 
pose that the same methods were used to make a shoe and an airplane. Some 
similar abstract principles might be involved, perhaps, but surely not the same 
specific molecules? 

One of the most astonishing revelations of the past 10 or 20 years has been 
that our initial suspicions are wrong. In fact, much of the basic machinery of 
development is essentially the same, not just in all vertebrates but in all the 
major phyla of invertebrates too. Recognizably similar, evolutionarily related 
molecules define our specialized cell types, mark the differences between body 
regions, and help create the body's pattern. Homologous proteins are often 
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functionally interchangeable between very different species. A mouse protein 
produced artificially in a fly can often perform the same function as the fly's own 
version of that protein, and vice versa, successfully controlling the development 
of an eye, for example, or the architecture of the brain (Figure 22-2). Thanks to 
this underlying unity of mechanism, as we shall see, developmental biologists 
are now well on their way toward a coherent understanding of animal develop- 
ment. 

Plants are a separate kingdom: they have evolved their multicellular organi- 
zation independently of animals. For their development too, a unified account 
can be given, but it is different from that for animals. Animals will be our main 
concern in this chapter, but we shall return to plants briefly at the end. 

We begin by reviewing some of the basic general principles of animal devel- 
opment and by introducing the seven animal species that developmental biolo- 
gists have adopted as their chief model organisms. 



Figure 22-1 The four essential processes 
by which a multicellular organism is 
made: cell proliferation, cell 
specialization, cell interaction, and 
cell movement. 
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Figure 22-2 Homologous proteins 
functioning interchangeably in the 
development of mice and flies. 

(A) A fly protein used in a mouse. The 
DNA sequence from Drosophila coding 
for the Engrailed protein (a gene 
regulatory protein) can be substituted for 
the corresponding sequence coding for 
the Engrailed-1 protein of the mouse. 
Loss of Engrailed-1 in the mouse causes a 
defect in its brain (the cerebellum fails to 
develop); the Drosophila protein acts as 
an efficient substitute, rescuing the 
transgenic mouse from this deformity. 

(B) A mollusk protein used in a fly. The 
Eyeless protein controls eye development 
in Drosophila, and when misexpressed 
can cause an eye to develop in an 
abnormal site, such as a leg. The 
homologous protein, Pax6, from a mouse, 
a squid, or practically any animal 
possessing eyes, when similarly 
misexpressed in a transgenic fly, has the 
same effect. The scanning electron 
micrographs show a patch of eye tissue 
on the leg of a fly resulting from 
misexpression of Drosophila Eyeless (top) 
and of squid Pax6 (bottom). The right 
panel shows, at lower magnification, the 
entire eye of a normal Drosophila, for 
comparison. (A, from M.C. Hanks et al., 
Development 125:4521-4530, 1998. With 
permission from The Company of 
Biologists; B, from S.l.Tomarev et al., Proc. 
Natl Acad. Sci. U.S.A. 94:2421-2426, 1997. 
With permission from National Academy 
of Sciences and courtesy of Kevin Moses.) 
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Animals Share Some Basic Anatomical Features 

The similarities between animal species in the genes that control development 
reflect the evolution of animals from a common ancestor in which these genes 
were already present. Although we do not know what it looked like, the common 
ancestor of worms, mollusks, insects, vertebrates, and other complex animals 
must have had many differentiated cell types that would be recognizable to us: 
epidermal cells, for example, forming a protective outer layer; gut cells to absorb 
nutrients from ingested food; muscle cells to move; neurons and sensory cells to 
control the movements. The body must have been organized with a sheet of skin 
covering the exterior, a mouth for feeding and a gut tube to contain and process 
the food — with muscles, nerves and other tissues arranged in the space between 
the external sheet of skin and the internal gut tube. 

These features are common to almost all animals, and they correspond to a 
common basic anatomical scheme of development. The egg cell — a giant store- 
house of materials — divides, or cleaves, to form many smaller cells. <ATTT> 
These cohere to create an epithelial sheet facing the external medium. Much of 
this sheet remains external, constituting the ectoderm — the precursor of the 
epidermis and of the nervous system. A part of the sheet becomes tucked into 
the interior to form endoderm — the precursor of the gut and its appendages, 
such as lung and liver. Another group of cells move into the space between ecto- 
derm and endoderm, and form the mesoderm — the precursor of muscles, con- 
nective tissues, and various other components. This transformation of a simple 
ball or hollow sphere of cells into a structure with a gut is called gastrulation 
(from the Greek word for a belly), and in one form or another it is an almost uni- 
versal feature of animal development. Figure 22-3 illustrates the process as it is 
seen in the sea urchin. 

Evolution has diversified upon the molecular and anatomical fundamentals 
that we describe in this chapter to produce the wonderful variety of present-day 
species. But the underlying conservation of genes and mechanisms means that 
studying the development of one animal very often leads to general insights into 
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Figure 22-3 Sea urchin gastrulation. A fertilized egg divides to produce a blastula — a hollow 
sphere of epithelial cells surrounding a cavity. Then, in the process of gastrulation, some of 
the cells tuck into the interior to form the gut and other internal tissues. (A) Scanning electron 
micrograph showing the initial intucking of the epithelium. (B) Drawing showing how a group 
of cells break loose from the epithelium to become mesoderm. (C) These cells then crawl over 
the inner face of the wall of the blastula. (D) Meanwhile, epithelium is continuing to tuck 
inward to become endoderm. (E and F) The invaginating endoderm extends into a long gut 
tube. (G) The end of the gut tube makes contact with the wall of the blastula at the site of the 
future mouth opening. Here the ectoderm and endoderm will fuse and a hole will form. 
(H) The basic animal body plan, with a sheet of ectoderm on the outside, a tube of endoderm 
on the inside, and mesoderm sandwiched between them. (A, from R.D. Burke et al., Dev. Biol. 
146:542-557, 1991. With permission from Academic Press; B-G, after LWolpert and 
T. Gustafson, Endeavour 26:85-90, 1967. With permission from Elsevier.) 
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the development of many other types of animals. As a result, developmental 
biologists today, like cell biologists, have the luxury of addressing fundamental 
questions in whatever species offers the easiest path to an answer. 

Multicellular Animals Are Enriched in Proteins Mediating Cell 
Interactions and Gene Regulation 

Genome sequencing reveals the extent of molecular similarities between 
species. The nematode worm Caenorhabditis elegans, the fly Drosophila 
melanogaster, and the vertebrate Homo sapiens are the first three animals for 
which a complete genome sequence was obtained. In the family tree of animal 
evolution, they are very distant from one another: the lineage leading to the ver- 
tebrates is thought to have diverged from that leading to the nematodes, insects 
and mollusks more than 600 million years ago. Nevertheless, when the 20,000 
genes of C. elegans, the 14,000 genes of Drosophila, and the 25,000 genes of the 
human are systematically compared with one another, it is found that about 
50% of the genes in each of these species have clearly recognizable homologs in 
one or both of the other two species. In other words, recognizable versions of at 
least 50% of all human genes were already present in the common ancestor of 
worms, flies, and humans. 

Of course, not everything is conserved: there are some genes with key roles 
in vertebrate development that have no homologs in the genome of C. elegans or 
Drosophila, and vice versa. However, a large proportion of the 50% of genes that 
lack identifiable homologs in other phyla may do so simply because their func- 
tions are of minor importance. Although these nonconserved genes are tran- 
scribed and well represented in cDNA libraries, studies of DNA and amino acid 
sequence variability in and between natural populations indicate that these 
genes are unusually free to mutate without seriously harming fitness; when they 
are artificially inactivated, the consequences are not so often severe as for genes 
with homologs in distantly related species. Because they are free to evolve so 
rapidly, a few tens of millions of years may be enough to obliterate any family 
resemblance or to permit loss from the genome. 

The genomes of different classes of animals differ also because, as discussed 
in Chapter 1, there are substantial variations in the extent of gene duplication: 
the amount of gene duplication in the evolution of the vertebrates has been par- 
ticularly large, with the result that a mammal or a fish often has several 
homologs corresponding to a single gene in a worm or a fly. 

Despite such differences, to a first approximation we can say that all these ani- 
mals have a similar set of proteins at their disposal for their key functions. In other 
words, they construct their bodies using roughly the same molecular kit of parts. 

What genes, then, are needed to produce a multicellular animal, beyond 
those necessary for a solitary cell? Comparison of animal genomes with that of 
budding yeast — a unicellular eucaryote — suggests that two classes of proteins are 
especially important for multicellular organization. The first class is that of the 
transmembrane molecules used for cell adhesion and cell signaling. As many as 
2000 C. elegans genes encode cell surface receptors, cell adhesion proteins, and 
ion channels that are either not present in yeast or present in much smaller num- 
bers. The second class is that of gene regulatory proteins: these DNA-binding 
proteins are much more numerous in the C. elegans genome than in yeast. For 
example, the basic helix-loop-helix family has 41 members in C. elegans, 84 in 
Drosophila, 131 in humans and only 7 in yeast, and other families of regulators of 
gene expression are also dramatically overrepresented in animals as compared to 
yeast. Not surprisingly, these two classes of proteins are central to developmental 
biology: as we shall see, the development of multicellular animals is dominated 
by cell-cell interactions and by differential gene expression. 

As discussed in Chapter 7, micro-RNAs also play a significant part in con- 
trolling gene expression during development, but they seem to be of secondary 
importance by comparison with proteins. Thus a mutant zebrafish embryo 
that completely lacks the Dicer enzyme, which is required for production of 
functional miRNAs, will still begin its development almost normally, creating 
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specialized cell types and a more-or-less correctly organized body plan, before 
abnormalities become severe. 



Regulatory DNA Defines the Program of Development 

A worm, a fly, a mollusc and a mammal share many of the same essential cell 
types, and they do all have a mouth, a gut, a nervous system and a skin; but 
beyond a few such basic features they seem radically different in their body 
structure. If the genome determines the structure of the body and these animals 
all have such a similar collection of genes, how can they be so different? 

The proteins encoded in the genome can be viewed as the components of a 
construction kit. Many things can be built with this kit, just as a child's con- 
struction kit can be used to make trucks, houses, bridges, cranes, and so on by 
assembling the components in different combinations. Some components nec- 
essarily go together — nuts with bolts, wheels with tires and axles — but the large- 
scale organization of the final object is not defined by these substructures. 
Rather, it is defined by the instructions that accompany the components and 
prescribe how they are to be assembled. 

To a large extent, the instructions needed to produce a multicellular animal 
are contained in the noncoding, regulatory DNA that is associated with each 
gene. As discussed in Chapter 4, each gene in a multicellular organism is associ- 
ated with thousands or tens of thousands of nucleotides of noncoding DNA. 
This DNA may contain, scattered within it, dozens of separate regulatory ele- 
ments or enhancers — short DNA segments that serve as binding sites for specific 
complexes of gene regulatory proteins. Roughly speaking, as explained in Chap- 
ter 7, the presence of a given regulatory module of this sort leads to expression 
of the gene whenever the complex of proteins recognizing that segment of DNA 
is appropriately assembled in the cell (in some cases, an inhibition or a more 
complicated effect on gene expression is produced instead). If we could deci- 
pher the full set of regulatory modules associated with a gene, we would under- 
stand all the different molecular conditions under which the product of that 
gene is to be made. This regulatory DNA can therefore be said to define the 
sequential program of development: the rules for stepping from one state to the 
next, as the cells proliferate and read their positions in the embryo by reference 
to their surroundings, switching on new sets of genes according to the activities 
of the proteins that they currently contain (Figure 22-4) . Variations in the pro- 
teins themselves do, of course, also contribute to the differences between 
species. But even if the set of proteins encoded in the genome remained com- 
pletely unchanged, the variation in the regulatory DNA would be enough to gen- 
erate radically different tissues and body structures. 

When we compare animal species with similar body plans — different verte- 
brates such as a fish, a bird and a mammal, for example — we find that corre- 
sponding genes usually have similar sets of regulatory modules: the DNA 
sequences of many of the individual modules have been well conserved and are 
recognizably homologous in the different animals. The same is true if we com- 
pare different species of nematode worm, or different species of insect. But 
when we compare vertebrate regulatory regions with those of worm or fly, it is 
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Figure 22-4 How regulatory DNA 
defines the succession of gene 
expression patterns in development. 

The genomes of organisms A and B code 
for the same set of proteins but have 
different regulatory DNA. The two cells in 
the cartoon start in the same state, 
expressing the same proteins at stage 1, 
but step to quite different states at 
stage 2 because of their different 
arrangements of regulatory modules. 
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hard to see any such resemblance. The protein-coding sequences are unmistak- 
ably similar, but the corresponding regulatory DNA sequences appear very dif- 
ferent. This is the expected result if different body plans are produced mainly by 
changing the program embodied in the regulatory DNA, while retaining most of 
the same kit of proteins. 



Manipulation of the Embryo Reveals the Interactions Between 
Its Cells 

Confronted with an adult animal, in all its complexity, how does one begin to 
analyze the process that brought it into being? The first essential step is to 
describe the anatomical changes — the patterns of cell division, growth, and 
movement — that convert the egg into the mature organism. This is the job of 
descriptive embryology, and it is harder than one might think. To explain devel- 
opment in terms of cell behavior, we need to be able to track the individual cells 
through all their divisions, transformations, and migrations in the embryo. The 
foundations of descriptive embryology were laid in the nineteenth century, but 
the fine-grained task of cell lineage tracing continues to tax the ingenuity of 
developmental biologists (Figure 22-5) 

Given a description, how can one go on to discover the causal mechanisms? 
Traditionally, experimental embryologists have tried to understand development 
in terms of the ways in which cells and tissues interact to generate the multicel- 
lular structure. Developmental geneticists, meanwhile, have tried to analyze 
development in terms of the actions of genes. These two approaches are com- 
plementary, and they have converged to produce our present understanding. 

In experimental embryology, cells and tissues from developing animals are 
removed, rearranged, transplanted, or grown in isolation, in order to discover 
how they influence one another. The results are often startling: an early embryo 
cut in half, for example, may yield two complete and perfectly formed animals, 
or a small piece of tissue transplanted to a new site may reorganize the whole 
structure of the developing body (Figure 22-6). Observations of this type can be 




Figure 22-5 Cell lineage tracing in the 
early chick embryo. The pictures in the 
top row are at low magnification and 
show the whole embryo; the pictures 
below are details, showing the distribution 
of labeled cells. The tracing experiment 
reveals complex and dramatic cell 
rearrangements. (A,D) Two tiny dots of 
fluorescent dye, one red, the other green, 
have been used to stain small groups of 
cells in an embryo at 20 hours of 
incubation. Though the embryo still 
appears as an almost featureless sheet of 
tissue, there is already some specialization. 
The dots have been placed on each side of 
a structure called the node. (B,E) Six hours 
later, some of the labeled cells have 
remained at the node (which has moved 
backwards), giving a bright spot of 
fluorescence there, while other cells have 
begun to move forwards relative to the 
node. (C,F) After a further 8 hours, the 
body plan is clearly visible, with a head at 
the front end (top), a central axis, and rows 
of embryonic body segments, called 
somites, on either side of this. The node 
has regressed still further tailwards; some 
of the originally labeled cells have stayed 
in the node, forming a bright spot of 
fluorescence, while others have migrated 
to positions far anterior to this and 
become parts of somites. (Courtesy of 
Raquel Mendes and Leonor Saude.) 
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extended and refined to decipher the underlying cell-cell interactions and rules 
of cell behavior. The experiments are easiest to perform in large embryos that 
are readily accessible for microsurgery Thus, the most widely used species have 
been birds — especially the chick — and amphibians — particularly the African 
frog Xenopus laevis. 



Studies of Mutant Animals Identify the Genes That Control 
Developmental Processes 

Developmental genetics begins with the isolation of mutant animals whose 
development is abnormal. This typically involves a genetic screen, as described 
in Chapter 8. Parent animals are treated with a chemical mutagen or ionizing 
radiation to induce mutations in their germ cells, and large numbers of their 
progeny are examined. Those rare mutant individuals that show some interest- 
ing developmental abnormality — altered development of the eye, for example — 
are picked out for further study. In this way, it is possible to discover genes that 
are required specifically for the normal development of any chosen feature. By 
cloning and sequencing a gene found in this way, it is possible to identify its pro- 
tein product, to investigate how it works, and to begin an analysis of the regula- 
tory DNA that controls its expression. 

The genetic approach is easiest in small animals with short generation times 
that can be grown in the laboratory. The first animal to be studied in this way was 
the fruit fly Drosophila melanogaster, which will be discussed at length below. 
But the same approach has been successful in the nematode worm, Caenorhab- 
ditis elegans, the zebrafish, Danio rerio, and the mouse, Mus musculus. Although 
humans are not intentionally mutagenized, they get screened for abnormalities 
in enormous numbers through the medical care system. Many mutations have 
arisen in humans that cause abnormalities compatible with life, and analyses of 
the affected individuals and of their cells have provided important insights into 
developmental processes. 



Figure 22-6 Some striking results 
obtained by experimental embryology. 

<ATTG> In (A), an early amphibian embryo 
is split almost into two parts with a hair 
loop. In (B), an amphibian embryo at a 
somewhat later stage receives a graft of a 
small cluster of cells from another embryo 
at that stage. The two quite different 
operations both cause a single embryo to 
develop into a pair of conjoined (Siamese) 
twins. It is also possible in experiment (A) 
to split the early embryo into two 
completely separate halves; two entire 
separate well-formed tadpoles are then 
produced. (A, after H. Spemann, Embryonic 
Development and Induction. New Haven: 
Yale University Press, 1938; B, after 
J. Holtfreter and V. Hamburger, in Analysis 
of Development [B.H. Willier, P.A. Weiss and 
V. Hamburger, eds.], pp. 230-296. 
Philadelphia: Saunders, 1 955.) 



A Cell Makes Developmental Decisions Long Before It Shows a 
Visible Change 

By simply watching closely, or with the help of tracer dyes and other cell-mark- 
ing techniques, one can discover what the fate of a given cell in an embryo will 
be if that embryo is left to develop normally The cell may be fated to die, for 
example, or to become a neuron, to form part of an organ such as the foot, or to 
give progeny cells scattered all over the body To know the cell fate, in this sense, 
however, is to know next to nothing about the cell's intrinsic character. At one 
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Figure 22-7 The standard test for cell 
determination. 



extreme, the cell that is fated to become, say, a neuron may be already special- 
ized in a way that guarantees that it will become a neuron no matter how its sur- 
roundings are disturbed; such a cell is said to be determined for its fate. At the 
opposite extreme, the cell may be biochemically identical to other cells destined 
for other fates, the only difference between them being the accident of position, 
which exposes the cells to different future influences. 

A cell's state of determination can be tested by transplanting it to altered 
environments (Figure 22-7). One of the key conclusions of experimental embry- 
ology has been that, thanks to cell memory, a cell can become determined long 
before it shows any obvious outward sign of differentiation. 

Between the extremes of the fully determined and the completely undeter- 
mined cell, there is a whole spectrum of possibilities. A cell may, for example, 
be already somewhat specialized for its normal fate, with a strong tendency to 
develop in that direction, but still able to change and undergo a different fate if 
it is put in a sufficiently coercive environment. (Some developmental biologists 
would describe such a cell as specified or committed, but not yet determined.) 
Or the cell may be determined, say, as a brain cell, but not yet determined as to 
whether it is to be a neuronal or a glial component of the brain. And often, it 
seems, adjacent cells of the same type interact and depend on mutual support 
to maintain their specialized character, so that they will behave as determined 
if kept together in a cluster, but not if taken singly and isolated from their usual 
companions. 



Cells Have Remembered Positional Values That Reflect Their 
Location in the Body 

In many systems, long before cells become committed to differentiating as a 
specific cell type, they become regionally determined: that is, they switch on and 
maintain expression of genes that can best be regarded as markers of position or 
region in the body. This position-specific character of a cell is called its posi- 
tional value, and it shows its effects in the way the cell behaves in subsequent 
steps of pattern formation. 

The development of the chick leg and wing provides a striking example. The 
leg and the wing of the adult both consist of muscle, bone, skin, and so on — 
almost exactly the same range of differentiated tissues. The difference between 
the two limbs lies not in the types of tissues, but in the way in which those tis- 
sues are arranged in space. So how does the difference come about? 

In the chick embryo the leg and the wing originate at about the same time in 
the form of small tongue-shaped buds projecting from the flank. The cells in the 
two pairs of limb buds appear similar and uniformly undifferentiated at first. But 
a simple experiment shows that this appearance of similarity is deceptive. A 
small block of undifferentiated tissue at the base of the leg bud, from the region 
that would normally give rise to part of the thigh, can be cut out and grafted into 
the tip of the wing bud. Remarkably, the graft forms not the appropriate part of 
the wing tip, nor a misplaced piece of thigh tissue, but a toe (Figure 22-8). This 
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Figure 22-8 Prospective thigh tissue 
grafted into the tip of a chick wing bud 
forms toes. (After J.W. Saunders et al., 
Dev. Biol. 1 1281-301, 1 959. With 
permission from Academic Press.) 
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experiment shows that the early leg-bud cells are already determined as leg but 
are not yet irrevocably committed to form a particular part of the leg: they can 
still respond to cues in the wing bud so that they form structures appropriate to 
the tip of the limb rather than the base. The signaling system that controls the 
differences between the parts of the limb is apparently the same for leg and 
wing. The difference between the two limbs results from a difference in the 
internal states of their cells at the outset of limb development. 

The difference of positional value between vertebrate forelimb cells and 
hindlimb cells corresponds to expression of different sets of genes, coding for 
gene regulatory proteins that are thought to make the cells in the two limb buds 
behave differently (Figure 22-9). Later in this chapter we shall explain how the 
next, more detailed level of patterning is set up inside an individual limb bud. 



Figure 22-9 Chick embryos at 6 days of 
incubation, showing the limb buds 
stained by in situ hybridization with 
probes to detect expression of the Tbx4, 
Tbx5, and Pitxl genes, all coding for 
related gene regulatory proteins. The 
cells expressing Tbx5 will form a wing; 
those expressing Tbx4 and Pitxl will form 
a leg. Pitxl, when artificially misexpressed 
in the wing bud, causes the limb to 
develop with leg-like characteristics. 
(Courtesy of Malcolm Logan.) 



Inductive Signals Can Create Orderly Differences Between Initially 
Identical Cells 

At each stage in its development, a cell in an embryo is presented with a limited 
set of options according to the state it has attained: the cell travels along a devel- 
opmental pathway that branches repeatedly. At each branch in the pathway it 
has to make a choice, and its sequence of choices determines its final destiny. In 
this way, a complicated array of different cell types is produced. 

To understand development, we need to know how each choice between 
options is controlled, and how those options depend on the choices made pre- 
viously. To reduce the question to its simplest form: how do two cells with the 
same genome, but separated in space, come to be different? 

The most straightforward way to make cells different is by exposing them to 
different environments, and the most important environmental cues acting on 
cells in an embryo are signals from neighboring cells. Thus, in what is probably 
the commonest mode of pattern formation, a group of cells start out all having 
the same developmental potential, and a signal from cells outside the group 
then drives one or more of the members of the group into a different develop- 
mental pathway, leading to a changed character. This process is called an induc- 
tive interaction. Generally, the signal is limited in time and space so that only a 
subset of the competent cells — those closest to the source of the signal — take on 
the induced character (Figure 22-10). inductive signal 

Some inductive signals are short-range — notably those transmitted via / \ 

cell-cell contacts; others are long-range, mediated by molecules that can diffuse / "^ \ 

through the extracellular medium. The group of initially similar cells competent <^^^h j~ **^^^^ 

to respond to the signal is sometimes called an equivalence group or a morpho- 
genetic field. It can consist of as few as two cells or as many as thousands, and 
any number of the total can be induced depending on the amount and distribu- 
tion of the signal. * 

Sister Cells Can Be Born Different by an Asymmetric Cell Division 22222222 

i i i i 

Cell diversification does not always have to depend on extracellular signals: in ceMs directed to new 

some cases, sister cells are born different as a result of an asymmetric cell divi- developmental pathway 

sion, in which some significant set of molecules is divided unequally between Figure 22-10 Inductive signaling. 
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1. asymmetric division: sister cells born different 
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2. symmetric division: sister cells become different as result of 
influences acting on them after their birth 



the two of them at the time of division. This asymmetrically segregated molecule 
(or set of molecules) then acts as a determinant for one of the cell fates by 
directly or indirectly altering the pattern of gene expression within the daughter 
cell that receives it (Figure 22-11). 

Asymmetric divisions often occur at the beginning of development, when 
the fertilized egg divides to give daughter cells with different fates, but they are 
also encountered at some later stages — in the genesis of nerve cells, for example. 



Positive Feedback Can Create Asymmetry Where There Was None 
Before 

Inductive signaling and asymmetric cell division represent two distinct strate- 
gies for creating differences between cells. Both of them, however, presuppose 
some prior asymmetry in the system: the source of inductive signal must be 
localized so that some cells receive the signal strongly and others do not; or the 
mother cell must already have an internal asymmetry before she divides. Very 
often, the history of the system ensures that some such asymmetry will be pre- 
sent. But what if it is not, or if the initial asymmetry is only very slight? 

The answer lies in positive feedback: through positive feedback, a system 
that starts off homogeneous and symmetrical can pattern itself spontaneously, 
even where there is no organized external signal at all. And where, as very often 
happens, the environment or the starting conditions impose some weak but def- 
inite initial asymmetry, positive feedback provides the means to magnify the 
effect and create a full-blown pattern. 

To illustrate the idea, consider a pair of adjacent cells that start off in a sim- 
ilar state and can exchange signals to influence one another's behavior (Figure 
22-12). The more that either cell produces of some product X, the more it signals 
to its neighbor to inhibit production of X by the neighbor. This type of cell-cell 
interaction is called lateral inhibition, and it gives rise to a positive feedback 
loop that tends to amplify any initial difference between the two cells. Such a dif- 
ference may arise from a bias imposed by some external or prior factor, or it may 
simply originate from spontaneous random fluctuations, or "noise" — an 
inevitable feature of the genetic control circuitry in cells, as discussed in Chap- 
ter 7. In either case, lateral inhibition means that if cell #1 makes a little more of 
X, it will thereby cause cell #2 to make less; and because cell #2 makes less X, it 
delivers less inhibition to cell #1 and so allows the amount of X in cell #1 to rise 
higher still; and so on, until a steady state is reached where cell #1 contains a lot 
of X and cell #2 contains very little. 



Figure 22-1 1 Two ways of making sister 
cells different. 
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POSITIVE FEEDBACK: 
asymmetry is self-amplifying 




BISTABILITY: 
all-or-none alternative outcomes represent a stable memory 

Mathematical analysis shows that this phenomenon depends on the 
strength of the lateral inhibition effect: if it is too weak, fluctuations will fade and 
have no lasting effect; but if it is strong enough and steep enough, they will be 
self- amplifying in a runaway fashion, breaking the initial symmetry between the 
two cells. Lateral inhibition, often mediated by exchange of signals at cell-cell 
contacts via the Notch signaling pathway (as discussed in Chapter 15), is a com- 
mon mechanism for cell diversification in animal tissues, driving neighboring 
cells to specialize in different ways. 



Figure 22-1 2 Genesis of asymmetry 
through positive feedback. In this 
example, two cells interact, each 
producing a substance X that acts on the 
other cell to inhibit its production of X, an 
effect known as lateral inhibition. An 
increase of X in one of the cells leads to a 
positive feedback that tends to increase X 
in that cell still further, while decreasing X 
in its neighbor. This can create a runaway 
instability, making the two cells become 
radically different. Ultimately the system 
comes to rest in one or the other of two 
opposite stable states. The final choice 
of state represents a form of memory: 
the small influence that initially directed 
the choice is no longer required to 
maintain it. 



Positive Feedback Generates Patterns, Creates All-or-none 
Outcomes, and Provides Memory 

Somewhat similar positive feedback processes can operate over larger arrays of 
cells to create many types of spatial patterns. For example, a substance A (a short- 
range activator) may stimulate its own production in the cells that contain it and 
their immediate neighbors, while also causing them to produce a signal H (a long- 
range inhibitor) that diffuses widely and inhibits production of A in the cells at 
larger distances. If the cells all start out on an equal footing, but one group of cells 
gains a slight advantage by making a little more A than the rest, the asymmetry can 
be self- amplifying. Short-range activation combined with long-range inhibition in 
this way may account for the formation of clusters of cells within an initially 
homogeneous tissue that become specialized as localized signaling centers. 

At the opposite end of the size spectrum, positive feedback can also be the 
means by which an individual cell becomes spontaneously polarized and inter- 
nally asymmetrical, through systems of intracellular signals that make a weak 
initial asymmetry self- amplifying. 

Through all these and many other variations on the theme of positive feed- 
back, certain general principles apply. In each of the above examples, the posi- 
tive feedback leads to broken symmetry, and the symmetry-breaking is an all-or- 
none phenomenon. If the feedback is below a certain threshold strength, the 
cells remain essentially the same; if the feedback is above the threshold, they 
become sharply different. Above this threshold, the system is bistable or multi- 
stable — it lurches toward one or other of two or more sharply different out- 
comes, according to which of the cells (or which of the ends of the single cell) 
gains the initial advantage. 

The choice between the alternative outcomes can be dictated by an external 
signal that gives one of the cells a small initial advantage. But once the positive 
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feedback has done its work, this external signal becomes irrelevant. The broken 
symmetry, once established, is very hard to reverse: positive feedback makes 
the chosen asymmetric state self-sustaining, even after the biasing signal has 
disappeared. In this way, positive feedback provides the system with a memory 
of past signals. 

All these effects of positive feedback — symmetry-breaking, all-or-none out- 
comes, bistability, and memory — go hand in hand and are encountered again 
and again in developing organisms. They are fundamental to the production of 
sharply delineated, stable patterns of cells in different states. 



A Small Set of Signaling Pathways, Used Repeatedly, Controls 
Developmental Patterning 

What, then, are the molecules that act as signals to coordinate spatial patterning 
in an embryo, either to create asymmetry de novo, or as inducers from estab- 
lished signaling centers to control the diversification of neighboring cells? In 
principle, any kind of extracellular molecule could serve. In practice, most of the 
known inductive events in animal development are governed by just a handful 
of highly conserved families of signal proteins, which are used over and over 
again in different contexts. The discovery of this limited vocabulary that cells use 
for developmental communications has emerged over the past 10 or 20 years as 
one of the great simplifying discoveries of developmental biology. In Table 22-1, 
we briefly review six major families of signal proteins that serve repeatedly as 
inducers in animal development. Details of the intracellular mechanisms 
through which these molecules act are given in Chapter 15. 

The ultimate result of most inductive events is a change in DNA transcrip- 
tion in the responding cell: some genes are turned on and others are turned off. 
Different signaling molecules activate different kinds of gene regulatory pro- 
teins. Moreover, the effect of activating a given gene regulatory protein will 
depend on which other gene regulatory proteins are also present in the cell, 
since these generally function in combinations. As a result, different types of 
cells will generally respond differently to the same signal, and the same cells will 
often respond differently to the same signal given at a different time. The 
response will depend both on the other gene regulatory proteins that are present 
before the signal arrives — reflecting the cell's memory of signals received previ- 
ously — and on the other signals that the cell is receiving concurrently. 



Morphogens Are Long-Range Inducers That Exert Graded Effects 

Signal molecules often seem to govern a simple yes-no choice: one outcome 
when their concentration is high, another when it is low. Positive feedback can 



Table 22-1 Some Signal Proteins That Are Used Over and Over Again as Inducers in Animal Development 


SIGNALING PATHWAY 


LIGAND FAMIILY 


RECEPTOR FAMILY 


EXTRACELLULAR INHIBITORS/MODULATORS 


Receptor tyrosine kinase (RTK) 


EGF 


EGF receptors 


Argos 




FGF (Branchless) 


FGF receptors (Breathless) 






Ephrins 


Eph receptors 




TGFp superfamily 


TGFp 


TGFp receptors 


chordin (Sog), noggin 




BMP (Dpp) 


BMP receptors 






Nodal 






Wnt 


Wnt (Wingless) 


Frizzled 


Dickkopf, Cerberus 


Hedgehog 


Hedgehog 


Patched, Smoothened 




Notch 


Delta 


Notch 


Fringe 



Only a few representatives of each class of proteins are listed — mainly those mentioned in this chapter. Names peculiar to Drosophila are shown 
in parentheses. Many of the listed components have several homologs distinguished by numbers (FGF1, FGF2, etc.) or by forenames (Sonic 
hedgehog, Lunatic fringe). Other signaling pathways, including the JAK/STAT, nuclear hormone receptor, and G-protein-coupled receptor 
pathways, also play important parts in some developmental processes. 
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make the cellular responses all-or-none, so that one result is obtained when the 
signal is below a certain critical strength, and another result when it is above that 
strength. In many cases, however, responses are more finely graded: a high con- 
centration may, for example, direct target cells into one developmental pathway, 
an intermediate concentration into another, and a low concentration into yet 
another. An important case is that in which the signal molecule diffuses out 
from a localized signaling center, creating a signal concentration gradient. Cells 
at different distances from the source are driven to behave in a variety of differ- 
ent ways, according to the signal concentration that they experience. 

A signal molecule that imposes a pattern on a whole field of cells in this 
way is called a morphogen. Vertebrate limbs provide a striking example: a 
group of cells at one side of the embryonic limb bud become specialized as a 
signaling center and secrete Sonic hedgehog protein — a member of the 
Hedgehog family of signal molecules. This protein spreads out from its source, 
forming a morphogen gradient that controls the characters of the cells along 
the thumb -to -little -finger axis of the limb bud. If an additional group of sig- 
naling cells is grafted into the opposite side of the bud, a mirror duplication of 
the pattern of digits is produced (Figure 22-13). 



Figure 22-13 Sonic hedgehog as a 
morphogen in chick limb development. 

(A) Expression of the Sonic hedgehog 
gene in a 4-day chick embryo, shown by 
in situ hybridization (dorsal view of the 
trunk at the level of the wing buds). The 
gene is expressed in the midline of the 
body and at the posterior border (the 
polarizing region) of each of the two 
wing buds. Sonic hedgehog protein 
spreads out from these sources. 

(B) Normal wing development. (C) A graft 
of tissue from the polarizing region 
causes a mirror-image duplication of the 
pattern of the host wing. The type of digit 
that develops is thought to be dictated 
by the local concentration of Sonic 
hedgehog protein; different types of digit 
(labeled 2, 3, and 4) therefore form 
according to their distance from a source 
of Sonic hedgehog. (A, courtesy of 
Randall S. Johnson and Robert D. Riddle.) 



Extracellular Inhibitors of Signal Molecules Shape the Response 
to the Inducer 

Especially for molecules that can act at a distance, it is important to limit the 
action of the signal, as well as to produce it. Most developmental signal proteins 
have extracellular antagonists that can inhibit their function. These antagonists 
are generally proteins that bind to the signal or its receptor, preventing a pro- 
ductive interaction from taking place. 

A surprisingly large number of developmental decisions are actually regulated 
by inhibitors rather than by the primary signal molecule. The nervous system in a 
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frog embryo arises from a field of cells that is competent to form either neural or 
epidermal tissue. An inducing tissue releases the protein chordin, which favors 
the formation of neural tissue. Chordin does not have its own receptor. Instead 
it is an inhibitor of signal proteins of the BMP/TGF(3 family, which induce epi- 
dermal development and are present throughout the neuroepithelial region 
where neurons and epidermis form. The induction of neural tissue is thus due to 
an inhibitory gradient of an antagonistic signal (Figure 22-14). 



Developmental Signals Can Spread Through Tissue in Several 
Different Ways 

Many developmental signals are thought to spread through tissues by simple 
diffusion through the spaces between cells. If some specialized group of cells 
produces a signal molecule at a steady rate, and this morphogen is then 
degraded as it diffuses away from this source, a smooth gradient will be set up, 
with its maximum at the source. The speed of diffusion and the half-life of the 
morphogen will together determine the steepness of the gradient (Figure 
22-15). 

This simple mechanism can be modified in many ways to adjust the shape 
and steepness of the gradient. Receptors on the surfaces of cells along the way 
may trap the diffusing morphogen and cause it to be endocytosed and 
degraded, shortening its effective halflife. Or it may bind to molecules in the 
extracellular matrix, reducing its effective diffusion rate. In some cases, it seems 



Figure 22-1 4 Two ways to create a 
morphogen gradient. (A) By localized 
production of an inducer — a 
morphogen — that diffuses away from its 
source. (B) By localized production of an 
inhibitor that diffuses away from its 
source and blocks the action of a 
uniformly distributed inducer. 
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Figure 22-15 Setting up a signal 
gradient by diffusion. The graphs show 
successive stages in the build-up of the 
concentration of a signal molecule that is 
produced at a steady rate at the origin, 
with production starting at time O.The 
molecule undergoes degradation as it 
diffuses away from the source, creating a 
concentration gradient with its peak at 
the source. The graphs are calculated on 
the assumption that diffusion is occurring 
along one axis in space, that the 
molecule has a half-life ty 2 of 20 minutes, 
and that it diffuses with a diffusion 
constant D = 0.4 mm 2 hr 1 , typical of a 
small (30 kilodalton) protein molecule in 
water. Note that the gradient is already 
close to its steady-state form within an 
hour, and that the concentration at 
steady state (large times) falls off 
exponentially with distance. 
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that a morphogen is taken up into cells by endocytosis and then disgorged, only 
to be taken up and then disgorged by other cells in turn, so that the signal 
spreads through a largely intracellular route. 

Yet another mechanism for signal distribution depends on long thin filopo- 
dia or cytonemes that extend over several cell diameters from cells in some 
epithelial tissues. A cell may send out cytonemes to make contact with distant 
cells, either to deliver or to receive signals from them. In this way, for example, a 
cell can deliver lateral inhibition via the Notch pathway to an extended set of 
neighbors. 



Programs That Are Intrinsic to a Cell Often Define the 
Time-Course of its Development 

Signals such as those we have just discussed play a large part in controlling the 
timing of events in development, but it would be wrong to imagine that every 
developmental change needs an inductive signal to trigger it. Many of the mech- 
anisms that alter cell character are intrinsic to the cell and require no cue from 
the cell's surroundings: the cell will step through its developmental program 
even when kept in a constant environment. There are numerous cases where 
one might suspect that something of this sort is occurring to control the dura- 
tion of a developmental process. For example, in a mouse, the neural progenitor 
cells in the cerebral cortex of the brain carry on dividing and generating neurons 
for just 11 cell cycles, and in a monkey for approximately 28 cycles, after which 
they stop. Different kinds of neurons are generated at different stages in this pro- 
gram, suggesting that as the progenitor cell ages, it changes the specifications 
that it supplies to the differentiating progeny cells. 

It is difficult to prove in the context of the intact embryo that such a course 
of events is strictly the result of a cell- autonomous timekeeping process, since 
the cell environment is changing. Experiments on cells in culture, however, give 
clear-cut evidence. For example, glial progenitor cells isolated from the optic 
nerve of a 7-day postnatal rat and cultured under constant conditions in an 
appropriate medium will carry on proliferating for a strictly limited time (corre- 
sponding to a maximum of about eight cell division cycles) and then differenti- 
ate into oligodendrocytes (the glial cells that form myelin sheaths around axons 
in the brain), obeying a timetable similar to the one that they would have fol- 
lowed if they had been left in place in the embryo. 

The molecular mechanisms underlying such slow changes in the internal 
states of cells, played out over days, weeks, months or even years, are still 
unknown. One possibility is that they reflect progressive changes in the state of 
the chromatin (discussed in Chapter 4). 

The mechanisms that control the timing of more rapid processes, though 
still poorly understood, are not quite such a mystery. Later, we shall discuss an 
example — the gene expression oscillator, known as the segmentation clock, that 
governs formation of the somites in vertebrate embryos — the rudiments of the 
series of vertebrae, ribs, and associated muscles. 

Initial Patterns Are Established in Small Fields of Cells and 
Refined by Sequential Induction as the Embryo Grows 

The signals that organize the spatial pattern of an embryo generally act over 
short distances and govern relatively simple choices. A morphogen, for example, 
typically acts over a distance of less than 1 mm — an effective range for diffusion 
(see Figure 22-15) — and directs choices between no more than a handful of 
developmental options for the cells on which it acts. But the organs that eventu- 
ally develop are much larger and more complex than this. 

The cell proliferation that follows the initial specification accounts for the 
size increase, while the refinement of the initial pattern is explained by a series 
of local inductions that embroider successive levels of detail on an initially sim- 
ple sketch. As soon as two sorts of cells are present, one of them can produce a 



1 320 Chapter 22: Development of Multicellular Organisms 




C is induced 
by signal from 
B acting on A 




D and E are 

induced by signal 

from C acting 

on A and B, 

respectively 




Figure 22-16 Patterning by sequential 
induction. A series of inductive 
interactions can generate many types of 
cells, starting from only a few. 



factor that induces a subset of the neighboring cells to specialize in a third way. 
The third cell type can in turn signal back to the other two cell types nearby, gen- 
erating a fourth and a fifth cell type, and so on (Figure 22-16). 

This strategy for generating a progressively more complicated pattern is 
called sequential induction. It is chiefly through sequential inductions that the 
body plan of a developing animal, after being first roughed out in miniature, 
becomes elaborated with finer and finer details as development proceeds. 

In the sections that follow, we focus on a small selection of model organisms 
to see how the principles that we have outlined in this first section operate in 
practice. We begin with the nematode worm, Caenorhabditis elegans. 



Summary 

The obvious changes of cell behavior that we see as a multicellular organism develops 
are the outward signs of a complex molecular computation, dependent on cell mem- 
ory, that is taking place inside the cells as they receive and process signals from their 
neighbors and emit signals in return. The final pattern of differentiated cell types is 
thus the outcome of a more hidden program of cell specialization — a program played 
out in the changing patterns of expression of gene regulatory proteins, giving one cell 
different potentialities from another long before terminal differentiation begins. 
Developmental biologists seek to decipher the hidden program and to relate it, 
through genetic and microsurgical experiments, to the signals the cells exchange as 
they proliferate, interact, and move. 

Animals as different as worms, flies, and humans use remarkably similar sets of 
proteins to control their development, so that what we discover in one organism very 
often gives insight into the others. A handful of evolutionary conserved cell-cell sig- 
naling pathways are used repeatedly, in different organisms and at different times, to 
regulate the creation of an organized multicellular pattern. Differences of body plan 
seem to arise to a large extent from differences in the regulatory DNA associated with 
each gene. This DNA has a central role in defining the sequential program of develop- 
ment, calling genes into action at specific times and places according to the pattern of 
gene expression that was present in each cell at the previous developmental stage. 

Differences between cells in an embryo arise in various ways. Positive feedback can 
lead to broken symmetry, creating a radical and permanent difference between cells 
that are initially almost identical. Sister cells can be born different as a result of an 
asymmetric cell division. Or a group of initially similar cells may receive different 
exposures to inductive signals from cells outside the group; long-range inducers with 
graded effects, called morphogens, can organize a complex pattern. Through cell mem- 
ory, such transient signals can have a lasting effect on the internal state of a cell, caus- 
ing it, for example, to become determined for a specific fate. In these ways, sequences of 
simple signals acting at different times and places in growing cell arrays give rise to the 
intricate and varied multicellular organisms that fill the world around us. 



C. ELEGANS: DEVELOPMENT FROM THE PERSPECTIVE OF THE INDIVIDUAL CELL 

CAENORHABDITIS ELEGANS: DEVELOPMENT FROM 
THE PERSPECTIVE OF THE INDIVIDUAL CELL 

The nematode worm Caenorhabditis elegans is a small, relatively simple, and 
precisely structured organism. The anatomy of its development has been 
described in extraordinary detail, and one can map out the exact lineage of every 
cell in the body Its complete genome sequence is also known, and large num- 
bers of mutant phenotypes have been analyzed to determine gene functions. If 
there is any multicellular animal whose development we should be able to 
understand in terms of genetic control, this is it. 

DNA sequence comparisons indicate that, while the lineages leading to 
nematodes, insects, and vertebrates diverged from one another at about the 
same time, the rate of evolutionary change in the nematode lineage has been 
substantially greater: its genes, its body structure, and its developmental strate- 
gies are more divergent from our own than are those of Drosophila. Neverthe- 
less, at a molecular level many of its developmental mechanisms are similar to 
those of insects or vertebrates, and governed by homologous systems of genes. 
If one wants to know how an eye, a limb, or a heart develops, one must look else- 
where: C. elegans lacks these organs. But at a more fundamental level, it is highly 
instructive: it poses the basic general questions of animal development in a rel- 
atively simple form, and it lets us answer them in terms of gene functions and 
the behavior of individual, identified cells. 

Caenorhabditis elegans Is Anatomically Simple 

As an adult, C. elegans consists of only about 1000 somatic cells and 1000-2000 
germ cells (exactly 959 somatic cell nuclei plus about 2000 germ cells in one sex; 
exactly 1031 somatic cell nuclei plus about 1000 germ cells in the other) (Figure 
22-17). The anatomy has been reconstructed, cell by cell, by electron 
microscopy of serial sections. The body plan of the worm is simple: it has a 
roughly bilaterally symmetrical, elongate body composed of the same basic tis- 
sues as in other animals (nerve, muscle, gut, skin), organized with mouth and 
brain at the anterior end and anus at the posterior. The outer body wall is com- 
posed of two layers: the protective epidermis, or "skin," and the underlying mus- 
cular layer. A tube of endodermal cells forms the intestine. A second tube, 
located between the intestine and the body wall, constitutes the gonad; its wall 
is composed of somatic cells, with the germ cells inside it. 

C. elegans has two sexes — a hermaphrodite and a male. The hermaphrodite 
can be viewed most simply as a female that produces a limited number of 
sperm: she can reproduce either by self-fertilization, using her own sperm, or by 
cross-fertilization after transfer of male sperm by mating. Self-fertilization 
allows a single heterozygous worm to produce homozygous progeny. This is an 
important feature that helps to make C. elegans an exceptionally convenient 
organism for genetic studies. 
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Figure 22-1 7 Caenorhabditis elegans. 

A side view of an adult hermaphrodite is 
shown. (From J.E. Sulston and 
H.R. Horvitz, Dev. Biol. 56:1 10-156, 1977. 
With permission from Academic Press.) 



1 322 Chapter 22: Development of Multicellular Organisms 



10 



EGG 

<£> 



£ 



nervous system 

epidermis 

musculature 



isculature 



musculature 
nervous system 
somatic gonad 



_L V-l 



epidermis 
nervous system 



>n 



germ line 



musculature 



hatching 



wmmmmm 



ANTERIOR 



POSTERIOR 




Cell Fates in the Developing Nematode Are Almost Perfectly 
Predictable 

C. elegans begins life as a single cell, the fertilized egg, which gives rise, through 
repeated cell divisions, to 558 cells that form a small worm inside the egg shell. 
After hatching, further divisions result in the growth and sexual maturation of 
the worm as it passes through four successive larval stages separated by molts. 
After the final molt to the adult stage, the hermaphrodite worm begins to pro- 
duce its own eggs. The entire developmental sequence, from egg to egg, takes 
only about three days. 

The lineage of all of the cells from the single-cell egg to the multicellular 
adult was mapped out by direct observation of the developing animal. In the 
nematode, a given precursor cell follows the same pattern of cell divisions in 
every individual, and with very few exceptions the fate of each descendant cell 
can be predicted from its position in the lineage tree (Figure 22-18). 

This degree of stereotyped precision is not seen in the development of larger 
animals. At first sight, it might seem to suggest that each cell lineage in the 
nematode embryo is rigidly and independently programmed to follow a set pat- 
tern of cell division and cell specialization, making the worm a woefully unrep- 
resentative model organism for development. We shall see that this is far from 
true: as in other animals, development depends on cell-cell interactions as well 
as on processes internal to the individual cells. The outcome in the nematode is 
almost perfectly predictable simply because the pattern of cell-cell interactions 
is highly reproducible and is accurately correlated with the sequence of cell divi- 
sions. 

In the developing worm, as in other animals, most cells do not become 
restricted to generate progeny cells of a single differentiated type until quite late 
in development, and cells of a particular type, such as muscle, usually derive 
from several spatially dispersed precursors that also give rise to other types of 
cells. The exceptions, in the worm, are the gut and the gonad, each of which 
forms from a single dedicated founder cell, born at the 8-cell stage of develop- 
ment for the gut-cell lineage and at the 16-cell stage for the germ-cell lineage, or 



Figure 22-1 8 The lineage tree for the 
cells that form the gut (the intestine) of 
C. elegans. Note that although the 
intestinal cells form a single clone (as do 
the germ-line cells), the cells of most 
other tissues do not. Nerve cells (not 
shown in the drawing of the adult at the 
bottom) are mainly clustered in ganglia 
near the anterior and posterior ends of 
the animal and in a ventral nerve cord 
that runs the length of the body. 
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germ line. But in any case, cell diversification starts early, as soon as the egg 
begins to cleave: long before terminal differentiation, the cells begin to step 
through a series of intermediate states of specialization, following different pro- 
grams according to their locations and their interactions with their neighbors. 
How do these early differences between cells arise? 



Products of Maternal-Effect Genes Organize the Asymmetric 
Division of the Egg 

The worm is typical of most animals in the early specification of the cells that 
will eventually give rise to the germ cells (eggs or sperm). The worm's germ line 
is produced by a strict series of asymmetric cell divisions of the fertilized egg. 
The asymmetry originates with a cue from the egg's environment: the sperm 
entry point defines the future posterior pole of the elongated egg. The proteins 
in the egg then interact with one another and organize themselves in relation to 
this point so as to create a more elaborate asymmetry in the interior of the cell. 
The proteins involved are mainly translated from the accumulated mRNA prod- 
ucts of the genes of the mother. Because this RNA is made before the egg is laid, 
it is only the mother's genotype that dictates what happens in the first steps of 
development. Genes acting in this way are called maternal- effect genes. 

A subset of maternal- effect genes are specifically required to organize the 
asymmetric pattern of the nematode egg. These are called Par {Par titioning- 
defective) genes, and at least six have been identified, through genetic screens 
for mutants where this pattern is disrupted. The Par genes have homologs in 
insects and vertebrates, where they play a fundamental part in the organization 
of cell polarity, as discussed in Chapter 19. In fact, one of the keys to our present 
understanding of the general mechanisms of cell polarity was the discovery of 
these genes through studies of early development in C. elegans. 

In the nematode egg, as in other cells both in the nematode and other ani- 
mals, the Par proteins (the products of the Par genes) are themselves asymmetri- 
cally located, some at one end of the cell and some at the other. They serve in the 
egg to bring a set of ribonucleoprotein particles called P granules to the posterior 
pole, so that the posterior daughter cell inherits P granules and the anterior 
daughter cell does not. Throughout the next few cell divisions, the Par proteins 
operate in a similar way, orienting the mitotic spindle and segregating the P gran- 
ules to one daughter cell at each mitosis, until, at the 16-cell stage, there is just 
one cell that contains the P granules (Figure 22-19). This one cell gives rise to the 
germ line. 

The specification of the germ-cell precursors as distinct from somatic-cell 
precursors is a key event in the development of practically every type of animal, 
and the process has common features even in phyla with very different body 
plans. Thus, in Drosophila, particles similar to P granules are also segregated 



Figure 22-19 Asymmetric divisions 
segregating P granules into the founder 
cell of the C elegans germ line. The 

micrographs in the upper row show the 
pattern of cell divisions, with cell nuclei 
stained blue with a DNA-specific 
fluorescent dye; below are the same cells 
stained with an antibody against P 
granules. These small granules (0.5-1 Jim 
in diameter) are distributed randomly 
throughout the cytoplasm in the 
unfertilized egg (not shown). After 
fertilization, at each cell division up to the 
16-cell stage, both they and the 
intracellular machinery that regulates 
their asymmetric localization are 
segregated into a single daughter cell. 
(Courtesy of Susan Strome.) 
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Figure 22-20 The pattern of cell divisions in the early C. elegans embryo, 
indicating the names and fates of the individual cells. Cells that are 
sisters are shown linked by a short black line. (After K. Kemphues, Cell 
101:345-348, 2000. With permission from Elsevier.) 



into one end of the egg, and become incorporated into the germ-line precursor 
cells to determine their fate. Similar phenomena occur in fish and frogs. In all 
these species, one can recognize at least some of the same proteins in the germ- 
cell-determining material, including homologs of an RNA-binding protein 
called Vasa. How Vasa and its associated proteins and RNA molecules act to 
define the germ line is still unknown. 



Progressively More Complex Patterns Are Created by Cell-Cell 
Interactions 

The egg, in C. elegans as in other animals, is an unusually big cell, with room for 
complex internal patterning. In addition to the P granules, other factors become 
distributed in an orderly way along its anteroposterior axis under the control of 
the Par proteins, and thus are allocated to different cells as the egg goes through 
its first few cell- division cycles. These divisions occur without growth (since 
feeding cannot begin until a mouth and a gut have formed) and therefore sub- 
divide the egg into progressively smaller cells. Several of the localized factors are 
gene regulatory proteins, which act directly in the cell that inherits them to 
either drive or block the expression of specific genes, adding to the differences 
between that cell and its neighbors and committing it to a specialized fate. 

While the first few differences between cells along the anteroposterior axis 
of C. elegans result from asymmetric divisions, further patterning, including the 
pattern of cell types along the other axes, depends on interactions between one 
cell and another. The cell lineages in the embryo are so reproducible that indi- 
vidual cells can be assigned names and identified in every animal (Figure 
22-20); the cells at the four-cell stage, for example, are called ABa and ABp (the 
two anterior sister cells), and EMS and P2 (the two posterior sister cells). As a 
result of the asymmetric divisions we have just described, the P2 cell expresses a 
signal protein on its surface — a nematode homolog of the Notch ligand Delta — 
while the ABa and ABp cells express the corresponding transmembrane recep- 
tor — a homolog of Notch. The elongated shape of the eggshell forces these cells 
into an arrangement such that the most anterior cell, ABa, and the most poste- 
rior cell, P2, are no longer in contact with one another. Thus only the ABp cell 
receives the signal from P2, making ABp different from ABa and defining the 
future dorsal-ventral axis of the worm (Figure 22-21). 

At the same time, P2 also expresses another signal molecule, a Wnt protein, 
which acts on a Wnt receptor (a Frizzled protein) in the membrane of the EMS 
cell. This signal polarizes the EMS cell in relation to its site of contact with P 2 , 
controlling the orientation of the mitotic spindle. The EMS cell then divides to 
give two daughters that become committed to different fates as a result of the 
Wnt signal from P 2 . One daughter, the MS cell, will give rise to muscles and var- 
ious other body parts; the other daughter, the E cell, is the founder cell for the 
gut, committed to give rise to all the cells of the gut and to no other tissues (see 
Figure 22-21). 



Figure 22-21 Cell signaling pathways controlling assignment of different 
characters to the cells in a four-cell nematode embryo. The P2 cell uses 
the Notch signaling pathway to send an inductive signal to the ABp cell, 
causing this to adopt a specialized character. The ABa cell has all the 
molecular apparatus to respond in the same way to the same signal, but it 
does not do so because it is out of contact with P 2 . Meanwhile, a Wnt signal 
from the P 2 cell causes the EMS cell to orient its mitotic spindle and 
generate two daughters that become committed to different fates as a 
result of their different exposure to Wnt protein — the MS cell and the E cell 
(the founder cell of the gut). 
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Having sketched the chain of cause and effect in early nematode develop- 
ment, we now examine some of the methods that have been used to decipher it. 

Microsurgery and Genetics Reveal the Logic of Developmental 
Control; Gene Cloning and Sequencing Reveal Its Molecular 
Mechanisms 

To discover the causal mechanisms, we need to know the developmental poten- 
tial of the individual cells in the embryo. At what points in their lives do they 
undergo decisive internal changes that determine them for a particular fate, and 
at what points do they depend on signals from other cells? In the nematode, 
using laser microbeam microsurgery, one can accurately kill one or more of a 
cell's neighbors and then observe directly how the cell behaves in the altered cir- 
cumstances. Alternatively, cells of the early embryo can be pushed around and 
rearranged inside the eggshell using a fine needle. For example, the relative posi- 
tions of ABa and ABp can be flipped at the four-cell stage of development. The 
ABa cell then undergoes what would normally be the fate of the ABp cell, and 
vice versa, showing that the two cells initially have the same developmental 
potential and depend on signals from their neighbors to make them different. A 
third tactic is to remove the eggshell of an early C. elegans embryo by digesting it 
with enzymes, and then to manipulate the cells in culture. The existence of a 
polarizing signal from P2 to EMS was demonstrated in this way. 

Genetic screens were used to identify the genes involved in the P2-EMS cell 
interaction. A search was made for mutant strains of worms in which no gut cells 
were induced (called Mom mutants, because they had more mesoderm — meso- 
derm being the fate of both of the EMS cell daughters when induction fails). 
Cloning and sequencing the Mom genes revealed that one encodes a Wnt sig- 
nal protein that is expressed in the P2 cell, while another encodes a Frizzled 
protein (a Wnt receptor) that is expressed in the EMS cell. A second genetic 
screen was conducted for mutant strains of worms with the opposite pheno- 
type, in which extra gut cells are induced (called Pop mutants, for posterior 
pharynx defect). One of the Pop genes {Pop!) turns out to encode a gene regu- 
latory protein (a LEF1/TCF homolog) whose activity is down-regulated by Wnt 
signaling in C. elegans. When Popl activity is absent, both daughters of the EMS 
cell behave as though they have received the Wnt signal from P2. Similar genetic 
methods were used to identify the genes whose products mediate the Notch- 
dependent signaling from P 2 to ABa. 

Continuing in this way, it is possible to build up a detailed picture of the 
decisive events in nematode development, and of the genetically specified 
machinery that controls them. 

Cells Change Over Time in Their Responsiveness to 
Developmental Signals 

The complexity of the adult nematode body is achieved through repeated use of 
a handful of patterning mechanisms, including those we have just seen in action 
in the early embryo. For example, cell divisions with a molecular asymmetry 
dependent on the Popl gene regulatory proteins occur throughout C. elegans 
development, creating anterior and posterior sister cells with different charac- 
ters. 

As emphasized earlier, while the same few types of signals act repeatedly at 
different times and places, the effects they have are different because the cells 
are programmed to respond differently according to their age and their past his- 
tory. We have seen, for example, that at the four- cell stage of development, one 
cell, ABp, changes its developmental potential because of a signal received via 
the Notch pathway. At the 12-cell stage of development, the granddaughters of 
the ABp cell and the granddaughters of the ABa cell both encounter another 
Notch signal, this time from a daughter of the EMS cell. The ABa granddaughter 
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changes its internal state in response to this signal and begins to form the phar- 
ynx. The ABp granddaughter does no such thing — the earlier exposure to a 
Notch signal has made it unresponsive. Thus, at different times in their history, 
both ABa lineage cells and ABp lineage cells respond to Notch, but the outcomes 
are different. Somehow a Notch signal at the 12-cell stage induces pharynx, but 
a Notch signal at the 4-cell stage has other effects — which include the preven- 
tion of pharynx induction by Notch at a later stage. This phenomenon, in which 
the same signaling mechanism evokes different effects at different stages and in 
different contexts — is seen in the development of all animals, and in all of them 
Notch signaling is used repeatedly in this way. 



Heterochronic Genes Control the Timing of Development 

A cell does not have to receive an external cue in order to change: one set of reg- 
ulatory molecules inside the cell can provoke the production of another, and the 
cell can thus step through a series of different states through its own internal 
mechanisms. These states differ not only in their responsiveness to external sig- 
nals, but also in other aspects of their internal chemistry, including proteins that 
stop or start the cell-division cycle. In this way, the internal mechanisms of the 
cell, together with the past and present signals received, dictate both the 
sequence of biochemical changes in the cell and the timing of its cell divisions. 

The specific molecular details of the mechanisms governing the temporal 
program of development are still mysterious. Remarkably little is known, even in 
the nematode embryo with its rigidly predictable pattern of cell divisions, about 
how the sequence of cell divisions is controlled. However, for the later stages, 
when the larva feeds and grows and moults to become an adult, it has been pos- 
sible to identify some of the genes that control the timing of cellular events. 
Mutations in these genes cause heterochronic phenotypes: the cells in a larva of 
one stage behave as though they belonged to a larva of a different stage, or cells 
in the adult carry on dividing as though they belonged to a larva (Figure 22-22). 

Through genetic analyses, one can determine that the products of the hete- 
rochronic genes act in series, forming regulatory cascades. Curiously, two genes 
at the top of their respective cascades, called Lin4 and Let7, do not code for pro- 
teins but for microRNAs — short untranslated regulatory RNA molecules, 21 or 
22 nucleotides long. These act by binding to complementary sequences in the 
noncoding regions of mRNA molecules transcribed from other heterochronic 
genes, thereby inhibiting their translation and promoting their degradation, as 
discussed in Chapter 7. Increasing levels of Lin4 RNA govern the progression 
from larval stage- 1 cell behavior to larval stage-3 cell behavior; increasing levels 
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Figure 22-22 Heterochronic mutations in 
the Lin 14 gene of C. elegans. The effects 
on only one of the many affected lineages 
are shown. The loss-of-function (recessive) 
mutation in Lin14 causes premature 
occurrence of the pattern of cell division 
and differentiation characteristic of a late 
larva, so that the animal reaches its final 
state prematurely and with an abnormally 
small number of cells. The gain-of-function 
(dominant) mutation has the opposite 
effect, causing cells to reiterate the 
patterns of cell divisions characteristic of 
the first larval stage, continuing through 
as many as five or six molt cycles and 
persisting in the manufacture of an 
immature type of cuticle. The cross 
denotes a programmed cell death. Green 
lines represent cells that contain Lin 14 
protein (which binds to DNA), red lines 
those that do not. In normal development 
the disappearance of Lin14 is triggered by 
the beginning of larval feeding. (After 
V. Ambros and H.R. Horvitz, Science 
226:409-416, 1984, with permission from 
AAAS; and P. Arasu, B. Wightman and 
G. Ruvkun, Growth Dev. Aging 
5:1 825-1 833, 1991, with permission from 
Growth Publishing Co., Inc.) 
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ofLet7RNA govern the progression from late larva to adult. In fact, Lin4 and Let7 
were the first microRNAs to be described in any animal: it was through develop- 
mental genetic studies in C. elegans that the importance of this whole class of 
molecules for gene regulation in animals was discovered. 

RNA molecules that are identical or almost identical to the Let7 RNA are 
found in many other species, including Drosophila, zebrafish, and human. 
Moreover, these RNAs appear to act in a similar way to regulate the level of their 
target mRNA molecules, and the targets themselves are homologous to the tar- 
gets of Let7RNA in the nematode. In Drosophila, this system of molecules seems 
to be involved in the metamorphosis of the larva into a fly, hinting at a conserved 
role in governing the timing of developmental transitions. 



Cells Do Not Count Cell Divisions in Timing Their Internal 
Programs 

Since the steps of cell specialization have to be coordinated with cell divisions, 
it is often suggested that the cell division cycle might serve as a clock to control 
the tempo of other events in development. In this view, changes of internal state 
would be locked to passage through each division cycle: the cell would click to 
the next state as it went through mitosis, so to speak. Although there are indeed 
some cases where changes of cell state are conditional on cell cycle events, this 
is far from being the general rule. Cells in developing embryos, whether they be 
worms, flies, or vertebrates, usually carry on with their standard timetable of 
determination and differentiation even when progress through the cell- division 
cycle is artificially blocked. Necessarily, there are some abnormalities, if only 
because a single undivided cell cannot differentiate in two ways at once. But in 
most cases that have been studied, it seems that the cell changes its state with 
time more or less regardless of cell division, and that this changing state controls 
both the decision to divide and the decision as to when and how to specialize. 



Selected Cells Die by Apoptosis as Part of the Program of 
Development 

The control of cell numbers in development depends on cell death as well as cell 
division. A C. elegans hermaphrodite generates 1030 somatic cell nuclei in the 
course of its development, but 131 of the cells die. These programmed cell 
deaths occur in an absolutely predictable pattern. In C. elegans, they can be 
chronicled in detail, because one can trace the fate of each individual cell and 
see which dies, watching as each suicide victim undergoes apoptosis and is 
rapidly engulfed and digested by neighboring cells (Figure 22-23). In other 
organisms, where close observation is harder, such deaths easily go unnoticed; 
but cell death by apoptosis is probably the fate of a substantial fraction of the 
cells produced in most animals, playing an essential part in generating an indi- 
vidual with the right cell types in the right numbers and places, as discussed in 
Chapter 18. 

Genetic screens in C. elegans have been crucial in identifying the genes that 
bring about apoptosis and in highlighting its importance in development. Three 
genes, called Ced3, Ced4, and Egll (Ced stands for cell death abnormal), are 
found to be required for the 131 normal cell deaths to occur. If these genes are 
inactivated by mutation, cells that are normally fated to die survive instead, dif- 
ferentiating as recognizable cell types such as neurons. Conversely, over-expres- 
sion or misplaced expression of the same genes causes many cells to die that 



Figure 22-23 Apoptotic cell death in C. elegans. Death depends on 
expression of the Ced3 and Ced4 genes in the absence of Ced9 
expression — all in the dying cell itself. The subsequent engulfment and 
disposal of the remains depend on expression of other genes in the 
neighboring cells. 
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would normally survive, and the same effect results from mutations that inacti- 
vate another gene, Ced9, which normally represses the death program. 

All these genes code for conserved components of the cell-death machinery 
As described in Chapter 18, Ced3 codes for a caspase homolog, while Ced4, Ced9, 
and Egll are respectively homologs of Apafl, Bcl2, and Bad. Without the insights 
that came from detailed analysis of the development of the transparent, geneti- 
cally tractable nematode worm, it would have been very much harder to dis- 
cover these genes and understand the cell-death process in vertebrates. 



Summary 

The development of the small, relatively simple, transparent nematode worm 
Caenorhabditis elegans is extraordinarily reproducible and has been chronicled in 
detail, so that a cell at any given position in the body has the same lineage in every 
individual, and this lineage is fully known. Also, the genome has been completely 
sequenced. Thus, powerful genetic and microsurgical approaches can be combined to 
decipher developmental mechanisms. As in other organisms, development depends on 
an interplay of cell-cell interactions and cell- autonomous processes. Development 
begins with an asymmetric division of the fertilized egg, dividing it into two smaller 
cells containing different cell-fate determinants. The daughters of these cells interact 
via the Notch and Wnt cell signaling pathways to create a more diverse array of cell 
states. Meanwhile, through further asymmetric divisions one cell inherits materials 
from the egg that determine it at an early stage as progenitor of the germ line. 

Genetic screens identify the sets of genes responsible for these and later steps in 
development, including, for example, cell-death genes that control the apoptosis of a 
specific subset of cells as part of the normal developmental program. Heterochronic 
genes that govern the timing of developmental events have also been found, although 
in general our understanding of temporal control of development is still very poor. 
There is good evidence, however, that the tempo of development is not set by the count- 
ing of cell divisions. 



DROSOPHILA AND THE MOLECULAR GENETICS OF 
PATTERN FORMATION: GENESIS OF THE BODY 
PLAN 

It is the fly Drosophila melanogaster (Figure 22-24), more than any other organ- 
ism, that has transformed our understanding of how genes govern the patterning 
of the body. The anatomy of Drosophila is more complex than that of C. elegans, 



Figure 22-24 Drosophila melanogaster. 

Dorsal view of a normal adult fly. 
(A) Photograph. (B) Labeled drawing. 
(Photograph courtesy of E.B. Lewis.) 
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with more than 100 times as many cells, and it shows more obvious parallels 
with our own body structure. Surprisingly, the fly has fewer genes than the 
worm — about 14,000 as compared with 20,000. On the other hand, it has almost 
twice as much DNA per gene (about 10,000 nucleotides on average, as compared 
with about 5000), most of this being noncoding sequence. The molecular con- 
struction kit has fewer types of parts, but the assembly instructions — as speci- 
fied by the regulatory sequences in the non- coding DNA — seem to be more 
voluminous. 

Decades of genetic study, culminating in massive systematic genetic 
screens, have yielded a catalogue of the developmental control genes that define 
the spatial pattern of cell types and body structures of the fly, and molecular 
biology has given us the tools to watch these genes in action. By in situ 
hybridization using DNA or RNA probes on whole embryos, or by staining with 
labeled antibodies to reveal the distribution of specific proteins, one can 
observe directly how the internal states of the cells are defined by the sets of reg- 
ulatory genes that they express at different times of development. Moreover, by 
analyzing animals that are a patchwork of mutant and nonmutant cells, one can 
discover how each gene operates as part of a system to specify the organization 
of the body. 

Most of the genes controlling the pattern of the body in Drosophila turn 
out to have close counterparts in higher animals, including ourselves. In fact, 
many of the basic devices for defining the body plan and patterning individual 
organs and tissues are astonishingly similar. Thus, quite surprisingly, the fly 
has provided the key to understanding the molecular genetics of our own 
development. 

Flies, like nematode worms, are ideal for genetic studies: cheap to breed, 
easy to mutagenize, and rapid in their reproductive cycle. But there is a more 
fundamental reason why they have been so important for developmental 
geneticists. As emphasized earlier, as a result of gene duplications, vertebrate 
genomes often contain two or three homologous genes corresponding to a sin- 
gle gene in the fly. A mutation that disrupts one of these genes very often fails to 
reveal the gene's core function, because the other homologs share this function 
and remain active. In the fly, with its more economical gene set, this phe- 
nomenon of genetic redundancy is less prevalent. The phenotype of a single 
mutation in the fly therefore more often directly uncovers the function of the 
mutant gene. 
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The Insect Body Is Constructed as a Series of Segmental Units 

The timetable of Drosophila development, from egg to adult, is summarized in 
Figure 22-25. The period of embryonic development begins at fertilization and 
takes about a day, at the end of which the embryo hatches out of the egg shell to 
become a larva. The larva then passes through three stages, or instars, separated 
by molts in which it sheds its old coat of cuticle and lays down a larger one. At 
the end of the third instar it pupates. Inside the pupa, a radical remodeling of the 
body takes place — a process called metamorphosis. Eventually, about nine days 
after fertilization, an adult fly, or imago, emerges. 

The fly consists of a head, with mouth, eyes, and antennae, followed by three 
thoracic segments (numbered Tl to T3), and eight or nine abdominal segments 
(numbered Al to A9). Each segment, although different from the others, is built 
according to a similar plan. Segment Tl, for example, carries a pair of legs, T2 car- 
ries a pair of legs plus a pair of wings, and T3 carries a pair of legs plus a pair of 
halteres — small knob-shaped balancers important in flight, evolved from the sec- 
ond pair of wings that more primitive insects possess. The quasi- repetitive seg- 
mentation develops in the early embryo during the first few hours after fertiliza- 
tion (Figure 22-26), but it is more obvious in the larva (Figure 22-27), where the 
segments look more similar than in the adult. In the embryo it can be seen that 
the rudiments of the head, or at least the future adult mouth parts, are likewise 
segmental. At the two ends of the animal, however, there are highly specialized 
terminal structures that are not segmentally derived. 
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Figure 22-25 Synopsis of Drosophila 
development from egg to adult fly. 
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The boundaries between segments are traditionally defined by visible 
anatomical markers; but in discussing gene expression patterns it is often con- 
venient to draw a different set of segmental boundaries, defining a series of 
segmental units called parasegments, half a segment out of register with the 
traditional divisions (see Figure 22-27). 



Figure 22-26 The origins of the 
Drosophila body segments during 
embryonic development. <AACG> 
The embryos are seen in side view in 
drawings (A-C) and corresponding 
scanning electron micrographs (D-F). 
(A and D) At 2 hours the embryo is at the 
syncytial blastoderm stage (see Figure 
22-28) and no segmentation is visible, 
although a fate map can be drawn 
showing the future segmented regions 
(color in A). (B and E) At 5-8 hours the 
embryo is at the extended germ band 
stage: gastrulation has occurred, 
segmentation has begun to be visible, 
and the segmented axis of the body has 
lengthened, curving back on itself at the 
tail end so as to fit into the egg shell. 
(C and F) At 10 hours the body axis has 
contracted and become straight again, 
and all the segments are clearly defined. 
The head structures, visible externally at 
this stage, will subsequently become 
tucked into the interior of the larva, to 
emerge again only when the larva goes 
through pupation to become an adult. 
(D and E, courtesy of F.R. Turner and 
A.P. Mahowald, Dev. Biol. 50:95-108, 1976; 
F, from J. P. Petschek, N. Perrimon and 
A.P. Mahowald, Dev. Biol. 1 1 9:1 75-1 89, 
1987. All with permission from 
Academic Press.) 



Drosophila Begins Its Development as a Syncytium 

The egg of Drosophila is about 0.5 mm long and 0.15 mm in diameter, with a 
clearly defined polarity. Like the eggs of other insects, but unlike most verte- 
brates, it begins its development in an unusual way: a series of nuclear divisions 
without cell division creates a syncytium. The early nuclear divisions are syn- 
chronous and extremely rapid, occurring about every 8 minutes. The first nine 
divisions generate a cloud of nuclei, most of which migrate from the middle of 
the egg toward the surface, where they form a monolayer called the syncytial 
blastoderm. After another four rounds of nuclear division, plasma membranes 
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Figure 22-27 The segments of the 
Drosophila larva and their 
correspondence with regions of the 
blastoderm. The parts of the embryo 
that become organized into segments 
are shown in color. The two ends of the 
embryo, shaded gray, are not segmented 
and become tucked into the interior of 
the body to form the internal structures 
of the head and gut. (The future external, 
segmental structures of the adult head 
are also transiently tucked into the 
interior in the larva.) Segmentation in 
Drosophila can be described in terms of 
either segments or parasegments: the 
relationship is shown in the middle part 
of the figure. Parasegments often 
correspond more simply to patterns of 
gene expression. The exact number of 
abdominal segments is debatable: eight 
are clearly defined, and a ninth is 
present vestigially in the larva, but 
absent in the adult. 
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Figure 22-28 Development of the Drosophila egg from fertilization to 
the cellular blastoderm stage. (A) Schematic drawings. (B) Surface view — 
an optical-section photograph of blastoderm nuclei undergoing mitosis at 
the transition from the syncytial to the cellular blastoderm stage. Actin is 
stained green, chromosomes orange. (A, after H.A. Schneiderman, in Insect 
Development [P.A. Lawrence, ed.], pp. 3-34. Oxford, UK: Blackwell, 1976; 
B, courtesy of William Sullivan.) 

grow inward from the egg surface to enclose each nucleus, thereby converting 
the syncytial blastoderm into a cellular blastoderm consisting of about 6000 sep- 
arate cells (Figure 22-28). About 15 of the nuclei populating the extreme poste- 
rior end of the egg are segregated into cells a few cycles earlier; these pole cells 
are the germ-line precursors (primordial germ cells) that will give rise to eggs or 
sperm. 

Up to the cellular blastoderm stage, development depends largely — 
although not exclusively — on stocks of maternal mRNA and protein that accu- 
mulated in the egg before fertilization. The frantic rate of DNA replication and 
nuclear division evidently gives little opportunity for transcription. After cellu- 
larization, cell division continues in a more conventional way, asynchronously 
and at a slower rate, and the rate of transcription increases dramatically. Gastru- 
lation begins a little while before cellularization is complete, when parts of the 
sheet of cells forming the exterior of the embryo start to tuck into the interior to 
form the gut, the musculature, and associated internal tissues. A little later and 
in another region of the embryo, a separate set of cells move from the surface 
epithelium into the interior to form the central nervous system. By marking and 
following the cells through these various movements, one can draw a fate map 
for the monolayer of cells on the surface of the blastoderm (Figure 22-29). 

As gastrulation nears completion, a series of indentations and bulges appear 
in the surface of the embryo, marking the subdivision of the body into segments 
along its anteroposterior axis (see Figure 22-26). Soon a fully segmented larva 
emerges, ready to start eating and growing. Within the body of the larva, small 
groups of cells remain apparently undifferentiated, forming structures called 
imaginal discs. These will grow as the larva grows, and eventually they will give rise 
to most of the structures of the adult body, as we shall see later. 

A head end and a tail end, a ventral (belly) side and a dorsal (back) side, a 
gut, a nervous system, a series of body segments — these are all features of the 
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Figure 22-29 Fate map of a Drosophila 
embryo at the cellular blastoderm stage. 

The embryo is shown in side view and in 
cross section, displaying the relationship 
between the dorsoventral subdivision into 
future major tissue types and the 
anteroposterior pattern of future 
segments. A heavy line encloses the 
region that will form segmental structures. 
During gastrulation the cells along the 
ventral midline invaginate to form 
mesoderm, while the cells fated to form 
the gut invaginate near each end of the 
embryo. (After V. Hartenstein, 
G.M.Technau and J.A. Campos-Ortega, 
Wilhelm Roux' Arch. Dev. Biol. 1 94:21 3-21 6, 
1985. With permission from Elsevier.) 
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basic body plan that Drosophila shares with many other animals, including our- 
selves. We begin our account of the mechanisms of Drosophila development by 
considering how this body plan is set up. 



Genetic Screens Define Groups of Genes Required for Specific 
Aspects of Early Patterning 

By carrying out a series of genetic screens based on saturation mutagenesis (dis- 
cussed in Chapter 8), it has been possible to amass a collection of Drosophila 
mutants that appears to include changes in a large proportion of the genes 
affecting development. Independent mutations in the same gene can be distin- 
guished from mutations in separate genes by a complementation test (see Panel 
8-1, p. 555), leading to a catalog of genes classified according to their mutant 
phenotypes. In such a catalog, a group of genes with very similar mutant phe- 
notypes will often code for a set of proteins that work together to perform a par- 
ticular function. 

Sometimes the developmental functions revealed by mutant phenotypes 
are those that one would expect; sometimes they are a surprise. A large-scale 
genetic screen focusing on early Drosophila development revealed that the key 
genes fall into a relatively small set of functional classes defined by their mutant 
phenotypes. Some — the egg-polarity genes (Figure 22-30) — are required to 
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Figure 22-30 The domains of the 
anterior, posterior, and terminal 
systems of egg-polarity genes. The 

upper diagrams show the fates of the 
different regions of the egg/early embryo 
and indicate (in white) the parts that fail 
to develop if the anterior, posterior, or 
terminal system is defective. The middle 
row shows schematically the appearance 
of a normal larva and of mutant larvae 
that are defective in a gene of the 
anterior system (for example, Bicoid), of 
the posterior system (for example, 
Nanos), or of the terminal system (for 
example, Torso). The bottom row of 
drawings shows the appearances of 
larvae in which none or only one of the 
three gene systems is functional. The 
lettering beneath each larva specifies 
which systems are intact (APT for a 
normal larva, - P T for a larva where the 
anterior system is defective but the 
posterior and terminal systems are intact, 
and so on). Inactivation of a particular 
gene system causes loss of the 
corresponding set of body structures; the 
body parts that form correspond to the 
gene systems that remain functional. 
Note that larvae with a defect in the 
anterior system can still form terminal 
structures at their anterior end, but these 
are of the type normally found at the rear 
end of the body rather than the front of 
the head. (Slightly modified from 
D. St. Johnston and C. Nusslein-Volhard, 
Cell 68:201 -21 9, 1992. With permission 
from Elsevier.) 
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define the anteroposterior and dorsoventral axes of the embryo and mark out its 
two ends for special fates, by mechanisms involving interactions between the 
oocyte and surrounding cells in the ovary Others, the gap genes, are required in 
specific broad regions along the anteroposterior axis of the early embryo to 
allow their proper development. A third category, the pair-rule genes, are 
required, more surprisingly, for development of alternate body segments. A 
fourth category, the segment polarity genes, are responsible for organizing the 
anteroposterior pattern of each individual segment. 

The discovery of these four systems of genes and the subsequent analysis of 
their functions (an enterprise that still continues) was a famous tour-de-force of 
developmental genetics. It had a revolutionary impact on all of developmental 
biology by showing the way toward a systematic, comprehensive account of the 
genetic control of embryonic development. In this section, we shall summarize 
only briefly the conclusions relating to the earliest phases of Drosophila devel- 
opment, because these are insect-specific; we dwell at greater length on the 
parts of the process that illustrate more general principles. 



Interactions of the Oocyte With Its Surroundings Define the Axes 
of the Embryo: the Role of the Egg-Polarity Genes 

Surprisingly, the earliest steps of animal development are among the most vari- 
able, even within a phylum. A frog, a chicken, and a mammal, for example, even 
though they develop in similar ways later, make eggs that differ radically in size 
and structure, and they begin their development with different sequences of cell 
divisions and cell specialization events. 

The style of early development that we have described for C. elegans is typi- 
cal of many classes of animals. In contrast, the early development of Drosophila 
represents a rather extreme variant. The main axes of the future insect body are 
defined before fertilization by a complex exchange of signals between the unfer- 
tilized egg, or oocyte, and the follicle cells that surround it in the ovary (Figure 
22-31). Then, in the syncytial phase following fertilization, an exceptional 
amount of patterning occurs in the array of rapidly dividing nuclei, before the 
first partitioning of the egg into separate cells. Here, there is no need for the 
usual forms of cell-cell communication involving transmembrane signaling; 
neighboring regions of the early Drosophila embryo can communicate by 
means of gene regulatory proteins and mRNA molecules that diffuse or are 
actively transported through the cytoplasm of the giant multinuclear cell. 

In the stages before fertilization, the anteroposterior axis of the future 
embryo becomes defined by three systems of molecules that create landmarks 
in the oocyte (Figure 22-32). Following fertilization, each landmark serves as a 
beacon, providing a signal, in the form of a morphogen gradient, that organizes 
the developmental process in its neighborhood. Two of these signals are gener- 
ated from localized deposits of specific mRNA molecules. The future anterior 
end of the embryo contains a high concentration of mRNA for a gene regulatory 
protein called Bicoid; this mRNA is translated to produce Bicoid protein, which 
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Figure 22-31 A Drosophila oocyte in its 
follicle. The oocyte is derived from a germ 
cell that divides four times to give a family 
of 1 6 cells that remain in communication 
with one another via cytoplasmic bridges 
(gray). One member of the family group 
becomes the oocyte, while the others 
become nurse cells, which make many of 
the components required by the oocyte 
and pass them into it via the cytoplasmic 
bridges. The follicle cells that partially 
surround the oocyte have a separate 
ancestry. As indicated, they are the 
sources of terminal and ventral egg- 
polarizing signals. 
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diffuses away from its source to form a concentration gradient with its maxi- 
mum at the anterior end of the egg. The future posterior end of the embryo con- 
tains a high concentration of mRNA for a regulator of translation called Nanos, 
which sets up a posterior gradient in the same way The third signal is generated 
symmetrically at both ends of the egg, by local activation of a transmembrane 
tyrosine kinase receptor called Torso. The activated receptor exerts its effects 
over a shorter range, marking the sites of specialized terminal structures that will 
form at the head and tail ends of the future larva and also defining the rudiments 
of the future gut. The three sets of genes responsible for these localized deter- 
minants are referred to as the anterior, posterior, and terminal sets of egg- 
polarity genes. 

A fourth landmark defines the dorsoventral axis (see Figure 22-32): a protein 
that is produced by follicle cells underneath the future ventral region of the 
embryo leads to localized activation of another transmembrane receptor, called 
Toll, in the oocyte membrane. The genes required for this function are called 
dorsoventral egg-polarity genes. 

All the egg-polarity genes in these four classes are maternal- effect genes: it 
is the mother's genome, not the zygotic genome, that is critical. Thus, a fly whose 
chromosomes are mutant in both copies of the Bicoid gene but who is born from 
a mother carrying one normal copy of Bicoid develops perfectly normally, with- 
out any defects in the head pattern. However, if that daughter fly is a female no 
functional Bicoid mRNA can be deposited into the anterior part of her own eggs, 
and all of these will develop into headless embryos regardless of the father's 
genotype. 

Each of the four egg-polarity signals — provided by Bicoid, Nanos, Torso, and 
Toll — exerts its effect by regulating (directly or indirectly) the expression of genes 
in the nuclei of the blastoderm. The use of these particular molecules to orga- 
nize the egg is not a general feature of early animal development — indeed, only 
Drosophila and closely related insects possess a Bicoid gene. And Toll has been 
coopted here for dorsoventral patterning; its more ancient and universal func- 
tion is in the innate immune response, as discussed in Chapter 24. 

Nevertheless, the egg-polarity system shows some highly conserved fea- 
tures. For example, the localization of Nanos mRNA at one end of the egg is 
linked to, and dependent on, the localization of germ-cell determinants at that 
site, just as it is in C. elegans. Later in development, as the zygotic genome comes 
into play under the influence of the egg-polarity system, more similarities with 
other animal species become apparent. We shall use the dorsoventral system to 
illustrate this point. 



Figure 22-32 The organization of the four 
egg-polarity gradient systems. The 

receptors Toll and Torso are distributed all 
over the membrane; the coloring in the 
diagrams on the right indicates where they 
become activated by extracellular ligands. 



The Dorsoventral Signaling Genes Create a Gradient of a Nuclear 
Gene Regulatory Protein 

Localized activation of the Toll receptor on the ventral side of the egg controls 
the distribution of Dorsal, a gene regulatory protein inside the egg. The Dorsal 
protein belongs to the same family as the NFkB gene regulatory protein of ver- 
tebrates (discussed in Chapter 15). Its Toll- regulated activity, like that of NFkB, 
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depends on its translocation from the cytoplasm, where it is held in an inactive 
form, to the nucleus, where it regulates gene expression. In the newly laid egg, 
both the Dorsal mRNA (detected by in situ hybridization) and the protein it 
encodes (detected with antibodies) are distributed uniformly in the cytoplasm. 
After the nuclei have migrated to the surface of the embryo to form the blasto- 
derm, however, a remarkable redistribution of the Dorsal protein occurs: dor- 
sally the protein remains in the cytoplasm, but ventrally it is concentrated in the 
nuclei, with a smooth gradient of nuclear localization between these two 
extremes (Figure 22-33). The signal transmitted by the Toll protein controls this 
redistribution of Dorsal through a signaling pathway that is essentially the same 
as the Toll-dependent pathway involved in innate immunity. 

Once inside the nucleus, the Dorsal protein turns on or off the expression of 
different sets of genes depending on its concentration. The expression of each 
responding gene depends on its regulatory DNA — specifically, on the number 
and affinity of the binding sites that this DNA contains for Dorsal and other reg- 
ulatory proteins. In this way, the regulatory DNA can be said to interpret the 
positional signal provided by the Dorsal protein gradient, so as to define a 
dorsoventral series of territories — distinctive bands of cells that run the length of 
the embryo (Figure 22-34A). Most ventrally — where the concentration of Dorsal 
protein is highest — it switches on, for example, the expression of a gene called 
Twist that is specific for mesoderm (Figure 22-35) . Most dorsally, where the con- 
centration of Dorsal protein is lowest, the cells switch on Decapentaplegic (Dpp). 
And in an intermediate region, where the concentration of Dorsal protein is high 
enough to repress Dpp but too low to activate Twist, the cells switch on another 
set of genes, including one called Short gastrulation (Sog). 




100 |um 

Figure 22-33 The concentration 
gradient of Dorsal protein in the nuclei 
of the blastoderm, as revealed by an 
antibody. Dorsally, the protein is present 
in the cytoplasm and absent from the 
nuclei; ventrally, it is depleted in the 
cytoplasm and concentrated in the 
nuclei. (From S. Roth, D. Stein and 
C. Nusslein-Volhard, Cell 59:1 1 89-1 202, 
1989. With permission from Elsevier.) 
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Figure 22-34 Morphogen gradients 
patterning the dorsoventral axis of the 
embryo. (A) The gradient of Dorsal 
protein defines three broad territories of 
gene expression, marked here by the 
expression of three representative 
genes — Dpp, Sog, and Twist. (B) Slightly 
later, the cells expressing Dpp and Sog 
secrete, respectively, the signal proteins 
Dpp (a TGFp family member) and Sog (an 
antagonist of Dpp). These two proteins 
diffuse and interact with one another 
(and with certain other factors) to set up 
a gradient of Dpp activity that guides a 
more detailed patterning process. 
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Dpp and Sog Set Up a Secondary Morphogen Gradient to Refine 
the Pattern of the Dorsal Part of the Embryo 

Products of the genes directly regulated by the Dorsal protein generate in turn 
more local signals that define finer subdivisions of the dorsoventral axis. These 
signals act after cellularization, and take the form of conventional extracellular 
signal molecules. In particular, Dpp codes for the secreted Dpp protein, which 
forms a local morphogen gradient in the dorsal part of the embryo. The gene 
Sog, meanwhile, codes for another secreted protein that is produced in the neu- 
rogenic ectoderm and acts as an antagonist of Dpp. The opposing diffusion gra- 
dients of these two proteins create a steep gradient of Dpp activity. The highest 
Dpp activity levels, in combination with certain other factors, cause develop- 
ment of the most dorsal tissue of all — extraembryonic membrane; intermediate 
levels cause development of dorsal ectoderm; and very low levels allow develop- 
ment of neurogenic ectoderm (Figure 22-34B). 



Figure 22-35 Origin of the mesoderm 
from cells expressing Twist. Embryos 
were fixed at successive stages, cross- 
sectioned, and stained with an antibody 
against the Twist protein, a gene 
regulatory protein of the bHLH family. 
The cells that express Twist move into the 
interior of the embryo to form 
mesoderm. (From M. Leptin, J. Casal, 
B. Grunewald and R. Reuter, Development 
Suppl. 23-31, 1992. With permission from 
The Company of Biologists.) 



The Insect Dorsoventral Axis Corresponds to the Vertebrate 
Ventrodorsal Axis 

Dpp is a member of the TGF(3 superfamily of signal molecules that is also impor- 
tant in vertebrates; Sog is a homolog of the vertebrate protein chordin. It is strik- 
ing that a Dpp homolog, BMP4, and chordin work together in vertebrates in the 
same way as do Dpp and Sog in Drosophila. These two proteins control the 
dorsoventral pattern of the ectoderm, with high levels of chordin defining the 
region that is neurogenic and high levels of BMP4 activity defining the region that 
is not. This, combined with other molecular parallels, strongly suggests that this 
part of the body plan has been conserved between insects and vertebrates. How- 
ever, the axis is inverted, so that dorsal in the fly corresponds to ventral in the ver- 
tebrate (Figure 22-36). At some point in its evolutionary history, it seems, the 
ancestor of one of these classes of animals took to living life upside down. 



Three Classes of Segmentation Genes Refine the 
Anterior-Posterior Maternal Pattern and Subdivide the Embryo 

After the initial gradients of Bicoid and Nanos are created to define the antero- 
posterior axis, the segmentation genes refine the pattern. Mutations in any one 
of the segmentation genes alter the number of segments or their basic internal 
organization without affecting the global polarity of the embryo. Segmentation 
genes are expressed by subsets of cells in the embryo, so their products are the 
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Figure 22-36 The vertebrate body plan 
as a dorsoventral inversion of the insect 
body plan. The mechanism of 
dorsoventral patterning in a vertebrate 
embryo is discussed in more detail later 
in this chapter. Note the correspondence 
with regard to the circulatory system as 
well as the gut and nervous system. In 
insects, the circulatory system is 
represented by a tubular heart and a 
main dorsal blood vessel, which pumps 
blood out into the tissue spaces through 
one set of apertures and receives blood 
back from the tissues through another 
set. In contrast with vertebrates, there is 
no system of capillary vessels to contain 
the blood as it percolates through the 
tissues. Nevertheless, heart development 
depends on homologous genes in 
vertebrates and insects, reinforcing the 
relationship between the two body plans. 
(After E.L. Ferguson, Curr. Opin. Genet. 
Dev. 6:424-431, 1996. With permission 
from Elsevier.) 



DROSOPHILA AND THE MOLECULAR GENETICS OF PATTERN FORMATION 



1337 



GAP GENE (Kruppel) 



PAIR-RULE GENE 
(Even-skipped) 



SEGMENT-POLARITY 
GENE (Gooseberry) 






first components that the embryo's own genome, rather than the maternal 
genome, contributes to embryonic development. They are therefore called 
zygotic-effect genes to distinguish them from the earlier maternal- effect genes. 

The segmentation genes fall into three groups according to their mutant 
phenotypes (Figure 22-37). It is convenient to think of these three groups as act- 
ing in sequence, although in reality their functions overlap in time. First come a 
set of at least six gap genes, whose products mark out coarse subdivisions of the 
embryo. Mutations in a gap gene eliminate one or more groups of adjacent seg- 
ments, and mutations in different gap genes cause different but partially over- 
lapping defects. In the mutant Kruppel, for example, the larva lacks eight seg- 
ments, from Tl to A5 inclusive. 

The next category of segmentation genesis a set of eight pair-rule genes. 
Mutations in these cause a series of deletions affecting alternate segments, leav- 
ing the embryo with only half as many segments as usual. While all the pair- rule 
mutants display this two-segment periodicity, they differ in the precise posi- 
tioning of the deletions relative to the segmental or parasegmental borders. The 
pair-rule mutant Even-skipped (Eve), for example, which is discussed in Chapter 
7, lacks the whole of each odd-numbered parasegment, while the pair-rule 
mutant Fushi tarazu (Ftz) lacks the whole of each even-numbered parasegment, 
and the pair-rule mutant Hairy lacks a series of regions that are of similar width 
but out of register with the parasegmental units. 

Finally, there are at least 10 segment-polarity genes. Mutations in these 
genes produce larvae with a normal number of segments but with a part of each 
segment deleted and replaced by a mirror- image duplicate of all or part of the 
rest of the segment. In Gooseberry mutants, for example, the posterior half of 
each segment (that is, the anterior half of each parasegment) is replaced by an 
approximate mirror image of the adjacent anterior half- segment (see Figure 
22-37). 

We see later that, in parallel with the segmentation process, a further set of 
genes, the homeotic selector genes, serve to define and preserve the differences 
between one segment and the next. 

The phenotypes of the various segmentation mutants suggest that the seg- 
mentation genes form a coordinated system that subdivides the embryo pro- 
gressively into smaller and smaller domains along the anteroposterior axis, dis- 
tinguished by different patterns of gene expression. Molecular genetics has 
helped to reveal how this system works. 



Figure 22-37 Examples of the 
phenotypes of mutations affecting the 
three types of segmentation genes. In 

each case the areas shaded in green on 
the normal larva (left) are deleted in the 
mutant or are replaced by mirror-image 
duplicates of the unaffected regions. 
(Modified from C. Nusslein-Volhard and 
E. Wieschaus, Nature 287:795-801, 1980. 
With permission from Macmillan 
Publishers Ltd.) 



The Localized Expression of Segmentation Genes Is Regulated by 
a Hierarchy of Positional Signals 

About three-quarters of the segmentation genes, including all of the gap genes 
and pair-rule genes, code for gene regulatory proteins. Their actions on one 
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Figure 22-38 The regulatory hierarchy of 
egg-polarity, gap, segmentation, and 
homeotic selector genes. The 

photographs show expression patterns of 
representative examples of genes in each 
category, revealed by staining with 
antibodies against the protein products. 
The homeotic selector genes, discussed 
below, define the lasting differences 
between one segment and the next. 
(Photographs from top (i) from W. Driever 
and C. Nusslein-Volhard, Cell 54:83-1 04, 
1988. With permission from Elsevier; 
(ii) courtesy of Jim Langeland, Steve 
Paddock, Sean Carroll, and the Howard 
Hughes Medical Institute; (iii) from 
P.A. Lawrence, The Making of a Fly. Oxford, 
UK: Blackwell, 1 992; (iv) from C. Hama, 
Z. Ali and T.B. Kornberg, Genes Dev. 
4:1079-1093, 1990. With permission from 
Cold Spring Harbor Laboratory Press; 
(v) courtesy of William McGinnis, adapted 
from D. Kosman et a I., Science 305:846, 
2004. With permission from AAAS.) 



another and on other genes can therefore be observed by comparing gene 
expression in normal and mutant embryos. By using appropriate probes to 
detect the gene transcripts or their protein products, one can, in effect, take 
snapshots as genes switch on and off in changing patterns. Repeating the pro- 
cess with mutants that lack a particular segmentation gene, one can begin to 
dissect the logic of the entire gene control system. 

The products of the egg-polarity genes provide the global positional signals 
in the early embryo. These cause particular gap genes to be expressed in partic- 
ular regions. The products of the gap genes then provide a second tier of posi- 
tional signals that act more locally to regulate finer details of patterning through 
the expression of yet other genes, including the pair-rule genes (Figure 22-38). 
The pair-rule genes in turn collaborate with one another and with the gap genes 
to set up a regular periodic pattern of expression of segment-polarity genes, and 
the segment-polarity genes collaborate with one another to define the internal 
pattern of each individual segment. The strategy, therefore, is one of sequential 
induction (see Figure 22-16). By the end of the process, the global gradients pro- 
duced by the egg-polarity genes have triggered the creation of a fine-grained 
pattern through a hierarchy of sequential, progressively more local, positional 
controls. Because the global positional signals that start the process do not have 
to directly specify fine details, the individual cell nuclei do not have to be gov- 
erned with extreme precision by small differences in the concentration of these 
signals. Instead, at each step in the sequence, new signals come into play, pro- 
viding substantial localized differences of concentration to define new details. 
Sequential induction is thus a robust strategy. It works reliably to produce fly 
embryos that all have the same pattern, despite the essential imprecision of bio- 
logical control systems, and despite variations in conditions such as the tem- 
perature at which the fly develops. 
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The Modular Nature of Regulatory DNA Allows Genes to Have 
Multiple Independently Controlled Functions 

The elaborate patterning process just described depends on the long stretches 
of noncoding DNA sequence that control the expression of each of the genes 
involved. These regulatory regions bind multiple copies of the gene regulatory 
proteins produced by the patterning genes expressed earlier. Like an input-out- 
put logic device, an individual gene is thus turned on and off according to the 
particular combination of proteins bound to its regulatory regions at each stage 
of development. In Chapter 7 we describe one particular segmentation gene — 
the pair-rule gene Even-skipped (Eve) — and discuss how the decision whether to 
transcribe the gene is made on the basis of all these inputs (see Figure 7-55). 
This example can be taken further to illustrate some important principles of 
developmental patterning. 

Individual stripes of Eve expression depend on separate regulatory modules 
in the Eve regulatory DNA. Thus, one regulatory module is responsible for driv- 
ing Eve expression in stripes 1 + 5, another for stripe 2, another for stripes 3 + 7 , 
and yet another for stripes 4 + 6 (Figure 22-39). Each regulatory module defines 
a different set of requirements for gene expression according to the concentra- 
tions of the products of the egg-polarity and gap genes. In this way, the Eve reg- 
ulatory DNA serves to translate the complex nonrepetitive pattern of egg-polar- 
ity and gap proteins into the periodic pattern of expression of a pair- rule gene. 

The modular organization of the Eve regulatory DNA just described is typi- 
cal of gene regulation in multicellular animals and plants, and it has profound 
implications. By stringing together sequences of modules that respond to differ- 
ent combinations of regulatory proteins, it is possible to generate almost any 
pattern of gene expression on the basis of almost any other. Modularity, more- 
over, allows the regulatory DNA to define patterns of gene expression that are 
not merely complex, but whose parts are independently adjustable. A change in 
one of the regulatory modules can alter one part of the expression pattern with- 
out affecting the rest, and without requiring changes in regulatory proteins that 
would have repercussions for the expression of other genes in the genome. As 
described in Chapter 7, it is such regulatory DNA that contains the key to the 
complex organization of multicellular plants and animals, and its properties 
make possible the independent adaptability of each part of an organism's body 
structure in the course of evolution. 

Most of the segmentation genes also have important functions at other 
times and places in the development of Drosophila. The Eve gene, for example, 
is expressed in subsets of neurons, in muscle precursor cells, and in various 
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Figure 22-39 Modular organization of 
the regulatory DNA of the Eve gene. In 

the experiment shown, cloned fragments 
of the regulatory DNA were linked to a 
LacZ reporter (a bacterial gene). 
Transgenic embryos containing these 
constructs were then stained by in situ 
hybridization to reveal the pattern of 
expression of LacZ (blue/black), and 
counterstained with an anti-Eve antibody 
(orange) to show the positions of the 
normal Eve expression stripes. Different 
segments of the Eve regulatory DNA 
(ochre) are thus found to drive gene 
expression in regions corresponding to 
different parts of the normal Eve 
expression pattern. Two segments in 
tandem drive expression in a pattern that 
is the sum of the patterns generated by 
each of them individually. Separate 
regulatory modules are responsible for 
different times of gene expression, as well 
as different locations: the leftmost panel 
shows the action of a module that comes 
into play later than the others illustrated 
and drives expression in a subset of 
neurons. (From M. Fujioka et al., 
Development 126:2527-538, 1999. 
With permission from The Company 
of Biologists.) 
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other sites, under the control of additional enhancers (see Figure 22-39). By 
addition of new modules to its regulatory DNA, any gene can be co-opted dur- 
ing evolution for new purposes at new sites in the body, without detriment to its 
other functions. 



Figure 22-40 The formation of Ftz and 
Eve stripes in the Drosophila 
blastoderm. Ftz and Eve are both pair- 
rule genes. Their expression patterns 
(shown in brown for Ftz and in gray for 
Eve) are at first blurred but rapidly resolve 
into sharply defined stripes. (From 
P.A. Lawrence, The Making of a Fly. 
Oxford, UK: Blackwell, 1992.) 



Egg-Polarity, Gap, and Pair-Rule Genes Create a Transient Pattern 
That Is Remembered by Other Genes 

Within the first few hours after fertilization, the gap genes and the pair-rule 
genes are activated. Their mRNA products appear first in patterns that only 
approximate the final picture; then, within a short time — through a series of 
interactive adjustments — the fuzzy initial distribution of gene products resolves 
itself into a regular, crisply defined system of stripes (Figure 22-40). But this sys- 
tem itself is unstable and transient. As the embryo proceeds through gastrula- 
tion and beyond, the regular segmental pattern of gap and pair-rule gene prod- 
ucts disintegrates. Their actions, however, have stamped a permanent set of 
labels — positional values — on the cells of the blastoderm. These positional 
labels are recorded in the persistent activation of certain of the segment-polar- 
ity genes and of the homeotic selector genes, which serve to maintain the seg- 
mental organization of the larva and adult. The segment-polarity gene Engrailed 
provides a good example. Its RNA transcripts are seen in the cellular blastoderm 
in a series of 14 bands, each approximately one cell wide, corresponding to the 
anteriormost portions of the future parasegments (Figure 22-41). 

The segment-polarity genes are expressed in patterns that repeat from one 
parasegment to the next, and their bands of expression appear in a fixed rela- 
tionship to the bands of expression of the pair-rule genes that help to induce 
them. However, the production of this pattern within each parasegment depends 
on interactions among the segment- polarity genes themselves. These interac- 
tions occur at stages when the blastoderm has already become fully partitioned 
into separate cells, so that cell-cell signaling of the usual sort has to come into 
play. A large subset of the segment-polarity genes code for components of two 
signal transduction pathways, the Wnt pathway and the Hedgehog pathway, 
including the secreted signal proteins Wingless (a Wnt family member) and 
Hedgehog. These are expressed in different bands of cells that serve as signaling 
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Figure 22-41 The pattern of expression of 
Engrailed, a segment-polarity gene. The 

Engrailed pattern is shown in a 
5-hour embryo (at the extended germ- 
band stage), a 1 0-hour embryo, and an 
adult (whose wings have been removed in 
this preparation). The pattern is revealed 
by an antibody (brown) against the 
Engrailed protein (for the 5- and 
1 0-hour embryos) or (for the adult) by 
constructing a strain of Drosophila 
containing the control sequences of the 
Engrailed gene coupled to the coding 
sequence of the reporter LacZ, whose 
product is detected histochemically 
through the blue product of a reaction 
that it catalyzes. Note that the Engrailed 
pattern, once established, is preserved 
throughout the animal's life. (From 
C. Hama, Z. Ali and T.B. Kornberg, Genes 
Dev. 4:1 079-1 093, 1990. With permission 
from Cold Spring Harbor Laboratory Press.) 
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centers within each parasegment, and they act to maintain and refine the 
expression of other segment- polarity genes. Moreover, although their initial 
expression is determined by the pair-rule genes, the two signal proteins regulate 
one another's expression in a mutually supportive way, and they proceed to help 
trigger expression of genes such as Engrailed in precisely the correct sites. 

The Engrailed expression pattern will persist throughout life, long after the 
signals that organized its production have disappeared (see Figure 22-41). This 
example illustrates not only the progressive subdivision of the embryo by means 
of more and more narrowly localized signals, but also the transition between the 
transient signaling events of early development and the later stable mainte- 
nance of developmental information. 

Besides regulating the segment-polarity genes, the products of pair-rule 
genes collaborate with the products of gap genes to cause the precisely localized 
activation of a further set of spatial labels — the homeotic selector genes. It is the 
homeotic selector genes that permanently distinguish one parasegment from 
another. In the next section we examine these selector genes in detail and con- 
sider their role in cell memory. 

Summary 

The fly Drosophila has been the foremost model organism for study of the genetics of 
animal development. Like other insects, it begins its development with a series of 
nuclear divisions generating a syncytium, and a large amount of early patterning 
occurs in this single giant multinucleate cell The pattern originates with asymmetry 
in the egg, organized both by localized deposits ofmRNA inside the egg and by signals 
from the follicle cells around it. Positional information in the multinucleate embryo is 
supplied by four intracellular gradients that are set up by the products of four groups 
of maternal- effect genes called egg-polarity genes. These control four distinctions fun- 
damental to the body plan of animals: dorsal versus ventral, endoderm versus meso- 
derm and ectoderm, germ cells versus somatic cells, and head versus rear. 

The egg-polarity genes operate by setting up graded distributions of gene regula- 
tory proteins in the egg and early embryo. The gradients along the anteroposterior axis 
initiate the orderly expression of gap genes, pair-rule genes, segment-polarity genes, 
and homeotic selector genes. These, through a hierarchy of interactions, become 
expressed in some regions of the embryo and not others, progressively subdividing the 
blastoderm into a regular series of repeating modular units called segments. The com- 
plex patterns of gene expression reflect the modular organization of the regulatory 
DNA, with separate enhancers of an individual gene responsible for separate parts of 
its expression pattern. 

The segment-polarity genes come into play toward the end of the segmentation 
process, soon after the syncytium has become partitioned into separate cells, and they 
control the internal patterning of each segment through cell-cell signaling via the Wnt 
(Wingless) and Hedgehog pathways. This leads to persistent localized activation of 
genes such as Engrailed, giving cells a remembered record of their anteroposterior 
address within the segment. Meanwhile, a new cell-cell signaling gradient is also set 
up along the dorsoventral axis, with the TGFfi family member Decapentaplegic (Dpp) 
and its antagonist, Short gastrulation, acting as the morphogens. This gradient helps 
to refine the assignment of different characters to cells at different dorsoventral levels. 
Homologous proteins are also known to control the patterning of the ventrodorsal axis 
in vertebrates. 



HOMEOTIC SELECTOR GENES AND THE 
PATTERNING OF THE ANTEROPOSTERIOR AXIS 

As development proceeds, the body becomes more and more complex. In all this 
growing complexity there is, however, a simplifying feature that puts an under- 
standing of the whole developmental process within our grasp. Again and again, 
in every species and at every level of organization, we find that complex structures 
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are made by repeating a few basic themes with variations. Thus, a limited num- 
ber of basic differentiated cell types, such as muscle cells or fibroblasts, recur 
with subtle individual variations in different sites. These cell types are organized 
into a limited variety of tissue types, such as muscle or tendon, which again are 
repeated with subtle variations in different regions of the body. From the various 
tissues, organs such as teeth or digits are built — molars and incisors, fingers and 
thumbs and toes — a few basic kinds of structure, repeated with variations. 

Wherever we find this phenomenon of modulated repetition, we can break 
down the developmental biologist's problem into two kinds of question: what is 
the basic construction mechanism common to all the objects of the given class, 
and how is this mechanism modified to give the observed variations? The 
embryo uses a combinatorial strategy to generate its complexity, and we can use 
a combinatorial strategy to understand it. 

The segments of the insect body provide a very clear example. We have 
already sketched the way in which the rudiment of a single typical segment is 
constructed. We must now consider how one segment is caused to be different 
from another. 



The Hox Code Specifies Anterior-Posterior Differences 

The first glimpse of a genetic answer to the question of how each segment 
acquires its individual identity came over 80 years ago, with the discovery of the 
first of a set of mutations in Drosophila that cause bizarre disturbances of the 
organization of the adult fly. In the Antennapedia mutant, for example, legs 
sprout from the head in place of antennae (Figure 22-42), while in the Bithorax 
mutant, portions of an extra pair of wings appear where normally there should 
be the much smaller appendages called halteres. These mutations transform 
parts of the body into structures appropriate to other positions and are called 
homeotic. A whole set of homeotic selector genes determines the anteroposte- 
rior character of the segments of the fly. 

The genes of this set — eight of them in the fly — are all related to one another 
as members of a multigene family, and they all lie in one or the other of two tight 
gene clusters known as the Bithorax complex and the Antennapedia complex. 
The genes in the Bithorax complex control the differences among the abdominal 
and thoracic segments of the body, while those in the Antennapedia complex con- 
trol the differences among thoracic and head segments. Comparisons with other 
species show that the same genes are present in essentially all animals, including 
humans. These comparisons also reveal that the Antennapedia and Bithorax com- 
plexes are the two halves of a single entity, called the Hox complex, that has 
become split in the course of the fly's evolution, and whose members operate in a 
coordinated way to exert their control over the head-to-tail pattern of the body. 




Figure 22-42 A homeotic mutation. The 

fly shown here is an Antennapedia 
mutant. Its antennae are converted into 
leg structures by a mutation in the 
regulatory region of the Antennapedia 
gene that causes it to be expressed in the 
head. Compare with the normal fly 
shown in Figure 22-24. (Courtesy of 
Matthew Scott.) 



Homeotic Selector Genes Code for DNA-Binding Proteins That 
Interact with Other Gene Regulatory Proteins 

To a first approximation each homeotic selector gene is normally expressed in 
just those regions that develop abnormally when the gene is mutated or absent. 
The products of these genes can thus be viewed as molecular address labels pos- 
sessed by the cells of each parasegment: they are the physical embodiment of 
the cells' positional value. If the address labels are changed, the parasegment 
behaves as though it were located somewhere else, and deletion of the entire 
complex results in a larva whose body segments are all alike (Figure 22-43). 

A first problem, therefore, is to understand how the homeotic selector gene 
products act on the basic segment-patterning machinery to give each paraseg- 
ment its individuality. The products of the homeotic selector genes are gene 
regulatory proteins, all related to one another by the possession of a highly con- 
served DNA-binding homeodomain (60 amino acids long), discussed in Chap- 
ter 7. The corresponding segment in the DNA sequence is called a homeobox 
from which, by abbreviation, the Hox complex takes its name. 
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Figure 22-43 The effect of deleting most of the genes of the Bithorax 
complex. (A) A normal Drosophila larva shown in dark-field illumination; 
(B) the mutant larva with the Bithorax complex largely deleted. In the 
mutant the parasegments posterior to P5 all have the appearance of P5. 
(From G. Struhl, Nature 293:36-41 , 1 981 . With permission from Macmillan 
Publishers Ltd.) 

If the products of the homeotic selector genes are similar in their DNA-bind- 
ing regions, how do they exert different effects so as to make one parasegment 
different from the next? The answer seems to lie largely in the parts of the pro- 
teins that do not bind directly to DNA but interact with other proteins in DNA- 
bound complexes. The different partners in these complexes act together with 
the homeotic selector proteins to dictate which DNA-binding sites will be rec- 
ognized and whether the effect on transcription at those sites will be activation 
or repression. In this way, the products of the homeotic selector genes combine 
with other gene regulatory proteins and modulate their actions so as to give 
each parasegment its characteristic features. 



The Homeotic Selector Genes Are Expressed Sequentially 
According to Their Order in the Hox Complex 

To understand how the Hox complex provides cells with positional values, we 
also need to consider how the expression of the Hox genes themselves is regu- 
lated. The coding sequences of the eight homeotic selector genes in the Anten- 
napedia and bithorax complexes are interspersed amid a much larger quan- 
tity—a total of about 650,000 nucleotide pairs— of regulatory DNA. This DNA 
includes binding sites for the products of egg-polarity and segmentation genes. 
The regulatory DNA in the Hox complex acts as an interpreter of the multiple 
items of positional information supplied to it by all these gene regulatory pro- 
teins. In response, a particular set of homeotic selector genes is transcribed, 
appropriate to the location. 

In the pattern of control there is a remarkable regularity. The sequence in 
which the genes are ordered along the chromosome, in both the Antennapedia 
and the Bithorax complexes, corresponds almost exactly to the order in which 
they are expressed along the axis of the body (Figure 22-44). This suggests that 
the genes are activated serially by some process that is graded — in duration or 
intensity — along the axis of the body, and whose action spreads gradually along 
the chromosome. The most "posterior" of the genes expressed in a cell generally 
dominates, driving down expression of the previously activated "anterior" genes 
and dictating the character of the segment. The gene regulatory mechanisms 
underlying these phenomena are still not well understood, but their conse- 
quences are profound. We shall see that the serial organization of gene expres- 
sion in the Hox complex is a fundamental feature that has been highly conserved 
in the course of evolution. 
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Figure 22-44 The patterns of 
expression compared to the 
chromosomal locations of the 
genes of the Hox complex. The 

diagram shows the sequence of 
genes in each of the two 
subdivisions of the chromosomal 
complex. This corresponds, with 
minor deviations, to the spatial 
sequence in which the genes are 
expressed, shown in the photograph 
of an embryo at the extended germ 
band stage, about 5 hours after 
fertilization. The embryo has been 
stained by in situ hybridization with 
differently labeled probes to detect 
the mRNA products of different Hox 
genes in different colors. 
(Photograph courtesy of William 
McGinnis, adapted from D. Kosman 
et al., Science 305:846, 2004. With 
permission from AAAS.) 



1 344 Chapter 22: Development of Multicellular Organisms 



There are hundreds of other homeob ox- containing genes in the genome of 
the fly — and of other animal species — but most of them are scattered and not 
clustered in complexes such as the Hox complex. They have many different gene 
regulatory functions, but a substantial proportion of them have roles akin to that 
of the Hox genes: they control the variations on a basic developmental theme. 
Different classes of neurons, for example, are often distinguished from one 
another by expression of specific genes of this large superfamily 



The Hox Complex Carries a Permanent Record of Positional 
Information 

The spatial pattern of expression of the genes in the Hox complex is set up by sig- 
nals acting early in development, but the consequences are long-lasting. 
Although the pattern of expression undergoes complex adjustments as develop- 
ment proceeds, the Hox complex behaves in each cell as though stamped with a 
permanent record of the anteroposterior position that the cell occupied in the 
early embryo. In this way, the cells of each segment are equipped with a long- 
term memory of their location along the anteroposterior axis of the body — in 
other words, with an anteroposterior positional value. As we shall see in the next 
section, the memory trace imprinted on the Hox complex governs the segment- 
specific identity not only of the larval segments, but also of the structures of the 
adult fly, which are generated at a much later stage from the larval imaginal discs 
and other nests of imaginal precursor cells in the larva. 

The molecular mechanism of the cell memory for this positional informa- 
tion relies on two types of regulatory inputs. One is from the homeotic selector 
genes themselves: many of the Hox proteins autoactivate the transcription of 
their own genes. Another crucial input is from two large complementary sets of 
proteins that control chromatin structure, called the Polycomb group and the 
Trithor ax group. If these regulators are defective, the pattern of expression of the 
homeotic selector genes is set up correctly at first but is not correctly maintained 
as the embryo grows older. 

The two sets of regulators act in opposite ways. Trithorax group proteins are 
needed to maintain the transcription of Hox genes in cells where transcription 
has already been switched on. In contrast, Polycomb group proteins form stable 
complexes that bind to the chromatin of the Hox complex and maintain the 
repressed state in cells where Hox genes have not been activated at the critical 
time (Figure 22-45). The developmental memory involves specific covalent 
modifications of histones in nucleosomes in the neighborhood of the Hox genes, 
leading to changes in the state of the chromatin that can be perpetuated from 
one cell generation to the next, as discussed in Chapters 4 and 7. 



The Anteroposterior Axis Is Controlled by Hox Selector Genes in 
Vertebrates Also 

Homologs of the Drosophila homeotic selector genes have been found in almost 
every animal species studied, from cnidarians (hydroids) and nematodes to 
mollusks and mammals. Remarkably, these genes are often grouped in com- 
plexes similar to the insect Hox complex. In the mouse there are four such com- 
plexes — called the HoxA, HoxB, HoxC, and HoxD complexes — each on a different 
chromosome. Individual genes in each complex can be recognized by their 
sequences as counterparts of specific members of the Drosophila set. Indeed, 
mammalian Hox genes can function in Drosophila as partial replacements for 
the corresponding Drosophila Hox genes. It appears that each of the four mam- 
malian Hox complexes is, roughly speaking, the equivalent of a complete insect 
complex (that is, an Antennapedia complex plus a Bithorax complex) (Figure 
22-46). 

The ordering of the genes within each vertebrate Hox complex is essen- 
tially the same as in the insect Hox complex, suggesting that all four vertebrate 
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Figure 22-45 Action of genes of the 
Polycomb group. (A) Photograph of a 
mutant embryo defective for the gene 
Extra sex combs (Esc) and derived from a 
mother also lacking this gene. The gene 
belongs to the Polycomb group. 
Essentially all segments have been 
transformed to resemble the most 
posterior abdominal segment (compare 
with Figure 22-43). In the mutant the 
pattern of expression of the homeotic 
selector genes, which is roughly normal 
initially, is unstable in such a way that all 
these genes soon become switched on all 
along the body axis. (B) The normal 
pattern of binding of Polycomb protein to 
Drosophila giant chromosomes, visualized 
with an antibody against Polycomb. The 
protein is bound to the Antennapedia 
complex (ANT-C) and the Bithorax 
complex (BX-C) as well as about 60 other 
sites. (A, from G. Struhl, Nature 293:36-41, 
1 981 . With permission from Macmillan 
Publishers Ltd. B, courtesy of B. Zink and 
R. Paro, Trends Genet 6:416-421, 1990. 
With permission from Elsevier.) 



100 urn 



complexes originated by duplications of a single primordial complex and have 
preserved its basic organization. Most tellingly, when the expression patterns of 
the Hox genes are examined in the vertebrate embryo by in situ hybridization, it 
turns out that the members of each complex are expressed in a head-to-tail 
series along the axis of the body, just as they are in Drosophila (Figure 22-47). 
The pattern is most clearly seen in the neural tube, but is also visible in other tis- 
sues, especially the mesoderm. With minor exceptions this anatomical ordering 
matches the chromosomal ordering of the genes in each complex, and corre- 
sponding genes in the four different Hox complexes have almost identical 
anteroposterior domains of expression. 

The gene expression domains define a detailed system of correspondences 
between insect body regions and vertebrate body regions (see Figure 22-46) . The 
parasegments of the fly correspond to a similarly labeled series of segments in 
the anterior part of the vertebrate embryo. These are most clearly demarcated in 
the hindbrain (see Figures 22-46 and 22-47), where they are called rhom- 
bomeres. In the tissues lateral to the hindbrain the segmentation is seen in the 
series of branchial arches, prominent in all vertebrate embryos — the precursors 
of the system of gills in fish and of the jaws and structures of the neck in mam- 
mals; each pair of rhombomeres in the hindbrain corresponds to one branchial 
arch. In the hindbrain, as in Drosophila, the boundaries of the expression 
domains of many of the Hox genes are aligned with the boundaries of the 
anatomical segments. 

The products of the mammalian Hox genes appear to specify positional val- 
ues that control the anteroposterior pattern of parts in the hindbrain, neck, and 
trunk (as well as some other parts of the body). As in Drosophila, when a poste- 
rior Hox gene is artificially expressed in an anterior region, it can convert the 
anterior tissue to a posterior character. Conversely, loss of posterior Hox genes 
allows the posterior tissue where they are normally expressed to adopt an ante- 
rior character (Figure 22-48). The transformations observed in mouse Hox 
mutants are not always so straightforward and are often incomplete, because of 
a redundancy between genes in the four Hox gene clusters. But it seems clear 
that the fly and the mouse use essentially the same molecular machinery to give 
individual characters to successive regions along at least a part of their antero- 
posterior axis. 
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Figure 22-46 The Hox complex of an 
insect and the Hox complexes of a 
mammal compared and related to body 
regions. The genes of the Antennapedia 
and Bithorax complexes of Drosophila are 
shown in their chromosomal order in the 
top line; the corresponding genes of the 
four mammalian Hox complexes are 
shown below, also in chromosomal order. 
The gene expression domains in fly and 
mammal are indicated in a simplified 
form by color in the cartoons of animals 
above and below. However, the details of 
the patterns depend on developmental 
stage and vary somewhat from one 
mammalian Hox complex to another. 
Also, in many cases, genes shown here as 
expressed in an anterior domain are also 
expressed more posteriorly, overlapping 
the domains of more posterior Hox genes 
(see, for example, Figure 22-47). The 
complexes are thought to have evolved 
as follows: first, in some common 
ancestor of worms, flies, and vertebrates, 
a single primordial homeotic selector 
gene underwent repeated duplication to 
form a series of such genes in tandem — 
the ancestral Hox complex. In the 
Drosophila sublineage this single 
complex became split into separate 
Antennapedia and Bithorax complexes. 
Meanwhile, in the lineage leading to the 
mammals the whole complex was 
repeatedly duplicated to give four Hox 
complexes. The parallelism is not perfect 
because apparently some individual 
genes have been duplicated, others lost, 
and still others co-opted for different 
purposes (genes in parentheses in the 
top line) since the complexes diverged. 
(Based on a diagram courtesy of William 
McGinnis.) 
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Figure 22-47 Expression domains of Hox genes in a mouse. The photographs show whole embryos displaying the expression domains of two 
genes of the HoxB complex (blue stain). These domains can be revealed by in situ hybridization or, as in these examples, by constructing 
transgenic mice containing the control sequence of a Hox gene coupled to a LacZ reporter gene, whose product is detected histochemically. 
Each gene is expressed in a long expanse of tissue with a sharply defined anterior limit. The earlier the position of the gene in its chromosomal 
complex, the more anterior the anatomical limit of its expression. Thus, with minor exceptions, the anatomical domains of the successive genes 
form a nested set, ordered according to the ordering of the genes in the chromosomal complex. (Courtesy of Robb Krumlauf.) 
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Summary 

The complexity of the adult body of an animal is built up by modulated repetition of 
a few basic types of structure. Thus, superimposed on the pattern of gene expression 
that repeats itself in every segment, there is a serial pattern of expression ofhomeotic 
selector genes that confer on each segment a different identity. The homeotic selector 
genes code for DNA-binding proteins of the homeodomain family. They are grouped in 
the Drosophila genome in two clusters, called the Antennapedia and bithorax com- 
plexes, believed to be the two parts of a single primordial Hox complex that became 
split during evolution of the fly. In each complex, the genes are arranged in a sequence 
that matches their sequence of expression along the axis of the body. Hox gene expres- 
sion is initiated in the embryo. It is maintained subsequently by the action of DNA- 
binding proteins of the Poly comb and Trithor ax group, which stamp the chromatin of 
the Hox complex with a heritable record of its embryonic state of activation. Hox com- 
plexes homologous to that o/Drosophila are found in virtually every type of animal 
that has been examined, from cnidarians to humans, and they appear to have an evo- 
lutionary conserved role in patterning the anteroposterior axis of the body. Mammals 
have four Hox complexes, each showing a similar relationship between a serial 
arrangement of the genes in the chromosome and their serial pattern of expression 
along the body axis. 



ORGANOGENESIS AND THE PATTERNING OF 
APPENDAGES 

We have seen that the segments of the insect larva are all variations on the same 
basic theme, with segmentation genes defining the basic repetitive module and 
homeotic selector genes giving each segment its individual character. The same 
applies to the major appendages of the adult insect body — legs, wings, anten- 
nae, mouthparts and external genitalia: they too are variations on a common 
basic theme. At a finer level of detail, we encounter the same wonderful simpli- 
fication: the appendages — and many other parts of the body — consist of sub- 
structures that are themselves variations on a small number of basic evolution- 
arily conserved themes. 

In this section we follow the course of development in Drosophila through 
to its end, narrowing our focus at each step to examine one example of the many 
related structures that are developing in parallel. As we go along, we shall point 



Figure 22-48 Control of anteroposterior 
pattern by Hox genes in the mouse. 

(A,B) A normal mouse has about 
65 vertebrae, differing in structure 
according to their position along the 
body axis: 7 cervical (neck), 1 3 thoracic 
(with ribs), 6 lumbar (bracketed by yellow 
asterisks in (B)), 4 sacral (bracketed by red 
asterisks in (B)), and about 35 caudal 
(tail). (A) shows a side view; (B) shows a 
dorsal view; for clarity, the limbs have 
been removed in each picture. (C) The 
HoxAW gene is normally expressed in the 
lumbar region (together with its paralogs 
HoxCIO and HoxDIO); here it has been 
artificially expressed in the developing 
vertebral tissue all along the body axis. 
As a result, the cervical and thoracic 
vertebrae are all converted to a lumbar 
character. (D) Conversely, when HoxAW is 
knocked out along with HoxCIO and 
HoxDIO, vertebrae that should normally 
have a lumbar or sacral character take on 
a thoracic character instead. (A and C, 
from M. Carapu^o et a I., Genes Dev. 
1 9:21 1 6-21 21 , 2005. With permission 
from Cold Spring Harbor Laboratory 
Press; B and D, from D.M. Wellik and 
M.R. Capecchi, Science 301 :363-367, 
2003. With permission from AAAS.) 
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out parallels with vertebrate structures that develop similarly, using not only the 
same general strategies but many of the same specific molecular mechanisms. 
But to avoid interrupting the narrative later, we must first briefly explain some 
key experimental methods, required to cope with a special problem that arises 
when we try to discover how genes control the later stages of development. 



Conditional and Induced Somatic Mutations Make it Possible to 
Analyze Gene Functions Late in Development 

As emphasized earlier, the same gene may be used repeatedly in many different 
situations — in different regions of the body, and at different times. Often, loss- 
of- function mutations disrupt early development so severely that the embryo or 
larva dies, depriving us of the opportunity to see how the mutation would affect 
later processes. 

One way around this problem is to study conditional mutations. If we have, 
for example, a temperature-sensitive mutation in the gene of interest, we can 
maintain the animal during early development at a low temperature, where the 
gene product functions normally, and then disable the gene product whenever 
we please by raising the temperature to discover the late functions. 

Other methods involve actually modifying the DNA in subsets of cells at late 
stages of development — a sort of genetic surgery on individual cells that allows 
mutant groups of cells of a specified genotype to be generated at a chosen time 
in development. This remarkable feat can be achieved by induced somatic 
recombination, and the resulting organism is called a genetic mosaic. By means 
of genetic mosaics, we can not only bypass the problem of lethality when a gene 
function is perturbed in the organism as a whole; we can also explore the func- 
tion of the gene in cell-cell interactions, by juxtaposing mutant and nonmutant 
cells. We can test, for example, whether cells use the gene product to send a sig- 
nal to neighbors, or to receive a signal from them, or neither. And by inducing 
the genetic change at different times, we can find out precisely when the gene 
acts to produce a particular effect. 

A current technique for induced somatic recombination uses transgenic 
flies that have been bred to contain two types of yeast- derived genetic elements: 
the FLP site-specific recombinase gene, and the FLP Recombinase Target (FRT) 
sequence. Typically, the animal is homozygous for an insertion of the FRT 
sequence close to the centromere on a chosen chromosome arm, while a con- 
struct consisting of the Flp gene under a heat-shock promoter is inserted else- 
where in the genome. If such a transgenic embryo or larva is given a heat shock 
(that is, exposed to a high temperature for a few minutes), expression of Flp is 
induced, and this enzyme catalyzes crossing-over and recombination between 
the maternal and paternal chromosomes at the FRT site. If the heat shock is 
adjusted to be sufficiently mild, this event will occur in only one or a few cells, 
scattered at random. As explained in Figure 22-49, if the animal is also het- 
erozygous for a gene of interest in the crossed-over chromosomal region, the 
process can result in a pair of daughter cells that are homozygous, the one 
receiving two copies of the maternal allele of the gene, the other receiving two 
copies of the paternal allele. Each of these daughter cells will normally grow and 
divide to give clonal patches of homozygous progeny. 

The occurrence of the crossover can be detected if the animal is chosen to 
be also heterozygous for a genetic marker that lies on the same chromosome 
arm as the gene of interest and so undergoes crossing over in company with it. 
In this way clearly marked homozygous mutant clones of cells can be created to 
order. Either FLP and FRT, or the analogous Cre and Lox pair of recombination 
elements, can also be used in other configurations to switch expression of a gene 
on or off (see Figure 5-79). With these techniques, one can discover what hap- 
pens, for example, when cells are caused to produce a particular signal molecule 
at an abnormal site, or are deprived of a particular receptor. 

Instead of using a heat-shock promoter to drive expression of the FLP 
recombinase, one can use a copy of the regulatory sequence of a gene in the fly's 
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Figure 22-49 Creation of mutant cells 
by induced somatic recombination. The 

diagrams follow the fate of a single pair 
of homologous chromosomes, one from 
the father (shaded), the other from the 
mother (unshaded). These chromosomes 
have an Frt element (green) inserted close 
to their centromere, and contain a locus 
for a gene of interest — gene X — farther 
out along the same chromosome arm. 
The paternal chromosome (in this 
example) carries the wild-type allele of 
gene X (open red box) while the maternal 
chromosome carries a recessive mutant 
allele (filled red box). Recombination by 
exchange of DNA between the maternal 
and paternal chromosomes, catalyzed by 
the FLP recombinase, can give rise to a 
pair of daughter cells, one containing two 
wild-type copies of gene X, the other 
containing two mutant copies. To help 
identify the cells where recombination 
has occurred, the maternal and paternal 
chromosomes can be chosen to carry 
different genetic markers (not shown 
here), capable of generating a visible 
product, and positioned on the 
chromosome so that recombination 
involving the marker locus — resulting in 
a visible alteration in the appearance of 
the cells — can be taken as a sure sign 
that gene X has also undergone 
recombination. 



normal genome that is expressed at some interesting time and place. The 
recombination event will then be triggered, and mutant cells created, at just the 
sites where that gene is normally expressed. A variant of this technique uses 
transcriptional regulation machinery borrowed from yeast, rather than genetic 
recombination machinery, to switch expression of a chosen fly gene reversibly 
on or off according to the normal pattern of expression of some other chosen fly 
gene (Figure 22-50). 

By switching gene functions off or on at specific times and places in these 
ways, developmental biologists can set about deciphering the system of geneti- 
cally specified signals and responses that control the patterning of any organ of 
the body. 



Body Parts of the Adult Fly Develop From Imaginal Discs 

The external structures of the adult fly are formed largely from rudiments called 
imaginal discs — groups of cells that are set aside, apparently undifferentiated, 
in each segment of the larva. The discs are pouches of epithelium, shaped like 
crumpled and flattened balloons, and continuous with the epidermis (the sur- 
face layer) of the larva. There are 19 of them, arranged as 9 pairs on either side of 
the larva plus 1 disc in the midline (Figure 22-51). They grow and develop their 
internal pattern as the larva grows, until finally, at metamorphosis, they evert 
(turn inside out), extend, and differentiate overtly to form the epidermal layer of 
the adult. The eyes and antennae develop from one pair of discs, the wings and 
part of the thorax from another, the first pair of legs from another, and so on. 
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Figure 22-50 The Gal4/Uas technique for controlled gene misexpression in Drosophila. The method allows one to drive 
expression of a chosen gene G at the places and times where some other Drosophila gene H is normally expressed. 
(A) A transgenic animal is created, with two separate constructs inserted in its genome. One insert consists of a yeast- 
specific regulatory sequence, called the Uas (upstream activating sequence) element, coupled to a copy of the coding 
sequence of gene G. The other insert contains the coding sequence of the yeast Gal4 gene, whose product is a yeast- 
specific gene regulatory protein that binds to the Uas element; this Gal4 insert is placed next to, and controlled by, the 
regulatory region of gene H. Wherever gene H is normally expressed, Gal4 protein is also made and drives transcription of 
gene G. (B) Although one can achieve the same result by linking a copy of the H regulatory sequence directly to the 
G coding sequence, the Gal4/Uas approach allows a strategy that is more efficient in the long run. Two separate "libraries" 
of transgenic flies are constructed, one containing Gal4 inserts driven by a variety of regulatory sequences of different 
genes A, B, C, etc., the other containing Uas inserts driving a variety of different coding sequences X, Y, Z, etc. By mating a 
fly from one library with a fly from the other, any desired coding sequence can be functionally coupled to any desired 
regulatory sequence. To generate the library of flies with Gal4 insertions at useful sites, flies are first produced with Gal4 
insertions at random locations in their genome. These are then mated with flies containing a Uas element linked to a 
reporter gene with an easily detectable product. Expression of the reporter reveals whether Gal4 has been inserted at a site 
that brings its expression under the control of an interesting enhancer; flies showing interesting reporter patterns are kept 
and studied. This is called the enhancer trap technique, because it provides a way to hunt out and characterize interesting 
regulatory sequences in the genome. 
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Figure 22-51 The imaginal discs in the 
Drosophila larva and the adult 
structures they give rise to. (After 
J.W. Fristrom et al., in Problems in 
Biology: RNA in Development 
[E.W. Hanley, ed.], p. 382. Salt Lake City: 
University of Utah Press, 1 969.) 
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Homeotic Selector Genes Are Essential for the Memory of 
Positional Information in Imaginal Disc Cells 

The cells of one imaginal disc look like those of another, but grafting experi- 
ments show that they are in fact already regionally determined and nonequiva- 
lent. If one imaginal disc is transplanted into the position of another in the larva 
and the larva is then left to go through metamorphosis, the grafted disc is found 
to differentiate autonomously into the structure appropriate to its origin: a wing 
disc will give wing structures, a haltere disc, haltere structures, regardless of its 
new site. This shows that the imaginal disc cells are governed by a memory of 
their original position. By a more complex serial grafting procedure that lets the 
imaginal disc cells proliferate for an extended period before differentiating, it 
can be shown that this cell memory is stably heritable (with rare lapses) through 
an indefinitely large number of cell generations. 

The homeotic selector genes are essential components of the memory 
mechanism. If, at any stage in the long period leading up to differentiation at 
metamorphosis, both copies of a homeotic selector gene are eliminated by 
induced somatic recombination from a clone of imaginal disc cells that would 
normally express that gene, those cells will differentiate into incorrect struc- 
tures, as though they belonged to a different segment of the body. These and 
other observations indicate that each cell's memory of positional information 
depends on the continued activity of the homeotic selector genes. This memory, 
furthermore, is expressed in a cell- autonomous fashion — each cell appears to 
maintain its state individually, depending on its own history and genome. 
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Specific Regulatory Genes Define the Cells That Will Form an 
Appendage 

We must now examine how an appendage develops its internal pattern. We shall 
take the insect wing as our example. 

The process begins with the early patterning mechanisms we have already 
discussed. The anteroposterior and dorsoventral systems of signals in the early 
embryo in effect mark out an orthogonal grid in the blastoderm, in the form of 
dorsoventral, anteroposterior, and periodically spaced segmental gene expres- 
sion boundaries. At certain points of intersection of these boundaries, the 
combination of genes expressed is such as to switch a cluster of cells into the 
imaginal disc pathway. 

In molecular terms this corresponds to switching on expression of imaginal - 
disc-defining regulatory genes. In most of the discs, the gene Distal-less is 
switched on. This codes for a gene regulatory protein that is essential for the sus- 
tained growth required to create an elongated appendage such as a leg or an 
antenna with a proximodistal axis. In its absence, such appendages fail to form, 
and when it is artificially expressed at abnormal sites, misplaced appendages 
can be produced. Distal-less is expressed in a similar fashion in the developing 
limbs and other appendages of most species of invertebrates and vertebrates 
that have been examined (Figure 22-52). For the eye disc, another gene, Eyeless 



Figure 22-52 Expression of Distal-less in 
developing legs and related appendages 
of various species. (A) A sea-urchin larva. 
(B) A moth larva. (A, from G. Panganiban 
et al., Proc. Natl Acad. Sci. U.S.A. 
94:5162-5166, 1997. With permission from 
National Academy of Sciences; B, from 
G. Panganiban, L. Nagy and S.B. Carroll, 
Curr. Biol. 4:671-675, 1 994. With 
permission from Elsevier.) 
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Figure 22-53 Gene expression domains in the wing imaginal disc, 
defining quadrants of the future wing. The wing blade itself derives from 
the oval-shaped domain toward the right, and it is divided into four 
quadrants by the expression of Apterous and Engrailed, as shown. 



(together with two closely related genes), performs the corresponding role; it too 
has homologues with homologous functions — the Pax6 genes that drive eye 
development in other species, as discussed in Chapter 7. 



The Insect Wing Disc Is Divided into Compartments 

From the outset, the cluster of cells forming the imaginal disc has the rudiments 
of an internal pattern, inherited from the earlier patterning process. For exam- 
ple, the cells in the posterior half of the wing-disc rudiment (and of most of the 
other imaginal-disc rudiments) express the segment-polarity gene Engrailed, 
while those in the anterior half do not. The initial asymmetries lay the founda- 
tions for a subsequent more detailed patterning, just as in the egg and early 
embryo. 

The sectors of the wing disc defined by these early differences of gene 
expression correspond to specific parts of the future wing. The posterior, 
Engrailed- expressing region will form the posterior half of the wing, while the 
region that does not express Engrailed will form the anterior half. Meanwhile, 
the dorsal part of the wing disc expresses a gene called Apterous, while the ven- 
tral half does not. At metamorphosis, the disc folds along the line separating 
these domains to give a wing whose dorsal sheet of cells is derived from the 
Apterous- expressing region and whose ventral sheet is derived from the region 
that does not express Apterous. The wing margin, where these two epithelial 
sheets are joined, corresponds to the boundary of the Apterous expression 
domain in the disc (Figure 22-53). 

The cells of the disc, having switched on expression of the genes that mark 
them as anterior or posterior, dorsal or ventral, retain this specification as the 
disc grows and develops. Because the cells are sensitive to these differences and 
selective in their choice of neighbors, sharply defined boundaries are formed 
between the four resultant sets of cells, with no mixing at the interfaces. The four 
corresponding quadrants of the disc are called compartments, because there is 
no exchange of cells between them (Figure 22-54). 
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Figure 22-54 Compartments in the adult wing. (A) The shapes of marked 
clones in the Drosophila\N\r\g reveal the existence of a compartment 
boundary. The border of each marked clone is straight where it abuts the 
boundary. Even when a marked clone has been genetically altered so that it 
grows more rapidly than the rest of the wing and is therefore very large, it 
respects the boundary in the same way (drawing on right). Note that the 
compartment boundary does not coincide with the central wing vein. 
(B) The pattern of expression of the Engrailed gene in the wing, revealed by 
the same technique as for the adult fly shown in Figure 22-41 . The 
compartment boundary coincides with the boundary of Engrailed gene 
expression. (A, after F.H.C. Crick and P.A. Lawrence, Science 1 89:340-347, 1 975. 
With permission from AAAS; B, courtesy of Chihiro Hama and Tom Kornberg.) 




500|j,m 



ORGANOGENESIS AND THE PATTERNING OF APPENDAGES 



1353 



Four Familiar Signaling Pathways Combine to Pattern the Wing 
Disc: Wingless, Hedgehog, Dpp, and Notch 

Along each of the compartment boundaries — the anteroposterior boundary 
defined by Engrailed and the dorsoventral boundary defined by Apterous — cells 
in different states confront one another and interact to create narrow bands of 
specialized cells. These boundary cells produce new signals to organize the sub- 
sequent growth and more detailed patterning of the appendage. 

Cells in the posterior wing compartment express the Hedgehog signal pro- 
tein, but cannot respond to it. Cells in the anterior compartment can respond to 
Hedgehog. Because Hedgehog acts only over a short distance, the signal recep- 
tion pathway is activated only in the narrow band of cells just anterior to the 
compartment boundary, where anterior and posterior cells are juxtaposed. 
These boundary cells respond by switching on expression of another signal 
molecule, Dpp — the same protein that we encountered previously, in the 
dorsoventral patterning of the early embryo (Figure 22-55). Dpp acts in its new 
context in much the same way as before: it spreads its effects outward from the 
boundary cells (by diffusion, via cytonemes, or through transfer from cell to cell 
by exocytosis or endocytosis), setting up a morphogen gradient to control the 
subsequent detailed pattern of growth and gene expression. 

Analogous events occur at the dorsoventral compartment boundary (see 
Figure 22-55). Here, at the future wing margin, short-range communication 
mediated by the Notch pathway creates a band of boundary cells that produce 
another morphogen, the Wingless protein — the same signaling factor, belonging 
to the Wnt family, that acted earlier in the anteroposterior patterning of each 
embryonic segment. The Dpp and Wingless gradients, together with other sig- 
nals and with the asymmetries of gene expression that we have discussed, com- 
bine to drive expression of other genes at precisely defined locations within each 
compartment. 

The Size of Each Compartment Is Regulated by Interactions 
Among Its Cells 

One of the most mysterious and ill-understood aspects of animal development 
is the control of growth: why does each part of the body grow to a precisely 
defined size? This problem is exemplified in remarkable way in the imaginal 
discs of Drosophila. By induced somatic recombination, one can, for example, 
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Figure 22-55 Morphogenetic signals 
created at compartment boundaries in 
the wing imaginal disc. (A) Creation of 
the Dpp signaling region at the 
anteroposterior compartment boundary 
through a Hedgehog-mediated 
interaction between the anterior and 
posterior cells. In an analogous way, a 
Notch-mediated interaction between 
dorsal and ventral cells creates a Wingless 
(Wnt) signaling region along the 
dorsoventral boundary. (B) The observed 
expression patterns of Dpp and Wingless. 
Although it seems clear that Dpp and 
Wingless act as morphogens, it is not yet 
certain how they spread out from their 
source. Cells in the imaginal disc have 
been seen to send out long cytonemes 
that may allow them to sense signals at a 
distance. Thus, the receiving cell may send 
its sensors to the source of the signal, 
instead of the signal moving to the 
receiving cell. (B, photographs courtesy of 
Sean Carroll and Scott Weatherbee, from 
S.J. Day and P.A. Lawrence, Development 
1 27:2977-2987, 2000. With permission 
from The Company of Biologists.) 





Wingless 



(B) 



100|iim 



1 354 Chapter 22: Development of Multicellular Organisms 





intercalation 




Figure 22-56 Intercalary regeneration. 

When mismatched portions of the 
growing cockroach leg are grafted 
together, new tissue (green) is 
intercalated (by cell proliferation) to fill in 
the gap in the pattern of leg structures, 
restoring a leg segment of normal size 
and pattern. 



create a clonal patch of cells that proliferate more rapidly than the rest of the 
cells in the developing organ. The clone may grow to occupy almost the whole of 
the compartment in which it lies, and yet it does not overstep the boundary of 
the compartment. Astonishingly, its rapid growth has almost no effect on the 
compartment's final size, its shape, or even the details of its internal pattern (see 
Figure 22-54). Somehow, the cells within the compartment interact with one 
another to determine when their growth should stop, and each compartment 
behaves as a regulatory unit in this respect. 

A first question is whether the size of the compartment is regulated so as to 
contain a set number of cells. Mutations in components of the cell-cycle control 
machinery can be used to speed up or slow down the rate of cell division with- 
out altering the rate of cell or tissue growth. This results in abnormally large 
numbers of abnormally small cells, or the converse, but the size — that is, the 
area — of the compartment is practically unchanged. Thus, the regulatory mech- 
anism seems to depend on signals that indicate the physical distance between 
one part of the compartment and another, and on cellular responses that some- 
how read these signals so as to halt growth only when the spacing between the 
parts has reached its proper value. 

This type of growth regulation is strikingly displayed in the intercalary 
regeneration that occurs when separate parts of a Drosophila imaginal disc or 
of a growing cockroach leg are surgically grafted together. After the graft, the 
cells in the neighborhood of the junction proliferate and fill in the parts of the 
pattern that should normally lie between them, continuing their growth until 
the normal spacing between landmarks is restored (Figure 22-56). The mecha- 
nisms that bring this about are a mystery, but it seems likely that they are simi- 
lar to the mechanisms that regulate growth during normal development. 

What mechanism could ensure that each little piece of the pattern within a 
compartment grows to its appropriate size, despite local disturbances in growth 
rate or starting conditions? The morphogen gradients (of Dpp and Wingless, for 
example) create a pattern by imposing different characters on cells in different 
positions. Could it be that the cells in each region can somehow sense how close 
the spacing of the pattern is — how steep the gradient of change in cell charac- 
ter — and continue their growth until the tissue is spread out to the right degree? 

This idea has been tested by creating clones of cells in the wing disc in which 
downstream components of the Dpp signaling pathway are misexpressed so as 
to drive the level of pathway activation either higher or lower than in the neigh- 
boring cells. From the point of view of the cells, conditions at the boundary of 
the mutant clone are then equivalent to those produced by a very steep gradient 
of Dpp. The result is that cells in this neighborhood are stimulated to divide at 
an increased rate. Conversely, if the level of Dpp signaling is made uniform in 
the middle region of the developing wing disc, where it would normally be 
steeply graded, cell division there is inhibited. It seems that the steepness of the 
gradient does indeed control the rate of proliferation. But if that is so, how do 
cells sense the steepness of the gradient? 

The answer is unknown, but there are strong hints that the mechanism 
depends on signals generated at cell-cell junctions, where cells with different 
levels of pathway activation make contact. As discussed in Chapter 19, muta- 
tions in junctional components such as the scaffold protein Discs-large (Dig) or 
the cadherin superfamily member Fat can cause a dramatic failure of growth 
control, allowing the wing disc to grow far beyond its normal proper size. In the 
case of Fat, a set of other molecules, including protein kinases called Hippo and 
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Warts, have been identified as components of a signaling pathway that leads 
from Fat at the cell membrane to the control of gene expression in the nucleus. 
The products of the target genes include the cell-cycle regulator cyclin E and an 
inhibitor of apoptosis, as well as a microRNA, Bantam, that seems to be an 
essential part of the growth control mechanism. Despite these tantalizing facts, 
the mechanisms controlling organ size are still mysterious. If we can discover 
how they work in Drosophila, we may get some insight into the problem of the 
control of organ size in vertebrates, where our current perplexity on this funda- 
mental question is even more profound. For in other aspects of organ develop- 
ment, as we now discuss, flies and vertebrates are unexpectedly similar at a 
molecular level, suggesting that their mechanisms of growth control may be 
similar also. 



Similar Mechanisms Pattern the Limbs of Vertebrates 



The limbs of vertebrates seem very different from those of insects. The insect 
wing, for example, consists mainly of two elaborately patterned sheets of epithe- 
lium, with very little tissue in between. In contrast, a limb of a vertebrate con- 
sists of an elaborately patterned system of muscles, bones and other connective 
tissues inside a thin and much more simply structured covering of epidermis. 
Moreover, the evolutionary evidence suggests that the last common ancestor of 
insects and vertebrates may have had neither legs, nor arms, nor wings, nor fins 
and that we have evolved these various appendages independently. And yet, 
when we examine the molecular mechanisms that control vertebrate limb 
development, we find a surprising number of similarities with the limbs of 
insects. We have already mentioned some of these resemblances, but there are 
many others: almost all the molecules we have already mentioned in the fly wing 
have their counterparts in the vertebrate limb, although these are expressed in 
different spatial relationships. 

The parallels have been most thoroughly studied in the chick embryo. As we 
saw earlier, each leg or wing of a chick originates from a tongue-shaped limb 
bud, consisting of a mass of embryonic connective tissue cells, called mes- 
enchyme cells, encased in a jacket of epithelium. In this structure, one finds 
expression of homologs of almost all the genes that we have mentioned in our 
account of Drosophila wing patterning, including Distal-less, Wingless, Notch, 
Engrailed, Dpp, and Hedgehog, mostly performing functions that seem more or 
less similar to their functions in the Drosophila wing disc (Figure 22-57). 

The Hox genes likewise make an appearance in the limbs of both insects and 
vertebrates. In the insect appendage, the anterior and posterior compartments are 
distinguished by expression of different genes of the Hox complex — a result of the 
serial expression pattern of these genes along the anteroposterior axis of the body 
as a whole. In the vertebrate limb, genes of two of the vertebrate Hox complexes 



Figure 22-57 Molecules that control 
patterning in a vertebrate limb bud. 

(A) A wing bud of a chick embryo at 4 
days of incubation. The scanning electron 
micrograph shows a dorsal view, with 
somites (the segments of the trunk of the 
embryo) visible to the left. At the distal 
margin of the limb bud a thickened ridge 
can just be seen — the apical ectodermal 
ridge. (B) Expression patterns of key 
signaling proteins and gene regulatory 
factors in the chick limb bud. The 
patterns are depicted schematically in 
two imaginary planes of section through 
the limb bud, one (horizontal) to show 
the dorsoventral system and the other 
(vertical) to show the anteroposterior and 
proximodistal systems. Sonic hedgehog, 
Bmp2, and Lmxl are expressed in the 
mesodermal core of the limb bud; the 
other molecules in the diagram are 
expressed in its epithelial covering. 
Almost all the molecules shown have 
homologs that are involved in patterning 
the Drosophila wing disc. (A, courtesy of 
Paul Martin.) 
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Figure 22-58 The basic structure of a 
mechanosensory bristle. The lineage of 
the four cells of the bristle — all 
descendants of a single sensory mother 
cell — is shown on the left. 



{HoxA and HoxD) are expressed in a regular pattern, obedient to the usual rules 
of serial expression of genes in these complexes. They help, in conjunction with 
other factors such as the Tbx proteins mentioned earlier (see Figure 22-9), to reg- 
ulate differences of cell behavior along the proximodistal limb axis. 

According to one view, these molecular resemblances between developing 
limbs in different phyla reflect descent from a common ancestor that, while 
lacking limbs, had appendages of some sort built on similar principles — anten- 
nae, perhaps, or protruding mouthparts for snatching food. Modern limblike 
appendages, from the wings and legs of the fly to the arms and legs of a human, 
would then have evolved through activation of the genes for appendage forma- 
tion at new sites in the body, as a result of changes in gene regulation. 



Localized Expression of Specific Classes of Gene Regulatory 
Proteins Foreshadows Cell Differentiation 

We now pick up again the thread of development in the Drosophila imaginal disc 
and follow it through to the final step at which cells become terminally differen- 
tiated. Narrowing our focus further, we take as our example the differentiation of 
just one type of small structure that arises in the imaginal disc epithelium: the 
sensory bristle. 

The bristles that cover the body surface of an insect are miniature sense 
organs. Some respond to chemical stimuli, others to mechanical stimuli, but 
they are all constructed in a similar way. The structure is seen at its simplest in 
the mechanosensory bristles. Each of these consists of four cells: a shaft cell, a 
socket cell, a neural sheath cell, and a neuron (Figure 22-58). Movement of the 
shaft of the bristle excites the neuron, which sends a signal to the central ner- 
vous system. 

The cells of the bristle of the adult fly derive from the imaginal disc epithe- 
lium, and all four of them are granddaughters or great-granddaughters (see Fig- 
ure 22-58) of a single sensory mother cell that becomes distinct from the neigh- 
boring prospective epidermal cells during the last larval instar (Figure 22-59). (A 
fifth descendant dies, or in some tissues becomes a glial cell.) To account for the 
pattern of bristle differentiation, we have to explain first how the genesis of sen- 
sory mother cells is controlled and then how the five descendants of each such 
cell become different from one another. 

Two genes, called Achaete and Scute, are crucial in initiating the formation 
of bristles in the imaginal disc epithelium. These genes have similar and over- 
lapping functions and code for closely related gene regulatory proteins of the 
basic helix-loop-helix class (discussed in Chapter 7). As a result of disc-pattern- 
ing mechanisms of the type we have already discussed, Achaete and Scute are 
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Figure 22-59 Sensory mother cells in 
the wing imaginal disc. The sensory 
mother cells [blue here) are easily 
revealed in this special strain of 
Drosophila, which contains an artificial 
LacZ reporter gene that, by chance, has 
inserted itself in the genome next to a 
control region that causes it to be 
expressed selectively in sensory mother 
cells. The purple stain shows the 
expression pattern of the Scute gene; this 
foreshadows the production of sensory 
mother cells and fades as the sensory 
mother cells successively develop. (From 
P. Cubas et al., Genes Dev. 5:996-1 008, 
1 991 . With permission from Cold Spring 
Harbor Laboratory Press.) 
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expressed in the imaginal disc in the regions within which bristles will form. 
Mutations that eliminate the expression of these genes at some of their usual 
sites block development of bristles at just those sites, and mutations that cause 
expression in additional, abnormal sites cause bristles to develop there. But 
expression of Achaete and Scute is transient, and only a minority of the cells ini- 
tially expressing the genes go on to become sensory mother cells; the others 
become ordinary epidermis. The state that is specified by expression of Achaete 
and Scute is called proneural, and Achaete and Scute are called proneural genes. 
The proneural cells are primed to take the neurosensory pathway of differentia- 
tion, but, as we shall see, which of them will actually do so depends on compet- 
itive interactions among them. 



Lateral Inhibition Singles Out Sensory Mother Cells Within 
Proneural Clusters 

Cells expressing the proneural genes occur in groups in the imaginal disc epithe- 
lium — a small, isolated cluster of fewer than 30 cells for a big isolated bristle, a 
broad, continuous patch of hundreds or thousands of cells for a field of small 
bristles. In the former case just one member of the cluster becomes a sensory 
mother cell; in the latter case many cells scattered throughout the proneural 
region do so. In either case, each sensory mother cell becomes surrounded by 
cells that switch off expression of the proneural genes and become condemned 
to differentiate as epidermis instead. Experiments with genetic mosaics show 
that this is because a cell that becomes committed to the sensory-mother-cell 
pathway of differentiation sends a signal to its neighbors not to do the same 
thing: it exerts a lateral inhibition. If a cell that would normally become a sen- 
sory mother is genetically disabled from doing so, a neighboring proneural cell, 
freed from lateral inhibition, will become a sensory mother cell instead. 

The lateral inhibition is mediated by the Notch signaling pathway. The cells 
in the cluster initially all express both the transmembrane receptor Notch and its 
transmembrane ligand Delta. Wherever Delta activates Notch, an inhibitory sig- 
nal is sent into the Notch- expressing cell; consequently, all the cells in the clus- 
ter initially inhibit one another. However, receipt of the signal in a given cell is 
thought to diminish not only that cell's tendency to specialize as a sensory 
mother cell but also its ability to fight back by delivering the inhibitory Delta sig- 
nal in return. This creates a competitive situation, from which a single cell in 
each small region — the future sensory mother cell — emerges as winner, sending 
a strong inhibitory signal to its immediate neighbors but receiving no such sig- 
nal in return (Figure 22-60). The consequences of a failure of this regulatory 
mechanism are shown in Figure 22-61. 



Lateral Inhibition Drives the Progeny of the Sensory Mother Cell 
Toward Different Final Fates 

The same lateral inhibition mechanism dependent on Notch operates repeat- 
edly in the formation of bristles — not only to force the neighbors of sensory 
mother cells to follow a different pathway and become epidermal, and but also 
later to make the daughters, the granddaughters, and finally the great-grand- 
daughters of the sensory mother cell express different genes so as to form the 
different components of the bristle. At each stage, lateral inhibition mediates a 
competitive interaction that forces adjacent cells to behave in contrasting ways. 
Using a temperature -sensitive Notch mutation, it is possible to switch off Notch 
signaling after the sensory mother cell has been singled out but before it has 
divided. The progeny then differentiate alike, giving a cluster of neurons in place 
of the four different cell types of a bristle. 

Like many other competitions, those mediated by lateral inhibition are often 
rigged: one cell starts with an advantage that guarantees it will be the winner. In 
the development of the different cell types of the sensory bristle, a strong initial 
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bias is provided by an asymmetry in each of the cell divisions of the sensory 
mother cell and its progeny A protein called Numb (together with certain other 
proteins) becomes localized at one end of the dividing cell, so that one daughter 
inherits the Numb protein and the other does not (Figure 22-62). Numb blocks 
the activity of Notch. Thus, the Numb -containing cell is deaf to inhibitory sig- 
nals from its neighbors while its sister remains sensitive. Since both cells initially 
express the Notch ligand Delta, the cell that has inherited Numb proceeds to 
become neural, while driving its sister toward a nonneural fate. 



Figure 22-60 Lateral inhibition. (A) The 

basic mechanism of Notch-mediated 
competitive lateral inhibition, illustrated 
for just two interacting cells. In this 
diagram, the absence of color on proteins 
or effector lines indicates inactivity. 
(B) The outcome of the same process 
operating in a larger patch of cells. At 
first, all cells in the patch are equivalent, 
expressing both the transmembrane 
receptor Notch and its transmembrane 
ligand Delta. Each cell has a tendency to 
specialize (as a sensory mother cell), and 
each sends an inhibitory signal to its 
neighbors to discourage them from also 
specializing in that way. This creates a 
competitive situation. As soon as an 
individual cell gains any advantage in the 
competition, that advantage becomes 
magnified. The winning cell, as it 
becomes more strongly committed to 
differentiating as a sensory mother, also 
inhibits its neighbors more strongly. 
Conversely, as these neighbors lose their 
capacity to differentiate as sensory 
mothers they also lose their capacity to 
inhibit other cells from doing so. Lateral 
inhibition thus makes adjacent cells 
follow different fates. Although the 
interaction is thought to be normally 
dependent on cell-cell contacts, the 
future sensory mother cell may be able to 
deliver an inhibitory signal to cells that 
are more than one cell diameter away — 
for example, by sending out long 
protrusions to touch them. 



Planar Polarity of Asymmetric Divisions is Controlled by Signaling 
via the Receptor Frizzled 

For the Numb mechanism to operate, there must be machinery in the dividing 
cell to segregate the determinant to one side of the cell before division. In addi- 
tion, as the cell enters mitosis the mitotic spindle must be aligned with this asym- 
metry so that the determinant is allocated to just one daughter cell, and not 
shared out to both daughters at the time of cell division. In the above case, the 
sensory mother cell, at its first division, regularly divides to give an anterior cell 
that inherits Numb and a posterior cell that does not. As discussed in Chapter 19, 
this type of polarity in the plane of the epithelium is called planar polarity (in 

Figure 22-61 The result of switching off lateral inhibition during the 
singling-out of sensory mother cells. The photograph shows part of the 
thorax of a fly containing a mutant patch in which the neurogenic gene 
Delta has been partially inactivated. The reduction of lateral inhibition has 
caused almost all the cells in the mutant patch (in the center of the picture) 
to develop as sensory mother cells, producing a great excess of sensory 
bristles there. Mutant patches of cells carrying more extreme mutations in 
the Notch pathway, causing a total loss of lateral inhibition, form no visible 
bristles because all of the progeny of the sensory mother cells develop as 
neurons or glial cells instead of diversifying to form both neurons and the 
external parts of the bristle structure. (Courtesy of P. Heitzler and 
P. Simpson, Cell 64:1 083-1 093, 1 991 . With permission from Elsevier.) 
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contradistinction to apico-basal polarity, where the cellular asymmetry is per- 
pendicular to the plane of the epithelium). It is manifested in the uniformly 
backward-pointing orientation of the bristles, giving the fly its wind-swept 
appearance (Figure 22-63). 

The planar polarity in the initial division of the sensory mother cell is con- 
trolled by a signaling pathway similar to the one that we encountered control- 
ling asymmetric divisions in the nematode (see Figure 22-21), depending on the 
receptor Frizzled. Frizzled proteins have been discussed in Chapter 15 as recep- 
tors for Wnt proteins, but in the control of planar polarity — in flies and probably 
in vertebrates too — this pathway functions in a special way: the intracellular 
relay mechanism exerts its main effects on the actin cytoskeleton, rather than on 
gene expression. The intracellular protein Dishevelled, downstream from Friz- 
zled, is common to the gene-regulatory and the actin- regulatory branches of the 
signaling pathway. Separate domains of the Dishevelled molecule can be shown 
to be responsible for the two functions (Figure 22-64). Frizzled and Dishevelled 
both take their names from the unkempt look of flies where bristle polarity is 
disordered (see Figure 19-32). 



Figure 22-62 Numb biases lateral 
inhibition during bristle development. At 

each division of the progeny of the sensory 
mother cell, Numb protein is asymmetrically 
localized, producing daughter cells that 
differ. Note that some of the divisions are 
oriented with the mitotic spindle in the 
plane of the epithelium, others at right 
angles to it; the localization of Numb is 
controlled in different ways at these 
different types of division but plays a critical 
role at each of them in deciding cell fate. 
(Based on data from M. Gho, Y. Bellaiche and 
F. Schweisguth, Development 
126:3573-3584, 1999. With permission from 
The Company of Biologists.) 



Asymmetric Stem-Cell Divisions Generate Additional Neurons in 
the Central Nervous System 

The mechanisms we have described for controlling the genesis of neurons of 
sensory bristles operate also, with variations, in the genesis of virtually all other 
neurons — not only in insects, but also in other phyla. Thus in the embryonic 
central nervous system, both in flies and in vertebrates, neurons are generated 
from regions of expression of proneural genes akin to Achaete and Scute. The 
nascent neurons express Delta and inhibit their immediate neighbors, which 
express Notch, from becoming committed to neural differentiation at the same 
time. When Notch signaling is blocked, inhibition fails, and in the proneural 
regions neurons are generated in huge excess at the expense of non-neuronal 
cells (Figure 22-65). 

In the central nervous system, however, an additional mechanism comes 
into play to help generate the very large numbers of neurons and glial cells that 
are needed: a special class of cells become committed as neural precursors, but 
instead of differentiating directly as neurons or glial cells, these undergo a long 
series of asymmetric divisions through which a succession of additional neu- 
rons and glial cells are added to the population. The mechanism is best under- 
stood in Drosophila, although there are many hints that something similar 
occurs also in vertebrate neurogenesis. 



Figure 22-63 Planar cell polarity manifest in bristle polarity on a fly's 
back: the bristles all point backwards. (Scanning electron micrograph 
courtesy of S. Oldham and E. Hafen, from E. Spana and N. Perrimon, Trends 
Genet. 15:301-302, 1999. With permission from Elsevier.) 
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Figure 22-64 The control of planar cell 

planar cell polarity polarity. (A) The two branches of the 

Wnt/Frizzled signaling pathway. The main 
branch, discussed in Chapter 15, controls 
gene expression via p-catenin; the 
planar-polarity branch controls the actin 
cytoskeleton via Rho GTPases. Different 
domains of the Dishevelled protein are 
responsible for the two effects. It is not 
yet clear which member of the Wnt signal 
protein family, if any, is responsible for 
activating the planar polarity function of 
Frizzled in Drosophila. (B) Cartoon of cells 
displaying planar polarity. In at least 
some systems, planar cell polarity is 
associated with asymmetric localization 
of the receptor Frizzled itself to one side 
of each cell. (See also Chapter 19, 
Figure 19-32.) 
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In the embryonic central nervous system of Drosophila, the nerve-cell pre- 
cursors, or neuroblasts, are initially singled out from the neurogenic ectoderm 
by a typical lateral -inhibition mechanism that depends on Notch. Each neuro- 
blast then divides repeatedly in an asymmetric fashion (Figure 22-66A). At each 
division, one daughter remains as a neuroblast, while the other, which is much 
smaller, becomes specialized as a ganglion mother cell, or GMC. The ganglion 
mother cell will divide only once, giving a pair of neurons, or a neuron plus a 
glial cell, or a pair of glial cells. The neuroblast becomes smaller at each division, 
as it parcels out its substance into one ganglion mother cell after another. Even- 
tually, typically after about 12 cycles, the process halts, presumably because the 
neuroblast becomes too small to pass the cell-size checkpoint in the cell division 
cycle. Later, in the larva, neuroblast divisions resume, and now they are accom- 
panied by cell growth, permitting the process to continue indefinitely, generat- 
ing the much larger numbers of neurons and glial cells required in the adult fly. 

The larval neuroblasts, therefore, are stem cells: while not terminally differ- 
entiated themselves, they behave as a self-renewing and potentially inex- 
haustible source of terminally differentiated cells. In Chapter 23, where we dis- 
cuss stem cells in detail, we shall see that stem cells do not necessarily have to 
divide asymmetrically; but asymmetric division is one possible strategy, and the 
neuroblasts of the fly provide a beautiful example. 
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Figure 22-65 Effects of blocking Notch 
signaling in a Xenopus embryo. In the 

experiment shown, mRNA coding for a 
truncated form of the Notch ligand Delta 
is injected, together with LacZ mRNA as a 
marker, into one cell of an embryo at the 
two-cell stage. The truncated Delta 
protein produced from the mRNA blocks 
Notch signaling in the cells descended 
from the cell that received the injection. 
These cells lie on the left side of the 
embryo and are identifiable because they 
contain LacZ protein {blue stain) as well 
as the truncated Delta protein. The right 
side of the embryo is unaffected and 
serves as a control. The embryo is fixed 
and stained at a stage when the central 
nervous system has not yet rolled up to 
form a neural tube, but is still a more or 
less flat plate of cells — the neural plate — 
exposed on the surface of the embryo. 
The first neurons (stained purple in the 
photograph) have already begun to 
differentiate in elongated bands 
(proneural regions) on each side of the 
midline. On the control (right) side, they 
are a scattered subset of the proneural 
cell population. On the Notch-blocked 
(left) side, virtually all the cells in the 
proneural regions have differentiated as 
neurons, creating a densely stained band 
of neurons without intervening cells. 
Injections of mRNA coding for normal, 
functional Delta have an opposite effect, 
reducing the number of cells that 
differentiate as neurons. (Photograph 
from A. Chitnis et al., Nature 375:761-766, 
1995. With permission from Macmillan 
Publishers Ltd.) 
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Asymmetric Neuroblast Divisions Segregate an Inhibitor of Cell 
Division into Just One of the Daughter Cells 

The divisions of the neuroblast are asymmetric in three respects: (1) physically, 
in that one daughter is smaller than the other; (2) biochemically, in factors con- 
trolling differentiation; and (3) biochemically, in factors controlling prolifera- 
tion. These asymmetries must all be coordinated with one another and with the 
orientation of the axis of the mitotic spindle, if the cleavage plane is to cut the 
cell into the correct parts. How is this achieved? 

The neuroblast has an apico-basal asymmetry that reflects its origin from 
the ectoderm, which, like other epithelia, has a well defined apico-basal polar- 
ity. As we saw in Chapter 19, apico-basal polarity is governed by a complex of 
three proteins — Par3 (also called Bazooka in Drosophila), Par6, and aPKC (atyp- 
ical protein kinase C) — that become localized in the cortex toward the apical end 
of the cell. This localized Par3/Par6/aPKC complex is thought to be the primary 
source of asymmetry in the neuroblast. By recruiting other components, some 
of which exert feedback effects to maintain the localization of the complex, it 
coordinates the whole process of unequal division. 

The Par3/Par6/aPKC complex defines the orientation of the mitotic spindle 
and the unequal partitioning of the cell at cytokinesis through interaction with 
adapter proteins called Inscuteable and Partner of Inscuteable (Pins). These in 
turn recruit the a subunit of a trimeric G protein (discussed in Chapter 15), 
which functions in this context as an intracellular messenger to guide organiza- 
tion of the cytoskeleton. 

At the same time, the Par3/Par6/aPKC complex locally phosphorylates a reg- 
ulator of intracellular architecture called Lgl (Lethal giant larvae) and thereby 
directs another adaptor protein called Miranda to become concentrated in the 
cortex at the opposite (basal) end of the cell (Figure 22-66B). Miranda binds pro- 
teins that control differentiation and proliferation, localizing them to the same 
site. When the neuroblast divides, Miranda and its cargo are segregated into the 
ganglion mother cell. One of the molecules thus carried into the ganglion 
mother cell is a gene regulatory protein called Prospero, which directs differen- 
tiation. Another is a posttranscriptional repressor called Brat (Brain Tumor) . Brat 
acts as an inhibitor of cell proliferation, apparently by preventing production of 
the growth-promoting protein Myc, famous for its role in cancer (discussed in 
Chapter 20) . In mutants where Brat is defective, or where it fails to become local- 
ized correctly, the smaller daughter cell of the asymmetric neuroblast division 
frequently fails to differentiate as a ganglion mother cell, and instead grows and 
divides as a neuroblast. The result is a brain tumor — a mass of neuroblasts that 
grows exponentially and without limit, until the fly is dead. 

Whether vertebrates tissues have stem cells that behave like the fly's neu- 
roblasts is a question of great current interest, especially in relation to cancer. 



Figure 22-66 Neuroblasts and 
asymmetric cell division in the central 
nervous system of a fly embryo. (A) The 

neuroblast originates as a specialized 
ectodermal cell. It is singled out by lateral 
inhibition and emerges from the basal 
(internal) face of the ectoderm. It then 
goes through repeated division cycles, 
dividing asymmetrically to generate a 
series of ganglion mother cells. Each 
ganglion mother cell divides just once to 
give a pair of differentiated daughters 
(typically a neuron plus a glial cell). 
(B) The asymmetric distribution of cell 
fate determinants in an isolated 
neuroblast as it goes through mitosis. The 
mitotic chromosomes are stained blue. 
The Par3/Par6/aPKC complex, shown by 
blue immunostaining for aPKC, is 
concentrated in the apical cortex, and 
causes Miranda {green), Brat {red, giving 
yellow where Brat and Miranda overlap) 
and Prospero (not stained) to become 
localized in the basal cortex. As the cell 
divides, these latter three molecules 
become segregated into the ganglion 
mother cell, forcing it to differentiate and 
leaving the neuroblast free to regenerate 
its asymmetry and divide again in the 
same way. (B, from C.Y. Lee et al., 
Dev. Cell 1 0:441 -449, 2006. With 
permission from Elsevier.) 
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Notch Signaling Regulates the Fine-Grained Pattern of 
Differentiated Cell Types in Many Different Tissues 

Each daughter of a normal ganglion mother cell can become either a neuron or 
a glial cell. This final choice, like the choice of cell fate for the progeny of a sen- 
sory mother cell in the peripheral nervous system, is controlled by Notch sig- 
naling and lateral inhibition. In fact, lateral inhibition mediated by Notch is cru- 
cial for cell diversification and fine-grained patterning in an enormous variety of 
different tissues. In the fly it controls the production not only of neurons but 
also of many other differentiated cell types — for example, in muscle, in the lin- 
ing of the gut, in the excretory system, in the tracheae, and in the eye and other 
sense organs. In vertebrates, homologs of Notch and its ligands are expressed in 
the corresponding tissues and have similar functions: mutations in the Notch 
pathway upset the balance not only of neurons and non-neuronal cells in the 
central nervous system, but also of the different specialized cell types in the lin- 
ing of the gut, of endocrine and exocrine cells in the pancreas, and of sensory 
and supporting cells in sense organs such as the ear, to give only a few examples. 
In all these tissues, a balanced mixture of different cell types is required. 
Notch signaling provides the means to generate the mixture, by enabling indi- 
vidual cells expressing one set of genes to direct their immediate neighbors to 
express another set. 



Some Key Regulatory Genes Define a Cell Type; Others Can 
Activate the Program for Creation of an Entire Organ 

As we mentioned at the beginning of this chapter, there are some genes whose 
products act as triggers for the development of a specific organ, initiating and 
coordinating the whole complex program of gene expression required for this. 
Thus, for example, when the Eyeless gene is artificially expressed in a patch of 
cells in the leg imaginal disc, a patch of well- organized eye tissue, with all its var- 
ious cell types correctly arranged, will develop on the leg (see Figure 22-2). In a 
somewhat similar way, much later, when a cell makes its final choice of a partic- 
ular mode of differentiation in the aftermath of the interactions mediated by 
Notch, it has to follow a complex program involving expression of a whole col- 
lection of genes, and this differentiation program is initiated and coordinated by 
a much smaller set of high-level regulators. Such regulators are sometimes 
called "master regulatory proteins" (though even they can exert their specific 
effect only in combination with the right partners, in a cell that is adequately 
primed). 

An example is the MyoD/myogenin family of gene regulatory proteins. 
These proteins drive cells to differentiate as muscle, expressing muscle-specific 
actins and myosins and all the other cytoskeletal, metabolic and membrane pro- 
teins that a muscle cell needs (see Figure 7-75). The gene regulatory proteins 
that define particular cell types often belong (as do MyoD and its relatives) to the 
basic helix-loop-helix family, encoded by genes homologous to, and in some 
cases apparently identical to, the proneural genes that we have already men- 
tioned. Their expression is often governed by the Notch pathway via compli- 
cated feedback loops. 

Terminal cell differentiation has brought us to the end of our sketch of how 
genes control the making of a fly. Our account has necessarily been simplified. 
Many more genes than we have mentioned are involved in each of the develop- 
mental processes that we have described. Feedback loops, alternative mecha- 
nisms operating in parallel, genetic redundancy, and other phenomena compli- 
cate the full picture. Despite all this, the overriding message of developmental 
genetics is one of an unexpected simplicity. A limited number of genes and 
mechanisms, used repeatedly in different circumstances and combinations, are 
responsible for controlling the main features of the development of all multicel- 
lular animals. 

We next turn to an essential aspect of animal development that we have so 
far neglected: cell movements. 
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Summary 

The external parts of an adult fly develop from epithelial structures called imaginal discs. 
Each imaginal disc is divided at the outset into a small number of domains expressing 
different gene regulatory proteins as a result of early embryonic patterning processes. 
These domains are called compartments, because their cells do not mix. At the compart- 
ment boundaries, cells expressing different genes confront one another and interact, 
inducing localized production ofmorphogens that govern the further growth and inter- 
nal patterning of each compartment. Thus, in the wing disc, dorsal and ventral cells 
interact by the Notch signaling mechanism to create a source of Wingless (Wnt) protein 
along the dorsoventral compartment boundary, while anterior and posterior cells inter- 
act through short-range Hedgehog signaling to create a source ofDpp protein (a TGFfi 
family member) along the anteroposterior compartment boundary. All these signaling 
molecules have homologs that play similar parts in limb patterning in vertebrates. 

Each compartment of an imaginal disc, and each substructure within it, grows to 
a precisely predictable size, even in the face of seemingly drastic disturbances, such as 
mutations that alter the cell division rate. Although the morphogen gradients in the 
disc are clearly involved, the critical regulatory mechanisms that control organ size are 
not understood. 

Within each compartment, the morphogen gradients control the sites of expres- 
sion of further sets of genes, defining patches of cells that interact with one another yet 
again to create the finest details of the ultimate pattern of cell differentiation. Thus, 
proneural gene expression defines the sites where sensory bristles will form, and Notch- 
mediated interactions among the cells of the proneural cluster, together with asym- 
metric cell divisions, force the individual cells of the bristle to follow different paths of 
terminal differentiation. In the central nervous system, neuroblasts are singled out 
from the ectoderm by lateral inhibition in a similar way, but then go through a long fertilized egg 

series of asymmetrical divisions as stem cells to generate neurons and glia. Faults in 
the asymmetric distribution of the molecules that control differentiation and prolifer- 
ation can convert the neuroblast stem cells into tumor cells. 

Many of the same mechanisms are thought to operate in vertebrate tissues also. 




CELL MOVEMENTS AND THE SHAPING OF THE 
VERTEBRATE BODY 

Most cells of the animal body are motile, and in the developing embryo their 
movements are often extensive, dramatic, and surprising. <GAGC> Controlled 
changes of gene expression create ordered arrays of cells in different states; cell 
movements rearrange these cellular building blocks and put them in their 
proper places. The genes that the cells express determine how they move; in this 
sense, the control of gene expression is the primary phenomenon. But the cell 
movements are also crucial, and no less in need of explanation if we want to 
understand how the architecture of the body is created. In this section, we exam- 
ine this topic in the context of vertebrate development. We take as our main 
example the frog Xenopus laevis (Figure 22-67), where cell movements have 
been well studied, though we shall also draw on evidence from chick, zebrafish, 
and mouse. 

Figure 22-67 Synopsis of the development of Xenopus laevis from newly fertilized egg to 
feeding tadpole. The adult frog is shown in the photograph at the top. The developmental 
stages are viewed from the side, except for the 1 0-hour and 1 9-hour embryos, which are viewed 
from below and from above, respectively. All stages except the adult are shown at the same 
scale. (Photograph courtesy of Jonathan Slack; drawings after P.D. Nieuwkoop and J. Faber, 
Normal Table of Xenopus laevis [Daudin]. Amsterdam: North-Holland, 1956.) 
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The Polarity of the Amphibian Embryo Depends on the Polarity 
of the Egg 

The Xenopus egg is a large cell, just over a millimeter in diameter (Figure 
22-68A). The light-colored lower end of the egg is called the vegetal pole; the 
dark-colored upper end is called the animal pole. The animal and vegetal hemi- 
spheres contain different selections of mRNA molecules and other cell compo- 
nents, which become allocated to separate cells as the egg cell divides after fer- 
tilization. Near the vegetal pole, for example, there is an accumulation of mRNAs 
coding for the gene regulatory protein VegT (a DNA-binding protein of the T-box 
family) and for signal proteins of the TGF(3 superfamily, as well as some ready- 
made protein components of the Wnt signaling pathway (Figure 22-68B). As a 
result, the cells that inherit vegetal cytoplasm will produce signals to organize 
the behavior of adjacent cells. They are committed to form the gut — the inner- 
most tissue of the body; the cells that inherit animal cytoplasm will form the 
outer tissues. Thus, crudely speaking, the animal-vegetal axis of the egg corre- 
sponds to the external-to-internal (or skin-to-gut) dimension of the future 
organism. 

Fertilization initiates a series of cell divisions and movements that will even- 
tually tuck the vegetal cells and cells from the equatorial (middle) region of the 
animal-vegetal axis into the interior. In the course of these complex movements, 
the three principal axes of the body become established: anteroposterior, from 
head to tail; dorsoventral, from back to belly; and mediolateral, from the midline 
outward to the left or to the right. The orientation of these axes is determined by 
the asymmetries of the early embryo. The unfertilized egg, has only one axis of 
asymmetry — the animal-vegetal — but fertilization triggers an intracellular 
movement that gives the egg an additional asymmetry defining a second axis at 
right angles to this. Following entry of the sperm, the outer, actin-rich cortex of 
the egg cytoplasm rotates relative to the central core of the egg, so that the ani- 
mal pole of the cortex is slightly shifted to one side. Treatments that block the 
rotation allow cleavage to occur normally but produce an embryo with a central 
gut and no dorsal structures or dorsoventral asymmetry. Thus, the cortical rota- 
tion is required to define the dorso-ventral axis of the future body, and the axis 
of asymmetry created in the egg by the rotation is called the dorso-ventral axis 
of the egg. Note, however, that the subsequent cell movements mean that the 
relationship between the egg axes and the future body axes is more complicated 
than this terminology would suggest. The direction of the cortical rotation is 
biased according to the point of sperm entry, perhaps through the centrosome 
that the sperm brings into the egg, and the movement is associated with a reor- 
ganization of microtubules in the egg cytoplasm. This leads to a microtubule- 
based transport of several components, including mRNA coding for Wntll, a 
member of the Wnt family of signal molecules, toward the future dorsal side (see 
Figure 22-68B). This mRNA is soon translated, producing Wntll protein in the 
dorsal vegetal region. The Wntll secreted from cells that form in that region is 
crucial in triggering the cascade of subsequent events that will organise the 
dorsoventral axis of the body. 



Figure 22-68 The Xenopus egg and its 
asymmetries. (A) Side view of an egg 
photographed just before fertilization. 
(B) The asymmetric distribution of 
molecules inside the egg, and how this 
changes following fertilization so as to 
define a dorsoventral as well as an 
animal-vegetal asymmetry. Fertilization, 
through a reorganization of the 
microtubule cytoskeleton, triggers a 
rotation of the egg cortex (a layer a 
few jim deep) through about 30° relative 
to the core of the egg in a direction 
determined by the site of sperm entry. 
Some components are carried still further 
to the future dorsal side by active 
transport along microtubules. The 
resulting dorsal concentration of Wnt 1 7 
mRNA leads to dorsal production of the 
Wnt1 1 signal protein and defines the 
dorsoventral polarity of the future 
embryo. (A, courtesy of Tony Mills.) 
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Cleavage Produces Many Cells from One 

The cortical rotation is completed in about an hour after fertilization and is fol- 
lowed by cleavage, in which the single large egg cell rapidly subdivides by 
repeated mitosis into many smaller cells, or blastomeres, without any change in 
total mass (Figure 22-69). <ATTT> In this way, the determinants distributed 
asymmetrically in the egg become partitioned into separate cells, with different 
fates (Figure 22-70). 

These first cell divisions in Xenopus have a cycle time of about 30 minutes, 
with a direct alternation of S and M phases, as discussed in Chapter 17. The very 
high rate of DNA replication and mitosis seems to preclude almost all gene tran- 
scription (although protein synthesis occurs), and the cleaving embryo is almost 
entirely dependent on reserves of RNA, protein, membrane, and other materials 
that accumulated in the egg while it developed as an oocyte in the mother. After 
about 12 cycles of cleavage (7 hours), the cell division rate slows down, the cell 
cycles begin to follow the standard pattern with Gi and G2 phases intervening 
between the S and M phases, and widespread transcription of the embryo's 
genome begins. This event is called the mid-blastula transition, and it occurs 
with roughly similar timing in most animal species (mammals being an excep- 
tion) . Studies in zebrafish show that the newly synthesized transcripts include 
micro-RNAs that recognize many of the transcripts deposited in the egg by the 
mother and direct their rapid degradation. The midblastula transition thus 
marks the point at which the embryo's own genome largely takes over control of 
development. 



Figure 22-69 The stages of cleavage in 
Xenopus. The cleavage divisions rapidly 
subdivide the egg into many smaller 
cells. All the cells divide synchronously 
for the first 1 2 cleavages, but the 
divisions are asymmetric, so that the 
lower, vegetal cells, encumbered with 
yolk, are fewer and larger. 



Gastrulation Transforms a Hollow Ball of Cells into a 
Three-Layered Structure with a Primitive Gut 

During the period of cleavage, the frog embryo becomes transformed from a 
solid sphere of cells into something more like a hollow ball, with an internal 
fluid-filled cavity surrounded by cells that cohere to form an epithelial sheet. 
The embryo is now termed a blastula (Figure 22-71). 

Soon after this, the coordinated movements of gastrulation begin. <TCCC> 
This dramatic process transforms the simple hollow ball of cells into a multilay- 
ered structure with a central gut tube and bilateral symmetry: by a more elabo- 
rate version of the process outlined earlier for the sea urchin (see Figure 22-3), 
many of the cells on the outside of the embryo are moved inside it. Subsequent 
development depends on the interactions of the inner, outer, and middle layers 
of cells thus formed: the endoderm on the inside, consisting of the cells that have 
moved into the interior to form the primitive gut; the ectoderm on the outside, 
consisting of cells that have remained external; and the mesoderm between 
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Figure 22-70 The origins of the three 
germ layers can be traced back to 
distinct blastomeres of the embryo in 
its early cleavage stages. The endoderm 
derives from the most vegetal 
blastomeres, the ectoderm from the most 
animal, and the mesoderm from a middle 
set that contribute also to endoderm and 
ectoderm. The coloring in each picture is 
the more intense, the higher the 
proportion of cell progeny that will 
contribute to the given germ layer. (After 
L Dale, Cum Biol. 9:R812-R815, 1999. 
With permission from Elsevier.) 
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them, consisting of cells that detach from the epithelium to form a more loosely 
organized embryonic connective tissue (Figure 22-72). From these three germ 
layers, the tissues of the adult vertebrate body will be generated, preserving the 
basic body plan established through gastrulation. 



Figure 22-71 The blastula. In the 

outermost regions of the embryo, tight 
junctions between the blastomeres begin 
to create an epithelial sheet that isolates 
the interior of the embryo from the 
external medium. Na + is pumped across 
this sheet into the spaces in the interior 
of the embryo, and water follows into 
these spaces because of the resulting 
osmotic pressure gradient. As a result, the 
intercellular crevices inside the embryo 
enlarge to form a single cavity, the 
blastocoel. In Xenopus the wall of the 
blastocoel is several cells thick, and only 
the outermost cells are tightly bound 
together as an epithelium. 



The Movements of Gastrulation Are Precisely Predictable 

The pattern of gastrulation movements that creates the germ layers and estab- 
lishes the body axes is described for Xenopus in Figure 22-73. The details are 
complex, but the principles are simple. 

Cells of the future endoderm are folded into the interior, or involuted, in 
succession. The process begins with a downward movement of cells from the 
animal hemisphere to cover and enclose the yolky vegetal hemisphere, which 
represents the food supply of the embryo. Cells that are in the vanguard of this 
movement, at the vegetal margin of the advancing cell sheet, are the first to 
involute, turning inward and then moving up toward the animal pole to form 
the most anterior part of the gut. As they near the animal pole, these leading 
endoderm cells will signal to the overlying ectoderm to define the anterior 
extremity of the head. The mouth will eventually develop as a hole formed at an 
anterior site where endoderm and ectoderm come into direct contact. Mean- 
while, future mesoderm cells, destined to detach from the epithelial sheet to 
form the sandwich filling between endoderm and ectoderm, tuck into the inte- 
rior along with the endoderm cells, and also move up toward the animal pole. 
The cells that are first to involute go to form parts of the head, and those that 
are last form parts of the tail. In this way, the anteroposterior axis of the final 
embryo is laid down sequentially. 

The anteroposterior movements go hand in hand with movements that 
organize the dorsoventral axis of the body. Gastrulation begins on the side of the 
blastula that has been marked out as dorsal by the cortical rotation. Here, invo- 
lution of cells into the interior starts with a short indentation that rapidly 
extends to form the blastopore — a line of invagination that curves around to 
encircle the vegetal pole. The site where the invagination starts defines the dor- 
sal lip of the blastopore. As we shall see, this tissue plays a leading part in subse- 
quent events and gives rise to the central dorsal structures of the main body axis. 
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Figure 22-72 A cross section through 
the trunk of an amphibian embryo after 
the end of gastrulation, showing the 
arrangement of endodermal, 
mesodermal and ectodermal tissues. 
The endoderm will form the epithelial 
lining of the gut, from the mouth to the 
anus. It gives rise not only to the pharynx, 
esophagus, stomach, and intestines, but 
also to many associated glands. The 
salivary glands, the liver, the pancreas, 
the trachea, and the lungs, for example, 
all develop from extensions of the wall of 
the originally simple digestive tract and 
grow to become systems of branching 
tubes that open into the gut or pharynx. 
The endoderm forms only the epithelial 
components of these structures — the 
lining of the gut and the secretory cells of 
the pancreas, for example. The 
supporting muscular and fibrous 
elements arise from the mesoderm. The 
mesoderm gives rise to the connective 
tissues — at first to the loose, space-filling, 
three-dimensional mesh of cells in the 
embryo known as mesenchyme, and 
ultimately to cartilage, bone, and fibrous 
tissue, including the dermis (the inner 
layer of the skin). The mesoderm also 
forms the muscles, the entire vascular 
system — including the heart, the blood 
vessels, and the blood cells — and the 
tubules, ducts, and supporting tissues of 
the kidneys and gonads. The ectoderm 
will form the epidermis (the outer, 
epithelial layer of the skin) and epidermal 
appendages such as hair, sweat glands, 
and mammary glands. It will also give rise 
to the whole of the nervous system, 
central and peripheral, including not only 
neurons and glia but also the sensory 
cells of the nose, the ear, the eye, and 
other sense organs. (After T. Mohun et 
al., Cell 22:9-1 5, 1 980. With permission 
from Elsevier.) 
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Figure 22-73 Gastrulation in Xenopus. <TCCC> (A) The external views (above) show the embryo as a semitransparent 
object, seen from the side; with the directions of cell movement indicated by red arrows, cross sections (below) are cut in 
the median plane (the plane of the dorsal and ventral midlines). Gastrulation begins when a short indentation, the 
beginning of the blastopore, becomes visible in the exterior of the blastula.This indentation gradually extends, curving 
around to form a complete circle surrounding a plug of very yolky cells (destined to be enclosed in the gut and digested). 
Sheets of cells meanwhile turn in around the lip of the blastopore and move deep into the interior of the embryo. At the 
same time the external epithelium in the region of the animal pole actively spreads to take the place of the cell sheets that 
have turned inward. Eventually, the epithelium of the animal hemisphere spreads in this way to cover the whole external 
surface of the embryo, and, as gastrulation reaches completion, the blastopore circle shrinks almost to a point. (B) A fate 
map for the early Xenopus embryo (viewed from the side) as it begins gastrulation, showing the origins of the cells that will 
come to form the three germ layers as a result of the movements of gastrulation. The various parts of the mesoderm 
(lateral plate, somites, and notochord) derive from deep-lying cells that segregate from the epithelium in the cross-hatched 
region. The other cells, including the more superficial cells in the cross-hatched region, will give rise to ectoderm (blue, 
above) or endoderm (yellow, below). Roughly speaking, the first cells to turn into the interior, or involute, will move forward 
inside the embryo to form the most anterior endodermal and mesodermal structures, while the last to involute will form 
the most posterior structures. (C) Cartoon (not to be taken too literally) showing roughly how the different regions of the 
ectoderm map into the body surface of the adult animal. (After R.E. Keller, J. Exp. Zool. 216:81-101, 1981, with permission 
from John Wiley & Sons, Inc. and Dev. Biol. 42:222-241, 1975, with permission from Academic Press.) 



Chemical Signals Trigger the Mechanical Processes 

The VegT, Wntll, and other mRNA molecules localized in the vegetal cytoplasm 
of the egg produce localized distributions of their protein products. These act in 
and on the cells in the lower and middle part of the embryo to give them spe- 
cialized characters and set them moving, both by direct effects and by stimulat- 
ing the production of other secreted signal molecules, in particular proteins of 
the TGF(3 superfamily If these latter signals are blocked, no mesodermal cell 
types are generated and gastrulation is disrupted. The local activation of the Wnt 
signaling pathway on the dorsal side of the embryo (as a result of the earlier cor- 
tical rotation; see Figure 22-68) modifies the action of the other signals so as to 
induce development of the special cells that form the dorsal lip of the blastopore 
(Figure 22-74). 

The dorsal lip of the blastopore plays a central role in gastrulation not just in 
a geometrical sense, but as a powerful new source of control. If the dorsal lip of 
the blastopore is excised from an embryo at the beginning of gastrulation and 
grafted into another embryo but in a different position, the host embryo initiates 
gastrulation both at the site of its own dorsal lip and at the site of the graft. The 
movements of gastrulation at the second site entail the formation of a second 
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whole set of body structures, and a double embryo (Siamese twins) results (see 
Figure 22-6B). 

Evidently, the dorsal lip of the blastopore is the source of a signal (or signals) 
coordinating both the movements of gastrulation and the pattern of specializa- 
tion of the tissues in its neighborhood. Because of this crucial role in organizing 
the formation of the main body axis, the dorsal lip of the blastopore is known as 
the Organizer (or Spemanris Organizer, after its co- discoverer). It is the oldest 
and most famous example of an embryonic signaling center. 



Active Changes of Cell Packing Provide a Driving Force for 
Gastrulation 

The Organizer controls the dorsoventral pattern of cell differentiation in its 
neighborhood by secreting at least six different signal proteins. These act as dif- 
fusible antagonists of the two main types of signals we have already mentioned, 
coming from the more vegetal cells — that is, of Wnt signals and of TGF(3-like sig- 
nals (specifically BMP proteins). These inhibitors released from the Organizer 
may help to limit the size of the Organizer by preventing neighboring cells from 
also adopting an Organizer character. At the same time, they create a gradient of 
signaling activity — a morphogen gradient, whose local value reflects the dis- 
tance from the Organizer (Figure 22-74C). As the cells move during gastrulation, 
they experience different doses of BMP (and other) signals, delivered with dif- 
ferent timing, evoking different cell behaviors and entailing different ultimate 
fates. But how is the pattern of cell movements organized in mechanical terms, 
and what are the forces that bring it about? 

Gastrulation begins with changes in the shape of the cells at the site of the 
blastopore. In the amphibian these are called bottle cells: they have broad 
bodies and narrow necks that anchor them to the surface of the epithelium 
(Figure 22-75), and they may help to force the epithelium to curve and so to 
tuck inward, producing the initial indentation seen from outside. Once this 
first tuck has formed, cells can continue to pass into the interior as a sheet to 
form the gut and mesoderm. The movement seems to be driven mainly by an 
active repacking of the cells, especially those in the involuting regions around 
the Organizer (see Figure 22-75). Here convergent extension occurs. Small 
square fragments of tissue from these regions, isolated in culture, will sponta- 
neously narrow and elongate through a rearrangement of the cells, just as they 
would in the embryo in the process of converging toward the dorsal midline, 
turning inward around the blastopore lip, and then elongating to form the 
main axis of the body. 



Figure 22-74 A current view of the main 
inductive signals organizing the events 
of gastrulation. (A) The distribution of 
axis-determining molecules in the 
blastula results from inheritance of 
different parts of the cytoplasm of the 
fertilized frog egg. The VegT gene 
regulatory protein in the vegetal 
blastomeres is translated from VegT 
mRNA that was localized at the vegetal 
pole before fertilization. The Wnt1 1 
protein on the future dorsal side is 
translated from mRNA localized there as 
a result of the cortical rotation that 
follows fertilization. (B) VegT drives 
expression of Xnr {Xenopus nodal-related) 
proteins and other members of theTGFp 
superfamily, which induce formation of a 
band of mesoderm in the middle part of 
the embryo, while Wnt11 modifies the 
outcome on the dorsal side, collaborating 
with Xnr to induce formation of the 
Organizer. (C) A morphogen gradient that 
organizes the dorsoventral axis is set up 
by a combination of signals, including 
BMP4 (another TGFp superfamily 
member) secreted by the mesoderm, and 
antagonists of the Wnt and BMP 
pathways, secreted by the Organizer cells 
at the dorsal lip of the blastopore. 
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To bring about this remarkable transformation, the individual cells have to 
crawl over one another in a coordinated way (Figure 22-76). The alignment of 
their movements appears to depend on the same machinery we encountered in 
the worm and the fly controlling planar cell polarity: the Frizzled/ Dishevelled 
polarity-signaling pathway When this pathway is blocked — for example, by a 
dominant- negative form of Dishevelled — convergent extension fails to occur. 



Changing Patterns of Cell Adhesion Molecules Force Cells Into 
New Arrangements 

Patterns of gene expression govern embryonic cell movements in many different 
ways. They regulate cell motility, cell shape, and the production of signals for 
guidance. Very importantly, they also determine the sets of adhesion molecules 
that the cells display on their surfaces. Through changes in its surface molecules, 
a cell can break old attachments and make new ones. Cells in one region may 
develop surface properties that make them cohere with one another and 
become segregated from a neighboring group of cells whose surface chemistry 
is different. 

Experiments done half a century ago on early amphibian embryos showed 
that the effects of selective cell-cell adhesion can be so powerful that they can 
bring about an approximate reconstruction of the normal structure of an early 



Figure 22-75 Cell movements in 
gastrulation. A section through a 
gastrulating Xenopus embryo, cut in the 
same plane as in Figure 22-73, indicating 
the four main types of movement that 
gastrulation involves. The animal pole 
epithelium expands by cell 
rearrangement, becoming thinner as it 
spreads. Migration of mesodermal cells 
over a fibronectin-rich matrix lining the 
roof of the blastocoel may help to pull 
the invaginated tissues forward. But the 
main driving force for gastrulation in 
Xenopus is convergent extension in the 
marginal zone. (After R.E. Keller, J. Exp. 
Zoo/. 216:81-101, 1981. With permission 
from Wiley-Liss.) 
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Figure 22-76 Convergent extension and its cellular basis. (A) The pattern of convergent extension in the marginal zone of a gastrula as 
viewed from the dorsal aspect. Blue arrows represent convergence toward the dorsal midline, red arrows represent extension of the 
anteroposterior axis. The simplified diagram does not attempt to show the accompanying movement of involution, whereby the cells are 
tucking into the interior of the embryo. (B) Schematic diagram of the cell behavior that underlies convergent extension. The cells form 
lamellipodia, with which they attempt to crawl over one another. Alignment of the lamellipodial movements along a common axis leads to 
convergent extension. The process depends on the Frizzled/Dishevelled polarity-signaling pathway and is presumably cooperative because 
cells that are already aligned exert forces that tend to align their neighbors in the same way. (B, after J. Shih and R. Keller, Development 
1 1 6:901 -91 4, 1 992. With permission from The Company of Biologists.) 
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Figure 22-77 Sorting out. Cells from different parts of an early amphibian 
embryo will sort out according to their origins. In the classical experiment 
shown here, mesoderm cells (green), neural plate cells (blue), and 
epidermal cells (red) have been disaggregated and then reaggregated in a 
random mixture. They sort out into an arrangement reminiscent of a 
normal embryo, with a "neural tube" internally, epidermis externally, and 
mesoderm in between. (Modified from P.LTownes and J. Holtfreter,/ Exp. 
Zool. 128:53-120, 1955. With permission from Wiley-Liss.) 

postgastrulation embryo even after the cells have been artificially dissociated. 
When these cells are reaggregated into a random mixture, the cells sort out 
spontaneously according to their original characters (Figure 22-77). As dis- 
cussed in Chapter 19, a central role in such phenomena is played by the cad- 
herins — a large and varied family of evolutionarily related Ca 2+ -dependent 
cell-cell adhesion proteins. These and other cell-cell adhesion molecules are 
differentially expressed in the various tissues of the early embryo, and antibod- 
ies against them interfere with the normal selective adhesion between cells of a 
similar type. 

Changes in the patterns of expression of the various cadherins correlate 
closely with the changing patterns of association among cells during gastrula- 
tion, neurulation, and somite formation (see Figure 19-25). These rearrange- 
ments are likely to be regulated and driven in part by the cadherin pattern. In 
particular, cadherins appear to have a major role in controlling the formation 
and dissolution of epithelial sheets and clusters of cells. They not only glue one 
cell to another but also provide anchorage for intracellular actin filaments at the 
sites of cell-cell adhesion. In this way, the pattern of stresses and movements in 
the developing tissue is regulated according to the pattern of adhesions. 



The Notochord Elongates, While the Neural Plate Rolls Up to Form 
the Neural Tube 

Gastrulation is only the first — though perhaps the most dramatic — of a dizzying 
variety of cell movements that shape the parts of the body. We have space to dis- 
cuss only a few of these. 

In the embryo just after gastrulation, the layer of mesoderm is divided into 
separate slabs on the left and right sides of the body. Defining the central body 
axis, and effecting this separation, is the very early specialization of the meso- 
derm known as the notochord. This slender rod of cells, with ectoderm above it, 
endoderm below it, and mesoderm on either side (see Figure 22-72), derives 
from the cells of the Organizer itself. The notochordal cells are characterized by 
expression of a gene regulatory protein called Brachyury (Greek for "short-tail", 
from the mutant phenotype); this belongs to the same T-box family as the VegT 
protein in the vegetal blastomeres. 

As the notochordal cells pass around the dorsal lip of the blastopore and 
move into the interior of the embryo, they form a column of tissue that elongates 
dramatically by convergent extension. The cells of the notochord also become 
swollen with vacuoles, so that the rod elongates still further and stretches out 
the embryo. The notochord is the defining peculiarity of the chordates — the 
phylum to which the vertebrates belong. It is one of the major vertebrate fea- 
tures that do not have any apparent counterpart in Drosophila. In the most 
primitive chordates, which have no vertebrae, the notochord persists as a prim- 
itive substitute for a vertebral column. In vertebrates it serves as a core around 
which other mesodermal cells will eventually gather to form the vertebrae. 

In the overlying sheet of ectoderm, meanwhile, other movements are occur- 
ring to form the rudiments of the nervous system. In a process known as neuru- 
lation, a broad central region of ectoderm, called the neural plate, thickens, rolls 
up into a tube, and pinches off from the rest of the cell sheet. The tube thus cre- 
ated from the ectoderm is called the neural tube; it will form the brain and the 
spinal cord (Figure 22-78). 

The mechanics of neurulation depend on changes of cell packing and cell 
shape that make the epithelium roll up into a tube (Figure 22-79). Signals initially 
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from the Organizer and later from the underlying notochord and mesoderm 
define the extent of the neural plate, induce the movements that make it roll up, 
and help to organize the internal pattern of the neural tube. The notochord in 
particular secretes Sonic hedgehog protein — a homolog of the Drosophila signal 
protein Hedgehog — and this acts as a morphogen to control gene expression in 
the neighboring tissues (Figure 22-80). 



Figure 22-78 Neural tube formation in 
Xenopus. The external views are from the 
dorsal aspect. The cross sections are cut in 
a plane indicated by the broken lines. (After 
T.E. Schroeder, J. Embryol. Exp. Morphol. 
23:427-462, 1970. With permission from 
The Company of Biologists.) 



A Gene-Expression Oscillator Controls Segmentation of the 
Mesoderm Into Somites 

Genetically regulated changes in cell adhesion underlie one of the most striking 
and characteristic processes in vertebrate development — the formation of the 
segments of the body axis. 

On either side of the newly formed neural tube lies a slab of mesoderm (see 
Figure 22-72). To form the repetitive series of vertebrae, ribs, and segmental 
muscles, this slab breaks up into separate blocks, or somites — cohesive groups 
of cells, separated by clefts. Figure 22-81A shows the process as it occurs in the 
chick embryo. The somites form one after another, starting in the head and end- 
ing in the tail. Depending on the species, the final number of somites ranges 
from less than 50 (in a frog or a bird) to more than 300 (in a snake). The poste- 
rior, most immature part of the mesodermal slab, called the presomitic meso- 
derm, supplies the necessary tissue: as it retreats tailward, extending the 
embryo, it deposits a trail of somites. The special character of the presomitic 
mesoderm is maintained by FGF signaling: Fgf8 mRNA is synthesized at the tail 
end of the embryo and slowly degraded as cells move away from this region. 
Translation of the message results in a gradient of secreted FGF8 protein, with its 
high point at the tail end. 

Formation of the cleft between one somite and the next is foreshadowed by 
an alternating spatial pattern of gene expression in the presomitic mesoderm: 
cells about to form the posterior part of a new somite switch on expression of 
one set of genes, while those destined to form the anterior part of the next 
somite switch on expression of another set. Selective cohesion resulting from 
differential gene expression seems to be the underlying cause of the physical 
segmentation observed. 

The problem then is to understand how the repetitive alternating pattern of 
gene expression is set up. Studies done originally in the chick embryo have pro- 
vided the beginnings of an answer. In the posterior part of the presomitic meso- 
derm, expression of certain genes is found to oscillate in time. The first such 
somite oscillator gene to be discovered was Hesl, a homolog of the Drosophila 
pair-rule gene Hairy and of the E(spl) genes that mediate responses to Notch 
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Figure 22-79 The bending of an 
epithelium through cell shape changes 
mediated by microtubules and actin 
filaments. The diagram is based on 
observations of neurulation in newts and 
salamanders, where the epithelium is only 
one cell layer thick. As the apical ends of 
the cells become narrower, their upper- 
surface membrane becomes puckered. 
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signaling. The length of one complete oscillation cycle of this segmentation 
clock (90 minutes in the chick) equals the time taken to lay down one further 
somite. As cells emerge from the presomitic mesoderm to form somites — in 
other words, as they lose exposure to the FGF8 signal — their oscillation slows 
down and finally comes to a halt. Some become arrested in one state, some in 
another, according to the phase of their oscillation cycle at their time of exit from 
the presomitic mesoderm. Hesl and several of the other oscillating genes code 
for gene regulatory proteins; thus, the cells that drop below the critical level of 
FGF8 when they are at the peak of their oscillation cycle switch on one set of reg- 
ulatory genes, while those passing the threshold at the trough of the cycle switch 
on another (Figure 22-81B). In this way, it is thought, the temporal oscillation of 
gene expression in the presomitic mesoderm leaves its trace in a spatially peri- 
odic pattern of gene expression in the maturing mesoderm, and this in turn dic- 
tates how the tissue will break up into physically separate blocks. 



Figure 22-80 A schematic cross section 
of the spinal cord of a chick embryo, 
showing how cells at different levels 
along the dorsoventral axis express 
different gene regulatory proteins. 
(A) Signals that direct the dorsoventral 
pattern: Sonic hedgehog protein from 
the notochord and the floor plate (the 
ventral midline of the neural tube) and 
BMP proteins from the roof plate (the 
dorsal midline) act as morphogens to 
control gene expression. (B) The resulting 
patterns of gene expression in the ventral 
part of the developing spinal cord. 
Different groups of proliferating neural 
progenitor cells (in the ventricular zone, 
close to the lumen of the neural tube) 
and of differentiating neurons (in the 
mantle zone, further out) express 
different combinations of gene 
regulatory proteins. Those indicated in 
this diagram are almost all members of 
the homeodomain superfamily; various 
other genes of the same superfamily 
(including Islet/Lim proteins) are 
expressed in the differentiating neurons. 
Neurons expressing different gene 
regulatory proteins will form connections 
with different partners and may make 
different combinations of 
neurotransmitters and receptors. 
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Figure 22-81 Somite formation in the 
chick embryo. (A) A chick embryo at 
40 hours of incubation. (B) How the 
temporal oscillation of gene expression 
in the presomitic mesoderm becomes 
converted into a spatial alternating 
pattern of gene expression in the formed 
somites. In the posterior part of the 
presomitic mesoderm, each cell oscillates 
with a cycle time of 90 minutes. As cells 
mature and emerge from the presomitic 
region, their oscillation is gradually 
slowed down and finally brought to a 
halt, leaving them in a state that depends 
on the phase of the cycle they happen to 
be in at the critical moment. In this way, a 
temporal oscillation of gene expression 
traces out an alternating spatial pattern. 
(A, from Y.J. Jiang, L. Smithers and 
J. Lewis, Curr. Biol. 8:R868-R871, 1998. 
With permission from Elsevier.) 
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Delayed Negative Feedback May Generate the Oscillations of the 
Segmentation Clock 

What, then, is the mechanism that generates the temporal oscillation? How does 
the clock work? In the mouse, at least three classes of genes have been found to 
show oscillating expression in the presomitic mesoderm, coding respectively for 
components of the Notch pathway, the Wnt pathway, and the Fgf pathway; but 
most of the mutations that are known to break the clock and disrupt somite seg- 
mentation lie in components of the Notch pathway These include genes (such 
as Hesl and more importantly its relative Hes7) that are regulated by Notch and 
code for inhibitory gene regulatory proteins. Some of these proteins act directly 
on the regulatory DNA of their own gene so as to inhibit their own expression. 
According to one theory, this simple negative feedback loop could be the basic 
generator of the oscillations Figure 22-82: when the gene is transcribed, the 
amount of its protein product builds up until transcription is inhibited and syn- 
thesis of the protein ceases; the protein then decays, permitting transcription to 
begin again; and so on. There is a time-lag from the beginning of a new bout of 
transcription to the first appearance in the nucleus of the resulting regulatory 
protein molecules, because it takes time for the RNA polymerase to traverse the 
gene, for the resulting RNA transcript then to mature, leave the nucleus, and 
direct synthesis of a protein molecule, and for the protein then to enter the 
nucleus to control transcription. This delay in the feedback loop is proposed to 
be the main determinant of the period of oscillation of the clock and thus of the 
size of each somite. 

Most of the cells of each newly formed somite will rapidly differentiate to 
form a block of muscle, corresponding to one muscle segment of the main body 
axis. The embryo can (and does) now begin to wriggle. Separate subsets of the 
somite cells will go to form the vertebrae and other connective tissues such as 
dermis. A further subset detach from the somite and migrate away into the lat- 
eral unsegmented mesoderm, crawling through the spaces between other cells: 
these emigrants will give rise to almost all the other skeletal muscle cells in the 
body, including those of the limbs. 



Figure 22-82 Delayed negative 
feedback giving rise to oscillating gene 
expression. (A) A single gene, coding for 
a gene regulatory protein that inhibits its 
own expression, can behave as an 
oscillator. For oscillation to occur, there 
must be a delay (or several delays) in the 
feedback circuit, and the lifetimes of the 
mRNA and protein must be short 
compared with the total delay. The delay 
determines the period of oscillation. 
According to one theory, a feedback 
circuit like this, based on a gene called 
Her7 in the zebrafish, or Hes7 in the 
mouse (a relative of Hesl), is the 
pacemaker of the segmentation clock 
governing somite formation. (B) The 
predicted oscillation of Her7 mRNA and 
protein, computed using rough estimates 
of the feedback circuit parameters 
appropriate to this gene in the zebrafish. 
Concentrations are measured as numbers 
of molecules per cell. The predicted 
period is close to the observed period, 
which is 30 minutes per somite in the 
zebrafish. 



Embryonic Tissues Are Invaded in a Strictly Controlled Fashion by 
Migratory Cells 

The muscle-cell precursors, or myoblasts, that emigrate from the somites are 
determined but not overtly differentiated. In the tissues that they colonize they 
will mingle with other classes of cells from which they appear practically indis- 
tinguishable; but they will maintain expression of myoblast- specific gene regu- 
latory proteins (such as Pax3 and members of the MyoD family), and when the 
time comes for differentiation, they, and they alone, will turn into muscle cells 
(Figure 22-83). 
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Figure 22-83 The migratory origin of limb muscle cells. The migrations 
can be traced by grafting cells from a quail embryo into a chick embryo; 
the two species are very similar in their development, but the quail cells 
are recognizable by the distinctive appearance of their nucleoli. If quail 
somite cells are substituted for the somite cells of a chick embryo at 2 days 
of incubation and the wing of the chick is sectioned a week later, it is found 
that the muscle cells in the chick wing derive from the transplanted quail 
somites. 



The eventual pattern of muscles — in the limbs, for example — is determined 
by the routes that the migrant cells follow and the selection of sites that they col- 
onize. The embryonic connective tissues form the framework through which the 
myoblasts travel and provide signals that guide their distribution. No matter 
which somite they come from, myoblasts that migrate into a forelimb bud will 
form the pattern of muscles appropriate to a forelimb, and those that migrate 
into a hindlimb bud will form the pattern appropriate to a hindlimb. 

Other classes of migrant cells, meanwhile, select different routes for their 
travels. Along the line where the neural tube pinches off from the future epider- 
mis, a number of ectodermal cells break loose from the epithelium and also 
migrate as individuals out through the mesoderm (Figure 22-84). These are the 
cells of the neural crest; they will give rise to almost all of the neurons and glial 
cells of the peripheral nervous system, as well as the pigment cells of the skin 
and many connective tissues in the head, including bones of the skull and jaws. 
Other important migrants are the precursors of the blood cells, of the germ cells, 
and of many groups of neurons within the central nervous system, as well as the 
endothelial cells that form blood vessels. Each of these classes of travelers will 
colonize a different set of sites. As a result of such invasions, most tissues in the 
vertebrate body are mixtures of cells of different characters derived from widely 
separate parts of the embryo. 

As a migrant cell travels through the embryonic tissues, it repeatedly extends 
projections that probe its immediate surroundings, testing for subtle cues to 
which it is particularly sensitive by virtue of its specific assortment of cell- surface 
receptor proteins. Inside the cell these receptor proteins are connected to the 
cytoskeleton, which moves the cell along. Some extracellular matrix materials, 
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Figure 22-84 The main pathways of neural crest cell migration. A chick embryo is shown in a 
schematic cross section through the middle part of the trunk. Deep-lying neural crest derivatives 
are indicated by yellow text boxes. The cells that take the pathway just beneath the ectoderm will 
form pigment cells of the skin; those that take the deep pathway via the somites will form the 
neurons and glial cells of sensory and sympathetic ganglia, and parts of the adrenal gland. The 
neurons and glial cells of the enteric ganglia, in the wall of the gut, are formed from neural crest 
cells that migrate along the length of the body, originating from either the neck region or the 
sacral region. In Drosophila, neurons in the wall of the gut originate in a similar way, by migration 
from the head end of the embryo. (See also Figure 1 9-23.) 
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such as the protein fibronectin, provide adhesive sites that help the cell to 
advance; others, such as chondroitin sulfate proteoglycan, inhibit locomotion 
and repel immigration. The nonmigrant cells along the pathway may likewise 
have inviting or repellent surfaces, or may even extend filopodia that touch the 
migrant cell and affect its behavior. 

Among this mass of different guiding influences, a few stand out as particu- 
larly important. In particular, cells of many different types are guided by chemo- 
taxis that depends on a receptor called CXCR4. This cell-surface protein belongs 
to the family of G-protein-coupled receptors, and it is activated by an extracel- 
lular ligand called SDF1. Cells expressing CXCR4 can snuffle their way along 
tracks marked out for them by production of SDF (Figure 22-85). Chemotaxis 
towards sources of SDF1 plays a major part in guiding the migrations of lym- 
phocytes and of various other white blood cells; of neurons in the developing 
brain; of muscle progenitor cells entering limb buds; of primordial germ cells as 
they travel toward the gonads; and of cancer cells when they metastasize. 



Figure 22-85 Migration of the lateral 
line primordium in a zebrafish larva, 
guided by SDF1 and CXCR4.The lateral 
line is a row of mechanosensory organs, 
closely similar to the sensory patches in 
the inner ear, which detect the 
movement of water over the surface of a 
fish or amphibian. (A) They originate as 
clusters of cells deposited by a 
primordium that migrates along the the 
flank of the larva, from a site in the head 
all the way down to the tail, as shown in 
this 2-day larva in which the lateral line 
cells are labeled by expression of Green 
Fluorescent Protein. (B) Cells in the 
primordium express the chemotaxis 
receptor CXCR4, shown here by in situ 
hybridization in a 1-day larva. (C) The 
track that they will follow is marked by 
expression of the ligand SDF1, shown by 
in situ hybridization in another 1-day 
specimen. If the ligand is lacking along 
the normal route (as a result of a 
mutation), the primordium departs from 
its proper route to follow an alternative 
more ventral track marked by another 
stripe of SDF1, defining the normal path 
of another migratory structure, the 
pronephros. (A, courtesy of David 
Gilmour; B and C, from N.B. David et al., 
Proc. Natl Acad. Sci. U.S.A. 
99:16297-16302, 2002. With permission 
from National Academy of Sciences.) 



The Distribution of Migrant Cells Depends on Survival Factors as 
Well as Guidance Cues 

The final distribution of migrant cells depends not only on the routes they take, 
but also on whether they survive the journey and thrive in the environment they 
find at the journey's end. Specific sites provide survival factors needed by spe- 
cific types of migrant. For example, the neural crest cells that give rise to the pig- 
ment cells of the skin and the nerve cells of the gut depend on a peptide factor 
called endothelin-3 that is secreted by tissues on the migration pathways; 
mutant mice and humans defective in the gene for this factor or its receptor 
have nonpigmented (albino) patches and potentially lethal gut malformations 
resulting from the lack of gut innervation (a condition called megacolon, 
because the colon becomes hugely distended). 

Germ cells, blood cell precursors, and neural- crest- derived pigment cells all 
appear to share at least one common requirement for survival. This involves a 
transmembrane receptor, called the Kit protein, in the membrane of the migrant 
cells, and a ligand, called the Steel factor, produced by the cells of the tissue 
through which the cells migrate and/ or in which they come to settle. Individu- 
als with mutations in the genes for either of these proteins are deficient in their 
pigmentation, their supply of blood cells, and their production of germ cells 
(Figure 22-86). 



1 376 Chapter 22: Development of Multicellular Organisms 




Figure 22-86 Effect of mutations in the 
Kit gene. Both the baby and the mouse 
are heterozygous for a loss-of-function 
mutation that leaves them with only half 
the normal quantity of Kit gene product. 
In both cases pigmentation is defective 
because pigment cells depend on the Kit 
product as a receptor for a survival factor. 
(Courtesy of R.A. Fleischman, from Proc. 
Natl Acad. Sci. U.S.A. 88:10885-10889, 
1991. With permission from National 
Academy of Sciences.) 



Left-Right Asymmetry of the Vertebrate Body Derives From 
Molecular Asymmetry in the Early Embryo 

Vertebrates may look bilaterally symmetrical from the outside, but many of their 
internal organs — the heart, the stomach, the liver, and so on — are highly asym- 
metric. This asymmetry is quite reproducible: 99.98% of people have their heart 
on the left. We have seen how a vertebrate embryo develops its internal and 
external tissue layers and its anteroposterior and dorsoventral axes. But how 
does the left-right asymmetry arise? 

Genetic studies in mammals show that this problem can be broken down 
into two distinct questions — one concerning the creation of asymmetry and the 
other concerning its orientation. Several mutations are known, in humans and 
in mice, that cause a randomization of the left-right axis: 50% of the mutant 
individuals have their internal organs arranged in the normal way, while the 
other 50% have an inverted anatomy, with the heart on the right. In these indi- 
viduals, it seems, the mechanism that makes the left and right sides different has 
functioned correctly, but the mechanism that decides between the two possible 
orientations of the left-right axis is defective. 

A key to the basis of these phenomena comes from the discovery of molec- 
ular asymmetries that precede the first gross anatomical asymmetries. The ear- 
liest signs are seen in patterns of gene expression in the neighborhood of the 
node — the homolog in mouse and chick of the frog Organizer. In particular, the 
gene Nodal coding for a member of the TGF(3 superfamily is expressed asym- 
metrically in this region (not only in the mouse, but also in chick, frog and 
zebrafish) (Figure 22-87). Asymmetry of Nodal expression in the immediate 
neighborhood of the node is relayed outward to create a broad stripe of Nodal 
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Figure 22-87 Helical beating of cilia at 
the node, and the origins of left-right 
asymmetry. (A) The beating of the cilia 
drives a fluid flow toward one side of the 
node, and this leads to asymmetric gene 
expression in the neighborhood of the 
node. According to one theory, the flow 
exerts this effect by carrying extracellular 
signal proteins to one side. Another 
theory notes that cilia can also function as 
mechanosensors, and proposes that a 
subset of cilia at the node respond to 
deflection due to the fluid flow by 
opening Ca 2+ channels so as to create an 
increased Ca 2+ concentration in the cells 
on one side. (B) The resulting asymmetric 
expression pattern of Nodal, coding for a 
signal protein belonging to theTGF(3 
superfamily, in the neighborhood of the 
node (lower two blue spots) in a mouse 
embryo at 8 days of gestation, as shown 
by in situ hybridization. At this stage, the 
asymmetry has already been relayed 
outward to the lateral plate mesoderm, 
where Nodal is expressed on the left side 
(large elongated blue patch) but not the 
right. (B, courtesy of Elizabeth Robertson.) 
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expression in the mesoderm along the left side — and only the left side — of the 
embryo's body The mechanism that relays the asymmetry from the node and 
localizes Nodal expression is not understood and may vary from one class of ver- 
tebrates to another. In all species, however, it seems to depend on feedback 
loops involving Nodal together with a second set of genes, the Lefty genes. These, 
like Nodal itself, are directly regulated by the Nodal signaling pathway and their 
products, the Lefty proteins, are related to Nodal; but Lefty proteins diffuse more 
widely and act oppositely, as Nodal antagonists. Mice with a knockout mutation 
in the Lefty 1 gene frequently have the right side converted into a mirror image of 
the left, so that left-right asymmetry is lost. 

Another gene that is directly regulated by the Nodal pathway, Pitx2, coding 
for a gene regulatory protein, links the outcome of the Nodal/Lefty interactions 
to subsequent anatomical development. Nodal drives Pitx2 expression on the 
left side of the body and thereby confers asymmetry on the heart and other 
internal organs. 

This leaves us with the puzzle of how the initial asymmetry of Nodal expres- 
sion originates. Whatever the mechanism, the outcome of events at the node in 
a normal animal must be biased so that left-specific genes are regularly 
expressed on the left side: there has to be a link between the mechanism that 
creates asymmetry and the mechanism that orients it. A clue to the orienting 
mechanism first came to light in a Swedish infertility clinic. A small subset of 
infertile men were found to have sperm that were immotile because of a defect 
in the dynein molecules needed for beating of cilia and flagella. These men also 
suffered from chronic bronchitis and sinusitis because the cilia in their respira- 
tory tract were defective. And strikingly, 50% of them had their internal organs 
left-right inverted, with the heart on the right. The findings originally seemed 
completely mysterious; but similar effects are seen in mammals with other 
mutations resulting in defective cilia. This suggests that ciliary beating somehow 
controls which way the left-right axis is oriented. 

Time-lapse videomicroscopy in the living mouse embryo reveals that the 
cells at the node, on its internal face, have cilia that beat in a helical fashion: like 
a screw-thread, they have a definite handedness, and at the node they are set in 
a little hollow that is shaped so that their beating drives a current of fluid towards 
the left side (see Figure 22-87A). According to one theory, signal proteins carried 
in this current toward the left side provide the bias that orients the left-right axis 
of the mouse body. Another theory proposes that cilia in this system, as in cer- 
tain other contexts, act not only as drivers of fluid flow but also as mechanical 
sensors, responding to deflection by generating an asymmetric current of Ca 2+ 
ions across the node to influence adjacent tissue. 

The handedness of the ciliary beating reflects the handedness — the 
left-right asymmetry — of the organic molecules of which all living things are 
made. It seems that this, therefore, is the ultimate director of the left-right asym- 
metry of our anatomy. 

Summary 

Animal development involves dramatic cell movements. Thus, in gastrulation, cells 
from the exterior of the early embryo tuck into the interior to form a gut cavity and 
create the three germ layers — endoderm, mesoderm, and ectoderm— from which 
higher animals are constructed. In vertebrates, the movements of gastrulation are 
organized by signals from the Organizer (the dorsal lip of the amphibian blastopore, 
corresponding to the node in a chick or mouse embryo). These signals specify the 
dorsoventral axis of the body and govern convergent extension, in which the sheet of 
cells moving into the interior of the body lengthens along the head-to-tail axis while 
narrowing at right angles to this axis. The active repacking movements of individual 
cells that drive convergent extension are coordinated through the Frizzled/Dishev- 
elled planar- polarity signaling pathway — a branch of the Wnt signaling pathway 
that regulates the actin cytoskeleton. 

Subsequent development involves many further cell movements. Part of the ecto- 
derm thickens, rolls up, and pinches off to form the neural tube and neural crest. In the 
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midline, a rod of specialized cells called the notochord elongates to form the central 
axis of the embryo. The long slabs of mesoderm on either side of the notochord become 
segmented into somites. Migrant cells, such as those of the neural crest, break loose 
from their original neighbors and travel through the embryo to colonize new sites. Pri- 
mordial germ cells and many other migrants are guided by chemotaxis dependent on 
the receptor CXCR4 and its ligand SDF1. Specific cell adhesion molecules, such as cad- 
herins and integrins, help to guide the migrations and control the selective cohesion of 
cells in new arrangements. 

Ultimately, the pattern of cell movements is directed by the pattern of gene expres- 
sion, which determines cell surface properties and motility. Thus, the formation of 
somites depends on a periodic pattern of gene expression, which is laid down by a bio- 
chemical oscillator — the segmentation clock — in the mesoderm and dictates the way 
the mass of cells will break up into separate blocks. Similarly, the left-right anatomi- 
cal asymmetry of the vertebrate body is foreshadowed by left-right asymmetry in the 
pattern of gene expression in the early embryo. This asymmetry, in mammals at least, 
is thought to be directed ultimately by the handedness of ciliary beating in the neigh- 
borhood of the node. 



THE MOUSE 

The mouse embryo — tiny and inaccessible in its mother's womb — presents a 
hard challenge to developmental biologists. It has, however, two immediate 
attractions. First, the mouse is a mammal, and mammals are the animals that 
we, as humans, care about most. Second, among mammals, it is one of the most 
convenient for genetic studies, because it is small and breeds rapidly. These two 
factors have spurred an enormous research effort, resulting in the development 
of some remarkably powerful experimental tools. In this way, the mouse has 
become the main model organism for experimentation in mammalian genetics 
and the most intensively studied surrogate for humans. It is separated from 
humans by only about 100 million years of evolution. Its genome is the same as 
ours in size, and there is very nearly a one-to-one correspondence between 
mouse and human genes. Our proteins are typically 80-90% identical in amino 
acid sequence, and large blocks of close nucleotide sequence similarity are also 
evident when the regulatory DNA sequences are compared. 

Through ingenuity and perseverance, developmental biologists have now 
found ways to gain access to the early mouse embryo without killing it and to 
generate mice to order with mutations in any chosen gene. Almost any genetic 
modification that can be made in a worm, a fly, or a zebrafish can now also be 
made in the mouse, and in some cases made better. The costs of research in the 
mouse are far greater, but so are the incentives. As a result, the mouse has 
become a rich source of information about all aspects of the molecular genetics 
of development — a key model system not only for mammals, but also for other 
animals. It has provided, for example, much of what we know about Hox genes, 
left-right asymmetry, cell death controls, the role of Notch signaling, and a host 
of other topics. 

We have already drawn repeatedly upon data from the mouse. We shall make 
use of it even more in the next chapter, where we discuss adult tissues and the 
developmental processes that occur in them. In this section, we examine the 
special features of mouse development that have been exploited to make the 
genetic manipulations possible. By way of example, we shall also outline how 
the mouse has been used to illuminate one further important developmental 
process — the creation of organs such as lungs and glands by interactions 
between embryonic connective tissue and epithelium. 



Mammalian Development Begins With a Specialized Preamble 

The mammalian embryo begins its development in an exceptional way. Pro- 
tected within the uterus, it does not have the same need as the embryos of most 
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other species to complete the early stages of development rapidly. Moreover, the 
development of a placenta quickly provides nutrition from the mother, so that 
the egg does not have to contain large stores of raw materials such as yolk. The 
egg of a mouse has a diameter of only about 80 jam and therefore a volume about 
2000 times smaller than that of a typical amphibian egg. Its cleavage divisions 
occur no more quickly than the divisions of many ordinary somatic cells, and 
gene transcription has already begun by the 2-cell stage. Most importantly, while 
the later stages of mammalian development are similar to those of other verte- 
brates such as Xenopus, mammals begin by taking a large developmental detour 
to generate a complicated set of structures — notably the amniotic sac and the 
placenta — that enclose and protect the embryo proper and provide for the 
exchange of metabolites with the mother. These structures, like the rest of the 
body, derive from the fertilized egg but are called extraembryonic because they 
are discarded at birth and form no part of the adult. Similar accessory structures 
are formed in the development of birds and reptiles. 

The early stages of mouse development are summarized in Figure 22-88. 
The fertilized egg divides to generate 16 cells by 3 days after fertilization. At first, 
the cells stick together only loosely, but beginning at the 8-cell stage they 
become more cohesive and undergo compaction to form a solid ball of cells 
called a morula (Latin for "little mulberry") (Figure 22-89). Apical tight junc- 
tions form between the cells, sealing off the interior of the morula from the 
external medium. Soon after this, an internal cavity develops, converting the 
morula into a blastocyst— a hollow sphere. The outer layer of cells, forming the 



Figure 22-88 The early stages of mouse 
development. The zona pellucida is a jelly 
capsule from which the embryo escapes 
after a few days, allowing it to implant in 
the wall of the uterus. (Photographs 
courtesy of Patricia Calarco.) 



Figure 22-89 Scanning electron 

micrographs of the early mouse 

embryo. The zona pellucida has been 

removed. (A) Two-cell stage. 

(B) Four-cell stage (a polar body is visible 

in addition to the four blastomeres — see 

Figure 21-23). (C) Eight-to-sixteen-cell 

morula-compaction occurring. 

(D) Blastocyst. (A-C, courtesy of Patricia 

Calarco; D, from P. Calarco and 

CJ. Epstein, Dev. Biol. 32:208-213, 1973. 

With permission from Academic Press.) 
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wall of the sphere, is called the trophectoderm. It will give rise to extraembryonic 
tissues. An inner clump of cells, called the inner cell mass, is located to one side 
of the cavity. It will give rise to the whole of the embryo proper. 

After the embryo has escaped from its jelly capsule (at about four days), the 
cells of the trophectoderm make close contact with the wall of the uterus, initi- 
ating the process of implantation that will lead on to formation of the placenta. 
Meanwhile the inner cell mass grows and begins to differentiate. Part of it gives 
rise to some further extraembryonic structures, such as the yolk sac, while the 
rest of it goes on to form the embryo proper by processes of gastrulation, neu- 
rulation, and so on, that are fundamentally similar to those seen in other verte- 
brates, although distortions of the geometry make some of the homologies hard 
to discern at first sight. 



The Early Mammalian Embryo Is Highly Regulative 

Localized intracellular determinants play only a small part in early mammalian 
development, and the blastomeres produced by the first few cell divisions are 
remarkably adaptable. If the early embryo is split in two, a pair of identical twins 
can be produced — two complete normal individuals from a single cell. Similarly, 
if one of the cells in a 2-cell mouse embryo is destroyed by pricking it with a nee- 
dle and the resulting "half-embryo" is placed in the uterus of a foster mother to 
develop, in many cases a perfectly normal mouse will emerge. 

Conversely, two 8-cell mouse embryos can be combined to form a single 
giant morula, which then develops into a mouse of normal size and structure 
(Figure 22-90). Such creatures, formed from aggregates of genetically different 
groups of cells, are called chimeras. Chimeras can also be made by injecting cells 
from an early embryo of one genotype into a blastocyst of another genotype. The 
injected cells become incorporated into the inner cell mass of the host blasto- 
cyst, and a chimeric animal develops. A single cell taken from an 8-cell embryo 
or from the inner cell mass of another early blastocyst can give rise in these ways 
to any combination of cell types in the chimera. Wherever the added cell may 
happen to find itself, it responds correctly to cues from its neighbors and follows 
the appropriate developmental pathway. 

These findings have two implications. First, during the early stages, the 
developmental system is self-adjusting, so that a normal structure emerges even 
if the starting conditions are perturbed. Embryos or parts of embryos that have 
this property are said to be regulative. Second, the individual cells of the inner 
cell mass are initially totipotent, or very nearly so: though they cannot form tro- 
phoblast, they can give rise to any part of the adult body, including germ cells. 



Totipotent Embryonic Stem Cells Can Be Obtained From a 
Mammalian Embryo 

If a normal early mouse embryo is grafted into the kidney or testis of an adult, its 
development is disturbed beyond any possibility of proper regulation, but not 
halted. The result is a bizarre tumorous growth known as a teratoma, consisting 
of a disorganized mass of cells containing many varieties of differentiated tis- 
sue — skin, bone, glandular epithelium, and so on — mixed with undifferentiated 
stem cells that continue to divide and generate yet more of these differentiated 
tissues. 

Investigation of the stem cells in teratomas and related types of tumors led 
to the discovery that their behavior reflects a remarkable property of the cells of 
the normal inner cell mass: given a suitable environment, they can be induced 
to proliferate indefinitely while retaining their totipotent character. Cultured 
cells with this property are called embryonic stem cells, or ES cells. They can be 
derived by placing a normal inner cell mass in culture and dispersing the cells as 
soon as they proliferate. Separating the cells from their normal neighbors and 
putting them in the appropriate culture medium evidently arrests the normal 
program of change of cell character with time and so enables the cells to carry 
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on dividing indefinitely without differentiating. Many tissues of the adult body 
also contain stem cells that can divide indefinitely without terminally differenti- 
ating, as we shall see in the next chapter; but these adult stem cells, when 
allowed to differentiate, normally give rise only to a narrowly restricted range of 
differentiated cell types. 

The state in which the ES cells are arrested seems to be equivalent to that of 
normal inner-cell-mass cells. This can be shown by taking ES cells from the cul- 
ture dish and injecting them into a normal blastocyst (Figure 22-91). The 
injected cells become incorporated in the inner cell mass of the blastocyst and 
can contribute to the formation of an apparently normal chimeric mouse. 
Descendants of the injected stem cells can be found in practically any of the tis- 
sues of this mouse, where they differentiate in a well-behaved manner appro- 
priate to their location and can even form viable germ cells. The extraordinarily 
adaptable behavior of ES cells shows that cues from a cell's neighbors not only 
guide choices between different pathways of differentiation, but can also stop or 
start the developmental clock — the processes that drive a cell to progress from 
an embryonic to an adult state. 

On a practical level, ES cells have a twofold importance. First, from a medi- 
cal point of view, they offer the prospect of a versatile source of cells for repair of 
damaged and defective tissues in the adult body, as we shall discuss at the end 
of the next chapter. Second, ES cells make possible the most precisely controlled 
forms of genetic modification, allowing animals to be created with virtually any 
desired alteration introduced into their genome. As discussed in Chapter 8, the 
technique uses genetic recombination to substitute an artificially constructed 
DNA segment for the normal DNA sequence at a chosen site in the genome of 
an ES cell. Although only a rare cell incorporates the DNA construct correctly, 
selection procedures have been devised to find this cell among the thousands of 
cells into which the DNA construct has been transfected. Once selected, the 
genetically modified ES cells can be injected into a blastocyst to make a chimeric 
mouse. This mouse will, with luck, have some ES- derived germ cells, capable of 
acting as founders of a new generation of mice that consist entirely of cells car- 
rying the carefully designed mutation. In this way, an entire mutant mouse can 
be resurrected from the culture dish (see Figure 8-65). 

Interactions Between Epithelium and Mesenchyme Generate 
Branching Tubular Structures 

Vertebrates are comparatively big animals, and they owe much of their bulk to 
connective tissues. For excretion, absorption of nutrients, and gas exchange, 
however, they also require large quantities of various specialized types of epithe- 
lial surfaces. Many of these take the form of tubular structures created by branch- 
ing morphogenesis, in which an epithelium invades embryonic connective tissue 
(mesenchyme) to form a composite organ. The lung is a typical example. It orig- 
inates from the endoderm lining the floor of the foregut. This epithelium buds 
and grows out into the neighboring mesenchyme to form the bronchial tree, a 
system of tubes that branch repeatedly as they extend (Figure 22-92). The same 
mesenchyme is also invaded by endothelial cells — the lining cells of blood ves- 
sels — to create the system of closely apposed airways and blood vessels required 
for gas exchange in the lung (discussed in Chapter 23). 

The whole process depends on exchanges of signals in both directions 
between the growing buds of epithelium and the mesenchyme that they are 
invading. These signals can be analyzed by genetic manipulation in the mouse. 
A central part is played by signal proteins of the fibroblast growth factor (FGF) 
family and the receptor tyrosine kinases on which they act. This signaling path- 
way has various roles in development, but it seems to be especially important in 
the many interactions that occur between epithelium and mesenchyme. 

Mammals have about 20 different Fgf genes, as compared with three in 
Drosophila and two in C. elegans. The Fgf that is most important in the lung is 
Fgf 10. This is expressed in clusters of mesenchyme cells near the tips of the 
growing epithelial tubes, while its receptor is expressed in the epithelial cells 
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Figure 22-91 Making a chimeric mouse 
with ES cells. The cultured ES cells can 
combine with the cells of a normal 
blastocyst to form a healthy chimeric 
mouse, and can contribute to any of its 
tissues, including the germ line. Thus the 
ES cells are totipotent. 
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themselves. FGF10 or its receptor can be knocked out (by the standard tech- 
niques based on recombination in ES cells). In the resulting knock-out mutant 
mouse, the whole process of branching morphogenesis then fails — a primary 
bud of lung epithelium is formed but fails to grow out into the mesenchyme to 
create a bronchial tree. Conversely, a microscopic bead soaked in FGF10 and 
placed near embryonic lung epithelium in culture will induce a bud to form and 
grow out toward it. Evidently, the epithelium invades the mesenchyme only by 
invitation, in response to FGF10. 

But what makes the growing epithelial tubes branch repeatedly as they 
invade? This seems to depend on a Sonic hedgehog signal that is sent in the 
opposite direction, from the epithelial cells at the tips of the buds back to the 
mesenchyme. In mice lacking Sonic hedgehog, the lung epithelium grows and 
differentiates, but forms a sac instead of a branching tree of tubules. Meanwhile, 
FGF10, instead of being restricted to small clusters of mesenchyme cells, with 
each cluster acting as a beacon to direct the outgrowth of a separate epithelial 
bud, is expressed in broad bands of cells immediately adjacent to the epithe- 
lium. This finding suggests that the Sonic hedgehog signal may serve to shut off 
FGF10 expression in the mesenchyme cells closest to the growing tip of a bud, 
splitting the FGFlO-secreting cluster into two separate clusters, which in turn 
cause the bud to branch into two (see Figure 22-92A). 

The branching growth of the epithelium and mesenchyme has to be coordi- 
nated with development of the associated blood vessels, and the whole process 
involves a large number of additional signals. Many aspects of the system are 
still not understood. It is known, however, that Drosophila uses closely related 
mechanisms to govern the branching morphogenesis of its tracheal system — the 
tubules that form the airways of an insect. Again, the process depends on the 
Drosophila FGF protein, encoded by the Branchless gene, and the Drosophila 
FGF receptor, encoded by the Breathless gene, both operating in much the same 
way as in the mouse. Indeed, genetic studies of tracheal development in 
Drosophilahave also identified other components of the control machinery, and 
the Drosophila genes have led us to their vertebrate homologs. Genetic manip- 
ulations in the mouse have given us the means to test whether these genes have 
similar functions in mammals too; and to a remarkable extent they do. 




Figure 22-92 Branching morphogenesis 
of the lung. (A) How FGF10 and Sonic 
hedgehog are thought to induce the 
growth and branching of the buds of the 
bronchial tree. Many other signal 
molecules, such as BMP4, are also 
expressed in this system, and the 
suggested branching mechanism is only 
one of several possibilities. (B) A cast of 
the adult human bronchial tree, prepared 
by injecting resin into the airways; resins 
of different colors have been injected 
into different branches of the tree. 
(B, from R. Warwick and P.L Williams, 
Gray's Anatomy, 35th ed. Edinburgh: 
Longman, 1973.) 



Summary 

The mouse has a central role as model organism for study of the molecular genetics of 
mammalian development. Mouse development is essentially similar to that of other 
vertebrates, but begins with a specialized preamble to form extraembryonic structures 
such as the amnion and placenta. Powerful techniques have been devised for creation 
of gene knockouts and other targeted genetic alterations by exploiting the highly regu- 
lative properties of the cells of the inner cell mass of the mouse embryo. These cells can 
be put into culture and maintained as embryonic stem cells (ES cells). Under the right 
culture conditions, ES cells can proliferate indefinitely without differentiating, while 
retaining the ability to give rise to any part of the body when injected back into an 
early mouse embryo. 
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Figure 22-93 A typical neuron of a 
vertebrate. The arrows indicate the 
direction in which signals are conveyed. 
The neuron shown is from the retina of a 
monkey. The longest and largest neurons 
in a human extend for about 1 million 
jim and have an axon diameter of 15 urn 
(Drawing of neuron from B.B. Boycott, in 
Essays on the Nervous System [R. Bellairs 
and E.G. Gray, eds.]. Oxford, UK: 
Clarendon Press, 1974.) 



Many general developmental processes, including most of those discussed else- 
where in the chapter, have been illuminated by studies in the mouse. As just one exam- 
ple, the mouse has been used to investigate the control of branching morphogenesis. 
This process gives rise to structures such as lungs and glands, and is governed by 
exchanges of signals between mesenchyme cells and an invading epithelium. The 
functions of these signals can be analyzed by gene knockout experiments. 



NEURAL DEVELOPMENT 

Nerve cells, or neurons, are among the most ancient of all specialized animal 
cell types. Their structure is like that of no other class of cells, and the develop- 
ment of the nervous system poses problems that have no real parallel in other 
tissues. A neuron is extraordinary above all for its enormously extended shape, 
with a long axon and branching dendrites connecting it through synapses to 
other cells (Figure 22-93). The central challenge of neural development is to 
explain how the axons and dendrites grow out, find their right partners, and 
synapse with them selectively to create a functional network (Figure 22-94) . The 
problem is formidable: the human brain contains more than 10 11 neurons, each 
of which, on average, has to make connections with a thousand others, accord- 
ing to a regular and predictable wiring plan. The precision required is not so 
great as in a man-made computer, for the brain performs its computations in a 
different way and is more tolerant of vagaries in individual components; but the 
brain nevertheless outstrips all other biological structures in its organized com- 
plexity. 

The components of a typical nervous system — the various classes of neu- 
rons, glial cells, sensory cells, and muscles — originate in a number of widely sep- 
arate locations in the embryo and are initially unconnected. Thus, in the first 
phase of neural development (Figure 22-95), the different parts develop accord- 
ing to their own local programs: neurons are born and assigned specific charac- 
ters according to the place and time of their birth, under the control of inductive 
signals and gene regulatory mechanisms similar to those we have already dis- 
cussed for other tissues of the body. The next phase involves a type of morpho- 
genesis unique to the nervous system: axons and dendrites grow out along spe- 
cific routes, setting up a provisional but orderly network of connections between 
the separate parts of the system. In the third and final phase, which continues 
into adult life, the connections are adjusted and refined through interactions 
among the far-flung components in a way that depends on the electrical signals 
that pass between them. 



Neurons Are Assigned Different Characters According to the Time 
and Place Where They Are Born 

Neurons are almost always produced in association with glial cells, which pro- 
vide a supporting framework and create an enclosed, protected environment in 
which the neurons can perform their functions. Both cell types, in all animals, 




Figure 22-94 The complex organization 
of nerve cell connections. This drawing 
depicts a section through a small part of 
a mammalian brain — the olfactory bulb 
of a dog, stained by the Golgi technique. 
The black objects are neurons; the thin 
lines are axons and dendrites, through 
which the various sets of neurons are 
interconnected according to precise 
rules. (From C. Golgi, Riv. sper. freniat 
Reggio-Emilia 1:405-425, 1875; 
reproduced in M. Jacobson, 
Developmental Neurobiology, 3rd ed. 
New York: Plenum, 1992.) 
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develop from the ectoderm, usually as sister cells or cousins derived from a 
common precursor. Thus, in vertebrates, the neurons and glial cells of the cen- 
tral nervous system (including the spinal cord, the brain, and the retina of the 
eye) derive from the part of the ectoderm that rolls up to form the neural tube, 
while those of the peripheral nervous system derive mainly from the neural crest 
(Figure 22-96). 

The neural tube, with which we shall be mainly concerned, consists initially 
of a single-layered epithelium (Figure 22-97). The epithelial cells are the pro- 
genitors of the neurons and glia. As these cell types are generated, the epithe- 
lium becomes thickened and transformed into a more complex structure. As dis- 
cussed earlier, Delta-Notch signaling controls the differentiation of the progen- 
itor cells into neurons: the nascent neurons express Delta, and thereby inhibit 
their neighbors from differentiating into neurons at the same time. This ensures 
that the progenitors do not all differentiate simultaneously but remain as a 
dividing cell population from which further neurons can be generated. The pro- 
genitor and, later, glial cells also maintain the cohesiveness of the epithelium 
and form a scaffolding that spans its thickness. Along and between these tall 
cells, like animals amid the trees of the forest, the new-born neurons migrate, 
find their resting places, mature, and send out their axons and dendrites (Figure 
22-98). 

Signal proteins secreted from the ventral and dorsal sides of the neural tube 
act as opposing morphogens, causing neurons born at different dorsoventral 
levels to express different gene regulatory proteins (see Figure 22-80). There are 
differences along the head-to-tail axis as well, reflecting the anteroposterior pat- 
tern of expression of Hox genes and the actions of yet other morphogens. More- 
over, just as in Drosophila, neurons continue to be generated in each region of 
the central nervous system over many days, weeks, or even months, and this 
gives rise to still greater diversity, because the cells adopt different characters 



Figure 22-96 Diagram of a 2-day chick embryo, showing the origins of 
the nervous system. The neural tube (light green) has already closed, 
except at the tail end, and lies internally, beneath the ectoderm, of which it 
was originally a part (see Figure 22-78). The neural crest (red) lies dorsally 
just beneath the ectoderm, in or above the roof of the neural tube. In 
addition, thickenings, or placodes (dark green), in the ectoderm of the head 
give rise to some of the sensory transducer cells and neurons of that 
region, including those of the ear and the nose. The cells of the retina of 
the eye, by contrast, originate as part of the neural tube. 



Figure 22-95 The three phases of neural 
development. 
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according to their "birthday" — the time of the terminal mitosis that marks the 
beginning of neuronal differentiation (Figure 22-99). When progenitor cells are 
taken from an embryonic mouse brain and maintained in culture for several 
days, individually isolated from their normal surroundings, they go through 
much the same program as in the intact tissue. That is, they divide repeatedly, 
producing pairs of daughters that frequently adopt different fates, such that one 
remains as a dividing progenitor while the other becomes committed to differ- 
entiate. 

The successive divisions throw off a sequence of different neuronal and glial 
cell types, according to a more-or-less regular timetable. This implies that the 
progenitors themselves must autonomously change their intrinsic character 
from one cell generation to the next. The molecular mechanism of this progres- 
sive change is unknown, just as it is in other cell types where similar slow 
changes occur. 



The Character Assigned to a Neuron at Its Birth Governs the 
Connections It Will Form 

The differences of gene expression modulate the characters of the neurons and 
help to cause them to make connections with different partners. In the spinal 
cord, for example, ventrally located clusters of cells express genes of the Islet/Lim 
homeobox family (coding for gene regulatory proteins) and develop as motor 
neurons, sending out axons to connect with specific subsets of muscles — differ- 
ent muscles according to the particular Islet/Lim family members expressed. If 
the pattern of gene expression is artificially altered, the neurons project to dif- 
ferent target muscles. 

The different destinations reflect different pathway choices that the axons 
make as they grow out from the nerve cell body, as well as their selective 
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Figure 22-97 Formation of the neural 
tube. The scanning electron micrograph 
shows a cross section through the trunk of 
a 2-day chick embryo. The neural tube is 
about to close and pinch off from the 
ectoderm; at this stage it consists (in the 
chick) of an epithelium that is only one cell 
thick. (Courtesy of J.P. Revel and S. Brown.) 
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Figure 22-98 Migration of immature 
neurons. Before sending out axons and 
dendrites, newborn neurons often 
migrate from their birthplace and settle 
in some other location. The diagrams are 
based on reconstructions from sections 
of the cerebral cortex of a monkey (part 
of the neural tube). The neurons go 
through their final cell division close to 
the inner, luminal face of the neural tube 
and then migrate outward by crawling 
along radial glial cells. Each of these cells 
extends from the inner to the outer 
surface of the tube, a distance that may 
be as much as 2 cm in the cerebral cortex 
of the developing brain of a primate. The 
radial glial cells can be considered as 
persisting cells of the original columnar 
epithelium of the neural tube that 
become extraordinarily stretched as the 
wall of the tube thickens. (After P. Rakic, 
J. Comp. Neurol. 145:61-84, 1972. With 
permission from John Wiley & Sons, Inc.) 
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Figure 22-99 Programmed production 
of different types of neurons at different 
last-born times from dividing progenitors in the 
neurons cerebral cortex of the brain of a 

mammal. Close to one face of the cortical 
neuroepithelium, progenitor cells divide 
repeatedly, in stem-cell fashion, to 
produce neurons. The neurons migrate 
out toward the opposite face of the 
epithelium by crawling along the 
surfaces of radial glial cells, as shown in 
first-born Figure 22-98. The first-born neurons 
neurons settle closest to their birthplace, while 

neurons born later crawl past them to 
settle farther out. Successive generations 
of neurons thus occupy different layers in 
the cortex and have different intrinsic 
characters according to their birth dates. 



dividing progenitor cell 



radial glial cell 



recognition of different target cells at the end of the journey. In the dorsal part of 
the spinal cord lie neurons that receive and relay sensory information from sen- 
sory neurons in the periphery of the body. In intermediate positions, there are 
various other classes of interneurons, connecting specific sets of nerve cells to 
one another. Some send their axons dorsally, others ventrally; some up toward 
the head, others down toward the tail, still others across the floor of the neural 
tube to the other side of the body (Figure 22-100). In a timelapse film where the 
developing neurons are stained with a fluorescent dye, one can watch the move- 
ments of the growing tips of the axons as they extend: one is reminded of the 
lights of rush-hour traffic at night, as the cars streak along a network of high- 
ways, turning this way or that at busy junctions, each one making its own choice 
of route. 

How are these complex movements guided? Before attempting an answer, 
we must examine more closely the structure of the growing neuron. 



Each Axon or Dendrite Extends by Means of a Growth Cone at Its 
Tip 

A typical neuron sends out one long axon, projecting toward a distant target to 
which signals are to be delivered, and several shorter dendrites, on which it 
mainly receives incoming signals from axon terminals of other neurons. Each 
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Figure 22-100 Growing axons in the 
developing spinal cord of a 3-day chick 
embryo. The drawing shows a cross 
section stained by the Golgi technique. 
Most of the neurons, apparently, have as 
yet only one elongated process — the 
future axon. An irregularly shaped 
expansion — a growth cone — is seen at 
the growing tip of each axon. The growth 
cones of the motor neurons emerge from 
the spinal cord (to make their way toward 
muscles), those of the sensory neurons 
grow into it from outside (where their cell 
bodies lie), and those of the interneurons 
remain inside the spinal cord. Many of 
the interneurons send their axons down 
toward the floor plate to cross to the 
other side of the spinal cord; these axons 
are called commissural. At this early 
stage, many of the embryonic spinal-cord 
cells (in the regions shaded gray) are still 
proliferating and have not yet begun to 
differentiate as neurons or glial cells. 
(From S. Ramon y Cajal, Histologie du 
Systeme Nerveux de I'Homme et des 
Vertebres, 1 909-1 91 1 . Paris: Maloine; 
reprinted, Madrid: C.S.I.C, 1972.) 
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Figure 22-101 Formation of axon and 
dendrites in culture. A young neuron has 
been isolated from the brain of a 
mammal and put to develop in culture, 
where it sends out processes. One of 
these processes, the future axon, has 
begun to grow out faster than the rest 
(the future dendrites) and has bifurcated. 
(A) A phase-contrast picture; (B) the 
pattern of staining with fluorescent 
phalloidin, which binds to filamentous 
actin. Actin is concentrated in the growth 
cones at the tips of the processes that are 
actively extending and at some other 
sites of lamellipodial activity. (Courtesy of 
Kimberly Goslin.) 



process extends by growth at its tip, where an irregular, spiky enlargement is 
seen. This structure, called the growth cone, crawls through the surrounding tis- 
sue, trailing a slender axon or dendrite behind it (see Figure 22-100). <AAGA> 
The growth cone comprises both the engine that produces the movement and 
the steering apparatus that directs the tip of each process along the proper path 
(see Figure 16-105). 

Much of what we know about the properties of growth cones has come from 
studies in tissue or cell culture. One can watch as a neuron begins to put out its 
processes, all at first alike, until one of the growth cones puts on a sudden turn 
of speed, identifying its process as the axon, with its own axon-specific set of 
proteins (Figure 22-101). The contrast between axon and dendrite established 
at this stage involves polarized intracellular transport of different materials into 
the two types of process. As a result, they will grow out for different distances, 
follow different paths, and play different parts in synapse formation. 

The growth cone at the end of a typical growing nerve cell process — either 
axon or dendrite — moves forward at a speed of about 1 mm per day, continually 
probing the regions that lie ahead and on either side by putting out filopodia 
and lamellipodia. When such a protrusion contacts an unfavorable surface, it 
withdraws; when it contacts a more favorable surface, it persists longer, steering 
the growth cone as a whole to move in that direction. In this way the growth cone 
can be guided by subtle variations in the surface properties of the substrata over 
which it moves. At the same time, it is sensitive to diffusible chemotactic factors 
in the surrounding medium, which can also encourage or hinder its advance. 
These behaviors depend on the cytoskeletal machinery inside the growth cone, 
as discussed in Chapter 16. A multitude of receptors in the growth cone mem- 
brane detect the external signals and, through the agency of intracellular regu- 
lators such as the monomeric GTPases Rho and Rac, control the assembly and 
disassembly of actin filaments and other components of the machinery of cell 
movement. 



The Growth Cone Pilots the Developing Neurite Along a Precisely 
Defined Path In Vivo 

In living animals, growth cones generally travel toward their targets along pre- 
dictable, stereotyped routes, exploiting a multitude of different cues to find their 
way, but always requiring a substratum of extracellular matrix or cell surface to 
crawl over. Often, growth cones take routes that have been pioneered by other 
neurites, which they follow by contact guidance. As a result, nerve fibers in a 
mature animal are usually found grouped together in tight parallel bundles 
(called fascicles or fiber tracts). Such crawling of growth cones along axons is 
thought to be mediated by homophilic cell-cell adhesion molecules — mem- 
brane glycoproteins that help a cell displaying them to stick to any other cell that 
also displays them. As discussed in Chapter 19, two of the most important 
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classes of such molecules are those that belong to the immunoglobulin super- 
family, such as N-CAM, and those of the Ca 2+ - dependent cadherin family, such 
as N-cadherin. Members of both families are generally present on the surfaces 
of growth cones, of axons, and of various other cell types that growth cones crawl 
over, including glial cells in the central nervous system and muscle cells in the 
periphery of the body. The human genome contains more than 100 cadherin 
genes, for example, and most of them are expressed in the brain (see Figure 
19-6). Different sets of cell-cell adhesion molecules, acting in varied combina- 
tions, provide a mechanism for selective neuronal guidance and recognition. 
Growth cones also migrate over components of the extracellular matrix. Some of 
the matrix molecules, such as laminin, favor axon outgrowth, while others, such 
as chondroitin sulfate proteoglycans, discourage it. 

Growth cones are guided by a succession of different cues at different stages 
of their journey, and the stickiness of the substratum is not the only thing that 
matters. Another important part is played by chemotactic factors, secreted from 
cells that act as beacons at strategic points along the path — some attracting, oth- 
ers repelling. The trajectory of commissural axons — those that cross from one 
side of the body to the other — provides a beautiful example of how a combina- 
tion of guidance signals can specify a complex path. Commissural axons are a 
general feature of bilaterally symmetrical animals, because the two sides of the 
body have to be neurally coordinated. Worms, flies and vertebrates use closely 
related mechanisms to guide their outgrowth. 

In the developing spinal cord of a vertebrate, for example, a large number of 
neurons send their axonal growth cones ventrally toward the floor plate — a spe- 
cialized band of cells forming the ventral midline of the neural tube (see Figure 
22-100). The growth cones cross the floor plate and then turn abruptly through 
a right angle to follow a longitudinal path up toward the brain, parallel to the 
floor plate but never again crossing it (Figure 22-102A). The first stage of the 
journey depends on a concentration gradient of the protein netrin, secreted by 
the cells of the floor plate: the commissural growth cones sniff their way toward 
its source. Netrin was purified from chick embryos, by assaying extracts of neu- 
ral tissue for an activity that would attract commissural growth cones in a cul- 
ture dish. Its sequence revealed that it was the vertebrate homolog of a protein 
already known from C. elegans, through genetic screens for mutant worms with 
misguided axons — called Unc mutants because they move in an ^coordinated 
fashion. One of the Unc genes, Unc6, codes for the homolog of netrin. Another, 
Unc40, codes for its transmembrane receptor; and this too has a vertebrate 
homolog called DCC that is expressed in the commissural neurons and mediates 
their response to the netrin gradient. 

Localized activation of DCC by netrin leads to opening of a specialized class 
of ion channels in the plasma membrane. These channels, called TRPC (Tran- 
sient Receptor Potential C) channels, belong to a large family (the TRP family) 



Figure 22-1 02 The guidance of 
commissural axons. (A) The pathway 
taken by commissural axons in the 
embryonic spinal cord of a vertebrate. 
(B) The signals that guide them. The 
growth cones are first attracted to the 
floor plate by netrin, which is secreted by 
the floor-plate cells and acts on the 
receptor DCC in the axonal membrane. 
As they cross the floor plate, the growth 
cones upregulate their expression of 
Roundabout, the receptor for a repellent 
protein, Slit, that is also secreted by the 
floor plate. Slit, binding to Roundabout, 
not only acts as a repellent to keep the 
cells from re-entering the floor plate, but 
also blocks responsiveness to the 
attractant netrin. At the same time, the 
growth cones switch on expression of 
receptors for another repellent protein, 
semaphorin, that is secreted by the cells 
in the side walls of the neural tube. 
Trapped between two repellent 
territories, the growth cones, having 
crossed the midline, travel in a tight 
fascicle up toward the brain. 
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that is responsible for many other sensory transduction processes, from 
mechanosensation to the perception of heat and cold. When open, the TRPC 
channels allow Ca 2+ (and other cations) to enter the cell. The localized rise in 
Ca 2+ then activates the machinery for extension of filopodia and movement of 
the growth cone toward the netrin source. 

The receptors on each growth cone determine the route it will take: non- 
commissural neurons in the neural tube, lacking DCC, are not attracted to the 
floor plate; and neurons expressing a different netrin receptor — called Unc5H in 
vertebrates (with a counterpart Unc5 in the worm) — are actively repelled by the 
floor plate and send their axons instead toward the roof plate. 



Growth Cones Can Change Their Sensibilities as They Travel 

If commissural growth cones are attracted to the floor plate, why do they cross it 
and emerge on the other side, instead of staying in the attractive territory? And 
having crossed it, why do they never veer back onto it again? The likely answer 
lies in another set of molecules, several of which are also conserved between 
vertebrates and invertebrates. Studies of Drosophila mutants with misguided 
commissural axons first identified three of the key proteins: Slit, Roundabout, 
and Commissureless. 

Slit, like netrin, is produced by midline cells of the developing fly, while its 
receptor, Roundabout, is expressed in the commissural neurons. Slit, acting on 
Roundabout, has an effect exactly opposite to that of netrin: it repels the growth 
cones, blocking entry to the midline territory. Commissureless, however, inter- 
feres with the delivery of Roundabout to the cell surface and thereby makes the 
growth cones initially blind to this "keep-out" signal. Commissural growth cones 
in this state advance to the midline; as they cross it, they seem, by some mecha- 
nism that we do not yet understand, to lose their blindfold of Commissureless 
protein and begin to be repelled. Emerging on the far side, they now have func- 
tional Roundabout on their surfaces and are thereby prohibited from re-entry. 

In vertebrates, a similar mechanism operates, involving homologs of Slit and 
Roundabout. Commissural growth cones are at first attracted to the midline, and 
then somehow change their surface receptor proteins as they cross; in this way 
they switch their sensibilities, gaining sensitivity to repulsion by Slit — which is 
expressed in the floor plate — and losing sensitivity to attraction by netrin. Sen- 
sitivity to Slit in the initial approach to the midline is blocked not by any 
homolog of Commissureless but by a divergent member of the Roundabout 
receptor family called Rigl, which sits in the plasma membrane and interferes 
with signal reception by its cousins. The Rigl block is switched off by some 
unknown mechanism once the growth cones have crossed the midline. Repul- 
sion from the midline now prevents them from straying back across it. At the 
same time, the growth cones apparently become sensitive to another set of 
repulsive signals, in the form of proteins called semaphorins, which prevent 
them from traveling back up into the dorsal regions of the spinal cord. Trapped 
between the two sets of repulsive signals, the growth cones have no choice but 
to travel in a narrow track, running parallel to the floor plate but never re-enter- 
ing it (Figure 22-102B). 



Target Tissues Release Neurotrophic Factors That Control Nerve 
Cell Growth and Survival 

Eventually, axonal growth cones reach the target region where they must halt 
and make synapses. The neurons that sent out the axons can now begin to com- 
municate with their target cells. Although synapses generally transmit signals in 
one direction, from axon to either dendrite or muscle, the developmental com- 
munications are a two-way affair. Signals from the target tissue not only regulate 
which growth cones are to synapse where (as we discuss below), but also how 
many of the innervating neurons are to survive. 
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Figure 22-103 NGF effects on neurite outgrowth. Dark-field 
photomicrographs of a sympathetic ganglion cultured for 48 hours with 
(above) or without (below) NGF. Neurites grow out from the sympathetic 
neurons only if NGF is present in the medium. Each culture also contains 
Schwann (glial) cells that have migrated out of the ganglion; these are not 
affected by NGF. Neuronal survival and maintenance of growth cones for 
neurite extension represent two distinct effects of NGF. The effect on 
growth cones is local, direct, rapid, and independent of communication 
with the cell body; when NGF is removed, the deprived growth cones halt 
their movements within a minute or two. The effect of NGF on cell survival 
is less immediate and is associated with uptake of NGF by endocytosis and 
its intracellular transport back to the cell body. (Courtesy of Naomi 
Kleitman.) 



Most types of neurons in the vertebrate central and peripheral nervous sys- 
tem are produced in excess; up to 50% or more of them then die soon after they 
reach their target, even though they appear perfectly normal and healthy up to 
the time of their death. About half of all the motor neurons that send axons to 
skeletal muscle, for example, die within a few days after making contact with 
their target muscle cells. A similar proportion of the sensory neurons that inner- 
vate the skin die after their growth cones have arrived there. 

This large-scale death of neurons is thought to reflect the outcome of a com- 
petition. Each type of target cell releases a limited amount of a specific neu- 
rotrophic factor that the neurons innervating that target require to survive. The 
neurons apparently compete for the factor, and those that do not get enough die 
by programmed cell death. If the amount of target tissue is increased — for exam- 
ple, by grafting an extra limb bud onto the side of the embryo — more limb-inner- 
vating neurons survive; conversely, if the limb bud is cut off, the limb -innervating 
neurons all die. In this way, although individuals may vary in their bodily propor- 
tions, they always retain the right number of motor neurons to innervate all their 
muscles and the right number of sensory neurons to innervate their whole body 
surface. The seemingly wasteful strategy of overproduction followed by death of 
surplus cells operates in almost every region of the nervous system. It provides a 
simple and effective means to adjust each population of innervating neurons 
according to the amount of tissue requiring innervation. 

The first neurotrophic factor to be identified, and still the best characterized, 
is known simply as nerve growth factor, or NGF — the founding member of the 
neurotrophin family of signal proteins. It promotes the survival of specific 
classes of sensory neurons derived from the neural crest and of sympathetic 
neurons (a subclass of peripheral neurons that control contractions of smooth 
muscle and secretion from exocrine glands). NGF is produced by the tissues that 
these neurons innervate. When extra NGF is provided, extra sensory and sym- 
pathetic neurons survive, just as if extra target tissue were present. Conversely, 
in a mouse with a mutation that knocks out the gene for NGF or for its receptor 
(a transmembrane tyrosine kinase called TrkA), almost all sympathetic neurons 
and the NGF- dependent sensory neurons are lost. There are many neurotrophic 
factors, only a few of which belong to the neurotrophin family, and they act in 
different combinations to promote survival of different classes of neurons. 

NGF and its relatives have an additional role: besides acting on the nerve cell 
as a whole to control its survival, they regulate the outgrowth of axons and den- 
drites (Figure 22-103). These can even act locally on just one part of the tree of 
nerve cell processes, promoting or pruning the growth of individual branches: a 
growth cone exposed to NGF shows an immediate increase of motility. Con- 
versely, an axon branch that is deprived of NGF, while the rest of the neuron con- 
tinues to be bathed in the factor, dies back. 

The peripheral action of NGF continues to be important after the phase of 
neuronal death. In the skin, for example, it controls the branching of sensory 
nerve fibers, ensuring not only that the whole body surface becomes innervated 
during development but also that it recovers its innervation after damage. 
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Neuronal Specificity Guides the Formation of Orderly Neural 
Maps 

In many cases, axons originating from neurons of a similar type but located in 
different positions come together for the journey and arrive at the target in a 
tight bundle. There they disperse again, to terminate at different sites in the tar- 
get territory 

The projection from the eye to the brain provides an important example. 
<TACC> The neurons in the retina that convey visual information back to the 
brain are called retinal ganglion cells. There are more than a million of them, 
each one reporting on a different part of the visual field. Their axons converge on 
the optic nerve head at the back of the eye and travel together along the optic 
stalk into the brain. Their main site of termination, in most vertebrates other 
than mammals, is the optic tectum — a broad expanse of cells in the midbrain. In 
connecting with the tectal neurons, the retinal axons distribute themselves in a 
predictable pattern according to the arrangement of their cell bodies in the 
retina: ganglion cells that are neighbors in the retina connect with target cells 
that are neighbors in the tectum. The orderly projection creates a map of visual 
space on the tectum (Figure 22-104). 

Orderly maps of this sort are found in many brain regions. In the auditory 
system, for example, neurons project from the ear to the brain in a tonotopic 
order, creating a map in which brain cells receiving information about sounds of 
different pitch are ordered along a line, like the keys of a piano. And in the 
somatosensory system, neurons conveying information about touch map onto 
the cerebral cortex so as to mark out a "homunculus" — a small, distorted, two- 
dimensional image of the body surface (Figure 22-105). 

The retinotopic map of visual space in the optic tectum is the best charac- 
terized of all these maps. How does it arise? In principle, the growth cones could 
be physically channeled to different destinations as a consequence of their dif- 
ferent starting positions, like drivers on a multilane highway where it is forbid- 
den to change lanes. This possibility was tested in the visual system by a famous 
experiment in the 1940s. If the optic nerve of a frog is cut, it will regenerate. The 
retinal axons grow back to the optic tectum, restoring normal vision. If, in addi- 
tion, the eye is rotated in its socket at the time of cutting of the nerve, so as to put 
originally ventral retinal cells in the position of dorsal retinal cells, vision is still 
restored, but with an awkward flaw: the animal behaves as though it sees the 
world upside down and left-right inverted. This is because the misplaced retinal 
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Figure 22-1 04 The neural map from eye to brain in a young zebrafish. (A) Diagrammatic view, looking down on the top of the head. 
(B) Fluorescence micrograph. Fluorescent tracer dyes have been injected into each eye — red into the anterior part, green into the posterior part. 
The tracer molecules have been taken up by the neurons in the retina and carried along their axons, revealing the paths they take to the optic 
tectum in the brain and the map that they form there. (Courtesy of Chi-Bin Chien, from D.H. Sanes, T.A. Reh and W.A. Harris, Development of the 
Nervous System. San Diego, CA: Academic Press, 2000.) 
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Figure 22-105 A map of the body 
surface in the human brain. The surface 
of the body is mapped onto the 
somatosensory region of the cerebral 
cortex by an orderly system of nerve cell 
connections, such that sensory 
information from neighboring body sites 
is delivered to neighboring sites in the 
brain. This means that the map in the 
brain is largely faithful to the topology of 
the body surface, even though different 
body regions are represented at different 
magnifications according to their density 
of innervation. The homunculus (the 
"little man" in the brain) has big lips, for 
example, because the lips are a 
particularly large and important source of 
sensory information. The map was 
determined by stimulating different 
points in the cortex of conscious patients 
during brain surgery and recording what 
they said they felt. (After W. Penfield and 
T. Rasmussen,The Cerebral Cortex of 
Man. New York: Macmillan, 1950.) 



cells make the connections appropriate to their original, not their actual, posi- 
tions. It seems that the cells have positional values — position-specific biochem- 
ical properties representing records of their original location. As a result, cells on 
opposite sides of the retina are intrinsically different, just as the motor neurons 
in the spinal cord that project to different muscles are intrinsically different. 

Such nonequivalence among neurons is referred to as neuronal specificity. It 
is this intrinsic characteristic that guides the retinal axons to their appropriate 
target sites in the tectum. Those target sites themselves are distinguishable by 
the retinal axons because the tectal cells also carry positional labels. Thus, the 
neuronal map depends on a correspondence between two systems of positional 
markers, one in the retina and the other in the tectum. 



Axons From Different Regions of the Retina Respond Differently 
to a Gradient of Repulsive Molecules in the Tectum 

Axons from the nasal retina (the side closest to the nose) project to the posterior 
tectum, and axons from the temporal retina (the side farthest from the nose) 
project to the anterior tectum, with intermediate regions of retina projecting to 
intermediate regions of tectum. When nasal and temporal axons are allowed to 
grow out over a carpet of anterior or posterior tectal membranes in a culture 
dish, they also show selectivity (Figure 22-106). Temporal axons strongly prefer 
the anterior tectal membranes, as in vivo, whereas nasal axons either prefer pos- 
terior tectal membranes, or show no preference (depending on the species of 
animal). The key difference between anterior and posterior tectum appears to 
be a repulsive factor on the posterior tectum, to which temporal retinal axons 
are sensitive but nasal retinal axons are not: if a temporal retinal growth cone 
touches posterior tectal membrane, it collapses its filopodia and withdraws. 

Assays based on these phenomena in vitro have identified some of the 
molecules responsible. The repulsive factor on posterior tectal membrane 
seems to be partly or entirely comprised of ephrinA proteins, a subset of the 
family of GPI-linked proteins that act as ligands for the EphA family of tyrosine 
kinase receptors. In the mouse, two different ephrins are expressed to form an 
anterior-to-posterior gradient on the tectal cells. Anterior cells have little or no 
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ephrin, cells in the center of the tectum express ephrin A2, and cells at the pos- 
terior edge of the tectum express ephrin A2 and ephrin A5. Thus, there is a gra- 
dient of ephrin expression across the tectum. Meanwhile, the incoming axons 
express Eph receptors, also in a gradient: temporal axons express high Eph lev- 
els, making them sensitive to repulsion by ephrinA, whereas nasal axons express 
low Eph levels. In a similar way, distributed across the other main axis of the tec- 
tum, from medial to lateral, there is graded expression of ephrinB protein and 
also of another type of signal molecule, Wnt3, with correspondingly graded 
expression of EphB receptors and Wnt receptors along the dorso-ventral axis of 
the retina. 

This system of signals and receptors is enough to produce an orderly two- 
dimensional map, if we make one further assumption — an assumption sup- 
ported by experiments in vivo: that the retinal axons somehow interact with one 
another and compete for tectal territory. Thus, temporal axons are restricted to 
anterior tectum, and drive nasal axons off it; nasal axons, consequently, are 
restricted to posterior tectum. Between the extremes, a balance is struck, creat- 
ing a smooth map of the temporo-nasal axis of the retina onto the anteroposte- 
rior axis of the tectum. 



Figure 22-106 Selectivity of retinal 
axons growing over tectal membranes. 

(A) A photograph of the experimental 
observation. (B) A diagram of what is 
happening. The culture substratum has 
been coated with alternating stripes of 
membrane prepared either from 
posterior tectum (P) or from anterior 
tectum (A). In the photograph, the 
anterior tectal stripes are made visible by 
staining them with a fluorescent marker 
in the vertical strips at the sides of the 
picture. Axons of neurons from the 
temporal half of the retina (growing in 
from the left) follow the stripes of 
anterior tectal membrane but avoid the 
posterior tectal membrane, while axons 
of neurons from the nasal half of the 
retina (growing in from the right) do the 
converse. Thus anterior tectum differs 
from posterior tectum and nasal retina 
from temporal retina, and the differences 
guide selective axon outgrowth. These 
experiments were performed with cells 
from the chick embryo. (From Y. von 
Boxberg, S. Deiss and U. Schwarz, Neuron 
10:345-357, 1993. With permission 
from Elsevier.) 



Diffuse Patterns of Synaptic Connections Are Sharpened by 
Activity-Dependent Remodeling 

In a normal animal the retinotectal map is initially fuzzy and imprecise: the sys- 
tem of matching markers we have just described is enough to define the broad 
layout of the map, but not sufficient to specify its fine details. Studies in frogs 
and fish show that each retinal axon at first branches widely in the tectum and 
makes a profusion of synapses, distributed over a large area of tectum that over- 
laps with the territories innervated by other axons. These territories are subse- 
quently trimmed back by selective elimination of synapses and retraction of 
axon branches. This is accompanied by the formation of new sprouts, through 
which each axon develops a denser distribution of synapses in the territory that 
it retains. 

A central part in this remodeling and refinement of the map is played by two 
competition rules that jointly help to create spatial order: (1) axons from sepa- 
rate regions of retina, which tend to be excited at different times, compete to 
dominate the available tectal territory, but (2) axons from neighboring sites in 
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the retina, which tend to be excited at the same time, innervate neighboring ter- 
ritories in the tectum because they collaborate to retain and strengthen their 
synapses on shared tectal cells (Figure 22-107). The mechanism underlying 
both these rules depends on electrical activity and signaling at the synapses that 
are formed. If all action potentials are blocked by a toxin that binds to voltage- 
gated Na + channels, synapse remodeling is inhibited and the map remains fuzzy. 

The phenomenon of activity- dependent synapse elimination is encoun- 
tered in almost every part of the developing vertebrate nervous system. 
Synapses are first formed in abundance and distributed over a broad target field; 
then the system of connections is pruned back and remodeled by competitive 
processes that depend on electrical activity and synaptic signaling. The elimina- 
tion of synapses in this way is distinct from the elimination of surplus neurons 
by cell death, and it occurs after the period of normal neuronal death is over. 

Much of what we know about the cellular mechanisms of synapse formation 
and elimination comes from experiments on the innervation of skeletal muscle 
in vertebrate embryos. A two-way exchange of signals between the nerve axon 
terminals and the muscle cells controls the initial formation of synapses. At sites 
of contact, acetylcholine receptors are clustered in the muscle cell membrane 
and the apparatus for secretion of this neurotransmitter becomes organized in 
the axon terminals (discussed in Chapter 11). Each muscle cell at first receives 
synapses from several neurons; but in the end, through a process that typically 
takes a couple of weeks, it is left innervated by only one. The synapse retraction 
again depends on synaptic communication: if synaptic transmission is blocked 
by a toxin that binds to the acetylcholine receptors in the muscle cell membrane, 
the muscle cell retains its multiple innervation beyond the normal time of elim- 
ination. 

Experiments on the musculoskeletal system, as well as in the retinotectal 
system, suggest that it is not only the amount of electrical activity at a synapse 
that is important for its maintenance, but also its temporal coordination. 
Whether a synapse is strengthened or weakened seems to depend critically on 
whether or not activity in the presynaptic cell is synchronized with activity of the 
other presynaptic cells synapsing on the same target (and thus also synchro- 
nized with activity of the target cell itself). 

These and many other findings have suggested a simple interpretation of the 
competition rules for synapse elimination in the retinotectal system (Figure 
22-108) . Axons from different parts of the retina fire at different times and so com- 
pete. Each time one of them fires, the synapse(s) made by the other on a shared 
tectal target cell are weakened, until one of the axons is left in sole command of 



Figure 22-107 Sharpening of the 
retinotectal map by synapse 
elimination. At first the map is fuzzy 
because each retinal axon branches 
widely to innervate a broad region of 
tectum overlapping the regions 
innervated by other retinal axons. The 
map is then refined by synapse 
elimination. Where axons from separate 
parts of the retina synapse on the same 
tectal cell, competition occurs, 
eliminating the connections made by one 
of the axons. But axons from cells that are 
close neighbors in the retina cooperate, 
maintaining their synapses on shared 
tectal cells. Thus each retinal axon ends 
up innervating a small tectal territory, 
adjacent to and partly overlapping the 
territory innervated by axons from 
neighboring sites in the retina. 
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Figure 22-108 Synapse modification 
and its dependence on electrical 
activity. Experiments in several systems 
indicate that synapses are strengthened 
or weakened by electrical activity 
according to the rule shown in the 
diagram. The underlying principle 
appears to be that each excitation of a 
target cell tends to weaken any synapse 
where the presynaptic axon terminal has 
just been quiet but to strengthen any 
synapse where the presynaptic axon 
terminal has just been active. As a result, 
"neurons that fire together, wire 
together." A synapse that is repeatedly 
weakened and rarely strengthened is 
eventually eliminated altogether. 



that cell. Axons from neighboring retinal cells, on the other hand, tend to fire in 
synchrony with one another: they therefore do not compete but instead main- 
tain synapses on shared tectal cells, creating a precisely ordered map in which 
neighboring cells of the retina project to neighboring sites in the tectum. 



Experience Molds the Pattern of Synaptic Connections in the 
Brain 

The phenomenon that we have just described is summed up in the catch-phrase 
"neurons that fire together, wire together". The same firing rule relating synapse 
maintenance to neural activity helps to organize our developing brains in the 
light of experience. 

In the brain of a mammal, axons relaying inputs from the two eyes are 
brought together in a specific cell layer in the visual region of the cerebral cor- 
tex. Here, they form two overlapping maps of the external visual field, one as 
perceived through the right eye, the other as perceived through the left. 
Although there is some evidence of a tendency for right- and left- eye inputs to 
be segregated even before synaptic communication begins, a large proportion of 
the axons carrying information from the two eyes at early stages synapse 
together on shared cortical target cells. A period of early signaling activity, how- 
ever, occurring spontaneously and independently in each retina even before 
vision begins, leads to a clean segregation of inputs, creating stripes of cells in 
the cortex that are driven by inputs from the right eye alternating with stripes 
that are driven by inputs from the left eye (Figure 22-109). The firing rule sug- 
gests a simple interpretation: a pair of axons bringing information from neigh- 
boring sites in the left eye will frequently fire together, and therefore wire 




2 mm 



Figure 22-109 Ocular dominance 
columns in the visual cortex of a 
monkey's brain, and their sensitivity to 
visual experience. (A) Normally, stripes of 
cortical cells driven by the right eye 
alternate with stripes, of equal width, 
driven by the left eye. The stripes are 
revealed here by injecting a radioactive 
tracer molecule into one eye, allowing 
time for this tracer to be transported to 
the visual cortex, and detecting 
radioactivity there by autoradiography, in 
sections cut parallel to the cortical 
surface. (B) If one eye is kept covered 
during the critical period of 
development, and thus deprived of visual 
experience, its stripes shrink and those of 
the active eye expand. In this way, the 
deprived eye may lose the power of 
vision almost entirely. (From D.H. Hubel, 
T.N. Wiesel and S. Le Vay, Philos. Trans. R. 
Soc. Lond. B. Biol. Sci. 278:377-409, 1977. 
With permission from The Royal Society.) 
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together, as will a pair of axons from neighboring sites in the right eye; but a 
right- eye axon and a left- eye axon will rarely fire together, and will instead com- 
pete. Indeed, if activity from both eyes is silenced using drugs that block action 
potentials or synaptic transmission, the inputs fail to segregate correctly 

Maintenance of the pattern of connections is extraordinarily sensitive to 
experience early in life. If, during a certain critical period (ending at about the 
age of 5 years in humans), one eye is kept covered for a time so as to deprive it 
of visual stimulation, while the other eye is allowed normal stimulation, the 
deprived eye loses its synaptic connections to the cortex and becomes almost 
entirely, and irreversibly, blind. In accordance with what the firing rule would 
predict, a competition has occurred in which synapses in the visual cortex made 
by inactive axons are eliminated while synapses made by active axons are con- 
solidated. In this way cortical territory is allocated to axons that carry informa- 
tion and is not wasted on those that are silent. 

In establishing the nerve connections that enable us to see, it is not only the 
quantity of visual stimulation that is important, but also its temporal coordina- 
tion. For example, the ability to see depth — stereo vision — depends on cells in 
other layers of the visual cortex that receive inputs relayed from both eyes at 
once, conveying information about the same part of the visual field as seen from 
two slightly different angles. These binocularly driven cells allow us to compare 
the view through the right eye with that through the left so as to derive informa- 
tion about the relative distances of objects from us. If, however, the two eyes are 
prevented during the critical period from ever seeing the same scene at the same 
time — for example, by covering first one eye and then the other on alternate 
days, or simply as a consequence of a childhood squint — almost no binocularly 
driven cells are retained in the cortex, and the capacity for stereo perception is 
irretrievably lost. Evidently, in accordance with the firing rule, the inputs from 
each eye to a binocularly driven neuron are maintained only if the two inputs are 
frequently triggered to fire in synchrony, as occurs when the two eyes look 
together at the same scene. 



Adult Memory and Developmental Synapse Remodeling May 
Depend on Similar Mechanisms 

We saw in Chapter 1 1 that synaptic changes underlying memory in at least some 
parts of the adult brain, notably the hippocampus, hinge on the behavior of a 
particular type of receptor for the neurotransmitter glutamate — the NMDA 
receptor. Ca 2+ flooding into the postsynaptic cell through the channels opened 
by this receptor triggers lasting changes in the strengths of the synapses on that 
cell, affecting the presynaptic as well as the postsynaptic structures. The changes 
that are induced by the NMDA- dependent mechanism in the adult brain obey 
rules closely akin to the developmental firing rule: events in the external world 
that cause two neurons to be active at the same time, or in quick succession, 
favor the making or strengthening of synapses between them. This condition, 
called the Hebb rule, has been suggested to be the fundamental principle under- 
lying associative learning. 

Is it possible, then, that adult learning and the more drastic forms of synap- 
tic plasticity seen during development both depend on the same basic machin- 
ery of synapse adjustment? There are many hints that it may be so. For example, 
inhibitors that specifically block activation of the NMDA receptor interfere with 
the refinement and remodeling of synaptic connections in the developing visual 
system. Both in the developing animal and in the adult, the alterations in the 
strength of the synaptic connections correspond to changes in physical struc- 
ture. The scale of these physical changes is, however, very different. In the devel- 
oping organism, electrical activity often regulates the extension and regression 
of large branches of the axonal and dendritic trees. But in the adult brain, the 
structural adjustments occurring in response to activity seem typically to be 
much more finely localized, affecting the sizes of individual dendritic spines — 
the tiny knob-shaped protrusions, no more than a few micrometers long, on 
which dendrites receive individual synapses (Figure 22-110). It seems that Ca 2+ 
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entering a spine through NMDA channels in response to excitation of the 
synapse on that particular spine can cause just that spine to remodel its actin 
cytoskeleton. But we still have a lot to learn about the mechanism of such 
changes and their relationship to learning and memory. The molecular basis of 
the processes of synapse remodeling through which experience molds our 
brains remains one of the central challenges that the nervous system presents to 
cell biology. 
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Figure 22-1 1 Growth of dendritic 
spines in response to synaptic 
stimulation. (A) Neurons in a slice of 
living tissue from the hippocampus of a 
young mouse. The cells are labeled by 
expression of Green Fluorescent Protein 
and observed with a two-photon laser 
scanning microscope, which allows 
individual dendrites to be seen at high 
resolution. The insert shows a processed 
image of a small part of some of the 
dendrites. These are covered with tiny 
dendritic spines, which are the sites of 
synapses. (B) Repeated intense bursts of 
synaptic stimulation, triggered by a 
nearby microelectrode, cause new spines 
to form within 30 minutes. Low- 
frequency stimulation has an opposite 
effect, causing a subset of spines to 
regress. (From U.V. Nagerl, N. Eberhorn, 
S.B. Cambridge and T Bonhoeffer, Neuron 
44:759-767, 2004. With permission 
from Elsevier.) 



Summary 

The development of the nervous system proceeds in three phases: first, nerve cells are 
generated through cell division; then, having ceased dividing, they send out axons and 
dendrites to form profuse synapses with other, remote cells so that communication can 
begin; last, the system of synaptic connections is refined and remodeled according to 
the pattern of electrical activity in the neural network. 

The neurons, and the glial cells that always accompany them, are generated from 
ectodermal precursors, and those born at different times and places express different 
sets of genes, which help to determine the connections they will form. Axons and den- 
drites grow out from the neurons by means of growth cones, which follow specific 
pathways delineated by signals along the way. Structures such as the floor plate of the 
embryonic spinal cord secrete both chemoattractants and chemorepellents, to which 
growth cones from different classes of neurons respond differently. On reaching their 
target area, the axons terminate selectively on a subset of the accessible cells, and in 
many parts of the nervous system neural maps are set up — orderly projections of one 
array of neurons onto another. In the retinotectal system, the map is based on the 
matching of complementary systems of position- specific cell- surface markers — 
ephrins and Eph receptors — possessed by the two sets of cells. 

After the growth cones have reached their targets and initial connections have 
formed, two major sorts of adjustment occur. First, many of the innervating neurons 
die as a result of a competition for survival factors such as NGF (nerve growth factor) 
secreted by the target tissue. This cell death adjusts the quantity of innervation accord- 
ing to the size of the target. Second, individual synapses are pruned away in some 
places and reinforced in others, so as to create a more precisely ordered pattern of con- 
nections. This latter process depends on electrical activity: synapses that are frequently 
active are reinforced, and different neurons contacting the same target cell tend to 
maintain their synapses on the shared target only if they are both frequently active at 
the same time. In this way the structure of the brain can be adjusted to reflect the con- 
nections between events in the external world. The underlying molecular mechanism 
of this synaptic plasticity may be similar to that responsible for the formation of mem- 
ories in adult life. 
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PLANT DEVELOPMENT 

Plants and animals are separated by about 1.5 billion years of evolutionary his- 
tory They have evolved their multicellular organization independently but 
using the same initial tool kit — the set of genes inherited from their common 
unicellular eucaryotic ancestor. Most of the contrasts in their developmental 
strategies spring from two basic peculiarities of plants. First, they get their 
energy from sunlight, not by ingesting other organisms. This dictates a body 
plan different from that of animals. Second, their cells are encased in semirigid 
cell walls and cemented together, preventing them from moving as animal cells 
do. This dictates a different set of mechanisms for shaping the body and differ- 
ent developmental processes to cope with a changeable environment. 

Animal development is largely buffered against environmental changes, and 
the embryo generates the same genetically determined body structure unaf- 
fected by external conditions. The development of most plants, by contrast, is 
dramatically influenced by the environment. Because they cannot match them- 
selves to their environment by moving from place to place, plants adapt instead 
by altering the course of their development. Their strategy is opportunistic. A 
given type of organ — a leaf, a flower, or a root, say — can be produced from the 
fertilized egg by many different paths according to environmental cues. A bego- 
nia leaf pegged to the ground may sprout a root; the root may throw up a shoot; 
the shoot, given sunlight, may grow leaves and flowers. 

The mature plant is typically made of many copies of a small set of stan- 
dardized modules, as described in Figure 22-111. The positions and times at 
which those modules are generated are strongly influenced by the environment, 
causing the overall structure of the plant to vary. The choices between alterna- 
tive modules and their organization into a whole plant depend on external cues 
and long-range hormonal signals that play a much smaller part in the control of 
animal development. 

But although the global structure of a plant — its pattern of roots or 
branches, its numbers of leaves or flowers — can be highly variable, its detailed 
organization on a small scale is not. A leaf, a flower, or indeed an early plant 
embryo, is as precisely specified as any organ of an animal, possessing a deter- 
minate structure, in contrast with the indeterminate pattern of branching and 
sprouting of the plant as a whole. The internal organization of a plant module 
raises essentially the same problems in the genetic control of pattern formation 
as does animal development, and they are solved in analogous ways. In this sec- 
tion we focus on the cellular mechanisms of development in flowering plants. 
We examine both the contrasts and the similarities with animals. 



Arabidopsis Serves as a Model Organism for Plant Molecular 
Genetics 

Flowering plants, despite their amazing variety, are of relatively recent origin. 
The earliest known fossil examples are 130 million years old, as against 350 mil- 
lion years or more for vertebrate animals. Underlying the diversity of form, 
therefore, there is a high degree of similarity in molecular mechanisms. As we 
shall see, a small genetic change can transform a plant's large-scale structure; 
and just as plant physiology allows survival in many different environments, so 
also it allows survival of many differently structured forms. A mutation that gives 
an animal two heads is generally lethal; one that doubles the number of flowers 
or branches on a plant is generally not. 

To identify the genes that govern plant development and to discover how 
they function, plant biologists have selected a small weed, the common wall 
cress Arabidopsis thaliana (Figure 22-112) as their primary model organism. 

Figure 22-1 12 Arabidopsis thaliana. This small plant is a member of the mustard 
(or crucifer) family (see also Figure 1-46). It is a weed of no economic use but of 
great value for genetic studies of plant development. (From M.A. Estelle and 
C.R. Somerville, Trends Genet. 12:89-93, 1986. With permission from Elsevier.) 
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Figure 22-1 1 1 A simple example of the 
modular construction of plants. Each 
module (shown in different shades of 
green) consists of a stem, a leaf, and a 
bud containing a potential growth center, 
or meristem. The bud forms at the branch 
point, or node, where the leaf diverges 
from the stem. Modules arise sequentially 
from the continuing activity of the apical 
meristem. 
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Like Drosophila or Caenorhabditis elegans, it is small, quick to reproduce, and 
convenient for genetics. It can be grown indoors in Petri dishes or tiny plant pots 
in large numbers and produces hundreds of seeds per plant after 8-10 weeks. It 
has, in common with C. elegans, a significant advantage over Drosophila or ver- 
tebrate animals for genetics: like many flowering plants, it can reproduce as a 
hermaphrodite because a single flower produces both eggs and the male 
gametes that can fertilize them. Therefore, when a flower that is heterozygous 
for a recessive lethal mutation is self-fertilized, one-fourth of its seeds will dis- 
play the homozygous embryonic phenotype. This makes it easy to perform 
genetic screens (Figure 22-113) and so to obtain a catalog of the genes required 
for specific developmental processes. 



The Arabidopsis Genome Is Rich in Developmental Control Genes 

Arabidopsis has one of the smallest plant genomes — 125 million nucleotide 
pairs, on a par with C. elegans and Drosophila — and the complete DNA sequence 
is now known. It contains approximately 26,000 genes. This total includes many 
recently generated duplicates, however, so that the number of functionally dis- 
tinct types of protein represented may be considerably less. Cell culture and 
genetic transformation methods have been established, as well as vast libraries 
of seeds carrying mutations produced by random insertions of mobile genetic 
elements, so that plants with mutations in any chosen gene can be obtained to 
order. Powerful tools are thus available to analyze gene functions. Although only 
a small fraction of the total gene set has been characterized experimentally as 
yet, functions can be tentatively assigned to many genes — about 18,000 — on the 
basis of their sequence similarities to well- characterized genes in Arabidopsis 
and other organisms. 

Even more than the genomes of multicellular animals, the Arabidopsis 
genome is rich in genes that code for gene regulatory proteins (Table 22-2). 
Some major families of animal gene regulatory proteins (such as the Myb family 
of DNA-binding proteins) are greatly expanded, while others (such as nuclear 
hormone receptors) seem to be entirely absent, and there are large families of 
gene regulatory proteins in the plant that have no animal homologs. 

Where homologous gene regulatory proteins (such as homeodomain pro- 
teins) can be recognized in both plants and animals, they have little in common 
with regard to the genes they regulate or the types of developmental decisions 
that they control, and there is very little conservation of protein sequence out- 
side the DNA-binding domains. 

Arabidopsis is like multicellular animals in possessing many genes for cell 
communication and signal transduction (1900 genes out of 18,000 classified), 



Figure 22-1 1 3 Production of mutants in 
Arabidopsis. A seed, containing a 
multicellular embryo, is treated with a 
chemical mutagen and left to grow into a 
plant. In general, this plant will be a 
mosaic of clones of cells carrying 
different induced mutations. An 
individual flower produced by this plant 
will usually be composed of cells 
belonging to the same clone, all carrying 
the same mutation, m, in heterozygous 
form (m/+). Self-fertilization of individual 
flowers by their own pollen results in 
seed pods, each of which contains a 
family of embryos of whose members 
half, on average, will be heterozygous 
(m/+), one-quarter will be homozygous 
mutant (m/m), and one-quarter will be 
homozygous wild-type (+/+). Often, the 
mutation will have a recessive lethal 
effect, as indicated here by the lack of a 
root in the m/m seedling. The mutant 
stock is then maintained by breeding 
from the heterozygote: it will produce 
seed pods (F2 generation) that all contain 
a mixture of +/+, m/+, and m/m seeds. 
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Table 22-2 Some Major Families of Gene Regulatory Proteins in Arabidopsis, Drosophila, C elegans, and the Yeast 
Saccharomyces cerevisiae 







NUMBER OF FAMILY MEMBERS PREDICTED FROM GENOI 


VIE ANALYSIS 






Arabidopsis 


Drosophila 


C. elegans 


YEAST 


Myb 




190 


6 


3 


10 


AP2/EREBP (Apetala2/ethylene- 


responsive- 


144 











element binding protein) 












bHLH (basic helix-loop-helix) 




139 


46 


25 


8 


NAC 




109 











C2H2 (Zn finger) 




105 


291 


139 


53 


Homeobox 




89 


103 


84 


9 


MADS box 




82 


2 


2 


4 


bZIP 




81 


21 


25 


21 


WRKY (Zn finger) 




72 











GARP 




56 











C2C2 (Zn finger)/GATA 




104 


6 
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10 


Nuclear hormone receptor 







21 


25 





C6 (Zn finger) 













52 


Estimated total (including many not listed 


1533 


635 


669 


209 


above) 












% of genes in genome 




5.9 


4.5 


3.5 


3.5 



The Table lists only those families that have at least 50 members in at least one organism. (Data from J.L. Riechmann et al., Science 290:21 05-21 1 0, 
2000. With permission from AAAS.) 

but the specific details of these gene sets are very different, as discussed in 
Chapter 15. The Wnt, Hedgehog, Notch, and TGF(3 signaling mechanisms are all 
absent in Arabidopsis. In compensation, other signaling pathways peculiar to 
plants are highly developed. Cell-surface receptors of the tyrosine kinase class 
seem to be entirely absent, although many of the signaling components down- 
stream of these receptors in animals are present. Conversely, receptors of the 
serine /threonine kinase class are very plentiful, but they do not act through the 
same system of intracellular messengers as the receptor serine /threonine 
kinases in animals. Substantial sets of genes are devoted to developmental pro- 
cesses of special importance in plants: more than 1000 for synthesis and remod- 
eling of the plant cell wall, for example, and more than 100 for detecting and 
responding to light. 

We must now examine how the genes of the plant are used to control plant 
development. 



Embryonic Development Starts by Establishing a Root-Shoot 
Axis and Then Halts Inside the Seed 

The basic strategy of sexual reproduction in flowering plants is briefly summa- 
rized in Panel 22-1. The fertilized egg, or zygote, of a higher plant begins by 
dividing asymmetrically to establish the polarity of the future embryo. One 
product of this division is a small cell with dense cytoplasm, which will become 
the embryo proper. The other is a large vacuolated cell that divides further and 
forms a structure called the suspensor, which in some ways is comparable to the 
umbilical cord in mammals. The suspensor attaches the embryo to the adjacent 
nutritive tissue and provides a pathway for the transport of nutrients. 

During the next step in development the diploid embryo cell proliferates to 
form a ball of cells that quickly acquires a polarized structure. This comprises 
two key groups of proliferating cells — one at the suspensor end of the embryo 
that will collaborate with the uppermost suspensor cell to generate a root, and 
one at the opposite end that will generate a shoot (Figure 22-114). The main 
root-shoot axis established in this way is analogous to the head-to-tail axis of 
an animal. At the same time it begins to be possible to distinguish the future 
epidermal cells, forming the outermost layer of the embryo, the future ground 



PANEL 22-1: Features of Early Development in Flowering Plants 






THE FLOWER 

Flowers, which contain the reproductive cells of higher 
plants, arise from vegetative shoot apical meristems, 
where they terminate further vegetative growth. 
Environmental factors, often the rhythms of day 
length and temperature, trigger the switch from 
vegetative to floral development. The germ cells 
thus arise late in plant development from 
somatic cells rather than from a germ-cell line, 
as in animals. 



sepal 



petal 



stamen 



carpel 




Petal: distinctive leaflike structures, usually brightly colored, 
facilitate pollination via, for example, attracted insects. 



Stamen: an organ containing cells that 
undergo meiosis and form haploid pollen 
grains, each of which contains 
two male sperm cells. Pollen 
transferred to a stigma 
germinates, and the pollen tube 
delivers the two nonmotile sperm 
to the ovary. 



Flower structure is both varied and species-specific but 
generally comprises four concentrically arranged sets of 
structures that may each be regarded as modified leaves. 




pollen 

tube 

nucleus 



Carpel: an organ containing one or 
more ovaries, each of which contains 
ovules. Each ovule houses cells that undergo 
meiosis and form an embryo sac containing the female 
egg cell. At fertilization, one sperm cell fuses with the 
egg cell and will form the future diploid embryo, while 
the other fuses with two cells in the embryo sac to 
form the triploid endosperm tissue. 

Sepals: leaflike structures that form a protective 
covering during early flower development. 



THE SEED 

A seed contains a dormant 
embryo, a food store, and 
a seed coat. By the end of 
its development a seed's 
water content can drop 
from 90% to 5%. The seed 
is usually protected in a 
fruit whose tissues are of 
maternal origin. 



embryo 



seed coat 




seed 
leaves 
(food store) 



THE EMBRYO 



fertilized egg 




The fertilized egg within the ovule will grow to 
form an embryo using nutrients transported from 
the endosperm by the suspensor. A complex series 
of cell divisions, illustrated here for the common weed 
called shepherd's purse, produces an embryo with a root 
apical meristem, a shoot apical meristem, and either one 
(monocots) or two (dicots) seed leaves, called cotyledons. 

Development is arrested at this stage, and the ovule, 
containing the embryo, now becomes a seed, 
adapted for dispersal and survival. 




two seed 

leaves 

(cotyledons) 



For the embryo to resume its growth the seed must 
germinate, a process dependent upon both internal 
factors (dormancy) and environmental factors including 
water, temperature, and oxygen. The food reserves for 
the early phase of germination may either be the 
endosperm (maize) or the cotyledons (pea and bean). 

The primary root usually emerges first from the seed 
to ensure an early water supply for the seedling. The 
cotyledon(s) may appear above the ground, as in the 
garden bean shown here, or they may remain in the 
soil, as in peas. In both cases the cotyledons eventually 
wither away. 

The apical meristem can now show its capacity for 
continuous growth, producing a typical pattern of 
nodes, internodes, and buds (see Figure 22-106). 
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Figure 22-1 1 4 Two stages of 
embryogenesis in Arabidopsis thaliana. 

(From G. Jurgens et al., Development 
[SuppL] 1:27-38, 1991. With permission 
from The Company of Biologists.) 
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tissue cells, occupying most of the interior, and the future vascular tissue cells, 
forming the central core (Panel 22-2). These three sets of cells can be compared 
to the three germ layers of an animal embryo. Slightly later in development, the 
rudiment of the shoot begins to produce the embryonic seed leaves, or cotyle- 
dons — one in monocots and two in dicots. Soon after this stage, development 
usually halts and the embryo becomes packaged in a seed (a case formed by tis- 
sues of the mother plant), specialized for dispersal and for survival in harsh con- 
ditions. The embryo in a seed is stabilized by dehydration, and it can remain 
dormant for a very long time — even hundreds of years. When rehydrated, the 
seeds germinate and embryonic development resumes. 

Genetic screens can be used in Arabidopsis, just as in Drosophila or C. ele- 
gans, to identify the genes that govern the organization of the embryo and to 
group these into categories according to their homozygous mutant phenotypes. 
Some are required for formation of the seedling root, some for the seedling stem, 
and some for the seedling apex with its cotyledons. Another class is required for 
formation of the three major tissue types — epidermis, ground tissue, and vascu- 
lar tissue — and yet another class for the organized changes of cell shape that 
give the embryo and seedling their elongated form (Figure 22-115). 





1 mm 



Figure 22-1 15 Mutant Arabidopsis seedlings. A normal seedling (A) compared with four 
types of mutant (B-E) defective in different parts of their apico-basal pattern: (B) has structures 
missing at its apex, (C) has an apex and a root but lacks a stem between them, (D) lacks a root, 
and (E) forms stem tissues but is defective at both ends. The seedlings have been "cleared" so 
as to show the vascular tissue inside them (pale strands). (From U. Mayer et al., Nature 
353:402-407, 1991. With permission from Macmillan Publishers Ltd.) 
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The Parts of a Plant Are Generated Sequentially by Meristems 

Roughly speaking, the embryo of an insect or a vertebrate animal is a rudimen- 
tary miniature scale model of the later organism, and the details of body struc- 
ture are filled in progressively as it enlarges. The plant embryo grows into an 
adult in a quite different way: the parts of the adult plant are created sequentially 
by groups of cells that proliferate to lay down additional structures at the plant's 
periphery These all-important groups of cells are called apical meristems (see 
Figure 22-111). Each meristem consists of a self- renewing population of stem 
cells. As these divide, they leave behind a trail of progeny that become displaced 
from the meristem region, enlarge, and finally differentiate. Although the shoot 
and root apical meristems generate all the basic varieties of cells that are needed 
to build leaves, roots, and stems, many cells outside the apical meristems also 
keep a capacity for further proliferation and retain meristem potential. In this 
way trees and other perennial plants, for example, are able to increase the girth 
of their stems and roots as the years go by and can sprout new shoots from dor- 
mant regions if the plant is damaged. 

The rudiments of the apical meristems of root and shoot are already deter- 
mined in the embryo. As soon as the seed coat ruptures during germination, a 
dramatic enlargement of nonmeristematic cells occurs, driving the emergence 
first of a root, to establish an immediate foothold in the soil, and then of a shoot 
(Figure 22-116). This is accompanied by rapid and continual cell divisions in 
the apical meristems: in the apical meristem of a maize root, for example, cells 
divide every 12 hours, producing 5 x 10 5 cells per day. The rapidly growing root 
and shoot probe the environment — the root increasing the plant's capacity for 
taking up water and minerals from the soil, the shoot increasing its capacity for 
photosynthesis (see Panel 22-1). 



Development of the Seedling Depends on Environmental Signals 

From germination onward, the course of plant development is powerfully influ- 
enced by signals from the environment. The shoot has to push its way rapidly up 
through the soil, and must open its cotyledons and begin photosynthesis only 
after it has reached the light. The timing of this transition from rapid subterranean 
sprouting to illuminated growth cannot be genetically programmed, because the 
depth at which the seed is buried is unpredictable. The developmental switch is 
controlled instead by light, which, among other effects, acts on the seedling by 
inhibiting production of a class of plant growth regulators called brassinosteroids, 
discussed in Chapter 15. Mutations in genes required for production or reception 
of the brassinosteroid signal cause the stem of the seedling to go green, slow its 
elongation, and open its cotyledons prematurely, while it is still in the dark. 
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Figure 22-1 16 A seedling of 
Arabidopsis. The brown objects to the 
right of the young seedling are the two 
halves of the discarded seed coat. 
(Courtesy of Catherine Duckett.) 



Long-Range Hormonal Signals Coordinate Developmental Events 
in Separate Parts of the Plant 

Separate parts of a plant experience different environments and react to them 
individually by changes in their mode of development. The plant, however, must 
continue to function as a whole. This demands that developmental choices and 
events in one part of the plant affect developmental choices elsewhere. There 
must be long-range signals to bring about such coordination. 

As gardeners know, for example, by pinching off the tip of a branch one can 
stimulate side growth: removal of the apical meristem relieves the quiescent axil- 
lary meristems of an inhibition and allows them to form new shoots. In this case 
the long-range signal from the apical meristem, or at least a key component has 
been identified. It is an auxin, a member of one of several classes of plant growth 
regulators (sometimes called plant hormones), all of which have powerful influ- 
ences on plant development. Other known classes include the gibberellins, the 
cytokinins, abscisic acid, the gas ethylene, and the brassinosteroids. As shown in 
Figure 22-117, all are small molecules that readily penetrate cell walls. They are 
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THE THREE TISSUE SYSTEMS 



THE PLANT 



Cell division, growth, and differentiation give rise to 
tissue systems with specialized functions. 

DERMAL TISSUE (^m)\ This is the plant's protective 
outer covering in contact with the environment. It 
facilitates water and ion uptake in roots and regulates | 
gas exchange in leaves and stems. 

VASCULAR TISSUE: Together the phloem (IZZI) and 
the xylem (^H) form a continuous vascular system 
throughout the plant. This tissue conducts water and 
solutes between organs and also provides 
mechanical support. 

GROUND TISSUE (1=1): This packing and supportive 
tissue accounts for much of the bulk of the young 
plant. It also functions in food manufacture and 
storage. 



The young flowering plant shown on the right is constructed 
from three main types of organs: leaves, stems, and roots. 
Each plant organ in turn is made from three tissue systems: 
ground (I I), dermal (^M), and vascular (M^). 

All three tissue systems derive ultimately from the cell 
proliferative activity of the shoot or root apical meristems, 
and each contains a relatively small number of specialized 
cell types. These three common tissue systems, and the 
cells that comprise them, are described in this panel. 
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GROUND TISSUE 



The ground tissue system contains 
three main cell types called parenchyma, 
collenchyma, and sclerenchyma. 



Parenchyma cells are found in all tissue systems. They are living 
cells, generally capable of further division, and have a thin 
primary cell wall. These cells have a variety of functions. The 
apical and lateral meristematic cells of shoots and roots provide 
the new cells required for growth. Food production and storage 
occur in the photosynthetic cells of the leaf and stem (called 
mesophyll cells); storage parenchyma cells form the bulk of most fruits 
and vegetables. Because of their proliferative capacity, parenchyma 
cells also serve as stem cells for wound healing and regeneration. 
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A transfer cell, a specialized form of 
the parenchyma cell, is readily 
identified by elaborate ingrowths of 
the primary cell wall. The increase 
in the area of the plasma membrane 
beneath these walls facilitates the 
rapid transport of solutes to and 
from cells of the vascular system. 



Collenchyma are living cells similar to parenchyma cells 

except that they have much thicker 
cell walls and are usually 
elongated and packed into long 
ropelike fibers. They are capable 
of stretching and provide 
mechanical support in the ground 
tissue system of the elongating 
regions of the plant. Collenchyma 
cells are especially common in 
subepidermal regions of stems. 
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fiber bundle 



Sclerenchyma, like collenchyma, have strengthening and 

supporting functions. However, they 
are usually dead cells with thick, 
lignified secondary cell walls that 
prevent them from stretching as the 
plant grows. Two common types are 
fibers, which often form long bundles, 
and sclereids, which are shorter 
branched cells found in seed coats 
and fruit. 
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DERMAL TISSUE 

The epidermis is the primary outer 
protective covering of the plant body. Cells 
of the epidermis are also modified to form 
stomata and hairs of various kinds. 
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Hairs (or trichomes) are appendages derived 
from epidermal cells. They exist in a variety 
of forms and are commonly found in all 
plant parts. Hairs function in protection, 
absorption, and secretion; for example, 
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The epidermis (usually one layer of cells 
deep) covers the entire stem, leaf, and root 
of the young plant. The cells are living, 
have thick primary cell walls, and are 
covered on their outer surface by a special 
cuticle with an outer waxy layer. The cells 
are tightly interlocked in different patterns. 




upper epidermis 
of a leaf 
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epidermis of 
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Stomata are openings in the epidermis, 
mainly on the lower surface of the leaf, that 
regulate gas exchange in the plant. They are 
formed by two specialized epidermal cells 
called guard cells, which regulate the diameter 
of the pore. Stomata are distributed in a 
distinct species-specific pattern within 
each epidermis. 



Vascular bundles 

Roots usually have a single vascular 
bundle, but stems have several 
bundles. These are arranged with 
strict radial symmetry in dicots, but 
they are more irregularly dispersed 
in monocots. 



sheath of 
sclerenchyma 

phloem 




young, single-celled hairs in the 
epidermis of the cotton seed. When 
these grow, the walls will be 
secondarily thickened with cellulose 
to form cotton fibers. 





The phloem and the xylem together form a 

continuous vascular system throughout the 

plant. In young plants they are usually 
| associated with a variety of other cell types 

in vascular bundles. Both phloem and 
I xylem are complex tissues. Their conducting 

elements are associated with parenchyma 

cells that maintain the elements and 

exchange materials with them. In addition, 

| groups of collenchyma and sclerenchyma cells provide mechanical support. 
Phloem 



xylem 
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a typical vascular bundle from 
the young stem of a buttercup 
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Phloem is involved in the transport of organic solutes in the plant. 
The main conducting cells (elements) are aligned to form tubes 
called sieve tubes. The sieve-tube elements at maturity are living 
cells, interconnected by perforations in their end walls formed 
from enlarged and modified plasmodesmata (sieve plates). These 
cells retain their plasma membrane, but they have lost their nuclei 
and much of their cytoplasm; they therefore rely on associated 
companion cells for their maintenance. These companion cells 
have the additional function of actively transporting soluble food 
molecules into and out of sieve-tube elements through porous sieve 
areas in the wall. 



a multicellular 
secretory hair from single-celled root hairs 
a geranium leaf naV e an important function | 
in water and ion uptake. 



Xylem 

Xylem carries water and dissolved ions 
in the plant. The main conducting cells 
are the vessel elements shown here, 
which are dead cells at maturity that lack 
a plasma membrane. The cell wall has 
been secondarily 
thickened and 
heavily lignified. 
As shown below, 
its end wall is 
largely removed, 
enabling very 
long, continuous 
tubes to be 
formed. 



large, mature 
vessel element 



The vessel elements are 
closely associated with xylem 
parenchyma cells, which 
actively transport selected 
solutes into and out of the 
elements across the 
parenchyma cell plasma 
membrane. 

xylem parenchyma cells 



vessel element 
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Figure 22-1 1 7 Plant growth regulators. The formula of one naturally occurring representative molecule 
from each of six groups of plant growth regulatory molecules is shown. 



all synthesized by most plant cells and can either act locally or be transported to 
influence target cells at a distance. Auxin, for example, is transported from cell to 
cell at a rate of about 1 cm per hour from the tip of a shoot toward its base. Each 
growth regulator has multiple effects, and these are modulated by the other 
growth regulators, as well as by environmental cues and nutritional status. Thus, 
auxin alone can promote root formation, but in conjunction with gibberellin it 
can promote stem elongation, with cytokinin, it can suppress lateral shoot out- 
growth, and with ethylene it can stimulate lateral root growth. Remarkably, as we 
shall see below, auxin also controls the detailed patterns of cell specialization on 
a microscopic scale in the apical meristem. The receptors that recognize some of 
these growth regulators are discussed in Chapter 15. 



The Shaping of Each New Structure Depends on Oriented Cell 
Division and Expansion 

Plant cells, imprisoned within their cell walls, cannot crawl about and cannot be 
shuffled as the plant grows; but they can divide, and they can swell, stretch, and 
bend. The morphogenesis of a developing plant therefore depends on orderly 
cell divisions followed by strictly oriented cell expansions. Most cells produced 
in the root-tip meristem, for example, go through three distinct phases of devel- 
opment — division, growth (elongation), and differentiation. These three steps, 
which overlap in both space and time, give rise to the characteristic architecture 
of a root tip. Although the process of cell differentiation often begins while a cell 
is still enlarging, it is comparatively easy to distinguish in a root tip a zone of cell 
division, a zone of oriented cell elongation (which accounts for the growth in 
length of the root), and a zone of cell differentiation (Figure 22-118). 

In the phase of controlled expansion that generally follows cell division, the 
daughter cells may often increase in volume by a factor of 50 or more. This 
expansion is driven by an osmotically based turgor pressure that presses out- 
ward on the plant cell wall, and its direction is determined by the orientation of 
the cellulose fibrils in the cell wall, which constrain expansion along one axis 
(see Figure 19-73). The orientation of the cellulose in turn is apparently con- 
trolled by the orientation of arrays of microtubules just inside the plasma mem- 
brane, which are thought to guide cellulose deposition (discussed in Chapter 
19). These orientations can be rapidly changed by plant growth regulators, such 
as ethylene and gibberellic acid (Figure 22-1 19), but the molecular mechanisms 
underlying these dramatic cytoskeletal rearrangements are still unknown. 
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Figure 22-1 1 8 A growing root tip. 

(A) The organization of the final 2 mm of 
a growing root tip. The approximate 
zones in which cells can be found 
dividing, elongating, and differentiating 
are indicated. (B) The apical meristem and 
root cap of a corn root tip, showing the 
orderly files of cells produced. (B, from 
R.F. Evert, Biology of Plants, 4th ed. New 
York: Worth, 1986.) 
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Each Plant Module Grows From a Microscopic Set of Primordia in 
a Meristem 

The apical meristems are self-perpetuating: in a perennial plant, they carry on 
with their functions indefinitely, as long as the plant survives, and they are 
responsible for its continuous growth and development. But apical meristems 
also give rise to a second type of outgrowth, whose development is strictly lim- 
ited and culminates in the formation of a structure such as a leaf or a flower, with 
a determinate size and shape and a short lifespan. Thus, as a vegetative (non- 
flowering) shoot elongates, its apical meristem lays down behind itself an 
orderly sequence of nodes, where leaves have grown out, and internodes (seg- 
ments of stem) . In this way the continuous activity of the meristem produces an 
ever increasing number of similar modules, each consisting of a stem, a leaf, and 
a bud (see Figure 22-111). The modules are connected to one another by sup- 
portive and transport tissue, and successive modules are precisely located rela- 
tive to each other, giving rise to a repetitively patterned structure. This iterative 
mode of development is characteristic of plants and is seen in many other struc- 
tures besides the stem-leaf system (Figure 22-120). 

Although the final module may be large, its organization, like that of an ani- 
mal embryo, is mapped out at first on a microscopic scale. At the apex of the 
shoot, within a space of a millimeter or less, one finds a small, low central dome 
surrounded by a set of distinctive swellings in various stages of enlargement 
(Figure 22-121). The central dome is the apical meristem itself; each of the sur- 
rounding swellings is the primordium of a leaf. This small region, therefore, con- 
tains the already distinct rudiments of several entire modules. Through a well- 
Figure 22-1 1 9 The different effects of the plant growth regulators 
ethylene and gibberellic acid. These regulators exert rapid and opposing 
effects on the orientation of the cortical microtubule array in cells of young 
pea shoots. A typical cell in an ethylene-treated plant (B) shows a net 
longitudinal orientation of microtubules, while a typical cell in a 
gibberellic-acid-treated plant (C) shows a net transverse orientation. New 
cellulose microfibrils are deposited parallel to the microtubules. Since this 
influences the direction of cell expansion, gibberellic acid and ethylene 
encourage growth in opposing directions: ethylene-treated seedlings will 
develop short, fat shoots (A), while gibberellic-acid-treated seedlings will 
develop long, thin shoots (D). 
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Figure 22-120 Repetitive patterning in 
plants. Accurate placing of successive 
modules from a single apical meristem 
produces these elaborate but regular 
patterns in leaves (A), flowers (B), and 
fruits (C). (A, from John Sibthorp, Flora 
Graeca. London: R.Taylor, 1806-1840; 

B, from Pierre Joseph Redoute, Les 
Liliacees. Paris: chez I'Auteur, 1807; 

C, from Christopher Jacob Trew, 
Uitgezochte planten. Amsterdam: Jan 
Christiaan Sepp, 1771 — all courtesy of 
the John Innes Foundation.) 



defined program of cell proliferation and cell enlargement, each leaf pri- 
mordium and its adjacent cells will grow to form a leaf, a node, and an intern- 
ode. Meanwhile, the apical meristem itself will give rise to new leaf primordia, so 
as to generate more and more modules in a potentially unending succession. 
The serial organization of the modules of the plant is thus controlled by events 
at the shoot apex. 



Polarized Auxin Transport Controls the Pattern of Primordia in the 
Meristem 

What are the signals that operate in the tiny apical region to determine the 
arrangement of primordia, and how are these signals generated in the appropri- 
ate pattern? A clue comes from mutation of a gene called Pinl, whose loss pre- 
vents formation of leaf primordia but allows the main stem to continue growing, 
producing a long thin bare structure shaped like a pin, with the apical meristem 
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Figure 22-1 21 A shoot apex from a 
young tobacco plant. (A) A scanning 
electron micrograph shows the shoot 
apex with two sequentially emerging leaf 
primordia, seen here as lateral swellings 
on either side of the domed apical 
meristem. (B) A thin section of a similar 
apex shows that the youngest leaf 
primordium arises from a small group of 
cells (about 1 00) in the outer four or five 
layers of cells. (C) A very schematic 
drawing showing that the sequential 
appearance of leaf primordia takes place 
over a small distance and very early in 
shoot development. Growth of the apex 
will eventually form internodes that will 
separate the leaves in order along the 
stem (see Figure 22-1 1 1). (A and B, from 
R.S. Poethig and I.M. Sussex, Planta 
1 65:1 58-1 69, 1 985. With permission from 
Springer-Verlag.) 
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at its head. The Pinl protein is an auxin transporter, driving efflux across the 
plasma membrane into the extracellular space. This suggests that the leaf pri- 
mordia are missing in the mutant because the auxin distribution is wrong. 
Indeed, a microdroplet of auxin applied to one side of a Pinl or similar type of 
mutant apical meristem, on the side of the head of the "pin", will induce a leaf or 
flower primordium to form at the site of the auxin application (Figure 22-122A). 

One can observe the distribution of Pinl transporter protein in the living tis- 
sue by creating a transgenic (but otherwise normal) plant that expresses a form 
of Pinl tagged with Green Fluorescent Protein (Figure 22-122B-D). In the outer- 
most layer of meristem cells, the amount of Pinl varies from region to region in 
a pattern that correlates with the pattern of developing primordia because the 
Pinl gene is upregulated by auxin. Moreover, the Pinl protein is asymmetrically 
distributed in the membranes of the individual cells, so that they pump out 
more auxin on one side than the other, creating local maxima that specify where 
primordia will begin to form. The pumps appear to be concentrated on the side 
facing the neighbors whose own auxin concentration is the highest, suggesting 
that there is a positive feedback in the accumulation of auxin. Computer mod- 
els show that positive feedback of this type can amplify asymmetry and gener- 
ate a pattern of peaks and troughs of auxin concentration of the sort observed. 
Localized transport of auxin in a perpendicular direction, between the outer 
sheet of meristematic cells and the developing strands of vascular tissue below, 
contributes to the asymmetry. As the cells proliferate and the tissue grows, the 
distributions of Pinl protein and of auxin adjust, producing new peaks and new 
lateral primordia in regular succession. 

Variations on this basic repetitive theme can give rise to more complex 
architectures, including structures such as tendrils, leaves, branches, and flow- 
ers. Thus, by switching on different sets of genes at the shoot apex, the plant can 
produce different types of primordia, in different spatial patterns. 



Cell Signaling Maintains the Meristem 

Central to all these phenomena is the question of how the apical meristem 
maintains itself. The meristem cells must continue to proliferate for weeks, 
years, or even centuries as a plant grows, replacing themselves while continu- 
ally generating progeny cells that differentiate. Through all this, the size of the 
cluster of cells that constitute the meristem remains practically constant (about 
100 cells in Arabidopsis, for example). New meristems may arise as the plant 
branches, but they too preserve the same size. 

Genetic screens have identified genes required for meristem maintenance. 
For example, mutations that disrupt the Wuschel gene, which codes for a home- 
odomain protein, convert the apical meristem into non-meristematic tissue, so 
that the seedling fails to sprout. Conversely, mutations in the Clavata group of 
genes, coding for components of a cell-cell signaling pathway (see Figure 
15-83), make the meristem abnormally big. These genes are expressed in dif- 
ferent layers of cells in the meristem region (Figure 22-123A). The two most 



Figure 22-122 Control of patterning in a 
meristem by auxin and Pinl . (A) A 

microdroplet containing auxin (green 
spot) has been applied on one side of a 
mutant meristem, phenotypically similar 
to a Pinl mutant because it lacks a 
protein required for control of auxin 
transport. The auxin has induced 
formation of a lateral flower primordium. 

(B) Distribution of the Pinl auxin 
transporter at a meristem. (B) An apical 
meristem of Arabidopsis is viewed from 
above by fluorescence microscopy, 
revealing the distribution of GFP-tagged 
Pinl protein in the surface layer of cells. 

(C) The same image labeled to show the 
locations of the established primordia (Pi 
being the most recently formed, P4 being 
the most mature) and the predicted 
incipient primordia (h being the next to 
form, U being the one that lies farthest in 
the future). (D) Magnified part of (B), 
showing the asymmetric distribution of 
Pinl in the membranes of the individual 
cells, driving auxin toward the site of an 
incipient primordium. Arrows indicate the 
direction of transport. As primordia 
become established, the amount of Pinl 
in their surface layer declines, in part 
because further changes in the 
distribution of the transport proteins 
cause auxin to be pumped downward 
into the developing vascular tissue 
below. Complex patterns of auxin 
transport control the detailed structure of 
many other developing plant tissues also. 
(A, from D. Reinhardt et al., Nature 
426:255-260, 2003. With permission from 
Macmillan Publishers Ltd; B-D, from 
M.G. Heisler et al., Curr. Biol. 

1 5:1 899-1 911, 2005. With permission 
from Elsevier.) 
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superficial cell layers, called the LI and L2 layers, together with the uppermost 
part of the L3 layer, contain the cells of the meristem proper, that is, the stem 
cells, capable of dividing indefinitely to give rise to future parts of the plant. The 
meristematic cells of the LI and L2 layers express Clavata3, a small secreted sig- 
nal protein. Just beneath, in the L3 layer, lies a cluster of cells expressing 
Clavatal (the receptor for Clavata3). In the center of this Clavatal patch are cells 
that express the Wuschel gene regulatory protein. 

The pattern of cell divisions implies that the cells expressing Wuschel are not 
themselves part of the meristem proper; new Wuschel- expressing cells are 
apparently continually recruited from the meristematic (stem-cell) part of the 
L3 population, just above the Wuschel domain. Nevertheless, the Wuschel- 
expressing cells are at the heart of the mechanism that maintains the meristem. 
A signal that they produce maintains meristematic behavior in the cells above, 
stimulates expression of the Clavata genes, and, presumably, causes new cells 
recruited into the Wuschel domain to switch on Wuschel. Negative feedback 
from the upper meristematic cells, delivered by the Clavata signaling pathway, 
acts back on the regions below to limit the size of the Wuschel domain, thereby 
preventing the meristem from becoming too big (Figure 22-123B). 

This account of the plant meristem, though uncertain in some details and 
certainly oversimplified, provides one of the clearest examples of an important 
general developmental strategy: it shows how a feedback loop involving a short- 
range activating signal (such as that produced by the Wuschel- expressing cells) 
and a long-range inhibitory signal (such as Clavata3) can stably maintain a sig- 
naling center of a well-defined size even when there is continual proliferation 
and turnover of the cells that form that center. As we pointed out at the beginning 
of this chapter, analogous systems of signals are thought to operate in animal 
development to maintain localized signaling centers — such as the Organizer of 
the amphibian gastrula, or the zone of polarizing activity in a limb bud. And just 
as this strategy serves in the mature plant to maintain its meristems, it may also 
serve in adult animal tissues such as the gut lining (discussed in Chapter 23) to 
maintain the all-important clusters of adult animal stem cells. 



Figure 22-1 23 The feedback loops that 
are thought to maintain the shoot 
apical meristem. (A) The arrangement of 
cell layers constituting a shoot apical 
meristem. (B) The pattern of cell-cell 
communication that maintains the 
meristem. Artificial overexpression of 
Wuschel in the L3 region causes an 
increase in the number of cells in the L1 
and L2 layers that behave as meristem 
cells and express Clavata3; artificial 
overexpression of Clavata 3 in the L1 and 
L2 layers causes a reduction of Wuschel 
expression in the L3 region below and a 
decrease in the number of meristem cells. 
Clavata3 codes for a small signal protein, 
while Clavatal codes for its receptor, a 
transmembrane protein kinase. Wuschel, 
which is expressed in the central part of 
the region that expresses the receptor 
Clavatal, codes for a gene regulatory 
protein of the homeodomain class. The 
size of the meristem is thought to be 
controlled by a self-regulating balance 
between a short-range stimulatory signal 
produced by cells expressing Wuschel 
(yellow arrow), and a longer-range 
inhibitory signal delivered by Clavata3 
(red bars). 



Regulatory Mutations Can Transform Plant Topology by Altering 
Cell Behavior in the Meristem 



If a plant stem is to branch, new shoot apical meristems must be created, and 
this too depends on events in the neighborhood of the shoot apex. At each 
developing node, in the acute angle (the axil) between the leaf primordium and 
the stem, a bud is formed (Figure 22-124). This contains a nest of cells, derived 
from the apical meristem, that keep a meristematic character. They have the 
capacity to become the apical meristem of a new branch or the primordium of a 
structure such as a flower; but they also have the alternative option of remaining 
quiescent as axillary buds. The plant's pattern of branching is regulated through 
this choice of fate, and mutations that affect it can transform the structure of the 
plant. Maize provides a beautiful example. 

Maize represents one of mankind's most remarkable feats of genetic engi- 
neering. Native Americans created it by selective breeding, over a period of sev- 
eral centuries or perhaps millennia between 5000 and 10,000 years ago. They 
started from a wild grass known as teosinte, with highly branched leafy stems 
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and tiny ears bearing hard, inedible kernels. Detailed genetic analysis has iden- 
tified a handful of genetic loci — about five — as the sites of the mutations that 
account for most of the difference between this unpromising ancestor and mod- 
ern corn. One of these loci, with a particularly dramatic effect, corresponds to a 
gene called Teosinte branched-1 (Tbl). In maize with loss- of- function mutations 
in Tbl, the usual simple unbranched stem, with a few large leaves at intervals 
along it, is transformed into a dense, branching, leafy mass reminiscent of 
teosinte (Figure 22-125A). The pattern of branching in the mutant implies that 
axillary buds, originating in normal positions, have escaped from an inhibition 
that prevents them, in normal maize, from growing into branches. 

In normal maize, the single stem is crowned with a tassel — a male flower — 
while a few of the axillary buds along the stem develop into female flowers and, 
upon fertilization, form the ears of corn that we eat. In the mutant maize with a 
defective Tbl gene, these fruitful axillary buds are transformed into branches 
bearing tassels. The wild teosinte plant is like the 71?i-defective maize in its 
leafy, highly branched appearance, but unlike this mutant it makes ears on 
many of its side branches, as though Tbl were active. DNA analysis reveals the 
explanation. Both teosinte and normal maize possess a functional Tbl gene, 
with an almost identical coding sequence, but in maize the regulatory region has 
undergone a mutation that boosts the level of gene expression. Thus, in normal 
maize the gene is expressed at a high level in every axillary bud, inhibiting 
branch formation, while in teosinte the expression in many axillary buds is low, 
so that branches are permitted to form (Figure 22-125B). 

This example shows how simple mutations, by switching the behavior 
of meristem cells, can transform plant structure — a principle of enormous 
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bud primordia 



leaf base 



Figure 22-124 Axillary buds in the 
neighborhood of a shoot apex. The 

photograph shows a longitudinal section 
of Coleus blumei, a common houseplant. 
(From P.H. Raven, R.F. Evert and 
S.E. Eichhorn, Biology of Plants, 6th ed. 
New York: Freeman/Worth, 1999, used 
with permission.) 



teosinte 



normal maize 



(B) 



Tb 1 -defective 
mutant of maize 



Figure 22-1 25 Transformation of plant 
architecture by mutation: a comparison 
of teosinte, normal maize, and 
76 7 -defective maize. (A) Photographs of 
the three types of plants. (B) The 
architecture of teosinte, normal maize 
and the Tb /-defective maize compared 
schematically. The Tbl gene product is 
needed for development of ears. It is 
absent in the Tbl mutant; it is present in 
both teosinte and normal maize, but 
these two plants differ because the gene 
is differently regulated. (A. left image, 
from J. Doebley and R.L Wang, Cold 
Spring Harbor Symp. Quant Biol. 
62:361-367, 1997. With permission from 
Cold Spring Harbor Laboratory Press; 
A, middle and right images, from 
J. Doebley, A. Stec and L Hubbard, Nature 
386:485-488, 1997. With permission from 
Macmillan Publishers Ltd.) 
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importance in the breeding of plants for food. More generally, the case of Tbl illus- 
trates how new body plans, whether of plant or animal, can evolve through 
changes in regulatory DNA without change in the characters of the proteins made. 



The Switch to Flowering Depends on Past and Present 
Environmental Cues 

Meristems face other developmental choices besides that between quiescence 
and growth, as we have already seen in our discussion of maize, and these also 
are frequently regulated by the environment. The most important is the decision 
to form a flower (Figure 22-126). 

The switch from meristematic growth to flower formation is triggered by a 
combination of cues. The plant does not merely take account of the current tem- 
perature, light intensity, and nutritional conditions; it bases its decision to flower 
on past conditions as well. One important cue, for many plants, is day length. To 
sense this, the plant uses its circadian clock — an endogenous 24-hour rhythm of 
gene expression — to generate a signal for flowering only when there is light for 
the appropriate part of the day. The clock itself is influenced by light, and the 
plant in effect uses the clock to compare past to present lighting conditions. 
Important parts of the genetic circuitry underlying these phenomena have been 
identified, from the phytochromes and cryptochromes that act as light receptors 
(discussed in Chapter 15) to the Constans gene, whose expression in the leaves 
of the plant represents a signal for flowering. The signal is thought to be relayed 
from the leaves to the meristem via the vasculature by the product of another 
gene, Flowering locus T {Ft), that is regulated by Constans. 

But this signal itself will reach the meristem and trigger flowering only if the 
plant is in a receptive condition, typically depending on its history over a much 
longer period. Many plants will flower only if they have previously spent a long 
time in the cold: they must pass through winter before they will behave as 
though it is spring — a process called vernalization. The prolonged cold brings 
about changes in chromatin structure, dependent on another large collection of 
genes, including homologs of members of the Polycomb group that we men- 
tioned earlier for their role in perpetuating patterns of gene expression in 
Drosophila. These epigenetic changes (discussed in Chapters 4 and 7) result in 
the gradual silencing of the Flowering locus C (Flc) gene. The effect is long-last- 
ing, persisting through many rounds of cell division even as the weather grows 
warmer. Flc codes for an inhibitor of flowering, antagonizing the expression and 
action of Ft. Thus vernalization, by blocking production of the inhibitor, enables 
the meristem to receive the Ft signal and respond to it by switching on the 
expression of a set of floral meristem-identity genes in the apical meristem. 

Mutations affecting the regulation of Flc expression alter the time of flow- 
ering and thus the ability of a plant to flourish in a given climate. The whole 




petal 



carpel 




Figure 22-1 26 The structure of an 
Arabidopsis flower. (A) Photograph. 
(B) Schematic cross-sectional view. The 
basic plan, as shown in (B), is common to 
most flowering dicotyledonous plants. 
(A, courtesy of Leslie Sieburth.) 
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Figure 22-1 27 Arabidopsis flowers 
showing a selection of homeotic 
mutations. (A) In Apetala2, sepals are 
converted into carpels and petals into 
stamens; (B) In Apetala3, petals are 
converted into sepals and stamens into 
carpels; (C) In Agamous, stamens are 
converted into petals and carpels into 
floral meristem. (D) In a triple mutant 
where these three functions are defective, 
all the organs of the flower are converted 
nto leaves. (A-C, courtesy of Leslie 
Sieburth; D, courtesy of Mark Running.) 



control system governing the switch to flowering thus is of vital importance for 
agriculture, especially in an era of rapid climate change. 



Homeotic Selector Genes Specify the Parts of a Flower 

By switching on the floral meristem -identity genes, the apical meristem aban- 
dons its chances of continuing vegetative growth and gambles its future on the 
production of gametes. Its cells embark on a strictly finite program of growth 
and differentiation: by a modification of the ordinary mechanisms for generat- 
ing leaves, a series of whorls of specialized appendages are formed in a precise 
order — typically sepals first, then petals, then stamens carrying anthers con- 
taining pollen, and lastly carpels containing eggs (see Panel 22-1). By the end of 
this process the meristem has disappeared, but among its progeny it has created 
germ cells. 

The series of modified leaves forming a flower can be compared to the series 
of body segments forming a fly. In plants, as in flies, one can find homeotic 
mutations that convert one part of the pattern to the character of another. The 
mutant phenotypes can be grouped into at least four classes, in which different 
but overlapping sets of organs are altered (Figure 22-127). The first or 7V class, 
exemplified by the Apetala2 mutant of Arabidopsis, has its two outermost whorls 
transformed: the sepals are converted into carpels and the petals into stamens. 
The second or 'B' class, exemplified by Apetala3, has its two middle whorls 
transformed: the petals are converted into sepals and the stamens into carpels. 
The third or 'C class, exemplified by Agamous, has its two innermost whorls 
transformed, with a more drastic consequence: the stamens are converted into 
petals, the carpels are missing, and in their place the central cells of the flower 
behave as a floral meristem, which begins the developmental performance all 
over again, generating another abnormal set of sepals and petals nested inside 
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(A) NORMAL FLOWER 



whorl 1 (sepal) 
whorl 2 (petal) 
whorl 3 (stamen) 
whorl 4 (carpel) 








gene A expression 
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gene C expression 
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floral meristem 



normal flower 
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(B) MUTANT FLOWER LACKING GENE B (Apetala3) EXPRESSION 



sepal 



whorls 1 and 2 

whorls 3 and 4 




gene A expression 
(Apetala2) 







sepal 
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Figure 22-128 Homeotic selector gene expression in an Arabidopsis flower. (A) Diagram of the normal expression 
patterns of the three genes whose mutant phenotypes are illustrated in Figure 22-1 27A-C. All three genes code for gene 
regulatory proteins. The colored shading on the flower indicates which organ develops from each whorl of the meristem, 
but does not imply that the homeotic selector genes are still expressed at this stage. (B) The patterns in a mutant where the 
Apetala3 gene is defective. Because the character of the organs in each whorl is defined by the set of homeotic selector 
genes that they express, the stamens and petals are converted into sepals and carpels. The consequence of a deficiency of a 
gene of class A, such as Apetolo2, is slightly more complex: the absence of this class A gene product allows the class C gene 
to be expressed in the outer two whorls as well as the inner two, causing these outer whorls to develop as carpels and 
stamens, respectively. Deficiency of a class C gene prevents the central region from undergoing terminal differentiation as a 
carpel and causes it instead to continue growth as a meristem, generating more and more sepals and petals. 



the first and, potentially, another nested inside that, and so on, indefinitely. A 
fourth class, the Sepallata mutants, has its three inner whorls all transformed 
into sepals. 

These phenotypes identify four classes of homeotic selector genes, which, 
like the homeotic selector genes of Drosophila, all code for gene regulatory pro- 
teins. These are expressed in different domains and define the differences of cell 
state that give the different parts of a normal flower their different characters, as 
shown in Figure 22-128. The gene products collaborate to form protein com- 
plexes that drive expression of the appropriate downstream genes. In a triple 
mutant where the A, B, and C genetic functions are all absent, one obtains in 
place of a flower an indefinite succession of tightly nested leaves (see Figure 
22-127D). Conversely, in a transgenic plant where genes of the A, B, and Sepal- 
lata classes are all expressed together outside their normal domains, leaves are 
transformed into petals. Leaves therefore represent a "ground state" in which 
none of these homeotic selector genes are expressed, while the other types of 
organ result from expressing the genes in different combinations. 

Similar studies have been carried out in other plant species, and a similar set 
of phenotypes and genes have been identified: plants, no less than animals, 
have conserved their homeotic selector gene systems. Gene duplication has 
played a large part in the evolution of these genes: several of them, required in 
different organs of the flower, have clearly homologous sequences. These are not 
of the homeobox class but are members of another family of gene regulatory 
proteins (the so-called MADS family), also found in yeast and in vertebrates. 

Clearly, plants and animals have independently found very similar solutions 
to many of the fundamental problems of multicellular development. 
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Summary 

The development of a flowering plant, like that of an animal, begins with division of a 
fertilized egg to form an embryo with a polarized organization: the apical part of the 
embryo will form the shoot, the basal part, the root, and the middle part, the stem. At 
first, cell division occurs throughout the body of the embryo. As the embryo grows, 
however, addition of new cells becomes restricted to small regions known as meristems. 
Apical meristems, at shoot tips and root tips, will persist throughout the life of the 
plant, enabling it to grow by sequentially adding new body parts at its periphery. Typ- 
ically, the shoot generates a repetitive series of modules, each consisting of a segment of 
stem, a leaf, and an axillary bud. Polarized transport of auxin controls the positioning 
of the primordia of these structures as they arise in the neighborhood of the meristem. 
An axillary bud is a potential new meristem, capable of giving rise to a side branch; the 
environment — and long-range hormonal signals within the plant can control the 
development of the plant by regulating bud activation. Mutations that alter the rules 
for activating axillary buds can have a drastic effect on the shape and structure of the 
plant; a single such mutation — one of about five key genetic alterations — accounts for 
a large part of the dramatic difference between modern maize and its wild ancestor, 
teosinte. 

The small weed Arabidopsis thaliana is widely used as a model organism for 
genetic studies and is the first plant to have had its genome completely sequenced. As 
in animals, genes governing plant development can be identified through genetic 
screens and their functions tested by genetic manipulations. Such studies have begun 
to reveal the molecular mechanisms by which the internal organization of each plant 
module is sketched out on a microscopic scale through cell-cell interactions in the 
neighborhood of the apical meristem. The meristem itself appears to be maintained by 
a local feedback loop, in which cells expressing the gene regulatory protein Wuschel 
provide a positive stimulus, and a negative feedback dependent on the Clavata 
cell-cell signaling pathway keeps the meristem from becoming too big. 

Environmental cues — especially light that is appropriately timed — can cause the 
expression of genes that switch the apical meristem from a leaf-forming to a flower- 
forming mode. The parts of a flower — its sepals, petals, stamens, and carpels — are 
formed by a modification of the mechanism for development of leaves, and the differ- 
ences between these parts are controlled by homeotic selector genes that are closely 
analogous (although not homologous) to those of animals. 
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Specialized Tissues, Stem Cells, 
and Tissue Renewal 




Cells evolved originally as free-living individuals, but the cells that matter most 
to us, as human beings, are specialized members of a multicellular community. 
They have lost features needed for independent survival and acquired peculiar- 
ities that serve the needs of the body as a whole. Although they share the same 
genome, they are spectacularly diverse: there are more than 200 different named 
cell types in the human body (see our web site for a list). These collaborate with 
one another to form many different tissues, arranged into organs performing 
widely varied functions. To understand them, it is not enough to analyze them 
in a culture dish: we need also to know how they live, work, and die in their nat- 
ural habitat, the intact body. 

In Chapters 7 and 21, we saw how the various cell types become different in 
the embryo and how cell memory and signals from their neighbors enable them 
to remain different thereafter. In Chapter 19, we discussed the building technol- 
ogy of multicellular tissues — the devices that bind cells together and the extra- 
cellular materials that give them support. In this chapter, we consider the func- 
tions and lifestyles of the specialized cells in the adult body of a vertebrate. We 
describe how cells work together to perform their tasks, how new specialized 
cells are born, how they live and die, and how the architecture of tissues is pre- 
served despite the constant replacement of old cells by new. We examine in par- 
ticular the role played in many tissues by stem cells — cells that are specialized to 
provide an indefinite supply of fresh differentiated cells where these are lost, dis- 
carded, or needed in greater numbers. 

We discuss these topics through a series of examples — some chosen because 
they illustrate important general principles, others because they highlight 
favorite objects of study, still others because they pose intriguing problems that 
cell biology has yet to solve. Finally, we shall confront the practical question that 
underlies the current storm of interest in stem cells: How can we use our under- 
standing of the processes of cell differentiation and tissue renewal to improve 
upon nature, and make good those injuries and failings of the human body that 
have hitherto seemed to be beyond repair? 
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EPIDERMIS AND ITS RENEWAL BY STEM CELLS 

We begin with a very familiar tissue: the skin. Like almost all tissues, skin is a 
complex of several different cell types. To perform its basic function as a barrier, 
the outer covering of the skin depends on a variety of supporting cells and struc- 
tures, many of which are required in most other tissues also. It needs mechanical 
support, largely provided by a framework of extracellular matrix, mainly secreted 
by fibroblasts. It needs a blood supply to bring nutrients and oxygen and to 
remove waste products and carbon dioxide, and this requires a network of blood 
vessels, lined with endothelial cells. These vessels also provide access routes for 
cells of the immune system to defend against infection: macrophages and den- 
dritic cells, to phagocytose invading pathogens and help activate lymphocytes, 
and lymphocytes themselves, to mediate more sophisticated adaptive immune 
system responses (discussed in Chapter 24). Nerve fibers are needed too, to con- 
vey sensory information from the tissue to the central nervous system, and to 
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deliver signals in the opposite direction for glandular secretion and smooth 
muscle contraction. 

Figure 23-1 illustrates the architecture of the skin and shows how it satis- 
fies all these requirements. An epithelium, the epidermis, forms the outer cov- 
ering, creating a waterproof barrier that is self- repairing and continually 
renewed. Beneath this lies a relatively thick layer of connective tissue, which 
includes the tough collagen-rich dermis (from which leather is made) and the 
underlying fatty subcutaneous layer or hypodermis. In the skin, as elsewhere, the 
connective tissue, with vessels and nerves running through it, provides most of 
the general supportive functions listed above. The epidermis, however, is the 
fundamental, quintessential component of the skin — the tissue that is peculiar 
to this organ, even though not the major part of its bulk. Appendages such as 
hairs, fingernails, sebaceous glands, and sweat glands develop as specializations 
of the epidermis (Figure 23-2). Complex mechanisms regulate the distribution 
of these structures and their distinctive patterns of growth and renewal. The 
regions of less specialized, more or less flat epithelium covering the body surface 
between the hair follicles and other appendages are called inter follicular epider- 
mis. This has a simple organization, and it provides a good introduction to the 
way in which tissues of the adult body are continually renewed. 



Figure 23-2 A hair follicle and its associated sebaceous gland. These structures 
form as specializations of the epidermis. The hair grows upward from the papilla at 
its base. The sebaceous gland contains cells loaded with lipid, which is secreted to 
keep the hair properly oiled. The whole structure undergoes cycles of growth, 
regression (when the hair falls out), and reconstruction. Like the rest of the 
epidermis, it depends on stem cells for its growth and reconstruction in each 
cycle. An important group of stem cells (red), able to give rise to both hair follicle 
and interfollicular epidermis, lie in a region called the bulge, just below the 
sebaceous gland. 
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Figure 23-1 Mammalian skin. (A) These 
diagrams show the cellular architecture 
of thick skin. (B) Micrograph of a cross 
section through the sole of a human foot, 
stained with hematoxylin and eosin.The 
skin can be viewed as a large organ 
composed of two main tissues: the 
epidermis, and the underlying connective 
tissue, which consists of the dermis and 
the hypodermis. Each tissue is composed 
of several different cell types. The dermis 
and hypodermis are richly supplied with 
blood vessels and nerves. Some nerve 
fibers extend into the epidermis. 
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Figure 23-3 The multilayered structure 
of the epidermis, as seen in a mouse. 

The outlines of the keratinized squames 
are revealed by swelling them in a 
solution containing sodium hydroxide. 
The highly ordered hexagonal 
arrangement of interlocking columns of 
cells shown here occurs only in some 
sites where the epidermis is thin. In 
human skin, the stacks of squames are 
usually many times higher and less 
regular, and where the skin is very thick 
mitotic cells are seen not only in the 
basal layer but also in the first few cell 
layers above it. In addition to the cells 
destined for keratinization, the deep 
layers of the epidermis include small 
numbers of different types of cells, as 
indicated in Figure 23-1, including 
dendritic cells, called Langerhans cells, 
derived from bone marrow; melanocytes 
(pigment cells) derived from the neural 
crest; and Merkel cells, which are 
associated with nerve endings in the 
epidermis. 



Epidermal Cells Form a Multilayered Waterproof Barrier 

The interfollicular epidermis is a multilayered (stratified) epithelium composed 
largely of keratinocytes (so named because their characteristic differentiated 
activity is the synthesis of keratin intermediate filament proteins, which give the 
epidermis its toughness) (Figure 23-3). These cells change their appearance 
from one layer to the next. Those in the innermost layer, attached to an under- 
lying basal lamina, are termed basal cells, and it is usually only these that divide. 
Above the basal cells are several layers of larger prickle cells (Figure 23-4), 
whose numerous desmosomes — each a site of anchorage for thick tufts of ker- 
atin filaments — are just visible in the light microscope as tiny prickles around 
the cell surface (hence the name). Beyond the prickle cells lies the thin, darkly 
staining granular cell layer (see Figure 23-3). It is at this level that the cells are 
sealed together to form a waterproof barrier. Mice that fail to form this barrier 
because of a genetic defect die from rapid fluid loss soon after birth, even though 
their skin appears normal in other respects. 

The granular layer, with its barrier to the movement of water and solutes, 
marks the boundary between the inner, metabolically active strata and the outer- 
most layer of the epidermis, consisting of dead cells whose intracellular organelles 
have disappeared. These outermost cells are reduced to flattened scales, or 
squames, filled with densely packed keratin. The plasma membranes of both the 
squames and the outer granular cells are reinforced on their cytoplasmic surface 
by a thin (12 nm), tough, cross-linked layer of proteins, including a cytoplasmic 
protein called involucrin. The squames themselves are normally so compressed 
and thin that their boundaries are hard to make out in the light microscope, but 
soaking in sodium hydroxide solution (or a warm bath tub) makes them swell 
slightly, and their outlines can then be seen (see Figure 23-3). 



Figure 23-4 A prickle cell. This drawing, from an electron micrograph of a 
section of the epidermis, shows the bundles of keratin filaments that 
traverse the cytoplasm and are inserted at the desmosome junctions that 
bind the prickle cell (red) to its neighbors. Nutrients and water diffuse freely 
through the intercellular spaces in the metabolically active layers of the 
epidermis occupied by the prickle cells. Farther out, at the level of the 
granular cells, there is a waterproof barrier that is thought to be created by a 
sealant material that the granular cells secrete. (From R.V. Krstic, 
Ultrastructure of the Mammalian Cell: an Atlas. Berlin: Springer-Verlag, 1979.) 
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Differentiating Epidermal Cells Express a Sequence of Different 
Genes as They Mature 

Let us now set this static picture in motion to see how the epidermis is continu- 
ally renewed. While some basal cells are dividing, adding to the population in 
the basal layer, others (their sisters or cousins) are slipping out of the basal cell 
layer into the prickle cell layer, taking the first step on their outward journey 
When they reach the granular layer, the cells start to lose their nucleus and cyto- 
plasmic organelles, through a degradative mechanism that involves partial acti- 
vation of the machinery of apoptosis; in this way, the cells are transformed into 
the keratinized squames of the keratinized layer. These finally flake off from the 
surface of the skin (and become a main constituent of household dust). The time 
from birth of a cell in the basal layer of the human skin to its loss by shedding 
from the surface is about a month, depending on body region. 

As the new keratinocyte in the basal layer is transformed into the squame in 
the outermost layers (see Figure 23-4), it steps through a succession of different 
states of gene expression, synthesizing a succession of different members of the 
keratin protein family. Meanwhile other characteristic proteins, such as involu- 
crin, also begin to be synthesized as part of a coordinated program of terminal 
cell differentiation — the process in which a precursor cell acquires its final spe- 
cialized characteristics and usually permanently stops dividing. The whole pro- 
gram is initiated in the basal layer. It is here that the fates of the cells are decided. 



Stem Cells in the Basal Layer Provide for Renewal of the 
Epidermis 

Humans renew the outer layers of their epidermis a thousand times over in the 
course of a lifetime. In the basal layer, there have to be cells that can remain 
undifferentiated and carry on dividing for this whole period, continually throw- 
ing off descendants that commit to differentiation, leave the basal layer, and are 
eventually discarded. The process can be maintained only if the basal cell pop- 
ulation is self-renewing. It must therefore contain some cells that generate a 
mixture of progeny, including daughters that remain undifferentiated like their 
parent, as well as daughters that differentiate. Cells with this property are called 
stem cells. They have so important a role in such a variety of tissues that it is use- 
ful to have a formal definition. 

The defining properties of a stem cell are as follows: 

1 . It is not itself terminally differentiated (that is, it is not at the end of a path- 
way of differentiation). 

2. It can divide without limit (or at least for the lifetime of the animal) . 

3. When it divides, each daughter has a choice: it can either remain a stem 
cell, or it can embark on a course that commits it to terminal differentia- 
tion (Figure 23-5) . 

Stem cells are required wherever there is a recurring need to replace differ- 
entiated cells that cannot themselves divide. The stem cell itself has to be able to 
divide — that is part of the definition — but it should be noted that it does not 
necessarily have to divide rapidly; in fact, stem cells usually divide at a relatively 
slow rate. 

The need for stem cells arises in many different tissues. Thus, stem cells are 
of many types, specialized for the genesis of different classes of terminally dif- 
ferentiated cells — epidermal stem cells for epidermis, intestinal stem cells for 
intestinal epithelium, hemopoietic stem cells for blood, and so on. Each stem- 
cell system nevertheless raises similar fundamental questions. What are the dis- 
tinguishing features of the stem cell in molecular terms? What factors determine 

Figure 23-5 The definition of a stem cell. Each daughter produced when a 
stem cell divides can either remain a stem cell or go on to become 
terminally differentiated. In many cases, the daughter that opts for terminal 
differentiation undergoes additional cell divisions before terminal 
differentiation is completed. 
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whether it divides or stays quiescent? What decides whether a given daughter 
cell commits to differentiation or remains a stem cell? And where the stem cell 
can give rise to more than one kind of differentiated cell — as is very often the 
case — what determines which differentiation pathway is followed? 



Figure 23-6 Two ways for a stem cell to 
produce daughters with different fates. 

In the strategy based on environmental 
asymmetry, the daughters of the stem 
cell are initially similar and are directed 
into different pathways according to the 
environmental influences that act on 
them after they are born. The 
environment is shown as colored shading 
around the cell. With this strategy, the 
number of stem cells can be increased or 
reduced to fit the niche available for 
them. In the strategy based on divisional 
asymmetry, the stem cell has an internal 
asymmetry and divides in such a way 
that its two daughters are already 
endowed with different determinants at 
the time of their birth. In some cases, the 
choice between the alternative fates may 
be made at random for each daughter, 
but with a defined probability, like a 
coin-toss, reflecting the intrinsic 
randomness or "noise" in all genetic 
control systems (discussed in Chapter 7). 



The Two Daughters of a Stem Cell Do Not Always Have to Become 
Different 

At steady state, to maintain a stable stem-cell population, precisely 50% of the 
daughters of stem cells in each cell generation must remain as stem cells. In 
principle, this could be achieved in two ways — through environmental asymme- 
try or through divisional asymmetry (Figure 23-6). In the first strategy, the divi- 
sion of a stem cell could generate two initially similar daughters whose fates 
would be governed by their subsequent environment or by some random pro- 
cess with an appropriate environmentally controlled probability; 50% of the 
population of daughters would remain as stem cells, but the two daughters of an 
individual stem cell in the population might often have the same fate. At the 
opposite extreme, the stem cell division could be always strictly asymmetric, 
producing one daughter that inherits the stem- cell character and another that 
inherits factors that force it to embark on differentiation. The neuroblasts of the 
Drosophila central nervous system, discussed in Chapter 22, are an example of 
cells that show this type of divisional asymmetry. This strategy in its strict form 
has a drawback, however: it means that the existing stem cells can never 
increase their numbers, and any loss of stem cells is irreparable, unless by 
recruitment of some other type of cell to become a stem cell. The strategy of con- 
trol by environmental asymmetry is more flexible. 

In fact, if a patch of epidermis is destroyed, the surrounding epidermal cells 
repair the damage by migrating in and proliferating to cover the denuded area. 
In this process, a new self- renewing patch of epidermis is established, implying 
that additional stem cells have been generated to make up for the loss. These 
must have been produced by symmetric divisions in which one stem cell gives 
rise to two. In this way, the stem cell population adjusts its numbers to fit the 
available niche. 

Observations such as these suggest that the maintenance of stem cell char- 
acter in the epidermis might be controlled by contact with the basal lamina, 
with a loss of contact triggering the start of terminal differentiation, and main- 
tenance of contact serving to preserve stem cell potential. This idea contains a 
grain of truth, but it is not the whole truth. As we now explain, not all the cells in 
the basal layer have the capacity to serve as stem cells. 
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The Basal Layer Contains Both Stem Cells and Transit Amplifying 
Cells 

Basal keratinocytes can be dissociated from intact epidermis and can proliferate 
in a culture dish, giving rise to new basal cells and to terminally differentiated 
cells. Even within a population of cultured basal keratinocytes that all seem 
undifferentiated, there is great variation in the ability to proliferate. When human 
keratinocytes are taken singly and tested for their ability to found new colonies, 
some seem unable to divide at all, others go through only a few division cycles 
and then halt, and still others divide enough times to form large colonies. This 
proliferative potential directly correlates with the expression of the (31 subunit of 
integrin, which helps mediate adhesion to the basal lamina. Clusters of cells with 
high levels of this molecule are found in the basal layer of the intact human epi- 
dermis also, and they are thought to contain the stem cells (Figure 23-7) . We still 
do not have definitive markers for the stem cells themselves, and we still do not 
understand in molecular terms what it is that fundamentally defines the stem- 
cell state. This is one of the key problems of stem- cell biology, and we shall say 
more about it in later sections of the chapter. 

Paradoxically, many if not all of the epidermal cells that generate large 
colonies in culture seem to be cells that themselves as a rule divide rarely. One 
line of evidence comes from experiments in which a pulse of the thymidine ana- 
log bromodeoxyuridine (BrdU) is given to a young animal, in which the epider- 
mis is growing rapidly, or to a mature animal following an injury that provokes 
rapid repair. One then waits for many days or weeks before fixing the tissue and 
staining with an antibody that recognizes DNA in which BrdU has been incorpo- 
rated. The BrdU is taken up by any cell that is in S phase of the division cycle at 
the time of the initial pulse. Because the BrdU would be expected then to be 
diluted by half at each subsequent cell division, any cells that remain strongly 
labeled at the time of fixation are assumed to have undergone few or no divisions 
since replicating their DNA at the time of the pulse. Such label- retaining cells can 
be seen scattered among unlabeled or lightly labeled cells in the basal layer of the 
epidermis even after a period of several months, and large numbers are visible in 
hair follicles, in a region called the bulge (see Figure 23-2). Ingenious labeling 
procedures indicate that the label-retaining cells, in the hair follicle at least, are 
in fact stem cells: when a new cycle of hair growth begins after an old hair has 
been shed, the lab el- retaining cells in the bulge at last divide and contribute the 
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Figure 23-7 The distribution of stem cells 
in human epidermis, and the pattern of 
epidermal cell production. The diagram is 
based on specimens in which the location 
of the stem cells was identified by staining 
for pi integrin, and that of the 
differentiating cells by staining for keratin- 
1 0, a marker of keratinocyte 
differentiation; dividing cells were 
identified by labeling with BrdU, a 
thymidine analog that is incorporated into 
cells in S phase of the cell division cycle. 
The stem cells seem to be clustered near 
the tips of the dermal papillae. They divide 
infrequently, giving rise (through a 
sideways movement) to transit amplifying 
cells, which occupy the intervening 
regions. The transit amplifying cells divide 
frequently, but for a limited number of 
division cycles, at the end of which they 
begin to differentiate and slip out of the 
basal layer. The precise distribution of 
stem cells and transit amplifying cells 
varies from one region of epidermis to 
another. (Adapted from S. Lowell et al., 
Curr. Biol. 1 0:491 -500, 2000. With 
permission from Elsevier.) 
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cells that go to form the regenerated hair follicle. Although it is not certain that all 
the stem cells of the hair follicle have this label-retaining character, some clearly 
do, and the same seems to be true of the stem cells in the interfollicular epider- 
mis. Moreover, basal cells expressing (31 integrin at a high level — the cells that can 
give rise to large colonies in culture — are rarely seen dividing. 

Mixed with these cells there are others that divide more frequently — but only 
for a limited number of division cycles, after which they leave the basal layer and 
differentiate. These latter cells are called transit amplifying cells — "transit", 
because they are in transit from a stem-cell character to a differentiated charac- 
ter; and "amplifying", because the division cycles they go through have the effect 
of amplifying the number of differentiated progeny that result from a single stem- 
cell division (Figure 23-8). In this way, a small population of stem cells that divide 
only rarely can generate a plentiful supply of new differentiated cells. 



Figure 23-8 Transit amplifying cells. 

Stem cells in many tissues divide only 
rarely but give rise to transit amplifying 
cells — daughters committed to 
differentiation that go through a limited 
series of more rapid divisions before 
completing the process. In the example 
shown here, each stem cell division gives 
rise in this way to eight terminally 
differentiated progeny. 



Transit Amplifying Divisions Are Part of the Strategy of Growth 
Control 

Transit amplifying cells are a common feature of stem cell systems. This means 
that in most such systems there are few true stem cells and they are mixed with 
a much larger number of progeny cells that have only a limited capacity to 
divide. As discussed in Chapter 20, the same seems to be true not only of normal 
self-renewing tissues but also for many cancers, where only a small minority of 
cells in the tumor cell population are capable of serving as cancer stem cells. 
Why should this be? There are several possible answers, but a part of the expla- 
nation probably lies in the strategy by which large multicellular animals (such as 
mammals) control the sizes of their cell populations. 

The proportions of the parts of the body are mostly determined early, dur- 
ing development, by means of signals that operate over distances of a few hun- 
dred cell diameters at most: for each organ or tissue, a small rudiment or 
founder cell population is delimited in this way. The founder cell populations 
must then grow, but — in mammals at least — only up to a certain definite limit, 
at which point they must stop. 

One way to halt growth at a certain size is by feedback signals that operate 
over much larger distances in the mature organism; we shall see that such sig- 
nals indeed play an important part in controlling the growth of at least some tis- 
sues. Another strategy, however, is to endow each founder cell with an internal 
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program dictating that it shall divide a limited number of times and then stop. 
In this way, short-range signals during development can define the size of large 
final structures (Figure 23-9). But if that is the strategy, how can the adult tissue 
be continually renewed? A solution is to specify the founder cells as stem cells, 
able to continue dividing indefinitely, but producing at each division one daugh- 
ter that remains as a stem cell and one that is programmed to go through a lim- 
ited number of transit amplifying divisions and then stop. 

This is certainly an oversimplified and incomplete account of the control of 
tissue growth and renewal, but it helps to explain why cells that are programmed 
to undergo long sequences of cell divisions and then halt are such a common 
feature of animal development and why tissue renewal by stem cells so often 
involves transit amplifying divisions. 



Figure 23-9 One way to define the size 
of a large organ. In the embryo, short- 
range signals determine small groups of 
cells as founders of the different cell 
populations. Each founder can be 
programmed then to divide a certain 
number of times, giving rise to a large set 
of cells in the adult. If the adult organ is 
to be renewed while maintaining its 
proper size, the founders can be 
programmed to divide as stem cells, 
giving rise at each division, on average, to 
one daughter that remains as a stem cell 
and another that is programmed to go 
through a set number of transit 
amplifying divisions. 



Stem Cells of Some Tissues Selectively Retain Original DNA 
Strands 

Stem cells in many tissues appear to be label-retaining cells. As we have just 
explained, this has generally been assumed to be because, having incorporated 
a tracer such as BrdU into their DNA during a period of BrdU exposure, the stem 
cells then divide rarely, so that the label is only slowly diluted by newly synthe- 
sized DNA. There is, however, another possible interpretation: regardless of 
whether they divide fast or slowly, the stem cells might segregate their DNA 
strands asymmetrically, in such a way that in every division, and for every chro- 
mosome, the specific DNA strand that was originally labeled is retained in the 
daughter cell that remains a stem cell. This original strand would presumably 
have to have acquired some sort of special tag, designating it as a stem-cell 
strand and ensuring that it segregates asymmetrically, into the daughter that 
remains a stem cell (see Figure 23-6), along with all the similarly tagged DNA 
strands of the other chromosomes; in this way, the old labeled strands would be 
retained in the stem cells from cell generation to cell generation. The tag might, 
for example, take the form of some special kinetochore protein that remains 
associated with the old DNA strand at the centromere of each chromosome dur- 
ing DNA replication and then engages with some asymmetry in the mitotic spin- 
dle so as to ensure that the stem- cell daughter receives all the daughter chro- 
mosomes carrying the tag. In each stem-cell generation, the same original 
tagged DNA strands would then serve as templates for production of the new 
sets of DNA strands to be despatched into the transit amplifying cells in the fol- 
lowing generation (Figure 23-10). 

This "immortal strand" hypothesis may seem a lot to swallow, given that no 
mechanism for such tagging and segregation of individual DNA strands has yet 
been identified. Yet there is increasing evidence suggesting that the immortal 
strand hypothesis is correct. Muscle (described later in this chapter) provides an 
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example. When BrdU is supplied during a period of production of muscle stem 
cells and the subsequent fate of the cells is followed as they divide and prolifer- 
ate, it is possible to detect small clones of cells, or pairs of sister cells, within 
which all the BrdU label is concentrated in a single cell, even though all the cells 
share a common origin from a single ancestor cell that took up the label initially. 
Similar observations have been reported in studies of other types of stem cells, 
and, importantly, this behavior has not been seen in cell populations that do not 
contain stem cells. The immortal strand hypothesis would not only explain why 
stem cells retain labeled DNA indefinitely, but would also imply that asymmet- 
ric division is a fundamental stem-cell property, with the corollary that any 
increase in the number of stem cells must require special conditions to confer 
the immortality tag on additional, newly synthesized DNA strands. The immor- 
tal strand hypothesis was originally proposed in the 1970s as a mechanism for 
stem cells to avoid accumulating cancer-promoting mutations during DNA 
replication. Reduction of the risk of cancer could be one of its benefits. 



The Rate of Stem-Cell Division Can Increase Dramatically When 
New Cells Are Needed Urgently 

Whatever the mechanism of stem- cell maintenance may be, the use of transit 
amplifying divisions brings several benefits. First, it means that the number of 
stem cells can be small and their division rate can be low, even when terminally 
differentiated cells have to be produced rapidly in large numbers. This reduces 
the cumulative burden of genetic damage, since most mutations occur in the 
course of DNA replication and mitosis, and mutations occurring in cells that are 
not stem cells are discarded in the course of tissue renewal. The likelihood of 
cancer is thus reduced. If the immortal strand hypothesis is correct, so that stem 
cells always retain the original "immortal" template DNA strands, the risk is still 
further reduced, since most sequence errors introduced during DNA replication 
will be in the newly synthesized strands, which the stem cells ultimately discard. 
Second, and perhaps more important, a low stem-cell division rate in nor- 
mal circumstances allows for dramatic increase when there is an urgent need - 
for example, in wound repair. The stem cells can then be roused to divide 
rapidly, and the additional division cycles can both amplify the stock of stem 
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Figure 23-10 The immortal strand 
hypothesis. (A) Experimental evidence. 
Here, stem cells of skeletal muscle 
(members of the muscle satellite cell 
population, discussed later in this chapter) 
have been placed in culture and allowed to 
divide for 4 days in the presence of BrdU to 
label newly synthesized DNA strands. The 
cells have then been allowed to divide for 
1 day in the absence of BrdU. The 
photographs show a pair of sister cells at 
the end of this procedure: one has 
inherited BrdU, the other has not. This 
implies that daughter chromosomes 
carrying DNA strands synthesized during 
the cell divisions that occurred in the 
presence of BrdU have all been inherited 
by the one cell, while those carrying only 
either pre-existing or subsequently 
synthesized DNA strands have been 
inherited by the other. This phenomenon, 
in which old and new DNA strands are 
asymmetrically allocated to different 
daughter cells, is seen only in cell 
populations that include stem cells. (B) The 
pattern of DNA strand inheritance in stem 
cells according to the immortal strand 
hypothesis. One strand in each 
chromosome in the stem cell is somehow 
tagged as the immortal strand and is 
retained by the stem-cell daughter. (C) This 
original DNA strand remains available 
through all subsequent stem-cell 
generations as a template for production of 
chromosomes of transit amplifying cells. 
(A, from V. Shinin, B. Gayraud-Morel, 
D. Gomes and S.Taj bakhsh, Nat Cell Biol. 
8:677-687, 2006. With permission from 
Macmillan Publishers Ltd.) 
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cells and increase steeply the production of cells committed to terminal differ- 
entiation. Thus, for example, when a patch of hairy skin is cut away, the slowly 
dividing stem cells in the bulge region of surviving hair follicles near the wound 
are switched into rapid proliferation, and some of their progeny move out as 
new stem cells to form fresh interfollicular epidermis to cover the wounded 
patch of body surface. 



Many Interacting Signals Govern Epidermal Renewal 

Cell turnover in the epidermis seems at first glance a simple matter, but the 
simplicity is deceptive. There are many points in the process that have to be 
controlled according to circumstances: the rate of stem-cell division; the prob- 
ability that a stem- cell daughter will remain a stem cell; the number of cell divi- 
sions of the transit amplifying cells; the timing of exit from the basal layer, and 
the time that the cell then takes to complete its differentiation program and be 
sloughed from the surface. Regulation of these steps must enable the epidermis 
to respond to rough usage by becoming thick and callused, and to repair itself 
when wounded. In specialized regions of epidermis, such as those that form 
hair follicles, with their own specialized subtypes of stem cells, yet more con- 
trols are needed to organize the local pattern. 

Each of the control points has its own importance, and a multitude of 
molecular signals is needed to regulate them all, so as to keep the body surface 
always properly covered. As we suggested earlier, one important influence is 
contact with the basal lamina, signaled via integrins in the plasma membrane 
of the cells. If cultured basal keratinocytes are held in suspension, instead of 
being allowed to settle and attach to the bottom of the culture dish, they all stop 
dividing and differentiate. To remain as an epidermal stem cell, it is apparently 
necessary (although not sufficient) to be attached to the basal lamina or other 
extracellular matrix. This requirement helps ensure that the size of the stem cell 
population does not increase without limit. If crowded out of their regular niche 
on the basal lamina, the cells lose their stem cell character. When this rule is bro- 
ken, as in some cancers, the result can be an ever-growing tumor. 

Most of the other cell communication mechanisms described in Chapter 15 
are also implicated in the control of epidermal renewal, either in signaling 
between cells within the epidermis or in signaling between epidermis and der- 
mis. The EGF, FGF, Wnt, Hedgehog, Notch, BMP/TGFp, and integrin signaling 
pathways are all involved (and we shall see that the same is true of most other tis- 
sues). Overactivation of the Hedgehog pathway, for example, can cause epider- 
mal cells to carry on dividing after they have left the basal layer, and mutations in 
components of this pathway are responsible for many epidermal cancers. At the 
same time, Hedgehog signaling helps to guide the choice of differentiation path- 
way: a deficit of Hedgehog signaling leads to loss of sebaceous glands, while an 
excess can cause sebaceous glands to develop in regions where they would never 
normally form. Similarly, loss of Wnt signaling leads to failure of hair follicle 
development, while excessive activation of this pathway causes extra hair follicles 
to form and to grow excessively so that they give rise to tumors. 

Notch signaling, in contrast, seems to restrict the size of the stem cell pop- 
ulation, inhibiting neighbors of stem cells from remaining as stem cells and 
causing them to become transit amplifying cells instead. And TGF(3 has a key 
role in signaling to the dermis during the repair of skin wounds, promoting the 
formation of collagen-rich scar tissue. The precise individual functions of all 
the various signaling mechanisms in the epidermis are only beginning to be 
disentangled. 



The Mammary Gland Undergoes Cycles of Development and 
Regression 

In specialized regions of the body surface, various other types of cells develop 
from the embryonic epidermis. In particular, secretions such as sweat, tears, 
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saliva, and milk are produced by cells segregated in deep-lying glands that orig- 
inate as ingrowths of the epidermis. These epithelial structures have functions 
and patterns of renewal quite different from those of keratinizing regions. 

The mammary glands are the largest and most remarkable of these secre- 
tory organs. They are the defining feature of mammals and an important con- 
cern in many ways: not only for nourishment of babies and attraction of the 
opposite sex, but also as the basis for a large industry — the dairy industry — and 
as the site of some of the commonest forms of cancer. Mammary tissue illus- 
trates most dramatically that developmental processes continue in the adult 
body; and it shows how cell death by apoptosis permits cycles of growth and 
regression. 

Milk production must be switched on when a baby is born and switched off 
when the baby is weaned. During pregnancy, the producer cells of the milk fac- 
tory are generated; at weaning, they are destroyed. A "resting" adult mammary 
gland consists of branching systems of ducts embedded in fatty connective tis- 
sue; this is the future plumbing network that will deliver milk to the nipple. The 
ducts are lined with an epithelium that includes mammary stem cells. These 
stem cells can be identified by a functional test, in which the cells of the mam- 
mary tissue are dissociated, sorted according to the cell surface markers that 
they express, and transplanted back into appropriate host tissue (a mammary 
fat pad). This assay reveals that a small subset of the total epithelial cells have 
stem-cell potential. A single one of these cells, estimated to be about one in 5000 
of the total mammary epithelial population but more concentrated within a 
subpopulation expressing certain markers, can proliferate indefinitely and give 
rise to an entire new mammary gland with all its epithelial cell types. This recon- 
stituted gland is able to go through the full program of differentiation required 
for milk production. As a first step toward milk production, the steroid hor- 
mones that circulate during pregnancy (estrogen and progesterone) cause the 
duct cells to proliferate, increasing their numbers several hundred-fold. In a pro- 
cess that depends on local activation of the Wnt signaling pathway, the terminal 
regions of the ducts grow and branch, forming little dilated outpocketings, or 



Figure 23-1 1 The mammary gland. 

(A) The growth of alveoli from the ducts of 
the mammary gland during pregnancy 
and lactation. Only a small part of the 
gland is shown. The "resting" gland 
contains a small amount of inactive 
glandular tissue embedded in a large 
amount of fatty connective tissue. During 
pregnancy an enormous proliferation of 
the glandular tissue takes place at the 
expense of the fatty connective tissue, 
with the secretory portions of the gland 
developing preferentially to create alveoli. 

(B) One of the milk-secreting alveoli with a 
basket of contractile myoepithelial cells 
(green) embracing it (see also Figure 
23-47E). (C) A single type of secretory 
alveolar cell produces both the milk 
proteins and the milk fat. The proteins are 
secreted in the normal way by exocytosis, 
while the fat is released as droplets 
surrounded by plasma membrane 
detached from the cell. (B, after R. Krstic, 
Die Gewebe des Menschen und der 
Saugetiere. Berlin: Springer-Verlag, 1978; 
C, from D.W. Fawcett, A Textbook of 
Histology, 12th ed. New York: Chapman 
and Hall, 1994.) 
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alveoli, containing secretory cells (Figure 23-11). Milk secretion begins only 
when these cells are stimulated by the different combination of hormones cir- 
culating in the mother after the birth of the baby, especially prolactin from the 
pituitary gland. Prolactin binds to receptors on the surface of the mammary 
epithelial cells and thereby activates a pathway that switches on expression of 
milk protein genes. As in the epidermis, signals from the extracellular matrix, 
mediated by integrins, are also essential: the milk-producing cells can only 
respond to prolactin if they are also in contact with the basal lamina. A further 
tier of hormonal control governs the actual ejection of milk from the breast: the 
stimulus of suckling causes cells of the hypothalamus (in the brain) to release 
the hormone oxytocin, which travels via the bloodstream to act on myoepithelial 
cells. These musclelike cells originate from the same epithelial precursor popu- 
lation as the secretory cells of the breast, and they have long spidery processes 
that embrace the alveoli. In response to oxytocin they contract, thereby squirt- 
ing milk out of the alveoli into the ducts. 

Eventually, when the baby is weaned and suckling stops, the secretory cells 
die by apoptosis, and most of the alveoli disappear. Macrophages rapidly clear 
away the dead cells, matrix metalloproteinases degrade the surplus extracellular 
matrix, and the gland reverts to its resting state. This ending of lactation seems 
to be induced by the accumulation of milk, rather than by a hormonal mecha- 
nism. If one subset of mammary ducts is obstructed so that no milk can be dis- 
charged, the secretory cells that supply it commit mass suicide by apoptosis, 
while other regions of the gland survive and continue to function. The apopto- 
sis is triggered by a complex array of factors that accumulate where milk secre- 
tion is blocked. 

Cell division in the growing mammary gland is regulated not only by hor- 
mones but also by local signals passing between cells within the epithelium and 
between the epithelial cells and the connective tissue, or stroma, in which the 
epithelial cells are embedded. All the signals listed earlier as important in con- 
trolling cell turnover in the epidermis are also implicated in controlling events 
in the mammary gland. Again, signals delivered via integrins play a crucial part: 
deprived of the basal lamina adhesions that activate integrin signaling, the 
epithelial cells fail to respond normally to hormonal signals. Faults in these 
interacting control systems underlie some of the commonest forms of cancer, 
and we need to understand them better. 



Summary 

Skin consists of a tough connective tissue, the dermis, overlaid by a multilayered water- 
proof epithelium, the epidermis. The epidermis is continually renewed from stem cells, 
with a turnover time, in humans, on the order of a month. Stem cells, by definition, are 
not terminally differentiated and have the ability to divide throughout the organism's 
lifetime, yielding some progeny that differentiate and others that remain stem cells. 
The epidermal stem cells lie in the basal layer, attached to the basal lamina; under nor- 
mal conditions, their division rate is low. Progeny that become committed to differen- 
tiation go through several rapid transit amplifying divisions in the basal layer, and 
then stop dividing and move out toward the surface of the skin. They progressively dif- 
ferentiate, switching from expression of one set of keratins to expression of another 
until, eventually, their nuclei degenerate, producing an outer layer of dead keratinized 
cells that are continually shed from the surface. 

The fate of the daughters of a stem cell is controlled by interactions with the basal 
lamina, mediated by integrins and by signals from neighboring cells. Some types of 
stem cells may also be internally programmed to divide asymmetrically so as to cre- 
ate one stem-cell daughter and one daughter committed to eventual differentiation; 
this may involve selective segregation of original "immortal" template DNA strands 
into the stem-cell daughter. Environmental controls, however, allow two stem cells to 
be generated from one during repair processes and can trigger steep increases in the 
rate of stem-cell division. Factors such as Wnt and Hedgehog signal proteins not only 
regulate the rate of cell proliferation according to need, but can also drive specializa- 
tion of epidermal cells to form structures such as hair follicles and sebaceous glands. 



SENSORY EPITHELIA 



These and other organs connected to the epidermis, such as the mammary glands, 
have their own stem cells and their own distinct patterns of cell turnover. In the 
breast, for example, circulating hormones stimulate the cells to proliferate, differenti- 
ate, and make milk; the cessation of suckling triggers the milk- secreting cells to die by 
apoptosis, in response to a combination of factors that build up where milk fails to be 
drained away 
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SENSORY EPITHELIA 

We sense the smells, sounds, and sights of the external world through another 
class of specializations of the epithelium that cover our body surface. The sen- 
sory tissues of the nose, the ears, and the eyes — and, indeed, if we look back to 
origins in the early embryo, the whole of the central nervous system — all arise 
from the same sheet of cells, the ectoderm, that gives rise to the epidermis. These 
structures have several features in common, and related systems of genes gov- 
ern their development (discussed in Chapter 22). They all retain an epithelial 
organization, but it is very different from that of the ordinary epidermis or of the 
glands that derive from it. 

The nose, the ear, and the eye are complex organs, with elaborate devices to 
collect signals from the external world and to deliver them, filtered and concen- 
trated, to the sensory epithelia, where they can act on the nervous system. The 
sensory epithelium in each organ is the key component, although it is small rel- 
ative to all the ancillary apparatus. It is the part that has been most highly con- 
served in evolution, not only from one vertebrate to another, but also between 
vertebrates and invertebrates. 

Within each sensory epithelium lie sensory cells that act as transducers, con- 
verting signals from the outside world into an electrical form that the nervous 
system can interpret. In the nose, the sensory transducers are olfactory sensory 
neurons', in the ear, auditory hair cells', and in the eye, photoreceptors. All of these 
cell types are either neurons or neuron-like. Each carries at its apical end a spe- 
cialized structure that detects the external stimulus and converts it to a change 
in the membrane potential. At its basal end, each makes synapses with neurons 
that relay the sensory information to specific sites in the brain. 



Olfactory Sensory Neurons Are Continually Replaced 

In the olfactory epithelium of the nose (Figure 23-12A), a subset of the epithe- 
lial cells differentiate as olfactory sensory neurons. These cells have modified, 
immotile cilia on their free surfaces (see Figure 15-46), containing odorant 
receptor proteins, and a single axon extending from their basal end toward the 
brain (Figure 23-12B). Supporting cells that span the thickened epithelium and 
have properties similar to those of glial cells in the central nervous system hold 



Figure 23-12 Olfactory epithelium and 
olfactory neurons. (A) Olfactory 
epithelium consists of supporting cells, 
basal cells, and olfactory sensory 
neurons. The basal cells are the stem cells 
for production of the olfactory neurons. 
Six to eight modified cilia project from 
the apex of the olfactory neuron and 
contain the odorant receptors. (B) This 
micrograph shows olfactory neurons in 
the nose of a genetically modified mouse 
in which the LacZ gene has been inserted 
into an odorant receptor locus, so that all 
the cells that would normally express 
that particular receptor now also make 
the enzyme p-galactosidase.The 
(3-galactosidase is detected through the 
blue product of the enzymatic reaction 
that it catalyzes. The cell bodies (dark 
blue) of the marked olfactory neurons, 
lying scattered in the olfactory 
epithelium, send their axons (light blue) 
toward the brain (out of the picture to 
the right). (C) A cross section of the left 
and right olfactory bulbs, stained for 
P-galactosidase. Axons of all the olfactory 
neurons expressing the same odorant 
receptor converge on the same glomeruli 
(red arrows) symmetrically placed within 
the bulbs on the right and left sides of the 
brain. Other glomeruli (unstained) 
receive their inputs from olfactory 
neurons expressing other odorant 
receptors. (B and C, from P. Mombaerts et 
al., Cell 87:675-686, 1 996. With 
permission from Elsevier.) 
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the neurons in place and separate them from one another. The sensory surfaces 
are kept moist and protected by a layer of fluid secreted by cells sequestered in 
glands that communicate with the exposed surface. Even with this protection, 
however, each olfactory neuron survives for only a month or two, and so a third 
class of cells — the basal cells — is present in the epithelium to generate replace- 
ments for the olfactory neurons that are lost. The population of basal cells, lying 
in contact with the basal lamina, includes stem cells for the production of the 
neurons. 

As discussed in Chapter 15, the genome contains a remarkably large num- 
ber of odorant receptor genes — about 1000 in a mouse or a dog, and about 350 
(plus many more that are degenerate and non-functional) in a human. Each 
olfactory neuron most probably expresses only one of these genes, enabling the 
cell to respond to one particular class of odorants (generally small organic 
molecules) sharing some structural feature that the odorant receptor protein 
recognizes. But regardless of the odor, every olfactory neuron responds in the 
same way — it sends a train of action potentials back along its axon to the brain. 
The discriminating sensibility of an individual olfactory neuron is therefore use- 
ful only if its axon delivers its messages to the specific relay station in the brain 
that is dedicated to the particular range of odors that the neuron senses. These 
relay stations are called glomeruli. They are located in structures called the 
olfactory bulbs (one on each side of the brain), with about 1800 glomeruli in 
each bulb (in the mouse) . Olfactory neurons expressing the same odorant recep- 
tor are widely scattered in the olfactory epithelium, but their axons all converge 
on the same glomerulus (Figure 23-12C). As new olfactory neurons are gener- 
ated, replacing those that die, they must in turn send their axons to the correct 
glomerulus. The odorant receptor proteins thus have a second function: guiding 
the growing tips of the new axons along specific paths to the appropriate target 
glomeruli in the olfactory bulbs. If it were not for the continual operation of this 
guidance system, a rose might smell in one month like a lemon, in the next like 
rotting fish. 



Auditory Hair Cells Have to Last a Lifetime 

The sensory epithelium responsible for hearing is the most precisely and 
minutely engineered of all the tissues in the body (Figure 23-13). Its sensory 
cells, the auditory hair cells, are held in a rigid framework of supporting cells 
and overlaid by a mass of extracellular matrix (the tectorial membrane), in a 
structure called the organ ofCorti The hair cells convert mechanical stimuli into 
electrical signals. Each has a characteristic organ-pipe array of giant microvilli 
(called stereocilia) protruding from its surface as rigid rods, filled with cross- 
linked actin filaments, and arranged in ranks of graded height. The dimensions 
of each such array are specified with extraordinary accuracy according to the 
location of the hair cell in the ear and the frequency of sound that it has to 



Figure 23-13 Auditory hair cells. 

(A) A diagrammatic cross section of the 
auditory apparatus (the organ of Corti) in 
the inner ear of a mammal shows the 
auditory hair cells held in an elaborate 
epithelial structure of supporting cells 
and overlaid by a mass of extracellular 
matrix (the tectorial membrane). The 
epithelium containing the hair cells sits 
on the basilar membrane — a thin, 
resilient sheet of tissue that forms a long, 
narrow partition separating two fluid- 
filled channels. Sound creates pressure 
waves in these channels and makes the 
basilar membrane vibrate up and down. 

(B) This scanning electron micrograph 
shows the apical surface of an outer 
auditory hair cell, with the characteristic 
organ-pipe array of giant microvilli 
(stereocilia). The inner hair cells, of which 
there are just 3500 in each human ear, 
are the principal auditory receptors. The 
outer hair cells, roughly four times more 
numerous in humans, are thought to 
form part of a feedback mechanism that 
regulates the mechanical stimulus 
delivered to the inner hair cells. (B, from 
J.D. Pickles, Prog. Neurobiol. 24:1-42, 
1985. With permission from Elsevier.) 
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Figure 23-14 How a relative movement 
of the overlying extracellular matrix 
(the tectorial membrane) tilts the 
stereocilia of auditory hair cells in the 
organ of Corti in the inner ear of a 
mammal. The stereocilia behave as rigid 
rods hinged at the base and bundled 
together at their tips. 
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respond to. Sound vibrations rock the organ of Corti, causing the bundles of 
stereocilia to tilt (Figure 23-14) and mechanically gated ion channels in the 
membranes of the stereocilia to open or close (Figure 23-15). The flow of elec- 
tric charge carried into the cell by the ions alters the membrane potential and 
thereby controls the release of neurotransmitter at the cell's basal end, where the 
cell synapses with a nerve ending. <TCCA> <CATA> 
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Figure 23-1 5 How a sensory hair cell works. (A) The cell functions as a transducer, generating an electrical signal in response to 
sound vibrations that rock the organ of Corti and so cause the stereocilia to tilt. A fine filament runs more or less vertically upward 
from the tip of each shorter stereocilium to attach at a higher point on its adjacent taller neighbor. Tilting the bundle puts tension 
on the filaments, which pull on mechanically gated ion channels in the membrane of the stereocilia. Opening of these channels 
allows an influx of positive charge, depolarizing the hair cell. (B) An electron micrograph of the filaments extending from the tops 
of two stereocilia. Each filament consists, in part at least, of members of the cadherin superfamily of cell-cell adhesion molecules. 
Mutant individuals lacking these specific cadherins lack the filaments and are deaf. 

By extraordinarily delicate mechanical measurements, correlated with electrical recordings from a single hair cell as the 
bundle of stereocilia is deflected by pushing with a flexible glass probe, it is possible to detect an extra "give" of the bundle as the 
mechanically gated channels yield to the applied force and are pulled open. In this way it can be shown that the force required to 
open a single one of the hypothesized channels is about 2 x 1 -13 newtons and that its gate swings through a distance of about 
4 nm as it opens. The mechanism is astonishingly sensitive: the faintest sounds that we can hear have been estimated to stretch 
the filaments by an average of 0.04 nm, which is just under half the diameter of a hydrogen atom. (B, from B. Kachar et al., Proc. 
Natl Acad. Sci. U.S.A. 97:13336-13341, 2000. With permission from National Academy of Sciences.) 
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In humans and other mammals, the auditory hair cells, unlike olfactory neu- 
rons, have to last a lifetime. If they are destroyed by disease, toxins, or excessively 
loud noise, they do not regenerate and the resultant hearing loss is permanent. 
But in other vertebrates, destruction of auditory hair cells triggers the support- 
ing cells to divide and behave as stem cells, generating progeny that can differ- 
entiate as replacements for the hair cells that are lost. With better understanding 
of how this regeneration process is regulated, we may one day be able to induce 
the auditory epithelium to repair itself in humans also. 

So far, one treatment is known that can bring about the partial regeneration 
of auditory hair cells in an adult mammal. The technique uses a virus (an ade- 
novirus) engineered to contain a copy of the Atohl gene, coding for a gene reg- 
ulatory protein that is known to drive the differentiation of hair cells during 
development. Guinea pigs that have been deafened by exposure to a toxin that 
destroys hair cells can be treated by injection of this viral construct into a dam- 
aged ear. Many of the surviving supporting cells then become infected with the 
viral construct and express Atohl. This converts them into functioning hair 
cells, and the animal partially recovers its hearing in the treated ear. 



Most Permanent Cells Renew Their Parts: the Photoreceptor Cells 
of the Retina 

The neural retina is the most complex of the sensory epithelia. It consists of sev- 
eral cell layers organized in a way that seems perverse. The neurons that trans- 
mit signals from the eye to the brain (called retinal ganglion cells) lie closest to 
the external world, so that the light, focused by the lens, must pass through them 
to reach the photoreceptor cells. The photoreceptors, which are classified as 
rods or cones, according to their shape, lie with their photoreceptive ends, or 
outer segments, partly buried in the pigment epithelium (Figure 23-16). Rods 
and cones contain different visual pigments — photosensitive complexes of 
opsin protein with the light- absorbing small molecule retinal Rods, whose 
visual pigment is called rhodopsin, are especially sensitive at low light levels, 
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Figure 23-1 6 The structure of the retina. 

When light stimulates the photoreceptors, 
the resulting electrical signal is relayed via 
interneurons to the ganglion cells, which 
then convey the signal to the brain. A 
population of specialized supporting cells 
(not shown here) occupies the spaces 
between the neurons and photoreceptors 
in the neural retina. (Modified from 
J.E. Dowling and B.B. Boycott, Proc. R. Soc 
Lond. B Biol. Sci. 1 66:80-1 11,1 966. With 
permission from Royal Society.) 
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while cones (of which there are three types in humans, each with a different 
opsin, giving a different spectral response) detect color and fine detail. 

The outer segment of a photoreceptor appears to be a modified cilium with 
a characteristic ciliumlike arrangement of microtubules in the region connect- 
ing the outer segment to the rest of the cell (Figure 23-17). The remainder of the 
outer segment is almost entirely filled with a dense stack of membranes in which 
the photosensitive complexes are embedded; light absorbed here produces an 
electrical response, as discussed in Chapter 15. At their opposite ends, the pho- 
toreceptors form synapses on interneurons, which relay the signal to the retinal 
ganglion cells (see Figure 23-16). 

Photoreceptors in humans, like human auditory hair cells, are permanent 
cells that do not divide and are not replaced if destroyed by disease or by a mis- 
directed laser beam. The photosensitive molecules of visual pigment, however, 
are not permanent but are continually degraded and replaced. In rods (although 
not, curiously, in cones), this turnover is organized in an orderly production line, 
which can be analyzed by following the passage of a cohort of radiolabeled pro- 
tein molecules through the cell after a short pulse of radioactive amino acid 
(Figure 23-18). The radiolabeled proteins can be traced from the Golgi appara- 
tus in the inner segment of the cell to the base of the stack of membranes in the 
outer segment. From here they are gradually displaced toward the tip as new 
material is fed into the base of the stack. Finally (after about 10 days in the rat), 
on reaching the tip of the outer segment, the labeled proteins and the layers of 
membrane in which they are embedded are phagocytosed (chewed off and 
digested) by the cells of the pigment epithelium. 

This example illustrates a general point: even though individual cells of cer- 
tain types persist, very little of the adult body consists of the same molecules 
that were laid down in the embryo. 
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Summary 

Most sensory receptor cells, like epidermal cells and nerve cells, derive from the epithe- 
lium forming the outer surface of the embryo. They transduce external stimuli into elec- 
trical signals, which they relay to neurons via chemical synapses. Olfactory receptor 
cells in the nose are themselves full-fledged neurons, sending their axons to the brain. 
They have a lifetime of only a month or two, and are continually replaced by new cells 
derived from stem cells in the olfactory epithelium. Each olfactory neuron expresses just 



Figure 23-1 7 A rod photoreceptor. 
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Figure 23-1 8 Turnover of membrane 
protein in a rod cell. Following a pulse of 
3 H-leucine, the passage of radiolabeled 
proteins through the cell is followed by 
autoradiography. Red dots indicate sites 
of radioactivity. The method reveals only 
the 3 H-leucine that has been 
incorporated into proteins; the rest is 
washed out during tissue preparation. 
(1 ) The incorporated leucine is first seen 
concentrated in the neighborhood of the 
Golgi apparatus. (2) From there it passes 
to the base of the outer segment into a 
newly synthesized disc of photoreceptive 
membrane. (3-5) New discs are formed at 
a rate of three or four per hour (in a 
mammal), displacing the older discs 
toward the pigment epithelium. 
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one of the hundreds of different olfactory receptor proteins for which genes exist in the 
genome, and the axons from all olfactory neurons expressing the same receptor protein 
navigate to the same glomeruli in the olfactory bulbs of the brain. 

Auditory hair cells — the receptor cells for sound — unlike olfactory receptor cells, 
have to last a lifetime, in mammals at least, although artificial expression of a hair- 
cell differentiation gene, Atohl, can convert surviving supporting cells into hair cells 
where hair cells have been destroyed. Hair cells have no axon but make synaptic con- 
tact with nerve terminals in the auditory epithelium. They take their name from the 
hair-like bundle of stereocilia (giant microvilli) on their outer surface. Sound vibra- 
tions tilt the bundle, pulling mechanically gated ion channels on the stereocilia into 
an open configuration to excite the cell electrically. 

Photoreceptor cells in the retina of the eye absorb photons in molecules of visual 
pigment (opsin protein plus retinal) held in stacks of membrane in the photoreceptor 
outer segments, triggering an electrical excitation by a more indirect intracellular sig- 
naling pathway. Although the photoreceptor cells themselves are permanent and irre- 
placeable, the stacks of opsin-rich membrane that they contain undergo continual 
renewal. 



THE AIRWAYS AND THE GUT 

The examples we have discussed so far represent a small selection of the tissues 
and cell types that derive from the outer layer of the embryo — the ectoderm. 
They are enough, however, to illustrate how diverse these cells can be, in form, 
function, lifestyle, and pattern of replacement. The innermost layer of the 
embryo — the endoderm, forming the primitive gut tube — gives rise to another 
whole zoo of cell types lining the digestive tract and its appendages. We begin 
with the lungs. 



Adjacent Cell Types Collaborate in the Alveoli of the Lungs 

The airways of the lungs are formed by repeated branching of a system of tubes 
that originated in the embryo from an outpocketing of the gut lining, as dis- 
cussed in Chapter 22 (see Figure 22-92). Repeated tiers of branching terminate 
in several hundred million air-filled sacs — the alveoli. Alveoli have thin walls, 
closely apposed to the walls of blood capillaries so as to allow exchange of O2 
and C0 2 with the bloodstream (Figure 23-19). 

To survive, the cells lining the alveoli must remain moist. At the same time, 
they must serve as a gas container that can expand and contract with each 
breath in and out. This creates a problem. When two wet surfaces touch, they 
become stuck together by surface tension in the layer of water between them — 
an effect that operates more powerfully the smaller the scale of the structure. 
There is a risk, therefore, that the alveoli may collapse and be impossible to rein- 
flate. To solve the problem, two types of cells are present in the lining of the alve- 
oli. Type I alveolar cells cover most of the wall: they are thin and flat {squamous) 
to allow gas exchange. Type II alveolar cells are interspersed among them. These 
are plump and secrete surfactant, a phospholipid-rich material that forms a film 
on the free water surfaces and reduces surface tension, making the alveoli easy 
to reinflate even if they collapse. The production of adequate amounts of sur- 
factant in the fetus, starting at about 5 months of pregnancy in humans, marks 
the beginning of the possibility of independent life. Premature babies born 
before this stage are unable to inflate their lungs and breathe; those born after it 
can do so and, with intensive care, can survive. 



Goblet Cells, Ciliated Cells, and Macrophages Collaborate to Keep 
the Airways Clean 

Higher up in the airways we find different combinations of cell types, serving 
different purposes. The air we breathe is full of dust, dirt, and air-borne 
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Figure 23-19 Alveoli in the lung. 

(A) Scanning electron micrograph at low 
magnification, showing the sponge-like 
texture created by the many air-filled 
alveoli. A bronchiole (small tubular airway) 
is seen at the top, communicating with the 
alveoli. (B) Transmission electron 
micrograph of a section through a region 
corresponding to the yellow box in (A) 
showing the alveolar walls, where gas 
exchange occurs. (C) Diagram of the 
cellular architecture of a piece of alveolar 
wall, corresponding to the yellow box in 
(B). (A, from P. Gehr et al., Respir. Physiol. 
44:61-86, 1 981 . With permission from 
Elsevier; B, courtesy of Peter Gehr, from 
D.W. Fawcett, A Textbook of Histology, 1 2th 
ed. New York: Chapman and Hall, 1 994.) 



layer of mucus 
carrying debris 



coordinated 
beating of cilia 
sweeps mucus 
out of lungs 



microorganisms. To keep the lungs clear and healthy, this debris must be con- 
stantly swept out. To perform this task, a relatively thick respiratory epithelium 
lines the larger airways (Figure 23-20). This epithelium consists of three differ- 
entiated cell types: goblet cells (so named because of their shape), which secrete 
mucus; ciliated cells, with cilia that beat; and a small number of endocrine cells, 
secreting serotonin and peptides that act as local mediators. These signal 
molecules affect nerve endings and other neighboring cells in the respiratory 
tract, so as to help regulate the rate of mucus secretion and ciliary beating, the 
contraction of surrounding smooth muscle cells that can constrict the airways, 
and other functions. Basal cells are also present, and serve as stem cells for 
renewal of the epithelium. 

The mucus secreted by the goblet cells forms a viscoelastic blanket about 
5 mjtl thick over the tops of the cilia. The cilia, all beating in the same direction, 
at a rate of about 12 beats per second, sweep the mucus out of the lungs, carry- 
ing with it the debris that has become stuck to it. This conveyor belt for the 
removal of rubbish from the lungs is called the mucociliary escalator. Of course, 
some inhaled particles may reach the alveoli themselves, where there is no esca- 
lator. Here, the unwanted matter is removed by yet another class of specialized 
cells, macrophages, which roam the lungs, engulfing foreign matter and killing 
and digesting bacteria. Many millions of macrophages, loaded with debris, are 
swept out of the lungs every hour on the mucociliary escalator. 

At the upper end of the respiratory tract, the wet mucus-covered respiratory 
epithelium gives way abruptly to stratified squamous epithelium. This cell sheet 
is structured for mechanical strength and protection, and, like epidermis, it con- 
sists of many layers of flattened cells densely packed with keratin. It differs from 
epidermis in that it is kept moist and its cells retain their nucleus even in the out- 
ermost layers. Abrupt boundaries of epithelial cell specialization, such as that 
between the mucous and the stratified squamous epithelium of the respiratory 
tract, are also found in other parts of the body, but very little is known about how 
they are created and maintained. 
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Figure 23-20 Respiratory epithelium. 

The goblet cells secrete mucus, which 
forms a blanket over the tops of the 
ciliated cells. The regular, coordinated 
beating of the cilia sweeps the mucus up 
and out of the airways, carrying any 
debris that is stuck to it. The mechanism 
that coordinates the ciliary beating is a 
mystery, but it seems to reflect an 
intrinsic polarity in the epithelium. If a 
segment of rabbit trachea is surgically 
reversed, it carries on sweeping mucus, 
but in the wrong direction, back down 
toward the lung, in opposition to 
adjacent unreversed portions of trachea. 
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The Lining of the Small Intestine Renews Itself Faster Than Any 
Other Tissue 



Only air-breathing vertebrates have lungs, but all vertebrates, and almost all 
invertebrate animals, have a gut — that is, a digestive tract lined with cells spe- 
cialized for the digestion of food and absorption of the nutrient molecules 
released by the digestion. These two activities are hard to carry on at the same 
time, as the processes that digest food in the lumen of the gut are liable also to 
digest the lining of the gut itself, including the cells that absorb the nutrients. 
The gut uses several strategies to solve the problem. 

The fiercest digestive processes, involving acid hydrolysis as well as enzyme 
action, are conducted in a separate reaction vessel, the stomach. The products 
are then passed on to the small intestine, where the nutrients are absorbed and 
enzymatic digestion continues, but at a neutral pH. The different regions of the 
gut lining consist of correspondingly different mixtures of cell types. The stom- 
ach epithelium includes cells that secrete acid, and other cells that secrete diges- 
tive enzymes that work at acid pH. Conversely, glands (in particular the pan- 
creas) that discharge into the initial segment of the small intestine contain cells 
that secrete bicarbonate to neutralize the acidity, along with other cells that 
secrete digestive enzymes that work at neutral pH. The lining of the intestine, 
downstream from the stomach, contains both absorptive cells and cells special- 
ized for the secretion of mucus, which covers the epithelium with a protective 
coat. In the stomach, too, mucus cells line most exposed surfaces. And, in case 
these measures are not enough, the whole lining of the stomach and intestine is 
continually renewed and replaced by freshly generated cells, with a turnover 
time of a week or less. 

The renewal process has been studied best in the small intestine (Figure 
23-21). The lining of the small intestine (and of most other regions of the gut) is 
a single-layered epithelium. This epithelium covers the surfaces of the villi that 
project into the lumen, and it lines the crypts that descend into the underlying 
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Figure 23-21 Renewal of the gut lining. 

(A) The pattern of cell turnover and the 
proliferation of stem cells in the 
epithelium that forms the lining of the 
small intestine. The colored arrow shows 
the general upward direction of cell 
movement onto the villi, but some cells, 
including a proportion of the goblet and 
enteroendocrine cells, stay behind and 
differentiate while still in the crypts. The 
nondividing differentiated cells (Paneth 
cells) at the very bottom of the crypts 
also have a finite lifetime and are 
continually replaced by progeny of the 
stem cells. (B) Photograph of a section of 
part of the lining of the small intestine, 
showing the villi and crypts. Note how 
mucus-secreting goblet cells (stained red) 
are interspersed among other cell types. 
Enteroendocrine cells are less numerous 
and less easy to identify without special 
stains. See Figure 23-22 for the structure 
of these cells. 



loose 

connective 

tissue 



absorptive 

brush-border 

cells 



direction of 
movement 




nondividing 

differentiated- 

cells 



rapidly dividing 
cells (cycle time 
12 hours) 



slowly dividing stem 
cells (cycle time 
> 24 hours) 



(A) 



nondividing 

differentiated 

Paneth cells 




100 |um 



THE AIRWAYS AND THE GUT 



1437 




absorptive cell 



goblet cell 



enteroendocrine cell 



Paneth cell 



Figure 23-22 The four main differentiated cell types found in the epithelial lining of the small intestine. All of these are 
generated from undifferentiated multipotent stem cells living near the bottoms of the crypts (see Figure 23-21). The 
microvilli on the apical surface of the absorptive (brush-border) cell provide a 30-fold increase of surface area, not only for 
the import of nutrients but also for the anchorage of enzymes that perform the final stages of extracellular digestion, 
breaking down small peptides and disaccharides into monomers that can be transported across the cell membrane. Broad 
yellow arrows indicate direction of secretion or uptake of materials for each type of cell. (After T.L Lentz, Cell Fine Structure. 
Philadelphia: Saunders, 1971; R. Krstic, Illustrated Encyclopedia of Human Histology. Berlin: Springer-Verlag, 1984.) 



connective tissue. Dividing stem cells lie in a protected position in the depths of 
the crypts. These generate four types of differentiated progeny (Figure 23-22): 

1. Absorptive cells (also called brush-border cells or enter ocytes) have densely 
packed microvilli on their exposed surfaces to increase their active surface 
area for the uptake of nutrients. They both absorb nutrients and secrete (or 
carry on their exterior surfaces) hydrolytic enzymes that perform some of 
the final steps of extracellular digestion, breaking down food molecules in 
preparation for transport across the plasma membrane. 

2. Goblet cells (as in respiratory epithelium) secrete mucus. 

3. Paneth cells form part of the innate immune defense system (discussed in 
Chapter 24) and secrete (along with some growth factors) cryptdins — pro- 
teins of the defensin family that kill bacteria (see Figure 24-46). 

4. Enteroendocrine cells, of more than 15 different subtypes, secrete serotonin 
and peptide hormones, such as cholecystokinin (CCK), that act on neurons 
and other cell types in the gut wall and regulate the growth, proliferation, 
and digestive activities of cells of the gut and other tissues. Cholecys- 
tokinin, for example, is released from enteroendocrine cells in response to 
the presence of nutrients in the gut and binds to receptors on nearby sen- 
sory nerve endings, which relay a signal to the brain to stop you feeling 
hungry after you have eaten enough. 

The absorptive, goblet, and enteroendocrine cells travel mainly upward 
from the stem- cell region, by a sliding movement in the plane of the epithelial 
sheet, to cover the surfaces of the villi. In analogy with the epidermis, it is 
thought that the most rapidly proliferating precursor cells in the crypt are in a 
transit amplifying stage, already committed to differentiation but undergoing 
several divisions on their way out of the crypt, before they stop dividing and dif- 
ferentiate terminally. Within 2-5 days (in the mouse) after emerging from the 
crypts, the cells reach the tips of the villi, where they undergo the initial stages of 
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apoptosis and are finally discarded into the gut lumen. The Paneth cells are pro- 
duced in much smaller numbers and have a different migration pattern. They 
live at the bottom of the crypts, where they too are continually replaced, 
although not so rapidly, persisting for about 20 days (in the mouse) before 
undergoing apoptosis and being phagocytosed by their neighbors. The stem 
cells, too, remain at or near the bottoms of the crypts. What keeps them there, 
and what restricts cell division to the crypts? How are the migrations controlled 
so that some cells move up while others stay down? What are the molecular sig- 
nals that organize the whole stem- cell system, and how do they work? 



Figure 23-23 An adenoma in the human 
colon, compared with normal tissue 
from an adjacent region of the same 
person's colon. The specimen is from a 
patient with an inherited mutation in one 
of his two copies of the Ape gene. A 
mutation in the other Ape gene copy, 
occurring in a colon epithelial cell during 
adult life, has given rise to a clone of cells 
that behave as though the Wnt signaling 
pathway is permanently activated. As a 
result, the cells of this clone form an 
adenoma — an enormous, steadily 
expanding mass of giant cryptlike 
structures. 



Wnt Signaling Maintains the Gut Stem-Cell Compartment 

The beginnings of an answer to these questions came from the study of cancer of 
the colon and rectum (the lower end of the gut). As discussed in Chapter 20, some 
people have a hereditary predisposition to this disease and, in advance of the can- 
cer, develop large numbers of small precancerous tumors (adenomas) in the lin- 
ing of their large intestine (Figure 23-23). The appearance of these tumors sug- 
gests that they have arisen from intestinal crypt cells that have failed to halt their 
proliferation in the normal way, and so have given rise to excessively large crypt- 
like structures. The cause can be traced to mutations in the Ape {Adenomatous 
Polyposis Coif) gene: the tumors arise from cells that have lost both gene copies. 
Ape codes for a protein that prevents inappropriate activation of the Wnt signaling 
pathway, so that loss of APC is presumed to mimic the effect of continual exposure 
to a Wnt signal. The suggestion, therefore, is that Wnt signaling normally keeps 



THE AIRWAYS AND THE GUT 



1439 



crypt cells in a proliferative state, and cessation of exposure to Wnt signaling nor- 
mally makes them stop dividing as they leave the crypt. Indeed, mice that are 
homozygous for a knockout mutation in the Tcf4 gene, coding for a gene regula- 
tory protein that is required as an effector of Wnt signaling in the gut, make no 
crypts, fail to renew their gut epithelium, and die soon after birth. 

Experiments with transgenic mice confirm the importance of Wnt signaling 
and reveal other regulators that act together with Wnt to organize the gut- cell 
production line and keep it running correctly. Using the Cre/lox technique with 
an inducible promoter for Cre (as described in Chapter 8, p. 567), it is possible, 
for example, to knock out the Ape gene in gut epithelial cells abruptly, at any 
chosen time in the life of the mouse. Within a few days, the gut structure is trans- 
formed: the crypt-like regions of proliferative cells are greatly enlarged, villi are 
reduced, and the numbers of terminally differentiated cells are drastically 
diminished. Conversely, one can make a transgenic mouse in which the gut 
epithelial cells all secrete a diffusible inhibitor of Wnt signaling. These animals, 
in which Wnt signaling is blocked, form scarcely any crypts and have hardly any 
proliferating cells in their gut epithelium. Instead, almost all the gut lining cells 
are fully differentiated non-dividing absorptive cells; but goblet cells, enteroen- 
docrine cells, and Paneth cells are missing. Thus Wnt signaling not only keeps 
cells in a proliferative state but is also needed to make them competent to give 
rise to the full range of ultimate differentiated cell types. 



Notch Signaling Controls Gut Cell Diversification 

What then causes the cells to diversify as they differentiate? Notch signaling has 
this function in many other systems, where it mediates lateral inhibition — a 
competitive interaction that drives neighboring cells toward different fates (see 
Chapters 15 and 22, Figures 15-75 and 22-60). All the essential components of 
the Notch pathway are expressed in the crypts; it seems that Wnt signaling 
switches on their expression. When Notch signaling is abruptly blocked by 
knocking out one of these essential components, within a few days all the cells 
in the crypts differentiate as goblet cells and cease dividing; conversely, when 
Notch signaling is artificially activated in all the cells, no goblet cells are pro- 
duced and the crypt-like regions of cell proliferation are enlarged. 

From the effects of all these manipulations of Wnt and Notch signaling, we 
arrive at a simple picture of how the two pathways combine to govern the pro- 
duction of differentiated cells from the intestinal stem cells (Figure 23-24) . Wnt 
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Figure 23-24 How Wnt and Notch 
signaling pathways combine to control 
the production of differentiated cells 
from stem cells in the intestine. (A) Wnt 

signaling maintains proliferation in the 
crypt, where the stem cells reside and 
their progeny become committed to 
diverse fates. (B) Wnt signaling in the 
crypt drives expression of the 
components of the Notch signaling 
pathway in that region; Notch signaling is 
thus active in the crypt and, through 
lateral inhibition, forces cells there to 
diversify. Both pathways must be 
activated in the same cell to keep it as a 
stem cell. The progeny of the stem cell 
continue dividing under the influence of 
Wnt even after they become committed 
to a specific differentiated fate, but the 
timing of these transit amplifying 
divisions in relation to commitment is not 
understood in detail. 
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signaling promotes cell proliferation and confers competence for the full range of 
modes of differentiation, while preventing differentiation from occurring imme- 
diately; in this way, it defines the crypt and maintains the stem cells. But Wnt sig- 
naling also, at the same time, activates the expression of Notch pathway compo- 
nents, and Notch signaling within the crypt population mediates lateral inhibi- 
tion, which forces the cells to diversify, in such a way that some become singled 
out to deliver lateral inhibition, while others receive it. Cells of the former class 
express Notch ligands and activate Notch in their neighbors, but escape from 
Notch activation themselves; as a result, they become committed to differentiate 
as secretory cells. Cells of the latter class — the majority — are kept in an opposite 
state, with Notch activated and ligand expression inhibited; as a result, they 
retain competence to differentiate in any of a variety of ways and to engage in lat- 
eral-inhibition competition with their neighbors. Both classes of cells (with the 
exception of some secretory subtypes) continue dividing so long as they are in the 
crypt, under the influence of Wnt. But when cells leave the crypt and lose expo- 
sure to Wnt signaling, the competition halts, division stops, and the cells differ- 
entiate according to their individual states of Notch activation at that time — as 
absorptive cells if Notch is still activated, as secretory cells if it is not. 

This is certainly not the whole story of events in the crypt. It does not explain, 
for example, how the various subclasses of secretory cells (goblet, entero- 
endocrine, and Paneth) become different from one another. Nor does it say any- 
thing about the distinction that many experts believe to exist between true stem 
cells and the more rapidly dividing transit amplifying cells within the crypt. Sev- 
eral different members of each of the families of Wnt and Notch pathway com- 
ponents are expressed in the crypt epithelium and in the connective tissue 
around the base of the crypts, and probably have differing effects. Moreover, 
other signaling pathways also have crucial functions in organizing the system. 



Ephrin-Eph Signaling Controls the Migrations of Gut Epithelial 
Cells 



One of the most remarkable features of the gut stem- cell system is the steady, 
ordered, selective migration of cells from crypt to villus. Differentiating absorp- 
tive, goblet, and enteroendocrine cells stream out of the crypts and up the villi 
(Figure 23-25); stem cells remain deep in the crypts; and Paneth cells migrate 
right down to the crypt bottoms. This pattern of movements, which segregates 
the different groups of cells, depends on yet another cell-cell signaling pathway. 
Wnt signaling stimulates the expression of cell- surface receptors of the EphB 
family (discussed in Chapter 15) in the cells in the crypt; however, as cells differ- 
entiate, they switch off expression of these receptors, and switch on instead 
expression of the ligands, cell- surface proteins of the ephrinB family (Figure 
23-26A) . There is one exception: the Paneth cells retain expression of the EphB 
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Figure 23-25 Migration of cells from 
crypts onto villi. In this mouse intestine, 
a random subset of epithelial cells was 
induced to undergo a mutation during 
late fetal life, causing the mutant cells to 
express a LacZ transgene, coding for an 
enzyme that can be detected by the blue 
product of the reaction that it catalyses. 
By 6 weeks after birth, each crypt has 
become populated by the progeny of a 
single stem cell and thus appears either 
totally blue or totally white, according to 
whether that stem cell was or was not 
genetically marked in this way. Several 
crypts contribute to a single villus, each 
sending a stream of differentiated cells 
outward along it. (A) Low-magnification 
surface view of part of the lining of the 
intestine, showing many villi, each 
receiving streams of cells from several 
crypts. (B) Detail of a single villus and 
adjacent crypts in cross-section. In the 
example shown, the streams from 
different crypts have remained unmixed, 
so that the villus appears blue on one 
side and white on the other; more 
commonly, there is some mixing, giving a 
less orderly result. (From M.H.Wong, 
J.R. Saam,T.S. Stappenbeck, C.H. Rexer 
and J.I. Gordon, Proc. Natl Acad. Sci. U.S.A. 
97:1 2601 -1 2606, 2000. With permission 
from National Academy of Sciences.) 
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Figure 23-26 Ephrin-Eph signaling 
controls cell segregation between crypts 
and villi. (A) Proliferative cells (including 
the stem cells) and Paneth cells express 
EphB proteins, while the differentiated, 
nondividing cells that cover the villi 
express ephrinB proteins. The repulsive 
cell-cell interaction mediated by 
encounters between these two types of 
cell-surface molecules keeps the two 
classes of cells segregated. 

(B) In a normal gut, as a result, Paneth cells 
{brown stain) and dividing cells remain 
confined to the bottoms of the crypts. 

(C) In a mutant where EphB proteins are 
defective, cells that should stay in the 
crypts wander out onto the villi. (Adapted 
from E. Batlle et al., Cell 1 1 1:251-263, 
2002. With permission from Elsevier.) 
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proteins. Thus EphB expression is characteristic of cells that stay in the crypts, 
while ephrinB expression is characteristic of cells moving out onto the villi. In 
various other tissues, cells expressing Eph proteins are repelled by contacts with 
cells expressing ephrins (see Chapter 22, Figure 22-106). It seems that the same 
is true in the gut lining, and that this mechanism serves to keep the cells in their 
proper places. In EphB knockout mutants, the populations become mixed, so 
that, for example, Paneth cells wander out onto the villi (Figure 23-26C). Loss of 
EphB genes in intestinal cancers correlates with the onset of invasive behavior 
by the tumor cells. 



Wnt, Hedgehog, PDGF, and BMP Signaling Pathways Combine to 
Delimit the Stem-Cell Niche 

Clearly, the gut stem cells cannot exist without the special environment that the 
crypt provides for them. This stem cell niche is as essential as the stem cells 
themselves. How is it created and maintained? The mechanism seems to 
depend on a complex interplay of signals between the epithelium and the 
underlying connective tissue. Exchange of Wnt, Hedgehog, and PDGF signals 
between the two tissues, and between different regions of the crypt-villus axis, 
leads to a restriction of Wnt signaling to the neighborhood of the crypts. The 
epithelial cells in the crypts produce both Wnt proteins and the receptors that 
respond to them, creating a positive feedback loop that presumably helps to 
make Wnt pathway activation in this region self-sustaining. At the same time, 
signals exchanged with the connective tissue lead to expression of BMP pro- 
teins in the connective-tissue cells forming the core of the villi (Figure 23-27). 
These cells signal to the adjacent villus epithelium to inhibit the development 
of misplaced crypts: blocking BMP signaling disrupts the whole organization 
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Figure 23-27 Signals defining the intestinal stem-cell niche. (A) Diagram 
of the signaling system. Signal proteins of the Hedgehog and Wnt families 
are expressed by the epithelial cells in the base of each crypt, which also 
express Wnt receptors and experience high levels of Wnt pathway 
activation. The connective-tissue cells underlying the epithelium express 
both Hedgehog receptors and Wnt receptors. The combined effect of the 
signals from the crypt base, perhaps in conjunction with other signals, is to 
provoke the connective-tissue cells that lie in the core of each villus to 
express BMP proteins. The BMP proteins act on the epithelium of the villus, 
preventing its cells from forming crypts. (B) Cross section of a region of 
normal intestinal epithelium. The brown stain marks proliferative cells, 
which are confined to the crypts. (C) Similarly stained section of intestine of 
a transgenic mouse expressing an inhibitor of BMP signaling. Crypts 
containing dividing cells have developed ectopically, along the sides of the 
misshapen villi. (B and C, courtesy of A. Haramis et al., Science 
303:1684-1686, 2004. With permission from AAAS.) 



and causes misplaced crypts to form as invaginations of proliferating epithe- 
lium along the sides of the villi. 



The Liver Functions as an Interface Between the Digestive Tract 
and the Blood 

As we have seen, the functions of the gut are divided between a variety of cell 
types. Some cells are specialized for the secretion of hydrochloric acid, others for 
the secretion of enzymes, others for the absorption of nutrients, and so on. 
Some of these cell types are closely intermingled in the wall of the gut, whereas 
others are segregated in large glands that communicate with the gut and origi- 
nate in the embryo as outgrowths of the gut epithelium. 

The liver is the largest of these glands. It develops at a site where a major vein 
runs close to the wall of the primitive gut tube, and the adult organ retains a spe- 
cial relationship with the blood. Cells in the liver that derive from the primitive 
gut epithelium — the hepatocytes — are arranged in interconnected sheets and 
cords, with blood-filled spaces called sinusoids running between them (Figure 
23-28). The blood is separated from the surface of the hepatocytes by a single 
layer of flattened endothelial cells that covers the exposed faces of the hepato- 
cytes. This structure facilitates the chief functions of the liver, which depend on 
the exchange of metabolites between hepatocytes and the blood. 

The liver is the main site at which nutrients that have been absorbed from 
the gut and then transferred to the blood are processed for use by other cells of 
the body. It receives a major part of its blood supply directly from the intestinal 
tract (via the portal vein). Hepatocytes synthesize, degrade, and store a vast 
number of substances. They play a central part in the carbohydrate and lipid 
metabolism of the body as a whole, and they secrete most of the protein found 
in blood plasma. At the same time, the hepatocytes remain connected with the 
lumen of the gut via a system of minute channels (or canaliculi) and larger ducts 
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(see Figure 23-28B,C) and secrete into the gut by this route both waste products 
of their metabolism and an emulsifying agent, bile, which helps in the absorp- 
tion of fats. Hepatocytes are big cells, and about 50% of them (in an adult 
human) are polyploid, with two, four, eight, or even more times the normal 
diploid quantity of DNA per cell. 

In contrast to the rest of the digestive tract, there seems to be remarkably lit- 
tle division of labor within the population of hepatocytes. Each hepatocyte 
seems able to perform the same broad range of metabolic and secretory tasks. 
These fully differentiated cells can also divide repeatedly, when the need arises, 
as we explain next. 



Liver Cell Loss Stimulates Liver Cell Proliferation 

The liver illustrates in a striking way one of the great unsolved problems of 
developmental and tissue biology: what determines the size of an organ of the 
body, or the quantity of one type of tissue relative to another? For different 
organs, the answers are almost certainly different, but there is scarcely any case 
in which the mechanism is well understood. 

Hepatocytes normally live for a year or more and are renewed at a slow rate. 
Even in a slowly renewing tissue, however, a small but persistent imbalance 
between the rate of cell production and the rate of cell death would lead to dis- 
aster. If 2% of the hepatocytes in a human divided each week but only 1% died, 
the liver would grow to exceed the weight of the rest of the body within 8 years. 
Homeostatic mechanisms must operate to adjust the rate of cell proliferation or 
the rate of cell death, or both, so as to keep the organ at its normal size. This size, 
moreover, needs to be matched to the size of the rest of the body. Indeed, when 
the liver of a small dog is grafted into a large dog, it rapidly grows to almost the 
size appropriate to the host; conversely, when the liver is grafted from a large dog 
into a small one, it shrinks. 

Direct evidence for the homeostatic control of liver cell proliferation comes 
from experiments in which large numbers of hepatocytes are removed surgically 
or are intentionally killed by poisoning with carbon tetrachloride. Within a day 
or so after either sort of damage, a surge of cell division occurs among the sur- 
viving hepatocytes, quickly replacing the lost tissue. (If the hepatocytes them- 
selves are totally eliminated, another class of cells, located in the bile ducts, can 
serve as stem cells for the genesis of new hepatocytes, but usually there is no 
need for this.) If two -thirds of a rat's liver is removed, for example, a liver of 
nearly normal size can regenerate from the remainder by hepatocyte prolifera- 
tion within about 2 weeks. Although many molecules have been implicated in 
the triggering of this reaction, one of the most important is a protein called hep- 
atocyte growth factor. It stimulates hepatocytes to divide in culture, and its pro- 
duction increases steeply (by poorly understood mechanisms) in response to 
liver damage. 
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Figure 23-28 The structure of the liver. 

(A) A scanning electron micrograph of a 
portion of the liver, showing the irregular 
sheets and cords of hepatocytes and the 
many small channels, or sinusoids, for the 
flow of blood. The larger channels are 
vessels that distribute and collect the 
blood that flows through the sinusoids. 

(B) Detail of a sinusoid (enlargement of 
region similar to that marked by yellow 
rectangle at lower right in [A]). 

(C) Schematized diagram of the fine 
structure of the liver. A single thin sheet 
of endothelial cells with interspersed 
macrophagelike Kupffer cells separates 
the hepatocytes from the bloodstream. 
Small holes in the endothelial sheet, 
called fenestrae (Latin for "windows"), 
allow the exchange of molecules and 
small particles between the hepatocytes 
and the bloodstream. Besides 
exchanging materials with the blood, the 
hepatocytes form a system of tiny bile 
canaliculi into which they secrete bile, 
which is ultimately discharged into the 
gut via bile ducts. The real structure is 
less regular than this diagram suggests. 
(A and B, courtesy of Pietro M. Motta, 
University of Rome "La Sapienza.") 
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The balance between cell births and cell deaths in the adult liver (and other 
organs too) does not depend exclusively on the regulation of cell proliferation: 
cell survival controls also play a part. If an adult rat is treated with the drug phe- 
nobarbetal, for example, hepatocytes are stimulated to divide, causing the liver 
to enlarge. When the phenobarbital treatment is stopped, hepatocyte cell death 
greatly increases until the liver returns to its original size, usually within a week 
or so. The mechanism of this type of cell survival control is unknown, but it has 
been suggested that hepatocytes, like most vertebrate cells, depend on signals 
from other cells for their survival and that the normal level of these signals can 
support only a certain standard number of hepatocytes. If the number of hepa- 
tocytes rises above this (as a result of phenobarbital treatment, for example), 
hepatocyte death will automatically increase to bring their number back down. 
It is not known how the appropriate levels of survival factors are maintained. 



Tissue Renewal Does Not Have to Depend on Stem Cells: 
Insulin-Secreting Cells in the Pancreas 

Most of the organs of the respiratory and digestive tract, including the lungs, the 
stomach, and the pancreas, contain a subpopulation of endocrine cells similar 
to the enteroendocrine cells in the intestines and, like them, generated in the 
epithelium under the control of the Notch signaling pathway. The insulin- secret- 
ing cells (P cells) of the pancreas belong in this category. Their mode of renewal 
has a special importance, because it is the loss of these cells (through autoim- 
mune attack) that is responsible for Type I (juvenile -onset) diabetes and a sig- 
nificant factor also in the Type II (adult-onset) form of the disease. In a normal 
pancreas, they are sequestered in cell clusters, called islets ofLangerhans (Figure 
23-29), where they are grouped with related enteroendocrine cells, secreting 
other hormones. The islets contain no obvious subset of cells specialized to act 
as stem cells, yet fresh (3 cells are continually generated within them. Where do 
these new cells come from? 

The question has been answered by study of transgenic mice in which an 
ingenious variant of the Cre-Lox technique (described in Chapter 8) was used to 
produce a marker mutation just in those cells that were expressing the insulin 
gene at the time a drug was given to activate Cre. In this way, the only cells that 
became labeled and transmitted the label to their progeny were those that were 
already differentiated (3 cells at the time of the treatment. When the mice were 
analyzed as much as a year later, all the new (3 cells carried the label, implying 
that they were descendants of already- differentiated (3 cells, and not of some 
undifferentiated stem cell. As in the liver, it seems that the population of differ- 
entiated cells here is renewed and enlarged by simple duplication of existing dif- 
ferentiated cells, and not by means of stem cells. 
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Figure 23-29 An islet of Langerhans in 
the pancreas. The insulin-secreting cells 
(p cells) are stained green by 
immunofluorescence. Cell nuclei are 
stained purple with a DNA dye. The 
surrounding pancreatic exocrine cells 
(secreting digestive enzymes and 
bicarbonate via ducts into the gut) are 
unstained, except for their nuclei. Within 
the islet, close to its surface, there are 
also small numbers of cells (unstained) 
secreting hormones such as glucagon. 
The insulin-secreting cells replace 
themselves by simple duplication, 
without need of specialized stem cells. 
(Adapted from a photograph courtesy of 
Yuval Dor. © 2004 Yuval Dor, The Hebrew 
University, Jerusalem.) 
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Summary 

The lung performs a simple function — gas exchange — hut its housekeeping systems 
are complex. Surfactant- secreting cells help to keep the alveoli from collapsing. 
Macrophages constantly scour the alveoli for dirt and microorganisms. A mucociliary 
escalator formed by mucus-secreting goblet cells and beating ciliated cells sweeps 
debris out of the airways. 

In the gut, where more potentially damaging chemical processes occur, constant 
rapid cell renewal keeps the absorptive epithelium in good repair. In the small intes- 
tine, stem cells in the crypts generate new absorptive, goblet, enteroendocrine, and 
Paneth cells, replacing most of the epithelial lining of the intestine every week. Wnt sig- 
naling in the crypts maintains the stem-cell population, while Notch signaling drives 
diversification of the stem-cell progeny and limits the number that are consigned to a 
secretory fate. Cell-cell interactions within the epithelium mediated by ephrin-Eph 
signaling control the selective migration of cells from the crypts upward onto the villi. 
Interactions between the epithelium and the stroma, involving the Wnt, Hedgehog, 
PDGF, and BMP pathways organize the pattern of crypts and villi, thereby creating the 
niches that stem cells inhabit. 

The liver is a more protected organ, but it too can rapidly adjust its size up or down 
by cell proliferation or cell death when the need arises. Differentiated hepatocytes 
remain able to divide throughout life, showing that a specialized class of stem cells is 
not always needed for tissue renewal. Similarly, the population of insulin- producing 
cells in the pancreas is enlarged and renewed by simple duplication of existing insulin- 
producing cells. 



BLOODVESSELS, LYMPHATICS, AND ENDOTHELIAL 
CELLS 



From the tissues that derive from the embryonic ectoderm and endoderm, we 
turn now to those derived from mesoderm. This middle layer of cells, sand- 
wiched between ectoderm and endoderm, grows and diversifies to provide 
many sorts of supportive functions. It gives rise to the body's connective tissues, 
blood cells, and blood and lymphatic vessels, as well as muscle, kidney, and 
many other structures and cell types. We begin with blood vessels. 

Almost all tissues depend on a blood supply, and the blood supply depends on 
endothelial cells, which form the linings of the blood vessels. Endothelial cells 
have a remarkable capacity to adjust their number and arrangement to suit local 
requirements. They create an adaptable life- support system, extending by cell 
migration into almost every region of the body. If it were not for endothelial cells 
extending and remodeling the network of blood vessels, tissue growth and repair 
would be impossible. Cancerous tissue is as dependent on a blood supply as is 
normal tissue, and this has led to a surge of interest in endothelial cell biology. By 
blocking the formation of new blood vessels through drugs that act on endothelial 
cells, it may be possible to block the growth of tumors (discussed in Chapter 20). 



Endothelial Cells Line All Blood Vessels and Lymphatics 

The largest blood vessels are arteries and veins, which have a thick, tough wall of 
connective tissue and many layers of smooth muscle cells (Figure 23-30). The 
wall is lined by an exceedingly thin single sheet of endothelial cells, the endothe- 
lium, separated from the surrounding outer layers by a basal lamina. The 
amounts of connective tissue and smooth muscle in the vessel wall vary accord- 
ing to the vessel's diameter and function, but the endothelial lining is always 
present. In the finest branches of the vascular tree — the capillaries and sinu- 
soids — the walls consist of nothing but endothelial cells and a basal lamina (Fig- 
ure 23-31), together with a few scattered — but functionally important — peri- 
cytes. These are cells of the connective-tissue family, related to vascular smooth 
muscle cells, that wrap themselves around the small vessels (Figure 23-32). 
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Figure 23-30 Diagram of a small artery 
in cross section. The endothelial cells, 
although inconspicuous, are the 
fundamental component. Compare with 
the capillary in Figure 23-31. 
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Figure 23-31 Capillaries. (A) Electron micrograph of a cross section of a small capillary in the pancreas. The wall is formed 
by a single endothelial cell surrounded by a basal lamina. (B) Scanning electron micrograph of the interior of a capillary in a 
glomerulus of the kidney, where blood filtration occurs to produce urine. Here, as in the liver (see Figure 23-28), the 
endothelial cells are specialized to form a sieve-like structure, with fenestrae, constructed rather like the pores in the 
nuclear envelope of eucaryotic cells, allowing water and most molecules to pass freely out of the bloodstream. (A, from 
R.P. Bolender, J. Cell Biol. 61 :269-287, 1 974. With permission from The Rockefeller University Press; B, courtesy of Steve 
Gschmeissner and David Shima.) 

Less obvious than the blood vessels are the lymphatic vessels. These carry no 
blood and have much thinner and more permeable walls than the blood vessels. 
They provide a drainage system for the fluid (lymph) that seeps out of the blood 
vessels, as well as an exit route for white blood cells that have migrated from 
blood vessels into the tissues. Less happily, they often also provide the path by 
which cancer cells escape from a primary tumor to invade other tissues. The lym- 
phatics form a branching system of tributaries all ultimately discharging into a 
single large lymphatic vessel, the thoracic duct, which opens into a large vein 
close to the heart. Like blood vessels, lymphatics are lined with endothelial cells. 

Thus, endothelial cells line the entire blood and lymphatic vascular system, 
from the heart to the smallest capillary, and control the passage of materials — and 
the transit of white blood cells — into and out of the bloodstream. Arteries, veins, 
and lymphatics all develop from small vessels constructed primarily of endothe- 
lial cells and a basal lamina: connective tissue and smooth muscle are added later 
where required, under the influence of signals from the endothelial cells. 



Endothelial Tip Cells Pioneer Angiogenesis 

To understand how the vascular system comes into being and how it adapts to 
the changing needs of tissues, we have to understand endothelial cells. How do 
they become so widely distributed, and how do they form channels that connect 
in just the right way for blood to circulate through the tissues and for lymph to 
drain back to the bloodstream? 

Endothelial cells originate at specific sites in the early embryo from precur- 
sors that also give rise to blood cells. From these sites the early embryonic 
endothelial cells migrate, proliferate, and differentiate to form the first rudi- 
ments of blood vessels — a process called vasculogenesis. Subsequent growth and 
branching of the vessels throughout the body is mainly by proliferation and 
movement of the endothelial cells of these first vessels, in a process called 
angiogenesis. 

Angiogenesis occurs in a broadly similar way in the young organism as it 
grows and in the adult during tissue repair and remodeling. We can watch the 
behavior of the cells in naturally transparent structures, such as the cornea of the 
eye or the fin of a tadpole, or in tissue culture, or in the embryo. The embryonic 
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Figure 23-32 Pericytes. The scanning 
electron micrograph shows pericytes 
wrapping their processes around a small 
blood vessel (a post-capillary venule) in 
the mammary gland of a cat. Pericytes 
are present also around capillaries, but 
are much more sparsely distributed there. 
(From T. Fujiwara and Y. Uehara, Am. J. 
Anat. 170:39-54, 1984. With permission 
from Wiley-Liss.) 
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Figure 23-33 Angiogenesis. (A) A new 
blood capillary forms by the sprouting of 
an endothelial cell from the wall of an 
existing small vessel. An endothelial tip 
cell, with many filopodia, leads the 
advance of each capillary sprout. The 
endothelial stalk cells trailing behind the 
tip cell become hollowed out to form a 
lumen. (B) Blood capillaries sprouting in 
the retina of an embryonic mouse. 
(C) A similar specimen, but with a red dye 
injected into the bloodstream, revealing 
the capillary lumen opening up behind 
the tip cell. (B and C, from H. Gerhardt et 
al., J. Cell Biol. 161:1 163-1 1 77, 2003. With 
permission from The Rockefeller 
University Press.) 



retina, which blood vessels invade according to a predictable timetable, is a con- 
venient example for experimental study. Each new vessel originates as a capil- 
lary sprout from the side of an existing capillary or small venule (Figure 
23-33A). At the tip of the sprout, leading the way, is an endothelial cell with a 
distinctive character. This tip cellhas a pattern of gene expression somewhat dif- 
ferent from that of the endothelial stalk cells following behind it, and while they 
divide, it does not; but the tip cell's most striking feature is that it puts out many 
long filopodia, resembling those of a neuronal growth cone (Figure 23-33B). 
The stalk cells, meanwhile, become hollowed out to form a lumen (see Figure 
23-33A). One can watch this process in the transparent zebrafish embryo: the 
individual cells develop internal vacuoles that join up with those of their neigh- 
bors to create a continuous multicellular tube. <GTTG> 

The endothelial tip cells that pioneer the growth of normal capillaries not 
only look like neuronal growth cones, but also respond similarly to signals in the 
environment. In fact, many of the same guidance molecules are involved, 
including semaphorins, netrins, slits, and ephrins, along with the corresponding 
receptors, which are expressed in the tip cells and guide the vascular sprouts 
along specific pathways in the embryo, often in parallel with nerves. Perhaps the 
most important of the guidance molecules for endothelial cells, however, is one 
that is specifically dedicated to the control of vascular development: vascular 
endothelial growth factor, or VEGF. We shall have more to say about it below. 



Different Types of Endothelial Cells Form Different Types of Vessel 

To create a new circuit for blood flow, a vascular sprout must continue to grow 
out until it encounters another sprout or vessel with which it can connect. The 
rules of connection presumably have to be selective, to prevent the formation 
of undesirable short circuits and to keep the blood and lymphatic systems 
properly segregated. In fact, endothelial cells of developing arterial, venous, 
and lymphatic vessels express different genes and have different surface prop- 
erties. These differences evidently help guide the various types of vessels along 
different paths, control the selective formation of connections, and govern the 
development of different types of wall as the vessel enlarges. Arterial endothe- 
lial cells, in the embryo at least, express the transmembrane protein ephrinB2, 
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for example, while the venous arterial cells express the corresponding receptor 
protein, EphB4 (discussed in Chapter 15). These molecules mediate signaling at 
sites of cell-cell contact, and they are essential for the development of a properly 
organized network of vessels. 

Expression of the gene regulatory protein Proxl distinguishes the endothe- 
lial cells of lymphatic vessels from arterial and venous endothelial cells. This 
gene switches on in a subset of endothelial cells in the wall of a large vein (the 
cardinal vein) in the embryo, converting them into lymphatic progenitors. From 
these, the whole of the lymphatic vasculature derives by sprouting as described 
above. Proxl causes the lymphatic endothelial cells to express receptors for a 
different member of the VEGF family of guidance molecules, as well as proteins 
that prevent the lymphatic cells from forming connections with blood vessels. 



Tissues Requiring a Blood Supply Release VEGF; Notch Signaling 
Between Endothelial Cells Regulates the Response 

Almost every cell, in almost every tissue of a vertebrate, is located within 50-100 
jam of a blood capillary. What mechanism ensures that the system of blood ves- 
sels branches into every nook and cranny? How is it adjusted so perfectly to the 
local needs of the tissues, not only during normal development but also in 
pathological circumstances? Wounding, for example, induces a burst of capillary 
growth in the neighborhood of the damage, to satisfy the high metabolic 
requirements of the repair process (Figure 23-34). Local irritants and infections 
also cause a proliferation of new capillaries, most of which regress and disap- 
pear when the inflammation subsides. Less benignly, a small sample of tumor 
tissue implanted in the cornea, which normally lacks blood vessels, causes 
blood vessels to grow quickly toward the implant from the vascular margin of the 
cornea; the growth rate of the tumor increases abruptly as soon as the vessels 
reach it. 

In all these cases, the invading endothelial cells respond to signals pro- 
duced by the tissue that they invade. The signals are complex, but a key part is 
played by vascular endothelial growth factor (VEGF), a distant relative of 
platelet- derived growth factor (PDGF). The regulation of blood vessel growth to 
match the needs of the tissue depends on the control of VEGF production, 
through changes in the stability of its mRNA and in its rate of transcription. The 
latter control is relatively well understood. A shortage of oxygen, in practically 
any type of cell, causes an increase in the intracellular concentration of a gene 
regulatory protein called hypoxia- inducible factor la (HIFla). HIFla stimu- 
lates transcription of Vegf (and of other genes whose products are needed when 
oxygen is in short supply). The VEGF protein is secreted, diffuses through the 
tissue (with different isoforms of VEGF diffusing to different extents), and acts 
on nearby endothelial cells, stimulating them to proliferate, to produce pro- 
teases to help them digest their way through the basal lamina of the parent cap- 
illary or venule, and to form sprouts. The tip cells of the sprouts detect the VEGF 
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Figure 23-34 New capillary formation in 
response to wounding. Scanning 
electron micrographs of casts of the 
system of blood vessels surrounding the 
margin of the cornea show the reaction 
to wounding. The casts are made by 
injecting a resin into the vessels and 
letting the resin set; this reveals the 
shape of the lumen, as opposed to the 
shape of the cells. Sixty hours after 
wounding many new capillaries have 
begun to sprout toward the site of injury, 
which is just above the top of the picture. 
Their oriented outgrowth reflects a 
chemotactic response of the endothelial 
cells to an angiogenic factor released at 
the wound. (Courtesy of Peter C. Burger.) 
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gradient and move toward its source. (Other growth factors, including some 
members of the fibroblast growth factor family, can also stimulate angiogenesis, 
mediating reactions to other conditions such as inflammation.) 

As the new vessels form, bringing blood to the tissue, the oxygen concentra- 
tion rises, HIFloc activity declines, VEGF production is shut off, and angiogene- 
sis comes to a halt (Figure 23-35). As in all signaling systems, it is as important 
to switch the signal off correctly as to switch it on. In normal well- oxygenated 
tissue, continual degradation of the HIFloc protein keeps the concentration of 
HIFloc low: in the presence of oxygen, an oxygen- requiring enzyme modifies 
HIFloc so as to target it for degradation. Degradation in turn requires the prod- 
uct of another gene, coding for an E3 ubiquitin ligase subunit, which is defective 
in a rare disorder called von Hippel-Lindau (VHL) syndrome. People with this 
condition are born with only one functional copy of the Vhl gene; mutations 
occurring at random in the body then give rise to cells with two defective gene 
copies. These cells contain large quantities of HIF1 regardless of oxygen avail- 
ability, triggering the continual overproduction of VEGF. The result is develop- 
ment of hemangioblastomas, tumors that contain dense masses of blood ves- 
sels. The mutant cells that produce the VEGF are apparently themselves encour- 
aged to proliferate by the over-rich nourishment provided by the excess blood 
vessels, creating a vicious cycle that promotes tumor growth. Loss of the VHL 
gene product also gives rise to other tumors as well as hemangioblastomas, by 
mechanisms that may be independent of effects on angiogenesis. 

This is not the whole story of how angiogenesis is controlled, however. VEGF 
and related factors from the target tissue are essential to stimulate and guide 
angiogenesis, but interactions between one endothelial cell and another, medi- 
ated by the Notch signaling pathway, also have a critical role. These interactions 
control which cells will be singled out to behave as tip cells, extending filopodia 
and crawling forward to create new vascular sprouts, and they are required to 
bring this motile behavior to a halt when it is time to stop. Thus, when endothe- 
lial sprouts meet and join up to form a vascular circuit, they normally switch off 
to reduce their sprouting activities. The effect depends on a specific Notch lig- 
and, called Delta4, which is expressed in tip cells and activates Notch in their 
neighbors; Notch activation leads to reduced expression of VEGF receptors, 
making the neighbors of the tip cell unresponsive to VEGF. In mutants where 
Notch signaling is defective, sprouting behavior continues inappropriately and 
fails to be confined to tip cells. The result is an excessively dense network of ill- 
organized, dysfunctional vessels that carry little or no blood. 
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Figure 23-35 The regulatory mechanism 
controlling blood vessel growth 
according to a tissue's need for oxygen. 

Lack of oxygen triggers the secretion of 
VEGF, which stimulates angiogenesis. 
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Signals from Endothelial Cells Control Recruitment of Pericytes 
and Smooth Muscle Cells to Form the Vessel Wall 

The vascular network is continually remodeled as it grows and adapts. A newly 
formed vessel may enlarge; or it may sprout side branches; or it may regress. 
Smooth muscle and other connective-tissue cells that pack themselves around 
the endothelium (see Figure 23-32) help to stabilize vessels as they enlarge. This 
process of vessel wall formation begins with recruitment of pericytes. Small 
numbers of these cells travel outward in company with the stalk cells of each 
endothelial sprout. The recruitment and proliferation of pericytes and smooth 
muscle cells to form a vessel wall depend on PDGF-B secreted by the endothe- 
lial cells and on PDGF receptors in the pericytes and smooth muscle cells. In 
mutants lacking this signal protein or its receptor, these vessel wall cells in many 
regions are missing. As a result, the embryonic blood vessels develop microa- 
neurysms — microscopic pathological dilatations — that eventually rupture, as 
well as other abnormalities, reflecting the importance of signals exchanged in 
both directions between the exterior cells of the wall and the endothelial cells. 

Once a vessel has matured, signals from the endothelial cells to the sur- 
rounding connective tissue and smooth muscle continue to regulate the vessel's 
function and structure. For example, the endothelial cells have mechanorecep- 
tors that allow them to sense the shear stress due to flow of blood over their sur- 
face. The cells react by generating and releasing the gas NO, thereby signaling to 
the surrounding cells and inducing changes in the vessel's diameter and wall 
thickness to accommodate the blood flow Endothelial cells also mediate rapid 
responses to neural signals for blood vessel dilation, by releasing NO to make 
smooth muscle relax in the vessel wall, as discussed in Chapter 15. 



Summary 

Endothelial cells are the fundamental elements of the vascular system. They form a 
single cell layer that lines all blood vessels and lymphatics and regulates exchanges 
between the bloodstream and the surrounding tissues. New vessels originate as 
endothelial sprouts from the walls of existing small vessels. A specialized motile 
endothelial tip cell at the leading edge of each sprout puts outfilopodia that respond 
to gradients of guidance molecules in the environment, leading the growth of the 
sprout like the growth cone of a neuron. The endothelial stalk cells following behind 
become hollowed out to form a capillary tube. Endothelial cells of developing arteries, 
veins, and lymphatics express different cell-surface proteins, which may control the 
way in which they link up to create the vascular networks. Signals from endothelial 
cells organize the growth and development of the connective-tissue cells that form the 
surrounding layers of the vessel wall. 

A homeostatic mechanism ensures that blood vessels permeate every region of the 
body. Cells that are short of oxygen increase their concentration of hypoxia- inducible 
factor (HIFla), which stimulates the production of vascular endothelial growth factor 
(VEGF). VEGF acts on endothelial cells, causing them to proliferate and invade the 
hypoxic tissue to supply it with new blood vessels. The endothelial cells also interact 
with one another via the Notch pathway. This exchange of Notch signals is necessary 
to limit the number of cells that behave as tip cells and to halt angiogenic behavior 
when tip cells meet. 



RENEWAL BY MULTI POTENT STEM CELLS: BLOOD 
CELL FORMATION 

Blood contains many types of cells, with functions that range from the transport 
of oxygen to the production of antibodies. Some of these cells stay within the 
vascular system, while others use the vascular system only as a means of trans- 
port and perform their function elsewhere. All blood cells, however, have certain 
similarities in their life history. They all have limited life spans and are produced 
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throughout the life of the animal. Most remarkably, they are all generated ulti- 
mately from a common stem cell in the bone marrow. This hemopoietic (blood- 
forming, also called hematopoietic) stem cell is thus multipotent, giving rise to all 
the types of terminally differentiated blood cells as well as some other types of 
cells, such as osteoclasts in bone, which we discuss later. 

Blood cells can be classified as red or white (Figure 23-36). The red blood 
cells, or erythrocytes, remain within the blood vessels and transport O2 and CO2 
bound to hemoglobin. The white blood cells, or leucocytes, combat infection 
and in some cases phagocytose and digest debris. Leucocytes, unlike erythro- 
cytes, must make their way across the walls of small blood vessels and migrate 
into tissues to perform their tasks. In addition, the blood contains large numbers 
of platelets, which are not entire cells but small, detached cell fragments or 
"minicells" derived from the cortical cytoplasm of large cells called megakary- 
ocytes. Platelets adhere specifically to the endothelial cell lining of damaged 
blood vessels, where they help to repair breaches and aid in blood clotting. 



Figure 23-36 Scanning electron 
micrograph of mammalian blood cells 
caught in a blood clot. The larger, more 
spherical cells with a rough surface are 
white blood cells; the smoother, 
flattened cells are red blood cells. 
(Courtesy of Ray Moss.) 



The Three Main Categories of White Blood Cells Are Granulocytes, 
Monocytes, and Lymphocytes 

All red blood cells belong in a single class, following the same developmental 
trajectory as they mature, and the same is true of platelets; but there are many 
distinct types of white blood cells. White blood cells are traditionally grouped 
into three major categories — granulocytes, monocytes, and lymphocytes — 
based on their appearance in the light microscope. 

Granulocytes contain numerous lysosomes and secretory vesicles (or gran- 
ules) and are subdivided into three classes according to the morphology and 
staining properties of these organelles (Figure 23-37). The differences in stain- 
ing reflect major differences of chemistry and function. Neutrophils (also called 
polymorphonuclear leucocytesbecmxse of their multilobed nucleus) are the most 
common type of granulocyte; they phagocytose and destroy microorganisms, 
especially bacteria, and thus have a key role in innate immunity to bacterial 
infection, as discussed in Chapter 25. Basophils secrete histamine (and, in some 
species, serotonin) to help mediate inflammatory reactions; they are closely 
related to mast cells, which reside in connective tissues but are also generated 
from the hemopoietic stem cells. Eosinophils help to destroy parasites and mod- 
ulate allergic inflammatory responses. 

Once they leave the bloodstream, monocytes (see Figure 23-37D) mature 
into macrophages, which, together with neutrophils, are the main "professional 
phagocytes" in the body. As discussed in Chapter 13, both types of phagocytic 
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cells contain specialized lysosomes that fuse with newly formed phagocytic 
vesicles (phagosomes), exposing phagocytosed microorganisms to a barrage of 
enzymatically produced, highly reactive molecules of superoxide (O2") and 
hypochlorite (HOC1, the active ingredient in bleach), as well as to attack by a 
concentrated mixture of lysosomal hydrolases that become activated in the 
phagosome. Macrophages, however, are much larger and longer-lived than neu- 
trophils. They recognize and remove senescent, dead, and damaged cells in 
many tissues, and they are unique in being able to ingest large microorganisms 
such as protozoa. 

Monocytes also give rise to dendritic cells, such as the Langerhans cells scat- 
tered in the epidermis. Like macrophages, dendritic cells are migratory cells that 
can ingest foreign substances and organisms; but they do not have as active an 
appetite for phagocytosis and are instead specialized as presenters of foreign 
antigens to lymphocytes to trigger an immune response. Langerhans cells, for 
example, ingest foreign antigens in the epidermis and carry these trophies back 
to present to lymphocytes in lymph nodes. 

There are two main classes of lymphocytes, both involved in immune 
responses: B lymphocytes make antibodies, while T lymphocytes kill virus- 
infected cells and regulate the activities of other white blood cells. In addition, 




neutrophil 
platelet 
lymphocyte 
eosinophil 
monocyte 
red blood cell 

(E) 



Figure 23-37 White blood cells. 

(A-D) These electron micrographs show 
(A) a neutrophil, (B) a basophil, (C) an 
eosinophil, and (D) a monocyte. Electron 
micrographs of lymphocytes are shown in 
Figure 25-7. Each of the cell types shown 
here has a different function, which is 
reflected in the distinctive types of 
secretory granules and lysosomes it 
contains. There is only one nucleus per 
cell, but it has an irregular lobed shape, 
and in (A), (B), and (C) the connections 
between the lobes are out of the plane of 
section. (E) A light micrograph of a blood 
smear stained with the Romanowsky stain, 
which colors the white blood cells 
strongly. (A-D, from B.A. Nichols et al., 
1 Cell Biol. 50:498-51 5, 1 971 . With 
permission from The Rockefeller University 
Press; E, courtesy of David Mason.) 
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there are lymphocytelike cells called natural killer (NK) cells, which kill some 
types of tumor cells and virus-infected cells. The production of lymphocytes is a 
specialized topic discussed in detail in Chapter 25. Here we concentrate mainly 
on the development of the other blood cells, often referred to collectively as 
myeloid cells. 

Table 23-1 summarizes the various types of blood cells and their functions. 



The Production of Each Type of Blood Cell in the Bone Marrow Is 
Individually Controlled 

Most white blood cells function in tissues other than the blood; blood simply 
transports them to where they are needed. A local infection or injury in any tis- 
sue rapidly attracts white blood cells into the affected region as part of the 
inflammatory response, which helps fight the infection or heal the wound. 

The inflammatory response is complex and is governed by many different 
signal molecules produced locally by mast cells, nerve endings, platelets, and 
white blood cells, as well as by the activation of complement (discussed in 
Chapters 24 and 25). Some of these signal molecules act on nearby capillaries, 
causing the endothelial cells to adhere less tightly to one another but making 
their surfaces adhesive to passing white blood cells. The white blood cells are 
thus caught like flies on flypaper and then can escape from the vessel by 
squeezing between the endothelial cells and using digestive enzymes to crawl 
across the basal lamina. <ACCG> As discussed in Chapter 19, homing receptors 
called selectins mediate the initial binding to endothelial cells, while integrins 
mediate the stronger binding required for the white blood cells to crawl out of 



Table 23-1 Blood Cells 



TYPE OF CELL 


MAIN FUNCTIONS TYPICAL CONCENTRATION 




IN HUMAN BLOOD 






(CELLS/LITER) 


Red blood cells 


transport O2 and CO2 


5x10 12 


(erythrocytes) 






White blood cells 






(leucocytes) 






Granulocytes 






Neutrophils 


phagocytose and destroy invading 


5x10 9 


(polymorphonuclear 


bacteria 




leucocytes) 






Eosinophils 


destroy larger parasites and modulate 
allergic inflammatory responses 


2x10 8 


Basophils 


release histamine (and in some species 
serotonin) in certain immune reactions 


4x10 7 


Monocytes 


become tissue macrophages, which 
phagocytose and digest invading 
microorganisms and foreign bodies as 
well as damaged senescent cells 


4x10 8 


Lymphocytes 






B cells 


make antibodies 


2x10 9 


T cells 


kill virus-infected cells and regulate 
activities of other leucocytes 


1 x10 9 


Natural killer (NK) cells 


kill virus-infected cells and some 
tumor cells 


1 x10 8 


Platelets 


initiate blood clotting 


3x10 n 


(cell fragments arising 






from megakaryocytes 






in bone marrow) 







Humans contain about 5 liters of blood, accounting for 7% of body weight. Red blood cells 
constitute about 45% of this volume and white blood cells about 1%, the rest being the liquid 
blood plasma. 
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Figure 23-38 The migration of white blood cells out of the bloodstream 
during an inflammatory response. The response is initiated by signal 
molecules produced by cells in the neighborhood (mainly in the 
connective tissue) or by complement activation. Some of these mediators 
act on capillary endothelial cells, causing them to loosen their attachments 
to their neighbors so that the capillaries become more permeable. 
Endothelial cells are also stimulated to express selecting cell-surface 
molecules that recognize specific carbohydrates that are present on the 
surface of leucocytes in the blood and cause them to stick to the 
endothelium. The inflamed tissues and local endothelial cells secrete other 
mediators called chemokines, and the chemokines act as chemoattractants, 
causing the bound leucocytes to crawl between the capillary endothelial 
cells into the tissue. 



the blood vessel (see Figure 19-19). Damaged or inflamed tissues and local 
endothelial cells secrete other molecules called chemokines, which act as 
chemoattractants for specific types of white blood cells, causing them to 
become polarized and crawl toward the source of the attractant. As a result, large 
numbers of white blood cells enter the affected tissue (Figure 23-38). 

Other signal molecules produced during an inflammatory response escape 
into the blood and stimulate the bone marrow to produce more leucocytes and 
release them into the bloodstream. The bone marrow is the key target for such 
regulation because, with the exception of lymphocytes and some macrophages, 
most types of blood cells in adult mammals are generated only in the bone mar- 
row. The regulation tends to be cell-type-specific: some bacterial infections, for 
example, cause a selective increase in neutrophils, while infections with some 
protozoa and other parasites cause a selective increase in eosinophils. (For this 
reason, physicians routinely use differential white blood cell counts to aid in the 
diagnosis of infectious and other inflammatory diseases.) 

In other circumstances erythrocyte production is selectively increased — for 
example, in the process of acclimatization when one goes to live at high altitude, 
where oxygen is scarce. Thus, blood cell formation, or hemopoiesis (also called 
hematopoiesis) , necessarily involves complex controls, which regulate the pro- 
duction of each type of blood cell individually to meet changing needs. It is a 
problem of great medical importance to understand how these controls operate. 

In intact animals, hemopoiesis is more difficult to analyze than is cell 
turnover in a tissue such as the epidermis or the lining of the gut, where a sim- 
ple, regular spatial organization makes it easy to follow the process of renewal 
and to locate the stem cells. The hemopoietic tissues do not appear so orderly. 
However, hemopoietic cells have a nomadic lifestyle that makes them more 
accessible to experimental study in other ways. It is easy to obtain dispersed 
hemopoietic cells and to transfer them, without damage, from one animal to 
another. Moreover, the proliferation and differentiation of individual cells and 
their progeny can be observed and analyzed in culture, and numerous molecu- 
lar markers distinguish the various stages of differentiation. Because of this, 
more is known about the molecules that control blood cell production than 
about those that control cell production in other mammalian tissues. Studies of 
hemopoiesis have strongly influenced current ideas about stem-cell systems in 
general. 
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Bone Marrow Contains Hemopoietic Stem Cells 

Routine staining methods allow us to recognize the different types of blood cells 
and their immediate precursors in the bone marrow (Figure 23-39). Here, these 
cells are intermingled with one another, as well as with fat cells and other stro- 
mal cells (connective- tissue cells), which produce a delicate supporting mesh- 
work of collagen fibers and other extracellular matrix components. In addition, 
the whole tissue is richly supplied with thin-walled blood vessels, called blood 
sinuses, into which the new blood cells are discharged. Megakaryocytes are also 
present; these, unlike other blood cells, remain in the bone marrow when 
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Figure 23-39 Bone marrow. (A) A light micrograph of a stained section. The large empty spaces correspond to fat cells, 
whose fatty contents have been dissolved away during specimen preparation. The giant cell with a lobed nucleus is a 
megakaryocyte. (B) A low-magnification electron micrograph. Bone marrow is the main source of new blood cells (except 
for T lymphocytes, which are produced in the thymus). Note that the immature blood cells of a particular type tend to 
cluster in "family groups." (A, courtesy of David Mason; B, from J.A.G. Rhodin, Histology: A Text and Atlas. New York: Oxford 
University Press, 1974.) 



mature and are one of its most striking features, being extraordinarily large 
(diameter up to 60 jam), with a highly polyploid nucleus. They normally lie close 
beside blood sinuses, and they extend processes through holes in the endothe- 
lial lining of these vessels; platelets pinch off from the processes and are swept 
away into the blood (Figure 23-40). <GCAT> 

Because of the complex arrangement of the cells in bone marrow, it is diffi- 
cult to identify in ordinary tissue sections any but the immediate precursors of 
the mature blood cells. The corresponding cells at still earlier stages of develop- 
ment, before any overt differentiation has begun, look confusingly similar, and 
although the spatial distribution of cell types has some orderly features, there is 
no obvious visible characteristic by which we can recognize the ultimate stem 
cells. To identify and characterize the stem cells, we need a functional assay, 
which involves tracing the progeny of single cells. As we shall see, this can be 
done in vitro simply by examining the colonies that isolated cells produce in cul- 
ture. The hemopoietic system, however, can also be manipulated so that such 
clones of cells can be recognized in vivo in the intact animal. 

When an animal is exposed to a large dose of x-rays, most of the hemopoi- 
etic cells are destroyed and the animal dies within a few days as a result of its 



megakaryocyte process 
budding off platelets 
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Figure 23-40 A megakaryocyte among 
other cells in the bone marrow. Its 

enormous size results from its having a 
highly polyploid nucleus. One 
megakaryocyte produces about 10,000 
platelets, which split off from long 
processes that extend through holes in 
the walls of an adjacent blood sinus. 
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inability to manufacture new blood cells. The animal can be saved, however, by 
a transfusion of cells taken from the bone marrow of a healthy, immunologically 
compatible donor. Among these cells there are some that can colonize the irra- 
diated host and permanently reequip it with hemopoietic tissue (Figure 23-41) . 
Such experiments prove that the marrow contains hemopoietic stem cells. They 
also show how we can assay for the presence of hemopoietic stem cells and 
hence discover the molecular features that distinguish them from other cells. 

For this purpose, cells taken from bone marrow are sorted (using a fluores- 
cence-activated cell sorter) according to the surface antigens that they display, 
and the different fractions are transfused back into irradiated mice. If a fraction 
rescues an irradiated host mouse, it must contain hemopoietic stem cells. In this 
way, it has been possible to show that the hemopoietic stem cells are character- 
ized by a specific combination of cell- surface proteins, and by appropriate sort- 
ing we can obtain virtually pure stem cell preparations. The stem cells turn out 
to be a tiny fraction of the bone marrow population — about 1 cell in 10,000; but 
this is enough. As few as five such cells injected into a host mouse with defective 
hemopoiesis are sufficient to reconstitute its entire hemopoietic system, gener- 
ating a complete set of blood cell types, as well as fresh stem cells. 



A Multipotent Stem Cell Gives Rise to All Classes of Blood Cells 

To see what range of cell types a single hemopoietic stem cell can generate, we 
need a way to trace the fate of its progeny. This can be done by marking individ- 
ual stem cells genetically, so that their progeny can be identified even after they 
have been released into the bloodstream. Although several methods have been 
used for this, a specially engineered retrovirus (a retroviral vector carrying a 
marker gene) serves the purpose particularly well. The marker virus, like other 
retroviruses, can insert its own genome into the chromosomes of the cell it 
infects, but the genes that would enable it to generate new infectious virus par- 
ticles have been removed. The marker is therefore confined to the progeny of the 
cells that were originally infected, and the progeny of one such cell can be dis- 
tinguished from the progeny of another because the chromosomal sites of inser- 
tion of the virus are different. To analyze hemopoietic cell lineages, bone mar- 
row cells are first infected with the retroviral vector in vitro and then are trans- 
ferred into a lethally irradiated recipient; DNA probes can then be used to trace 
the progeny of individual infected cells in the various hemopoietic and lym- 
phoid tissues of the host. These experiments show that the individual hemopoi- 
etic stem cell is multipotent and can give rise to the complete range of blood cell 
types, both myeloid and lymphoid, as well as new stem cells like itself (Figure 
23-42). 

Later in this chapter, we explain how the same methods that were developed 
for experimentation in mice can now be used for treatment of disease in 
humans. 
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Figure 23-41 Rescue of an irradiated 
mouse by a transfusion of bone marrow 
cells. An essentially similar procedure is 
used in the treatment of leukemia in 
human patients by bone marrow 
transplantation. 



Commitment Is a Stepwise Process 

Hemopoietic stem cells do not jump directly from a multipotent state into a com- 
mitment to just one pathway of differentiation; instead, they go through a series of 
progressive restrictions. The first step, usually, is commitment to either a myeloid 
or a lymphoid fate. This is thought to give rise to two kinds of progenitor cells, one 
capable of generating large numbers of all the different types of myeloid cells, or 
perhaps of myeloid cells plus B lymphocytes, and the other giving rise to large 
numbers of all the different types of lymphoid cells, or at least T lymphocytes. Fur- 
ther steps give rise to progenitors committed to the production of just one cell 
type. The steps of commitment correlate with changes in the expression of specific 
gene regulatory proteins, needed for the production of different subsets of blood 
cells. These proteins seem to act in a complicated combinatorial fashion: the 
GATA1 protein, for example, is needed for the maturation of red blood cells, but is 
active also at much earlier steps in the hemopoietic pathway. 
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Figure 23-42 A tentative scheme of hemopoiesis. The multipotent stem cell normally divides infrequently to generate 
either more multipotent stem cells, which are self-renewing, or committed progenitor cells, which are limited in the 
number of times that they can divide before differentiating to form mature blood cells. As they go through their divisions, 
the progenitors become progressively more specialized in the range of cell types that they can give rise to, as indicated by 
the branching of the cell-lineage diagram in the region enclosed in the gray box. Many of the details of this part of the 
lineage diagram are still controversial, however. In adult mammals, all of the cells shown develop mainly in the bone 
marrow — except forT lymphocytes, which develop in the thymus, and macrophages and osteoclasts, which develop from 
blood monocytes. Some dendritic cells may also derive from monocytes. 



Divisions of Committed Progenitor Cells Amplify the Number of 
Specialized Blood Cells 

Hemopoietic progenitor cells generally become committed to a particular path- 
way of differentiation long before they cease proliferating and terminally differ- 
entiate. The committed progenitors go through many rounds of cell division to 
amplify the ultimate number of cells of the given specialized type. In this way, a 
single stem-cell division can lead to the production of thousands of differenti- 
ated progeny, which explains why the number of stem cells is such a small frac- 
tion of the total population of hemopoietic cells. For the same reason, a high rate 
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of blood cell production can be maintained even though the stem-cell division 
rate is low. As noted earlier, infrequent division or quiescence is a common fea- 
ture of stem cells in several tissues. By reducing the number of division cycles 
that the stem cells themselves have to undergo in the course of a lifetime, it low- 
ers the risk of generating stem-cell mutations, which would give rise to persis- 
tent mutant clones of cells in the body. It also has another effect: it reduces the 
rate of replicative senescence (discussed in Chapter 17). In fact, hemopoietic 
stem cells that are forced to keep dividing rapidly (through knockout of a gene 
called Gfil that restricts their proliferation rate, or by other means) fail to sustain 
hemopoiesis for a full normal lifespan. 

The stepwise nature of commitment means that the hemopoietic system can 
be viewed as a hierarchical family tree of cells. Multipotent stem cells give rise to 
committed progenitor cells, which are specified to give rise to only one or a few 
blood cell types. The committed progenitors divide rapidly, but only a limited 
number of times, before they terminally differentiate into cells that divide no fur- 
ther and die after several days or weeks. Many cells normally die at the earlier 
steps in the pathway as well. Studies in culture provide a way to find out how the 
proliferation, differentiation, and death of the hemopoietic cells are regulated. 



Stem Cells Depend on Contact Signals From Stromal Cells 

Hemopoietic cells can survive, proliferate, and differentiate in culture if, and 
only if, they are provided with specific signal proteins or are accompanied by 
cells that produce these proteins. If deprived of such proteins, the cells die. For 
long-term maintenance, contact with appropriate supporting cells also seems to 
be necessary: hemopoiesis can be kept going for months or even years in vitro 
by culturing dispersed bone marrow hemopoietic cells on top of a layer of bone- 
marrow stromal cells, which mimic the environment in intact bone marrow. 
Such cultures can generate all the types of myeloid cells, and their long-term 
continuation implies that stem cells, as well as differentiated progeny, are being 
continually produced. 

In the bone marrow, where they normally live, the hemopoietic stem cells 
are mostly located in close contact with the osteoblasts that line the bony sur- 
faces of the marrow cavity — the cells that produce the bone matrix. Treatments 
and mutations that increase or decrease the number of osteoblasts cause corre- 
sponding changes in the numbers of hemopoietic stem cells. This suggests that 
the osteoblasts provide the signals that the hemopoietic stem cells need to keep 
them in their uncommitted stem-cell state, just as the intestinal crypt provides 
the signals needed to maintain stem cells of the gut epithelium. In both systems, 
stem cells are normally confined to a particular niche, and when they leave this 
niche they tend to lose their stem-cell potential (Figure 23-43). Hemopoietic 
stem cells in the bone marrow and elsewhere are also often associated with a 
specialized class of endothelial cells, which may provide them with an alterna- 
tive niche. 

A key feature of the stem-cell niche in the bone marrow, as in the gut, is that 
it provides stimulation of the Wnt signaling pathway. Artificial activation of this 
pathway in cultured hemopoietic stem cells helps them to survive, proliferate, 
and keep their character as stem cells, while blocking Wnt signaling does the 
opposite. Another interaction that is important for the maintenance of 
hemopoiesis came to light through the analysis of mouse mutants with a curi- 
ous combination of defects: a shortage of red blood cells (anemia), of germ cells 
(sterility), and of pigment cells (white spotting of the skin; see Figure 22-86). As 
discussed in Chapter 22, this syndrome results from mutations in either of two 
genes: one, called Kit, codes for a receptor tyrosine kinase; the other codes for 
its ligand. The cell types affected by the mutations all derive from migratory 
precursors, and it seems that these precursors in each case must express the 
receptor and be provided with the ligand by their environment if they are to sur- 
vive and produce progeny in normal numbers. Studies in mutant mice suggest 
that Kit ligand must be membrane-bound to be fully effective, implying that 
normal hemopoiesis requires direct cell-cell contact between the hemopoietic 
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Figure 23-43 Dependence of 
hemopoietic stem cells on contact with 
stromal cells. The contact-dependent 
interaction between Kit and its ligand is 
one of several signaling mechanisms 
thought to be involved in hemopoietic 
stem-cell maintenance. The real system is 
certainly more complex; the dependence 
of hemopoietic cells on contact with 
stromal cells cannot be absolute, since 
small numbers of the functional stem 
cells can be found free in the circulation. 
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Figure 23-44 A developing red blood cell (erythroblast). The cell is 
shown extruding its nucleus to become an immature erythrocyte (a 
reticulocyte), which then leaves the bone marrow and passes into the 
bloodstream. The reticulocyte will lose its mitochondria and ribosomes 
within a day or two to become a mature erythrocyte. Erythrocyte clones 
develop in the bone marrow on the surface of a macrophage, which 
phagocytoses and digests the nuclei discarded by the erythroblasts. 

cells that express Kit receptor protein, and stromal cells (osteoblasts among 
them) that express Kit ligand. 



Factors That Regulate Hemopoiesis Can Be Analyzed in Culture 

While stem cells depend on contact with stromal cells for long-term mainte- 
nance, their committed progeny do not, or at least not to the same degree. Thus, 
dispersed bone marrow hemopoietic cells can be cultured in a semisolid matrix 
of dilute agar or methylcellulose, and factors derived from other cells can be 
added artificially to the medium. Because cells in the semisolid matrix cannot 
migrate, the progeny of each isolated precursor cell remain together as an easily 
distinguishable colony. A single committed neutrophil progenitor, for example, 
may give rise to a clone of thousands of neutrophils. Such culture systems have 
provided a way to assay for the factors that support hemopoiesis and hence to 
purify them and explore their actions. These substances are glycoproteins and 
are usually called colony-stimulating factors (CSFs). Of the growing number of 
CSFs that have been defined and purified, some circulate in the blood and act as 
hormones, while others act in the bone marrow either as secreted local media- 
tors or, like Kit ligand, as membrane-bound signals that act through cell-cell 
contact. The best understood of the CSFs that act as hormones is the glycopro- 
tein erythropoietin, which is produced in the kidneys and regulates erythro- 
poiesis, the formation of red blood cells. 
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Erythropoiesis Depends on the Hormone Erythropoietin 

The erythrocyte is by far the most common type of cell in the blood (see Table 
23-1). When mature, it is packed full of hemoglobin and contains hardly any of 
the usual cell organelles. In an erythrocyte of an adult mammal, even the 
nucleus, endoplasmic reticulum, mitochondria, and ribosomes are absent, hav- 
ing been extruded from the cell in the course of its development (Figure 23-44) . 
The erythrocyte therefore cannot grow or divide; the only possible way of mak- 
ing more erythrocytes is by means of stem cells. Furthermore, erythrocytes have 
a limited life-span — about 120 days in humans or 55 days in mice. Worn-out ery- 
throcytes are phagocytosed and digested by macrophages in the liver and 
spleen, which remove more than 10 11 senescent erythrocytes in each of us each 
day. Young erythrocytes actively protect themselves from this fate: they have a 
protein on their surface that binds to an inhibitory receptor on macrophages 
and thereby prevents their phagocytosis. 

A lack of oxygen or a shortage of erythrocytes stimulates specialized cells in 
the kidney to synthesize and secrete increased amounts of erythropoietin into 
the bloodstream. The erythropoietin, in turn, stimulates the production of more 
erythrocytes. Since a change in the rate of release of new erythrocytes into the 
bloodstream is observed as early as 1-2 days after an increase in erythropoietin 
levels in the bloodstream, the hormone must act on cells that are very close pre- 
cursors of the mature erythrocytes. 

The cells that respond to erythropoietin can be identified by culturing bone 
marrow cells in a semisolid matrix in the presence of erythropoietin. In a few days, 
colonies of about 60 erythrocytes appear, each founded by a single committed 
erythroid progenitor cell. This progenitor depends on erythropoietin for its sur- 
vival as well as its proliferation. It does not yet contain hemoglobin, and it is 
derived from an earlier type of committed erythroid progenitor that does not 
depend on erythropoietin. 
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Table 23-2 Some Colony-stimulating Factors (CSFs) That Influence Blood Cell Formation 




FACTOR 


TARGET CELLS 


PRODUCING CELLS 


RECEPTORS 


Erythropoietin 


CFC-E 


kidney cells 


cytokine family 


Interleukin 3 (IL3) 


multipotent stem cell, most 
progenitor cells, many terminally 
differentiated cells 


T lymphocytes, epidermal cells 


cytokine family 


Granulocyte/macrophage 
CSF (GMCSF) 


GM progenitor cells 


T lymphocytes, endothelial 
cells, fibroblasts 


cytokine family 


Granulocyte CSF (GCSF) 


GM progenitor cells and 
neutrophils 


macrophages, fibroblasts 


cytokine family 


Macrophage CSF (MCSF) 


GM progenitor cells and 
macrophages 


fibroblasts, macrophages, 
endothelial cells 


receptor tyrosine 
kinase family 


Kit ligand 


hemopoietic stem cells 


stromal cells in bone marrow 
and many other cells 


receptor tyrosine 
kinase family 



A second CSF, called interleukin-3 (IL3), promotes the survival and prolifer- 
ation of the earlier erythroid progenitor cells. In its presence, much larger ery- 
throid colonies, each comprising up to 5000 erythrocytes, develop from cultured 
bone marrow cells in a process requiring a week or 10 days. Evidently the 
descendants of the hemopoietic stem cells, after they have become committed 
to an erythroid fate, have to step their way through a further long program of cell 
divisions, changing their character and their dependence on environmental sig- 
nals as they progress toward the final differentiated state. 



Multiple CSFs Influence Neutrophil and Macrophage Production 

The two classes of cells dedicated to phagocytosis, neutrophils and 
macrophages, develop from a common progenitor cell called a granulocyte/ 
macrophage (GM) progenitor cell. Like the other granulocytes (eosinophils and 
basophils), neutrophils circulate in the blood for only a few hours before migrat- 
ing out of capillaries into the connective tissues or other specific sites, where 
they survive for only a few days. They then die by apoptosis and are phagocy- 
tosed by macrophages. Macrophages, in contrast, can persist for months or per- 
haps even years outside the bloodstream, where they can be activated by local 
signals to resume proliferation. 

At least seven distinct CSFs that stimulate neutrophil and macrophage 
colony formation in culture have been defined, and some or all of these are 
thought to act in different combinations to regulate the selective production of 
these cells in vivo. These CSFs are synthesized by various cell types — including 
endothelial cells, fibroblasts, macrophages, and lymphocytes — and their con- 
centration in the blood typically increases rapidly in response to bacterial infec- 
tion in a tissue, thereby increasing the number of phagocytic cells released from 
the bone marrow into the bloodstream. IL3 is one of the least specific of the fac- 
tors, acting on multipotent stem cells as well as on most classes of committed 
progenitor cells, including GM progenitor cells. Various other factors act more 
selectively on committed GM progenitor cells and their differentiated progeny 
(Table 23-2), although in many cases they act on certain other branches of the 
hemopoietic family tree as well. 

All of these CSFs, like erythropoietin, are glycoproteins that act at low con- 
centrations (about 10~ 12 M) by binding to specific cell-surface receptors, as dis- 
cussed in Chapter 15. A few of these receptors are transmembrane tyrosine 
kinases but most belong to the large cytokine receptor family, whose members 
are usually composed of two or more subunits, one of which is frequently shared 
among several receptor types (Figure 23-45). The CSFs not only operate on the 
precursor cells to promote the production of differentiated progeny, they also 
activate the specialized functions (such as phagocytosis and target-cell killing) 
of the terminally differentiated cells. Proteins produced artificially from the 
cloned genes for these factors are strong stimulators of hemopoiesis in experi- 
mental animals. They are now widely used in human patients to stimulate the 
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Figure 23-45 Sharing of subunits 
among CSF receptors. Human IL3 
receptors and GMCSF receptors have 
different a subunits and a common 
p subunit. Their ligands are thought to 
bind to the free a subunit with low 
affinity, and this triggers the assembly of 
the heterodimer that binds the ligand 
with high affinity. 
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regeneration of hemopoietic tissue and to boost resistance to infection — an 
impressive demonstration of how basic cell biological research and animal 
experiments can lead to better medical treatment. 



The Behavior of a Hemopoietic Cell Depends Partly on Chance 

CSFs are defined as factors that promote the production of colonies of differenti- 
ated blood cells. But precisely what effect does a CSF have on an individual 
hemopoietic cell? The factor might control the rate of cell division or the number 
of division cycles that the progenitor cell undergoes before differentiating; it might 
act late in the hemopoietic lineage to facilitate differentiation; it might act early to 
influence commitment; or it might simply increase the probability of cell survival 
(Figure 23-46). By monitoring the fate of isolated individual hemopoietic cells in 
culture, it has been possible to show that a single CSF, such as GMCSF, can exert all 
these effects, although it is still not clear which are most important in vivo. 
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1. Frequency of stem-cell division 

2. Probability of stem-cell death 
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will become a committed progenitor 
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Figure 23-46 Some of the parameters 
through which the production of blood 
cells of a specific type might be 
regulated. Studies in culture suggest that 
colony-stimulating factors (CSFs) can 
affect all of these aspects of hemopoiesis. 
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Studies in vitro indicate, moreover, that there is a large element of chance in 
the way a hemopoietic cell behaves — a reflection, presumably, of "noise" in the 
genetic control system, as discussed in Chapter 7. At least some of the CSFs 
seem to act by regulating probabilities, not by dictating directly what the cell 
shall do. In hemopoietic cell cultures, even if the cells have been selected to be 
as homogeneous a population as possible, there is a remarkable variability in the 
sizes and often in the characters of the colonies that develop. And if two sister 
cells are taken immediately after a cell division and cultured apart under identi- 
cal conditions, they frequently give rise to colonies that contain different types 
of blood cells or the same types of blood cells in different numbers. Thus, both 
the programming of cell division and the process of commitment to a particular 
path of differentiation seem to involve random events at the level of the individ- 
ual cell, even though the behavior of the multicellular system as a whole is reg- 
ulated in a reliable way. The sequence of cell fate restrictions shown in Figure 
23-42 conveys the impression of a program executed with computer-like logic 
and precision. Individual cells may be more quirky and erratic, and may some- 
times progress by other decision pathways from the stem-cell state toward ter- 
minal differentiation. 

Regulation of Cell Survival Is as Important as Regulation of Cell 
Proliferation 

The default behavior of hemopoietic cells in the absence of CSFs is death by 
apoptosis (discussed in Chapter 18). Thus, in principle, the CSFs could regulate 
the numbers of the various types of blood cells entirely through selective control 
of cell survival in this way. There is evidence that the control of cell survival does 
indeed play a central part in regulating the numbers of blood cells, just as it does 
for hepatocytes and many other cell types, as we have already seen. The amount 
of apoptosis in the vertebrate hemopoietic system is enormous: billions of neu- 
trophils die in this way each day in an adult human, for example. In fact, most 
neutrophils produced in the bone marrow die there without ever functioning. 
This futile cycle of production and destruction presumably serves to maintain a 
reserve supply of cells that can be promptly mobilized to fight infection when- 
ever it flares up, or phagocytosed and digested for recycling when all is quiet. 
Compared with the life of the organism, the lives of cells are cheap. 

Too little cell death can be as dangerous to the health of a multicellular 
organism as too much proliferation. In the hemopoietic system, mutations that 
inhibit cell death by causing excessive production of the intracellular apoptosis 
inhibitor Bcl2 promote the development of cancer in B lymphocytes. Indeed, the 
capacity for unlimited self-renewal is a dangerous property for any cell to pos- 
sess, and many cases of leukemia arise through mutations that confer this 
capacity on committed hemopoietic precursor cells that would normally be 
fated to differentiate and die after a limited number of division cycles. 

Summary 

The many types of blood cells, including erythrocytes, lymphocytes, granulocytes, and 
macrophages, all derive from a common multipotent stem cell. In the adult, hemopoi- 
etic stem cells are found mainly in hone marrow, and they depend on signals from the 
marrow stromal (connective-tissue) cells, especially osteoblasts, to maintain their 
stem-cell character. As in some other stem-cell systems, the Wnt signaling pathway 
appears to be critical for stem- cell maintenance, though it is not the only one involved. 
The stem cells normally divide infrequently to produce more stem cells (self- renewal) 
and various committed progenitor cells (transit amplifying cells), each able to give rise 
to only one or a few types of blood cells. The committed progenitor cells divide exten- 
sively under the influence of various protein signal molecules (colony-stimulating fac- 
tors, or CSFs) and then terminally differentiate into mature blood cells, which usually 
die after several days or weeks. 

Studies of hemopoiesis have been greatly aided by in vitro assays in which stem 
cells or committed progenitor cells form clonal colonies when cultured in a semisolid 
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matrix. The progeny of stem cells seem to make their choices between alternative 
developmental pathways in a partly random manner. Cell death by apoptosis, con- 
trolled by the availability ofCSFs, also plays a central part in regulating the numbers 
of mature differentiated blood cells. 



GENESIS, MODULATION, AND REGENERATION OF 
SKELETAL MUSCLE 

The term "muscle" includes many cell types, all specialized for contraction but 
in other respects dissimilar. As noted in Chapter 16, all eucaryotic cells possess 
a contractile system involving actin and myosin, but muscle cells have devel- 
oped this apparatus to a high degree. Mammals possess four main categories of 
cells specialized for contraction: skeletal muscle cells, heart (cardiac) muscle 
cells, smooth muscle cells, and myoepithelial cells (Figure 23-47). These differ 
in function, structure, and development. Although all of them generate contrac- 
tile forces by using organized filament systems based on actin and myosin, the 
actin and myosin molecules employed have somewhat different amino acid 
sequences, are differently arranged in the cell, and are associated with different 
sets of proteins to control contraction. 

Skeletal muscle cells are responsible for practically all movements that are 
under voluntary control. These cells can be very large (2-3 cm long and 100 jiim 
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Figure 23-47 The four classes of muscle 
cells of a mammal. (A) Schematic 
drawings (to scale). (B-E) Scanning 
electron micrographs, showing (B) 
skeletal muscle from the neck of a 
hamster, (C) heart muscle from a rat, 

(D) smooth muscle from the urinary 
bladder of a guinea pig, and 

(E) myoepithelial cells in a secretory 
alveolus from a lactating rat mammary 
gland. The arrows in (C) point to 
intercalated discs — end-to-end junctions 
between the heart muscle cells; skeletal 
muscle cells in long muscles are joined 
end to end in a similar way. Note that the 
smooth muscle is shown at a lower 
magnification than the others. 

(B, courtesy of Junzo Desaki; C, from 
T. Fujiwara, in Cardiac Muscle in 
Handbook of Microscopic Anatomy 
[E.D. Canal, ed.]. Berlin: Springer-Verlag, 
1 986; D, courtesy of Satoshi Nakasiro; 
E, from T. Nagato et a I., Cell Tiss. Res. 
209:1 -1 0, 1 980. With permission from 
Springer-Verlag.) 
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in diameter in an adult human) and are often called muscle fibers because of 
their highly elongated shape. Each one is a syncytium, containing many nuclei 
within a common cytoplasm. The other types of muscle cells are more conven- 
tional, generally having only a single nucleus. Heart muscle cells resemble 
skeletal muscle fibers in that their actin and myosin filaments are aligned in very 
orderly arrays to form a series of contractile units called sarcomeres, so that the 
cells have a striated (striped) appearance. Smooth muscle cells are so named 
because they do not appear striated. The functions of smooth muscle vary 
greatly, from propelling food along the digestive tract to erecting hairs in 
response to cold or fear. Myoepithelial cells also have no striations, but unlike 
all other muscle cells they lie in epithelia and are derived from the ectoderm. 
They form the dilator muscle of the eye's iris and serve to expel saliva, sweat, and 
milk from the corresponding glands, as discussed earlier (see Figure 23-1 1). The 
four main categories of muscle cells can be further divided into distinctive sub- 
types, each with its own characteristic features. 

The mechanisms of muscle contraction are discussed in Chapter 16. Here 
we consider how muscle tissue is generated and maintained. We focus on the 
skeletal muscle fiber, which has a curious mode of development, a striking abil- 
ity to modulate its differentiated character, and an unusual strategy for repair. 



Myoblasts Fuse to Form New Skeletal Muscle Fibers 

Chapter 22 described how certain cells, originating from the somites of a verte- 
brate embryo at a very early stage, become determined as myoblasts, the pre- 
cursors of skeletal muscle fibers. The commitment to be a myoblast depends on 
gene regulatory proteins of at least two families — a pair of homeodomain pro- 
teins called Pax3 and Pax7, and the MyoD family of basic helix-loop-helix pro- 
teins (discussed in Chapter 7). These act in combination to give the myoblast a 
memory of its committed state, and, eventually, to regulate the expression of 
other genes that give the mature muscle cell its specialized character (see Figure 
7-75). After a period of proliferation, the myoblasts undergo a dramatic change 
of state: they stop dividing, switch on the expression of a whole battery of mus- 
cle-specific genes required for terminal differentiation, and fuse with one 
another to form multinucleate skeletal muscle fibers (Figure 23-48). Fusion 
involves specific cell-cell adhesion molecules that mediate recognition between 
newly differentiating myoblasts and fibers. Once differentiation has occurred, 
the cells do not divide and the nuclei never again replicate their DNA. 



Figure 23-48 Myoblast fusion in culture. 

The culture is stained with a fluorescent 
antibody (green) against skeletal muscle 
myosin, which marks differentiated 
muscle cells, and with a DNA-specific dye 
(blue) to show cell nuclei. (A) A short time 
after a change to a culture medium that 
favors differentiation, just two of the 
many myoblasts in the field of view have 
switched on myosin production and have 
fused to form a muscle cell with two 
nuclei (upper right). (B) Somewhat later, 
almost all the cells have differentiated 
and fused. (C) High-magnification view, 
showing characteristic striations (fine 
transverse stripes) in two of the 
multinucleate muscle cells. (Courtesy of 
Jacqueline Gross and Terence Partridge.) 
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Figure 23-49 Fast and slow muscle fibers. Two consecutive cross sections of 
the same piece of adult mouse leg muscle were stained with different 
antibodies, each specific for a different isoform of myosin heavy chain protein, 
and images of the two sections were overlaid in false color to show the 
pattern of muscle fiber types. Fibers stained with antibodies against "fast" 
myosin (gray) are specialized to produce fast-twitch contractions; fibers 
stained with antibodies against "slow" myosin (pink) are specialized to produce 
slow, sustained contractions. The fast-twitch fibers are known as white muscle 
fibers because they contain relatively little of the colored oxygen-binding 
protein myoglobin. The slow muscle fibers are called red muscle fibers 
because they contain much more of it. (Courtesy of Simon Hughes.) 



Myoblasts that have been kept proliferating in culture for as long as two 
years still retain the ability to differentiate and can fuse to form muscle cells in 
response to a suitable change in culture conditions. Appropriate signal proteins 
such as fibroblast or hepatocyte growth factor (FGF or HGF) in the culture 
medium can maintain myoblasts in the proliferative, undifferentiated state: if 
these soluble factors are removed, the cells rapidly stop dividing, differentiate, 
and fuse. The system of controls is complex, however, and attachment to the 
extracellular matrix is also important for myoblast differentiation. Moreover, the 
process of differentiation is cooperative: differentiating myoblasts secrete fac- 
tors that apparently encourage other myoblasts to differentiate. 
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Muscle Cells Can Vary Their Properties by Changing the Protein 
Isoforms They Contain 

Once formed, a skeletal muscle fiber grows, matures, and modulates its charac- 
ter. The genome contains multiple variant copies of the genes encoding many of 
the characteristic proteins of the skeletal muscle cell, and the RNA transcripts of 
many of these genes can be spliced in several ways. As a result, muscle fibers 
produce many variant forms (isoforms) of the proteins of the contractile appa- 
ratus. As the muscle fiber matures, it synthesizes different isoforms, satisfying 
the changing demands for speed, strength, and endurance in the fetus, the new- 
born, and the adult. Within a single adult muscle, several distinct types of skele- 
tal muscle fibers, each with different sets of protein isoforms and different func- 
tional properties, can be found side by side (Figure 23-49). The characteristics 
of the different fiber types are determined partly before birth by the genetic pro- 
gram of development, partly in later life by activity and training. Different 
classes of motor neurons innervate slow muscle fibers (for sustained contrac- 
tion) and fast muscle fibers (for rapid twitch), and the innervation can regulate 
muscle-fiber gene expression and size through the different patterns of electri- 
cal stimulation that these neurons deliver. 



Skeletal Muscle Fibers Secrete Myostatin to Limit Their Own 
Growth 

A muscle can grow in three ways: its fibers can increase in number, in length, or 
in girth. Because skeletal muscle fibers are unable to divide, more of them can 
be made only by the fusion of myoblasts, and the adult number of multinucle- 
ated skeletal muscle fibers is in fact attained early — before birth, in humans. 
Once formed, a skeletal muscle fiber generally survives for the entire lifetime of 
the animal. However, individual muscle nuclei can be added or lost. The enor- 
mous postnatal increase in muscle bulk is achieved by cell enlargement. Growth 
in length depends on recruitment of more myoblasts into the existing multinu- 
cleated fibers, which increases the number of nuclei in each cell. Growth in 
girth, such as occurs in the muscles of weightlifters, involves both myoblast 
recruitment and an increase in the size and numbers of the contractile myofib- 
rils that each muscle fiber nucleus supports. 
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What, then, are the mechanisms that control muscle cell numbers and mus- 
cle cell size? One part of the answer lies in an extracellular signal protein called 
myostatin. Mice with a loss-of-function mutation in the myostatin gene have 
enormous muscles — two to three times larger than normal (Figure 23-50). Both 
the numbers and the size of the muscle cells seem to be increased. Mutations in 
the same gene are present in so-called "double-muscled" breeds of cattle (see 
Figure 17-69): in selecting for big muscles, cattle breeders have unwittingly 
selected for myostatin deficiency. Myostatin belongs to the TGF(3 superfamily of 
signal proteins. It is normally made and secreted by skeletal muscle cells, and it 
acts powerfully on myoblasts, inhibiting both proliferation and differentiation. 
Its function, evidently, is to provide negative feedback to limit muscle growth, in 
adult life as well as during development. The growth of some other organs is 
similarly controlled by a negative-feedback action of a factor that they them- 
selves produce. We shall encounter another example in a later section. 



Some Myoblasts Persist as Quiescent Stem Cells in the Adult 

Even though humans do not normally generate new skeletal muscle fibers in 
adult life, they still have the capacity to do so, and existing muscle fibers can 
resume growth when the need arises. Cells capable of serving as myoblasts are 
retained as small, flattened, and inactive cells lying in close contact with the 
mature muscle cell and contained within its sheath of basal lamina (Figure 
23-51). If the muscle is damaged or stimulated to grow, these satellite cells are 
activated to proliferate, and their progeny can fuse to repair the damaged mus- 
cle or to allow muscle growth. Like myoblasts, they are regulated by myostatin. 
Satellite cells, or some subset of the satellite cells, are thus the stem cells of 
adult skeletal muscle, normally held in reserve in a quiescent state but available 
when needed as a self- renewing source of terminally differentiated cells. Stud- 
ies of these cells have provided some of the clearest evidence for the immortal 
strand hypothesis of asymmetric stem-cell division, as illustrated earlier in Fig- 
ure 23-10). 

The process of muscle repair by means of satellite cells is, nevertheless, lim- 
ited in what it can achieve. In one form of muscular dystrophy, for example, a 
genetic defect in the cytoskeletal protein dystrophin damages differentiated 
skeletal muscle cells. As a result, satellite cells proliferate to repair the damaged 
muscle fibers. This regenerative response is, however, unable to keep pace with 
the damage, and connective tissue eventually replaces the muscle cells, block- 
ing any further possibility of regeneration. A similar loss of capacity for repair 
seems to contribute to the weakening of muscle in the elderly. 

In muscular dystrophy, where the satellite cells are constantly called upon to 
proliferate, their capacity to divide may become exhausted as a result of pro- 
gressive shortening of their telomeres in the course of each cell cycle (discussed 
in Chapter 17). Stem cells of other tissues seem to be limited in the same way, as 
we noted earlier in the case of hemopoietic stem cells: they normally divide only 
at a slow rate, and mutations or exceptional circumstances that cause them to 
divide more rapidly can lead to premature exhaustion of the stem-cell supply. 




Figure 23-50 Regulation of muscle size 
by myostatin. (A) A normal mouse 
compared with a mutant mouse deficient 
in myostatin. (B) Leg of a normal and 
(C) of a myostatin-deficient mouse, with 
skin removed to show the massive 
enlargement of the musculature in the 
mutant. (From S.J. Lee and 
A.C. McPherron, Curr. Opin. Genet. Devel. 
9:604-607, 1999. With permission from 
Elsevier.) 
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Figure 23-51 A satellite cell on a 
skeletal muscle fiber. The specimen is 
stained with an antibody (red) against a 
muscle cadherin, M-cadherin, which is 
present on both the satellite cell and the 
muscle fiber and is concentrated at the 
site where their membranes are in 
contact. The nuclei of the muscle fiber are 
stained green, and the nucleus of the 
satellite cell is stained blue. (Courtesy of 
Terence Partridge.) 
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Summary 

Skeletal muscle fibers are one of four main categories of vertebrate cells specialized for 
contraction, and they are responsible for all voluntary movement. Each skeletal mus- 
cle fiber is a syncytium and develops by the fusion of many myoblasts. Myoblasts pro- 
liferate extensively, but once they have fused, they can no longer divide. Fusion gener- 
ally follows the onset of myoblast differentiation, in which many genes encoding 
muscle- specific proteins are switched on coordinately. Some myoblasts persist in a 
quiescent state as satellite cells in adult muscle; when a muscle is damaged, these cells 
are reactivated to proliferate and to fuse to replace the muscle cells that have been 
lost. They are the stem cells of skeletal muscle. Muscle bulk is regulated homeostati- 
cally by a negative-feedback mechanism, in which existing muscle secretes myostatin, 
which inhibits further muscle growth. 



FIBROBLASTS AND THEIR TRANSFORMATIONS: 
THE CONNECTIVE-TISSUE CELL FAMILY 

Many of the differentiated cells in the adult body can be grouped into families 
whose members are closely related by origin and by character. An important 
example is the family of connective- tissue cells, whose members are not only 
related but also unusually interconvertible. The family includes fibroblasts, car- 
tilage cells, and bone cells, all of which are specialized for the secretion of col- 
lagenous extracellular matrix and are jointly responsible for the architectural 
framework of the body. The connective-tissue family also includes fat cells and 
smooth muscle cells. Figure 23-52 illustrates these cell types and the intercon- 
versions that are thought to occur between them. Connective-tissue cells con- 
tribute to the support and repair of almost every tissue and organ, and the 
adaptability of their differentiated character is an important feature of the 
responses to many types of damage. 



Fibroblasts Change Their Character in Response to Chemical 
Signals 

Fibroblasts seem to be the least specialized cells in the connective-tissue family. 
They are dispersed in connective tissue throughout the body, where they secrete 
a nonrigid extracellular matrix that is rich in type I or type III collagen, or both, 
as discussed in Chapter 19. When a tissue is injured, the fibroblasts nearby pro- 
liferate, migrate into the wound <TGAT>, and produce large amounts of collage- 
nous matrix, which helps to isolate and repair the damaged tissue. Their ability 
to thrive in the face of injury, together with their solitary lifestyle, may explain 
why fibroblasts are the easiest of cells to grow in culture — a feature that has 
made them a favorite subject for cell biological studies (Figure 23-53). 
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Figure 23-52 The family of connective- 
tissue cells. Arrows show the 
interconversions that are thought to 
occur within the family. For simplicity, the 
fibroblast is shown as a single cell type, 
but it is uncertain how many types of 
fibroblasts exist in fact and whether the 
differentiation potential of different types 
is restricted in different ways. 
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Figure 23-53 The fibroblast. (A) A phase-contrast micrograph of 
fibroblasts in culture. (B) These drawings of a living fibroblastlike cell in the 
transparent tail of a tadpole show the changes in its shape and position on 
successive days. Note that while fibroblasts flatten out in culture, they can 
have more complex, process-bearing morphologies in tissues. See also 
Figure 19-54. (A, from E. Pokorna et al., Cell Motil. Cytoskeleton 28:25-33, 
1994; B, redrawn from E. Clark, /Am. J.Anat. 13:351-379, 1912. Both with 
permission from Wiley-Liss.) 

As indicated in Figure 23-52, fibroblasts also seem to be the most versatile of 
connective-tissue cells, displaying a remarkable capacity to differentiate into 
other members of the family There are uncertainties about their interconver- 
sions, however. Fibroblasts in different parts of the body are intrinsically differ- 
ent, and there may be differences between them even in a single region. 
"Mature" fibroblasts with a lesser capacity for transformation may, for example, 
exist side by side with "immature" fibroblasts (often called mesenchymal cells) 
that can develop into a variety of mature cell types. 

The stromal cells of bone marrow, mentioned earlier, provide a good exam- 
ple of connective-tissue versatility. These cells, which can be regarded as a kind 
of fibroblast, can be isolated from the bone marrow and propagated in culture. 
Large clones of progeny can be generated in this way from single ancestral stro- 
mal cells. According to the signal proteins that are added to the culture medium, 
the members of such a clone can either continue proliferating to produce more 
cells of the same type, or can differentiate as fat cells, cartilage cells, or bone 
cells. Because of their self- renewing, multipotent character, they are referred to 
as mesenchymal stem cells. 

Fibroblasts from the dermal layer of the skin are different. When placed in 
the same culture conditions, they do not show the same plasticity. Yet they, too, 
can be induced to change their character. At a healing wound, for example, they 
change their actin gene expression and take on some of the contractile proper- 
ties of smooth muscle cells, thereby helping to pull the wound margins together; 
such cells are called myofibroblasts. More dramatically, if a preparation of bone 
matrix, made by grinding bone into a fine powder and dissolving away the hard 
mineral component, is implanted in the dermal layer of the skin, some of the 
cells there (probably fibroblasts) become transformed into cartilage cells, and a 
little later, others transform into bone cells, thereby creating a small lump of 
bone. These experiments suggest that components in the extracellular matrix 
can dramatically influence the differentiation of connective-tissue cells. 

We shall see that similar cell transformations occur in the natural repair of 
broken bones. In fact, bone matrix contains high concentrations of several sig- 
nal proteins that can affect the behavior of connective-tissue cells. These 
include members of the TGF(3 superfamily including BMPs and TGF(3 itself. 
These factors regulate growth, differentiation, and matrix synthesis by connec- 
tive-tissue cells, exerting a variety of actions depending on the target cell type 
and the combination of other factors and matrix components that are present. 
When injected into a living animal, they can induce the formation of cartilage, 
bone, or fibrous matrix, according to the site and circumstances of injection. 
TGF(3 is especially important in wound healing, where it stimulates the conver- 
sion of fibroblasts into myofibroblasts and promotes the formation of the colla- 
gen-rich scar tissue that gives a healed wound its strength. 
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The Extracellular Matrix May Influence Connective-Tissue Cell 
Differentiation by Affecting Cell Shape and Attachment 

The extracellular matrix may influence the differentiated state of connective-tis- 
sue cells through physical as well as chemical effects. This has been shown in 
studies on cultured cartilage cells, or chondrocytes. Under appropriate culture 
conditions, these cells proliferate and maintain their differentiated character, 
continuing for many cell generations to synthesize large quantities of highly dis- 
tinctive cartilage matrix, with which they surround themselves. If, however, the 
cells are kept at relatively low density and remain as a monolayer on the culture 



FIBROBLASTS AND THEIR TRANSFORMATIONS: THE CONNECTIVE-TISSUE CELL FAMILY 



1469 



dish, a transformation occurs. They lose their characteristic rounded shape, flat- 
ten down on the substratum, and stop making cartilage matrix: they stop pro- 
ducing type II collagen, which is characteristic of cartilage, and start producing 
type I collagen, which is characteristic of fibroblasts. By the end of a month in 
culture, almost all the cartilage cells have switched their collagen gene expres- 
sion and taken on the appearance of fibroblasts. The biochemical change must 
occur abruptly, since very few cells are observed to make both types of collagen 
simultaneously. 

The biochemical change seems to be induced, at least in part, by the change 
in cell shape and attachment. Cartilage cells that have made the transition to a 
fibroblast-like character, for example, can be gently detached from the culture 
dish and transferred to a dish of agarose. By forming a gel around them, the 
agarose holds the cells suspended without any attachment to a substratum, 
forcing them to adopt a rounded shape. In these circumstances, the cells 
promptly revert to the character of chondrocytes and start making type II colla- 
gen again. Cell shape and anchorage may control gene expression through intra- 
cellular signals generated at focal contacts by integrins acting as matrix recep- 
tors, as discussed in Chapter 19. 

For most types of cells, and especially for a connective-tissue cell, the oppor- 
tunities for anchorage and attachment depend on the surrounding matrix, 
which is usually made by the cell itself. Thus, a cell can create an environment 
that then acts back on the cell to reinforce its differentiated state. Furthermore, 
the extracellular matrix that a cell secretes forms part of the environment for its 
neighbors as well as for the cell itself, and thus tends to make neighboring cells 
differentiate in the same way. A group of chondrocytes forming a nodule of car- 
tilage, for example, either in the developing body or in a culture dish, can be 
seen to enlarge by the conversion of neighboring fibroblasts into chondrocytes. 



Osteoblasts Make Bone Matrix 

Cartilage and bone are tissues of very different character; but they are closely 
related in origin, and the formation of the skeleton depends on an intimate part- 
nership between them. 

Cartilage tissue is structurally simple, consisting of cells of a single type — 
chondrocytes — embedded in a more or less uniform highly hydrated matrix con- 
sisting of proteoglycans and type II collagen, whose remarkable properties we 
have already discussed in Chapter 19. The cartilage matrix is deformable, and the 
tissue grows by expanding as the chondrocytes divide and secrete more matrix 
(Figure 23-54) . Bone, by contrast, is dense and rigid; it grows by apposition — that 
is, by deposition of additional matrix on free surfaces. Like reinforced concrete, 
the bone matrix is predominantly a mixture of tough fibers (type I collagen fib- 
rils), which resist pulling forces, and solid particles (calcium phosphate as 
hydroxylapatite crystals), which resist compression. The collagen fibrils in adult 
bone are arranged in regular plywoodlike layers, with the fibrils in each layer 
lying parallel to one another but at right angles to the fibrils in the layers on either 
side. They occupy a volume nearly equal to that occupied by the calcium phos- 
phate. The bone matrix is secreted by osteoblasts that lie at the surface of the 
existing matrix and deposit fresh layers of bone onto it. Some of the osteoblasts 
remain free at the surface, while others gradually become embedded in their own 
secretion. This freshly formed material (consisting chiefly of type I collagen) is 
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Figure 23-54 The growth of cartilage. 

The tissue expands as the chondrocytes 
divide and make more matrix. The freshly 
synthesized matrix with which each cell 
surrounds itself is shaded dark green. 
Cartilage may also grow by recruiting 
fibroblasts from the surrounding tissue 
and converting them into chondrocytes. 
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called osteoid. It is rapidly converted into hard bone matrix by the deposition of 
calcium phosphate crystals in it. Once imprisoned in hard matrix, the original 
bone-forming cell, now called an osteocyte, has no opportunity to divide, 
although it continues to secrete further matrix in small quantities around itself. 
The osteocyte, like the chondrocyte, occupies a small cavity, or lacuna, in the 
matrix, but unlike the chondrocyte it is not isolated from its fellows. Tiny chan- 
nels, or canaliculi, radiate from each lacuna and contain cell processes from the 
resident osteocyte, enabling it to form gap junctions with adjacent osteocytes 
(Figure 23-55) . Although the networks of osteocytes do not themselves secrete or 
erode substantial quantities of matrix, they probably play a part in controlling the 
activities of the cells that do. Blood vessels and nerves run through the tissue, 
keeping the bone cells alive and reacting when the bone is damaged. 

A mature bone has a complex and beautiful architecture, in which dense 
plates of compact bone tissue enclose spaces spanned by light frameworks of 
trabecular bone — a filigree of delicate shafts and flying buttresses of bone tissue, 
with soft marrow in the interstices (Figure 23-56). The creation, maintenance, 
and repair of this structure depend not only on the cells of the connective-tissue 
family that synthesize matrix, but also on a separate class of cells called osteo- 
clasts that degrade it, as we shall discuss below. 



Figure 23-55 Deposition of bone matrix 
by osteoblasts. Osteoblasts lining the 
surface of bone secrete the organic 
matrix of bone (osteoid) and are 
converted into osteocytes as they 
become embedded in this matrix. The 
matrix calcifies soon after it has been 
deposited. The osteoblasts themselves 
are thought to derive from osteogenic 
stem cells that are closely related to 
fibroblasts. 



Most Bones Are Built Around Cartilage Models 

Most bones, and in particular the long bones of the limbs and trunk, originate 
from minute "scale models" formed out of cartilage in the embryo. Each scale 
model grows, and as new cartilage forms, the older cartilage is replaced by bone. 
The process is known as endochondral bone formation. Cartilage growth and 
erosion and bone deposition are so ingeniously coordinated that the adult bone, 
though it may be half a meter long, is almost the same shape as the initial carti- 
laginous model, which was no more than a few millimeters long. 




Figure 23-56 Trabecular and compact 
bone. (A) Low-magnification scanning 
electron micrograph of trabecular bone 
in a vertebra of an adult man. The soft 
marrow tissue has been dissolved away. 
(B) A slice through the head of the femur, 
with bone marrow and other soft tissue 
likewise dissolved away, reveals the 
compact bone of the shaft and the 
trabecular bone in the interior. Because 
of the way in which bone tissue remodels 
itself in response to mechanical load, the 
trabeculae become oriented along the 
principle axes of stress within the bone. 
(A, courtesy of Alan Boyde; B, from 
J.B. Kerr, Atlas of Functional Histology. 
Mosby, 1999.) 
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The process begins in the embryo with the appearance of hazily defined 
"condensations" — groups of embryonic connective tissue cells that become 
more closely packed than their neighbors and begin to express a characteristic 
set of genes — including, in particular, Sox9 and, after a slight delay, Runx2. These 
two genes code for gene regulatory proteins that are critical for cartilage and 
bone development, respectively Mutant cells lacking Sox9 are unable to differ- 
entiate as cartilage but can form bone (and in some parts of the body will make 
bone where cartilage should be). Conversely, animals lacking functional Runx2 
make no bone and are born with a skeleton consisting solely of cartilage. 

Soon after expression of Sox9 has begun, the cells in the core of the conden- 
sation begin to secrete cartilage matrix, dividing and enlarging individually as 
they do so. In this way, they form an expanding rod of cartilage surrounded by 
more densely packed non-cartilage cells. The cartilage cells in the middle seg- 
ment of the rod become hypertrophied (grossly enlarged) and cease dividing; 
and at the same time, they start to secrete Indian Hedgehog — a signal molecule 
of the Hedgehog family. This in turn provokes increased production of certain 
Wnt proteins, which activate the Wnt pathway in cells surrounding the cartilage 
rod. As a result, they switch off expression of Sox9, maintain expression of 
Runx2, and begin to differentiate as osteoblasts, creating a collar of bone around 
the shaft of the cartilage model. Artificial overactivation of the Wnt pathway tips 
a larger proportion of cells into making bone rather than cartilage; an artificial 
block in the Wnt signaling pathway does the opposite. In this system, therefore, 
Wnt signaling controls the choice between alternative paths of differentiation, 
with Sox9 expression leading the way toward cartilage, and Runx2 expression 
leading the way toward bone. 

The hypertrophied cartilage cells in the shaft of the cartilage model soon die, 
leaving large cavities in the matrix, and the matrix itself becomes mineralized, 
like bone, by the deposition of calcium phosphate crystals. Osteoclasts and 
blood vessels invade the cavities and erode the residual cartilage matrix, creat- 
ing a space for bone marrow, and osteoblasts following in their wake begin to 
deposit trabecular bone in parts of the cavity where strands of cartilage matrix 
remain as a template. The cartilage tissue at the ends of the bone is replaced by 
bone tissue at a much later stage, by a somewhat similar process, as shown in 
Figure 23-57. Continuing elongation of the bone, up to the time of puberty, 
depends on a plate of growing cartilage between the shaft and the head of the 
bone. Defective growth of the cartilage in this plate, as a result of a dominant 
mutation in the gene that codes for an FGF receptor (FGFR3), is responsible for 
the commonest form of dwarfism, known as achondroplasia (Figure 23-58). 

The cartilage growth plate is eventually replaced by bone and disappears. 
The only surviving remnant of cartilage in the adult long bone is a thin but 



Figure 23-57 The development of a 
long bone. Long bones, such as the 
femur or the humerus, develop from a 
miniature cartilage model. Uncalcified 
cartilage is shown in light green, calcified 
cartilage in dark green, bone in black, and 
blood vessels in red. The cartilage is not 
converted to bone but is gradually 
replaced by it through the action of 
osteoclasts and osteoblasts, which invade 
the cartilage in association with blood 
vessels. Osteoclasts erode cartilage and 
bone matrix, while osteoblasts secrete 
bone matrix. The process of ossification 
begins in the embryo and is not 
completed until the end of puberty. The 
resulting bone consists of a thick-walled 
hollow cylinder of compact bone 
enclosing a large central cavity occupied 
by the bone marrow. Note that not all 
bones develop in this way. The 
membrane bones of the skull, for 
example, are formed directly as bony 
plates, not from a prior cartilage model. 
(Adapted from D.W. Fawcett, A Textbook 
of Histology, 12th ed. New York: 
Chapman and Hall, 1994.) 



1 472 Chapter 23: Specialized Tissues, Stem Cells, and Tissue Renewal 



Figure 23-58 Achondroplasia. This type of dwarfism occurs in one of 
1 0,000-1 00,000 births; in more than 99% of cases it results from a mutation 
at an identical site in the genome, corresponding to amino acid 380 in the 
FGF receptor FGFR3 (a glycine in the transmembrane domain). The 
mutation is dominant, and almost all cases are due to new, independently 
occurring mutations, implying an extraordinarily high mutation rate at this 
particular site in the genome. The defect in FGF signaling causes dwarfism 
by interfering with the growth of cartilage in developing long bones. (From 
Velasquez's painting of Sebastian de Morra. © Museo del Prado, Madrid.) 



important layer that forms a smooth, slippery covering on the bone surfaces at 
joints, where one bone articulates with another (see Figure 23-57). Erosion of 
this layer of cartilage, through aging, mechanical damage, or autoimmune 
attack, leads to arthritis, one of the commonest and most painful afflictions of 
old age. 



Bone Is Continually Remodeled by the Cells Within It 

For all its rigidity, bone is by no means a permanent and immutable tissue. Run- 
ning through the hard extracellular matrix are channels and cavities occupied by 
living cells, which account for about 15% of the weight of compact bone. These 
cells are engaged in an unceasing process of remodeling: while osteoblasts 
deposit new bone matrix, osteoclasts demolish old bone matrix. This mecha- 
nism provides for continuous turnover and replacement of the matrix in the 
interior of the bone. 

Osteoclasts (Figure 23-59) are large multinucleated cells that originate, like 
macrophages, from hemopoietic stem cells in the bone marrow. The precursor 
cells are released as monocytes into the bloodstream and collect at sites of bone 
resorption, where they fuse to form the multinucleated osteoclasts, which cling 
to surfaces of the bone matrix and eat it away. Osteoclasts are capable of tun- 
neling deep into the substance of compact bone, forming cavities that are then 
invaded by other cells. A blood capillary grows down the center of such a tunnel, 
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Figure 23-59 Osteoclasts. (A) Drawing of an osteoclast in cross section. This giant, multinucleated cell erodes bone matrix. 
The "ruffled border" is a site of secretion of acids (to dissolve the bone minerals) and hydrolases (to digest the organic 
components of the matrix). Osteoclasts vary in shape, are motile, and often send out processes to resorb bone at multiple 
sites. They develop from monocytes and can be viewed as specialized macrophages. (B) An osteoclast on bone matrix, seen 
by scanning electron microscopy. The osteoclast has been crawling over the matrix, eating it away, and leaving a trail of 
pits where it has done so. (A, from R.V. Krstic, Ultrastructure of the Mammalian Cell: An Atlas. Berlin: Springer-Verlag, 1979; 
B, courtesy of Alan Boyde.) 
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Figure 23-60 The remodeling of 
compact bone. Osteoclasts acting 
together in a small group excavate a 
tunnel through the old bone, advancing 
at a rate of about 50 jam per day. 
Osteoblasts enter the tunnel behind 
them, line its walls, and begin to form 
new bone, depositing layers of matrix at 
a rate of 1 -2 jim per day. At the same 
time, a capillary sprouts down the center 
of the tunnel. The tunnel eventually 
becomes filled with concentric layers of 
new bone, with only a narrow central 
canal remaining. Each such canal, besides 
providing a route of access for osteoclasts 
and osteoblasts, contains one or more 
blood vessels that transport the nutrients 
the bone cells require for survival. 
Typically, about 5-10% of the bone in a 
healthy adult mammal is replaced in this 
way each year. (After Z.F.G. Jaworski, 
B. Duck and G. Sekaly, J. Anal 
1 33:397-405, 1 981 . With permission from 
Blackwell Publishing.) 



and the walls of the tunnel become lined with a layer of osteoblasts (Figure 
23-60). To produce the plywoodlike structure of compact bone, these 
osteoblasts lay down concentric layers of new matrix, which gradually fill the 
cavity, leaving only a narrow canal surrounding the new blood vessel. Many of 
the osteoblasts become trapped in the bone matrix and survive as concentric 
rings of osteocytes. At the same time as some tunnels are filling up with bone, 
others are being bored by osteoclasts, cutting through older concentric systems. 
The consequences of this perpetual remodeling are beautifully displayed in the 
layered patterns of matrix observed in compact bone (Figure 23-61). 



Osteoclasts Are Controlled by Signals From Osteoblasts 

The osteoblasts that make the matrix also produce the signals that recruit and 
activate the osteoclasts to degrade it. Two proteins appear to have this role: one 
is Macrophage-CSF (MCSF), which we already encountered in our account of 
hemopoiesis (see Table 23-2); the other is TNF11, a member of the TNF family 
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Figure 23-61 A transverse section 
through a compact outer portion of a 
long bone. The micrograph shows the 
outlines of tunnels that have been 
formed by osteoclasts and then filled in 
by osteoblasts during successive rounds 
of bone remodeling. The section has 
been prepared by grinding. The hard 
matrix has been preserved, but not the 
cells. Lacunae and canaliculi that were 
occupied by osteocytes are clearly visible, 
however. The alternating bright and dark 
concentric rings correspond to an 
alternating orientation of the collagen 
fibers in the successive layers of bone 
matrix laid down by the osteoblasts that 
lined the wall of the canal during life. 
(This pattern is revealed here by viewing 
the specimen between partly crossed 
polarizing filters.) Note how older 
systems of concentric layers of bone have 
been partly cut through and replaced by 
newer systems. 
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(also called RANKL). The behavior of the osteoblasts in attracting their oppo- 
nents may seem self-defeating, but it has the useful function of localizing osteo- 
clasts in the tissue where they are needed. 

To prevent excessive degradation of matrix, the osteoblasts secrete, along 
with MCSF and TNF11, another protein, osteoprotegerin, that tends to block the 
action of TNF11. The higher the level of Wnt activation in the osteoblasts, the 
more osteoproteregin they secrete and, consequently, the lower the level of 
osteoclast activation and the lower the rate of bone matrix degradation. The Wnt 
signaling pathway thus seems to have two distinct functions in bone formation: 
at early stages, it controls the initial commitment of cells to an osteoblast fate; 
later, it acts in the differentiated osteoblasts to help govern the balance between 
matrix deposition and matrix erosion. 

Disturbance of this balance can lead to osteoporosis, where there is excessive 
erosion of the bone matrix and weakening of the bone, or to the opposite con- 
dition, osteopetrosis, where the bone becomes excessively thick and dense. Hor- 
monal signals, including estrogen, androgens, and the peptide hormone leptin, 
famous for its role in the control of appetite (discussed below), have powerful 
effects on this balance. At least some of these effects are mediated through influ- 
ences on the osteoblasts' production of TNF11 and osteoprotegerin. 

Circulating hormones affect bones throughout the body. No less important 
are local controls that allow bone to be deposited in one place while it is 
resorbed in another. Through such controls over the process of remodeling, 
bones are endowed with a remarkable ability to adjust their structure in 
response to long-term variations in the load imposed on them. It is this that 
makes orthodontics possible, for example: a steady force applied to a tooth with 
a brace will cause it to move gradually, over many months, through the bone of 
the jaw, through remodeling of the bone tissue ahead of it and behind it. The 
adaptive behavior of bone implies that the deposition and erosion of the matrix 
are in some way governed by local mechanical stresses (see Figure 23-56). Some 
evidence suggests that this is because mechanical stress on the bone tissue acti- 
vates the Wnt pathway in the osteoblasts or osteocytes, thereby regulating their 
production of the signals that regulate osteoclast activity. 

Bone can also undergo much more rapid and dramatic reconstruction when 
the need arises. Some cells capable of forming new cartilage persist in the con- 
nective tissue that surrounds a bone. If the bone is broken, the cells in the neigh- 
borhood of the fracture repair it by a sort of recapitulation of the original embry- 
onic process: cartilage is first laid down to bridge the gap and is then replaced by 
bone. The capacity for self-repair, so strikingly illustrated by the tissues of the 
skeleton, is a property of living structures that has no parallel among present- 
day man-made objects. 



Fat Cells Can Develop From Fibroblasts 

Fat cells, or adipocytes, also derive from fibroblastlike cells, both during normal 
mammalian development and in various pathological circumstances. In mus- 
cular dystrophy, for example, where the muscle cells die, they are gradually 
replaced by fatty connective tissue, probably by conversion of local fibroblasts. 
Fat-cell differentiation (whether normal or pathological) begins with the expres- 
sion of two families of gene regulatory proteins: the CEBP (CCAAT/ enhancer 
binding protein) family and the PPAR (peroxisome proliferator- activated recep- 
tor) family, especially PPARy. Like the MyoD and MEF2 families in skeletal mus- 
cle development, the CEBP and PPARy proteins drive and maintain one 
another's expression, through various cross-regulatory and autoregulatory con- 
trol loops. They work together to control the expression of the other genes char- 
acteristic of adipocytes. 

The production of enzymes for import of fatty acids and glucose and for fat 
synthesis leads to an accumulation of fat droplets, consisting mainly of triacyl- 
glycerol (see Figure 2-81). These then coalesce and enlarge until the cell is 
hugely distended (up to 120 jam in diameter), with only a thin rim of cytoplasm 
around the mass of lipid (Figure 23-62 and Figure 23-63). Lipases are also made 
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Figure 23-62 The development of a fat 
cell. A fibroblastlike precursor cell is 
converted into a mature fat cell by the 
accumulation and coalescence of lipid 
droplets. The process is at least partly 
reversible, as indicated by the arrows; the 
dashed arrow indicates uncertainty as to 
whether a differentiated fat cell can ever 
revert to the state of a pluripotent 
fibroblast. The cells in the early and 
intermediate stages can divide, but the 
mature fat cell cannot. 



in the fat cell, giving it the capacity to reverse the process of lipid accumulation, 
by breaking down the triacylglycerols into fatty acids that can be secreted for 
consumption by other cells. The fat cell can change its volume by a factor of a 
thousand as it accumulates and releases lipid. 



Leptin Secreted by Fat Cells Provides Feedback to Regulate Eating 

Almost all animals under natural circumstances have to cope with food supplies 
that are are variable and unpredictable. Fat cells have the vital role of storing 
reserves of nourishment in times of plenty and releasing them in times of 
dearth. It is thus essential to the function of adipose tissue that its quantity 
should be adjustable throughout life, according to the supply of nutrients. For 
our ancestors, this was a blessing; in the well-fed half of the modern world, it has 
become also a curse. In the United States, for example, approximately 30% of the 
population suffers from obesity, defined as a body mass index (weight/ height 2 ) 
more than 30 kg/m 2 , equivalent to about 30% above ideal weight. 

It is not easy to determine to what extent the changes in the quantity of 
adipose tissue depend on changes in the numbers of fat cells, as opposed to 
changes in fat-cell size. Changes in cell size are probably the main factor in 
normal nonobese adults, but in severe obesity, at least, the number of fat cells 
also increases. The factors that drive the recruitment of new fat cells are not 
well understood, although they are thought to include growth hormone and 
IGF1 (insulinlike growth factor- 1). It is clear, however, that the increase or 
decrease of fat cell size is regulated directly by levels of circulating nutrients 
and by hormones, such as insulin, that reflect nutrient levels. The surplus of 
food intake over energy expenditure thus directly governs the accumulation of 
adipose tissue. 

But how are food intake and energy expenditure themselves regulated? Fac- 
tors such as cholecystokinin, secreted by gut cells in response to food in the gut 
lumen as discussed earlier, are responsible for short-term control, over the 
course of a meal or a day. But we also need long-term controls, if we are not to 
get steadily fatter and fatter or thinner and thinner over the course of a lifetime. 
Most important, from an evolutionary point of view and for our ancestors cop- 
ing with food supplies that were often scanty and uncertain, starvation must 
provoke hunger and the pursuit of food. Those who have known real prolonged 
hunger testify to the overwhelming force of this compulsion. The key signal 
appears to be a protein hormone called leptin, which normally circulates in the 
bloodstream when fat reserves are adequate, and disappears, producing chronic 
hunger, when they are not. Mutant mice that lack leptin or the appropriate lep- 
tin receptor are extremely fat (Figure 23-64). Mutations in the same genes 
sometimes occur in humans, although very rarely. The consequences are simi- 
lar: constant hunger, overeating, and crippling obesity. 

Leptin is normally made by fat cells; the bigger they are, the more they make. 
Leptin acts on many tissues, and in particular in the brain, on cells in those 
regions of the hypothalamus that regulate eating behavior. Absence of leptin is a 
signal of starvation, driving the behavior that will restore fat reserves to their 
proper level. Thus, leptin, like myostatin released from muscle cells, provides a 
feedback mechanism to regulate the growth of the tissue that secretes it. 
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Figure 23-63 Fat cells. This low- 
magnification electron micrograph shows 
parts of two fat cells. A neutrophil cell 
that happens to be present in the 
adjacent connective tissue provides a 
sense of scale; each of the fat cells is 
more than 1 times larger than the 
neutrophil in diameter and is almost 
entirely filled with a single large fat 
droplet. The small fat droplets (pale oval 
shapes) in the remaining rim of 
cytoplasm are destined to fuse with the 
central droplet. The nucleus is not visible 
in either of the fat cells in the picture. 
(Courtesy of Don Fawcett, from 
D.W. Fawcett, A Textbook of Histology, 
12th ed. New York: Chapman and 
Hall, 1994.) 



1 476 Chapter 23: Specialized Tissues, Stem Cells, and Tissue Renewal 




Figure 23-64 Effects of leptin 
deficiency. A normal mouse (right) 
compared with a mouse that has a 
mutation in the Obese gene, which codes 
for leptin (left). The leptin-deficient 
mutant fails to limit its eating and 
becomes grotesquely fat (three times the 
weight of a normal mouse). (Courtesy of 
Jeffrey M. Friedman.) 



In most obese people, leptin levels in the bloodstream are persistently high, 
and yet appetite is not suppressed, even though leptin receptors are also present 
and functional. The leptin feedback control evolved, it seems, to save us from 
death by starvation, rather than from obesity through overeating. In the well-fed 
regions of the world, we depend on a complex of other mechanisms, many of 
them still poorly understood, to keep us from getting too fat. 



Summary 

The family of connective-tissue cells includes fibroblasts, cartilage cells, bone cells, fat 
cells, and smooth muscle cells. Some classes of fibroblasts, such as the mesenchymal 
stem cells of bone marrow, seem to be able to transform into any of the other members 
of the family. These transformations of connective-tissue cell type are regulated by the 
composition of the surrounding extracellular matrix, by cell shape, and by hormones 
and growth factors. 

Cartilage and bone both consist of cells and solid matrix that the cells secrete 
around themselves — chondrocytes in cartilage, osteoblasts in bone (osteocytes being 
osteoblasts that have become trapped within the bone matrix). The matrix of cartilage 
is deformable so that the tissue can grow by swelling, whereas bone is rigid and can 
grow only by apposition. The two tissues have related origins and collaborate closely. 
Thus, most long bones develop from miniature cartilage "models," which, as they grow, 
serve as templates for the deposition of bone. Wnt signaling regulates the choice 
between the two pathways of cell differentiation — as chondrocyte (requiring Sox9 
expression) or as osteoblast (requiring Runx2 expression). While osteoblasts secrete 
bone matrix, they also produce signals that recruit monocytes from the circulation to 
become osteoclasts, which degrade bone matrix. Osteoblasts and osteocytes control the 
balance of deposition and degradation of matrix by adjusting the signals they send to 
the osteoclasts. Through the activities of these antagonistic classes of cells, bone under- 
goes perpetual remodeling through which it can adapt to the load it bears and alter its 
density in response to hormonal signals. Moreover, adult bone retains an ability to 
repair itself if fractured, by reactivation of the mechanisms that governed its embry- 
onic development: cells in the neighborhood of the break convert into cartilage, which 
is later replaced by bone. 

While the chief function of most members of the connective-tissue family is to 
secrete extracellular matrix, fat cells serve as storage sites for fat. Feedback control 
keeps the quantity of fat tissue from falling too low: fat cells release a hormone, leptin, 
which acts in the brain, and disappearance of leptin acts as a starvation danger sig- 
nal, driving the behavior that will restore fat reserves to an adequate level. 



STEM-CELL ENGINEERING 

As we have seen, many of the tissues of the body are not only self- renewing but 
also self-repairing, and this is largely thanks to stem cells and the feedback 
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controls that regulate their behavior. But where Nature's own mechanisms fail, 
can we intervene and do better? Can we find ways of getting cells to reconstruct 
living tissues that have been lost or damaged by disease or injury and are inca- 
pable of spontaneous repair? An obvious strategy is to exploit the special devel- 
opmental capabilities of the stem cells or progenitors from which the missing 
tissue components normally derive. But how are such cells to be obtained, and 
how can we put them to use? That is the topic of this final section. 



Hemopoietic Stem Cells Can Be Used to Replace Diseased Blood 
Cells with Healthy Ones 

Earlier in this chapter, we saw how mice can be irradiated to kill off their 
hemopoietic cells, and then rescued by a transfusion of new stem cells, which 
repopulate the bone marrow and restore blood-cell production. In the same 
way, patients with leukemia, for example, can be irradiated or chemically 
treated to destroy their cancerous cells along with the rest of their hemopoietic 
tissue, and then can be rescued by a transfusion of healthy, non-cancerous 
hemopoietic stem cells, which can be harvested from the bone marrow of a 
suitable donor. This creates problems of immune rejection if the bone marrow 
donor and the recipient differ genetically, but careful tissue matching and the 
use of immunosuppressive drugs can reduce these difficulties to a tolerable 
level. In some cases, where the leukemia arises from a mutation in a specialized 
type of blood cell progenitor rather than in the hemopoietic stem cell itself, it is 
possible to rescue the patient with his or her own cells. A sample of bone mar- 
row is taken before the irradiation and sorted to obtain a preparation of 
hemopoietic stem cells that is free from leukemic cells. This purified prepara- 
tion is then transfused back into the patient after the irradiation. 

The same technology also opens the way, in principle, to one form of gene 
therapy: hemopoietic stem cells can be isolated in culture, genetically modified 
by DNA transfection or some other technique to introduce a desired gene, and 
then transfused back into a patient in whom the gene was lacking, to provide a 
self-renewing source of the missing genetic component. A version of this 
approach is under trial for the treatment of AIDS. Hemopoietic stem cells can be 
taken from the patient infected with HIV; genetically modified by transfection 
with genetic material that makes the stem cells and their progeny resistant to 
HIV infection, and transfused back into the same patient. 



Epidermal Stem-Cell Populations Can Be Expanded in Culture for 
Tissue Repair 

Another simple example of the use of stem cells is in the repair of the skin after 
extensive burns. By culturing cells from undamaged regions of the burned 
patient's skin, it is possible to obtain epidermal stem cells quite rapidly in large 
numbers. These can then be used to repopulate the damaged body surface. For 
good results after a third-degree burn, however, it is essential to provide first an 
immediate replacement for the lost dermis. For this, dermis taken from a human 
cadaver can be used, or an artificial dermis substitute. This is still an area of 
active experimentation. In one technique, an artificial matrix of collagen mixed 
with a glycosaminoglycan is formed into a sheet, with a thin membrane of sili- 
cone rubber covering its external surface as a barrier to water loss, and this skin 
substitute (called Integra) is laid on the burned body surface after the damaged 
tissue has been cleaned away. Fibroblasts and blood capillaries from the 
patient's surviving deep tissues migrate into the artificial matrix and gradually 
replace it with new connective tissue. Meanwhile, the epidermal cells are culti- 
vated until there are enough to form a thin sheet of adequate extent. Two or 
more weeks after the original operation, the silicone rubber membrane is care- 
fully removed and replaced with this cultured epidermis, so as to reconstruct a 
complete skin. 
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Neural Stem Cells Can Be Manipulated in Culture 

While the epidermis is one of the simplest and most easily regenerated tissues, 
the central nervous system (the CNS) is the most complex and seems the most 
difficult to reconstruct in adult life. The adult mammalian brain and spinal cord 
have very little capacity for self-repair. Stem cells capable of generating new 
neurons are hard to find in adult mammals — so hard to find, indeed, that until 
recently they were thought to be absent. 

We now know, however, that CNS neural stem cells capable of giving rise to 
both neurons and glial cells do persist in the adult mammalian brain. Moreover, 
in certain parts of the brain they continually produce new neurons to replace 
those that die (Figure 23-65). Neuronal turnover occurs on a more dramatic 
scale in certain songbirds, where large numbers of neurons die each year and are 
replaced by newborn neurons as part of the process by which the bird learns a 
new song in each breeding season. 

The proof that the adult mammalian brain contains neural stem cells came 
from experiments in which pieces of brain tissue were dissociated and used to 
establish cell cultures. In suitable culture conditions, cells derived from an 
appropriate region of the brain will form floating "neurospheres" — clusters con- 
sisting of a mixture of neural stem cells with neurons and glial cells derived from 
the stem cells. These neurospheres can be propagated through many cell gener- 
ations, or their cells can be taken at any time and implanted back into the brain 
of an intact animal. Here they will produce differentiated progeny, in the form of 
neurons and glial cells. 

Using slightly different culture conditions, with the right combination of 
growth factors in the medium, the neural stem cells can be grown as a mono- 
layer and induced to proliferate as an almost pure stem-cell population without 
attendant differentiated progeny. By a further change in the culture conditions, 
these cells can be induced at any time to differentiate to give a mixture of neu- 
rons and glial cells (Figure 23-66), or just one of these two cell types, according 
to the composition of the culture medium. 

The pure cultures of neural stem cells, dividing to produce more neural stem 
cells, are valuable as more than just a source of cells for transplantation. They 
should help in the analysis of the factors that define the stem-cell state and con- 
trol the switch to differentiation. Since the cells can be manipulated genetically 
by DNA transfection and other means, they open up new ways to investigate the 
role of specific genes in these processes and in genetic diseases of the nervous 
system, such as neurodegenerative diseases. They also create opportunities, in 
principle at least, for genetic engineering of neural cells to treat disease. 



Neural Stem Cells Can Repopulate the Central Nervous System 

Neural stem cells grafted into an adult brain show a remarkable ability to adjust 
their behavior to match their new location. Stem cells from the mouse hip- 
pocampus, for example, implanted in the mouse olfactory-bulb -precursor path- 
way (see Figure 23-65) give rise to neurons that become correctly incorporated 
into the olfactory bulb. This capacity of neural stem cells and their progeny to 



Figure 23-65 The continuing production of neurons in an adult mouse 
brain. The brain is viewed from above, in a cut-away section, to show the 
region lining the ventricles of the forebrain where neural stem cells are 
found. These cells continually produce progeny that migrate to the 
olfactory bulb, where they differentiate as neurons. The constant turnover 
of neurons in the olfactory bulb is presumably linked in some way to the 
turnover of the olfactory receptor neurons that project to it from the 
olfactory epithelium, as discussed earlier. There is also a continuing 
turnover of neurons in the adult hippocampus, a region specially concerned 
with learning and memory, where plasticity of adult function seems to be 
associated with turnover of a specific subset of neurons. (Adapted from 
B. Barres, Cell 97:667-670, 1 999. With permission from Elsevier.) 
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adapt to a new environment promises to have important clinical applications in 
the treatment of diseases where neurons degenerate or lose their myelin 
sheaths, and in injuries of the central nervous system. Thus, neural stem cells 
(derived from fetal human tissue) have been grafted into the spinal cord of mice 
that are crippled by a spinal cord injury or by a mutation that leads to defective 
myelination; the mice chosen were of an immuno deficient strain, and so did not 
reject the grafted cells. The grafted cells then gave rise both to neurons that con- 
nected with the host neurons and to oligodendrocytes that formed new myelin 
sheaths around demyelinated host axons. As a result, the host mice recovered 
some of their control over their limbs. 

Such findings hold out the hope that, in spite of the extraordinary complex- 
ity of nerve cell types and neuronal connections, it may be possible to use neu- 
ral stem cells to repair at least some types of damage and disease in the central 
nervous system. 



Stem Cells in the Adult Body Are Tissue-Specific 

When cells are removed from the body and maintained in culture or are trans- 
planted from one site in the body to another, as in the procedures we have just 
described, they generally remain broadly faithful to their origins. Keratinocytes 
continue to behave as keratinocytes, hemopoietic cells as hemopoietic cells, 
neural cells as neural cells, and so on. Placed in an abnormal environment, dif- 
ferentiated cells may, it is true, cease to display the full normal set of differenti- 
ated features, and stem cells may lose their stem-cell character and differentiate; 
but they do not switch to expressing the characteristics of another radically dif- 
ferent cell type. Thus, each type of specialized cell has a memory of its develop- 
mental history and seems fixed in its specialized fate. Some limited transforma- 
tions can certainly occur, as we saw in our account of the connective-tissue cell 
family, and some stem cells can generate a variety of differentiated cell types, 
but the possibilities are restricted. Each type of stem cell serves for the renewal 
of one particular type of tissue. 

Obviously, the practical opportunities would be much greater if stem cells 
were more versatile and not so specialized — if we could take them from one type 
of tissue where they are easily available, and use them to repair a different tissue 



fetal brain or ES cells ► neurospheres (A) ► pure culture of neural stem cells (B) ► mixture (C) of differentiated neurons (red) 
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in medium A in medium B medium C 




Figure 23-66 Neural stem cells. The photographs show the steps leading from fetal brain tissue, via neurospheres (A), to a 
pure culture of neural stem cells (B). These stem cells can be kept proliferating as such indefinitely, or, through a change of 
medium, can be caused to differentiate (C) into neurons (red) and glial cells (green). Neural stem cells with the same 
properties can also be derived, via a similar series of steps, from ES cells. (Micrographs from L Conti et al., PLoS 
3:1 594-1 606, 2005. With permission from Public Library of Science.) 
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Figure 23-67 Newt limb regeneration. 

A time-lapse sequence showing the 
progress of limb regeneration in an 
axolotl from amputation at the level of 
the humerus. The sequence show the 
wound-healing, dedifferentiation, 
blastema, and redifferentiation stages of 
regeneration. Total time shown is 
approximately 20-30 days. (Courtesy of 
Susan Bryant and David Gardiner.) 
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where they are needed. Thus, there has been great excitement in the past decade 
over reports that stem cells of various specialized tissues can, in certain circum- 
stances, show astonishing developmental plasticity, giving rise to cells of radi- 
cally different types — hemopoietic stem cells to neurons, for example, or neural 
stem cells to muscle. The validity of these findings is hotly debated, however, 
and faults have been found in some of the key evidence. For example, many 
apparent cases of such switches of cell fate are now thought to be actually the 
result of cell fusion events, through which nuclei from one type of specialized 
cell are exposed to cytoplasm of another cell type and consequently switch on 
an altered set of genes. In any case, most reports of interconversions between 
radically different adult cell lineages agree that these are rare events. While 
research continues into these extreme forms of stem- cell plasticity, we do not 
yet know how to make such direct interconversions happen on a large enough 
scale or reliably enough, if at all, for practical medical application. 

This is not to say that the radical transformation of cells from one differen- 
tiated character to another is an impossible dream or that efficient ways of 
bringing it about will never be found. In fact, some non-mammalian species can 
regenerate lost tissues and organs by just such interconversions. A newt, for 
example, can regenerate an amputated limb through a process in which differ- 
entiated cells seem to revert to an embryonic character and recapitulate embry- 
onic development. Differentiated multinucleate muscle cells in the remaining 
limb stump reenter the cell cycle, dedifferentiate, and break up into mononu- 
cleated cells; these then proliferate to form a bud similar to the limb bud of an 
embryo, and eventually redifferentiate into the range of cell types needed to 
reconstruct the missing part of the limb (Figure 23-67) . Why a newt can manage 
this — as well as many other extraordinary feats of regeneration — but a mammal 
cannot is still a profound mystery. 



ES Cells Can Make Any Part of the Body 

While stem cells of adult mammalian tissues seem to be quite restricted in what 
they can do, another type of mammalian stem cell is extraordinarily versatile. As 
described in Chapters 8 and 22, it is possible to take an early mouse embryo, at 
the blastocyst stage, and through cell culture to derive from it a class of stem 
cells called embryonic stem cells, or ES cells. ES cells can be kept proliferating 
indefinitely in culture and yet retain an unrestricted developmental potential. If 
ES cells are put back into a blastocyst, they become incorporated into the 
embryo and can give rise to all the tissues and cell types in the body, including 
germ cells, integrating perfectly into whatever site they may come to occupy, 
and adopting the character and behavior that normal cells would show at that 
site. We can think of development in terms of a series of choices presented to 
cells as they follow a road that leads from the fertilized egg to terminal differen- 
tiation. After their long sojourn in culture, the ES cell and its progeny can evi- 
dently still read the signs at each branch in the highway and respond as normal 
embryonic cells would. If ES cells are implanted directly into an embryo at a 
later stage or into an adult tissue, however, they fail to receive the appropriate 
sequence of cues; their differentiation then is not properly controlled, and they 
will often give rise to a tumor. 
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Figure 23-68 Production of differentiated cells from mouse ES cells in culture. ES cells derived from an 
early mouse embryo can be cultured indefinitely as a monolayer, or allowed to form aggregates called 
embryoid bodies, in which the cells begin to specialize. Cells from embryoid bodies, cultured in media with 
different factors added, can then be driven to differentiate in various ways. (Based on E. Fuchs and 
J.A. Segre, Cell 100:143-155, 2000. With permission from Elsevier.) 



Cells with properties similar to those of mouse ES cells can now be derived 
from early human embryos and from human fetal germ cells, creating a poten- 
tially inexhaustible supply of cells that might be used for the replacement and 
repair of mature human tissues that are damaged. Although one may have ethi- 
cal objections to such use of human embryos, it is worth considering the possi- 
bilities that are opened up. Setting aside the dream of growing entire organs 
from ES cells by a recapitulation of embryonic development, experiments in 
mice suggest that it should be possible in the future to use ES cells to replace the 
skeletal muscle fibers that degenerate in victims of muscular dystrophy, the 
nerve cells that die in patients with Parkinson's disease, the insulin- secreting 
cells that are lacking in type I diabetics, the heart muscle cells that die in a heart 
attack, and so on. 

If ES cells are to be used for this sort of tissue repair, they first have to be 
coaxed along the desired pathway of development. ES cells can, in fact, be 
induced to differentiate into a wide variety of cell types in culture (Figure 
23-68), by treatment with appropriate combinations of signal proteins and 
growth factors. <GGAA> They can, for example, be used to generate neurospheres 
and neural stem cells. Neural stem cells derived from mouse ES cells, like those 
derived from brain tissue, can be grafted into the brain of an adult host mouse, 
where they will differentiate to give neurons and glial cells. If the host is deficient 
in myelin-forming oligodendrocytes, a graft of ES-derived oligodendrocyte pre- 
cursors can correct the deficiency and supply myelin sheaths for axons that lack 
them. 



Patient-Specific ES Cells Could Solve the Problem of Immune 
Rejection 

There are many problems to be solved before ES cells can be used effectively for 
tissue repair in human patients. One of the most severe, limiting the use of adult 
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stem cells also, is immune rejection. If ES-derived cells of a given genotype are 
grafted into a genetically different individual, the grafted cells are likely to be 
rejected by the immune system as foreign. Ways of dealing with this problem 
using immunosuppressive drugs have been developed for the transplantation of 
organs such as kidneys and hearts, but they are far from perfect. 

To avoid immunological problems altogether, we need grafted cells that are 
genetically identical to those of the host. How, then, can ES cells be produced to 
order, with the same genotype as an adult human patient who needs a trans- 
plant? As discussed in Chapter 8, one possible route is via somatic cell nuclear 
transfer. In this procedure — not yet achieved with human cells, despite some 
false hopes — the nucleus would be taken from a somatic cell of the patient, and 
injected into an oocyte provided by a donor (in general, a woman other than the 
patient), replacing the original oocyte nucleus. From this hybrid oocyte, a blas- 
tocyst could be obtained, and from the blastocyst, ES cells. These and their 
progeny would contain the nuclear genome of the patient, and should in princi- 
ple be transplantable without risk of immune rejection. But the whole procedure 
involves many difficulties, and is a long way from the stage where it could be 
used for treatment. 

It would be far preferable if we could take cells from the adult patient and 
convert them to an ES-like character by manipulating gene expression more 
directly. A first step along this road is to identify the key determinants of ES cell 
character — the master regulatory proteins that specify that character, if they 
exist. Biochemical comparisons of ES cells with other cell types suggest a set of 
candidates for this role. These candidates can be tested by introducing the 
appropriate DNA expression constructs into differentiated cells, such as fibrob- 
lasts, that can be grown in culture. A combination of such transgenes, coding for 
a set of four gene regulatory proteins (Oct3/4, Sox2, Myc, and Klf4), seems in fact 
to be able to convert fibroblasts into cells with ES-like properties, including the 
ability to differentiate in diverse ways. The conversion rate is low — only a small 
proportion of fibroblasts containing the transgenes make the switch — and the 
converted cells are different from true ES cells in significant respects. Neverthe- 
less, these experiments show a possible way toward the production of cells with 
ES-like versatility from adult somatic cells. 



ES Cells Are Useful for Drug Discovery and Analysis of Disease 

Although transplantation of ES-derived cells for the treatment of human dis- 
eases still seems to be far in the future, there are other ways in which ES cells 
promise to be more immediately valuable. They can be used to generate large 
homogeneous populations of differentiated cells of a specific type in culture; 
and these can serve for testing the effects of large numbers of chemical com- 
pounds in the search for new drugs with useful actions on a given human cell 
type. By techniques such as those we have just described, it may be possible, fur- 
thermore, to create ES-like cells containing the genomes of patients who suffer 
from a given genetic disease, and to use these patient-specific stem cells for the 
discovery of drugs useful in the treatment of that disease. Such cells should be 
valuable also for analysis of the disease mechanism. And at a basic level, manip- 
ulations of ES cells in culture should help us to fathom some of the many 
unsolved mysteries of stem-cell biology. 

Serious ethical issues to need be resolved and enormous technical problems 
overcome before stem-cell technology can yield all the benefits that we dream 
of. But by one route or another, it seems that cell biology is beginning to open up 
new opportunities for improving on Nature's mechanisms of tissue repair, 
remarkable as those mechanisms are. 



Summary 

Stem cells can be manipulated artificially and used both for the treatment of disease 
and for other purposes such as drug discovery. Hemopoietic stem cells, for example, 
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can be transfused into leukemia patients to replace a diseased hemopoietic system, 
and epidermal stem cells taken from undamaged skin of a badly burned patient can 
be rapidly grown in large numbers in culture and grafted back to reconstruct an epi- 
dermis to cover the burns. Neural stem cells can be derived from some regions of the 
fetal or adult brain, and when grafted into a brain that is damaged can differentiate 
into neurons and glial cells that become integrated into the host tissue and may help 
to bring about a partial repair, at least in experimental studies in animals. 

In the normal adult body, each type of stem cell gives rise to a restricted range of 
differentiated cell types. Although there have been many reports of stem-cell plastic- 
ity that violates these restrictions, the evidence is still contentious. Embryonic stem 
cells (ES cells), however, are able to differentiate into any cell type in the body, and 
they can be induced to differentiate into many different cell types in culture. From ES 
cells it is possible, for example, to generate neural stem cell lines that will proliferate 
indefinitely as pure stem-cell cultures but can respond to an appropriate change of 
culture conditions at any time by differentiating into neurons and glia. Methods to 
derive ES-like cells from cells of adult tissues are under development. In principle, 
such ES-like cells, carrying the genome of a specific patient, could be used for tissue 
repair, avoiding the problem of immune rejection. More immediately, they provide an 
in vitro testing ground for the investigation of the physiology and pharmacology of 
cells of any normal or pathological genotype, and for the discovery of drugs with use- 
ful effects on these cells. 
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Pathogens, Infection, and 
Innate Immunity 




Infectious diseases currently cause about one-third of all human deaths in the 
world, more than all forms of cancer combined. In addition to the continuing 
heavy burden of ancient diseases such as tuberculosis and malaria, new infec- 
tious diseases are continually emerging, including the current pandemic (world- 
wide epidemic) of AIDS (acquired immune deficiency syndrome), which has 
already caused more than 25 million deaths worldwide. Moreover, some dis- 
eases long thought to result from other causes are now turning out to be associ- 
ated with infections. Most gastric ulcers, for example, are caused not by stress or 
spicy food, as was once believed, but by a bacterial infection of the stomach 
caused by Helicobacter pylori. 

The burden of infectious diseases is not spread equally across the planet. 
Poorer countries and communities suffer disproportionately. Frequently, the 
prevalence of infectious diseases correlates with poor public sanitation and 
public health systems, which are often further compromised by natural disasters 
or political upheavals. Some infectious diseases, however, occur primarily or 
exclusively among industrialized communities: Legionnaire's disease, com- 
monly spread through air-conditioning systems, is a recent example. 

Humans have long been both troubled and fascinated with infectious dis- 
eases. The earliest written descriptions of how to limit the spread of rabies date 
back more than 3,000 years. Since the mid- 1800s, physicians and scientists have 
struggled to identify the agents that cause infectious diseases, collectively called 
pathogens. More recently, the advent of microbial genetics and molecular cell 
biology has greatly enhanced our understanding of the causes and mechanisms 
of infectious diseases. We now know that pathogens frequently exploit the bio- 
logical attributes of their host's cells in order to infect them. This understanding 
can give us new insights into normal cell biology, as well as strategies for treat- 
ing and preventing infectious diseases. 

In a world teeming with hostile, subtle, and rapidly evolving pathogens, how 
does a fragile and slowly evolving human survive? Like all other multicellular 
organisms, we have evolved several mechanisms to resist infection by 
pathogens. First, physical barriers, such as our tough outer layers of skin, and 
associated chemical defenses, such as acid in the stomach, prevent most 
microorganisms (microbes) from coming into contact with sterile tissues in our 
body. Second, individual human cells possess some intrinsic defensive capabil- 
ities; for example, cells aggressively degrade double-stranded RNA molecules, 
which are a hallmark of certain kinds of viral infections. To combat especially 
powerful pathogens that breach these barricades, vertebrates use two types of 
immune defense, which are carried out by specialized proteins and cells: innate 
immune responses spring into action immediately after an infection begins and 
do not depend on the host's prior exposure to the pathogen, while more power- 
ful adaptive immune responses operate later in an infection and are highly spe- 
cific for the pathogen that induced them. 

In this chapter, we begin with an overview of the different kinds of organ- 
isms that cause disease. We then discuss the cell biology of infection and, finally, 
consider barriers to infection and innate immunity. Adaptive immunity is the 
subject of Chapter 25. 
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INTRODUCTION TO PATHOGENS 

We normally think of pathogens as hostile invaders that attack our bodies. But a 
pathogen, like any other organism, is simply fulfilling its biological imperative to 
live and procreate. Living at the expense of a host organism is a very effective 
strategy, and it is possible that every living organism on earth is subject to some 
type of infection (Figure 24-1). A human host is a nutrient-rich, warm, and 
moist environment, which remains at a uniform temperature and constantly 
renews itself. It is not surprising that many microorganisms have evolved the 
ability to survive and reproduce in this desirable niche. In this section, we dis- 
cuss some of the common features that microorganisms must have in order to 
be infectious. We then explore the wide variety of organisms that are known to 
cause disease in humans. 



Figure 24-1 Parasitism at many levels. 

(A) Scanning electron micrograph of a 
flea. The flea is a common parasite of 
mammals — including dogs, cats, rats, and 
humans. It drinks the blood of its host. 
Flea bites spread bubonic plague by 
passing the pathogenic bacterium 
Yersinia pestis from the bloodstream of 
one infected host to that of another. 

(B) A close-up view of this flea's leg 
reveals that it also has a parasite, a type 
of mite. The mite, in turn, is covered with 
bacteria. It is likely that bacteriophages, 
which are bacterial viruses, parasitize 
these bacteria. 

Jonathan Swift reported a similar 
observation in 1733: 
So, naturalists observe, a flea 
Has smaller fleas that on him prey; 
And these have smaller still to bite 'em; 
And so proceed ad infinitum. 
(A, courtesy of Tina Carvalho/ 
MicroAngela; B, courtesy of Stanley 
Falkow.) 



Pathogens Have Evolved Specific Mechanisms for Interacting 
with Their Hosts 

The human body is a complex and thriving ecosystem. It contains about 10 13 
human cells and also about 10 14 bacterial, fungal, and protozoan cells, which rep- 
resent thousands of microbial species. These commensal microbes, called the 
normal flora, are usually confined to certain areas of the body, including the skin, 
mouth, large intestine, and vagina. The normal flora are not just free-loading 
inhabitants of the ecosystem that is the human body, they can also affect our 
health. The anaerobic bacteria that inhabit our intestines contribute to the diges- 
tion of food, and they are also essential for proper development of the gastro- 
intestinal tract in infants. Normal flora on the skin and elsewhere also help us by 
competing with disease-causing microorganisms. In addition, humans are 
always infected with viruses, the vast majority of which rarely become noticeable. 
If it is normal for us to live in such close intimacy with a wide variety of 
microbes, why are some of them capable of causing us illness or death? As we 
shall see, this question has several answers, and the ability of a particular 
microorganism to cause obvious damage and disease in a host can depend 
greatly on external influences. Primary pathogens, which can cause overt dis- 
ease in most healthy people, are usually distinct from the normal flora. They dif- 
fer from commensal organisms in their abilities to breach barriers and survive in 
host locations where other microorganisms cannot. Our normal microbial 
inhabitants only cause trouble if our immune systems are weakened or if they 
gain access to a normally sterile part of the body, as when a bowel perforation 
enables gut flora to enter the peritoneal cavity of the abdomen, causing peritoni- 
tis; occasionally they cause disease if our immune response to them is inappro- 
priately strong. In contrast, primary pathogens do not require an immunocom- 
promised or injured host. Primary pathogens have evolved highly specialized 
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mechanisms for crossing cellular and biochemical barriers and for eliciting spe- 
cific responses from the host organism that contribute to the pathogen's survival 
and multiplication. For some pathogens, these mechanisms are adapted to a 
unique host species, whereas for others they are sufficiently general that the 
pathogen can invade, survive, and thrive in a wide variety of hosts. 

Some pathogens cause acute epidemic infections and are forced to spread 
rapidly from one sick or dying host to another; historically important examples 
include bubonic plague and smallpox. Others cause persistent infections that 
may last for years in a single individual without necessarily leading to overt dis- 
ease; examples include Epstein-Barr virus (which can cause the severe flu-like 
illness mononucleosis in some people), the bacterium Mycobacterium tubercu- 
losis (which can cause the life-threatening lung infection tuberculosis), and the 
intestinal worm Ascaris. Although each of these pathogens can make some peo- 
ple critically ill, billions of people who are mostly unaware that they are infected 
carry each of them in an asymptomatic way. It is hard to draw a line between 
persistent infection and commensalism. Throughout this chapter, we shall con- 
tinue to acknowledge the diversity of pathogens and infections while focusing 
on the cell biological principles that are common to many of them. 

In order to survive and multiply in a host, a successful pathogen must be 
able to: (1) colonize the host; (2) find a nutritionally compatible niche in the 
host's body; (3) avoid, subvert, or circumvent the host's innate and adaptive 
immune responses; (4) replicate, using host resources; and (5) exit and spread to 
a new host. Under severe selective pressure to induce host responses that help 
to accomplish these tasks, pathogens have evolved mechanisms that maximally 
exploit the biology of their host organisms. Many of them are therefore skillful 
and practical cell biologists, metaphorically speaking, and we can learn a great 
deal of cell biology by observing them. 

At the same time, our constant exposure to pathogens has strongly influ- 
enced human evolution. The development of the exquisitely precise adaptive 
immune system in vertebrates, described in Chapter 25, was an important esca- 
lation in the arms race that has always existed between pathogens and their 
hosts. In modern times, humans have upped the ante by deliberately altering 
our behavior to limit the ability of pathogens to infect us. Improvements in pub- 
lic health measures, including the construction of working sewer systems and 
clean water supplies, have contributed to the gradual decline in the frequency of 
total deaths due to infectious disease over the past several centuries. Societies 
that have dedicated resources to improving childhood nutrition have benefited 
from generally improved health, including greatly reduced death rates from 
early childhood infections. Medical interventions such as vaccinations, anti- 
microbial drugs, and routine testing of blood before using it for transfusion, 
have also substantially reduced the burden of infectious disease for many 
humans. As we learn more about the mechanisms by which pathogens cause 
disease (pathogenesis), our brains will continue to serve as an important exten- 
sion of our immune systems in fighting infectious diseases. 

The Signs and Symptoms of Infection May Be Caused by the 
Pathogen or by the Host's Responses 

Although we can easily understand why infectious microorganisms would 
evolve to reproduce in a host, it is less clear why they would evolve to cause dis- 
ease, that is, to damage their hosts. One explanation may be that, in some cases, 
the pathological responses that microorganisms elicit enhance the efficiency of 
their spread or propagation and hence clearly have a selective advantage for the 
pathogen. The virus- containing lesions on the genitalia caused by herpes simplex 
infection, for example, facilitate direct spread of the virus from an infected host 
to an uninfected partner during sexual contact. Similarly, diarrheal infections are 
efficiently spread from patient to caretaker. In many cases, however, the induc- 
tion of disease has no apparent advantage for the pathogen. Some host responses 
to infection such as lethargy and withdrawal from social interactions would 
instead seem to inhibit pathogen spread. Infected humans may altruistically try 
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to avoid infecting their friends and family members, and healthy humans may 
likewise consciously avoid the sick; similar behavior has also been documented 
in spiny lobsters. These may be adaptive responses that have been selected in 
host populations that tend to live in groups. 

Many of the symptoms and signs that we associate with infectious disease 
are actually direct manifestations of the host's immune responses in action. 
Some hallmark signs at the site of a bacterial infection, including swelling, red- 
ness, and the production of pus (mainly dead white blood cells), result from 
immune system cells attempting to destroy the invading microorganisms. Fever, 
too, is a defensive response, as the increase in body temperature can inhibit the 
proliferation of some microorganisms. In extreme cases, the most severe and 
damaging consequences of an infectious disease are directly caused by an 
overzealous immune response: the massive tissue destruction seen in cases of 
leishmaniasis (an infection caused by eucaryotic pathogens that are members of 
the genus Leishmania) is an example. Thus, understanding the biology of an 
infectious disease requires an appreciation of the contributions of both 
pathogen and host. 

To understand the relative contributions of the infecting microorganism and 
the host in causing the signs and symptoms of disease, it is useful to consider the 
cause and extent of damage done to host tissues during an infection. Each inter- 
action between a particular microorganism and a particular host is unique, and 
the outcome depends on a constantly changing landscape of microbial activity 
and host immune function. The extent of damage caused to the host depends on 
the interplay of these factors. In some cases, a particular microorganism may act 
as a harmless or even beneficial commensal in most people at most times but 
may cause invasive disease in people with weakened immune systems; this is 
true for the common skin inhabitant Staphylococcus epidermidis, for example 
(Figure 24-2A). Other microorganisms, such as the virus that causes mumps, will 
cause severe damage only in the presence of strong immune responses (Figure 
24-2B) . A very interesting category, nicely illustrating the importance of the inter- 
play between host and microbial factors in causing damage, are the many 
pathogens that cause severe disease in people with either very weak or very 
strong immune responses but little if any damage in people with intermediate 
immune responses (Figure 24-2C). An excellent example is tuberculosis, which 
currently infects between 1 and 2 billion people on Earth (usually in their lungs), 
although most are unaware of it because their immune systems have effectively 
contained the infection. When, however, a person with such a latent M. tubercu- 
losis infection becomes immunosupressed, through drug therapy or infection 
with the human immunodeficiency virus (HIV), for example, the delicate balance 
between the bacterium and the immune system is tipped in favor of the bac- 
terium, which now replicates in an uncontrolled manner, leading to serious dis- 
ease, often with painful cough producing bloody sputum. Conversely, when the 
immune response to M. tuberculosis is overzealous, it can destroy an extensive 
amount of lung tissue. 



(A) 



(B) 
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Figure 24-2 Interaction between 
microbes and immune responses in 
microbial pathogenesis. The strength of 
the immune responses increases in the 
direction of the arrows, while the amount 
of host damage is indicated by the 
amount of red shading in the arrows. 
(A) Normal flora such as the bacterium 
Staphylococcus epidermidis and the 
fungus Pneumocystis carinii only cause 
disease when the immune system is 
abnormally weak. (B) For some 
pathogens, such as the paramyxovirus 
that causes mumps and the bacterium 
Chlamydia trachomatis that causes a 
common sexually transmitted disease, it 
is the immune responses rather than the 
pathogen that is primarily responsible for 
the tissue damage. (C) For many 
pathogens, hosts with normal immune 
responses may be colonized 
asymptomatically, but hosts with very 
strong or very weak immune responses 
may suffer severe damage or even death. 
Many microorganisms behave in this way, 
including the bacterium Mycobacterium 
tuberculosis, the fungus Aspergillus, and 
the herpes simplex virus. 



Pathogens Are Phylogenetically Diverse 

Many types of pathogens cause disease in humans. The most familiar are viruses 
and bacteria. Viral infections cause diseases ranging from AIDS and smallpox to 
the common cold. They are essentially fragments of nucleic acid (DNA or RNA) 
encoding a relatively small number of gene products, wrapped in a protective 
shell of proteins and (in some cases) membrane (Figure 24-3A) . They have no 
capacity for independent metabolic activity and therefore depend absolutely on 
metabolic energy supplied by the host. They all use the basic protein synthesis 
machinery of their host cells for their replication, and many rely on host cell 
transcription machinery as well. 

Of all the bacteria we encounter in our lives, only a small minority are pri- 
mary pathogens. Much larger and more complex than viruses, bacteria are usu- 
ally free-living cells, which perform most of their basic metabolic functions 
themselves, relying on the host primarily for nutrition (Figure 24-3B). 
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Some other infectious agents are eucaryotic organisms. These range from 
single-celled fungi and protozoa (Figure 24-3C), through large complex metazoa 
such as parasitic worms. One of the most common infectious diseases on the 
planet, shared by about a billion people at present, is an infestation in the gut by 
the nematode worm Ascaris lumbricoides. Ascaris closely resembles its cousin 
Caenorhabditis elegans, which is widely used as a model organism for genetic 
and developmental biological research (discussed in Chapter 22). C. elegans, 
however, is only about 1 mm in length, whereas Ascaris can reach 30 cm (Figure 
24-3D). 

Some rare neurodegenerative diseases, including "mad cow" disease, are 
caused by an unusual type of infectious particle called a prion, which is made 
only of protein. Although the prion contains no genome, it can nevertheless 
replicate and kill the host. 

Even within each class of pathogen, there is striking diversity. Viruses vary 
tremendously in their size, shape, and content (DNA versus RNA, enveloped or 
not, and so on), and the same is true for the other pathogens. The ability to cause 
disease is an evolutionary niche, not a legacy shared only among close relatives. 

Each individual pathogen causes disease in a different way, and the same 
pathogen may cause different diseases in different hosts, making it challenging 
to understand the basic biology of infection. But, when considering the interac- 
tions of infectious agents with their hosts, some common themes of pathogen- 
esis emerge, which are the focus of this chapter. 

We now introduce the basic features of each of the major types of pathogens, 
before we examine the mechanisms that pathogens use to control their hosts 
and the innate immune responses that hosts use to control pathogens. 



Bacterial Pathogens Carry Specialized Virulence Genes 

Bacteria are small and appear structurally simple. Most can be classified 
broadly by their shape as rods, spheres, or spirals (Figure 24-4A) and by their so- 
called Gram-staining properties (Figure 24-4B and C) . Their relatively small size 
and simple range of shapes belies their extraordinary molecular, metabolic, and 
ecological diversity. At the molecular level, bacteria are far more diverse than 
eucaryotes, and they can successfully occupy ecological niches in extremes of 
temperature, salt, and nutrient limitation that would daunt even the most 




Figure 24-3 Pathogens in many forms. 

(A) The structure of the protein coat, or 
capsid, of poliovirus.This virus was once a 
common cause of paralysis, but the 
disease (poliomyelitis) has been nearly 
eradicated by widespread vaccination. 

(B) The bacterium Vibrio cholerae, the 
causative agent of the epidemic, 
diarrheal disease cholera. (C) The 
protozoan parasite Toxoplasma gondii. 
The definitive hosts for this organism are 
cats, ranging in size from housecats to 
tigers, but it also can cause serious 
infections in the muscles and brains of 
immunocompromised people with AIDS. 
(D) This clump of Ascaris nematodes was 
removed from the obstructed intestine of 
a two-year-old boy. (A, courtesy of Robert 
Grant, Stephan Crainic, and James M. 
Hogle; B, all attempts have been made to 
contact the copyright holder and we 
would be pleased to hear from them; 

C, courtesy of John Boothroyd and David 
Ferguson; D, from J.K. Baird et al., Am. 
J. Trop. Med. Hyg. 35:314-31 8, 1 986. 
Photograph by Daniel H. Connor.) 



1490 Chapter 24: Pathogens, Infection, and Innate Immunity 




Coccus 



Coccobacillus 



Vibrio 



Bacillus 



(A) 



Spirillum 



Spirochete 



-teichoic acid 



lipopolysaccharide (LPS) outer 
leaflet of outer membrane 




straight 
pilus 



CYTOSOL 




GRAM POSITIVE 



GRAM NEGATIVE 



(B) 



(O 



CYTOSOL 



(D) 



s inner 
membrane 



Figure 24-4 Bacterial shapes and cell-surface structures. (A) Bacteria are classified by shape. (B and C) They are also 
classified as Gram-positive or Gram-negative. (B) Bacteria such as Streptococcus and Staphylococcus have a single membrane 
and a thick cell wall made of cross-linked peptidoglycan. They retain the violet dye used in the Gram staining procedure and 
are thus called Gram-positive. (C) Gram-negative bacteria such as Escherichia coli (E. coli) and Salmonella have two 
membranes, separated by a periplasmic space (see Figure 1 1-1 8). The peptidoglycan layer in the cell wall of these 
organisms is located in the periplasmic space and is thinner than in Gram-positive bacteria; they therefore fail to retain the 
dye in the Gram staining procedure. The inner membrane of Gram-negative bacteria is a phospholipid bilayer, and the 
inner leaflet of the outer membrane is also made primarily of phospholipids; the outer leaflet of the outer membrane, 
however, is composed of a unique glycosylated lipid called lipopolysaccharide (LPS) (see Figure 24-47). (D) Cell-surface 
appendages are important for bacterial behavior. Many bacteria swim using the rotation of helical flagella (see Figure 
15-71). The bacterium illustrated has only a single flagellum at one pole; others such as E. coli have multiple flagella. 
Straight pili (also called fimbriae) are used to adhere to various surfaces in the host, as well as to facilitate genetic exchange 
between bacteria. Some kinds of pili can retract to generate force and thereby help bacteria move across surfaces. Both 
flagella and pili are anchored to the cell surface by large multiprotein complexes. 



intrepid eucaryote. Although they lack the elaborate morphological variety of 
eucaryotic cells, bacteria display a surprising array of surface appendages, 
which enable the cells to swim or adhere to desirable surfaces (Figure 24-4D). 
Their genomes are also small, typically between 1,000,000 and 5,000,000 
nucleotide pairs (compared to 12,000,000 for yeast and more than 3,000,000,000 
for humans) . 

As already emphasized, only a minority of bacterial species have the ability 
to cause disease in humans. Some of those that do cause disease can only repli- 
cate inside the body of their host and are called obligate pathogens. Others repli- 
cate in an environmental reservoir such as water or soil and only cause disease 
if they happen to encounter a susceptible host; these are called facultative 
pathogens. Many bacteria are normally harmless but have a latent ability to 
cause disease in an injured or immunocompromised host; these are called 
opportunistic pathogens. As discussed previously, whether or not a particular 
bacterium causes disease in a particular host depends on a wide variety of fac- 
tors, including the overall health of the host; many normal flora, for example, 
can cause severe infections in people with AIDS. 
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Some bacterial pathogens are fastidious in their choice of host and will 
infect only a single species or a group of related species, whereas others are gen- 
eralists. Shigella flexneri, for example, which causes epidemic dysentery (bloody 
diarrhea) in areas of the world lacking a clean water supply, will infect only 
humans and other primates. By contrast, the closely related bacterium 
Salmonella enterica, which is a common cause of food poisoning in humans, 
can also infect many other vertebrates, including chickens and turtles. A cham- 
pion generalist is the opportunistic pathogen Pseudomonas aeruginosa, which is 
capable of causing disease in plants as well as animals. 

A relatively small number of genes causes the significant differences 
between a virulent pathogenic bacterium and its closest nonpathogenic relative. 
Genes that contribute to the ability of an organism to cause disease are called 
virulence genes, and the proteins they encode are called virulence factors. Vir- 
ulence genes are frequently clustered together, either in groups on the bacterial 
chromosome called pathogenicity islands or on extrachromosomal virulence 
plasmids (Figure 24-5). These genes may also be carried on mobile bacterio- 
phages (bacterial viruses). It seems therefore that a new pathogen may arise 
when groups of virulence genes are transferred together into a previously aviru- 
lent bacterium. As more genomes of pathogenic and nonpathogenic bacteria 
are being completely sequenced, it is becoming clear that the acquisition of 
large chunks of DNA and other gross chromosomal changes have contributed to 
bacterial evolution, enabling bacterial species to inhabit new ecological and 
nutritional niches, as well as to cause disease. Even within a single bacterial 
species, the amount of chromosomal variation is astonishing; different strains of 
E. coli may differ by as much as 25% in their genomes. 

Acquisition of genes and gene clusters can drive the rapid evolution of 
pathogens and turn nonpathogens into pathogens. Consider, for example, Vib- 
rio cholerae — the Gram-negative bacterium that causes the epidemic diarrheal 
disease cholera. The genes encoding the two subunits of the toxin that cause the 
diarrhea are carried on a mobile bacteriophage (Figure 24-6A and B). Of the 
hundreds of strains of Vibrio cholerae found in lakes in the wild, the only ones 
that cause pandemic human disease are those infected with this bacterial virus. 
As summarized in Figure 24-6C, there have been eight identified pandemics of 
V cholerae since 1817. The first six were caused by the periodic reemergence of 
similar strains, called Classical strains. Besides the toxins encoded by the bacte- 
riophage and pathogenicity islands, the Classical strains also shared a similar 
primary carbohydrate surface antigen, called 01, which is part of the 
lipopolysaccharide that makes up the outer leaflet of the outer membrane (see 
Figure 25-4C). In 1961, the seventh pandemic began, caused by a new strain 
(named "El Tor"), which was markedly different from the Classical strains and 
appeared to have arisen when an 01 -expressing strain in the wild acquired two 
bacteriophages, as well as at least two new pathogenicity islands not found in 
Classical strains. El Tor eventually displaced the Classical strains all over the 
world. In 1991, an eighth pandemic began, this time with the frightening twist 
that even people who had suffered cholera previously were not immune, as the 
new strain had a different type of O antigen, rendering the anti-01 antibodies 
present in the blood of survivors of previous cholera epidemics ineffective 
against the new strain. In other respects, the new strain was very similar to El 
Tor; it had apparently simply acquired a new cassette for synthesis of a different 
type of O antigen. 

What are the genes that can enable a bacterium to cause disease in a healthy 
host? Many virulence genes encode proteins that interact directly with host cells. 
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Figure 24-5 Genetic differences between 
pathogenic and nonpathogenic bacteria. 

Nonpathogenic E. coli has a single circular 
chromosome. E. coli is very closely related 
to two types of food-borne pathogens — 
Shigella flexneri, which causes dysentery, 
and Salmonella enterica, a common cause 
of food poisoning. If these three 
organisms were being named today based 
on molecular techniques, they would be 
classified in the same genus, if not in the 
same species. The chromosome of 
5. flexneri differs from that of E. coli at only 
a few loci; most of the genes required for 
pathogenesis (virulence genes) are carried 
on an extrachromosomal virulence 
plasmid. The chromosome of S. enterica 
carries two large inserts (pathogenicity 
islands) not found in the E. coli 
chromosome; these inserts each contain 
many virulence genes. 
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Figure 24-6 Genetic organization of Vibrio cholerae. An electron micrograph of Vibrio cholerae {V. cholerae) is shown in 
Figure 24-3B. (A) The bacterium is unusual in having two circular chromosomes rather than one. Each chromosome has its 
own origin of replication (oriCi and oriC2). Three loci in pathogenic strains of V. cholerae are absent in nonpathogenic 
strains and appear to have been acquired relatively recently. CTX(]> on chromosome 1 is an integrated bacteriophage 
genome that carries the genes encoding cholera toxin. The pathogenicity island VPI on chromosome 1 includes genes 
encoding factors required for intestinal colonization. The integron island on chromosome 2 facilitates the insertion of newly 
acquired DNA fragments downstream of a strong transcriptional promoter and thereby promotes the sequential 
acquisition of novel genes. Although this integron island has not yet been shown to be required for virulence in V. cholerae, 
similar integron islands in many other pathogens contain virulence genes, as well as genes involved in antibiotic resistance. 
(B) Map of the CTXcj) locus. The genes encoding the two subunits of cholera toxin are CtxA and CtxB. Other genes in the core 
region (Ace and Zot) are also involved in virulence. The two repeated flanking sequences RS2 and RS1 were involved in the 
insertion of the bacteriophage genome into chromosome 1. (C) Comparative-genomics-based model for the evolution of 
pathogenic V. cholerae strains. Progenitor strains in the wild first acquired the biosynthetic pathway necessary to make the 
01 antigen type of carbohydrate chain on the outer-membrane lipopolysaccharide (see Figures 24-4C and 24-47), and 
perhaps also acquired one or two of the Vibrio pathogenicity islands (VPI1 and VPI2). Incorporation of the CTX(|) 
bacteriophage created the Classical pathogenic strains responsible for the first six worldwide epidemics of cholera 
between 1817 and 1 923. Sometime in the twentieth century, an 01 strain in the environment picked up the CTXcj) 
bacteriophage again, along with an associated bacteriophage RSI cf> and two new pathogenicity islands (VSP1 and VSP2), 
creating the El Tor strain that emerged as the seventh worldwide pandemic in 1 961 . Before 1 991 , an El Tor strain picked up 
a new DNA cassette enabling it to produce the 01 39 antigen type of carbohydrate chain rather than 01 . This altered its 
interaction with the human immune system, without diminishing its virulence, and triggered the start of the eighth 
pandemic; this bacterium also picked up a new pathogenicity island (SXT) and lost most of the VPI2 island. 



Two carried by the Vibrio cholerae phage, for example, encode two subunits of 
cholera toxin (see Figure 24-6B). The B subunit of this secreted, toxic protein 
binds to a glycolipid component of the plasma membrane of the epithelial cells 
in the gut of a person who has consumed Vibrio cholerae in contaminated water. 
The B subunit transfers the A subunit through the plasma membrane into the 
epithelial cell cytoplasm. The A subunit is an enzyme that catalyzes the transfer 
of an ADP-ribose moiety from NAD + to the trimeric G protein G s , which nor- 
mally activates adenylyl cyclase to make cyclic AMP (discussed in Chapter 15). 
ADP-ribosylation of the G protein results in an overaccumulation of cyclic AMP 
and an ion imbalance, leading to the massive watery diarrhea associated with 
cholera. The infection is then spread to new hosts by the fecal-oral route via 
contaminated food and water. 

Some pathogenic bacteria use several independent mechanisms to cause tox- 
icity to the cells of their host. Anthrax, for example, is an acute infectious disease 
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of sheep, cattle, other herbivores, and occasionally humans. It is usually caused 
by contact with spores of the Gram-positive bacterium Bacillus anthracis. 
Unlike cholera, anthrax does not spread directly from person to person. Dor- 
mant spores can survive in soil for long periods and are highly resistant to 
adverse environmental conditions, including heat, ultraviolet and ionizing radi- 
ation, pressure, and chemical agents. After the spores are inhaled, ingested, or 
rubbed into breaks in the skin, the spores germinate, and the bacteria begin to 
replicate. The bacteria secrete two toxins, called lethal toxin and edema toxin, 
either of which is sufficient to cause signs of infection. Like cholera toxin, both 
anthrax toxins are made of two subunits. The B subunit is identical in the two 
anthrax toxins, and it binds to a host cell-surface receptor protein to transfer the 
two different A subunits into host cells (Figure 24-7). The A subunit of edema 
toxin is an adenylyl cyclase that directly converts host-cell ATP into cyclic AMP, 
leading to an ion imbalance that can cause an accumulation of extracellular 
fluid (edema) in the infected skin or lung. The A subunit of lethal toxin is a pro- 
tease that cleaves several members of the MAP kinase kinase family (see Figure 
15-60). Injection of lethal toxin into the bloodstream of an animal causes shock 
(a fall in blood pressure) and death. The molecular mechanisms leading to death 
in anthrax remain uncertain. 

These examples illustrate a common theme among virulence factors. The 
factors are frequently either toxic proteins (toxins) that interact directly with 
important host structural or intracellular signaling proteins to elicit a host cell 
response that is beneficial to pathogen colonization or replication, or they are 
proteins that are needed to deliver such toxins to the host cell targets. One com- 
mon and particularly efficient delivery mechanism found in several Gram-neg- 
ative pathogens, called the type III secretion system, acts like a tiny syringe that 
injects toxic proteins from the cytoplasm of an extracellular bacterium directly 
into the cytoplasm of an adjacent host cell (Figure 24-8). The effector proteins 
that these injection devices deliver into the host cell cytoplasm can elicit a vari- 
ety of host cell responses that enable the bacterium to invade or survive. There 
is a remarkable degree of structural similarity between the type III syringe and 
the base of a bacterial flagellum (see Figure 15-71), and many of the proteins in 
the two structures are homologous. Because flagella are found in a wider range 
of bacteria than are type III secretion systems and the secretion systems appear 
to be adaptations specific for pathogenesis, it seems likely that the type III secre- 
tion systems evolved from flagella rather than the other way around. Other types 
of specialized toxin delivery systems found in pathogens appear to have evolved 
independently. For example, type IV secretion systems, used by several pathogens 



Figure 24-7 Anthrax toxin entry into 
host cells. (A) The B subunits of the toxin 
form a pore to transport the 
enzymatically active A subunit into the 
cytoplasm of host cells. The B subunit 
binds to a receptor protein on the host 
cell surface and is cleaved 
autocatalytically, releasing a soluble small 
fragment and a large fragment that 
remains associated with the receptor. 
Seven copies of the large fragment and 
their associated receptors aggregate on 
the cell surface to form a heptameric 
ring. The toxin A subunit then binds to 
the heptameric ring and is endocytosed 
along with the ring. As the pH drops in 
the endosome, the ring undergoes a 
dramatic conformational change that 
generates a pore in the endosomal 
membrane, and the A subunit is 
delivered into the cytosol of the host cell 
through the pore. (B) X-ray crystal 
structure of the B subunit heptameric 
ring, viewed from the top and from the 
side. (B, from C. Petosa et al., Nature 
385:833-838, 1997, with permission from 
Macmillan Publishers Ltd.) 
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to deliver toxins to the cytoplasm of host cells in a manner analogous to the type 
III systems, are closely related to the conjugative apparatus that many bacteria 
use to exchange genetic material. 



Figure 24-8 Type III secretion systems 
that can deliver virulence factors into 
the cytosol of a host cell. (A) Electron 
micrographs of purified type III secretion 
systems. About two dozen proteins form 
the complete structure, which is seen in 
the three enlarged electron micrographs 
below. (B) The large lower ring is 
embedded in the bacterial plasma 
membrane, and the smaller upper ring is 
embedded in the bacterial outer 
membrane. During infection, contact of 
the hollow tube tip with the plasma 
membrane of a host cell triggers 
secretion into the host cell. Here, the 
plague bacterium, Yersinia pestis, delivers 
toxins to a macrophage. (A, from 
K.Tamano et al., EMBOJ. 19:3876-3887, 
2000. With permission from Macmillan 
Publishers Ltd.) 



Fungal and Protozoan Parasites Have Complex Life Cycles with 
Multiple Forms 

Pathogenic fungi and protozoan parasites are eucaryotes. It is therefore more 
difficult to find drugs that will kill them without killing the host. Consequently, 
antifungal and antiparasitic drugs are often less effective and more toxic than 
antibiotics. A second characteristic of fungal and parasitic infections that makes 
them difficult to treat is the tendency of the infecting organisms to switch 
among several different forms during their life cycles. A drug that is effective at 
killing one form is often ineffective at killing another form, which therefore sur- 
vives the treatment. 

The fungal branch of the eucaryotic kingdom includes both unicellular 
yeasts (such as Saccharomyces cerevisiae and Schizosaccharomyces pombe) and 
filamentous, multicellular molds (like those found on moldy fruit or bread). 
Most of the important pathogenic fungi exhibit dimorphism — the ability to grow 
in either yeast or mold form. The yeast-to-mold or mold-to-yeast transition is 
frequently associated with infection. Histoplasma capsulatum, for example, 
grows as a mold at low temperature in the soil, but it switches to a yeast form 
when inhaled into the lung, where it can cause the disease histoplasmosis (Fig- 
ure 24-9). 

Protozoan parasites are single-celled eucaryotes with more elaborate life 
cycles than fungi, and they frequently require the services of more than one 
host. Malaria is the most common protozoal disease, infecting 200-300 million 
people every year and killing 1-3 million of them. It is caused by four species of 
Plasmodium, which are transmitted to humans by the bite of the female of any 
of 60 species of Anopheles mosquito. Plasmodium falciparum — the most inten- 
sively studied of the malaria- causing parasites — exists in no fewer than eight 
distinct forms, and it requires both the human and mosquito hosts to complete 
its sexual cycle (Figure 24-1 0A). Gametocytes are formed in the bloodstream of 
infected humans, but they can only differentiate into gametes and fuse to form 
a zygote in the gut of the mosquito. Three of the Plasmodium forms are highly 
specialized to invade and replicate in specific tissues — the insect gut lining, the 
human liver, and the human red blood cell. Even within a single host cell type, 
the red blood cell, the Plasmodium parasite undergoes a complex sequence of 
developmental events, reflected in striking morphological changes (Figure 
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Figure 24-9 Dimorphism in the 
pathogenic fungus Histoplasma 
capsulatum. (A) At low temperature in the 
soil, Histoplasma grows as a multicellular 
filamentous mold. (B) After it is inhaled 
into the lung of a mammal, Histoplasma 
undergoes a morphological switch 
triggered by the change in temperature. In 
this unicellular yeast-like form, it closely 
resembles Saccharomyces cerevisiae. 
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Figure 24-1 The complex life cycle of 
malaria parasites. (A) The sexual cycle of 
Plasmodium falciparum requires passage 
between a human host and an insect host. 
(B)-(D) Blood smears from people with 
malaria, showing three different forms of 
the parasite that appear in red blood cells: 
(B) ring stage; (C) schizont; (D) gametocyte. 
(Micrographs courtesy of the Centers for 
Disease Control, Division of Parasitic 
Diseases, DPDx.) 
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24-10B, C, D) as well as in stage-specific regulation of a majority of its transcripts 
(Figure 24-11). 

Because malaria is so widespread and devastating, it has acted as a strong 
selective pressure on human populations in areas of the world that harbor the 
Anopheles mosquito. Sickle-cell anemia, for example, is a recessive genetic disor- 
der caused by a point mutation in the gene that encodes the hemoglobin (3 chain, 
and it is common in areas of Africa with a high incidence of the most serious form 
of malaria (caused by Plasmodium falciparum). The malarial parasites grow 
poorly in red blood cells from either homozygous sickle-cell patients or healthy 
heterozygous carriers, and, as a result, malaria is seldom found among carriers of 
this mutation. For this reason, malaria has served to maintain the otherwise dele- 
terious sickle-cell mutation at high frequency in these regions of Africa. 

Figure 24-1 1 Time-dependent transcriptional program in malaria 
parasites developing in red blood cells. RNA was isolated from red blood 
cells infected with Plasmodium falciparum at 1-hour intervals over a total of 
48 hours. In this image, each horizontal line represents one of the -2700 
genes in which the transcriptional level changed significantly during the 
course of infection. Red indicates an increase in mRNA abundance relative 
to the average, and green indicates a decrease. The genes were arranged in 
order from top to bottom according to the relative phases of their 
transcriptional activation. This regular, orderly progression of gene 
expression parallels the morphological differentiation of the Plasmodium 
parasite through the ring stage, trophozoite stage, schizont stage, and 
merozoite stage, all observed within red blood cells in infected humans 
(see Figure 24-1 OB, C, D). (Adapted from Z. Bozdech et al., PLoS Biol. 1 :E5, 
2003. With permission from Public Library of Science.) 
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All Aspects of Viral Propagation Depend on Host Cell Machinery 

Bacteria, yeast, and protozoan pathogens are cells themselves. Even as intracel- 
lular pathogens, they use their own machinery for DNA replication, transcrip- 
tion, and translation, and they provide their own sources of metabolic energy 
Viruses, by contrast, are the ultimate hitchhikers, carrying little more than infor- 
mation in the form of nucleic acid. The information is largely replicated, pack- 
aged, and preserved by the host cells (Figure 24-12). Viruses have a small 
genome, made up of a single nucleic acid type — either DNA or RNA — which, in 
either case, may be single- stranded or double-stranded. The genome is pack- 
aged in a protein coat, which in some viruses is further enclosed by a lipid enve- 
lope. 

Viruses replicate in various ways. In general, replication involves (1) dis- 
assembly of the infectious virus particle, (2) replication of the viral genome, (3) 
synthesis of the viral proteins by the host cell translation machinery, and (4) 
reassembly of these components into progeny virus particles. A single virus 
particle (a virion) that infects a single host cell can produce thousands of 
progeny in the infected cell. Such prodigious multiplication often kills the host 
cell: the infected cell breaks open (lyses) and thereby allows the progeny virions 
access to nearby host cells. Many of the clinical manifestations of some kinds of 
viral infection reflect this cytolytic effect of the virus. Both the cold sores formed 
by herpes simplex virus and the lesions caused by the smallpox virus, for exam- 
ple, reflect the killing of the epidermal cells in a local area of infected skin. As dis- 
cussed earlier and again later, some host cell death is also caused by the immune 
responses to the virus. 

Virions come in a wide variety of shapes and sizes, and, unlike cellular life 
forms, they cannot be systematically classified by their relatedness into a single 
phylogenetic tree. Because of their tiny sizes, we now have complete genome 
sequences for nearly all clinically important viruses. The virions of poxvirus are 
among the largest, up to 450 nm long, which is about the size of some small bac- 
teria. Their genome of double-stranded DNA consists of about 270,000 
nucleotide pairs. At the other end of the size scale are the virions of parvovirus, 
which are less than 20 nm in diameter and have a single -stranded DNA genome 
of under 5000 nucleotides (Figure 24-13). The genetic information in a virus can 
be carried in a variety of unusual nucleic acid forms (Figure 24-14). 
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Figure 24-1 2 A simple viral life cycle. 

The hypothetical virus shown consists of 
a small double-stranded DNA molecule 
that codes for only a single viral capsid 
protein. No known virus is this simple. 
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Figure 24-13 Examples of viral 
morphology. As shown, both DNA and RNA 
viruses vary greatly in both size and shape. 
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Figure 24-14 Schematic drawings of 
several types of viral genomes. The 

smallest viruses contain only a few genes 
and can have an RNA or a DNA genome. 
The largest viruses contain hundreds of 
genes and have a double-stranded DNA 
genome. The peculiar ends (as well as the 
circular forms) overcome the difficulty of 
replicating the last few nucleotides at the 
end of a DNA strand (discussed in 
Chapter 5). 



The capsid that encloses the viral genome is made of one or several proteins, 
arranged in regularly repeating layers and patterns; the viral genome together 
with the capsid is called a nucleocapsid. In enveloped viruses, the nucleocapsid 
is enclosed by a lipid bilayer membrane that the virus acquires in the process of 
budding from the host cell plasma membrane (Figure 24-15). Whereas nonen- 
veloped viruses usually leave an infected cell by lysing it, an enveloped virus can 
leave the cell by budding, without disrupting the plasma membrane and, there- 
fore, without killing the cell. Enveloped viruses can cause persistent infections 
that may last for years, often without noticeable deleterious effects on the host. 

Despite this variety, all viral genomes encode three types of proteins: pro- 
teins for replicating the genome, proteins for packaging the genome and deliv- 
ering it to more host cells, and proteins that modify the structure or function of 
the host cell to enhance the replication of the virions (Figure 24-16). In the sec- 
ond section of this chapter, we focus primarily on this third class of viral pro- 
teins. Many viral genomes also encode a fourth class of proteins, which modu- 
late or subvert the host's normal immune defense mechanisms. Several of these 
are described in the final section of this chapter. 
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Figure 24-1 5 Acquisition of a viral 
envelope. (A) Electron micrograph of an 
animal cell from which six copies of an 
enveloped virus (Semliki forest virus) are 
budding. (B) Schematic drawing of the 
envelope assembly and budding 
processes. The lipid bilayer that 
surrounds the viral capsid is derived 
directly from the plasma membrane of 
the host cell. In contrast, the proteins in 
this lipid bilayer (shown in green) are 
encoded by the viral genome. 
(A, courtesy of M. Olsen and G. Griffith.) 
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Since the host cell's machinery performs most of the critical steps in viral 
replication, the identification of effective antiviral drugs is problematic. Whereas 
the antibiotic tetracycline specifically poisons bacterial ribosomes, for example, 
it will not be possible to find a drug that specifically poisons viral ribosomes, as 
viruses use the host cell's ribosomes to make their proteins. The best strategy for 
containing viral diseases is to prevent them by vaccinating the potential hosts. 
Highly successful vaccination programs have effectively eliminated smallpox 
from the planet, and the eradication of poliomyelitis may be imminent (Figure 
24-17). 



Prions Are Infectious Proteins 

All information in biological systems is encoded by structure. We are used to 
thinking of biological information in the form of nucleic acid sequences (as in 
our description of viral genomes), but the sequence itself is a shorthand code for 
describing nucleic acid structure. The replication and expression of the infor- 
mation encoded in DNA and RNA depend strictly on the structure of these 
nucleic acids and their interactions with other macromolecules. The propaga- 
tion of genetic information primarily requires that the information be stored in 
a structure that can be duplicated from unstructured precursors. Nucleic acid 
sequences are the simplest and most robust solution that organisms have found 
to the problem of faithful structural replication. 

Nucleic acids are not the only solution, however. Prions are infectious 
agents that replicate in the host by copying an aberrant protein structure. They 
have been found to occur in organisms ranging from yeasts to sea slugs to 
humans, and they cause various neurodegenerative diseases in mammals. The 
most well-known infection caused by prions is bovine spongiform encephalopa- 
thy (BSE, or mad cow disease), which occasionally spreads to humans who eat 
infected parts of the cow (Figure 24-18); it can also be transmitted from human 
to human via blood transfusions. Isolation of the infectious prions that cause the 
disease scrapie in sheep, followed by years of painstaking laboratory characteri- 
zation of scrapie-infected mice, eventually established that the protein itself is 
infectious. 

Intriguingly, the host makes the infectious prion protein, and the prion's 
amino acid sequence is identical to that of a normal host protein. Moreover, the 
prion and normal forms of the protein are indistinguishable in their post-trans- 
lational modifications. The only difference between them appears to be in their 
folded three-dimensional structure. The misfolded prion protein tends to aggre- 
gate to form regular helical fibers called amyloid. The amyloid fibers grow at the 



Figure 24-1 6 A map of the HIV genome. 

This retroviral RNA genome consists of 
about 9000 nucleotides and contains 
nine genes, the locations of which are 
shown in green and red. Three of the 
genes (green) are common to all 
retroviruses: Gag encodes capsid 
proteins, Env encodes envelope proteins, 
and Pol encodes both the reverse 
transcriptase (which copies the RNA into 
DNA) and the integrase (which inserts the 
DNA copy into the host cell genome) 
(discussed in Chapter 5). The HIV genome 
is unusually complex, because in addition 
to the three large genes (green) normally 
required for the retrovirus life cycle, it 
contains six small genes (red). At least 
some of these small genes encode 
proteins that regulate viral gene 
expression (Tat and Rev — see Figure 
7-1 03); others encode proteins that 
modify host cell processes, including 
protein trafficking (Vpu and Nef) and 
progression through the cell cycle (Vpr). 
As indicated by the red lines, RNA splicing 
(using host cell spliceosomes) is required 
to produce the Rev and Tat proteins. 
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Figure 24-1 7 Eradication of a viral 
disease through vaccination. The graph 
shows the number of cases of 
poliomyelitis reported per year in the 
United States. The arrows indicate the 
timing of the introduction of the Salk 
vaccine (inactivated virus given by 
injection) and the Sabin vaccine (live 
attenuated virus given orally). 
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ends, much like the cytoskeletal protein filaments discussed in Chapter 16, 
except that the protein subunits undergo a structural conversion from the nor- 
mal folded form of the protein to the misfolded form as they become part of the 
amyloid polymer (see Figure 6-95) . In other words, the misfolded prion form has 
the remarkable capacity to cause the normal protein to adopt its misfolded 
prion conformation and thereby to become infectious, which is equivalent to 
the prion's having replicated itself in the host. When one of the amyloid fibrils is 
broken into smaller pieces, each one can seed the conversion process in a new 
cell; the prion can therefore propagate as well as replicate. If eaten by another 
susceptible host, these newly misfolded prions can transmit the infection from 
organism to organism. 

It is not known how most normal proteins are able to find the single, correct, 
folded conformation, among the billions of other possibilities, without becom- 
ing stuck in dead-end intermediates (discussed in Chapters 3 and 6). Prions are 
a good example of how protein folding can go dangerously wrong. But why are 
the prion diseases so uncommon? What are the constraints that determine 
whether a misfolded protein will behave like a prion, or simply get refolded or 
degraded by the cell that made it? We do not yet have answers to these questions, 
and the study of prions remains an area of intense research. 



Infectious Disease Agents Are Linked To Cancer, Heart Disease, 
and Other Chronic Illnesses 
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Figure 24-18 Neural degeneration in a 
prion infection. This micrograph shows a 
slice from the brain of a person who died 
of kuru. Kuru is a human prion disease, 
very similar to BSE, that was spread from 
one person to another by ritual mortuary 
practices in New Guinea. The large fluid- 
filled holes are places where neurons 
have died. These characteristic holes are 
why prion-based neurological diseases 
are called spongiform encephalopathies. 
(Courtesy of Gary Baumbach.) 



Thus far, we have considered microorganisms primarily in their roles as 
causative agents of infectious disease. It is clear, however, that in many cases 
viral and bacterial infections can contribute to the pathogenesis of important 
life-threatening illnesses that are not normally classified as infectious diseases. 
One obvious example is cancer. The oncogene concept, which is that certain 
altered genes can trigger cell transformation and tumor development, came ini- 
tially from studies of the Rous sarcoma virus, which causes a form of cancer (sar- 
comas) in chickens. One of the genes encoded by the virus was eventually found 
to encode an overactive homolog of the host tyrosine kinase Src, which has since 
been implicated in many kinds of cancer. 

Although Rous sarcoma virus does not cause cancer in humans, several 
human cancers are now known to have a viral origin. Human papillomavirus, 
for example, which causes genital warts, is also responsible for more than 90% 
of cervical cancers. Worldwide, cervical cancer is the second most common can- 
cer in women and has a mortality rate of -40%. In wealthy countries, widespread 
screening using the Pap smear test has reduced the incidence and severity of 
cervical cancer, but it is still very common in developing countries. The recent 
development of a vaccine against the most abundant cancer-associated strains 
of human papillomavirus raises the hope that this form of cancer can be largely 
prevented worldwide by a simple and cost-effective measure. 

The Epstein-Barr virus (EBV) provides a more complex example of human 
cancer linked to a viral infection. Infection by this DNA virus is so common that 
nearly 90% of adults in the United States over the age of 40 have detectable lev- 
els of anti-EBV antibodies in their blood. EBV prefers to invade B cells of the 
adaptive immune system, especially long-lived memory B cells (discussed in 
Chapter 25). Most people infected as children have few symptoms and are 
unaware that they have been infected, but teenagers and young adults infected 
for the first time often develop mononucleosis (also called glandular fever) , a 
severe flu-like disease that can lead to high fever, painful swelling of lymph 
nodes, and fatigue that can persist for several months. After symptoms subside, 
EBV can remain dormant in the B cells for life, with its genome maintained as an 
extrachromosomal plasmid in the B cell nucleus. Some of the gene products 
encoded by the EBV genome inhibit apoptosis and thereby presumably help to 
prevent the virus from being cleared from the body. Thus, when an infected B 
cell acquires cancer-promoting mutations, the usual mechanism for eliminating 
precancerous cells by apoptosis is inhibited, and a form of B cell cancer called 
Burkitt's lymphoma may develop. 
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In some cases, chronic tissue damage caused by infection can increase the 
likelihood of cancer developing in the infected tissue. The stomach- dwelling 
bacterium Helicobacter pylori has been implicated as a major cause of stomach 
cancers as well as gastric ulcers, and the hepatitis viruses that cause chronic 
infections in the liver (chronic hepatitis) are associated with more than 60% of 
liver cancers. 

Along with cancer, the other major cause of death in wealthy industrialized 
nations is cardiovascular disease, frequently brought on by atherosclerosis, the 
accumulation of fatty deposits in blood vessel walls that can block blood flow 
The resulting ischemia has dire consequences in the heart and brain. A hallmark 
of early atherosclerosis is the appearance in blood vessel walls of clumps of 
strange-looking macrophages, called foam cells because they are loaded with 
engulfed fatty globules. The foam cells secrete cytokines that recruit other white 
blood cells into the forming atherosclerotic plaque, which also accumulates 
extracellular matrix. The continued accumulation of cells and matrix can grad- 
ually block blood flow, or, alternatively, the plaque can rupture, causing an over- 
lying thrombus to form, which acutely blocks blood flow; moreover, pieces of the 
thrombus can break off to form emboli that block smaller blood vessels down- 
stream. Interestingly, foam cells in atherosclerotic plaques often contain the 
bacterial pathogen Chlamydia pneumoniae, which commonly causes pneumo- 
nia in humans (Figure 24-19). Numerous lines of evidence suggest that C. pneu- 
moniae infection is a significant risk factor for atherosclerosis in humans and 
animal models. DNA from other bacterial species has also been found in 
atherosclerotic plaques, including DNA from bacteria usually associated with 
teeth and gums, such as Porphyromonas gingivalis. The connection between 
infectious agents and atherosclerosis is an area of active current research. 

In addition to contributing to the life-threatening conditions of cancer and 
cardiovascular disease, infectious agents are also thought to have a role in many 
chronic illnesses, although it is often hard to tell whether infection causes these 
diseases or is a consequence of the diseases. A clear-cut case of an infectious 
cause for a chronic ailment is Lyme disease, a bacterial infection caused by the 
spirochete Borrelia burgdorferi. The infection is acquired by a tick bite and can 
cause painful chronic arthritis if it is not detected and treated early with anti- 
biotics. Several other bacterial infections, particularly infections by Gram-posi- 
tive cocci and by small bacteria without a wall, called Mycoplasma, can also trig- 
ger immune responses leading to arthritis. In some people, Mycoplasma, 
Chlamydia pneumoniae, or both, are associated with chronic asthma. As we 
learn more about the complex interactions between pathogens and the human 
body, it seems likely that more and more chronic conditions will be found to 
have a link to an infectious agent. As has been the case with peptic ulcers, cur- 
ing the infection may cure the disease, or at least alleviate the painful symptoms. 
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Figure 24-1 9 Chlamydia pneumoniae 
within a foam cell macrophage in an 
atherosclerotic plaque. The labels on this 
electron micrograph indicate: 
EB, elementary body (bacterium); FG, fat 
globule; N, macrophage nucleus. (From 
LA. Campbell and C.C. Kuo, Nat. Rev. 
Microbiol. 2:23-32, 2004. With permission 
from Macmillan Publishers Ltd.) 
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Summary 

Infectious diseases are caused by pathogens, which include bacteria, fungi, protozoa, 
worms, viruses, and even infectious proteins called prions. All pathogens must have 
mechanisms for entering their host and for evading immediate destruction by the host. 
Most bacteria are not pathogenic. Those that are contain specific virulence genes that 
mediate interactions with the host, eliciting responses from the host cells that promote 
the replication and spread of the pathogen. Pathogenic fungi, protozoa, and other 
eucaryotic parasites typically pass through several different forms during the course of 
infection; the ability to switch among these forms is usually required for the parasites 
to survive in a host and cause disease. In some cases, such as malaria, parasites must 
pass sequentially through several host species to complete their life cycles. Unlike bac- 
teria and eucaryotic parasites, viruses have no metabolism of their own and no intrin- 
sic ability to produce the proteins encoded by their DNA or RNA genomes; they rely 
entirely on subverting the machinery of the host cell to produce their proteins and to 
replicate their genomes. Prions, the smallest and simplest infectious agents, contain no 
nucleic acid; instead, they are rare, aberrantly folded proteins that replicate by cat- 
alyzing the misfolding of normal host proteins with the same amino acid sequence as 
the prion. 



CELL BIOLOGY OF INFECTION 

The mechanisms that pathogens use to cause disease are as diverse as the 
pathogens themselves. Nonetheless, all pathogens must carry out certain com- 
mon tasks: they must colonize the host, reach an appropriate niche, avoid host 
defenses, replicate, and exit from the infected host to spread to an uninfected 
one. In this section, we examine the common strategies that many pathogens 
use to accomplish these tasks. 



Pathogens Cross Protective Barriers to Colonize the Host 

The first step in infection is for the pathogen to colonize the host. A thick and 
fairly tough covering of skin protects most parts of the human body from the 
environment. The protective boundaries of some other human tissues (eyes, 
nasal passages, respiratory tract, mouth, digestive tract, urinary tract, and 
female genital tract) are less robust. In the lungs and small intestine, for exam- 
ple, where oxygen and nutrients, respectively, are absorbed from the environ- 
ment, the barrier is just a single monolayer of epithelial cells. 

Skin and many other epithelial barriers are densely populated by normal 
flora. Some pathogens also colonize these surfaces and attempt to outcompete 
the normal flora, but most pathogens avoid such competition by crossing the 
barriers to gain access to unoccupied niches within the host. 

Wounds in barrier epithelia allow pathogens direct access to such niches. 
This avenue of entry requires little in the way of pathogen specialization, and 
many members of the normal flora can cause serious illness if they enter 
through such wounds. Anaerobic bacteria of the genus Bacteroides, for example, 
are carried as harmless flora at very high density in the large intestine, but they 
can cause life-threatening peritonitis if they enter the peritoneal cavity through 
a perforation in the intestinal wall caused by trauma, surgery, or infection. 
Staphylococcus from the skin and nose, or Streptococcus from the throat and 
mouth, are also responsible for many serious infections resulting from breaches 
in epithelial barriers. 

Primary pathogens, however, need not wait for a wound to gain access to 
their host. A particularly efficient way for a pathogen to cross the skin is to catch 
a ride in the saliva of a biting arthropod. Many insects and ticks nourish them- 
selves by sucking mammalian blood, and a diverse group of bacteria, viruses, 
and protozoa have developed the ability to survive in arthropods and then use 
them as vectors to spread from one mammalian host to another. As discussed 
earlier, the Plasmodium protozoan that causes malaria develops through several 
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forms in its life cycle, including some that are specialized for survival in a human 
and some that are specialized for survival in a mosquito (see Figure 24-10). 
Viruses that are spread by insect bites include the causative agents for several 
types of hemorrhagic fever, including yellow fever and Dengue fever, as well as 
the causative agents for many kinds of viral encephalitis (inflammation of the 
brain). All these viruses replicate in both insect cells and mammalian cells, as 
required for their transmission by an insect vector. Bloodborne viruses such as 
HIV that are not capable of replicating in insect cells are rarely, if ever, spread 
from insect to human. 

The efficient spread of a pathogen via an insect vector requires that individ- 
ual insects consume blood meals from numerous mammalian hosts. In a few 
striking cases, the pathogen appears to alter the behavior of the insect so that its 
transmission is more likely. Like most animals, the tsetse fly (whose bite spreads 
the protozoan parasite Trypanosoma brucei, which causes sleeping sickness in 
Africa) stops eating when it is satiated. Tsetse flies carrying trypanosomes, how- 
ever, bite much more frequently and ingest more blood than do uninfected flies. 
The presence of trypanosomes impairs the function of the insect mechano- 
receptors that measure blood flow through the gullet to assess the fullness of the 
stomach, effectively fooling the tsetse fly into thinking that it is still hungry. The 
bacterium Yersinia pestis, which causes bubonic plague, uses a different mech- 
anism to ensure that a flea carrying it bites repeatedly: it multiplies in the flea's 
foregut to form aggregated masses that eventually enlarge and physically block 
the digestive tract. The insect is then unable to feed normally and begins to 
starve. During repeated attempts to alleviate its hunger by feeding, some of the 
bacteria in the foregut are flushed into the bite site, thus transmitting plague to 
a new host (Figure 24-20). 



Pathogens That Colonize Epithelia Must Avoid Clearance by the 
Host 
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Figure 24-20 Plague bacteria within a 
flea. This light micrograph shows the 
digestive tract dissected from a flea that 
had dined about two weeks previously on 
the blood of an animal infected with the 
plague bacterium, Yersinia pestis. The 
bacteria multiplied in the flea gut to 
produce large cohesive aggregates {red 
arrows); the bacterial mass on the left is 
occluding the passage between the 
esophagus and the midgut. This type of 
blockage prevents a flea from digesting its 
blood meals, so that hunger causes it to 
bite repeatedly, disseminating the 
infection. (From B.J. Hinnebusch, 
E.R. Fischer and T.G. Schwann, J. Infect. Dis. 
178:1406-1415, 1998. With permission 
from University of Chicago Press.) 



Hitching a ride through the skin on an insect proboscis is just one strategy that 
pathogens use to pass through host barriers. Whereas many epithelial barriers 
such as the skin and the lining of the mouth and large intestine are densely pop- 
ulated by normal flora, others, including the lining of the lower lung, the small 
intestine, and the bladder, are normally kept nearly sterile, despite the presence 
of a relatively direct route to the outside world. How do these epithelia avoid 
bacterial colonization? As discussed in Chapter 23, a layer of protective mucus 
covers the respiratory epithelium, and the coordinated beating of cilia sweeps 
the mucus and trapped bacteria and debris up and out of the lung. The epithe- 
lia lining the bladder and the upper gastrointestinal tract also have a thick layer 
of mucus, and these organs are periodically flushed by urination and by peri- 
stalsis, respectively, which washes away undesirable microbes. The pathogenic 
bacteria and parasites that infect these epithelial surfaces have specific mecha- 
nisms for overcoming these host- cleaning processes. Those that infect the uri- 
nary tract, for example, resist the washing action of urine by adhering tightly to 
the epithelium lining the tract via specific adhesins, which are proteins or pro- 
tein complexes that recognize and bind to host cell- surface molecules. An 
important group of adhesins in E. coli strains that infect the kidney are compo- 
nents of the P pili that help the bacteria adhere to the kidney epithelial cells. 
These surface projections can be several micrometers long and are thus able to 
span the thickness of the protective mucus layer (see Figure 24-4D). At the tip of 
each pilus is an adhesin protein that binds tightly to a particular glycolipid dis- 
accharide that is found on the surface of kidney cells. Strains of E. coli that infect 
the bladder rather than the kidney express a second kind of pilus that enables 
them to adhere to bladder epithelial cells. It is the adhesion specificity of the 
adhesin proteins on the tips of two types of pili that is responsible for the bacte- 
ria's colonizing the different parts of the urinary tract (Figure 24-21). The speci- 
ficity of the adhesins also restricts the host range for these and many other 
pathogenic bacteria. 
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One of the hardest organs for a microbe to colonize is the stomach. Besides 
the thick layer of mucus and peristaltic washing, the stomach is filled with acid 
(average pH ~2). This extreme environment is lethal to almost all bacteria 
ingested in food. Nonetheless, the stomach is colonized by the hardy and enter- 
prising bacterium Helicobacter pylori, which was recognized only recently as the 
major causative agent of stomach ulcers and some cases of stomach cancer. 
Remarkably, it is able to persist for life as a harmless commensal in most of its 
hosts. Although the older treatments for ulcers (acid-reducing drugs and bland 
diets) are still used to reduce inflammation, a short and relatively cheap course 
of antibiotics can now effectively cure a patient of recurrent stomach ulcers. The 
hypothesis that a persistent bacterial infection of the stomach lining could 
cause stomach ulcers was initially met with great skepticism. The young Aus- 
tralian doctor who made the initial discovery finally proved the point: he drank 
a flask of a pure culture of H. pylori and developed gastritis. One way in which H. 
pylori survives in the stomach is by producing the enzyme urease, which con- 
verts urea to ammonia and carbon dioxide; in this way, the bacterium surrounds 
itself with a layer of ammonia, which neutralizes the acid in its immediate vicin- 
ity. The bacteria also express at least five types of adhesins, which enable them 
to adhere to the stomach epithelium, and they produce several cytotoxins that 
destroy the stomach epithelial cells, creating painful ulcers. The resulting 
chronic inflammation promotes cell proliferation and thus predisposes the 
infected individual to stomach cancer. 

Bordetella pertussis, the bacterium that causes whooping cough, provides 
another remarkable example of active colonization. The first step in a B. pertus- 
sis infection is colonization of the respiratory epithelium. The bacteria circum- 
vent the normal clearance mechanism that clears the respiratory tract (the 
mucociliary escalator described in Chapter 23) by binding tightly to the surface 
of the ciliated cells that line the tract and multiplying on them. B. pertussis 
expresses at least four adhesins that bind tightly to particular glycolipids on the 
ciliated cells. The adherent bacteria produce various toxins that eventually kill 
the ciliated cells, compromising the host's ability to clear the infection. The most 
familiar of these is pertussis toxin, which — like cholera toxin — is an ADP-ribosyl- 
ating enzyme. It ADP-ribosylates the a subunit of the G protein Gi, inhibiting the 
G protein from suppressing the activity of the host cell's adenylyl cyclase, 
thereby increasing the production of cyclic AMP (discussed in Chapter 15). This 
toxin also interferes with the chemotactic pathway that neutrophils use to seek 
out and destroy invading bacteria (see Figure 16-101). Not content with this, B. 
pertussis also produces an adenylyl cyclase of its own, which is active only when 
bound to the eucaryotic Ca 2+ -binding protein calmodulin in the cytoplasm of the 
host cell. Although both B. pertussis and V. cholerae drastically increase cAMP lev- 
els in the host cells to which they adhere, the symptoms of the diseases differ 
because the two bacteria colonize different sites in the host: B. pertussis colonizes 
the respiratory tract and causes paroxysmal coughing, whereas V. cholerae colo- 
nizes the gut and causes watery diarrhea. 



Figure 24-21 Uropathogenic E. coli in 
the infected bladder of a mouse. 

(A) Scanning electron micrograph of 
uropathogenic E. coli, a common cause of 
bladder and kidney infections. The 
bacteria are attached to the surface of 
epithelial cells lining the infected bladder. 

(B) A close-up view of one of the bacteria 
showing the pili on its surface. (A, from 
G.E. Soto and SJ. Hultgren, J. Bad 

1 81 :1 059-1 071, 1 999; B, courtesy of 
D.G. Thanassi and SJ. Hultgren, Meth. 
Comp. Meth. Enzym. 20:1 1 1 -1 26, 2000. 
With permission from Academic Press.) 
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Figure 24-22 Interaction of enteropathogenic E. coli (EPEC) with host cells in the gut. (A) When EPEC contacts an 
epithelial cell in the lining of the human gut, it delivers a bacterial protein called Tir into the host cell through a type III 
secretion system. Tir then inserts into the plasma membrane of the host cell, where it functions as a receptor for the 
bacterial adhesin protein intimin. (B) A host cell protein tyrosine kinase phosphorylates the intracellular domain of Tir on 
tyrosines. Phosphorylated Tir then recruits various host cell proteins in turn, which trigger actin polymerization. 
Consequently, a dense accumulation of actin filaments assembles underneath the bacterium, forming an actin pedestal. 
(C) EPEC on a pedestal. In this fluorescence micrograph, the DNA of the EPEC and host cell are labeled in blue,T\r protein is 
labeled in green, and host cell actin filaments are labeled in red. The inset shows a close up view of the two upper bacteria 
on pedestals. (C, from D. Goosney et al., Annu. Rev. Cell Dev. Biol. 1 6:1 73-1 89, 2000. With permission from Annual Reviews.) 



Not all examples of specific colonization require that the bacterium express 
adhesins that bind to host cell glycolipids or proteins. Enteropathogenic E. coli, 
which causes diarrhea in young children, instead uses a type III secretion system 
(see Figure 24-8) to deliver its own bacterially produced receptor protein (called 
Tir) into its host cell (Figure 24-22A) . After Tir inserts into the host cell's plasma 
membrane, a bacterial surface protein binds to the extracellular domain of Tir, 
triggering a remarkable series of events inside the host cell. First, a host protein 
tyrosine kinase phosphorylates the Tir receptor protein on tyrosines, which is 
unusual because bacteria generally do not phosphorylate tyrosines on proteins. 
The phosphorylated Tir is then thought to recruit a member of the Rho family of 
small GTPases, which promotes actin polymerization through a series of inter- 
mediate steps (discussed in Chapter 16). The polymerized actin then forms a 
unique cell- surface protrusion, called a pedestal, that pushes the tightly adher- 
ent bacteria up about 10 jam from the host cell surface (Figure 24-22B, C). 

These examples of host colonization illustrate the importance of 
host-pathogen communication in both the infection process and its evolution. 
Pathogenic organisms have acquired genes that encode proteins that interact 
specifically with particular molecules of the host cells. In some cases, such as the 
B. pertussis adenylyl cyclase, an ancestor of the pathogen may have acquired the 
cyclase gene from its host, whereas in others, such as Tir, random mutation may 
have produced protein motifs that are recognized by a eucaryotic protein tyro- 
sine kinase. 



Intracellular Pathogens Have Mechanisms for Both Entering and 
Leaving Host Cells 

Many pathogens, including V. cholerae and B. pertussis, infect their host without 
entering host cells; they are referred to as extracellular pathogens. Others, how- 
ever, including all viruses and many bacteria and protozoa, are intracellular 
pathogens. Their preferred niche for replication and survival is within the 
cytosol or intracellular compartments of particular host cells. This strategy has 
several advantages. The pathogens are not accessible to antibodies (discussed in 
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Chapter 25), and they are not easy targets for phagocytic cells (discussed later); 
furthermore, they are bathed in a rich source of many sugars, amino acids, and 
other nutrients present in host cell cytoplasm. This lifestyle, however, requires 
that the pathogen develop mechanisms for entering host cells, for finding a suit- 
able subcellular niche where it can replicate, and for exiting from the infected 
cell to spread the infection. In the remainder of this section, we consider some 
of the myriad ways that individual intracellular pathogens exploit and modify 
host cell biology to satisfy these requirements. 



Virus Particles Bind to Molecules Displayed on the Host Cell 
Surface 

The first step for any intracellular pathogen is to bind to the surface of the host 
target cell. Viruses accomplish this binding through the association of a viral 
surface protein with a specific receptor on the host cell surface. Of course, no 
host cell receptor evolved for the sole purpose of allowing a pathogen to bind to 
it; these receptors all have other functions. The first such "virus receptor" iden- 
tified was the E. coli surface protein that allows the bacteriophage lambda to 
bind to the bacterium. Its normal function is as a transport protein responsible 
for the uptake of maltose. Receptors need not be proteins, however; herpes sim- 
plex virus, for example, binds to heparan sulfate proteoglycans through specific 
viral membrane proteins. 

Virions that infect animal cells generally use cell- surface receptor molecules 
that are either very abundant (such as sialic- acid- containing oligosaccharides, 
which are used by the influenza virus) or uniquely found on those cell types in 
which the virions can replicate (such as the nerve growth factor receptor, the 
nicotinic acetylcholine receptor, or the cell-cell adhesion protein N-CAM, all of 
which are used by rabies virus to infect neurons specifically). Often, many types 
of virus use a single type of receptor, and some viruses can use several different 
receptors. Moreover, different viruses that infect the same cell type may each use 
a different receptor. For example, members of at least six virus families, all of 
which preferentially replicate in liver cells (hepatocytes), cause hepatitis. Recep- 
tors for four of these have been identified, and they all differ. Many virions bind 
to receptors expressed on cells of the immune system. While seemingly para- 
doxic, as we might expect that triggering an immune response does not enhance 
viral survival, invading an immune cell may be a useful way to travel around the 
body and be taken to lymphoid organs, which are filled with other immune sys- 
tem cells. 

Virions often require both a primary receptor and a secondary co-receptor 
for efficient attachment and entry into host cells. An important example is the 
AIDS virus HIV. Its primary receptor is CD4, a cell- surface protein on helper T 
cells and macrophages that is involved in immune recognition (discussed in 
Chapter 25). Viral entry also requires a co-receptor, either CCR5 (a receptor for (3- 
chemokines) or CXCR4 (a receptor for oc-chemokines), depending on the partic- 
ular variant of the virus (Figure 24-23). Macrophages are susceptible only to HIV 
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Figure 24-23 Receptor and co-receptors 
for HIV. All strains of HIV require the CD4 
protein as a primary receptor. Early in an 
infection, most of the viruses use CCR5 as 
a co-receptor, allowing them to infect 
macrophages and their precursors, 
monocytes. As the infection progresses, 
mutant variants arise that now use CXCR4 
as a co-receptor, enabling them to infect 
helper T cells efficiently. The natural 
ligand for the chemokine receptors (Sdfl 
for CXCR4; Rantes, Mip1 a, or Mip1 p for 
CCR5) blocks co-receptor function and 
prevents viral invasion. 
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variants that use CCR5 for entry, whereas helper T cells are most efficiently 
infected by variants that use CXCR4. The viruses that are found within the first 
few months after HIV infection almost invariably require CCR5, which presum- 
ably explains why individuals fortunate enough to carry a defective Ccr5 gene are 
not susceptible to HIV infection. In the later stages of infection, viruses may 
either switch to use the CXCR4 co-receptor or adapt to use both co-receptors; in 
this way, the virus can change the cell types it infects as the disease progresses. 



Virions Enter Host Cells by Membrane Fusion, Pore Formation, or 
Membrane Disruption 

After recognition and attachment to the host cell surface, the typical next steps 
for a virion are to enter the host cell and release its nucleic acid genome from its 
protective protein coat or lipid envelope. In most cases, the liberated nucleic acid 
remains complexed with some viral proteins. Enveloped viruses enter the host 
cell by fusing either with the plasma membrane or with the endosomal mem- 
brane following endocytosis (Figure 24-24A,B). Fusion is thought to proceed via 
a mechanism similar to SNARE -mediated fusion of intracellular vesicles during 
normal vesicular traffic (discussed in Chapter 13). <ATAG> 

The virus regulates fusion both to ensure that its particles fuse only with the 
appropriate host cell membrane and to prevent the particles from fusing with 



Figure 24-24 Four virus uncoating 
strategies. (A) Some enveloped viruses, such 
as HIV, fuse directly with the host cell plasma 
membrane to release their genome (blue) 
and capsid proteins (orange) into the cytosol. 

(B) Other enveloped viruses, such as 
influenza virus, first bind to cell-surface 
receptors, triggering receptor-mediated 
endocytosis. When the endosome acidifies, 
the virus envelope fuses with the endosomal 
membrane, releasing the viral genome (blue) 
and capsid proteins (orange) into the cytosol. 

(C) Poliovirus, a nonenveloped virus, binds to 
a receptor (green) on the host cell surface 
and then forms a pore (not shown) in the 
host cell membrane to extrude its RNA 
genome (blue). (D) Adenovirus, another 
nonenveloped virus, uses a more 
complicated strategy. It induces receptor- 
mediated endocytosis and then disrupts the 
endosomal membrane, releasing part of the 
capsid and its DNA genome into the cytosol. 
The trimmed-down virus eventually docks 
onto a nuclear pore and releases its DNA 
(red) directly into the nucleus. 
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one another. For viruses such as HIV that fuse at neutral pH at the plasma mem- 
brane, binding to receptors or co-receptors usually triggers a conformational 
change in the viral envelope protein to expose a normally buried fusion peptide 
(see Figure 13-16). Other enveloped viruses, such as influenza, only fuse with a 
host cell membrane after endocytosis; in this case, it is frequently the acid envi- 
ronment in the early endosome that triggers the conformational change in a 
viral surface protein that exposes the fusion peptide (see Figure 24-24B). The H + 
pumped into the early endosome enters the influenza particle through an ion 
channel and triggers the uncoating of the viral RNA, which is directly released 
into the cytosol as the virus fuses with the endosomal membrane. For some 
viruses, uncoating occurs after release into the cytosol. In the case of Semliki for- 
est virus, for example, the binding of host ribosomes to the capsid causes the 
capsid proteins to separate from the viral genome. 

It is more difficult to envision how nonenveloped viruses enter host cells, as 
it is not obvious how large assemblies of protein and nucleic acid can cross the 
plasma or endosomal membrane. Where the entry mechanism is understood, 
nonenveloped viruses generally either form a pore in the cell membrane to 
deliver the viral genome into the cytoplasm or they disrupt the endosomal 
membrane after endocytosis. 

Poliovirus uses the first strategy. Binding of poliovirus to its receptor triggers 
both receptor-mediated endocytosis and a conformational change in the viral 
particle. The conformational change exposes a hydrophobic projection on one 
of the capsid proteins, which apparently inserts into the endosomal membrane 
to form a pore. The viral genome then enters the cytoplasm through the pore, 
leaving the capsid either in the endosome on the cell surface, or in both places 
(see Figure 24-24C). 

Adenovirus uses the second strategy. It is initially taken up by receptor-medi- 
ated endocytosis. As the endosome matures and becomes more acidic, the virus 
undergoes multiple uncoating steps that remove structural proteins sequen- 
tially from the capsid. Some of these steps require the action of a viral protease, 
which is inactive in the extracellular virus particle (probably because of intra - 
chain disulfide bonds) but is activated in the reducing environment of the endo- 
some. One of the proteins released from the capsid lyses the endosomal mem- 
brane, releasing the remainder of the virus into the cytosol. This trimmed-down 
virus then docks onto the nuclear pore complex, and the viral DNA genome is 
transported through the pore into the nucleus, where it is transcribed (see Fig- 
ure 24-24D). 

In these various entry strategies, viruses exploit a variety of host cell 
molecules and processes, which can include cell-surface components, receptor- 
mediated endocytosis, endosomal maturation steps, and nuclear transport. 
These strategies illustrate again the sophisticated ways that pathogens have 
evolved to utilize the basic cell biology of their hosts. 

Bacteria Enter Host Cells by Phagocytosis 

Bacteria are much larger than viruses, and they are too large to be taken up 
either through pores or by receptor-mediated endocytosis. Instead, they enter 
host cells by phagocytosis. Phagocytosis of bacteria is a normal function of 
macrophages. They patrol the tissues of the body and ingest and destroy 
unwanted microbes. Some pathogens, however, have acquired the ability to sur- 
vive and replicate within macrophages after they have been phagocytosed. 

Mycobacterium tuberculosis is one such pathogen. As discussed earlier, it 
causes tuberculosis, a serious lung infection that is widespread in some urban 
populations. It is usually acquired by inhalation of the bacterium into the lungs, 
where it is phagocytosed by alveolar macrophages. Although the microbe can 
survive and replicate within macrophages, with the help of the adaptive 
immune system, the macrophages of most healthy individuals contain the 
infection within a lesion called a tubercle. In most cases, the lesion becomes 
walled off by a fibrous capsule that eventually undergoes calcification and can 
then easily be seen on an X-ray of the lungs. Remarkably, M. tuberculosis in such 
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lesions can survive for decades, and, later in life, especially if drugs or disease 
weaken the immune system, the infection may be reactivated and spread in the 
lung and even to other organs. 

Tuberculosis has infected human populations for thousands of years, but 
another bacterium that lives within alveolar macrophages was first recognized 
as a human pathogen only in 1976. Legionella pneumophila is normally a para- 
site of freshwater amoebae, which take it up by phagocytosis. When droplets of 
water containing L. pneumophila or infected amoebae are inhaled into the lung, 
the bacteria can invade and live inside alveolar macrophages (Figure 24-25), 
which, to the bacteria, must seem just like large amoebae. This infection leads to 
the type of pneumonia known as Legionnaire's disease. The pathogen is com- 
monly spread by central air-conditioning systems, as the amoebae that are the 
bacterium's normal host are particularly adept at colonizing air-conditioning 
cooling towers; moreover, these cooling systems produce microdroplets of water 
that are easily inhaled. The incidence of Legionnaire's disease has increased dra- 
matically in recent decades, with outbreaks frequently traced to the air-condi- 
tioning systems in office buildings, hospitals, and hotels. Other forms of modern 
aerosolization are sometimes responsible, including decorative fountains and 
produce sprayers in supermarkets. 

Some bacteria invade cells that are normally nonphagocytic. One way in 
which bacteria can induce such a cell to phagocytose them is by expressing an 
adhesin that binds with high affinity to a host cell adhesion protein that the host 
cell normally uses to adhere to another host cell or to the extracellular matrix 
(discussed in Chapter 19). For example, a bacterium that causes diarrhea, 
Yersinia pseudotuberculosis (a close relative of the plague bacterium Yersinia 
pestis), expresses a protein called invasin that binds to (31 integrins, and a bac- 
terium that causes a rare but serious form of food poisoning, Listeria monocyto- 
genes, expresses a protein that binds to E-cadherin. Binding of the bacterial pro- 
teins to these transmembrane host adhesion proteins fools the host cell into 
attempting to form a cell junction, and it begins moving actin and other cytoskele- 
tal components to the site of bacterial attachment. Since the bacterium is small 
relative to the host cell, the host cell's attempt to spread over the adhesive surface 
of the bacterium results in the phagocytic uptake of the bacterium — a process 
known as the zipper mechanism of invasion (Figure 24-26A). The similarity of this 
form of invasion to the natural process of cell adhesion was revealed when the 
three-dimensional structure of invasin was determined. The bacterial protein has 
an RGD motif with a structure almost identical to the RGD motif of the integrin- 
binding site in the extracellular matrix protein laminin (discussed in Chapter 19). 

A second pathway by which bacteria can invade nonphagocytic cells is 
known as the trigger mechanism (Figure 24-26B). It is used by various pathogens, 
including Salmonella enterica, which causes food poisoning. When the bac- 
terium injects a set of effector molecules into the host cell cytoplasm through a 
type III secretion system, it initiates this dramatic form of invasion. Some of these 
effector molecules activate Rho-family GTPases, which stimulate actin polymer- 
ization (discussed in Chapter 16). Others interact with cytoskeletal elements 
more directly, severing actin filaments and causing the rearrangement of cross- 
linking proteins. The net effect is to cause dramatic localized ruffling on the sur- 
face of the host cell (Figure 24-26C), which throws up large actin-rich protrusions 
that fold over and trap the bacterium within a large endocytic vesicle called a 
macropinosome (Figure 24-26D). The overall appearance of cells being invaded 
by the trigger mechanism is similar to the dramatic ruffling induced by some 
extracellular growth factors, suggesting that similar intracellular signaling path- 
ways are activated in both cases. 
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Figure 24-25 Uptake of Legionella 
pneumophila by a human phagocyte. This 
electron micrograph shows the unusual 
coil structure induced on the surface of 
the phagocyte by the bacterium. Some 
other pathogens, including the bacterium 
Borrelia burgdorferi, which causes Lyme 
disease, the eucaryotic pathogen 
Leishmania, and the yeast Candida 
albicans, may also sometimes invade cells 
using this type of coiling phagocytosis. 
(From M.A. Horwitz, Cell 36:27-33, 1984. 
With permission from Elsevier.) 



Intracellular Eucaryotic Parasites Actively Invade Host Cells 

The host cell supplies the energy required for the uptake of viruses by receptor- 
mediated endocytosis and bacteria by phagocytosis or macropinocytosis. The 
pathogen is a relatively passive participant, usually providing a trigger to initiate 
the invasion process. In contrast, intracellular eucaryotic parasites, which are 
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typically much larger than bacteria, invade host cells through a variety of com- 
plex pathways that usually require significant energy expenditure on the part of 
the parasite. 

Toxoplasma gondii, the cat parasite that also causes occasional serious 
human infections, is an example. When this protozoan contacts a host cell, it 
protrudes an unusual microtubule-based structure called a conoid, which it uses 
to push its way slowly into the host cell. The energy for invasion seems to come 
entirely from the parasite and requires at least one highly unusual myosin (Class 
XIV; see Figure 16-57); depolymerizing the actin cytoskeleton in the parasite, but 
not in the host cell, disrupts this process. As the parasite moves into the host cell, 
a membrane derived from the invaginated host cell plasma membrane sur- 
rounds it. Remarkably, the parasite somehow removes host transmembrane pro- 
teins from the surrounding membrane as it forms, so that the parasite is pro- 
tected in a membrane-enclosed compartment, which does not fuse with lyso- 
somes and does not participate in host cell membrane trafficking processes 
(Figure 24-27). The specialized membrane allows the parasite to take up 
metabolic intermediates and nutrients from the host cell's cytosol but excludes 
larger molecules. Malaria parasites invade red blood cells by using a very similar 
mechanism. 

The protozoan Trypanosoma cruzi, which causes multiorgan Chagas dis- 
ease, mainly in Mexico and Central and South America, uses an entirely differ- 
ent, but no less peculiar, invasion strategy. After attachment to host cell surface 
receptors, this parasite induces a local elevation of Ca 2+ in the host cell's cytosol. 
The Ca 2+ signal recruits lysosomes to the site of parasite attachment, which fuse 
with the host cell's plasma membrane during the internalization process, allow- 
ing the parasites rapid access to the lysosomal compartment (Figure 24-28). As 
we discuss below, most intracellar pathogens go to great lengths to avoid expo- 
sure to the hostile, proteolytic environment of the lysosome, but Trypanosoma 
cruziuses the lysosome as its means of entry. In the lysosomal compartment, the 
parasite secretes an enzyme that removes sialic acid from lysosomal glycopro- 
teins and transfers it to its own surface molecules, thereby coating itself with 
host cell sugars. Next, the parasite secretes a pore-forming toxin that lyses the 
lysosome membrane, releasing the parasite into the host cell's cytosol, where it 
proliferates. 



Figure 24-26 Mechanisms used by 
bacteria to induce phagocytosis by 
nonphagocytic host cells. (A) The zipper 
and (B) trigger mechanisms for 
pathogen-induced phagocytosis both 
require the polymerization of actin at the 
site of bacterial entry. (C) A scanning 
electron micrograph showing a very early 
stage of Salmonella enterica invasion by 
the trigger mechanism. Each of the three 
bacteria (pseudocolored yellow) is 
surrounded by a small membrane ruffle. 
(D) Fluorescence micrograph showing 
that the large ruffles that engulf the 
Salmonella bacteria are actin-rich.The 
bacteria are labeled in green and actin 
filaments in red; because of the color 
overlap, the bacteria appear yellow. (From 
J.E. Galan, Annu. Rev. Cell Dev. Biol. 
1 7:53-86, 2001 . With permission from 
Annual Reviews.) 
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Figure 24-27 The life cycle of the intracellular parasite Toxoplasma gondii. (A) After attachment to a host cell, T. gondii 
uses its conoid to push its way into the host cell. As the host cell's plasma membrane invaginates to surround the parasite, 
the parasite somehow removes the host cell proteins associated with normal endosomes or phagosomes, so that the 
compartment (shown in red) does not fuse with lysosomes. After several rounds of replication, the parasite causes the 
compartment to break down and the host cell to lyse, releasing the progeny into the extracellular space from which they 
can infect other host cells. (B) Light micrograph of T. gondii replicating within a membrane-enclosed compartment in a 
cultured cell. (B, courtesy of Manuel Camps and John Boothroyd.) 

The microsporidiause perhaps the most bizarre active invasion mechanism. 
These tiny, obligate, intracellular, eucaryotic parasites are only about 5 jam long 
and have among the smallest known genomes for a eucaryotic cell, only 
2,900,000 nucleotide pairs. Normally, microsporidia cause disease primarily in 
insects, but they can also cause opportunistic infections in people with AIDS. 
Having adapted over a long period to a parasitic lifestyle, they depend on their 
host cells for some metabolic functions and have lost many of the genes and cell 
structures required for a free-living existence; for example, they no longer have 
mitochondria or peroxisomes. However, they do have a strange extrusion appa- 
ratus, the polar tube, that enables them to invade host cells. In the environmen- 
tally resistant spore stage of its life cycle, the polar tube is wound in a coil around 
the nucleus (Figure 24-29A). On contact with an appropriate host cell, the polar 
tube discharges explosively, uncoiling in less than 2 seconds to form a mature 
structure that can be more than ten times the length of the spore. The tip of the 
discharging polar tube, traveling at a speed of 100 um/sec, penetrates the host 
cell and delivers (apparently by osmotic pressure) the internal contents of the 
spore, including the micro sporidiaris nucleus, into the cytoplasm of the host 
cell, where the parasite replicates to form up to a hundred progeny (Figure 
24-29B and C). Eventually, the progeny mature into spores, and the host cell 



Figure 24-28 Invasion of host cells by 
Trypanosoma cruzi. This parasite recruits 
host cell lysosomes to its site of 
attachment. The lysosomes fuse with the 
invaginating plasma membrane to create 
an intracellular compartment constructed 
almost entirely of lysosomal membrane. 
After a brief stay in the compartment, the 
parasite secretes a pore-forming protein 
that disrupts the surrounding membrane, 
allowing the parasite to escape into the 
host cell cytosol and proliferate. 
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Figure 24-29 Invasion of host cells by microsporidia. (A) The spore form of the parasite is covered with a rigid coat and 
harbors a coiled polar tube that wraps many times around the nucleus. (B) The polar tube extends explosively when the 
spore comes into contact with an appropriate host cell, penetrating the host cell and delivering the nucleus and other 
contents of the spore into the cytoplasm of the host cell. The microsporidian then proliferates in the host cell. 
(C) Immunofluorescence micrograph of a spore of the microsporidian Encephalitozoon cuniculi. The parts of the spore that 
are outside the host cell are stained yellow, and the parts that are inside the host cell are stained green. The discharged 
polar tube changes from yellow to green at the point where it has entered the host cell, which is not visible. Scale bar 
10 um. (C, from C. Franzen, Trends Parasitol. 20:275-279, 2004. With permission from Elsevier.) 

lyses to release them. Microsporidia spores are sufficiently small for 
macrophages to phagocytose them. When this happens, the spores discharge 
their polar tube from within the confines of the phagosome, again delivering 
their contents into the cytosol of the host cell. 



Many Pathogens Alter Membrane Traffic in the Host Cell 

The three examples of intracellular parasites just discussed raise a general prob- 
lem that faces all intracellular pathogens, including viruses, bacteria, and 
eucaryotic parasites. They must deal in some way with membrane traffic in the 
host cell. After endocytosis by a host cell, they usually find themselves in an 
endosomal compartment that normally would fuse with lysosomes to form a 
phagolysosome. They therefore must either modify the compartment to prevent 
its fusion with lysosomes, escape from the compartment before such fusion, 
escape after fusion but before getting digested, or find ways to survive in the 
hostile environment of the phagolysosome (Figure 24-30). 

Most pathogens use the first or second strategy. As we have seen, Try- 
panosoma cruzi uses the escape route, as do essentially all viruses (see Figure 
24-24). The bacterium Listeria monocytogenes also uses this strategy. It is taken 
up into cells via the zipper mechanism discussed earlier and secretes a protein 
called listeriolysin O that forms large pores in the phagosomal membrane, 
releasing the bacteria into the cytosol before they are digested. Once in the 
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Figure 24-30 Choices that an 
intracellular pathogen faces. After entry 
into a host cell, generally through 
endocytosis or phagocytosis into a 
membrane-enclosed compartment, 
intracellular pathogens can use one of 
three strategies to survive and replicate. 
Pathogens that follow strategy (1) include 
all viruses, Trypansoma cruzi, Listeria 
monocytogenes, and Shigella flexneri. 
Those that follow strategy (2) include 
Mycobacterium tuberculosis, Salmonella 
enterica, Legionella pneumophila, and 
Chlamydia trachomatis. Those that follow 
strategy (3) include Coxiella burnetii and 
Leishmania. 
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cytosol, the bacteria continue to secrete listeriolysin O, but it does not destroy 
other cell plasma membranes for two reasons: first, it is 10 times more active at 
the acidic pH found in the phagosome than at neutral pH found in the cytosol; 
second, it is rapidly degraded in the cytosol by host cell proteasomes (see Figure 
6-80), which do not have access to the listeriolysin O in the phagosome (Figure 
24-31). 

If a pathogen is to survive and replicate in a host cell membrane-enclosed 
compartment, it must modify membrane trafficking in the host cell, and it can 
use various methods to do this. It must modify the compartment in at least two 
ways: first, it must prevent lysosomal fusion, and second, it must provide a path- 
way for importing nutrients from the host cytosol. In addition, many pathogens 
(particularly viruses) alter membrane trafficking pathways to prevent presenta- 
tion of their tell-tale foreign antigens on the host cell surface; otherwise, T cells 
could detect their presence and kill the host cell (discussed in Chapter 25). 

Different pathogens have distinct strategies for altering host cell membrane 
traffic (Figure 24-32). As we have seen, Toxoplasma gondii creates a mem- 
brane-enclosed compartment that does not participate in normal host cell 
membrane traffic and yet specifically allows nutrient import. Mycobacterium 
tuberculosis somehow prevents the very early endosome that contains it from 
maturing, so that the endosome never acidifies or acquires the characteristics 
of a late endosome or lysosome. Endosomes containing Salmonella enterica, in 
contrast, do acidify and acquire markers of late endosomes, but they arrest 
their maturation at a stage prior to lysosomal fusion. Other bacteria seem to 
find shelter in intracellular compartments that are completely distinct from the 
usual endocytic pathway. Legionella pneumophila, for example, replicates in 
compartments that are enclosed by layers of rough endoplasmic reticulum 



Figure 24-31 Selective destruction of 
the phagosomal membrane by Listeria 
monocytogenes. L. monocytogenes 
attaches to E-cadherin on the surface of 
epithelial cells and induces its own 
uptake by the zipper mechanism (see 
Figure 24-26A). Within the phagosome, 
the bacterium secretes the hydrophobic 
protein listeriolysin O, which forms 
oligomers in the host cell membrane, 
thereby creating large pores and 
eventually disrupting the membrane. 
Once in the host cell cytosol, the bacteria 
begin to replicate and continue to 
secrete listeriolysin O. Because the 
listeriolysin O in the cytosol is rapidly 
degraded by proteasomes, the host cell's 
plasma membrane remains intact. 
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Figure 24-32 Modifications of host cell 
membrane trafficking by bacterial 
pathogens. Four intracellular bacterial 
pathogens, Mycobacterium tuberculosis, 
Salmonella enterica, Legionella 
pneumophila, and Chlamydia trachomatis, 
all replicate in membrane-enclosed 
compartments, but the four 
compartments differ. M. tuberculosis 
remains in a compartment that has early 
endosomal markers and continues to 
communicate with the plasma 
membrane via transport vesicles. 
S. enterica replicates in a compartment 
that has late endosomal markers and 
does not communicate with the plasma 
membrane. L. pneumophila replicates in 
an unusual compartment that is wrapped 
in several layers of rough endoplasmic 
reticulum (ER) membrane; only one layer 
is shown for simplicity. C trachomatis 
replicates in an exocytic compartment 
that fuses with vesicles coming from the 
trans Golgi network (TGN). 
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(Figure 24-33). Chlamydia trachomatis, a sexually transmitted bacterial 
pathogen that can cause sterility and blindness, replicates in a compartment 
that seems similar to part of the exocytic pathway Some intracellular bacterial 
pathogens seem to be able to manipulate the location of other membrane- 
enclosed organelles not in direct physical contact with their own compartment. 
For example, Salmonella- containing late endosomes are normally found in very 
close apposition to the Golgi apparatus (Figure 24-34). The mechanisms used 
by these organisms to alter their membrane compartments and other aspects of 
membrane traffic in the host cell are still poorly understood. 

Viruses also often alter membrane traffic in the host cell. Enveloped viruses 
must acquire their membrane from host cell phospholipids. In the simplest 
cases, virally encoded proteins are inserted into the ER membrane and follow 
the usual path through the Golgi apparatus to the plasma membrane, undergo- 
ing various post-translational modifications en route. The viral capsid and 
genome then assemble at the plasma membrane and bud off from the cell sur- 
face. This is one mechanism used by HIV Other enveloped viruses interact in 
complex ways with membrane trafficking pathways in the host cell (Figure 
24-35). Even some nonenveloped viruses alter membrane traffic in the host cell 
to suit their own purposes. For example, a membrane-associated, virus-encoded 
RNA polymerase carries out poliovirus replication. The replication proceeds 
more quickly if the surface area of host cell membrane is increased. To accom- 
plish this, the virus induces increased membrane lipid synthesis in the host cell 
and blocks membrane transport from the ER. ER membrane thereby accumu- 
lates, expanding the surface area on which viral RNA replication can occur (Fig- 
ure 24-36). Many viral pathogens and some bacterial pathogens are frequently 
found in association with autophagosomes, which form by autophagy (discussed 



Figure 24-33 Association of host cell 
endoplasmic reticulum (ER) membrane 
with intracellular bacterial pathogens. 

(A) Shortly after infection with Legionella 
pneumophila, the ER (green) of the host 
cell is recruited to surround the 
intracellular bacterium. The DNA of the 
bacterium and of the host cell are labeled 
red. The inset shows the close association 
between the ER and the bacterium. 

(B) Brucella abortus, a Gram-negative 
bacterium that can cause abortions in 
cattle, also replicates within an ER- 
associated compartment. In this electron 
micrograph, the black smudges within the 
ER indicate the presence of the ER-specific 
enzyme glucose-6-phosphatase.The black 
arrows show concentrations of the enzyme 
on the membrane enclosing the 
bacterium, indicating that it has fused 
directly with the ER. (A, from J.C. Kagan 
and C.R. Roy, Nat. Cell Biol. 4:945-954, 
2002. With permission from Macmillan 
Publishers Ltd; B, from J. Celli and 

J.P. Gorvel, Curr. Opin. Microbiol. 7:93-97, 
2004. With permission from Elsevier.) 
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Figure 24-34 Proximity of Golgi stacks to endosomes containing Salmonella enterica. (A) In this 
infected cell, the bacteria are green and the Golgi has been labeled {red) with antibodies against a 
Golgi-specific structural protein. (B) Electron micrograph showing the very close apposition of a 
Golgi stack to a late endosome containing the bacterium. (From S.P. Salcedo and D.W. Holden, 
EMBOJ. 22:5003-5014, 2003. With permission from Macmillan Publishers Ltd.) 
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in Chapter 13). In most cases, it is not clear whether the host cell initiates induc- 
tion of autophagy as a protective response or the invading pathogen triggers it 
to assist in the pathogen's replication. 



Figure 24-35 Complicated strategies for 
viral envelope acquisition. (A) Herpes 
virus nucleocapsids assemble in the 
nucleus and then bud through the inner 
nuclear membrane into the space 
between the inner and outer nuclear 
membranes, acquiring a membrane coat. 
The virus particles then apparently lose 
this coat when they fuse with the outer 
nuclear membrane to escape into the 
cytosol. Subsequently, the nucleocapsids 
bud into the Golgi apparatus and bud 
out again on the other side, acquiring 
two new membrane coats. The virus then 
buds from the cell with a single 
membrane when its outer membrane 
fuses with the plasma membrane. 
(B) Vaccinia virus (which is closely related 
to the virus that causes smallpox and is 
used to vaccinate against smallpox) 
assembles in "replication factories" within 
the cytosol, far away from the plasma 
membrane. The first structure that 
assembles contains two membranes, 
both acquired from the Golgi apparatus 
by a poorly defined wrapping 
mechanism. Some of these viral particles 
are then engulfed by the membranes of a 
second intracellular membrane-enclosed 
compartment; these viral particles have a 
total of four layers of membrane. After 
fusion at the plasma membrane, the virus 
escapes from the cell with three 
membrane layers. 



Viruses and Bacteria Use the Host Cell Cytoskeleton for 
Intracellular Movement 

The cytoplasm of mammalian cells is extremely viscous. It is crowded with 
organelles and supported by networks of cytoskeletal filaments, all of which 
inhibit the diffusion of particles the size of a bacterium or a viral capsid. To reach 
a particular part of the cell to carry out part of its replication cycle, a pathogen 
must actively move there. As with transport of intracellular organelles, generally, 
pathogens use the host cell's cytoskeleton for active movement. 

Several bacteria that replicate in the host cell's cytosol (rather than in mem- 
brane-enclosed compartments) have adopted a remarkable mechanism for 
moving, which depends on actin polymerization. These bacteria, including Lis- 
teria monocytogenes, Shigella flexneri, Rickettsia rickettsii (which causes Rocky 
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Mountain spotted fever), Burkholderia pseudomallei (which causes melioidosis), 
and Mycobacterium marinum (a relative of the bacterium that causes tuberculo- 
sis), induce the nucleation and assembly of host cell actin filaments at one pole of 
the bacterium. The growing filaments generate substantial force and push the 
bacterium through the cytoplasm at rates of up to 1 |Lim/sec. New filaments form 
at the rear of each bacterium and are left behind like a rocket trail as the bacterium 
advances, depolymerizing again within a minute or so as they encounter depoly- 
merizing factors in the cytosol. When a moving bacterium reaches the plasma 
membrane, it continues to move outward, inducing the formation of a long, thin 
host cell protrusion with the bacterium at its tip. A neighboring cell often engulfs 
this projection, allowing the bacterium to enter the neighbor's cytoplasm without 
exposure to the extracellular environment, thereby avoiding recognition by anti- 
bodies produced by the host's adaptive immune system (Figure 24-37). 
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Figure 24-36 Intracellular membrane 
alterations induced by a poliovirus 
protein. Poliovirus, like other single- 
stranded RNA viruses, replicates its RNA 
genome by using a polymerase that 
associates with intracellular membranes. 
Several of the proteins encoded by its 
genome alter the structure or dynamic 
behavior of the membrane-enclosed 
organelles in the host cell. These electron 
micrographs show an untransfected 
monkey cell (left) and the same type of cell 
expressing a transgene encoding the 3A 
protein from poliovirus (right). In the 
transfected cell, the ER is swollen because 
thetransgene-encoded protein inhibits 
traffic from the ER to the Golgi apparatus. 
(From JJ.R. Doedens, T.H. Giddings Jr. and 
K. Kirkegaard, J. Virol. 71:9054-9064, 1997.) 
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Figure 24-37 The actin- 
based movement of 
Listeria monocytogenes 
within and between 
host cells. <GTAT> 

(A) These bacteria 
induce the assembly of 
actin-rich tails in the 
host cell cytoplasm, 
enabling the bacteria to 
move rapidly. Motile 
bacteria spread from cell 
to cell by forming 
membrane-enclosed 
protrusions that are 
engulfed by 
neighboring cells. 

(B) Fluorescence 
micrograph of the 
bacteria moving in a cell 
that has been stained to 
reveal bacteria in red 
and actin filaments in 
green. Note the comet- 
like tail of actin 
filaments behind each 
moving bacterium. 
Regions of overlap of 
red and green 
fluorescence appear 
yellow. (B, courtesy of 
Julie Theriot and Tim 
Mite hi son.) 
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The molecular mechanism of pathogen-induced actin assembly has been 
determined for several microorganisms. The mechanisms are different for the 
different pathogens, suggesting that they evolved independently Although they 
all use the same host cell regulatory pathway that normally controls the nucle- 
ation of actin filaments, they exploit different points in the pathway As dis- 
cussed in Chapter 16, activation of the small GTPase Cdc42 by certain extracel- 
lular signals leads to the activation of a protein called N-WASp, which in turn 
activates an ARP complex, which nucleates the growth of a new actin filament. 
An L. monocytogenes surface protein directly binds to and activates the ARP 
complex to initiate the formation of an actin tail. B. pseudomallei and R. rick- 
ettsii use a similar strategy, although the sequence of the activating proteins dif- 
fers in all three cases. In contrast, another unrelated surface protein on S. 
flexneri binds to and activates N-WASp, which then activates the ARP complex. 
Remarkably, vaccinia virus uses yet another mechanism to move intracellularly 
by inducing actin polymerization, although it exploits the same regulatory path- 
way (Figure 24-38) . 

Other pathogens rely primarily on microtubule-based transport to move 
within the host cell. Viruses that infect neurons illustrate this movement. The 
neurotropic alpha herpes viruses, a group that includes the virus that causes 
chicken pox, provides an important example. The virus enters sensory neurons 
at the tips of their axons and microtubule-based transport carries the nucleo- 
capsids down the axon to the neuronal nucleus, apparently mediated by attach- 
ment of the capsid proteins to the molecular motor protein dynein. After repli- 
cation and assembly in the nucleus, the enveloped virion is transported along 
axonal microtubules away from the neuronal cell body, mediated by attachment 
of a different viral coat component to a kinesin motor protein (Figure 24-39). A 




Figure 24-38 Molecular mechanisms for 
actin nucleation by various pathogens. 

The bacteria Listeria monocytogenes and 
Shigella flexneri and the virus vaccinia all 
move intracellularly using actin 
polymerization. To induce actin 
nucleation, all of these pathogens recruit 
and activate the ARP complex (see Figure 
16-34), although each pathogen uses a 
different recruitment strategy. 
L monocytogenes expresses a surface 
protein, ActA, that directly binds to and 
activates the ARP complex. 5. flexneri 
expresses a surface protein, IcsA 
(unrelated to ActA), that recruits the host 
cell signaling protein N-WASp. N-WASp in 
turn recruits the ARP complex, along with 
other host proteins, including WIP 
(WASp-interacting protein). Vaccinia virus 
expresses an envelope protein that is 
phosphorylated on tyrosine by a host cell 
protein tyrosine kinase. The 
phosphorylated protein then recruits 
Nek, which binds WIP. WIP binds N-WASp, 
which recruits and activates the ARP 
complex. The more complicated 
mechanism that vaccinia uses is thought 
to closely resemble how chemotactic 
factors activate the ARP complex in 
motile eucaryotic cells. Despite the 
different molecular strategies used to 
assemble them, the actin comet tails 
formed by all three pathogens look very 
similar, and the pathogens move at 
similar speeds inside infected cells. 
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Figure 24-39 Fluorescence micrographs 
of herpes virus moving in an axon. This 
nerve cell has been infected with an 
alpha herpes virus that has been 
genetically engineered to express green 
fluorescent protein (GFP) fused to one of 
its capsid proteins. In this segment of the 
axon, several viral particles are visible, 
and two of them (numbered 1 and 3) are 
moving away from the cell body, which is 
to the right and out of the picture. (From 
G.A. Smith, S.P. Gross and LW. Enquist, 
Proc Natl Acad. Sci. U.S.A. 98:3466-3470, 
2001. With permission National Academy 
of Sciences.) 
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Figure 24-40 The association of Wolbachia with microtubules. This 
fluorescence micrograph shows Wolbachia (red) associated with the 
microtubules (green) of four mitotic spindles in a syncytial Drosophila 
embryo. The clumps of bacteria at the spindle poles will segregate along 
with the chromosomes (not visible at the equator of the spindles) when 
the chromosomes segregate. (From H. Kose and T.L. Karr, Mech. Cell Dev. 
51:275-288, 1995. With permission from Elsevier.) 

large number of virions, including those of HIV rabies virus, influenza virus, 
adenovirus, canine parvovirus, and vaccinia virus (the smallpox relative used for 
vaccination), have now been shown to associate with dynein or kinesin motor 
proteins and to undergo directed movement along microtubules at some stage 
in their replication. A primary function of the microtubule highways laid down 
in eucaryotic cells is to serve as oriented tracks for membrane traffic; it is not 
surprising that many viruses have independently evolved the ability to engage 
these transport systems to enhance their own replication. 

One bacterium that is known to associate with microtubules is Wolbachia. 
This fascinating genus includes many species that are parasites or symbionts of 
insects and other invertebrates, living in the cytosol of each cell in the animal. 
The infection is spread vertically from mother to offspring, as Wolbachia are also 
present in eggs. The bacteria ensure their transmission into every cell by bind- 
ing to microtubules, so that the mitotic spindle segregates them simultaneously 
with chromosome segregation when an infected cell divides (Figure 24-40) . As 
we discuss later, Wolbachia infection can significantly alter the reproductive 
behavior of its insect hosts. 
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Viral Infections Take Over the Metabolism of the Host Cell 

Most intracellular bacteria and parasites carry the basic genetic information 
required for their own metabolism and replication, and they rely on their host 
cells only for nutrients. Viruses, in contrast, use the basic host cell machinery for 
most aspects of their reproduction: they all depend on host cell ribosomes to 
produce their proteins, and some also use host cell DNA and RNA polymerases 
for replication and transcription, respectively. 

Many viruses encode proteins that modify the host transcription or transla- 
tion apparatus to favor the synthesis of viral proteins over those of the host cell. 
As a result, the synthetic capability of the host cell is devoted principally to the 
production of new virus particles. Poliovirus, for example, encodes a protease 
that specifically cleaves the TATA-binding factor component of TFIID (see Figure 
6-18), effectively shutting off all host cell transcription via RNA polymerase II. 
Influenza virus produces a protein that blocks both the splicing and the 
polyadenylation of RNA transcripts, which therefore fail to be exported from the 
nucleus (see Figure 6-40). 

Translation initiation of most host cell mRNAs depends on recognition of 
their 5' cap by a group of translation initiation factors (see Figure 6-72). Transla- 
tion initiation of host mRNAs is often inhibited during viral infection, so that the 
host cell ribosomes can be used more efficiently for the synthesis of viral pro- 
teins. Some viral genomes such as that of influenza virus encode endonucleases 
that cleave the 5' cap from host cell mRNAs. Some go even further and then use 
the liberated 5' caps as primers to synthesize viral mRNAs, a process called cap 
snatching. Several other viral RNA genomes encode proteases that cleave certain 
translation initiation factors. These viruses rely on 5' cap-independent transla- 
tion of the viral RNA, using internal ribosome entry sites (IRESs) (see Figure 
7-108). 

A few DNA viruses use host cell DNA polymerase to replicate their genome. 
Unfortunately for the virions, DNA polymerase is expressed at high levels only 
during S phase of the cell cycle, and most cells that these viral particles infect 
spend most of their time in Gi phase. Adenovirus has evolved a mechanism to 
drive the host cell to enter S phase, producing large amounts of active DNA 
polymerase that then replicates the viral genome. The adenovirus genome 
encodes proteins that inactivate both Rb and p53, two key suppressors of cell- 



1518 Chapter 24: Pathogens, Infection, and Innate Immunity 



cycle progression (discussed in Chapter 17). As might be expected for any mech- 
anism that induces unregulated DNA replication, these viruses can promote the 
development of cancer in some circumstances. 



Pathogens Can Alter the Behavior of the Host Organism to 
Facilitate the Spread of the Pathogen 

As we have seen, pathogens often alter the behavior of the host cell in ways that 
benefit the survival and replication of the pathogen. Similarly, pathogens often 
alter the behavior of the whole host organism to facilitate pathogen spread, as 
we saw earlier for Trypanosoma brucei and Yersinia pestis. In some cases, it is dif- 
ficult to tell whether a particular host response is more for the benefit of the host 
or for the pathogen. Pathogens such as Salmonella enterica that cause diarrhea, 
for example, usually produce self-limiting infections because the diarrhea effi- 
ciently washes out the pathogen. The bacteria-laden diarrhea, however, can 
spread the infection to a new host. Similarly, coughing and sneezing help to 
clear pathogens from the respiratory tract, but they also spread the infection to 
new individuals. A person with a common cold may produce 20,000 droplets in 
a single sneeze, all carrying rhinovirus or corona virus. 

A frightening example of a pathogen modifying host behavior is seen in 
rabies, as first described in Egyptian writings over 3000 years ago. Rabies virus 
replicates in neurons and causes infected people or animals to become "rabid": 
they are unusually aggressive and develop a strong desire to bite. The virus is 
shed in the saliva and transmitted through the bite wound into the bloodstream 
of the victim, spreading the infection to a new host. 

But Wolbachia exhibit the most dramatic example of pathogens modifying 
host behavior. These bacteria manipulate the sexual behavior of their host to 
maximize their dissemination. As described earlier, Wolbachia are passed verti- 
cally into offspring through eggs. If they live in a male, however, they hit a dead 
end, as they are excluded from sperm. In some species of Drosophila, Wolbachia 
modify the sperm of their host so that they can fertilize the eggs only of infected 
females. This modification creates a reproductive advantage for infected 
females over uninfected females, so that the overall proportion of Wolbachia 
carriers increases. In other host species, a Wolbachia infection kills males but 
spares females, increasing the number of females in the population and thus the 
number of individuals that can produce eggs to pass on the infection. In a few 
types of wasp, Wolbachia infections enable the females to produce eggs that 
develop parthenogenetically, without the need for fertilization by sperm; in this 
species, males have been completely eliminated. For some of its hosts, Wol- 
bachia has become an indispensable symbiont, and curing the infection causes 
death of the host. In one case, humans are making use of this dependence: the 
filarial nematode that causes African river blindness is difficult to kill with 
antiparasite medications, but when people with river blindness are treated with 
antibiotics that cure the nematode's Wolbachia infection, the nematode infec- 
tion is also arrested. 



Pathogens Evolve Rapidly 

The complexity and specificity of the molecular interactions between pathogens 
and their host cells might suggest that virulence would be difficult to acquire by 
random mutation. Yet, new pathogens are constantly emerging, and old 
pathogens are constantly changing in ways that make familiar infections difficult 
to treat. Pathogens have two great advantages that enable them to evolve rapidly. 
First, they replicate very quickly, providing a great deal of material for the engine 
of natural selection. Whereas humans and chimpanzees have acquired a 2% dif- 
ference in genome sequences over about 8 million years of divergent evolution, 
poliovirus manages a 2% change in its genome in 5 days, about the time it takes 
the virus to pass from the human mouth to the gut. Second, selective pressures 
encourage this rapid genetic variation. The host's adaptive immune system and 
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modern antipathogen drugs, both of which destroy pathogens that fail to 
change, are the source of these selective pressures. 

In many cases, changes in human behavior exacerbate the emergence and 
evolution of new infectious diseases. Crowded and filthy living conditions in 
medieval cities, for example, contributed to the spread of the bacterium Yersinia 
pestis to humans from its natural rodent host to cause plague. The tendency of 
modern humans to live at high population densities in large cities has also cre- 
ated the opportunity for infectious organisms to initiate epidemics, such as 
influenza, tuberculosis, and AIDS, which could not have spread so rapidly or so 
far among sparser human populations. Air travel can, in principle, allow an 
asymptomatic, newly infected host to carry an epidemic to any previously unex- 
posed population within a few hours or days. 



Antigenic Variation in Pathogens Occurs by Multiple Mechanisms 

A small-scale example of the constant battle between infection and immunity is 
the phenomenon of antigenic variation. An important adaptive immune 
response against many pathogens is the host's production of antibodies that rec- 
ognize specific molecules (antigens) on the pathogen surface (discussed in 
Chapter 25). Many pathogens change these antigens during the course of an 
infection, enabling them to evade elimination by antibodies. Some eucaryotic 
parasites, for example, undergo programmed rearrangements of the genes 
encoding their surface antigens. The most striking example occurs in African 
trypanosomes such as Trypanosoma brucei, a protozoan parasite that causes 
sleeping sickness and is spread by an insect vector. {T. brucei is a close relative of 
T. cruzi — see Figure 24-28 — but it replicates extracellularly rather than inside 
cells.) T. brucei is covered with a single type of glycoprotein, called variant-spe- 
cific glycoprotein {VSG), which elicits a protective antibody response in the host 
that rapidly clears most of the parasites. The trypanosome genome, however, 
contains about 1000 Vsg genes, each encoding a VSG with distinct antigenic 
properties. Only one of these genes is expressed at any one time, by being copied 
into an active expression site in the genome. Gene rearrangements that copy 
new alleles into the expression site repeatedly change the Vsggene expressed. In 
this way, a few trypanosomes with an altered VSG escape the antibody-mediated 
clearance, replicate, and cause the disease to recur, leading to a chronic cyclic 
infection (Figure 24-41). Many other eucaryotic parasites, including the proto- 
zoan Plasmodium falciparum, which causes malaria, and the fungus Pneumo- 
cystis carinii, which causes pneumonia in people with AIDS, use very similar 
strategies to evade the host adaptive immune responses. 



Figure 24-41 Antigenic variation in 
trypanosomes. (A) There are about 1 000 
distinct Vsg genes in Trypanosoma brucei, 
but there is only one site for Vsg gene 
expression. An inactive gene is copied 
into the expression site by gene 
conversion, where it is now expressed. 
Each Vsg gene encodes a different surface 
protein (antigen). Rare switching events 
allow the trypanosome to repeatedly 
change the surface antigen it expresses. 
(B) A person infected with trypanosomes 
expressing VSG a mounts a protective 
antibody response, which clears most of 
the parasites expressing this antigen. 
However, a few of the trypanosomes may 
have switched to expression of VSG b , 
which can now proliferate until anti-VSG b 
antibodies clear them. By that time, 
however, some parasites will have 
switched to VSG C , and so the cycle 
repeats, seemingly indefinitely. 
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Bacterial pathogens can also rapidly change their surface antigens. Species 
of the genus Neisseria are champions. These Gram-positive cocci can cause 
meningitis and sexually transmitted diseases. They employ an astonishing vari- 
ety of mechanisms to promote antigenic variation. First, they undergo genetic 
recombination very similar to that just described for eucaryotic pathogens, 
which enables them to vary (over time) the pilin protein they use to make cell- 
surface pili: by recombination of multiple silent copies of variant pilin genes 
into a single expression locus they can express dozens of slightly different ver- 
sions of the protein. Second, many cell-surface proteins, as well as many of the 
biosynthetic enzymes involved in synthesizing cell-surface carbohydrates, have 
their expression levels continually altered by random slippage and repair of tan- 
dem nucleotide repeats in the promoter region or coding sequence of their 
genes, which modulates transcription or translation. Neisseria, for example, 
have about 10 different genes encoding variants of the Opa family of outer mem- 
brane proteins, each of which undergoes random variation of protein expres- 
sion levels in this way, resulting in a plethora of different surface protein com- 
positions to bewilder the host adaptive immune system. Analysis of the genome 
sequence for several Neisseria species has led to the suggestion that over 100 
genes may vary their expression levels using some variation of this mechanism. 
Third, Neisseria are extremely adept at taking up DNA from their environment 
and incorporating it into their genomes, further contributing to their extraordi- 
nary variability. Finally, Neisseria lack several of the DNA repair mechanisms 
present in other bacteria such as E. coli, so that the likelihood of their acquiring 
new mutations through replication error is higher than average. With all these 
mechanisms working together, it is not surprising that we have not yet devel- 
oped an effective vaccine against Neisseria infections. 

Although Neisseria is an extreme example, many other bacterial pathogens 
employ one or more of these techniques to enhance their antigenic variation. 
Moreover, several studies have shown that pathogenic bacteria isolated from 
patients with disease symptoms are much more likely to have defects in DNA 
repair pathways than isolates of the same bacterial species from environmental 
reservoirs. This intriguing finding suggests that the human immune system may 
act to accelerate bacterial evolution. 

Horizontal gene transfer, rather than point mutations, often causes rapid 
evolution in bacteria. The acquisition of plasmids and bacteriophages mediates 
most of this horizontal gene transfer. Bacteria readily pick up pathogenicity 
islands and virulence plasmids (see Figure 24-5) from other bacteria. Once a 
bacterium acquires a new set of virulence-related genes, it may quickly establish 
itself as a new cause of human epidemics. Yersinia pestis, for example, is a bac- 
terium endemic to rats and other rodents; it first appeared in human history in 
542 A.D., when the city of Constantinople was devastated by plague. Sequence 
comparisons of Y pestis with those of its close relative Y pseudotuberculosis, 
which causes a severe diarrheal disease, suggest that Y pestis may have emerged 
as a distinct strain only a few thousand years ago, not long before its devastating 
debut as plague. 



Error-Prone Replication Dominates Viral Evolution 

Error-prone replication mechanisms rather than genomic rearrangements are 
mainly responsible for antigenic variation in viruses. Retroviral genomes, for 
example, acquire on average one point mutation every replication cycle, 
because the viral reverse transcriptase that produces DNA from the viral RNA 
genome cannot correct nucleotide misincorporation errors. A typical, untreated 
HIV infection may eventually produce HIV genomes with every possible point 
mutation. In some ways, the high mutation rate is beneficial for the pathogen. 
By a microevolutionary process of mutation and selection within each host, 
most virions change over time from a form that is most efficient at infecting 
macrophages to one more efficient at infecting T-cells, as described earlier (see 
Figure 24-23). Similarly, once a patient is treated with an antiviral drug, the viral 
genome can quickly mutate and be selected for its resistance to the drug. If the 
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reverse transcriptase error rate were too high, however, deleterious mutations 
might accumulate too rapidly for the virus to survive. Furthermore, a variant 
that is successful in one host does not necessarily spread to others, as a mutated 
virion may not be able to infect a new host. For HIV-1, we can estimate the 
extent of this constraint by examining the sequence diversity among different 
infected individuals. Remarkably, only about one-third of the nucleotide posi- 
tions in the coding sequence of the viral genome are invariant, and nucleotide 
sequences in some parts of the genome, such as the env gene, can differ by as 
much as 30%. This extraordinary genomic plasticity greatly complicates 
attempts to develop vaccines against HIV, and it can also lead to rapid drug resis- 
tance (discussed below). It has also led to the rapid emergence of new HIV 
strains. Sequence comparisons between various strains of HIV and the very sim- 
ilar simian immunodeficiency virus (SIV) from a variety of different monkey 
species suggest that the most virulent type of HIV, HIV-1, may have jumped from 
chimpanzees to humans as recently as 1930 (Figure 24-42). 

Influenza viruses are an important exception to the rule that error-prone 
replication dominates viral evolution. They are unusual in that their genome 
consists of several (usually eight) strands of RNA. When two strains of influenza 
infect the same host, the strands of the two strains can recombine to form a 
novel type of influenza virus. Prior to 1900, the influenza strain that infected 
humans caused a very mild disease; a different influenza strain infected fowl 
such as ducks and chickens, but it only rarely infected humans. In 1918, a par- 
ticularly virulent variant of avian (bird) influenza crossed the species barrier to 
infect humans, triggering the catastrophic epidemic of 1918 called the Spanish 
flu, which killed 20-50 million people worldwide, more than were killed in World 
War I. Subsequent influenza pandemics have been triggered by recombination, 
in which a new DNA segment from an avian form of the virus replaced one or 
more of the viral DNA segments governing human immune response to the virus 
(Figure 24-43). Such recombination events allow the new virus to replicate 
rapidly and spread through an immunologically naive human population. Gen- 
erally, within two or three years, the human population develops immunity to 
the new recombinant strain of virus, and, as a result, the infection rate drops to 
a steady- state level. In normal years, influenza is a mild disease in healthy adults 
but can be life-threatening in the very young and very old. In pandemic years, 
however, especially in the 1918 pandemic, healthy adults seem unusually sus- 
ceptible to lethal influenza infection, perhaps because of the tissue damage 
caused by an overzealous immune response. Because the recombination events 
are unpredictable, it is not possible to know when the next influenza pandemic 
will occur or how severe it might be. 



Drug-Resistant Pathogens Are a Growing Problem 

While human activities such as air travel have promoted the spread of certain 
infectious diseases, advances in public sanitation and in medicine have pre- 
vented or ameliorated the suffering caused by many others. Effective vaccines 
and worldwide vaccination programs have eliminated smallpox and severely 
reduced poliomyelitis, and many deadly childhood infections such as mumps 
and measles are now rarities in wealthy industrialized nations. Nonetheless, 
there are still many widespread and devastating infectious diseases, such as 
malaria, for which no effective vaccines are available. 

The development of drugs that cure rather than prevent infections has also 
had a major impact on human health. Antibiotics, which kill bacteria, comprise 
the most successful class. Penicillin was one of the first antibiotics used to treat 
infections in humans, introduced into clinical use just in time to prevent tens of 
thousands of deaths from infected battlefield wounds in World War II. Because 
bacteria form a kingdom distinct from the eucaryotes they infect, much of their 
basic machinery for DNA replication, transcription, translation, and fundamen- 
tal metabolism differs from that of their host. These differences enable us to find 
antibacterial drugs that specifically inhibit these processes in bacteria, without 
disrupting them in the host. Most of the antibiotics that we use to treat bacterial 
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Figure 24-42 Diversification of HIV-1, 
HIV-2, and related strains of SIV. The 

genetic distance between any two viral 
isolates is found by following the shortest 
path joining them in the tree. HIV-1 is 
divided into two groups, major (M) and 
outlier (O).The HIV-1 M group is 
responsible for the global AIDS epidemic. 
HIV-1 M is further subdivided into various 
subtypes (A through G — not named in 
the figure). Subtype B is dominant in 
America and Europe; B, C, and E are 
dominant in Asia; and all subtypes are 
found in Africa. At least two monkey 
viruses, chimpanzee and mandrill, are 
more closely related to HIV-1 than is 
HIV-2, suggesting that HIV-1 and HIV-2 
arose independently of each other. HIV-1 
and SIV chimpanzee are estimated to 
have diverged in about 1 930. This tree 
was constructed from the nucleotide 
sequences of the Gag gene, using a 
database containing about 16,000 
sequences from virus samples isolated 
around the world. 
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infections are small molecules that inhibit macromolecular synthesis in bacte- 
ria by targeting bacterial enzymes that either are distinct from their eucaryotic 
counterparts or are involved in pathways, such as cell wall biosynthesis, that are 
absent in humans (Figure 24-44 and Table 6-3). 

The rapid evolution of pathogens, however, enables bacteria to develop 
resistance to antibiotics very quickly The typical lag between the introduction of 
an antibiotic into clinical use and the appearance of the first resistant strains is 
only a few years. Similar drug resistance also arises rapidly when treating viral 
infections with antiviral drugs. The virus population in an HIV-infected person 
treated with the reverse transcriptase inhibitor AZT, for example, will acquire 
complete resistance to the drug within a few months. The current protocol for 
treatment of HIV infections involves the simultaneous use of three drugs, which 
helps to minimize the acquisition of resistance. 

There are three general strategies by which a pathogen can develop drug 
resistance: (1) it can alter the molecular target of the drug so that it is no longer 
sensitive to the drug; (2) it can produce an enzyme that destroys the drug; or (3) 
it can prevent access to the target by, for example, actively pumping the drug out 
of the pathogen (Figure 24-45). 

Once a pathogen has chanced upon an effective drug- resistance strategy, 
the newly acquired or mutated genes that confer the resistance are frequently 
spread throughout the pathogen population and may even transfer to 
pathogens of different species that are treated with the same drug. The highly 
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Figure 24-44 Antibiotic targets. Although there are many antibiotics in clinical use, they 
have a narrow range of targets, which are highlighted in yellow. A few representative 
antibiotics in each class are listed. Nearly all antibiotics used to treat human infections fall 
into one of these categories. The vast majority inhibit either bacterial protein synthesis or 
bacterial cell wall synthesis. 



Figure 24-43 Model for the evolution of 
pandemic strains of influenza virus by 
recombination. Influenza A virus is a 
natural pathogen of birds, particularly 
waterfowl, and is always present in wild 
bird populations. Before 1900, influenza 
virus caused sporadic mild disease in 
humans. In 1 91 8, a particularly virulent 
form of the virus crossed the species 
barrier from birds to humans and caused 
a devastating worldwide epidemic. This 
strain was designated H1N1, referring to 
the specific forms of its main antigens, 
hemagglutinin (HA, or H) and 
neuraminidase (NA, or N). Changes in the 
virus, rendering it less virulent, and the 
rise of adaptive immunity in the human 
population, prevented the pandemic 
from continuing in subsequent seasons, 
although H1N1 influenza strains 
continued to cause serious disease every 
year in very young and very old people. 
In 1957, a new pandemic arose when the 
genes encoding HA and NA were 
replaced by equivalent genes from an 
avian virus {green bars); the new strain 
(designated H2N2) was not effectively 
cleared by antibodies in people who had 
previously contracted only H1 N1 forms of 
influenza. In 1 968, another pandemic was 
triggered when the HA gene of H2N2 
(but not the NA gene) was replaced from 
another avian virus; the new virus was 
designated H3N2. In 1977, there was a 
resurgence of HI N1 influenza, which had 
previously been almost completely 
replaced by the N2 strains. Molecular 
sequence information suggests that this 
minor pandemic may have been caused 
by an accidental release of an influenza 
strain that had been held in a laboratory 
since about 1950. As indicated, most 
human influenza today is caused by 
H1N1 and H3N2 strains. 

Poultry farming brings humans into 
very close contact with large numbers of 
birds, which may carry different versions 
of the influenza virus. These avian viruses 
occasionally cause fatal disease in 
humans who have been in direct contact 
with infected birds (so-called bird flu), 
but thus far these viruses have not 
acquired the ability to spread effectively 
from human to human. Monitoring these 
cross-species jumps and attempting to 
predict the origin of the next serious 
influenza epidemic remain significant 
public health challenges. 
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effective and very expensive antibiotic vancomycin, for example, has been used 
as a treatment of last resort for many severe hospital- acquired bacterial infec- 
tions that are already resistant to most other known antibiotics. Vancomycin 
prevents one step in bacterial cell wall biosynthesis, by binding to part of the 
growing peptidoglycan chain and preventing it from being cross-linked to other 
chains (see Figure 24-4). Resistance can arise if the bacterium synthesizes a dif- 
ferent type of cell wall, using different subunits that do not bind vancomycin. 
The most effective form of vancomycin resistance depends on a transposon 
containing seven genes, the products of which work together to sense the pres- 
ence of vancomycin, shut down the normal pathway for bacterial cell wall syn- 
thesis, and generate a different type of cell wall. Although the joining of these 
genes into a single transposon must have been a difficult evolutionary step (it 
took 15 years for vancomycin resistance to develop, rather than the typical year 
or two), the transposon can now be readily transmitted to many other 
pathogenic bacterial species. 

Where do drug-resistance genes come from? Sometimes, when the bacteria 
are under selective pressure due to drug exposure, resistance genes arise by 
spontaneous mutation and expand within a population. In many cases, how- 
ever, they appear in a pathogen's genome as new DNA segments acquired by 
horizontal transfer, frequently carried on transposons or replicative plasmids. 
Unlike eucaryotic cells, bacteria commonly exchange genetic material across 
species boundaries. 

Drug-resistance genes acquired by horizontal transfer frequently seem to 
come from environmental reservoirs, where they play an important part in the 
competition between microorganisms. Nearly all antibiotics used to treat bacte- 
rial infections today are not synthetic creations of chemists; instead, most are 
natural products produced by fungi or bacteria: penicillin, for example, is made 
by the mold Penicillium, and more than 50% of the antibiotics currently used in 
the clinic are made by the Gram-positive genus Streptomyces. It is believed that 
these microorganisms produce antimicrobial compounds as weapons in their 
competition with other microorganisms in the environment. Many of these com- 
pounds have probably existed on Earth for at least hundreds of millions of years, 
which is ample time for other microorganisms, as well as those that produce the 
antibiotics themselves, to have evolved resistance mechanisms. Unbiased sur- 
veys of bacteria taken from soil samples that have never been deliberately 
exposed to antibiotic drugs reveal that the bacteria are typically already resistant 
to about seven or eight of the antibiotics widely used in clinical practice. When 
pathogenic microorganisms are faced with the selective pressure provided by 
antibiotics treatments, they can apparently draw upon this world-wide and 
essentially inexhaustible source of genetic material to acquire resistance. 

Like most other aspects of infectious disease, human behavior has exacer- 
bated the problem of drug resistance. Many patients choose to take antibiotics 



Figure 24-45 Three general mechanisms 
of antibiotic resistance. (A) Non-resistant 
bacterial cells bathed in a drug {red 
triangles) that binds to and inhibits a 
critical enzyme {light green) will be killed 
due to enzyme inhibition. (B) Bacteria 
that have altered the target enzyme so 
that it no longer binds to the drug will 
survive and proliferate. In many cases, a 
single point mutation in the gene 
encoding the target protein can generate 
resistance. (C) Bacteria that express a 
resistance gene (R-factor) that encodes 
an enzyme {dark green) that either 
degrades or covalently modifies the drug 
will survive and proliferate. Some 
resistant bacteria, for example, make 
Beta-lactamase enzymes, which cleave 
penicillin and similar compounds. (D) Yet 
other resistant bacteria express or 
up-regulate an efflux pump that ejects 
the drug from the bacterial cytoplasm 
using energy derived from either ATP 
hydrolysis or the electrochemical 
gradient across the bacterial plasma 
membrane. Some efflux pumps, such as 
theTetR efflux pump that confers 
resistance to tetracycline, are specific for 
a single drug; others, called multi-drug 
resistance (MDR) efflux pumps, are 
capable of exporting a wide variety of 
structurally dissimilar drugs and so can 
render a bacterium resistant to a very 
large number of different antibiotics in a 
single step. 
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for viral conditions such as influenza, colds, sore throats, and earaches that the 
drugs do not help. Persistent and chronic misuse of antibiotics in this way can 
eventually result in antibiotic-resistant normal flora, which can then transfer the 
resistance to pathogens. Several antibiotic-resistant outbreaks of infectious 
diarrhea caused by Shigella flexneri, for example, originated in this way. The 
problem is particularly severe in countries where antibiotics are available with- 
out a physician's prescription, as in Brazil, where more than 80% of the strains of 
S. flexneri found in infected patients are resistant to four or more antibiotics. 
Antibiotics are also misused in agriculture, where they are commonly employed 
as food additives to promote the health of farm animals. An antibiotic closely 
related to vancomycin was commonly added to cattle feed in Europe; the result- 
ing resistance in the normal flora of these animals is widely believed to be one of 
the original sources for vancomycin-resistant bacteria that now threaten the 
lives of hospitalized patients. 

Because the acquisition of drug resistance is almost inevitable, it is crucial 
that we continue to develop innovative novel treatments for infectious diseases. 
We must also try to do better at delaying the onset of drug resistance. 



Summary 

All pathogens share the ability to interact with host cells in various ways that promote 
replication and spread of the pathogen, hut these host-pathogen interactions are 
diverse. Pathogens often colonize the host by adhering to or invading the epithelial 
surfaces that line the respiratory gastrointestinal, and urinary tracts, as well as the 
other surfaces in direct contact with the environment. Intracellular pathogens, includ- 
ing all viruses and many bacteria and protozoa, invade host cells and replicate inside 
them. They invade by one of several mechanisms. Viruses rely largely on receptor-medi- 
ated endocytosis, whereas bacteria exploit cell adhesion and phagocytic pathways; in 
both cases, the host cell provides the machinery and energy. Protozoa, by contrast, 
employ unique invasion strategies that usually require significant metabolic expense 
on the part of the invader. Once inside, intracellular pathogens seek out a cell com- 
partment that is favorable for their replication, frequently altering host membrane 
traffic and exploiting the host cell cytoskeleton for intracellular movement. Besides 
altering the behavior of individual host cells, pathogens frequently alter the behavior 
of the host organism in ways that favor spread to a new host. Pathogens evolve rapidly, 
so that new infectious diseases frequently emerge, and old pathogens acquire new ways 
to evade our attempts at treatment, prevention, and eradication. 



BARRIERS TO INFECTION AND THE INNATE 
IMMUNE SYSTEM 

Humans are exposed to millions of potential pathogens daily, through contact, 
ingestion, and inhalation. Our ability to avoid infection depends in large part on 
our adaptive immune system (discussed in Chapter 25), which remembers pre- 
vious encounters with specific pathogens and specifically destroys or eliminates 
them when they attack again. Adaptive immune responses, however, are slow to 
develop on first exposure to a new pathogen, as specific clones of B and T cells 
that can respond to it have to become activated and proliferate; it can therefore 
take a week or so before the responses are effective. By contrast, a single bac- 
terium with a doubling time of 1 hour can produce almost 20 million progeny, a 
full-blown infection, in a single day. Therefore, during the first critical hours and 
days of exposure to a new pathogen, we rely on our innate immune system to 
protect us from infection. As we discuss in Chapter 25, we also rely on the innate 
immune system to help activate adaptive immune responses. 

Innate immune responses are not specific to a particular pathogen in the 
way that the adaptive immune responses are. Generally, there are three lines of 
innate immune defenses that can prevent an infection or stop it in its tracks 
before the adaptive immune system needs be called into play. The first of these 
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are the physical and chemical barriers that prevent easy access of microorgan- 
isms to the interior of the human body These include the thick layer of dead ker- 
atinized cells that forms the surface of our skin, the tight junctions between 
epithelial cells, the acidic pH of the stomach, and components of the mucus lay- 
ers that inhibit colonization or even kill pathogenic bacteria. The normal flora 
also have a role in protecting body surfaces against invaders by competing for 
the same ecological niche and thereby limiting colonization. 

The second line of innate defenses comprise cell-intrinsic responses, by 
which an individual cell recognizes that it has been infected and takes measures 
to kill or cripple the invader. Most cells that have taken up a bacterium by 
pathogen-induced phagocytosis (see Figure 24-26), for example, will immedi- 
ately direct the phagosome to fuse with a lysosome, exposing the invading 
microorganism to a barrage of digestive enzymes. Another ancient intrinsic 
defense mechanism is the ability of host cells to degrade double-stranded RNA, 
which is a common intermediate in viral replication; the infected cells will even 
degrade any single -stranded RNA that shares sequence identity with the double- 
stranded trigger. This mechanism not only serves as an effective intrinsic 
defense against many viral infections, it also enables cell biologists to manipu- 
late gene expression by using the technique of RNA interference (RNAi). 

The third line of innate immune defenses depends on a specialized set of 
proteins and phagocytic cells that recognize conserved features of pathogens 
and become quickly activated to help destroy invaders. These include profes- 
sional phagocytic cells such as neutrophils and macrophages, natural killer cells, 
and the complement system. Whereas the adaptive immune system arose in 
evolution less than 500 million years ago and is confined to vertebrates, innate 
immune responses operate in both vertebrates and invertebrates, as well as in 
plants, and the basic mechanisms that regulate them are similar in these organ- 
isms. As discussed in Chapter 25, the innate immune responses in vertebrates 
are also required to activate adaptive immune responses by producing extracel- 
lular signal molecules that help call the adaptive immune system into action. 

Epithelial Surfaces and Defensins Help Prevent Infection 

In vertebrates, the skin and other epithelial surfaces, including those lining the 
respiratory, intestinal, and urinary tracts (Figure 24-46), provide a physical bar- 
rier between the inside of the body and the outside world. A mucus layer pro- 
vides additional protection against microbial, mechanical, and chemical insults 
of the interior epithelial surfaces; many amphibians and fish also have a mucus 
layer covering their skin. The slimy mucus coating is made primarily of secreted 
mucin and other glycoproteins, and it physically helps prevent pathogens from 
adhering to the epithelium. It also facilitates the clearance of pathogens by beat- 
ing cilia on the epithelial cells (discussed in Chapter 23). 

The mucus layer also contains substances that either kill pathogens or 
inhibit their proliferation. Among the most abundant of these are antimicrobial 
peptides, called defensins, which are found in all animals and plants. They are 
generally short (12-50 amino acids) and positively charged, and have hydropho- 
bic or amphipathic domains. They constitute a diverse family with a broad spec- 
trum of antimicrobial activities, including the ability to kill or inactivate Gram- 
negative and Gram-positive bacteria, fungi (including yeasts), parasites (includ- 
ing protozoa and nematodes), and even enveloped viruses such as HIV. 
Defensins are also the most abundant proteins in neutrophils (see below), which 
use them to kill phagocytosed pathogens. 

It is still uncertain how defensins kill pathogens. One possibility is that they 
use their hydrophobic or amphipathic domains to insert into the surface mem- 
brane of their victims, thereby disrupting the integrity of the membrane. Some 
of their selectivity for pathogens over host cells may come from their preference 
for membranes that do not contain cholesterol. After disrupting the membrane 
of the pathogen, the positively charged peptides may also interact with various 
negatively charged targets within the microbe, including DNA. Because of the 
relatively nonspecific nature of the interaction between antimicrobial peptides 
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Figure 24-46 Epithelial defenses against 
microbial invasion. (A) Light micrograph 
of a cross section through the wall of the 
human small intestine, showing three villi. 
Goblet cells secreting mucus are stained 
magenta. The protective mucus layer 
covers the exposed surfaces of the villi. At 
the base of the villi lie the crypts, where 
the epithelial cells proliferate. (B) Close-up 
view of a crypt, stained to show the 
granules in the Paneth cells (scarlet). These 
cells secrete large quantities of defensins 
and other antimicrobial peptides into the 
intestinal lumen. (B, courtesy of 
H.G. Burkitt, from P.R. Wheater, Functional 
Histology, 2nd ed. London: 
Churchill-Livingstone, 1987.) 
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and the microbes they kill, it is difficult for pathogens to acquire resistance to 
them. Thus, in principle, defensins and other antimicrobial peptides might be 
useful therapeutic agents to combat infection, either alone or in combination 
with more traditional drugs. 



Human Cells Recognize Conserved Features of Pathogens 

Microorganisms do occasionally breach the epithelial barricades. It is then up to 
the innate and adaptive immune systems to recognize and destroy them, with- 
out harming the host. Consequently, the immune systems must be able to dis- 
tinguish self from nonself. We discuss in Chapter 25 how the adaptive immune 
system does this. The innate immune system relies on the recognition of partic- 
ular types of molecules that are common to many pathogens but are absent in 
the host. These pathogen-associated molecules (called pathogen-associated or 
microbe-associated immunostimulants) trigger two types of innate immune 
responses — inflammatory responses (discussed below) and phagocytosis by 
professional phagocytes (neutrophils and macrophages), and by dendritic cells, 
which activate T cells of the adaptive immune system (discussed in Chapter 25). 
Both the inflammatory and phagocytic responses can occur quickly, even if the 
host has never been previously exposed to a particular pathogen. 

The microbe-associated immunostimulants are of various types. Most are 
not exclusive to pathogens, but are found in many bacteria, benign as well as 
harmful. Bacterial translation initiation differs from eucaryotic translation initi- 
ation in that formylated methionine, rather than regular methionine, is generally 
used as the first amino acid. Therefore, any peptide containing formylmethion- 
ine at the N- terminus must be of bacterial origin. Formylmethionine- containing 
peptides act as very potent chemoattractants for neutrophils, which migrate 
quickly to the source of such peptides and engulf the bacteria producing them 
(see Figure 16-101). 

In addition, molecules that do not occur in multicellular hosts compose the 
outer surface of many microorganisms, and these molecules also act as 
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Figure 24-47 Structure of lipopolysaccharide (LPS). On the left is the three-dimensional structure of a molecule of LPS, 
with the fatty acid tails shown in yellow and the sugars in blue. The molecular structure of the bottom part of LPS is shown 
on the right. The membrane anchor is made up of two linked glucosamine sugars attached to three phosphates and six 
fatty acid tails. Attached to this basic structure is a long chain of sugars, which is usually highly branched. This drawing 
shows the simplest type of LPS that will allow E. coli to live; it has just two sugar molecules in the chain, and both are the 
same (3-deoxy-D-manno-octulosonic acid). At the position marked by the arrow, wild-type Gram-negative bacteria also 
attach a core oligosaccharide made up of 8-12 linked sugars and a long O antigen polysaccharide made up of an 
oligosaccharide unit that is repeated up to 40 times. The sugars making up the core oligosaccharide and O antigen 
polysaccharide vary from one bacterial species to another, and even among different strains of the same species. All forms 
of LPS stimulate strong innate immune responses. 



immunostimulants. They include the peptidoglycan cell wall and flagella of bac- 
teria, as well as lipopolysaccharide (LPS) on Gram-negative bacteria (Figure 
24-47) and teichoic acids on Gram-positive bacteria (see Figure 24-4B). They 
also include molecules in the cell walls of fungi, including mannan, glucan, and 
chitin. Many eucaryotic parasites also contain unique membrane components 
that act as immunostimulants, including glycosylphosphatidylinositol in Plas- 
modium. In order to avoid making unnecessary immune responses, the host 
must be able to distinguish between microbe-associated immunostimulants 
produced by pathogens and very similar or identical molecules released from 
normal flora. In many cases, differences in concentration of the immunostimu- 
lant may be sufficient; chronic low concentrations of the molecule may be mon- 
itored by the immune system but do not provoke a reaction, while sudden 
increases in concentration or the appearance of immunostimulants in normally 
sterile areas of the body will trigger an innate immune response. 

Short sequences in bacterial or viral DNA can also act as immunostimulants. 
The culprit is a "CpG motif," which consists of the unmethylated dinucleotide 
CpG flanked by two 5' purine residues and two 3' pyrimidines. This short 
sequence is at least 20 times less common in vertebrate DNA than in bacterial or 
viral DNA, and it can activate innate immune responses. 

The various classes of microbe-associated immunostimulants often occur 
in repeating patterns and are therefore often called pathogen-associated molec- 
ular patterns (PAMPs). Several types of dedicated receptors in the host, collec- 
tively called pattern recognition receptors, recognize these patterns. These 
receptors include soluble receptors in the blood (components of the comple- 
ment system, which we discuss below) and membrane-bound receptors on or in 
host cells (including members of the Toll-like receptor family, which we consider 
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later). The cell- associated receptors have two functions: they initiate the phago- 
cytosis of the pathogen, and they activate a program of gene expression in the 
host cell responsible for innate immune responses. Some of the complement 
components also aid in phagocytosis and, in some cases, the direct killing of the 
pathogen, as we now discuss. 



Complement Activation Targets Pathogens for Phagocytosis or 
Lysis 

The complement system consists of about 20 interacting soluble proteins that 
are made mainly by the liver and circulate in the blood and extracellular fluid. 
Most are inactive until an infection activates them. They were originally identi- 
fied by their ability to amplify and "complement" the action of antibodies, but 
some complement components are also pattern recognition receptors that 
microbe-associated immunostimulants activate directly. 

The early complement components are activated first. There are three sets of 
these, belonging to three distinct pathways of complement activation — the 
classical pathway, the lectin pathway, and the alternative pathway. The early 
components of all three pathways act locally to activate C3, which is the pivotal 
component of complement (Figure 24-48). Individuals with a C3 deficiency are 
subject to repeated bacterial infections. The early components are proenzymes, 
which are activated sequentially by proteolytic cleavage. The cleavage of each 
proenzyme in the series activates the next component to generate a serine pro- 
tease, which cleaves the next proenzyme in the series, and so on. Since each 
activated enzyme cleaves many molecules of the next proenzyme in the chain, 
the activation of the early components consists of an amplifying, proteolytic 
cascade. 

Many of these cleavages liberate a biologically active small peptide fragment 
that can attract phagocytic cells such as neutrophils, and a membrane-binding 
larger fragment. The binding of the large fragment to a cell membrane, usually 
the surface of a pathogen, helps to carry out the next reaction in the sequence. 
In this way, complement activation is confined largely to the particular cell sur- 
face where it began. The larger fragment of C3, called C3b, binds covalently to 
the surface of the pathogen, where it recruits fragments of cleaved C2 and C3 to 
form a proteolytic complex (C4b, C2b, C3b) that catalyzes the subsequent steps 
in the complement cascade. Specific receptors on phagocytic cells that 
enhance the ability of these cells to phagocytose the pathogen also recognize 
C3b. In addition, receptors on B cells recognize C3b, which is the reason that 
C3b-coated pathogens are especially efficient at stimulating B cells to make 
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Figure 24-48 The principal stages in 
complement activation by the classical, 
lectin, and alternative pathways. In all 

three pathways, the reactions of 
complement activation usually take place 
on the surface of an invading microbe, 
such as a bacterium, and lead to the 
cleavage of C3. C1-C9, mannose-binding 
lectin (MBL), MBL-associated serine 
protease (MASP), and factors B and D are 
the central components of the 
complement system; various other 
components regulate the system. The 
early components are shown within gray 
arrows, while the late components are 
shown within a brown arrow. 
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antibodies (discussed in Chapter 25). The smaller fragment of C3 (called C3a), as 
well as fragments of C4 and C5 (see Figure 24-48), act independently as dif- 
fusible signals to promote an inflammatory response by recruiting phagocytes 
and lymphocytes to the site of infection. 

IgG or IgM antibody molecules (discussed in Chapter 25) bound to the sur- 
face of a microbe activate the classical pathway. Mannan-binding lectin, the 
protein that initiates the second pathway of complement activation, is a serum 
protein that forms clusters of six carbohydrate-binding heads around a central 
collagen-like stalk. This assembly binds specifically to mannose and fucose 
residues in bacterial cell walls that have the correct spacing and orientation to 
match up perfectly with the six carbohydrate -binding sites, providing a good 
example of a pattern recognition receptor. These initial binding events in the 
classical and lectin pathways cause the recruitment and activation of the early 
complement components. Molecules on the surface of pathogens often activate 
the alternative pathway; activation of the classical or lectin pathways also acti- 
vates the alternative pathway, forming a positive feedback loop that amplifies 
the effects of the classical or lectin pathway. 

Host cells produce various proteins and surface modifications that prevent 
the complement reaction from proceeding on their cell surface. The most 
important of these is the carbohydrate moiety sialic acid, a common constituent 
of cell surface glycoproteins and glycolipids. Because pathogens generally lack 
these surface components, they are singled out for destruction, while host cells 
are spared. At least one pathogen, Neisseria gonorrhoeae, the bacterium that 
causes the sexually transmitted disease gonorrhea, has developed the ability to 
exploit this feature of host self-protection. Coating itself with a layer of sialic 
acid, it effectively hides from the complement activation system. 

Membrane-immobilized C3b, produced by any of the three pathways, trig- 
gers a further cascade of reactions that leads to the assembly of the late comple- 
ment components to form membrane attack complexes (Figure 24-49). These 
complexes assemble in the pathogen membrane near the site of C3 activation 
and have a characteristic appearance in negatively stained electron micro- 
graphs, where they are seen to form aqueous pores through the membrane (Fig- 
ure 24-50) . For this reason, and because they perturb the structure of the bilayer 
in their vicinity, they make the membrane leaky and can, in some cases, cause 
the microbial cell to lyse, much like the defensins mentioned earlier. 

The self- amplifying, inflammatory, and destructive properties of the com- 
plement cascade make it essential that key activated components be rapidly 
inactivated after they are generated to ensure that the attack does not spread to 
nearby host cells. Inactivation is achieved in at least two ways. First, specific 
inhibitor proteins in the blood or on the surface of host cells terminate the cas- 
cade, by either binding or cleaving certain components once they have been 
activated by proteolytic cleavage. Second, many of the activated components in 
the cascade are unstable; unless they bind immediately to either another appro- 
priate complement component in the cascade or to a nearby membrane, they 
rapidly become inactive. 



Figure 24-49 Assembly of the late 
complement components to form a 
membrane attack complex. When C3b is 
produced by any of the three 
complement activation pathways, it is 
immobilized on a membrane, where it 
recruits C4b and C2b to form a 
proteolytic complex. This complex then 
cleaves the first of the late components, 
C5, to produce C5a (not shown) and C5b. 
C5b remains loosely bound to C3b (not 
shown) and rapidly assembles with C6 
and C7 to form C567, which then binds 
firmly via C7 to the membrane, as 
illustrated. To this complex is added one 
molecule of C8 to form C5678. The 
binding of a molecule of C9 to C5678 
induces a conformational change in C9 
that exposes a hydrophobic region and 
causes C9 to insert into the lipid bilayer 
of the target cell. This starts a chain 
reaction in which the altered C9 binds a 
second molecule of C9, which can then 
bind another molecule of C9, and so on. 
In this way, a chain of C9 molecules forms 
a large transmembrane channel in the 
membrane. 
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Figure 24-50 Electron micrographs of 
negatively stained complement lesions 
in the plasma membrane of a red blood 
cell. The lesion in (A) is seen en face, while 
that in (B) is seen from the side as an 
apparent transmembrane channel. The 
negative stain fills the channels, which 
therefore look black. This red blood cell 
has been deliberately sensitized to be 
susceptible to lysis by complement. 
(From R. Dourmashkin, Immunology 
35:205-21 2, 1 978. With permission from 
Blackwell Publishing.) 
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Toll-like Proteins and NOD Proteins Are an Ancient Family of 
Pattern Recognition Receptors 

Many of the mammalian pattern recognition receptors responsible for trigger- 
ing innate immune responses to pathogens are members of the Toll-like recep- 
tor (TLR) family. DrosophilaToll is a transmembrane protein with a large extra- 
cellular domain consisting of a series of leucine-rich repeats (see Figure 15-82). 
It was originally identified as a protein involved in the establishment of 
dorsoventral polarity in developing fly embryos (discussed in Chapter 22). It is 
also involved, however, in the adult fly's resistance to fungal infections. The 
intracellular signal transduction pathway activated downstream of Toll when a 
fly is exposed to a pathogenic fungus leads to the translocation of the NFkB pro- 
tein (discussed in Chapter 15) into the nucleus, where it activates the transcrip- 
tion of various genes, including those encoding antifungal defensins. The 
leucine-rich repeats found in Toll and TLRs are versatile structural motifs that 
are useful for binding a wide variety of ligands. Besides their role in pathogen 
recognition in both animals and plants, proteins with leucine-rich repeats have 
roles in signal transduction, DNA repair, and cell-cell and cell-matrix adhesion. 
Humans have at least 10 TLRs, several of which play important parts in rec- 
ognizing microbe-associated immunostimulants made by bacteria, viruses, 
fungi, and parasites. Different ligands activate different TLRs: TLR4, for example, 
recognizes lipopolysaccharide (LPS) from the outer membrane of Gram-nega- 
tive bacteria, TLR9 recognizes CpG DNA, and TLR5 recognizes the protein that 
makes up the bacterial flagellum. Most TLRs are on cell surfaces; they are abun- 
dant, for instance, on the surface of macrophages, dendritic cells, and neu- 
trophils, as well as on the surface of epithelial cells lining the respiratory and 
intestinal tracts. Others, however, are associated with intracellular membranes, 
where they can detect intracellular pathogens. TLRs act as an alarm system to 
alert both the innate and adaptive immune systems that an infection is brewing. 
In mammals, they activate a variety of intracellular signaling pathways, which in 
turn stimulate the transcription of hundreds of genes, especially those that pro- 
mote inflammatory responses (discussed later) and help induce adaptive 
immune responses (Figure 25-51). 

A second family of pattern recognition receptors is exclusively intracellular. 
They are called NOD proteins and also have leucine-rich repeat motifs. They are 
also functionally similar to TLRs but recognize a distinct set of ligands, including 



LPS 



, LPS-binding protein (LBP) 

.Toll-like receptor 4 (TLR4) 



BLOOD 




TRANSCRIPTION OF 
PRO-INFLAMMATORY GENES 



TRANSCRIPTION OF 
INTERFERON-INDUCIBLE GENES 



Figure 24-51 The activation of a 
macrophage by lipopolysaccharide 

(LPS). LPS binds to LPS-binding protein 
(LBP) in the blood, and the complex 
binds to the GPI-anchored protein CD14 
on the macrophage surface. The ternary 
complex then activates Toll-like receptor 
4 (TLR4), which activates multiple, 
downstream, intracellular signaling 
pathways. As a result, at least four gene 
regulatory proteins are activated, 
including NFkB, an AP1 complex of Jun 
and Fos, and two interferon regulatory 
factors, IRF3 and IRF5.This strong, 
multifaceted transcriptional response 
results in the production of interferons 
and pro-inflammatory cytokines, 
including chemokines that recruit various 
white blood cells to the site of 
macrophage activation, reflecting the 
significant danger that a macrophage 
perceives when it encounters a high 
concentration of LPS. 
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Figure 24-52 Microbial disease in a 
plant. The tomato leaves shown here are 
infected with the leaf mold fungus 
Cladosporium fulvum. Resistance to this 
type of infection depends on recognition 
of a fungal protein by host cell receptors 
that are structurally related toTLRs. 
(Courtesy of Jonathan Jones.) 



bacterial cell wall components. The different allelic forms of NODs and TLRs 
that an individual expresses play an important part in influencing their suscep- 
tibility to certain infectious diseases; particular polymorphisms in TLR4 and 
TLR5, for example, correlate with susceptibility to Legionella pneumophila, and 
members of families that express a particular allele of NOD2 have a greatly 
increased chance of suffering from Crohn's disease, a chronic inflammatory dis- 
ease of the small intestine, which is thought to be triggered by bacterial infec- 
tion. 

Proteins related to Toll, TLRs, and NODs are apparently involved in innate 
immunity in all multicellular organisms. In plants, proteins with leucine-rich 
repeats and with domains homologous to the cytosolic portion of TLRs are 
required for resistance to fungal, bacterial, and viral pathogens (Figure 24-52). 
Thus, at least two families of proteins that function in innate immunity — the 
defensins and the TLR/NOD families — seem to be evolutionarily very ancient, 
perhaps predating the split between animals and plants over a billion years ago. 
Their conservation during evolution underlines the importance of innate 
immune responses in the defense against microbial pathogens. 



Phagocytic Cells Seek, Engulf, and Destroy Pathogens 

In all animals, invertebrate as well as vertebrate, the recognition of a microbial 
invader is usually quickly followed by its engulfment by a phagocytic cell. Plants, 
however, lack this type of innate immune response. In vertebrates, macrophages 
are professional phagocytes that reside in tissues throughout the body and are 
especially abundant in areas where infections are likely to arise, including the 
respiratory and intestinal tracts, for example. They are also present in large 
numbers in connective tissues, in the liver and spleen. These long-lived cells 
patrol the tissues and are among the first cells to encounter invading microbes. 
Neutrophils are the second major type of professional phagocytic cells in verte- 
brates. In contrast to macrophages, they are short-lived cells, abundant in blood 
but not present in normal, healthy tissues. They are rapidly recruited to sites 
of infection by activated macrophages, by molecules such as formylmethion- 
ine- containing peptides that the microbes themselves release, and by peptide 
fragments of cleaved complement components. Neutrophils can detect com- 
plement-derived chemoattractants at concentrations as low as 10 -11 M. 

Macrophages and neutrophils display a variety of cell- surface receptors that 
enable them to recognize and engulf pathogens. <TGTA> These include pattern 
recognition receptors such as TLRs, receptors for antibodies produced by the 
adaptive immune system, and receptors for the C3b component of complement. 
Binding to any of these receptors induces actin polymerization at the site of 
pathogen attachment, causing the phagocyte's plasma membrane to surround 
the pathogen and engulf it in a large membrane-enclosed phagosome (Figure 
24-53). Although some bacteria can actually actively induce a host cell such as 
an epithelial cell to phagocytose them as a mechanism for invading the cell (see 
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Figure 24-53 Phagocytosis. This 
scanning electron micrograph shows a 
macrophage in the midst of ingesting 
five red blood cells that have been 
coated with an antibody against a 
surface glycoprotein. (From E.S. Gold et 
al., J. Exp. Med. 1 90:1 849-1 856, 1 999. 
With permission from The Rockefeller 
University Press.) 
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Figure 24-25), phagocytosis by a macrophage or neutrophil generally leads to 
the ingested pathogen's death. Unsurprisingly, some pathogens use specific 
mechanisms to avoid phagocytosis by macrophages or neutrophils. One strat- 
egy is to secrete a thick, slimy layer of polysaccharides, called a capsule, which 
blocks access of complement components to the bacterial surface and also 
makes it physically difficult for the phagocytic cell to bind to and engulf the bac- 
terium. Another strategy, used by Yersinia pestis (the causative bacterium of 
plague), for example, is to deliver a toxin into the macrophage via a type III 
secretory system (see Figure 24-8) that disrupts the assembly of the actin 
cytoskeleton and thereby prevents phagocytosis. 

When a macrophage or neutrophil engulfs a pathogen, it unleashes an 
impressive armory of weapons to kill it. Exposure to both microbe-associated 
immunostimulants and chemical signals produced by the immune response to 
the pathogen enhances the phagocytic and killing power of the phagocytes. This 
exposure is said to "activate" the phagocyte, putting it in a state of high alert, in 
which not only is it more effective at phagocytosing and killing pathogens, but it 
also releases cytokines to attract more white blood cells to the site of infection. 
The location of the phagocyte's weaponry is readily visible in the light or electron 
microscope as dense membrane-enclosed organelles called granules. These 
specialized lysosomal derivatives fuse with the phagosomes, delivering enzymes 
such as lysozyme and acid hydrolases that can degrade the pathogen's cell wall 
and proteins. The granules also contain defensins, the antimicrobial peptides 
that make up about 15% of the total protein in neutrophils. In addition, the 
phagocyte assembles NADPH oxidase complexes on the phagolysosomal mem- 
brane, which catalyze the production of highly toxic oxygen- derived com- 
pounds, including superoxide (O2"), hypochlorite (HOC1, the active ingredient in 
bleach), hydrogen peroxide, and hydroxyl radicals. A transient increase in oxy- 
gen consumption by the phagocytic cells, called the respiratory burst, accompa- 
nies the production of these toxic compounds. It is not only these highly reactive 
oxygen- derived compounds that damage the pathogen trapped within the 
phagolysosome. The action of NADPH oxidase transports electrons into the 
phagolysosome and induces a compensatory movement of K + along with the 
electron, which has the net effect of raising the pH. The high pH in the 
phagolysosome activates a group of potent neutral proteases, which the low pH 
of the lysosomal granule kept inactive prior to fusion with the phagosome. The 
neutral proteases quickly destroy the hapless pathogens trapped in the 
phagolysosome. Whereas macrophages generally survive this killing frenzy and 
live to kill again, neutrophils usually do not. Dead and dying neutrophils are a 
major component of the pus that forms in acutely infected wounds. The dis- 
tinctive greenish tint of pus is due to the abundance in neutrophils of the cop- 
per-containing enzyme myeloperoxidase, which is one of the components 
active in the respiratory burst. 

If a pathogen is too large to be successfully phagocytosed (if it is a large par- 
asite such as a nematode, for example), a group of macrophages, neutrophils, or 
eosinophils (discussed in Chapter 23) will gather around the invader. They 
secrete their defensins and other bactericidal products contained in their gran- 
ules by exocytosis, and they also release the toxic products of the respiratory 
burst (Figure 24-54). This barrage is generally sufficient to destroy the 
pathogen. In some cases, neutrophils have been observed to eject large parts of 
their chromatin along with the contents of their granules. The ejected DNA, with 
its attached histones, form a sticky web that entraps nearby bacteria, preventing 
their escape (Figure 24-55). Because its sole function is to sacrifice itself to kill 
invading pathogens, a neutrophil has no hesitation in using every tool available, 
including its own DNA, to accomplish this task. 



Figure 24-54 Eosinophils attacking a schistosome larva. Phagocytes 
cannot ingest large parasites such as the schistosome larva shown here. 
When the larva is coated with antibody or complement, however, 
eosinophils (and other white blood cells) can recognize it and collectively 
kill it by secreting various toxic molecules. (Courtesy of Anthony 
Butterworth.) 
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Figure 24-55 Neutrophils eject their 
chromatin to trap bacteria in a sticky 
web. (A) Scanning electron micrograph 
showing resting neutrophils. 
(B) Activated neutrophils have multiple 
surface ruffles and are associated with 
fibrous strands (arrow). The strands 
contain DNA and histones and appear to 
be chromatin from the neutrophil nucleus 
that has been ejected during the process 
of activation. (C) The sticky chromatin can 
entrap many different kinds of bacteria, 
including (left to right) Staphylococcus 
aureus, Salmonella enterica, and Shigella 
flexneri. (From V. Brinkmann et al., Science 
303:1 532-1 535, 2004. With permission 
from AAAS.) 
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Activated Macrophages Contribute to the Inflammatory 
Response at Sites of Infection 

When a pathogen invades a tissue, it almost always elicits an inflammatory 
response. Changes in local blood vessels cause a response characterized by 
pain, redness, heat, and swelling at the site of infection (physicians have recog- 
nized these four signs of inflammation, in Latin dolor, rubor, calor, and turgor, 
for thousands of years). The blood vessels dilate and become permeable to fluid 
and proteins, leading to local swelling and an accumulation of blood proteins 
that aid in defense, including components of the complement cascade. At the 
same time, the endothelial cells lining the local blood vessels are stimulated to 
express cell adhesion proteins (discussed in Chapter 19) that facilitate the 
attachment and extravasion of white blood cells, initially neutrophils, followed 
later by lymphocytes and monocytes (the blood-borne precursors of 
macrophages). <CCCC> 

Whereas neutrophils usually die at the site of inflammation, macrophages 
frequently survive their initial encounter with invading pathogens and can 
migrate to other parts of the body. Pathogens that can survive within the 
macrophage, such as the bacterium Salmonella enterica serovar Typhi, for 
example, can use the macrophages as a transport system to spread a localized 
infection to distant sites in the body, converting a minor local invasion event in 
the gut into a severe systemic disease, typhoid fever. 

Various signaling molecules mediate the inflammatory response at the site of 
an infection. Activation of TLRs results in the production of both lipid signaling 
molecules, such as prostaglandins, and protein (or peptide) signaling molecules, 
such as cytokines (discussed in Chapter 15), all of which contribute to the inflam- 
matory response, as do the complement fragments released during complement 
activation. Some of the cytokines produced by activated macrophages are 
chemoattractants (called chemokines). Some of these attract neutrophils, which 
are the first cells recruited in large numbers to the site of a new infection. Other 
cytokines trigger fever, a rise in body temperature. On balance, fever helps fight 
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Figure 24-56 Inflammation of the airways in severe asthma. Light 
micrograph of a section through a bronchus of a patient who died during a 
prolonged and severe asthmatic attack. There is almost total occlusion of 
the airway by a mucus plug. The mucus plug is a dense inflammatory 
infiltrate that includes eosinophils, neutrophils, and lymphocytes. (Courtesy 
of Thomas Krausz.) 



infection, since most bacterial and viral pathogens proliferate better at lower 
temperatures, whereas adaptive immune responses are more potent at higher 
temperatures. Still other proinflammatory signaling molecules stimulate 
endothelial cells to express proteins that trigger blood clotting in local small ves- 
sels. By occluding the vessels and cutting off blood flow, this response can help 
prevent the pathogen from entering the bloodstream and spreading the infec- 
tion to other parts of the body. 

The same inflammatory responses that help control local infections, how- 
ever, can have disastrous consequences when they occur in response to a dis- 
seminated infection in the bloodstream, a condition called sepsis. The systemic 
release of proinflammatory signaling molecules into the blood causes dilation of 
blood vessels, and loss of plasma volume, which, together, cause a large fall in 
blood pressure, or shock; in addition, there is widespread blood clotting. The end 
result, known as septic shock, is often fatal. Inappropriate or overzealous local 
inflammatory responses can also contribute to chronic diseases, such as asthma 
(Figure 24-56) and arthritis. 

Just as some pathogens have developed mechanisms to avoid the lethal con- 
sequences of phagocytosis, so some have acquired mechanisms to either pre- 
vent the inflammatory response or, in some cases, take advantage of it to spread 
the infection. Many viruses, for example, encode potent cytokine antagonists 
that block aspects of the inflammatory response. Some of these antagonists are 
simply modified forms of cytokine receptors, encoded by genes that the virus 
originally acquired from the host genome. They bind the cytokines with high 
affinity and block their activity. Some bacteria, such as Salmonella, induce an 
inflammatory response in the gut at the initial site of infection, thereby recruit- 
ing macrophages and neutrophils that they then invade. In this way, the bacte- 
ria hitch a ride to other tissues in the body. 
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Virus-Infected Cells Take Drastic Measures to Prevent Viral 
Replication 

The microbe-associated immunostimulants on the surface of bacteria and par- 
asites that are so important in eliciting innate immune responses against these 
pathogens are generally not present on the surface of viruses. Host cell ribo- 
somes construct viral proteins, and host cell lipids form the membranes of 
enveloped viruses. The only general way that a host cell can recognize the pres- 
ence of a virus is to detect unusual elements of the viral genome, such as the 
double- stranded RNA (dsRNA) that is an intermediate in the life cycle of many 
viruses. DNA virus genomes frequently contain significant amounts of CpG din- 
uclotide, which can be recognized by the Toll-like receptor TLR9, as discussed 
earlier. 

Mammalian cells are particularly adept at recognizing the presence of 
dsRNA, and they can mobilize a program of intracellular responses to eliminate 
it. The program occurs in two steps. First, the cell degrades the dsRNA into small 
fragments (about 21-25 nucleotide pairs in length), using the enzyme Dicer. 
These double-stranded fragments bind to any single- stranded RNA (ssRNA) in 
the host cell that has the same sequence as either strand of the dsRNA fragment, 
leading to the destruction of the ssRNA. This dsRNA- directed ssRNA destruction 
is the basis of the technique of RNA interference (RNAi) that researchers use to 
destroy specific mRNAs and thereby block specific gene expression (discussed in 
Chapter 8). Second, the dsRNA induces the host cell to produce and secrete two 
cytokines — interferon- a (IFNa) and interferon- P (IFN|$), which act in both an 
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autocrine fashion on the infected cell and a paracrine fashion on uninfected 
neighbors. The binding of the interferons to their cell-surface receptors stimu- 
lates specific gene transcription by the Jak-STAT intracellular signaling pathway 
(see Figure 15-68), leading to the production of more than 300 gene products, 
including a large number of cytokines, reflecting the complexity of the cell's 
acute response to a viral infection. 

The interferon response appears to be a general reaction of a mammalian 
cell to a viral infection, and viral components other than dsRNA can trigger it. 
In addition to their effects on host cell gene transcription, the interferons acti- 
vate a latent ribonuclease, which nonspecifically degrades ssRNA. They also 
indirectly activate a protein kinase that phosphorylates and inactivates the 
protein synthesis initiation factor eIF-2, thereby shutting down most protein 
synthesis in the embattled host cell. Apparently, by destroying most of its RNA 
and transiently halting most of its protein synthesis, the host cell inhibits viral 
replication without killing itself. If these measures fail, the cell takes the even 
more extreme step of killing itself by apoptosis to prevent the virus from repli- 
cating, often with the help of a killer lymphocyte, as we discuss below and in 
Chapter 25. 

Mammalian cells have a special defense mechanism to help them deal with 
retroviruses. These viruses activate a family of proteins called APOBEC (named 
because they are also members of the editing complex that modifies the mRNA 
for the protein ApoB, which is the major protein component of the low-density 
lipoprotein, LDL). These enzymes deaminate cytosines in nascent retroviral 
cDNAs, converting them into uridine and thereby generating large numbers of 
mutations in the viral genome, leading eventually to the termination of viral 
replication. 

Not surprisingly, many viruses have acquired mechanisms to defeat or avoid 
these intracellular defense processes. Influenza virus encodes a protein that 
blocks the recognition of dsRNA by Dicer. HIV encodes a small protein that 
mediates the ubiquitylation and proteasome-mediated degradation of the 
APOBEC proteins. Many viruses, including most of those that are able to cause 
disease in healthy hosts, use various mechanisms to block the activation of the 
interferon pathway. Some viruses also inhibit host cell apoptosis, which can 
have the side-effect of promoting the development of cancer; this is one way in 
which the Epstein-Barr virus occasionally causes Burkitt's lymphoma. 



Natural Killer Cells Induce Virus-Infected Cells to Kill Themselves 

Interferons have other, less direct ways of blocking viral replication. One of these 
is to enhance the activity of natural killer cells (NK cells), which are part of the 
innate immune system. Like cytotoxic T cells of the adaptive immune system 
(discussed in Chapter 25), NK cells destroy virus-infected cells by inducing the 
infected cells to kill themselves by undergoing apoptosis. The ways in which 
cytotoxic T cells and NK cells distinguish virus-infected cells from uninfected 
cells, however, is different. 

Both cytotoxic T cells and NK cells recognize the same special class of cell- 
surface proteins to detect virus-infected host cells, The proteins are called class 
I MHC proteins, <AAGT> because they are encoded by genes in the major histo- 
compatibility complex; almost all vertebrate cells express these genes, and we 
discuss them in detail in Chapter 25. Cytotoxic T cells recognize peptide frag- 
ments of viral proteins bound to these MHC proteins on the surface of virus - 
infected cells. By contrast, NK cells monitor the level of class I MHC proteins on 
the surface of all host cells: high levels inhibit the killing activity of NK cells, so 
that NK cells selectively kill host cells expressing low levels, which are mainly 
virus-infected cells and some cancer cells (Figure 24-57). 

The reason that class I MHC protein levels are often low on virus-infected 
cells is that many viruses have developed mechanisms to inhibit the expression 
of these proteins on the surface of the cells they infect, to avoid detection by 
cytotoxic T lymphocytes. Adenovirus and HIV, for example, encode proteins that 
block class I MHC gene transcription. Herpes simplex virus and cytomegalovirus 
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Figure 24-57 A natural killer (NK) cell 
attacking a cancer cell. The NK cell is the 
smaller cell on the left. This scanning 
electron micrograph was taken shortly 
after the NK cell attached, but before it 
induced the cancer cell to kill itself. 
(Courtesy of J.C. Hiserodt, in Mechanisms 
of Cytotoxicity by Natural Killer Cells [R.B. 
Herberman and D. Callewaert, eds.]. New 
York: Academic Press, 1 995.) 
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block the peptide translocators in the ER membrane that transport proteasome- 
derived peptide fragments of viral proteins and host cell proteins from the 
cytosol into the lumen of the ER; such peptides are required for newly made 
class I MHC proteins to assemble in the ER membrane and be transported 
through the Golgi apparatus to the cell surface as peptide-MHC complexes (see 
Figure 25-59). Cytomegalovirus causes the retro translocation of class I MHC 
proteins from the ER membrane into the cytosol, where they are rapidly 
degraded in proteasomes. Proteins encoded by still other viruses prevent the 
delivery of assembled class I MHC proteins from the ER to the Golgi apparatus, 
or from the Golgi apparatus to the plasma membrane. 

By evading recognition by cytotoxic T cells in these ways, however, a virus 
incurs the wrath of NK cells. The local production of IFN(3 activates the killing 
activity of NK cells and also increases the expression of class I MHC proteins in 
uninfected cells. The cells infected with a virus that blocks class I MHC expres- 
sion are thereby exposed as being different and become the victims of the acti- 
vated NK cells. Thus, it is difficult for viruses to hide from both cytotoxic T cells 
and NK cells simultaneously. Remarkably, however, some large DNA viruses, 
including cytomegalovirus, encode MHC-like proteins that are expressed on the 
surface of the host cells they infect. Like bona fide class I MHC proteins, these 
MHC mimics activate inhibitory receptors on NK cells, and block the killing 
activity of the NK cells. 

Both NK cells and cytotoxic T lymphocytes kill infected target cells by induc- 
ing them to undergo apoptosis before the virus has had a chance to replicate. It 
is not surprising, then, that many viruses have evolved mechanisms to inhibit 
apoptosis, particularly early in infection. As discussed in Chapter 18, apoptosis 
depends on an intracellular proteolytic cascade, which the cytotoxic cells can 
trigger, either through the activation of cell-surface death receptors or by inject- 
ing a proteolytic enzyme into the target cell (see Figure 24-47). Viral proteins can 
interfere with nearly every step in these pathways. 

Dendritic Cells Provide the Link Between the Innate and Adaptive 
Immune Systems 

Dendritic cells are crucially important cells of the innate immune system that 
are widely distributed in the tissues and organs of vertebrates. They display a 
large variety of pattern recognition receptors, including TLRs and NOD proteins, 
that enable the cells to recognize and phagocytose invading pathogens and to 
become activated in the process. The dendritic cells cleave the proteins of the 
pathogens into peptide fragments, which then bind to MHC proteins that carry 
the fragments to the cell surface. The activated dendritic cells now carry the 
pathogen-derived peptides, as complexes with MHC proteins, to a nearby lym- 
phoid organ such as a lymph node, where they activate T cells of the adaptive 
immune system to join in the battle against the specific invader. In addition to 
the complexes of MHC proteins and microbial peptides displayed on their cell 
surface, activated dendritic cells also display special, cell-surface co- stimulatory 
proteins that help activate the T cells. The activated dendritic cells also secrete a 
variety of cytokines that influence the type of response that the T cells make, 
ensuring that it is appropriate to fight the particular pathogen. In these ways, 
dendritic cells serve as crucial links between the innate immune system, which 
provides a rapid first line of defense against invading pathogens, and the adap- 
tive immune system, which provides slower but more powerful and highly spe- 
cific ways of attacking an invader. 

The battle between pathogens and host defenses is remarkably balanced. At 
present, humans seem to be gaining a slight advantage, using public sanitation 
measures, vaccines, and drugs to aid the efforts of our innate and adaptive 
immune systems. However, infectious diseases are still a leading cause of death 
worldwide, and new epidemics such as AIDS continue to emerge. The rapid evo- 
lution of pathogens and the almost infinite variety of ways that they can invade 
the human body and elude immune responses will prevent us from ever win- 
ning the battle completely. 



REFERENCES 



1537 



In the next Chapter, we consider the unique and remarkable strategies that 
our adaptive immune system has evolved to defend us against such wily oppo- 
nents. Amazingly, this immune system can mount pathogen- specific responses 
against pathogens that have never existed before. 



Summary 

Physical barriers preventing infection, cell- intrinsic responses to infection, and innate 
immune responses provide early lines of defense against invading pathogens. All mul- 
ticellular organisms possess these defenses. In vertebrates, innate immune responses 
can also recruit specific and more powerful adaptive immune responses. Innate 
immune responses rely on the body's ability to recognize conserved features of microbial 
molecules that are not made by the host. These microbe- associated immunostimulants 
include many types of molecules on microbial surfaces, as well as the double- stranded 
RNA of some viruses. Many of these microbial molecules are recognized by pattern 
recognition receptors, including the toll-like receptors (TLRs) found in both plants and 
animals. In vertebrates, microbial surface molecules also activate complement, a 
group of blood proteins that are activated in sequence to target the microbe for phago- 
cytosis by macrophages and neutrophils, to disrupt the membrane of the microbe, and 
to produce an inflammatory response. The phagocytes use a combination ofdegrada- 
tive enzymes, antimicrobial peptides, and reactive oxygen species to kill the invading 
microorganism; in addition, they secrete signal molecules that trigger an inflamma- 
tory response. Cells infected with viruses produce interferons, which induce a series of 
cell responses, inhibit viral replication, and activate the killing activities of natural 
killer cells. Dendritic cells of the innate immune system ingest microbes at sites of 
infection and carry them and their products to local lymph nodes, where they activate 
T cells of the adaptive immune system to make specific responses against the microbes. 
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The Adaptive Immune System 




Our adaptive immune system saves us from certain death by infection. An 
infant born with a severely defective adaptive immune system will soon die 
unless extraordinary measures are taken to isolate it from a host of infectious 
agents, including bacteria, viruses, fungi, and parasites. All multicellular 
organisms need to defend themselves against infection by such potentially 
harmful invaders, collectively called pathogens. Invertebrates use relatively 
simple defense strategies that rely chiefly on protective barriers, toxic 
molecules, and phagocytic cells that ingest and destroy invading microorgan- 
isms (microbes) and larger parasites (such as worms). Vertebrates, too, depend 
on such innate immune responses as a first line of defense (discussed in 
Chapter 24), but they can also mount much more sophisticated defenses, 
called adaptive immune responses. In vertebrates, the innate responses call 
the adaptive immune responses into play, and both work together to eliminate 
the pathogens (Figure 25-1). 

Whereas the innate immune responses are general defense reactions, the 
adaptive responses are highly specific to the particular pathogen that induced 
them, and they provide long-lasting protection. A person who recovers from 
measles, for example, is protected for life against measles by the adaptive 
immune system, although not against other common viruses, such as those that 
cause mumps or chickenpox. In this chapter, we focus on adaptive immune 
responses, and, unless we indicate otherwise, we use the term "immune 
responses" to refer to them. 

Adaptive immune responses eliminate or destroy invading pathogens and 
any toxic molecules they produce. Because these responses are destructive, it is 
important that they are directed only against foreign molecules and not against 
molecules of the host itself. The adaptive immune system uses multiple mecha- 
nisms to avoid damaging responses against self molecules. Occasionally, how- 
ever, these mechanisms fail, and the system turns against the host, causing 
autoimmune diseases, which can be fatal. 

Many harmless foreign molecules enter the body, and it would be pointless 
and potentially dangerous to mount adaptive immune responses against them. 
Allergic conditions such as hayfever and allergic asthma are examples of delete- 
rious adaptive immune responses against apparently harmless foreign 
molecules. An individual normally avoids such inappropriate responses 
because the innate immune system only calls adaptive immune responses into 
play when it recognizes conserved patterns of molecules that are specifically 
expressed by invading pathogens. The innate immune system can even distin- 
guish between different classes of pathogens and recruit the most effective form 
of adaptive immune response to eliminate them. 

Any substance capable of eliciting an adaptive immune response is referred 
to as an antigen {antibody generator). Most of what we know about such 
responses has come from studies in which an experimenter tricks the adaptive 
immune system of a laboratory animal (usually a mouse) into responding to a 
harmless foreign molecule, such as a foreign protein. The trick involves injecting 
the harmless molecule together with immunostimulants (usually microbial in 
origin) called adjuvants, which activate the innate immune system. This trick is 
called immunization. If administered in this way, almost any macromolecule, as 
long as it is foreign to the recipient, can induce an adaptive immune response 
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Figure 25-1 Innate and adaptive immune responses. Innate immune 
responses are activated directly by pathogens and defend all multicellular 
organisms against infection. In vertebrates, pathogens, together with the 
innate immune responses they activate, stimulate adaptive immune 
responses, which then work together with innate immune responses to 
help fight the infection. 



that is specific to that macromolecule. Remarkably, the adaptive immune system 
can distinguish between antigens that are very similar — such as between two 
proteins that differ in only a single amino acid, or between two optical isomers of 
the same molecule. Thus, the adaptive immune system recognizes the fine 
molecular details of macromolecules. 

Adaptive immune responses are carried out by white blood cells called lym- 
phocytes. There are two broad classes of such responses — antibody responses 
and T-cell-mediated immune responses — and different classes of lymphocytes, 
called B cells and T cells, respectively, carry them out. In antibody responses, B 
cells are activated to secrete antibodies, which are proteins called immunoglob- 
ulins. The antibodies circulate in the bloodstream and permeate the other body 
fluids, where they bind specifically to the foreign antigen that stimulated their 
production (Figure 25-2) . Binding of antibody inactivates viruses and microbial 
toxins (such as tetanus toxin or diphtheria toxin) by blocking their ability to bind 
to receptors on host cells. Antibody binding also marks invading pathogens for 
destruction, mainly by making it easier for phagocytic cells of the innate 
immune system to ingest them. 

In T-cell-mediated immune responses, the second class of adaptive 
immune responses, activated T cells react directly against a foreign antigen that 
is presented to them on the surface of a host cell, which is therefore referred to 
as an antigen-presenting cell. Remarkably, T cells can detect microbes hiding 
inside host cells and either kill the infected cells or help the infected cells or 
other cells to eliminate the microbes. The T cell, for example, might kill a virus- 
infected host cell that has viral antigens on its surface, thereby eliminating the 
infected cell before the virus has had a chance to replicate (see Figure 25-2). In 
other cases, the T cell produces signal molecules that either activate 
macrophages to destroy the microbes that they have phagocytosed or help acti- 
vate B cells to make antibodies against the microbes. 

We begin this chapter by discussing the general properties of lymphocytes. 
We then consider the functional and structural features of antibodies that 
enable them to recognize and neutralize extracellular microbes and the toxins 
they make. Next, we discuss how B cells can produce a virtually unlimited num- 
ber of different antibody molecules. Finally, we consider the special features of 
T cells and the immune responses they mediate. 



LYMPHOCYTES AND THE CELLULAR BASIS OF 
ADAPTIVE IMMUNITY 

Lymphocytes are responsible for the astonishing specificity of adaptive immune 
responses. They occur in large numbers in the blood and lymph (the colorless 
fluid in the lymphatic vessels that connect the lymph nodes in the body to each 
other and to the bloodstream). They are also concentrated in lymphoid organs, 
such as the thymus, lymph nodes (also called lymph glands), spleen, and 
appendix (Figure 25-3). In this section, we discuss the general properties of lym- 
phocytes that apply to both B cells and T cells. 

Lymphocytes Are Required for Adaptive Immunity 

There are about 2 x 10 12 lymphocytes in the human body, making the immune 
system comparable in cell mass to the liver or the brain. Despite their abundance, 
their central role in adaptive immunity was not definitively demonstrated until 
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Figure 25-2 The two main classes of 
adaptive immune responses. 

Lymphocytes carry out both classes of 
responses. Here, the lymphocytes are 
responding to a viral infection. In one 
class of adaptive response, B cells secrete 
antibodies that neutralize the virus. In the 
other, a T-cell-mediated response, T cells 
kill the virus-infected cells. In both cases, 
innate immune responses help activate 
the adaptive immune responses through 
pathways that are not shown. 
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Figure 25-3 Human lymphoid organs. 

Lymphocytes develop in the thymus and 
bone marrow (yellow), which are 
therefore called central (or primary) 
lymphoid organs. The newly formed 
lymphocytes migrate from these primary 
organs to peripheral (or secondary) 
lymphoid organs, where they can react 
with foreign antigen. Only some of the 
peripheral lymphoid organs (blue) and 
lymphatic vessels (green) are shown; 
many lymphocytes, for example, are 
found in the skin and respiratory tract. As 
we discuss later, the lymphatic vessels 
ultimately empty into the bloodstream 
(not shown). 



the late 1950s. The crucial experiments were performed in mice and rats that 
were heavily irradiated to kill most of their white blood cells, including lympho- 
cytes. This treatment makes the animals unable to mount adaptive immune 
responses. Then, by transferring various types of cells into the animals it was 
possible to determine which cells reversed the deficiency. Lymphocytes were the 
only cell type able to restore the adaptive immune responses of irradiated ani- 
mals, indicating that they are required for these responses (Figure 25-4). 



The Innate and Adaptive Immune Systems Work Together 

As mentioned earlier, lymphocytes usually respond to foreign antigens only if the 
innate immune system is first activated. As discussed in Chapter 24, the rapid 
innate immune responses to an infection depend largely on pattern recognition 
receptors made by cells of the innate immune system. These receptors recognize 
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Figure 25-4 A classic experiment showing that lymphocytes are required for adaptive immune responses to foreign 
antigens. An important requirement of all such cell-transfer experiments is that cells are transferred between animals of 
the same inbred strain. Members of an inbred strain are genetically identical. If lymphocytes are transferred to a genetically 
different animal that has been irradiated, they react against the "foreign" antigens of the host and can kill the animal. In the 
experiment shown, the injection of lymphocytes restores both antibody and T-cell-mediated adaptive immune responses, 
indicating that lymphocytes are required for both types of responses. 
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microbe-associated molecules that are not present in the host organism, called 
microbe-associated immunostimulants. Because they often occur in repeating 
patterns, they are also called pathogen-associated molecular patterns (PAMPs). 
PAMPs include repeated patterns of molecular structure in microbial nucleic 
acids, lipids, polysaccharides, and proteins. 

Some of the pattern recognition receptors are present on the surface of pro- 
fessional phagocytic cells (phagocytes) such as macrophages and neutrophils, 
where they mediate the uptake of pathogens, which are then delivered to lyso- 
somes for destruction. Others are secreted and bind to the surface of pathogens, 
marking them for destruction by either phagocytes or a system of blood proteins 
collectively called the complement system (discussed in Chapter 24). Still others, 
including the Toll-like receptors (TLRs) discussed in Chapter 24, activate intra- 
cellular signaling pathways that lead to the secretion of extracellular signal 
molecules that promote inflammation and help activate adaptive immune 
responses. 

The cells of the vertebrate innate immune system that respond to PAMPs 
and activate adaptive immune responses most efficiently are dendritic cells. 
Present in most tissues, dendritic cells express high levels of TLRs and other pat- 
tern recognition receptors, and they function by presenting microbial antigens 
to T cells in peripheral lymphoid organs. In most cases, they recognize and 
phagocytose invading microbes or their products or fragments of infected cells 
at a site of infection and then migrate with their prey to a nearby lymph node; in 
other cases, they pick up microbes or their products directly in a peripheral lym- 
phoid organ such as the spleen. In either case, the microbial PAMPs activate the 
dendritic cells so that they, in turn, can directly activate the T cells in peripheral 
lymphoid organs to respond to the microbial antigens displayed on the den- 
dritic cell surface. Once activated, some of the T cells then migrate to the site of 
infection, where they help destroy the microbes (Figure 25-5). Other activated T 
cells remain in the lymphoid organ, where they help keep the dendritic cells 
active, help activate other T cells, and help activate B cells to make antibodies 
against the microbial antigens. 

Thus, innate immune responses are activated mainly at sites of infection (or 
injury), whereas adaptive immune responses are activated mainly in peripheral 
lymphoid organs such as lymph nodes and spleen. Both types of responses work 
together to eliminate invading pathogens and foreign macromolecules. 



Figure 25-5 How the innate immune 
system can help activate the adaptive 
immune system. Dendritic cells ingest 
invading microbes or their products at 
the site of an infection. The microbial 
PAMPs activate the dendritic cells to 
express co-stimulatory proteins on their 
surface and to migrate in lymphatic 
vessels to a nearby lymph node. In the 
lymph node, the activated dendritic cells 
activate the small fraction of T cells that 
express a receptor for the microbial 
antigens displayed on the dendritic cell 
surface. These T cells proliferate and 
some then migrate to the site of 
infection, where they help eliminate the 
microbes, by either helping to activate 
macrophages or killing infected cells 
(not shown). 
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B Lymphocytes Develop in the Bone Marrow; T Lymphocytes 
Develop in the Thymus 

T cells and B cells derive their names from the organs in which they develop. T 
cells develop in the thymus, and B cells, in mammals, develop in the bone mar- 
row in adults or the liver in fetuses. 

Both T and B cells are thought to develop from the same common lymphoid 
progenitor cells. The common lymphoid progenitor cells themselves derive from 
multipotential hemopoietic stem cells, which give rise to all of the blood cells, 
including red blood cells, white blood cells, and platelets. These stem cells (dis- 
cussed in Chapter 23) are located primarily in hemopoietic tissues — mainly the 
liver in fetuses and the bone marrow in adults. 

T cells develop in the thymus from common lymphoid progenitor cells that 
migrate there from the hemopoietic tissues via the blood. In most mammals, 
including humans and mice, B cells develop from common lymphoid progeni- 
tor cells in the hemopoietic tissues themselves (Figure 25-6). Because they are 
sites where lymphocytes develop from precursor cells, the thymus and 
hemopoietic tissues are referred to as central (primary) lymphoid organs (see 
Figure 25-3). 

As we discuss later, most lymphocytes die in the central lymphoid organs 
soon after they develop, without ever functioning. Others, however, mature and 
migrate via the blood to the peripheral (secondary) lymphoid organs — mainly, 
the lymph nodes, spleen, and epithelium- associated lymphoid tissues in the 
gastrointestinal tract, respiratory tract, and skin (see Figure 25-3). It is in these 
peripheral lymphoid organs that foreign antigens activate T and B cells (see Fig- 
ure 25-6). 

T and B cells become morphologically distinguishable from each other only 
after they have been activated by antigen. Resting T and B cells look very similar, 
even in an electron microscope. Both are small, only marginally bigger than red 
blood cells, and contain little cytoplasm (Figure 25-7A). After activation by an 
antigen, both proliferate and mature into effector cells. Effector B cells secrete 
antibodies. In their most mature form, called plasma cells, they are filled with an 
extensive rough endoplasmic reticulum that is busily making antibodies (Figure 
25-7B). In contrast, effector T cells (Figure 5-7C) contain very little endoplasmic 
reticulum and do not secrete antibodies; instead, they secrete a variety of signal 
proteins called cytokines, which act as local mediators. 

There are three main classes of T cells — cytotoxic T cells, helper T cells, and 
regulatory (suppressor) T cells. Cytotoxic T cells directly kill infected host cells. 
Helper T cells help activate macrophages, dendritic cells, B cells, and cytotoxic T 
cells by secreting a variety of cytokines and displaying a variety of co- stimulatory 
proteins on their surface. Regulatory T cells are thought to use similar strategies 
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Figure 25-6 The development of T and 
B cells. The central lymphoid organs, 
where lymphocytes develop from 
common lymphoid progenitor cells, are 
labeled in yellow boxes. The common 
lymphoid progenitor cells develop from 
multipotent hemopoietic stem cells in 
the bone marrow. Some of the common 
lymphoid progenitor cells develop locally 
in the bone marrow into immature 
B cells, while others migrate to the 
thymus (via the bloodstream) where they 
develop into thymocytes (developing 
T cells). T cells and B cells are activated by 
foreign antigens mainly in peripheral 
lymphoid organs, such as lymph nodes 
or the spleen. 
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to inhibit the function of helper T cells, cytotoxic T cells, and dendritic cells. 
Thus, whereas B cells can act over long distances by secreting antibodies that are 
widely distributed by the bloodstream, T cells can migrate to distant sites, but, 
once there, they act only locally on neighboring cells. 



The Adaptive Immune System Works by Clonal Selection 

The most remarkable feature of the adaptive immune system is that it can 
respond to millions of different foreign antigens in a highly specific way. Human 
B cells, for example, can make more than 10 12 different antibody molecules that 
react specifically with the antigen that induced their production. How do B cells 
produce such a diversity of specific antibodies? The answer began to emerge in 
the 1950s with the formulation of the clonal selection theory. According to this 
theory, an animal first randomly generates a vast diversity of lymphocytes and 
then selects for activation those lymphocytes that can react against the foreign 
antigens that the animal actually encounters. As each lymphocyte develops in a 
central lymphoid organ, it becomes committed to react with a particular antigen 
before ever being exposed to the antigen. It expresses this commitment in the 
form of cell-surface receptor proteins that specifically bind the antigen. When a 
lymphocyte encounters its antigen in a peripheral lymphoid organ, the binding 
of the antigen to the receptors activates the lymphocyte, causing it to proliferate, 
thereby producing many more cells with the same receptor — a process called 
clonal expansion (as cells derived from a common ancestor cell are referred to as 
a clone). The encounter with antigen also causes the cells to differentiate into 
effector cells. An antigen therefore selectively stimulates those cells that express 
complementary antigen- specific receptors and are thus already committed to 
respond to it (Figure 25-8). This arrangement is what makes adaptive immune 
responses antigen-specific. 

Compelling evidence supports the main tenets of the clonal selection the- 
ory. But how can the adaptive immune system produce lymphocytes that col- 
lectively display such an enormous diversity of receptors, including ones that 
recognize synthetic molecules that never occur in nature? We shall see later that, 
in humans, the antigen- specific receptors on both T and B cells are encoded by 
genes that are assembled from a series of gene segments by a special form of 
genetic recombination that occurs early in a lymphocyte's development, before 
it has encountered antigen. This assembly process generates an enormous 
diversity of receptors and lymphocytes, thereby enabling the immune system to 
respond to an almost unlimited variety of antigens. 



Figure 25-7 Electron micrographs of 
resting and effector lymphocytes. 

(A) This resting lymphocyte could be 
either a T cell or a B cell, as these cells are 
difficult to distinguish morphologically 
until they have been activated to become 
effector cells. (B) An effector B cell 
(a plasma cell). It is filled with an 
extensive rough endoplasmic reticulum 
(ER), which is distended with antibody 
molecules. (C) An effector T cell, which 
has relatively little rough ER but is filled 
with free ribosomes.The three cells are 
shown at the same magnification. 
(A, courtesy of Dorothy Zucker-Franklin; 
B, courtesy of Carlo Grossi; A and B, from 
D. Zucker-Franklin et al., Atlas of Blood 
Cells: Function and Pathology, 2nd ed. 
Milan, Italy: Edi. Ermes, 1 988; C, courtesy 
of Stefanello de Petris.) 
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Figure 25-8 The clonal selection theory. 

An antigen activates only those 
lymphocytes that are already committed 
to respond to it. A cell committed to 
respond to a particular antigen displays 
cell-surface receptors that specifically 
recognize the antigen. The human 
immune system is thought to consist of 
many millions of different lymphocyte 
clones, with cells within a clone 
expressing the same unique receptor. 
Before their first encounter with antigen, 
a clone would usually contain only one or 
a small number of cells. A particular 
antigen may activate hundreds of 
different clones. Although only B cells are 
shown here, T cells operate in a similar 
way. Note that the receptors on B cells 
are antibody molecules and that those on 
the B cells labeled "BP" in this diagram 
bind the same antigen as do the 
antibodies secreted by the effector 
"Bp" cells. 



Most Antigens Activate Many Different Lymphocyte Clones 

Most large molecules, including virtually all proteins and many polysaccharides, 
can act as antigens. Those parts of an antigen that bind to the antigen-binding 
site on either an antibody molecule or a lymphocyte receptor are called anti- 
genic determinants (or epitopes). Most antigens have a variety of antigenic 
determinants that can stimulate the production of antibodies, specific T cell 
responses, or both. Some determinants of an antigen produce a greater 
response than others, so that the reaction to them may dominate the overall 
response. Such determinants are called immunodominant. 

Any antigenic determinant is likely to activate many lymphocyte clones, 
each of which produces an antigen-binding site with its own characteristic affin- 
ity for the determinant. Even a relatively simple structure, like the dinitrophenyl 
(DNP) group in Figure 25-9, can be "looked at" in many ways. When it is cou- 
pled to a protein, as shown in the figure, it usually stimulates the production of 
hundreds of species of anti-DNP antibodies, each made by a different B cell 
clone. Such responses are said to be polyclonal. In oligoclonal responses, only a 
few clones are activated, and in monoclonal responses only a single B or T cell 
clone is activated. Monoclonal antibodies are widely used as tools in biology and 
medicine, but they have to be produced in a special way (see Figure 8-8), as the 
responses to most antigens are polyclonal. 



Immunological Memory Involves Both Clonal Expansion and 
Lymphocyte Differentiation 

The adaptive immune system, like the nervous system, can remember prior 
experiences. This is why we develop lifelong immunity to many common infec- 
tious diseases after our initial exposure to the pathogen, and it is why vaccina- 
tion works. The same phenomenon can be demonstrated in experimental ani- 
mals. If an animal is immunized once with antigen A, an immune response 
(antibody, T-cell-mediated, or both) appears after several days, rises rapidly and 



Figure 25-9 The dinitrophenyl (DNP) group. Although it is too small to 
induce an immune response on its own, when it is coupled covalently to a 
lysine side chain on a protein, as illustrated, DNP stimulates the production 
of hundreds of different species of antibodies that all bind specifically to it. 
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exponentially, and then, more gradually, declines. This is the characteristic 
course of a primary immune response, occurring on an animal's first exposure 
to an antigen. If, after some weeks, months, or even years have elapsed, the ani- 
mal is immunized again with antigen A, it will usually produce a secondary 
immune response that differs from the primary response: the lag period is 
shorter, and the response is greater and more efficient. These differences indi- 
cate that the animal has "remembered" its first exposure to antigen A. If the ani- 
mal is given a different antigen (for example, antigen B) instead of a second 
immunization with antigen A, the response is typical of a primary, and not a sec- 
ondary, immune response. The secondary response must therefore reflect anti- 
gen-specific immunological memory for antigen A (Figure 25-10). 

The clonal selection theory provides a useful conceptual framework for 
understanding the cellular basis of immunological memory. In an adult animal, 
the peripheral lymphoid organs contain a mixture of lymphocytes in at least 
three stages of maturation: naive cells, effector cells, and memory cells. When 
naive cells encounter their antigen for the first time, the antigen stimulates 
some of them to proliferate and differentiate into effector cells, which then 
carry out an immune response (effector B cells secrete antibody, while effector T 
cells either kill infected cells or influence the response of other cells). Some of 
the antigen- stimulated naive cells multiply and differentiate into memory cells, 
which do not themselves carry out immune responses but are more easily and 
more quickly induced to become effector cells by a later encounter with the 
same antigen. When they encounter their antigen, memory cells (like naive 
cells), give rise to either effector cells or more memory cells (Figure 25-11). 

Thus, the primary response generates immunological memory because of 
clonal expansion, whereby the proliferation of antigen- stimulated naive cells cre- 
ates many memory cells, as well as because these memory cells are able to 
respond more sensitively, rapidly, and effectively to the same antigen than do 
naive cells. And, unlike most effector cells, which die within days or weeks, mem- 
ory cells can persist for the lifetime of the animal, even in the absence of their 
specific antigen, thereby providing lifelong immunological memory. 

As we discuss later, memory B cells produce antibodies of different classes 
and of much higher affinity for antigen than those produced by naive B cells. 
This is the main reason that secondary antibody responses are much more effec- 
tive at eliminating pathogens than are primary antibody responses. 

Although most effector T and B cells die after an immune response is over, 
some survive as effector cells and help provide long-term protection against the 

Figure 25-1 1 A model for the cellular basis of immunological memory. 

When stimulated by their specific antigen, naive cells proliferate and 
differentiate. Most become effector cells, which function and then usually 
die, while others become memory cells. During a subsequent exposure to 
the same antigen, the memory cells respond more readily, rapidly, and 
efficiently than did the naive cells: they proliferate and give rise to effector 
cells and to more memory cells. Memory T cells might also develop from 
effector cells (not shown). 



Figure 25-1 Primary and secondary 
antibody responses. The secondary 
response induced by a second exposure to 
antigen A is faster and greater than the 
primary response and is specific for A, 
indicating that the adaptive immune 
system has specifically remembered its 
previous encounter with antigen A. The 
same type of immunological memory is 
observed inT-cell-mediated responses. 
As we discuss later, the types of antibodies 
produced in the secondary response are 
different from those produced in the 
primary response, and these antibodies 
bind the antigen more tightly. 
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pathogen. A small proportion of the plasma cells produced in a primary B cell 
response, for example, can survive for many months in the bone marrow, where 
they continue to secrete their specific antibodies into the bloodstream. 



Immunological Tolerance Ensures That Self Antigens Are Not 
Normally Attacked 

As discussed in Chapter 24, cells of the innate immune system use pattern 
recognition receptors to distinguish pathogens from the normal molecules of 
the host. The adaptive immune system has a far more difficult recognition task: 
it must be able to respond specifically to an almost unlimited number of foreign 
macromolecules, while avoiding responding to the large number of molecules 
made by the host organism itself. How is this possible? It helps that self 
molecules do not induce the innate immune reactions required to activate the 
adaptive immune system. But even when an infection or tissue injury triggers 
innate reactions, the vast excess of self molecules present normally fail to induce 
an adaptive immune response. Why not? 

One answer is that the adaptive immune system has "learned" not to 
respond to self antigens. Transplantation experiments provide one line of evi- 
dence for this learning process. When tissues are transplanted from one individ- 
ual to another (and the two individuals are not identical twins), the immune sys- 
tem of the recipient usually recognizes the donor cells as foreign and destroys 
them. (For reasons we discuss later, the foreign antigens on the donor cells are 
so powerful that they can stimulate adaptive immune responses in the absence 
of infection, injury, or an adjuvant.) If, however, we introduce cells from one 
strain of mouse into a newborn mouse of another strain, some of these cells sur- 
vive for most of the recipient animal's life, and the recipient will now accept a 
graft from the original donor, even though it rejects "third-party" grafts. Appar- 
ently, nonself antigens can, in particular circumstances, induce the immune sys- 
tem to become specifically unresponsive to them. This antigen- specific unre- 
sponsiveness to foreign antigens is known as acquired immunological tolerance 
(Figure 25-12). 

The unresponsiveness of an animal's adaptive immune system to its own 
macromolecules {natural immunological tolerance, or self- tolerance) is 
acquired in the same way. Normal mice, for example, cannot make an immune 
response against one of their own protein components of the complement sys- 
tem called C5 (discussed in Chapter 24). But, mutant mice that lack the gene 
encoding C5 (but are otherwise genetically identical to the normal mice) can 
make a strong immune response to this blood protein when immunized with it. 
Similarly, humans that lack a normal gene that codes for the clotting protein 
Factor VIII (and therefore bleed excessively) make antibodies against the protein 
when it is administered to them to control bleeding. 




Figure 25-12 Acquired immunological 
tolerance. The skin graft seen here was 
transplanted from an adult brown mouse 
to an adult white mouse. It has survived 
for many weeks only because the white 
mouse, at the time of its birth, received 
an injection of bone marrow cells from 
the brown mouse and therefore became 
immunologically tolerant to them. Some 
of the bone marrow cells (and their 
progeny) from the brown mouse persist 
in the adult white mouse and continue to 
induce tolerance in newly formed 
lymphocytes that would otherwise react 
against the brown skin. (Courtesy of 
Leslie Brent, from I. Roitt, Essential 
Immunology, 6th ed. Oxford, UK: 
Blackwell Scientific, 1988.) 
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The natural immunological tolerance for a particular self molecule persists 
only for as long as the molecule remains present in the body. If a self molecule 
such as C5 is experimentally removed from an adult mouse, the animal gains the 
ability to respond to it after a few weeks or months. Thus, the immune system is 
genetically capable of responding to self molecules but learns not to do so. 

Self- tolerance depends on a number of distinct mechanisms: 
1. In receptor editing, developing lymphocytes that recognize self molecules 
(self-reactive lymphocytes) change their antigen receptors so that they no 
longer recognize self antigens. 

In clonal deletion, self-reactive lymphocytes die by apoptosis when they 
bind their self antigen. 

In clonal inactivation (also called clonal anergy), self- reactive lymphocytes 
become functionally inactivated when they encounter their self antigen. 
In clonal suppression, regulatory T cells suppress the activity of self-reac- 
tive lymphocytes. 

Some of these mechanisms — especially the first two, clonal deletion and recep- 
tor editing — operate in central lymphoid organs when newly formed self-reac- 
tive lymphocytes first encounter their self antigens, and they are largely respon- 
sible for the process of central tolerance. Clonal inactivation and clonal suppres- 
sion, by contrast, operate mainly when lymphocytes encounter their self anti- 
gens in peripheral lymphoid organs, and they are responsible for the process of 
peripheral tolerance. Clonal deletion and clonal inactivation, however, are 
known to operate both centrally and peripherally (Figure 25-13). 

Why does the binding of a self antigen lead to tolerance rather than activa- 
tion? The answer is still not completely known. As we discuss later, to activate a 
lymphocyte in a peripheral lymphoid organ, the cell must do more than bind its 
antigen: it must also receive membrane-bound and secreted co- stimulatory sig- 
nals (the secreted signals are various cytokines). Both types of signals are pro- 
vided by a helper T cell in the case of a B lymphocyte and by an activated den- 
dritic cell in the case of a T lymphocyte. Because the production of these signals 
is usually triggered by exposure to a pathogen, a self-reactive lymphocyte nor- 
mally encounters its self antigen in the absence of such signals. Under these 
conditions, a B cell interacting with its antigen or a T cell interacting with its 
antigen on the surface of a nonactivated dendritic cell will not only fail to be 
activated, it will often be rendered tolerant — being either killed, inactivated, or 
actively suppressed by a regulatory T cell (see Figure 25-13). As we discuss later, 
in peripheral lymphoid organs, both T cell tolerance and activation usually 
occur on the surface of a dendritic cell. 



Figure 25-13 Mechanisms of 
immunological tolerance to self 
antigens. When a self-reactive immature 
lymphocyte binds its self antigen in the 
central lymphoid organ where the cell is 
produced, it may alter its antigen receptor 
so that it is no longer self-reactive (cell 1). 
This process is called receptor editing and 
is thought to occur mainly in developing 
B cells. Alternatively, the cell may die by 
apoptosis, a process called clonal deletion 
(cell 2). Because these two forms of 
tolerance (shown on the left) occur in 
central lymphoid organs, they are called 
central tolerance. 

When a self-reactive naive lymphocyte 
escapes tolerance in the central lymphoid 
organ and binds its self antigen in a 
peripheral lymphoid organ (cell 4), it will 
generally not be activated, because the 
binding usually occurs in the absence of 
appropriate co-stimulatory signals; 
instead, the cell may die by apoptosis 
(often after a period of proliferation), be 
inactivated, or be suppressed by 
regulatory T cells (if the self-reactive 
lymphocyte is an effector Tcell). These 
forms of tolerance shown on the right are 
called peripheral tolerance. 
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The tolerance mechanisms sometimes break down, causing T or B cells (or 
both) to react against the organism's own tissue antigens. Myasthenia gravis is an 
example of such an autoimmune disease. Affected individuals make antibodies 
against the acetylcholine receptors on their own skeletal muscle cells. These 
antibodies interfere with the normal functioning of the receptors so that the 
patients become weak and may die because they cannot breathe. Similarly, in 
childhood (type 1) diabetes, immune reactions against insulin- secreting cells in 
the pancreas kill these cells, leading to severe insulin deficiency. 

For the most part, the mechanisms responsible for the breakdown of toler- 
ance to self antigens in autoimmune diseases are unknown. It is thought, how- 
ever, that activation of the innate immune system by infection or tissue injury 
may help trigger anti-self responses in individuals with defects in their self- tol- 
erance mechanisms, leading to autoimmunity. 



Lymphocytes Continuously Circulate Through Peripheral 
Lymphoid Organs 

Pathogens generally enter the body through an epithelial surface, usually 
through the skin, gut, or respiratory tract. To induce an adaptive immune 
response, microbial antigens must travel from these entry points to a peripheral 
lymphoid organ, such as a lymph node or the spleen, the sites where lympho- 
cytes are activated (see Figure 25-5). The route and destination depend on the 
site of entry. Lymphatic vessels (see Figure 25-3) carry antigens that enter 
through the skin or respiratory tract to local lymph nodes; antigens that enter 
through the gut end up in gut- associated peripheral lymphoid organs such as 
Peyer's patches; and the spleen filters out antigens that enter the blood. As dis- 
cussed earlier, in many cases, dendritic cells will carry the antigen from the site 
of infection to the peripheral lymphoid organ, where they play a crucial part in 
activating T cells (see Figure 25-5). 

But only a tiny fraction of the total lymphocyte population can recognize a 
particular microbial antigen in a peripheral lymph organ (estimated to be 
between 1/10,000 and 1/100,000 of each class of lymphocyte). How do these rare 
cells find an antigen-presenting cell displaying their antigen? The answer is that 
the lymphocytes continuously circulate between one peripheral lymphoid 
organ and another via the lymph and blood. In a lymph node, for example, lym- 
phocytes continually leave the bloodstream by squeezing out between special- 
ized endothelial cells lining small veins called postcapillary venules. After perco- 
lating through the node, they accumulate in small lymphatic vessels that leave 
the node and connect with other lymphatic vessels that pass through other 
lymph nodes downstream (see Figure 25-3). Passing into larger and larger ves- 
sels, the lymphocytes eventually enter the main lymphatic vessel (the thoracic 
duct), which carries them back into the blood (Figure 25-14). 

The continuous recirculation between the blood and lymph ends only if a 
lymphocyte is activated by its specific antigen in a peripheral lymphoid organ. 
Now the lymphocyte remains in the peripheral lymphoid organ, where it prolif- 
erates and differentiates into either effector cells or memory cells. Many of the 
effector T cells leave the lymphoid organ via the lymph and migrate through the 



Figure 25-1 4 The path followed by lymphocytes as they continuously 
circulate between the lymph and blood. The circulation through a lymph 
node (yellow) is shown here. Microbial antigens are usually carried into the 
lymph node by dendritic cells (not shown), which enter the node via 
afferent lymphatic vessels draining an infected tissue (green). T and B cells, 
by contrast, enter the lymph node via an artery and migrate out of the 
bloodstream through postcapillary venules. Unless they encounter their 
antigen, the T and B cells leave the lymph node via efferent lymphatic 
vessels, which eventually join the thoracic duct. The thoracic duct empties 
into a large vein carrying blood to the heart to complete the circulation 
process for T and B cells. A typical circulation cycle for these lymphocytes 
takes about 12-24 hours. 
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blood to the site of infection (see Figure 25-5), whereas others stay in the lym- 
phoid organ and help activate B cells or other T cells there. Some effector B cells 
(plasma cells) remain in the peripheral lymphoid organ and secrete antibodies 
into the blood for days until they die; others migrate to the bone marrow, where 
they secrete antibodies into the blood for months or years. The memory T and B 
cells produced join the recirculating pool of lymphocytes. 

Lymphocyte recirculation depends on specific interactions between the 
lymphocyte cell surface and the surface of the endothelial cells lining the blood 
vessels in the peripheral lymphoid organs. Many cell types in the blood come 
into contact with the specialized endothelial cells lining the postcapillary 
venules in lymph nodes, but only lymphocytes adhere and then migrate out of 
the bloodstream into the nodes. The lymphocytes initially adhere to the 
endothelial cells via homing receptors that bind to specific ligands (often called 
counterreceptors) on the endothelial cell surface. Lymphocyte migration into 
lymph nodes depends on a homing receptor protein called L-selectin, a member 
of the selectin family of cell- surface lectins. This protein binds to specific sugar 
groups on a counterreceptor that is expressed exclusively on the surface of the 
specialized endothelial cells lining the postcapillary venules in lymph nodes, 
causing the lymphocytes to adhere weakly to the endothelial cells and to roll 
slowly along their surface. The rolling continues until another, much stronger 
adhesion system is called into play by the chemoattractant proteins (called 
chemokines; see below) secreted by endothelial cells. This strong adhesion is 
mediated by members of the integrin family of cell adhesion molecules, which 
become activated on the lymphocyte surface. Now the lymphocytes stop rolling 
and crawl out of the blood vessel into the lymph node (Figure 25-15). Both 
selectins and integrins are discussed in Chapter 19. 

Chemokines are a type of cytokine. They are small, secreted, positively 
charged proteins that have a crucial role in guiding the migrations of various types 
of cells, including white blood cells. <ACCG> They are all structurally related and 
bind to the surface of endothelial cells, as well as to negatively charged proteogly- 
cans of the extracellular matrix in organs. By binding to G-protein-coupled 
chemokine receptors (discussed in Chapter 15) on the surface of specific blood 
cells, chemokines attract these cells from the bloodstream into an organ, guide 
them to specific locations within the organ, and then help stop migration. (Unfor- 
tunately, the AIDS virus, HIV also binds to certain chemokine receptors, as well as 
to the CD4 co-receptor that we discuss later, and thus allows the virus to infect 
white blood cells.) The T and B cells initially enter the same region of a lymph 
node, but then different chemokines guide them to separate regions of the node — 
T cells to the paracortex and B cells to lymphoid follicles (Figure 25-16). 

Unless they encounter their antigen, both T and B cells soon leave the lymph 
node via efferent lymphatic vessels. If they encounter their antigen, however, they 
are stimulated to display adhesion receptors that trap the cells in the node; the 
cells accumulate at the junction between the T cell and B cell areas, where the rare 
specific T and B cells can interact, leading to their proliferation and differentiation 



basal - 
lamina 



e 


















WEAK ADHESION STRONG ADHESION 
AND ROLLING (mtegrm-dependent) 
anu kullnmu AND EM | GRAT | 0N 

(selectm-dependent) (CD 31 -dependent) 



lymphocyte 



chemokine 



specialized 
endothelial cell 
of postcapillary 
venule 




postcapillary 
venule 



chemokine 



Figure 25-1 5 Migration of a lymphocyte 
out of the bloodstream into a lymph 
node. <CCCC> A circulating lymphocyte 
adheres weakly to the surface of the 
specialized endothelial cells lining a 
postcapillary venule in a lymph node. 
This initial adhesion is mediated by 
L-selectin on the lymphocyte surface. The 
adhesion is sufficiently weak to enable 
the lymphocyte to roll along the surface 
of the endothelial cells, pushed along by 
the flow of blood. Stimulated by 
chemokines secreted by the endothelial 
cells (curved red arrow), the lymphocyte 
rapidly activates a stronger adhesion 
system, mediated by an integrin. This 
strong adhesion enables the cell to stop 
rolling. The lymphocyte then uses an Ig- 
like cell adhesion protein (CD31) to bind 
to the junctions between adjacent 
endothelial cells and migrate out of the 
venule. CD31 is located both on the 
surface of the lymphocyte and at the 
junctions between the endothelial cells. 
The subsequent migration of the 
lymphocytes in the lymph node depends 
on chemokines produced within the 
node (straight red arrow). 

The migration of other white blood cells 
out of the bloodstream into sites of 
infection occurs in a similar way. 
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into either effector cells or memory cells. Many of the effector cells leave the node, 
expressing different chemokine receptors that help guide them to their new desti- 
nations — T cells to sites of infection and B cells to the bone marrow. 



Summary 

Innate immune responses are triggered at sites of infection by pathogen-associated 
molecular patterns (PAMPs), which are recognized by pattern recognition receptors 
made by cells of the innate immune system. In addition to fighting infection directly, 
these innate immune responses help activate adaptive immune responses in periph- 
eral lymphoid organs. Unlike innate immune responses, adaptive responses display 
immunological memory and thereby provide specific and long-lasting protection 
against the particular pathogen that induced them. 

The adaptive immune system is composed of many millions of lymphocyte clones, 
with the cells in each clone sharing a unique cell-surface receptor that enables them to 
bind a particular antigen. The binding of antigen to these receptors, however, is usu- 
ally not sufficient to stimulate a lymphocyte to proliferate and differentiate into an 
effector cell that can help eliminate the pathogen. Membrane-bound co-stimulatory 
signals and a variety of secreted signals (cytokines) provided by another specialized cell 
in a peripheral lymphoid organ are also required. Helper T cells provide such signals 
for B cells, while dendritic cells usually provide them for T cells. Effector B cells secrete 
antibodies, which can act over long distances to help eliminate extracellular 
pathogens and their toxins. Effector T cells, by contrast, act locally to either kill infected 
host cells or help other cells to eliminate the pathogen. As part of the adaptive immune 
response, some lymphocytes proliferate and differentiate into memory cells, which are 
able to respond faster and more efficiently the next time the same pathogen invades. 
Both B and T cells circulate continuously between one peripheral lymphoid organ and 
another via the blood and lymph. Only if they encounter their specific foreign antigen 
in a peripheral lymphoid organ do they stop migrating, proliferate, and differentiate 
into effector cells or memory cells. Lymphocytes that would react against self molecules 
either alter their receptors or are eliminated, inactivated, or suppressed by regulatory 
T cells, so that the adaptive immune system normally avoids attacking the molecules 
and cells of the host. 



Figure 25-1 6 A simplified drawing of a 
human lymph node. B cells are primarily 
clustered in structures called lymphoid 
follicles, whereas T cells are found mainly 
in the paracortex. Chemokines attract 
both types of lymphocytes into the 
lymph node from the blood via 
postcapillary venules (see Figure 25-1 5). 
T and B cells then migrate to their 
respective areas, attracted by different 
chemokines. If they do not encounter 
their specific antigen, both T cells and 
B cells then enter the medullary sinuses 
and leave the node via the efferent 
lymphatic vessel. This vessel ultimately 
empties into the bloodstream, allowing 
the lymphocytes to begin another cycle 
of circulation through a peripheral 
lymphoid organ (see Figure 25-14). 

If they encounter their specific antigen, 
T and B cells are retained in the node and 
are activated to become effector cells or 
memory cells; T cells and B cells 
responding to the same pathogen can 
interact in and around lymphoid follicles. 



B CELLS AND ANTIBODIES 

Vertebrates inevitably die of infection if they are unable to make antibodies. 
Antibodies defend us against infection by binding to viruses and microbial tox- 
ins, thereby inactivating them (see Figure 25-2) . When antibodies bind to invad- 
ing pathogens, they also recruit some of the components of the innate immune 
system, including various types of white blood cells and components of the 
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complement system (discussed in Chapter 24). The white blood cells and a 
ctivated complement components work together to attack the invaders. 

Synthesized exclusively by B cells, antibodies are produced in billions of 
forms, each with a different amino acid sequence. Collectively called 
immunoglobulins (abbreviated as Ig), they are among the most abundant pro- 
tein components in the blood, constituting about 20% of the total protein in 
plasma by weight. Mammals make five classes of antibodies, each of which 
mediates a characteristic biological response following antigen binding. In this 
section, we discuss the structure and function of antibodies and how they inter- 
act with antigen. 

B Cells Make Antibodies as Both Cell-Surface Antigen Receptors 
and Secreted Proteins 

All antibody molecules made by an individual B cell have the same antigen- 
binding site. The first antibodies made by a newly formed B cell are not secreted 
but are instead inserted into the plasma membrane, where they serve as recep- 
tors for antigen. Each B cell has approximately 10 5 such receptors in its plasma 
membrane. As we discuss later, each of these receptors is stably associated with 
a complex of transmembrane proteins that activate intracellular signaling path- 
ways when antigen on the outside of the cell binds to the receptor. 

Each B cell clone produces a single species of antibody, with a unique anti- 
gen-binding site. When an antigen (with the aid of a helper T cell) activates a 
naive or a memory B cell, that B cell proliferates and differentiates into an anti- 
body-secreting effector cell. Such effector cells make and secrete large amounts 
of soluble (rather than membrane-bound) antibody, which has the same unique 
antigen-binding site as the cell-surface antibody that served earlier as the anti- 
gen receptor (Figure 25-17). Effector B cells can begin secreting antibody while 
they are still small lymphocytes, but the end stage of their maturation pathway 
is a large plasma cell (see Figure 25-7B), which continuously secretes antibodies 
at the astonishing rate of about 5000 molecules per second. Although most 
plasma cells die after several days, some survive in the bone marrow for months 
or years and continue to secrete antibodies into the blood, helping to provide 
long-term protection against the pathogen that stimulated their production. 
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Figure 25-1 7 The membrane-bound 
and secreted antibodies made by a B 
cell clone. When an antigen (aided by a 
helperT cell — not shown) binds to and 
thereby activates either a naive or a 
memory B cell, the cell proliferates and 
differentiates into effector cells. The 
effector cells produce and secrete 
antibodies with a unique antigen-binding 
site, which is the same as that of their 
original membrane-bound antibody that 
served as their antigen receptors. 



A Typical Antibody Has Two Identical Antigen-Binding Sites 

The simplest antibodies are Y-shaped molecules with two identical antigen- 
binding sites, one at the tip of each arm of the Y (Figure 25-18). Because of their 
two antigen-binding sites, they are described as bivalent. As long as an antigen 
has three or more antigenic determinants, bivalent antibody molecules can 
cross-link it into a large lattice (Figure 25-19) that macrophages can readily 
phagocytose and degrade. The efficiency of antigen binding and cross-linking is 
greatly increased by the flexible hinge region in most antibodies, which allows 
the distance between the two antigen-binding sites to vary (Figure 25-20). 

The protective effect of antibodies is not due simply to their ability to bind 
and cross-link antigen. The tail of the Y-shaped molecule mediates many other 
activities of antibodies. As we discuss later, antibodies with the same antigen- 
binding sites can have any one of several different tail regions. Each type of tail 
region gives the antibody different functional properties, such as the ability to 
activate the complement system, to bind to phagocytic cells, or to cross the pla- 
centa from mother to fetus. 



An Antibody Molecule Is Composed of Heavy and Light Chains 

The basic structural unit of an antibody molecule consists of four polypeptide 
chains, two identical light (L) chains (each containing about 220 amino acids) 
and two identical heavy (H) chains (each usually containing about 440 amino 
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Figure 25-18 A simple representation of 
an antibody molecule. Note that its two 
antigen-binding sites are identical. 
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Figure 25-19 Antibody-antigen interactions. Because antibodies have two identical 
antigen-binding sites, they can cross-link antigens. The types of antibody-antigen 
complexes that form depend on the number of antigenic determinants on the antigen. 
(A-C) A single species of antibody (a monoclonal antibody) is shown binding to antigens 
containing one, two, or three copies of a single type of antigenic determinant. Antigens with 
two identical antigenic determinants can form small cyclic complexes or linear chains with 
the antibodies, while antigens with three or more identical antigenic determinants can form 
large three-dimensional lattices that readily precipitate out of solution. (D) Most antigens 
have many different antigenic determinants (see Figure 25-29A), and different antibodies 
that recognize different determinants can cooperate in cross-linking the antigen into large 
three-dimensional lattices. 



acids). A combination of noncovalent and covalent (disulfide) bonds holds the 
four chains together. The molecule is composed of two identical halves, each 
with the same antigen-binding site. Both light and heavy chains usually cooper- 
ate to form the antigen-binding surface (Figure 25-21). 



There Are Five Classes of Antibody Heavy Chains, Each with 
Different Biological Properties 

In mammals, there are five classes of antibodies, IgA, IgD, IgE, IgG, and IgM, each 
with its own class of heavy chain — a, 8, e, y, and jll, respectively. IgA molecules 
have a chains, IgG molecules have y chains, and so on. In addition, there are a 
number of subclasses of IgG and IgA immunoglobulins; for example, there are 
four human IgG subclasses (IgGl, IgG2, IgG3, and IgG4), having yi, y 2 , 73, and y 4 
heavy chains, respectively. The various heavy chains give a distinctive confor- 
mation to the hinge and tail regions of antibodies, so that each class (and sub- 
class) has characteristic properties of its own. 

IgM, which has jll heavy chains, is always the first class of antibody that a 
developing B cell makes, although many B cells eventually switch to making 
other classes of antibody when an antigen stimulates them (discussed below). 
The first cells in the B cell lineage that make Ig are pro-B cells, which make only 
|Li chains. They give rise to pre-B cells, in which the jll chains associate with so- 
called surrogate light chains (substituting for genuine light chains) and insert 
into the plasma membrane. Signaling from this pre-B cell receptor is required 
for the cell to progress to the next stage of development, where it makes bona 
fide light chains. The light chains combine with the jli chains, replacing the sur- 
rogate light chains, to form four- chain IgM molecules (each with two jli chains 
and two light chains). These molecules then insert into the plasma membrane, 
where they function as receptors for antigen. At this point, the cell is called an 
immature naive B cell. After leaving the bone marrow, the cell starts to produce 
cell-surface IgD molecules as well, with the same antigen-binding site as the 
IgM molecules. It is now called a mature naive B cell. It is this cell that can 
respond to foreign antigen in peripheral lymphoid organs (Figure 25-22). 

IgM is not only the first class of antibody to appear on the surface of a devel- 
oping B cell. It is also the major class secreted into the blood in the early stages 
of a primary antibody response, on first exposure to an antigen. (Unlike IgM, IgD 
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Figure 25-20 The hinge region of an 
antibody molecule. Because of its 
flexibility, the hinge region improves the 
efficiency of antigen binding and cross- 
linking. 
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Figure 25-21 A schematic drawing of a 
bivalent antibody molecule. It is 

composed of four polypeptide chains — 
two identical heavy chains and two 
identical light chains. The two antigen- 
binding sites are identical, each formed 
by the N-terminal region of a light chain 
and the N-terminal region of a heavy 
chain. The two heavy chains also form 
both the tail and hinge region of the 
antibody. 
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molecules are secreted in only small amounts and seem to function mainly as 
cell-surface receptors for antigen.) In its secreted form, IgM is a pentamer com- 
posed of five four-chain units, giving it a total of 10 antigen-binding sites. Each 
pentamer contains one copy of another polypeptide chain, called a / (joining) 
chain. The J chain is produced by IgM -secreting cells and is covalently inserted 
between two adjacent tail regions (Figure 25-23). 

When an antigen with multiple identical antigenic determinants (see Figure 
25-19) binds to a single secreted pentameric IgM molecule, it alters the structure 
of the pentamer, allowing it to activate the complement system. As discussed in 
Chapter 24, when the antigen is on the surface of an invading pathogen, this 
activation of complement can either mark the pathogen for phagocytosis or kill 
it directly. As we discuss later, complement activation can also greatly increase 
the immune response to an antigen: the binding of an activated complement 
component to an antibody-antigen complex, for example, can increase the abil- 
ity of the antigen to stimulate a B cell response more than a thousand fold (see 
Figure 25-71 A). 

The major class of immunoglobulin in the blood is IgG, which is a four- 
chain monomer (see Figure 25-21) produced in large quantities during sec- 
ondary antibody responses. Besides activating complement, the tail region of an 
IgG molecule binds to specific receptors on macrophages and neutrophils. 
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Figure 25-22 The main stages in B cell development. All of the stages shown occur independently of 
antigen. The pro-B cell makes ji chains, but they remain in the endoplasmic reticulum until surrogate light 
chains are made. Although not shown, all of the cell-surface Ig molecules are associated with 
transmembrane proteins that help convey signals to the cell interior (see Figure 25-70). When they are 
activated by their specific foreign antigen and helper T cells in peripheral lymphoid organs, mature naive 
B cells proliferate and differentiate into either antibody-secreting cells or memory cells (not shown). 
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Figure 25-23 A pentameric IgM 
molecule. Disulfide bonds (red) hold the 
five four-chain units together. A single 
J chain, which has a structure similar to 
that of a single Ig domain (discussed later), 
is covalently attached by disulfide bonds 
to the tails of two ji heavy chains. The 
J chain is required for pentamer formation. 
The addition of each successive four-chain 
IgM unit requires a J chain, which is then 
discarded, except for the last one, which is 
retained. Note that IgM molecules do not 
have hinge regions. 



Ifide bond 



Largely by means of such Fc receptors (so-named because antibody tails are 
called Fc regions), these phagocytic cells bind, ingest, and destroy infecting 
microorganisms that have become coated with the IgG antibodies produced in 
response to the infection (Figure 25-24). 

Some IgG subclasses are the only antibodies that can pass from mother to 
fetus via the placenta. Cells of the placenta that are in contact with maternal 
blood have Fc receptors that bind these blood-borne IgG molecules and direct 
their passage to the fetus. The antibody molecules bound to the receptors are 
first taken into the placental cells by receptor-mediated endocytosis. They are 
then transported across the cell in vesicles and released by exocytosis into the 
fetal blood (a process called transcytosis — see Figure 25-26). Because other 
classes of antibodies do not bind to these particular Fc receptors, they cannot 
pass across the placenta. Later, IgG is secreted into the mother's milk and is then 
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Figure 25-24 Antibody-activated 
phagocytosis. (A) An 
IgG-antibody-coated bacterium is 
efficiently phagocytosed by a 
macrophage or neutrophil, which 
has cell-surface receptors that bind 
the tail (Fc) region of IgG 
molecules. The binding of the 
pseudopod antibody-coated bacterium to 
these Fc receptors activates the 
phagocytic process. The tail of an 
antibody molecule is called an 
Fc region because, when 
antibodies are cleaved with the 
proteolytic enzyme papain, the 
fragments containing the tail 
region readily crystallize. 
(B) Electron micrograph of a 
neutrophil phagocytosing an IgG- 
coated bacterium, which is in the 
process of dividing. The process in 
which antibody (or complement) 
coating of a pathogen increases 
the efficiency with which the 
pathogen is phagocytosed is 
called opsonization. (B, courtesy of 
Dorothy F. Bainton, from 
R.C.Williams, Jr. and 
H.H. Fudenberg, Phagocytic 
Mechanisms in Health and Disease. 
New York: Intercontinental Medical 
Book Corporation, 1971.) 
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Figure 25-25 A highly schematized diagram of a dimeric IgA molecule 
found in secretions. In addition to the two IgA monomers, there is a single 
J chain and an additional polypeptide chain called the secretory 
component, which is derived from the Fc receptor (see Figure 25-26) and 
is thought to protect the IgA molecules from digestion by proteolytic 
enzymes in secretions. 

taken up from the gut of the neonate into the blood by transcytosis, providing 
protection for the baby against infection. 

IgA is the principal class of antibody in secretions, including saliva, tears, 
milk, and respiratory and intestinal secretions. IgA is a four- chain monomer in 
the blood which is assembled into a dimer by the addition of two other polypep- 
tide chains before it is released into secretions (Figure 25-25). It is transported 
through secretory epithelial cells from the extracellular fluid into the secreted 
fluid by transcytosis mediated by another type of Fc receptor that is unique to 
secretory epithelia (Figure 25-26). This Fc receptor can also transport IgM into 
secretions (but less efficiently), which is probably why individuals with a selec- 
tive IgA deficiency, the most common form of antibody deficiency, are only 
mildly affected by the defect. 

The tail region of IgE molecules, which are four- chain monomers, binds with 
unusually high affinity (K& ~ 10 10 liters/mole) to yet another class of Fc receptors. 
These receptors are located on the surface of mast cells in tissues and of basophils 
in the blood. The IgE molecules bound to them function as passively acquired 
receptors for antigen. Antigen binding triggers the mast cell or basophil to secrete 
a variety of cytokines and biologically active amines, especially histamine (Figure 
25-27). The histamine causes blood vessels to dilate and become leaky, which in 
turn helps white blood cells, antibodies, and complement components to enter 
sites where mast cells have been activated. The release of amines from mast cells 
and basophils is largely responsible for the symptoms of such allergic reactions 
as hay fever, asthma, and hives. In addition, mast cells secrete factors that attract 
and activate white blood cells called eosinophils. Eosinophils also have Fc recep- 
tors that bind IgE molecules, and they can kill extracellular parasitic worms, 
especially if the worms are coated with IgE antibodies. 

In addition to the five classes of heavy chains found in antibody molecules, 
higher vertebrates have two types of light chains, k and X, which seem to be 
functionally indistinguishable. Either type of light chain may be associated with 
any of the heavy chains. An individual antibody molecule, however, always con- 
tains identical light chains and identical heavy chains: an IgG molecule, for 
instance, may have either k or X light chains, but not one of each. As a result, an 
antibody's antigen-binding sites are always identical. Such symmetry is crucial 
for the cross-linking function of secreted antibodies (see Figure 25-19). 
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Figure 25-26 The mechanism of transport 
of a dimeric IgA molecule across an 
epithelial cell. The IgA molecule, as a 
J-chain-containing dimer, binds to a 
transmembrane receptor protein on the 
nonluminal surface of a secretory epithelial 
cell. (The J chain has been omitted in this 
diagram for clarity.) The receptor-lgA 
complexes are ingested by receptor- 
mediated endocytosis, transferred across 
the epithelial cell cytoplasm in vesicles, 
and secreted into the lumen on the 
opposite side of the cell by exocytosis. 
When exposed to the lumen, the part of 
the Fc receptor protein that is bound to 
the IgA dimer (the secretory component) is 
cleaved from its transmembrane tail, 
thereby releasing the antibody in the form 
shown in Figure 25-25. 
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Figure 25-27 The role of IgE in histamine secretion by mast cells. A mast cell (or a basophil) binds IgE molecules after 
they are secreted by effector B cells. The soluble IgE antibodies bind to Fc receptor proteins on the mast cell surface that 
specifically recognize the Fc region of these antibodies. The bound IgE molecules serve as cell-surface receptors for 
antigen. Thus, unlike B cells, each mast cell (and basophil) has a set of cell-surface antibodies with a wide variety of 
antigen-binding sites. When an antigen molecule binds to these membrane-bound IgE antibodies so as to cross-link them 
to their neighbors, it signals the mast cell to release its histamine and other local mediators by exocytosis. 

All classes of antibody can be made in a membrane-bound form, as well as 
in a soluble, secreted form. The two forms differ only in the C-terminus of their 
heavy chain. The heavy chains of membrane-bound antibody molecules have a 
transmembrane hydrophobic C-terminus, which anchors them in the lipid 
bilayer of the B cell's plasma membrane. The heavy chains of secreted antibody 
molecules, by contrast, have instead a hydrophilic C-terminus, which allows 
them to escape from the cell. The switch in the character of the antibody 
molecules made occurs because the activation of B cells by antigen (and helper 
T cells) induces a change in the way in which the H-chain RNA transcripts are 
made and processed in the nucleus (see Figure 7-99). 

The properties of the various classes of antibodies in humans are summa- 
rized in Table 25-1. 



The Strength of an Antibody-Antigen Interaction Depends on 
Both the Number and the Affinity of the Antigen-Binding Sites 

The binding of an antigen to an antibody, like the binding of a substrate to an 
enzyme, is reversible. The sum of many relatively weak non-covalent forces, 
including hydrogen bonds, hydrophobic and van der Waals forces, and ionic 
interactions determine the strength of the interaction. These weak forces are 
effective only when the antigen molecule is close enough to allow some of its 
atoms to fit into complementary recesses on the surface of the antibody. The 
complementary regions of a four- chain antibody unit are its two identical anti- 
gen-binding sites; the corresponding region on the antigen is an antigenic deter- 
minant (Figure 25-28). Most antigenic macromolecules have many different 
antigenic determinants and are said to be multivalent; if two or more of the 
determinants are identical (as in a polymer with a repeating structure), the anti- 
gen is said to be polyvalent (Figure 25-29). 



Table 25-1 Properties of the Major Classes of Antibodies in Humans 
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Figure 25-28 Antigen binding to antibody. In this highly schematized 
diagram, an antigenic determinant on a macromolecule is shown 
interacting with one of the antigen-binding sites of two different antibody 
molecules, one of high affinity and one of low affinity. Various weak 
noncovalent forces hold the antigenic determinant in the binding site, and 
the site with the better fit to the antigen has a greater affinity. Note that 
both the light and heavy chains of the antibody molecule usually 
contribute to the antigen-binding site. 

The reversible binding reaction between an antigen with a single antigenic 
determinant (denoted Ag) and a single antigen-binding site (denoted Ab) can be 
expressed as 

Ag + Ab <-> AgAb 

The equilibrium point depends both on the concentrations of Ab and Ag and on 
the strength of their interaction. Clearly, a larger fraction of Ab will become asso- 
ciated with Ag as the concentration of Ag increases. The strength of the interac- 
tion is generally expressed as the affinity constant (K^ (see Figure 3-43), where 

£a=[AgAb]/[Ag][Ab] 

(the square brackets indicate the concentration of each component at equilib- 
rium) . 

One can determine the affinity constant, also known as the association con- 
stant, by measuring the concentration of free Ag required to fill half of the anti- 
gen-binding sites on the antibody. When half the sites are filled, [AgAb] = [Ab] 
and iC a = l/[Ag]. Thus, the reciprocal of the antigen concentration that produces 
half the maximum binding is equal to the affinity constant of the antibody for 
the antigen. Common values range from as low as 5 x 10 4 to as high as 10 11 
liters/mole. 

The affinity of an antibody for an antigenic determinant describes the 
strength of binding of a single copy of the antigenic determinant to a single anti- 
gen-binding site, and it is independent of the number of antigen-binding sites. 
When, however, a polyvalent antigen, carrying multiple copies of the same anti- 
genic determinant, combines with a polyvalent IgM antibody (see Figure 25-23), 
the binding strength is greatly increased because all of the antigen-antibody 
bonds must be broken simultaneously before the antigen and antibody can dis- 
sociate. Even a bivalent IgG molecule can bind at least 100 times more strongly 
to a polyvalent antigen if both antigen-binding sites are engaged than if only one 
site is engaged. The total binding strength of a bivalent or polyvalent antibody 
with a polyvalent antigen is referred to as the avidity of the interaction. 

If the affinity of the antigen-binding sites in an IgG and an IgM molecule is 
the same, the IgM molecule (with 10 binding sites) will have a much greater 
avidity for a polyvalent antigen than an IgG molecule (which has two binding 
sites). This difference in avidity, often 10 4 -fold or more, is important because 
antibodies produced early in an immune response usually have much lower 
affinities than those produced later. Because of its high total avidity, IgM — the 
major Ig class produced early in primary immune responses — can function 
effectively even when each of its binding sites has only a low affinity. 

So far, we have considered the general structure and function of antibodies. 
Next, we look at the details of their structure, as revealed by studies of their 
amino acid sequence and three-dimensional structure. 



Antibody Light and Heavy Chains Consist of Constant and 
Variable Regions 

Comparison of the amino acid sequences of different antibody molecules 
reveals a striking feature with important genetic implications. Both light and 
heavy chains have a variable sequence at their N-terminal ends but a constant 
sequence at their C-terminal ends. Consequently, when we compare the amino 
acid sequences of many different k chains, the C-terminal halves are the same or 
show only minor differences, whereas the N-terminal halves all differ. Light 
chains have a constant region about 110 amino acids long and a variable region 
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Figure 25-29 Molecules with multiple 
antigenic determinants. (A) A globular 
protein is shown with a number of 
different antigenic determinants. Different 
regions of a polypeptide chain usually 
come together in the folded structure to 
form each antigenic determinant on the 
surface of the protein, as shown for three 
of the four determinants. (B) A polymeric 
structure is shown with many identical 
antigenic determinants. 
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Figure 25-30 Constant and variable 
regions of immunoglobulin chains. The 

variable regions of the light and heavy 
chains form the antigen-binding sites, 
while the constant regions of the heavy 
chains determine the other biological 
properties of an antibody. 



of the same size. The variable region of the heavy chains is also about 110 amino 
acids long, but the constant region is about three or four times longer (330 or 440 
amino acids), depending on the class (Figure 25-30). 

It is the N-terminal ends of the light and heavy chains that come together to 
form the antigen-binding site, and the variability of their amino acid sequences 
provides the structural basis for the diversity of antigen-binding sites. The great- 
est diversity occurs in three small hypervariable regions in the variable regions 
of both light and heavy chains; the remaining parts of the variable region, known 
as framework regions, are relatively constant. 

Only about 5-10 amino acids in each hypervariable region form the actual 
antigen-binding site (Figure 25-31). As a result, the size of the antigenic deter- 
minant that an antibody recognizes is generally comparably small. It can consist 
of fewer than 10 amino acids on the surface of a globular protein, for example. 



The Light and Heavy Chains Are Composed of Repeating 
Ig Domains 

Both light and heavy chains are made up of repeating segments — each about 
110 amino acids long and each containing one intrachain disulfide bond. Each 
repeating segment folds independently to form a compact functional unit called 
an immunoglobulin (Ig) domain. As shown in Figure 25-32, a light chain con- 
sists of one variable (Vl) and one constant (Cl) domain (equivalent to the vari- 
able and constant regions shown in the top half of Figure 25-30). Vl pairs with 
the variable (Vh) domain of the heavy chain to form the antigen-binding region. 
Cl pairs with the first constant domain of the heavy chain (CrI), and the 
remaining constant domains of the heavy chains form the Fc region, which 
determines the other biological properties of the antibody. Most heavy chains 
have three constant domains (CrI, Cr2, and Cr3), but those of IgM and IgE anti- 
bodies have four. 

The similarity in their domains suggests that antibody chains arose during 
evolution by a series of gene duplications, beginning with a primordial gene 
coding for a single 110 amino acid domain of unknown function. Each domain 
of the constant region of a heavy chain is encoded by a separate coding 
sequence (exon), which supports this hypothesis (Figure 25-33). 
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An Antigen-Binding Site Is Constructed from Hypervariable Loops 

Many antibody fragments, as well as some intact antibody molecules, have been 
studied by x-ray crystallography From these examples, we can understand the 
way in which billions of different antigen-binding sites are constructed on a 
common structural theme. 

As illustrated in Figure 25-34, each Ig domain has a very similar three- 
dimensional structure consisting of a sandwich of two (3 sheets held together by 
a disulfide bond. As we discuss later, many other proteins on the surface of lym- 
phocytes and other cells, many of which function as cell-cell adhesion 
molecules (discussed in Chapter 19), contain similar domains and hence are 
members of a very large immunoglobulin (Ig) superfamily of proteins. 

The variable domains of antibody molecules are unique in that each has its 
particular set of three hypervariable regions, which are arranged in three hyper- 
variable loops (see Figure 25-34). The hypervariable loops of both the light and 
heavy variable domains cluster together to form the antigen-binding site. 
Because the variable domain of an antibody molecule consists of a highly con- 
served rigid framework, with hypervariable loops attached at one end, changes 
in only the lengths and amino acid sequences of the hypervariable loops can 
generate an enormous diversity of antigen-binding sites. The overall three- 
dimensional structure necessary for antibody function remains constant. 

X-ray analyses of crystals of antibody fragments bound to an antigenic 
determinant reveal exactly how the hypervariable loops of the light and heavy 
variable domains cooperate to form an antigen-binding surface in particular 



Figure 25-32 Immunoglobulin domains. 

The light and heavy chains in an antibody 
molecule are each folded into similar 
repeating domains. The variable domains 
(shaded in blue) of the light and heavy 
chains (V L and V H ) make up the antigen- 
binding sites, while the constant domains 
of the heavy chains (mainly Ch2 and Ch3) 
determine the other biological properties 
of the molecule. The heavy chains of IgM 
and IgE antibodies do not have a hinge 
region and have an extra constant 
domain (Ch4). Hydrophobic interactions 
between domains on adjacent chains 
help hold the chains together in the 
antibody molecule: V|_ binds to Vh, C|_ 
binds to Ch1 , and so on (see Figure 
25-34). All antibodies are glycosylated on 
their Ch2 domains (not shown); the 
attached oligosaccharide chains vary 
from antibody to antibody and influence 
the biological properties of the antibody. 



coding sequences 




intron 



intron intron 

TRANSCRIPTION 



RNA SPLICING 



DNA 



I RNA 



H 2 N 



C H 1 


hinge 


C H 2 C h 3 
TRANSLATION 


C h 1 


hinge 


' C H 2 C h 3 



mRNA 



COOH 



constant region of heavy chain 



Figure 25-33 The organization of the 
DNA sequences that encode the 
constant region of an antibody heavy 
chain, such as that found in IgG. The 

coding sequences (exons) for each 
domain and for the hinge region are 
separated by noncoding sequences 
(introns).The intron sequences are 
removed by splicing the primary RNA 
transcripts to form mRNA. The presence 
of introns in the DNA is thought to have 
facilitated accidental duplications of DNA 
segments that gave rise to the antibody 
genes during evolution (discussed in 
Chapter 4). The DNA and RNA sequences 
that encode the variable region of the 
heavy chain are not shown. 



B CELLS AND ANTIBODIES 



1561 



hypervariable 
loop 




Figure 25-34 The folded structure of an 
IgG antibody molecule, based on x-ray 
crystallography studies. <GCCG> The 
structure of the whole protein is shown 
in the middle, while the structure of a 
constant domain is shown on the left and 
that of a variable domain on the right. 
Both domains consist of two p sheets, 
which are joined by a disulfide bond (not 
shown). Note that all the hypervariable 
regions (red) form loops at the far end of 
the variable domain, where they come 
together to form part of the antigen- 
binding site (see also Figure 3-41). 



cases. The dimensions and shape of each different site vary depending on the 
conformations of the polypeptide chain in the hypervariable loops, which in 
turn are determined by the sequences of the amino acid side chains in the loops. 
The shapes of binding sites vary greatly — from pockets, to grooves, to undulat- 
ing flatter surfaces, and even to protrusions — depending on the antibody (Fig- 
ure 25-35). Smaller ligands tend to bind to deeper pockets, whereas larger ones 
tend to bind to flatter surfaces. In addition, the binding site can alter its shape 
after antigen binding to fit the ligand better. 

Now that we have discussed the structure and functions of antibodies, we 
are ready to consider the crucial question that puzzled immunologists for many 
years — what are the genetic mechanisms that enable each of us to make many 
billions of different antibody molecules? 



Summary 

Antibodies defend vertebrates against infection by inactivating viruses and microbial 
toxins and by recruiting the complement system and various types of white blood cells 
to kill invading pathogens. A typical antibody molecule is Y-shaped, with two identi- 
cal antigen-binding sites at the tips of the Y, plus binding sites for complement com- 
ponents and various cell-surface receptors on the tail of the Y 

Each B cell clone makes antibody molecules with a unique antigen-binding site. 
Initially, during B cell development in the bone marrow, the antibody molecules are 
inserted into the plasma membrane, where they serve as receptors for antigen. In 
peripheral lymphoid organs, antigen binding to these receptors, together with co-stim- 
ulatory signals provided by helper T cells, activates the B cells to proliferate and differ- 
entiate into either memory cells or antibody-secreting effector cells. The effector cells 
secrete large amounts of antibodies with the same unique antigen-binding site as the 
membrane-bound antibodies. 

A typical antibody molecule is composed of four polypeptide chains, two identical 
heavy chains and two identical light chains. Parts of both the heavy and light chains 
usually combine to form the antigen-binding sites. There are five classes of antibodies 
(IgA, IgD, IgE, IgG, and IgM), each with a distinctive heavy chain (a, S, s, y, and /a, 
respectively). The heavy chains also form the tail (Fc region) of the antibody, which 
determines what other proteins will bind to the antibody and therefore what biologi- 
cal properties the antibody class has. Either type of light chain (k or X) can be associ- 
ated with any class of heavy chain; this choice has no effect on the properties of the 
antibody, except for its specificity for antigen. 

Each light and heavy chain is composed of a number oflg domains — fi sheet struc- 
tures constructed from about 110 amino acids. A light chain has one variable (Vl) and 
one constant (Cl) domain, while a heavy chain has one variable (Vh) and either three or 
four constant (Ch) domains. The amino acid sequence variation in the variable domains 
of both light and heavy chains is concentrated in several small hypervariable regions, 
which protrude as loops at one end of these domains to form the antigen-binding site. 




pocket 



groove 



surface 



Figure 25-35 The variety of antigen- 
binding surfaces in antibodies. The 

hypervariable loops of different Vl and Vh 
domains can combine to form a large 
variety of binding surfaces. The antigenic 
determinants and the antigen-binding 
site of the antibodies are shown in red. 
Only one antigen-binding site is shown 
for each antibody. 
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THE GENERATION OF ANTIBODY DIVERSITY 

Even in the absence of antigen stimulation, a human can probably make more 
than 10 12 different antibody molecules — its preimmune, primary antibody 
repertoire. The primary repertoire consists of IgM and IgD antibodies and is 
apparently large enough to ensure that there will be an antigen-binding site to 
fit almost any potential antigenic determinant, albeit with low affinity (The 
antigen-binding sites of many antibodies can cross-react with a variety of 
related but different antigenic determinants, making this primary antibody 
defense force even more formidable.) 

After stimulation by antigen (and helper T cells), B cells can switch from 
making IgM and IgD to making other classes of antibodies — a process called 
class switching. In addition, the affinity of these antibodies for their antigen pro- 
gressively increases over time — a process called affinity maturation. Thus, anti- 
gen stimulation generates a secondary antibody repertoire, with a greatly 
increased diversity of both Ig classes and antigen-binding sites. 

Antibodies are proteins, and proteins are encoded by genes. Antibody diver- 
sity therefore poses a special genetic problem: how can an animal make more 
antibodies than there are genes in its genome? (The human genome, for exam- 
ple, contains only about 25,000 genes.) This problem is not quite as formidable 
as it might first appear. Recall that the variable regions of the light and heavy 
chains of antibodies usually combine to form the antigen- binding site. Thus, an 
animal with 1000 genes encoding light chains and 1000 genes encoding heavy 
chains could, in principle, combine their products in 1000 x 1000 different ways 
to make 10 6 different antigen-binding sites (although, in reality, not every light 
chain can combine with every heavy chain to make an antigen-binding site). 
Nonetheless, unique genetic mechanisms have evolved to enable adaptive 
immune systems to generate an almost unlimited number of different light and 
heavy chains in a remarkably economical way. 

Not all vertebrates use the same genetic mechanisms to diversify antibodies, 
and there are even substantial differences in the mechanisms used by different 
mammals. We discuss the mechanisms used by mice and humans, in which 
antibody diversity is generated in two steps. First, before antigen stimulation, 
developing B cells join together separate gene segments in DNA in order to cre- 
ate the genes that encode the primary repertoire of low-affinity IgM and IgD 
antibodies. Second, after antigen stimulation, the assembled antibody- coding 
genes can undergo two further changes — mutations that can increase the affin- 
ity of the antigen-binding site and DNA rearrangements that switch the class of 
antibody made. Together, these changes produce the secondary repertoire of 
high-affinity IgG, IgA, and IgE antibodies. 

We begin this section by discussing the mechanisms that B cells use to pro- 
duce the primary antibody repertoire and then discuss the mechanisms that 
they use to produce the secondary repertoire. 



Antibody Genes Are Assembled From Separate Gene Segments 
During B Cell Development 

Mice and humans produce their primary antibody repertoire by joining sepa- 
rate antibody gene segments together during B cell development. Each type of 
antibody chain — k light chains, X light chains, and heavy chains — is encoded 
by a separate locus on a separate chromosome. Each locus contains a large 
number of gene segments encoding the V region of an antibody chain, and one 
or more gene segments encoding the C region. During the development of a B 
cell in the bone marrow (or fetal liver), a complete coding sequence for each of 
the two antibody chains to be synthesized is assembled by site-specific genetic 
recombination (discussed in Chapter 5). In addition to bringing together the 
separate gene segments of the antibody gene, these rearrangements also acti- 
vate transcription from the gene promoter through changes in the relative 
positions of the enhancers and silencers acting on the gene. Thus, a complete 
antibody chain can be synthesized only after the DNA has been rearranged. 
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Each light- chain V region is encoded by a DNA sequence assembled from 
two gene segments — a long Vgene segment and a short joining, or /gene seg- 
ment (not to be confused with the protein / chain (see Figure 25-23), which is 
encoded elsewhere in the genome). Figure 25-36 illustrates the sequence of 
events involved in the production of a human k light-chain polypeptide from its 
separate gene segments. Each heavy-chain V region is similarly constructed by 
combining gene segments, but here an additional diversity segment, or D gene 
segment, is also required (Figure 25-37). 

The large number of inherited V,J, and D gene segments available for encod- 
ing antibody chains contributes substantially to antibody diversity, and the 
combinatorial joining of these segments (called combinatorial diversification) 
greatly increases this contribution. Any of the 40 V segments in the human k 
light-chain locus, for example, can be joined to any of the 5 /segments (see Fig- 
ure 25-36), so that this locus can encode at least 200 (40 x 5) different K-chain V 
regions. Similarly, any of the 40 ^segments in the human heavy-chain locus can 
be joined to any of the 25 D segments and to any of the 6 / segments to encode 
at least 6000 (40 x 25 x 6) different heavy-chain V regions. 

The combinatorial diversification resulting from the assembly of different 
combinations of inherited V,J, and D gene segments is an important mechanism 
for diversifying the antigen-binding sites of antibodies. By this mechanism 
alone, called V(D)J recombination, a human can produce 320 different Vl regions 
(200 k and 120 X) and 6000 different Vh regions. In principle, these could then be 
combined to make about 1.9 x 10 6 (320 x 6000) different antigen-binding sites. 
In addition, as we discuss next, the joining mechanism itself greatly increases 
this number of possibilities (probably more than 10 8 -fold), making the primary 
antibody repertoire much larger than the total number of B cells (about 10 12 ) in 
a human. 
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Figure 25-36 The V-J joining process 
involved in making a human k light 
chain. In the "germ-line" DNA (where the 
antibody genes are not rearranged and 
are therefore not being expressed), the 
cluster of five J gene segments is 
separated from the C-region coding 
sequence by a short intron and from the 
40 i/gene segments by thousands of 
nucleotide pairs. During the 
development of a B cell, a randomly 
chosen i/gene segment (V3 in this case) 
is moved to lie precisely next to one of 
the J gene segments (J3 in this case). The 
"extra" J gene segments (J4 and J5) and 
the intron sequence are transcribed 
(along with the joined V3 and J3 gene 
segments and the C-region coding 
sequence) and then removed by RNA 
splicing to generate mRNA molecules 
with contiguous V3,J3, and C sequences, 
as shown. These mRNAs are then 
translated into k light chains. A J gene 
segment encodes the C-terminal 1 5 or so 
amino acids of the V region, and a short 
sequence containing the V-J segment 
junction encodes the third hypervariable 
region of the light chain, which is the 
most variable part of the V region. 
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Figure 25-37 The human heavy-chain 
locus. There are 40 V segments, 
25 D segments, 6 J segments, and an 
ordered cluster of C-region coding 
sequences, each cluster encoding a 
different class of heavy chain. The 
D segment (and part of the J segment) 
encodes amino acids in the third 
hypervariable region, which is the most 
variable part of the heavy-chain V region. 
The genetic mechanisms involved in 
producing a heavy chain are the same as 
those shown in Figure 25-36 for light 
chains, except that two DNA 
rearrangement steps are required instead 
of one. First a D segment joins to a 
J segment, and then a V segment joins to 
the rearranged DJ segment. The figure is 
not drawn to scale and omits detail: for 
example, the total length of the heavy 
chain locus is over 2 megabases. 
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Imprecise Joining of Gene Segments Greatly Increases the 
Diversity of V Regions 

In the process of V(D)J recombination, site-specific recombination joins sepa- 
rate antibody gene segments together to form a functional Vl- or Vn-region cod- 
ing sequence. Conserved recombination signal sequences flank each gene seg- 
ment and serve as recognition sites for the joining process, ensuring that only 
appropriate gene segments recombine. Thus, for example, a light-chain V seg- 
ment will always join to a / segment but not to another V segment. An enzyme 
complex called the V(D)J recombinase mediates joining. This complex contains 
two proteins that are specific to developing lymphocytes, as well as enzymes 
that help repair damaged DNA in all our cells. 

Two closely linked genes called Ragl and Rag2 (Rag = recombination acti- 
vating genes) encode the lymphocyte-specific proteins of the V(D)J recombi- 
nase, RAG1 and RAG2. To mediate V(D) J joining, the two proteins come together 
to form a complex (called RAG), which functions as an endonuclease, introduc- 
ing double-strand breaks precisely between the gene segments to be joined and 
their flanking recombination signal sequences. RAG then initiates the rejoining 
process by recruiting enzymes involved in DNA double-strand repair in all cells 
(Figure 25-38) . Mice or humans deficient in either of the two Rag genes or in 
nonhomologous end joining are highly susceptible to infection because they are 
unable to carry out V(D)J recombination and consequently do not have func- 
tional B or T cells, a condition called severe combined immunodeficiency (SCID). 
(As we discuss later, T cells use the same V(D)J recombinase to assemble the 
gene segments that encode their antigen- specific receptors.) 

During the joining of antibody (and T cell receptor) gene segments, as in 
nonhomologous end-joining (see Figure 5-5 1A), a variable number of 
nucleotides are often lost from the ends of the recombining gene segments, and 
one or more randomly chosen nucleotides may also be inserted. This random 
loss and gain of nucleotides at joining sites is called junctional diversification, 
and it enormously increases the diversity of V- region coding sequences created 
by V(D)J recombination, specifically in the third hypervariable region. This 
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Figure 25-38 The role of 
recombination signal sequences in 
RAG-mediated gene segment 
joining. In the case shown, V1 is 
joined to J1 in a light-chain locus. 
Two types of DNA signal sequences 
are involved in V(D)J recombination, 
and recombination can only occur 
between different types: both have 
the same 7-base-pair (bp) sequence 
at one end and the same 9-bp 
sequence at the other end, but in 
one type the ends are separated by 
a 1 2-bp spacer and in the other they 
are separated by a 23-bp spacer, as 
shown. When one RAG protein binds 
to a 1 2-bp spacer and another to a 
23-bp spacer, and the two RAG 
proteins bind to each other, the two 
different signal sequences are 
juxtaposed. The RAG complex then 
cuts the two signal sequences at 
their 7-bp ends, and DNA repair 
enzymes join the cut V1 and J1 
segments together. The signal 
sequences are also joined together 
and discarded as a small circle of 
DNA that contains all the DNA 
originally located between 
V1 andJ1. 

The same process and signal 
sequences are used to join V, D, and 
J gene segments in a heavy-chain 
locus. The arrangement of signal 
sequences and the "12/23 rule" just 
described ensure that only 
appropriate gene segments 
recombine. 
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increased diversification comes at a price, however. In many cases, it will shift 
the reading frame to produce a nonfunctional gene. Because roughly two in 
every three rearrangements are "nonproductive" in this way, many developing 
B cells never make a functional antibody molecule and consequently die in the 
bone marrow. 

B cells making functional antibody molecules that bind strongly to self anti- 
gens in the bone marrow would be dangerous. Such B cells maintain expression 
of the RAG proteins and can undergo a second round ofV(D)J recombination in 
a light-chain locus (usually a k locus), thereby changing the specificity of the 
cell-surface antibody they make — a process referred to as receptor editing. To 
provide a further layer of protection, clonal deletion eliminates those self- reac- 
tive B cells that fail to change their specificity (see Figure 25-13). 

The Control of V(D)J Recombination Ensures That B Cells Are 
Monospecific 

B cells are monospecific. That is, all the antibodies that any one B cell produces 
have identical antigen-binding sites. This property enables antibodies to cross- 
link antigens into large aggregates, thereby promoting antigen elimination (see 
Figure 25-19). It also means that an activated B cell secretes antibodies with the 
same specificity as that of its membrane-bound antibody receptor, guaranteeing 
the specificity of antibody responses (see Figure 25-17). 

To achieve monospecificity, each B cell must make only one type of Vl region 
and one type of Vh region. Since B cells, like other somatic cells, are diploid, each 
cell has six loci encoding antibody chains: two heavy-chain loci (one from each 
parent) and four light-chain loci (one k and one X from each parent). If DNA 
rearrangements occurred independently in each heavy- chain locus and each 
light-chain locus, a single B cell could make up to eight different antibodies, 
each with a different antigen-binding site. 

In fact, however, each B cell uses only two of the six antibody loci: one of the 
two heavy- chain loci and one of the four light- chain loci. Thus, each B cell must 
choose not only between its k and X light-chain loci, but also between its mater- 
nal and paternal light-chain and heavy-chain loci. This second choice is called 
allelic exclusion. Allelic exclusion also occurs in the expression of some genes 
that encode T cell receptors and genes that encode olfactory receptors in the 
nose (discussed in Chapter 15). However, for most proteins that are encoded by 
autosomal genes, both the maternal and paternal gene copies in a cell are 
expressed about equally. 

Allelic exclusion and k versus X light- chain choice during B cell development 
depend on negative feedback regulation of the V(D)J recombination process. A 
functional rearrangement in one antibody locus suppresses rearrangements in 
all remaining loci that encode the same type of antibody chain (Figure 25-39). In 
B cell clones isolated from transgenic mice expressing a rearranged jll- chain gene, 
for example, the rearrangement of all of the endogenous heavy- chain genes is 
usually suppressed. Comparable results have been obtained for light chains. The 
suppression does not occur if the product of the rearranged gene fails to assem- 
ble into a receptor that inserts into the plasma membrane. It has therefore been 
proposed that either the receptor assembly process itself or extracellular signals 
that act on the receptor suppress further gene rearrangements. 

Although no biological differences between the constant regions of k and X 
light chains have been discovered, there is an advantage in having two separate 
loci encoding light-chain variable regions. Having two separate loci increases 
the chance that a pre-B cell that has successfully assembled a Vn-region coding 
sequence will then successfully assemble a VL-region coding sequence to 
become a B cell. This chance is further increased because, before a developing 
pre-B cell produces ordinary light chains, it makes surrogate light chains (see 
Figure 25-22), which assemble with |li heavy chains. The resulting receptors are 
displayed on the cell surface and allow the cell to proliferate, producing large 
numbers of progeny cells, some of which are likely to succeed in producing bona 
fide light chains. 
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Figure 25-39 Selection of antibody loci during B cell development in the bone 
marrow. To produce antibodies with only one type of antigen-binding site, a 
developing B cell must use only one L-chain locus and one H-chain locus. 
Although the choice between maternal and paternal loci is thought to be random, 
the assembly of V-region coding sequences in a developing B cell proceeds in an 
orderly sequence, one segment at a time, usually beginning with the H-chain 
locus. In this locus, D segments first join to Jh segments on both parental 
chromosomes; then i/nto DJh joining occurs on one of these chromosomes (not 
shown). If this rearrangement produces a functional gene, the resulting 
production of complete |i chains (always the first H chains made) leads to their 
expression on the cell surface in association with surrogate light chains (see 
Figure 25-22). The cell now shuts down all further rearrangements of VVi-region- 
encoding gene segments and initiates Vt rearrangement. Vi rearrangement 
usually occurs first in a k locus, and only if that fails does it occur at the other 
k locus or at a X locus. If, at any point, "in-phase" V L -to-Ji joining leads to the 
production of light chains, these combine with preexisting ji chains to form IgM 
antibody molecules, which insert into the plasma membrane. The IgM cell-surface 
receptors are thought to enable the newly formed B cell to receive extracellular 
signals that shut down all further V(D)J recombination, by turning off the 
expression of the Ragl and Rag2 genes. 

If a developing B cell makes a receptor with high affinity for a self antigen, Rag 
gene expression is maintained and the cell undergoes another round of V(D)J 
recombination in a light-chain locus (called receptor editing — see Figure 25-13), 
thereby changing the specificity of its receptor (not shown). If a cell fails to 
assemble both a functional Vn-region and a functional VVregion coding sequence, 
it is unable to make antibody molecules and dies by apoptosis (not shown). 

The production of a functional B cell is a complex and highly selective pro- 
cess: in the end, all B cells that fail to produce intact antibody molecules die by 
apoptosis. 

We turn now from the mechanisms responsible for generating the primary 
antibody repertoire before antigen stimulation to those responsible for generat- 
ing the secondary antibody repertoire after antigen stimulation. We begin with 
the remarkable Darwinian-like mechanism responsible for increasing the affin- 
ity of the antigen-binding sites of antibodies for their specific antigen. 



COMMON LYMPHOID 
PROGENITOR CELL 




chosen 

heavy-chain 

locus 



Antigen-Driven Somatic Hypermutation Fine-Tunes Antibody 
Responses 

As mentioned earlier, with the passage of time after immunization, there is usu- 
ally a progressive increase in the affinity of the antibodies produced against the 
immunizing antigen. This phenomenon, known as affinity maturation, is due 
to the accumulation of point mutations in both heavy-chain and light-chain V- 
region coding sequences. The mutations occur long after the coding regions 
have been assembled. After B cells have been stimulated by antigen and helper 
T cells in a peripheral lymphoid organ, some of the activated B cells proliferate 
rapidly in the lymphoid follicles (see Figure 25-16) and form structures called 
germinal centers. Here, the B cells mutate at the rate of about one mutation per 
V-region coding sequence per cell generation. Because this is about a million 
times greater than the spontaneous mutation rate in other genes and occurs in 
somatic cells rather than germ cells (discussed in Chapter 21), the process is 
called somatic hypermutation. 

Very few of the altered antibodies generated by hypermutation will have an 
increased affinity for the antigen. Because the same antibody genes produce the 
antigen receptors on the B cell surface, the antigen will stimulate preferentially 
those few B cells that do make such antibodies with increased affinity for the 
antigen. Clones of these altered B cells will preferentially survive and proliferate, 
especially as the amount of antigen decreases to very low levels late in the 
response. Most other B cells in the germinal center will die by apoptosis. Thus, 
as a result of repeated cycles of somatic hypermutation, followed by antigen- 
driven proliferation of selected clones of effector and memory B cells, antibod- 
ies of increasingly higher affinity become abundant during an immune 
response, providing progressively better protection against the pathogen. (In 
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some mammals, including sheep and cows, a similar somatic hypermutation 
also plays a major part in diversifying the primary antibody repertoire before B 
cells encounter their antigen.) 

A breakthrough in understanding the molecular mechanism of somatic 
hypermutation came with the identification of an enzyme that is required for 
the process. It is called activation-induced deaminase (AID) because it is 
expressed specifically in activated B cells and deaminates cytosine (C) to uracil 
(U) in transcribed V-region coding DNA. The deamination produces U:G mis- 
matches in the DNA double helix, and the repair of these mismatches produces 
various types of mutations, depending on the repair pathway used (Figure 
25-40). Somatic hypermutation affects only actively transcribed V-region cod- 
ing sequences, possibly because the AID enzyme is specifically loaded onto RNA 
transcripts (discussed in Chapter 7). AID is also required when activated B cells 
switch from IgM production to the production of other classes of antibody, as we 
now discuss. 



B Cells Can Switch the Class of Antibody They Make 

As discussed earlier, all B cells begin their antibody- synthesizing lives by making 
IgM molecules and inserting them into the plasma membrane as receptors for 
antigen. After the B cells leave the bone marrow, but before they interact with 
antigen, they begin making both IgM and IgD molecules as membrane-bound 
antigen receptors, both with the same antigen-binding sites (see Figure 25-22). 
Stimulation by antigen and helper T cells activates many of these cells to 
become IgM-secreting effector cells, so that IgM antibodies dominate the pri- 
mary antibody response. Later in the immune response, however, when acti- 
vated B cells are undergoing somatic hypermutation, the combination of anti- 
gen and help er-T- cell- derived cytokines stimulates many of the B cells to switch 
from making membrane-bound IgM and IgD to making IgG, IgA, or IgE anti- 
bodies — a process called class switching. Some of these cells become memory 
cells that express the corresponding class of antibody molecules on their sur- 
face, while others become effector cells that secrete the antibodies. The IgG, IgA, 
and IgE molecules are collectively referred to as secondary classes of antibodies, 
because they are produced only after antigen stimulation, dominate secondary 
antibody responses, and make up the secondary antibody repertoire. As we saw 
earlier, each different class of antibody is specialized to attack pathogens in dif- 
ferent ways and in different sites. 

The constant region of an antibody heavy chain determines the class of the 
antibody. Thus, the ability of B cells to switch the class of antibody they make 
without changing the antigen-binding site implies that the same assembled Vh- 
region coding sequence (which specifies the antigen-binding part of the heavy 
chain) can sequentially associate with different Cn-coding sequences. This has 
important functional implications. It means that, in an individual animal, a 
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Figure 25-40 Some ways in which AID can 
cause mutations during somatic 
hypermutation. AID deaminates some 
cytosines to uracil in transcribed 
V-region coding DNA, causing U:G 
mismatches, which lead to mutations in 
various ways. Some mutations occur when 
DNA containing unprocessed U:G 
mismatches are replicated (see Figure 
5-49A). Others occur when the uracil is 
removed by uracil-DNA glycosylase before 
the DNA is replicated, as this generates a 
position on one of the DNA template 
strands that lacks a base for DNA 
polymerase to copy. Still others (not shown) 
occur when the area around the U:G 
mismatch is excised by the mismatch repair 
system (discussed in Chapter 5), producing 
a gap that can be repaired by error-prone 
DNA polymerases, thereby generating 
mutations at A:T as well as C:G pairs. 
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particular antigen-binding site that has been selected by environmental anti- 
gens can be distributed among the various classes of antibodies, thereby acquir- 
ing the different biological properties of each class. 

When a B cell switches from making IgM and IgD to one of the secondary 
classes of antibody, an irreversible change at the DNA level occurs — a process 
called class-switch recombination. It entails the deletion of all the Cn-coding 
sequences between the assembled VDJ-coding sequence and the particular Cn- 
coding sequence that the cell is destined to express. Class-switch recombination 
differs from V(D)J recombination in several ways. (1) It happens after antigen 
stimulation, mainly in germinal centers, and depends on helper T cells. (2) It 
uses different recombination signal sequences, called switch sequences, which 
are composed of short motifs tandemly repeated over several kilobases. (3) It 
involves cutting and joining the switch sequences, which are noncoding 
sequences, and so the coding sequence is unaffected (Figure 25-41). (4) Most 
importantly, the molecular mechanism is different. It depends on AID, which is 
also involved in somatic hypermutation, rather than on RAG, which is responsi- 
ble for V(D)J recombination. 

The cytokines that activate class switching induce the production of tran- 
scription factors that activate transcription from the relevant switch sequences, 
allowing AID to bind to these sequences. Once bound, AID initiates switch 
recombination by deaminating some cytosines to uracil in the vicinity of these 
switch sequences. Excision of these uracils by uracil-DNA glycosylase (see Fig- 
ure 25-40) is thought to lead somehow to double-strand breaks in the partici- 
pating switch regions, which are then joined by a form of nonhomologous end- 
joining (discussed in Chapter 5). 

Thus, whereas the primary antibody repertoire in mice and humans is gen- 
erated by V(D)J joining mediated by RAG, the secondary antibody repertoire is 
generated by somatic hypermutation and class-switch recombination, both of 
which are mediated by AID. Figure 25-42 summarizes the main mechanisms 
involved in diversifying antibodies that we have discussed in this chapter. 



Summary 

Antibodies are encoded by three loci on separate chromosomes, each of which produces 
a different polypeptide chain. One encodes k light chains, one encodes X light chains, 
and one encodes heavy chains. Each antibody locus contains separate gene segments 
that code for different parts of the variable region of the particular antibody chain. 
Each light-chain locus contains one or more constant- (C-) region coding sequences 
and sets of variable (V) and joining (J) gene segments. The heavy-chain locus contains 
sets ofC-region coding sequences and sets ofV, diversity (D), and J gene segments. 



Figure 25-41 An example of the DNA 
rearrangement that occurs in class switch 
recombination. A B cell making an IgM 
antibody from an assembled VDJ DNA 
sequence is stimulated to switch to 
making an IgA antibody. In the process, it 
deletes the DNA between the VDJ 
sequence and the C a -coding sequence. 
Specific DNA sequences (switch sequences) 
located upstream of each Cn-coding 
sequence (except C§) recombine with each 
other, with the deletion of the intervening 
DNA. As discussed in the text, the 
recombination process depends on AID 
and uracil-DNA glycosylase (UDG), the 
same enzymes that are involved in 
somatic hypermutation (see Figure 25-40). 
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Figure 25-42 The main mechanisms of 
antibody diversification in mice and 
humans. Those shaded in green occur 
during B cell development in the bone 
marrow (or fetal liver), whereas the 
mechanisms shaded in red occur when 
B cells are stimulated by foreign antigen 
and helperT cells in peripheral lymphoid 
organs, either late in a primary response 
or in a secondary response. 
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During B cell development in the bone marrow (or fetal liver), before antigen stim- 
ulation, separate gene segments are brought together by site- specific recombination 
that depends on the RAG complex. A Vi gene segment recombines with a Jl gene seg- 
ment to produce a DNA sequence coding for the V region of a light chain, and a Vngene 
segment recombines with a D and aJngene segment to produce a DNA sequence cod- 
ing for the V region of a heavy chain. Each of the assembled V-region coding sequences 
is then co-transcribed with the appropriate C-region sequence to produce an RNA 
molecule that codes for the complete polypeptide chain. Once a B cell makes func- 
tional heavy and light chains that form antigen-binding sites (that do not bind a self 
antigen with high affinity), it turns off the V(D)J recombination process, thereby ensur- 
ing that the cell makes only one species of antigen-binding site. 

By randomly combining inherited gene segments that code for Vl and Vh regions 
during B cell development, humans can make hundreds of different light chains and 
thousands of different heavy chains. Because the antigen-binding site is formed where 
the hypervariable loops of the Vl and Vh come together in the final antibody, the heavy 
and light chains can potentially pair to form antibodies with millions of different 
antigen-binding sites. The loss and gain of nucleotides at the site of gene-segment join- 
ing increases this number enormously. These antibodies, made by RAG-dependent 
V(D)J recombination before antigen stimulation, are low-affinity IgM and IgD anti- 
bodies, and they constitute the primary antibody repertoire. 

Antibodies are further diversified following antigen stimulation in peripheral lym- 
phoid organs by the AID- and helper-T- cell- dependent processes of somatic hypermu- 
tation and class-switch recombination, which produce the high-affinity IgG, IgA, and 
IgE antibodies that constitute the secondary antibody repertoire. Class switching allows 
the same antigen-binding site to be incorporated into antibodies that have different 
biological properties. 



T CELLS AND MHC PROTEINS 

Like antibody responses, T- cell-mediated immune responses are exquisitely 
antigen-specific, and they are at least as important as antibodies in defending 
vertebrates against infection. Indeed, most adaptive immune responses, includ- 
ing most antibody responses, require helper T cells for their initiation. Most 
importantly, unlike B cells, T cells can help eliminate pathogens that would oth- 
erwise be invisible inside host cells. Much of the rest of this chapter is concerned 
with how T cells accomplish this feat. 

T cell responses differ from B cell responses in at least two crucial ways. 
First, T cells are activated by foreign antigen to proliferate and differentiate into 
effector cells only when the antigen is displayed on the surface of antigen-pre- 
senting cells, usually dendritic cells in peripheral lymphoid organs. T cells 
require antigen-presenting cells for activation because the form of antigen they 
recognize is different from that recognized by B cells. Whereas B cells recognize 
intact protein antigens, for example, T cells recognize fragments of protein anti- 
gens that have been partly degraded inside the antigen-presenting cell. Special 
proteins, called MHC proteins (introduced in Chapter 24), bind to the peptide 
fragments and carry them to the surface of the antigen-presenting cell, where T 
cells can recognize them. 

The second difference is that, once activated, effector T cells act only at short 
range, either within a secondary lymphoid organ or after they have migrated 
into a site of infection. Effector B cells, by contrast, secrete antibodies that can 
act far away. Effector T cells interact directly with another host cell in the body, 
which they either kill (as in the case of an infected host cell, for example) or sig- 
nal in some way (as in the case of a B cell or macrophage, for example) . We shall 
refer to such host cells as target cells. However, because these traget cells must 
display an antigen bound to an MHC protein on their surface for a T cell to rec- 
ognize them, they are also antigen-presenting cells. 

There are three main classes of T cells — cytotoxic T cells, helper T cells, and 
regulatory (suppressor) T cells. Effector cytotoxic T cells directly kill cells that are 
infected with a virus or some other intracellular pathogen. Effector helper T cells 
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help stimulate the responses of other cells — mainly macrophages, dendritic 
cells, B cells, and cytotoxic T cells. Effector regulatory T cells suppress the activ- 
ity of other cells, especially of self- reactive effector T cells. 

In this section, we describe these three main classes of T cells and their 
respective functions. We discuss how they recognize foreign antigens on the sur- 
face of antigen-presenting cells and target cells and consider the crucial part 
played by MHC proteins in the recognition process. Finally, we describe how T 
cells are selected during their development in the thymus to ensure that only cells 
with potentially useful receptors survive and mature. We begin by considering the 
nature of the cell-surface receptors that T cells use to recognize antigen. 



T Cell Receptors (TCRs) Are Antibodylike Heterodimers 

Because T cell responses depend on direct contact with an antigen-presenting 
cell or a target cell, T cell receptors (TCRs), unlike antibodies made by B cells, 
exist only in membrane-bound form and are not secreted. For this reason, TCRs 
were difficult to isolate, and it was not until the 1980s that researchers identified 
their molecular structure. TCRs resemble antibodies. They are composed of two 
disulfide-linked polypeptide chains, each of which contains two Ig-like 
domains, one variable and one constant (Figure 25-43A). Moreover, the three- 
dimensional structure of the extracellular part of aTCR has been determined by 
x-ray diffraction, and it looks very much like one arm of a Y-shaped antibody 
molecule (Figure 25-43B). 

On most T cells, the TCRs have one a chain and one (3 chain. The genetic 
loci that encode the a and (3 chains are located on different chromosomes. Like 
an antibody heavy-chain locus, the TCR loci contain separate V, D, and /gene 
segments (or just V and / gene segments in the case of the a chain locus), 
which are brought together by site-specific recombination during T cell devel- 
opment in the thymus. With one exception, T cells use the same mechanisms 
to generate TCR diversity as B cells use to generate antibody diversity. Indeed, 
they use the same V(D)J recombinase, including the RAG proteins discussed 
earlier. The mechanism that does not operate in TCR diversification is antigen- 
driven somatic hypermutation. Thus, the affinity of the receptors tends to be 
low (iC a ~ 10 5 -10 7 liters/mole), although T cells with the highest affinities are 
preferentially selected by antigen to persist as memory cells. T cells can partly 
compensate for their low affinity by increased avidity, which results when mul- 
tiple TCRs bind simultaneously to multiple membrane-bound ligands (the 
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Figure 25-43 A T cell receptor (TCR) 
heterodimer. (A) Schematic drawing 
showing that the receptor is composed 
of an a and a p polypeptide chain. Each 
chain is about 280 amino acids long and 
has a large extracellular part that is 
folded into two Ig-like domains — one 
variable (V) and one constant (C).The 
antigen-binding site is formed by a V a 
and a Vp domain (shaded in blue). Unlike 
antibodies, which have two binding sites 
for antigen, TCRs have only one. The 
ap heterodimer is noncovalently 
associated with a large set of invariant 
membrane-bound proteins (not shown), 
which help activate the T cell when the 
TCRs bind to antigen. A typical T cell has 
about 30,000 such receptor complexes 
on its surface. (B) The three-dimensional 
structure of the extracellular part of a 
TCR. The antigen-binding site is formed 
by the hypervariable loops of both the 
V a and Vp domains (red), and it is similar 
in its overall dimensions and geometry to 
the antigen-binding site of an antibody 
molecule. (B, based on K.C. Garcia et al., 
Science 274:209-21 9, 1 996. With 
permission from AAAS.) 
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peptide-MHC complexes that we describe later). Moreover, various co-recep- 
tors and cell-cell adhesion proteins greatly strengthen the binding of a T cell 
to an antigen-presenting cell or a target cell. 

A minority of T cells, instead of making a and (3 chains, make a different but 
related type of receptor heterodimer, composed of y chains and 8 chains. 
Although these cells normally make up 5-10% of the T cells in human blood, 
they can be the dominant T cell population in epithelia (in the skin and gut, for 
example). The functions of these cells are less well understood than those of T 
cells expressing a/ (3 TCRs, and we will not discuss them further. 

As with antigen receptors on B cells, the TCRs are tightly associated in the 
plasma membrane with a number of invariant membrane-bound proteins that 
are involved in passing the signal from an antigen- activated receptor to the cell 
interior (see Figure 25-66). We will discuss these proteins in more detail later. 
First, we must consider the special ways in which T cells recognize foreign anti- 
gen on the surface of an antigen-presenting cell. 
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Antigen Presentation by Dendritic Cells Can Either Activate or 
TolerizeT Cells 

Naive cytotoxic or helper T cells must be activated to proliferate and differenti- 
ate into effector cells before they can kill or help their target cells, respectively. 
This activation occurs in peripheral lymphoid organs on the surface of activated 
dendritic cells (Figure 25-44) that display foreign antigen complexed with MHC 
proteins on their surface, along with co- stimulatory proteins. By contrast, mem- 
ory T cells can be activated by other types of antigen-presenting cells, including 
macrophages and B cells, as well as by dendritic cells. 

Various kinds of dendritic cells interact with T cells, but they all have a sin- 
gle known function, which is to present antigens that either activate or sup- 
press the T cells. Dendritic cells are located in tissues throughout the body, 
including the central and peripheral lymphoid organs. Wherever they 
encounter invading microbes, they endocytose the pathogens or their prod- 
ucts. If the encounter is not in a lymphoid organ, the dendritic cells carry the 
foreign antigens via the lymph to local lymph nodes or gut-associated lym- 
phoid organs. The encounter with a pathogen activates pattern recognition 
receptors of the dendritic cell, which is thereby induced to mature from an 
antigen-capturing cell to an activated antigen-presenting cell that can activate 
T cells (see Figure 25-5). Dendritic cells have to be activated in order to acti- 
vate naive T cells, and they can also be activated by tissue injury or by effector 
helper T cells. Tissue injury is thought to activate dendritic cells by the release 
of heat shock proteins and uric acid crystals when cells die by necrosis rather 
than by apoptosis (discussed in Chapter 18). 

Activated dendritic cells display three types of protein molecules on their 
surface that have a role in activating a T cell to become an effector cell or a mem- 
ory cell (Figure 25-45): (1) MHC proteins, which present foreign antigen to the 
TCR, (2) co-stimulatory proteins, which bind to complementary receptors on the 
T cell surface, and (3) cell-cell adhesion molecules, which enable a T cell to bind 
to the antigen-presenting cell for long enough to become activated, which is 
usually hours. In addition, activated dendritic cells secrete a variety of cytokines 
that can influence the type of effector helper T cell that develops (discussed 
later), as well as where the T cell migrates after it has been stimulated. T cells 
activated by dendritic cells isolated from gut- associated Peyer's patches (see Fig- 
ure 25-3), for example, but not by those isolated from lymph nodes, migrate to 
the small intestine where their antigens are likely to be located. 

Nonactivated dendritic cells also have important roles. They help induce 
self-reactive T cells to become tolerant, both in the thymus and in other organs; 
such dendritic cells present self antigens in the absence of the co- stimulatory 
molecules required to activate naive T cells. They induce tolerance in at least two 
ways: they can stimulate abortive responses in the T cell that lead to either inac- 
tivation or apoptosis, and they can activate regulatory T cells to suppress the 
activity of another T cell. 
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Figure 25-44 Immunofluorescence 
micrograph of a dendritic cell in culture. 

These antigen-presenting cells derive 
their name from their long processes, or 
"dendrites." The cell has been labeled with 
a monoclonal antibody that recognizes a 
surface antigen on these cells. (Courtesy 
of David Katz.) 
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Before discussing the role of MHC proteins in presenting antigen to T cells, 
we consider the functions of the three major classes of T cells. 

Effector Cytotoxic T Cells Induce Infected Target Cells to Kill 
Themselves 

Cytotoxic T cells protect vertebrates against intracellular pathogens such as 
viruses and some bacteria and parasites that multiply in the host-cell cytoplasm, 
where they are sheltered from antibody-mediated attack. Cytotoxic T cells do 
this by killing the infected cell before the microbes can proliferate and escape 
from the infected cell to infect neighboring cells. As we discuss later, the intra- 
cellular microbes can be recognized by T cells because vertebrate cells have 
mechanisms for displaying fragments of their intracellular proteins on the cell 
surface, where they are bound to MHC proteins. 

Once a cytotoxic T cell has been activated by an infected antigen-presenting 
cell to become an effector cell, it can kill any target cell harboring the same 
pathogen. Using its TCR, the effector cytotoxic T cell first recognizes a microbial 
antigen bound to an MHC protein on the surface of an infected target cell. This 
causes the T cell to reorganize its cytoskeleton and focus its killing apparatus on 
the target (Figure 25-46). Focus is achieved when the TCRs actively aggregate — 




Figure 25-45 Three types of proteins on 
the surface of an activated dendritic cell 
involved in activating a T cell. The 

invariant polypeptide chains that are 
always stably associated with the T cell 
receptor (TCR) are not shown. 



Figure 25-46 Effector cytotoxic T cells 
killing target cells in culture. <GTCA> 
(A) Electron micrograph showing an effector 
cytotoxic T cell binding to a target cell. The 
cytotoxic T cells were obtained from mice 
immunized with the target cells, which are 
foreign tumor cells. (B) Electron micrograph 
showing a cytotoxic T cell and a tumor cell 
that theT cell has killed. In an animal, as 
opposed to in a culture dish, the killed 
target cell would be phagocytosed by 
neighboring cells long before it 
disintegrated in the way that it has here. 
(C) Immuno-fluorescence micrograph of a 
T cell and tumor cell after staining with 
anti-tubulin antibodies. Note that the 
centrosome in theT cell is located at the 
point of cell-cell contact with the target 
cell — an immunological synapse. The 
secretory granules (not visible) in the T cell 
are initially transported along microtubules 
to the centrosome, which then moves to the 
synapse, delivering the granules to where 
they can release their contents. See also 
Figure 1 6-1 03. (A and B, from D. Zagury et 
al., Eur. J. Immunol. 5:81 8-822, 1 975. With 
permission from John Wiley & Sons, Inc. 
C, reproduced from B. Geiger, D. Rosen and 
G. Berke, J. Cell Biol. 95:137-143, 1982. 
With permission from The Rockefeller 
University Press.) 
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along with various co-receptors, adhesion molecules, and signaling proteins — at 
the T cell/target cell interface, forming an immunological synapse. A similar 
synapse forms when an effector helper T cell interacts with its target cell. In this 
way, effector T cells avoid delivering their signals to neighboring cells. 

Once bound to its target cell, an effector cytotoxic T cell can employ one of 
two strategies to kill the target, both of which operate by inducing the target cell 
to kill itself by undergoing apoptosis (discussed in Chapter 18). In killing an 
infected target cell, the cytotoxic T cell usually releases a pore-forming protein 
called perforin, which is homologous to the complement component C9 (see 
Figure 24-49). The cytotoxic T cell stores perforin in secretory vesicles and 
releases it by local exocytosis at the point of contact with the target cell. Perforin 
then polymerizes in the target cell plasma membrane to form transmembrane 
channels. The secretory vesicles also contain serine proteases, which are 
thought to enter the target cell cytosol through the perforin channels. One of the 
proteases, called granzyme B, activates a pro-apoptotic Bcl2 protein called Bid 
by producing a truncated form of the protein called tBid; tBid then releases 
cytochrome c from mitochondria, triggering a proteolytic caspase cascade that 
kills the cell by apoptosis (discussed in Chapter 18) (Figure 25-47A). Mice with 
an inactivated perforin gene cannot generate pathogen-specific cytotoxic T 
cells, and they show an increased susceptibility to certain viral and intracellular 
bacterial infections. 

In the second killing strategy, the cytotoxic T cell activates a death-inducing 
caspase cascade in the target cell less directly. A homotrimeric protein on the 
cytotoxic T cell surface called Fas ligand binds to transmembrane receptor pro- 
teins on the target cell called Fas. The binding alters the Fas proteins so that their 
clustered cytosolic tails recruit procaspase-8 into a complex via an adaptor pro- 
tein. The recruited procaspase-8 molecules thereby become activated and initi- 
ate a caspase cascade that leads to apoptosis (Figure 25-47B). 



Figure 25-47 Two strategies by which 
effector cytotoxic T cells kill their target 
cells. In both cases, the T cell has to 
contact the target cell to kill it, and a 
single cytotoxic T cell can kill multiple 
target cells in sequence. (A) The cytotoxic 
T cell (Tc) releases perforin and 
proteolytic enzymes onto the surface of 
an infected target cell by localized 
exocytosis. The high concentration of 
Ca 2+ in the extracellular fluid causes the 
perforin to assemble into transmembrane 
channels in the target cell plasma 
membrane. The channels are thought to 
allow the proteolytic enzymes to enter 
the target cell cytosol. One of the 
enzymes, granzyme B, cleaves the Bid 
protein to produce the truncated form 
tBid, which releases cytochrome c from 
mitochondria to initiate a caspase 
cascade leading to apoptosis. (B)The 
homotrimeric Fas ligand on the surface of 
the cytotoxic T cell binds to and activates 
the Fas protein on the surface of a target 
cell. The cytosolic tail of Fas contains a 
death domain, which, when activated, 
binds to an adaptor protein, which in 
turn recruits a specific procaspase 
(procaspase-8). Clustered procaspase-8 
molecules become activated and initiate 
a proteolytic caspase cascade leading to 
apoptosis (see Figure 18-6). 



Effector Helper T Cells Help Activate Other Cells of the Innate and 
Adaptive Immune Systems 

In contrast to cytotoxic T cells, helper T cells are crucial for defense against both 
extracellular and intracellular pathogens. They help stimulate B cells to make 
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Figure 25-48 Differentiation of naive 
helper T cells into either ThI or"l"H2 
effector helper cells in a peripheral 
lymphoid organ. The nature of the 
activated dendritic cell and the 
characteristics of the pathogen that 
activated it mainly determine which type 
of effector helper cell develops. 



antibodies that help inactivate or eliminate extracellular pathogens and their 
toxic products. They also activate macrophages to destroy any intracellular 
pathogens multiplying within the macrophage's phagosomes, and they help 
activate cytotoxic T cells to kill infected target cells. They can also stimulate a 
dendritic cell to maintain it in an activated state. 

Once an antigen-presenting cell activates a helper T cell to become an effec- 
tor cell, the helper cell can then help activate other cells. It does this both by 
secreting a variety of co- stimulatory cytokines and by displaying co- stimulatory 
proteins on its surface. When activated by its binding to an antigen on a dendritic 
cell, a naive helper T cell usually differentiates into either of two distinct types of 
effector helper cell, called ThI and Th2. ThI cells are mainly involved in immunity 
to intracellular microbes and help activate macrophages, cytotoxic T cells, and B 
cells. Th2 cells are mainly involved in immunity to extracellular pathogens, espe- 
cially multicellular parasites, and they help activate B cells to make antibodies 
against the pathogen (Figure 25-48). As we discuss later, the nature of the invad- 
ing pathogen and the types of innate immune responses it elicits largely deter- 
mine which type of helper T cell develops. This, in turn, determines the nature of 
the adaptive immune responses mobilized to fight the invaders. 

In some cases, a naive helper T cell that encounters its antigen in a periph- 
eral lymphoid organ develops into an effector cell that suppresses rather than 
helps an immune response. Such regulatory T cells, however, mostly develop in 
the thymus, as a distinct class of T cell, as we now discuss. 



Regulatory T Cells Suppress the Activity of Other T Cells 

Regulatory T cells have been difficult to study and characterize, largely because, 
until recently, there were no good markers to identify them. Indeed, for many 
years immunologists questioned whether such cells existed or not. They were 
originally identified by their ability to suppress the activity of other lymphocytes 
and were therefore called suppressor T cells. When markers became available, they 
were renamed regulatory T cells and shown to suppress the activity of effector 
helper and cytotoxic T cells and of dendritic cells. Although they make up less than 
10% of the T cells in the blood and peripheral lymphoid organs, regulatory T cells 
play a crucial part in immunological self tolerance by suppressing the activity of 
self- reactive effector helper and cytotoxic T cells. They also help prevent excessive 
T cell responses to microbial antigens in chronic infections. In both these ways, 
they help prevent adaptive immune responses from damaging host tissues. 

A breakthrough in understanding regulatory T cells was the discovery that 
they alone express the transcription factor Foxp3, which serves as both an 
unambiguous marker of these cells and a master controller of their develop- 
ment. When the gene encoding this protein is inactivated in mice or humans, for 
example, the individuals specifically fail to produce regulatory T cells and 



T CELLS AND MHC PROTEINS 



1575 



develop an early and fatal autoimmune disease involving multiple organs. It is 
still uncertain how regulatory T cells suppress the action of effector T cells or 
dendritic cells, but one pathway is thought to involve the secretion of the 
inhibitory cytokines TGF/5 and interleukin 10 (IL10). 

We now turn to the crucial role of MHC proteins in presenting antigen to T 
cells. 



T Cells Recognize Foreign Peptides Bound to MHC Proteins 

As discussed earlier, both cytotoxic T cells and helper T cells are initially activated 
in peripheral lymphoid organs by recognizing foreign antigen on the surface of an 
antigen-presenting cell, which is a dendritic cell in the case of a naive T cell. The 
antigen is usually in the form of peptide fragments that are generated by the 
degradation of foreign protein antigens inside the antigen-presenting cell. The 
recognition process depends on the presence of MHC proteins in the antigen-pre- 
senting cell. These bind fragments, carry them to the cell surface, and present 
them there — along with co- stimulatory signals — to the T cells. Once activated, 
effector T cells then recognize the same peptide-MHC complex on the surface of 
the target cell they influence, which may be a B cell, a cytotoxic T cell, or an 
infected macrophage in the case of a helper T cell, or any infected host cell in the 
case of a cytotoxic T cell; for helper T cells, it may also be the dendritic cell itself. 

A large complex of genes called the major histocompatibility complex 
(MHC) encodes MHC proteins. There are two main classes of MHC proteins, and 
they are both structurally and functionally distinct. Class I MHC proteins mainly 
present foreign peptides to cytotoxic T cells, and class II MHC proteins mainly 
present foreign peptides to helper and regulatory T cells (Figure 25-49). 

Before examining the mechanisms by which protein antigens are processed 
for presentation to T cells, we must look more closely at the MHC proteins them- 
selves, which have such a critical role in T cell function. 



MHC Proteins Were Identified in Transplantation Reactions Before 
Their Functions Were Known 

MHC proteins were initially identified as the main antigens recognized in trans- 
plantation reactions. When organ grafts are exchanged between adult individu- 
als, either of the same species (allografts) or of different species (xenografts), they 
are usually rejected. In the 1950s, skin grafting experiments between different 
strains of mice demonstrated that graft rejection is an adaptive immune response 
to the foreign antigens on the surface of the grafted cells. Rejection is mediated 
mainly by T cells, which react against genetically "foreign" versions of cell- surface 
proteins called histocompatibility molecules (from the Greek word histos, mean- 
ing "tissue"). The MHC proteins encoded by the clustered genes of the major his- 
tocompatibility complex (MHC) are by far the most important of these. MHC 
proteins are expressed on the cells of all higher vertebrates. They were first 
demonstrated in mice, where they are called H-2 antigens (Tzisto compatibility- 2 



CYTOTOXIC T CELL 




HELPER OR REGULATORY T CELL 



TCR 



fragment 
of foreign 
protein 



dendritic cell 
or target cell 




fragment 
of foreign 
protein 



Figure 25-49 Recognition by T cells of 
foreign peptides bound to MHC 
proteins. Cytotoxic T cells recognize 
foreign peptides in association with class 
I MHC proteins, whereas helperT cells 
and regulatory T cells recognize foreign 
peptides in association with class II MHC 
proteins. In both cases, the T cell 
recognizes the peptide-MHC complexes 
on the surface of a dendritic cell or a 
target cell. 
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antigens). In humans they are called HLA antigens (Tzuman-Zeucocyte- associ- 
ated antigens) because they were first demonstrated on leucocytes (white blood 
cells). 

Three remarkable properties of MHC proteins baffled immunologists for 
many years. First, MHC proteins are overwhelmingly the antigens that are recog- 
nized in T-cell-mediated transplantation reactions. Second, an unusually large 
fraction of T cells are able to recognize foreign MHC proteins: whereas fewer than 
0.001% of an individual's naive T cells respond to a typical viral antigen, up to 10% 
of them respond to the foreign MHC proteins of another individual. Third, some 
of the genes that code for MHC proteins are the most polymorphic known in 
higher vertebrates. That is, within a species, there are an extraordinarily large 
number of alleles (alternative forms of the same gene) present (in some cases 
more than 400), without any one allele predominating. As each individual has at 
least 12 genes encoding MHC proteins (as discussed later), it is very rare for two 
unrelated individuals to have an identical set of MHC proteins. These differences 
make it very difficult to match donor and recipient for organ transplantation 
unless they are closely related. 

Of course, a vertebrate does not need to protect itself against invasion by 
foreign vertebrate cells. So the apparent obsession of its T cells with foreign 
MHC proteins and the extreme polymorphism of these molecules were a great 
puzzle. The puzzle was at least partly solved when researchers discovered that 
(1) MHC proteins bind fragments of foreign proteins and display them on the 
surface of host cells for T cells to recognize, and (2) T cells respond to foreign 
MHC proteins in the same way they respond to self MHC proteins that have for- 
eign antigen bound to them. 



Class I and Class II MHC Proteins Are Structurally Similar 
Heterodimers 

Class I and class II MHC proteins have very similar overall structures. They are 
both transmembrane heterodimers with extracellular N-terminal domains that 
bind antigen for presentation to T cells. 

Class I MHC proteins consist of a transmembrane a chain, which is encoded 
by a class I MHC gene, and a small extracellular protein called ^-microglobulin 
(Figure 25-50A). The p2- microglobulin does not span the membrane and is 
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Figure 25-50 Class I and class II MHC 
proteins. (A) The a chain of the class I 
molecule has three extracellular domains, 
oci, 0C2, and 0:3, encoded by separate 
exons. It is noncovalently associated with 
a smaller polypeptide chain, 
p 2 -microglobulin, which is not encoded 
within the MHC. The 0C3 domain and 
P2-microglobulin are Ig-like. While 
p 2 -microglobulin is invariant, the 
a chain is extremely polymorphic, mainly 
in the oci and 0C2 domains. (B) In class II 
MHC proteins, both chains are 
polymorphic, mainly in the oci and Pi 
domains; the 0C2 and P2 domains are 
Ig-like. Thus, there are striking similarities 
between class I and class II MHC proteins. 
In both, the two outermost domains 
(shaded in blue) are polymorphic and 
interact to form a groove that binds 
peptide fragments of foreign proteins 
and presents them to T cells. 
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encoded by a gene that does not lie in the MHC gene cluster. The a chain is 
folded into three extracellular globular domains (ai, 0C2, and as), and the 0C3 
domain and the p2 -microglobulin, which are closest to the membrane, are both 
similar to an Ig domain. The two N-terminal domains of the a chain, which are 
farthest from the membrane, contain the polymorphic (variable) amino acids 
that T cells recognize in transplantation reactions. These domains bind a pep- 
tide and present it to cytotoxic T cells. 

Like class I MHC proteins, class II MHC proteins are heterodimers with two 
conserved Ig-like domains close to the membrane, and two polymorphic (vari- 
able) N-terminal domains farthest from the membrane. In these proteins, how- 
ever, both chains (a and (3) are encoded by genes within the MHC, and both span 
the membrane (Figure 25-50B). The two polymorphic domains bind a peptide 
and present it to helper or regulatory T cells. 

The presence of Ig-like domains in class I and class II proteins suggests that 
MHC proteins and antibodies have a common evolutionary history. The loca- 
tions of the genes that encode class I and class II MHC proteins in humans are 
shown in Figure 25-51, where we illustrate how an individual can make six types 
of class I MHC proteins and more than six types of class II proteins. 

In addition to the classic class I MHC proteins, there are many class-I-MHC- 
like proteins, which form dimers with P2- microglobulin. These proteins are 
encoded by genes outside the MHC and are much less polymorphic than MHC 
proteins, but some of them present specific microbial antigens, including some 
lipids and glycolipids, to T cells. Although the functions of most of them are 
unknown, some have a role in brain development. 



An MHC Protein Binds a Peptide and Interacts with a T Cell 
Receptor 

Any individual can make only a small number of different classic MHC proteins, 
which together must be able to present peptide fragments from almost any foreign 
protein to T cells. Thus, unlike an antibody molecule, each MHC protein has to be 
able to bind a very large number of different peptides. X-ray crystallographic anal- 
yses of MHC proteins have revealed the structural basis for this versatility. 

As shown in Figure 25-52A, a class I MHC protein has a single peptide-bind- 
ing site located at one end of the molecule, facing away from the plasma mem- 
brane. This site consists of a deep groove between two long a helices; the groove 
narrows at both ends so that it is only large enough to accommodate an 
extended peptide about 8-10 amino acids long. In fact, when a class I MHC pro- 
tein was first analyzed by x-ray crystallography, this groove contained bound 
peptides that had co- crystallized with the MHC protein (Figure 25-52B), sug- 
gesting that once a peptide binds to this site it does not normally dissociate. 

A typical peptide binds in the groove of a class I MHC protein in an extended 
conformation, with its terminal amino group bound to invariant amino acids of 
the MHC protein at one end of the groove and its terminal carboxyl group bound 
to invariant amino acids at the other end of the groove (Figure 25-53). Some 
amino acid side chains of the peptide bind to variable (polymorphic) amino 
acids of the MHC protein distributed along the groove, while other side chains 
point outward, in a position to be recognized by TCRs on cytotoxic T cells. 
Because the invariant amino acids of the MHC protein at the ends of the groove 
recognize features of the peptide backbone that are common to all peptides, 
each allelic form of a class I MHC protein can bind a large variety of peptides of 
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Figure 25-51 Human MHC genes. This 

simplified schematic drawing shows the 
location of the genes that encode the 
transmembrane subunits of class I (light 
green) and class II (dark green) MHC 
proteins. The genes shown encode three 
types of class I proteins (HLA-A, HLA-B, 
and HLA-C) and three types of class II 
MHC proteins (HLA-DP, HLA-DQ, and 
HLA-DR). An individual can therefore 
make six types of class I MHC proteins 
(three encoded by maternal genes and 
three by paternal genes) and more than 
six types of class II MHC proteins. The 
number of class II MHC proteins that can 
be made is greater than six because there 
are two DR p genes and because 
maternally encoded and paternally 
encoded polypeptide chains can 
sometimes pair. 
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Figure 25-52 The three-dimensional structure of a human class I 
MHC protein as determined by x-ray diffraction analysis of crystals 
of the extracellular part of the molecule. The extracellular part of 
the protein was cleaved from the transmembrane segment by a 
proteolytic enzyme (papain) before crystallization. (A) Each of the two 
domains closest to the plasma membrane (a 3 and (^-microglobulin) 
resembles a typical Ig domain (see Figure 25-34), while the two 
domains farthest from the membrane (ai and 0C2) are very similar to 
each other and together form a peptide-binding groove at the top of 
the molecule. (B) The peptide-binding groove viewed from above. 
The small peptides that co-purified with the MHC protein are shown 
schematically. (After P.J. Bjorkman et al., Nature 329:506-512, 1987. 
With permission from Macmillan Publishers Ltd.) 
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diverse sequence. At the same time, the polymorphic MHC amino acids along 
the groove, which bind specific side chains of the peptide, ensure that each 
allelic form binds and presents a distinct characteristic set of peptides. Thus, the 
six types of class I MHC proteins in an individual can present a broad range of 
foreign peptides to the cytotoxic T cells, but in each individual they do so in 
slightly different ways. 

The three-dimensional structure of class II MHC proteins is very similar to 
that of class I proteins, but the antigen-binding groove does not narrow at the 
ends, so it can accommodate longer peptides, which are usually 12-20 amino 
acids long. Moreover, the peptide is not bound at its ends but is instead held by 
interactions with invariant amino acids of the MHC protein distributed along 
the length of the groove (Figure 25-54). As in the case of class I MHC proteins, 
side chains of other amino acids in the peptide either bind to polymorphic MHC 
amino acids along the groove or point upward to be recognized by TCRs on 
helper or regulatory T cells. A class II MHC groove can accommodate a more het- 
erogeneous set of peptides than can a class I MHC groove. Thus, although an 
individual makes only a small number of types of class II proteins, each with its 
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sheet 



own unique peptide-binding groove, together these proteins can bind and pre- 
sent an enormous variety of foreign peptides to helper T cells, which have a cru- 
cial role in almost all adaptive immune responses. 

X-ray crystallographic analyses of complexes formed between a soluble TCR 
and a soluble MHC protein with peptide in its binding groove revealed the way 
in which the TCR recognizes a peptide-MHC complex. Recombinant DNA tech- 
nology produced the soluble proteins for these experiments. In each case stud- 
ied, the TCR fits diagonally across the peptide-binding groove and binds 
through its V a and Vp hypervariable loops to both the walls of the groove and the 
peptide (Figure 25-55). Soluble peptide-MHC complexes are now widely used 
to detect T cells with a particular specificity; they are usually cross-linked into 
tetramers so that they can bind to four TCRs on the T cell surface with strong 
avidity. 



Figure 25-53 A peptide bound in the 
groove of a class I MHC protein. <AAGT> 
Schematic drawing of a top view of the 
groove. A ribbon drawing of the MHC 
groove is shown in gray; it is formed by 
the ai and a 2 domains of the protein 
(see Figures 25-50A and 25-52A). The 
backbone of the bound peptide is shown 
in yellow, with the carbon atoms in black, 
oxygen atoms in red, and nitrogen atoms 
in blue; the amino terminus of the 
peptide is to the left. Note that the 
terminal amino and carboxyl groups of 
the peptide backbone bind via hydrogen 
and ionic bonds (shown as dotted blue 
lines) to the side chains of invariant MHC 
amino acids of the MHC protein towards 
the ends of the groove (shown in full). 
Although not illustrated in the drawing, 
the side chains of some amino acids of 
the peptide bind to variable 
(polymorphic) amino acids of the groove, 
while others point upward and can be 
recognized by TCRs on a cytotoxic T cell. 
(Courtesy of Paul Travers.) 



MHC Proteins Help Direct T Cells to Their Appropriate Targets 

Virtually all nucleated vertebrate cells express class I MHC proteins. This is pre- 
sumably because effector cytotoxic T cells must be able to focus on and kill any 
cell in the body that becomes infected with an intracellular microbe such as a 
virus. Class II proteins, by contrast, are normally confined largely to cells that 
take up foreign antigens from the extracellular fluid and interact with helper T 
cells. These cells also express class I MHC proteins. They include dendritic cells, 
which initially activate naive helper T cells, as well as the targets of effector 
helper T cells, such as macrophages and B cells. 

It is important that effector cytotoxic T cells focus their attack mainly on 
cells that make the foreign antigens (such as viral proteins), while helper T cells 
focus their help mainly on cells that have taken up foreign antigens from the 
extracellular fluid. Since the former type of target cell is always a menace, while 




peptide- 
binding groove 



Figure 25-54 A peptide bound in the 
groove of a class II MHC protein. 

<GAAA> Schematic drawing similar to 
that shown in Figure 25-53. The groove is 
formed by the amino terminal domains 
of the a and p chains (ai and Pi — see 
Figure 25-50B). Note that the peptide 
extends beyond the ends of the groove 
and that its backbone binds through 
hydrogen bonds distributed along the 
length of the peptide to invariant amino 
acid side chains in the groove. (Courtesy 
of Paul Travers.) 
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the latter type is essential for the body's adaptive immune defenses, it is vitally 
important that T cells do not confuse the two target cells and misdirect their 
cytotoxic and helper functions. Therefore, in addition to the antigen receptor 
that recognizes a peptide-MHC complex, each of the three major classes of T 
cells also expresses a co-receptor that recognizes a separate, invariant part of the 
appropriate class of MHC protein. These two co-receptors, called CD4 and CD8, 
help direct helper (and regulatory) T cells and cytotoxic T cells, respectively, to 
their appropriate targets. Table 25-2 reviews the properties of class I and class II 
MHC proteins. 



Figure 25-55 The interaction of a T cell 
receptor with a viral peptide bound to a 
class I MHC protein. (A) Schematic view of 
the hypervariable loops of the V a and Vp 
domains of the T cell receptor interacting 
with the peptide and the walls of the 
peptide-binding groove of the MHC 
protein. Note that the third hypervariable 
loops, which are the most variable, 
primarily interact with the peptide, 
whereas the other hypervariable loops 
primarily bind to the walls of the peptide- 
binding groove. (B) Drawing of the 
"footprint" of the V domains (light blue) and 
hypervariable loops (dark blue) of the 
receptor over the peptide-binding groove, 
as determined by x-ray diffraction. The V a 
domain covers the amino half of the 
peptide, while the Vp domain covers the 
carboxyl half. Note that the receptor is 
oriented diagonally across the peptide- 
binding groove. (B, adapted from 
D.N. Garboczi et al., Nature 384:134-141, 
1996. With permission from Macmillan 
Publishers Ltd.) 



CD4 and CD8 Co-Receptors Bind to Invariant Parts of MHC 
Proteins 

The affinity of TCRs for peptide-MHC complexes on an antigen-presenting cell 
or target cell is usually too low by itself to mediate a functional interaction 
between the two cells. T cells normally require accessory receptors to help stabi- 
lize the interaction by increasing the overall strength of the cell-cell adhesion. 
Unlike TCRs or MHC proteins, the accessory receptors do not bind foreign anti- 
gens and are invariant. 

When accessory receptors have a direct role in activating the T cell by gen- 
erating their own intracellular signals, they are called co-receptors. The most 
important and best understood of the co-receptors on T cells are the CD4 and 
CD8 proteins, both of which are single-pass transmembrane proteins with extra- 
cellular Ig-like domains. Like TCRs, they recognize MHC proteins, but, unlike 
TCRs, they bind to invariant parts of the protein, far away from the peptide- 
binding groove. CD4 is expressed on both helper T cells and regulatory T cells 
and binds to class II MHC proteins, whereas CD8 is expressed on cytotoxic T 
cells and binds to class I MHC proteins (Figure 25-56). Thus, CD4 and CD8 con- 
tribute to T cell recognition by helping the T cell to focus on particular MHC pro- 
teins, and thereby on particular types of target cells: the recognition of class I 
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Class I and Class II MHC Proteins 






CLASS 1 


CLASS II 


Genetic loci 


HLA-A, HLA-B, HLA-C 


DP, DQ, DR 


Chain structure 


a chain + p2-microglobulin 


a chain + p chain 


Cell distribution 


most nucleated cells 


dendritic cells, B cells, macrophages, thymus epithelial 
cells, some others 


Presents antigen to 


cytotoxic T cells 


helper T cells, regulatory T cells 


Source of peptide fragments 


mainly proteins made in cytoplasm 


mainly endocytosed plasma membrane and 
extracellular proteins 


Polymorphic domains 


OCi + 0C2 


oci+pi 


Recognition by co-receptor 


CD8 


CD4 
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Figure 25-56 CD4 and CD8 co-receptors on the surface of T cells. 

Cytotoxic T cells (T c ) express CD8, which recognizes class I MHC proteins, 
whereas helper T cells (Th) and regulatory T cells (not shown) express CD4, 
which recognizes class II MHC proteins. Note that the co-receptors bind to 
the same MHC protein that theTCR has engaged, so that they are brought 
together with TCRs during the antigen recognition process. Whereas the 
TCR binds to the variable (polymorphic) parts of the MHC protein that form 
the peptide-binding groove, the co-receptor binds to the invariant part, 
well away from the groove. 



MHC proteins allows cytotoxic T cells to focus on any host cell, while the recog- 
nition of class II MHC proteins allows helper T cells to focus on a small subset of 
cells — most notably dendritic cells, macrophages, and B cells. The cytoplasmic 
tail of the CD4 and CD8 proteins is associated with a member of the Src family 
of cytoplasmic tyrosine kinases called Lck, which phosphorylates various intra- 
cellular proteins on tyrosines and thereby participates in the activation of the T 
cell (discussed in Chapter 15). Antibodies to CD4 and CD8 are widely used as 
tools to help distinguish between the main classes of T cells, in both humans and 
experimental animals: whereas only cytotoxic T cells express CD8, both helper 
and regulatory T cells express CD4. 

Ironically, the AIDS virus (HIV) uses CD4 molecules (as well as chemokine 
receptors) to enter helper T cells. AIDS patients are susceptible to infection by 
microbes that are not normally dangerous because HIV depletes helper T cells. 
As a result, most AIDS patients die of infection within several years of the onset 
of symptoms, unless they are treated with a combination of powerful anti-HIV 
drugs. HIV also uses CD4 and chemokine receptors to enter macrophages, 
which also have both of these receptors on their surface. 

Before a T cell can recognize a foreign protein, the protein has to be pro- 
cessed inside an antigen-presenting cell or target cell so that it can be displayed 
as peptide-MHC complexes on the cell surface. We now consider how a virus- 
infected antigen-presenting cell or target cell processes viral proteins for pre- 
sentation to a cytotoxic T cell. We then discuss how ingested foreign proteins are 
processed for presentation to a helper T cell. 



Cytotoxic T Cells Respond to Fragments of Foreign Cytosolic 
Proteins in Association with Class I MHC Proteins 

An experiment performed in the 1970s provided dramatic evidence that class I 
MHC proteins are involved in the recognition of viral antigens by cytotoxic T 
cells. Researchers found that effector cytotoxic T cells from a virus -infected 
mouse could kill cultured cells infected with the same virus only if these target 
cells expressed some of the same class I MHC proteins as the infected mouse. 
This experiment demonstrated that the cytotoxic T cells of any individual can 
recognize a specific foreign antigen on a target cell only when the target cell 
expresses at least some of the allelic forms of class I MHC proteins expressed by 
that individual, a phenomenon known as MHC restriction (Figure 25-57) . 



virus A 




strain-X mouse 



cytotoxic 

T cells 
added to 



stram-X mouse / ^x 

fibroblasts p\Ch 

infected with 

virus B ^<^y 

stram-X mouse ^-7 
fibroblasts 
infected with 
virus A 

strain-Y mouse 
fibroblasts 
infected with 
virus A 



T cells kill 
target cells 

NO 



m> 



YES 



NO 



T C 





class 1 MHC 


tein 


protein 

/ 


3^ 


fe 


"7 


dendritic cell 


TCR 


or target cell 


variable 


invariant 


part 


part 




CD4 protein 

class II MHC protein 



dendritic cell 
or target cell 



Figure 25-57 The classic experiment 
showing that an effector cytotoxic T cell 
recognizes some aspect of the surface 
of the host target cell in addition to a 
viral antigen. Mice of strain X are 
infected with virus A. Seven days later, 
the spleens of these mice contain 
effector cytotoxic T cells able to kill virus- 
infected, strain-X fibroblasts in cell 
culture. As expected, they kill only 
fibroblasts infected with virus A and not 
those infected with virus B. Thus, the 
cytotoxic T cells are virus-specific. The 
same T cells, however, are also unable to 
kill fibroblasts from strain-Y mice infected 
with the same virus A, indicating that the 
cytotoxic T cells recognize a genetic 
difference between the two kinds of 
fibroblasts and not just the virus. Pinning 
down the difference required the use of 
special strains of mice (known as 
congenic strains) that either were 
genetically identical except for the alleles 
at their class I MHC loci or were 
genetically different except for these 
alleles. In this way, it was found that the 
killing of infected target cells required 
that they express at least one of the same 
class I MHC alleles as expressed by the 
original infected mouse. This result 
suggested that class I MHC proteins are 
somehow necessary to present cell- 
surface-bound viral antigens to effector 
cytotoxic T cells and that they do so in a 
highly specific manner. 
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Figure 25-58 The peptide-transport problem. How do peptide fragments 
get from the cytosol, where they are produced, into the ER lumen, where 
the peptide-binding grooves of class I MHC proteins are located? A special 
transport process is required. 



Subsequent evidence indicated that, in killing a virus -infected cell, a cyto- 
toxic T cell recognizes degraded fragments of internal proteins of the virus that 
are bound to class I MHC proteins on the infected cell surface. Because a T cell 
can recognize tiny amounts of foreign antigen (as few as 1-10 peptide-MHC 
complexes for T cells with the highest affinity receptors), only a tiny fraction of 
the fragments generated from viral proteins have to bind to class I MHC proteins 
and get to the cell surface to trigger an effector cytotoxic T cell to attack. 

The internal viral proteins are synthesized in the cytosol of the infected cell. 
As discussed in Chapter 3, proteolytic degradation in the cytosol is mainly medi- 
ated by an ATP- and ubiquitin- dependent mechanism that operates in protea- 
somes — large proteolytic enzyme complexes constructed from many different 
protein subunits. All proteasomes are probably able to generate peptide frag- 
ments of appropriate length to fit in the groove of class I MHC proteins, as even 
bacterial proteasomes can do so. This suggests that the MHC groove evolved to 
fit peptides of around this length. Nonetheless, some proteasomes are appar- 
ently specialized for producing peptides for class I MHC proteins, as they con- 
tain two subunits that are encoded by genes located within the MHC chromo- 
somal region. 

How do peptides generated in the cytosol make contact with the peptide- 
binding groove of class I MHC proteins in the lumen of the endoplasmic reticu- 
lum (Figure 25-58)? The answer initially came through observations of mutant 
cells in which class I MHC proteins are not expressed at the cell surface but are 
instead degraded within the cell. The mutant genes in these cells proved to 
encode subunits of a protein belonging to the family of ABC transporters, which 
we discuss in Chapter 11. This transporter is located in the ER membrane and 
uses the energy of ATP hydrolysis to pump peptides from the cytosol into the ER 
lumen. The genes encoding its two subunits are located in the MHC chromoso- 
mal region, and if either gene is inactivated by mutation, cells are unable to sup- 
ply peptides to class I MHC proteins. It is striking that the class I MHC proteins 
in such mutant cells are degraded inside the cell without reaching the cell sur- 
face. This occurs because peptide binding is normally required for the proper 
folding of these proteins: until it binds a peptide, a class I MHC protein remains 
in the ER, tethered to an ABC transporter by a chaperone protein; without a pep- 
tide bound, the trapped MHC proteins in the mutant cells eventually undergo 
proteolysis (Figure 25-59) . 

In all cells, peptide fragments come from the cells' own cytosolic and 
nuclear proteins that are degraded in the processes of normal protein turnover 
and quality control mechanisms. (Surprisingly, more than 30% of the proteins 
made by mammalian cells are apparently faulty and are degraded in protea- 
somes soon after they are synthesized.) These peptides are constantly being 
pumped into the ER and are carried to the cell surface by class I MHC proteins. 
They are not antigenic, however, because the cytotoxic T cells that could recog- 
nize them have either been eliminated, inactivated, or suppressed by regulatory 
T cells in the process of self- tolerance (see Figure 25-13). 

When an antigen activates cytotoxic T cells or TrI helper T cells to become 
effector cells, the effector cells secrete the cytokine interferon- Y(IFNy), which 
greatly enhances anti-viral responses. The IFNyacts on virus-infected host cells 
in two ways. It blocks viral replication, and it increases the expression of many 
genes within the MHC chromosomal region. These genes include those that 
encode class I MHC proteins, the two specialized proteasome subunits, and the 
two subunits of the peptide transporter located in the ER (Figure 25-60). Thus, 
the machinery in a host cell that is required for presenting viral antigens to cyto- 
toxic T cells is coordinately called into action by IFNy, creating a positive feed- 
back loop that amplifies the immune response and culminates in the death of 
the infected cell. 
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Helper T Cells Respond to Fragments of Endocytosed Foreign 
Protein Associated with Class II MHC Proteins 

Unlike cytotoxic T cells, helper T cells do not act directly to kill infected cells. 
Instead, they stimulate macrophages to be more effective in destroying intracel- 
lular microorganisms, and they help B cells and cytotoxic T cells to respond to 
microbial antigens. 

Like the viral proteins presented to cytotoxic T cells, the proteins presented 
to helper T cells on dendritic cells or target cells are degraded fragments of for- 
eign proteins. The fragments are bound to class II MHC proteins in much the 
same way that virus-derived peptides are bound to class I MHC proteins. But 
both the source of the peptide fragments presented and the route they take to 
find the MHC proteins generally differ. 

Rather than being derived from foreign protein synthesized in the cytosol of 
a cell, the foreign peptides presented to helper T cells are derived from endo- 
somes. Some come from extracellular microbes or their products that the antigen- 
presenting cell has endocytosed and degraded in the acidic environment of its 
endosomes. Others come from microbes growing within the endocytic compart- 
ment of the antigen-presenting cell. In either case, the peptides do not have to be 
pumped across a membrane because they are generated in a compartment that is 
topologically equivalent to the extracellular space. Instead of entering the lumen 
of the ER, where the class II MHC proteins are synthesized and assembled, they 
bind to preassembled class II heterodimers in an endosomal compartment. When 
the peptide binds, the class II MHC protein alters its conformation, trapping the 



Figure 25-59 The processing of a viral 
protein for presentation to cytotoxic 
T cells. An effector cytotoxic T cell kills a 
virus-infected cell when it recognizes 
fragments of an internal viral protein 
bound to class I MHC proteins on the 
surface of the infected cell. Not all viruses 
enter the cell in the way that this 
enveloped RNA virus does, but fragments 
of internal viral proteins always follow the 
pathway shown. Only a small proportion 
of the viral proteins synthesized in the 
cytosol are degraded and transported to 
the cell surface, but this is sufficient to 
attract an attack by a cytotoxic T cell. 
Several chaperone proteins (only one of 
which is shown) in the ER lumen aid the 
folding and assembly of class I MHC 
proteins. The chaperones bind to the 
class I MHC a chain and act sequentially. 
The last one binds the MHC protein to 
the ABC transporter, as shown. The 
assembly of class I MHC proteins and 
their transport to the cell surface requires 
the binding of peptide. 
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Figure 25-60 Some effects of interferon-y 
(IFNy) on virus-infected cells. The activated 
IFNy receptors signal to the nucleus, 
altering gene transcription, which leads to 
the effects indicated. The effects shaded in 
yellow tend to make the infected cell a 
better target for killing by an effector 
cytotoxic T cell. 
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Figure 25-61 The processing of an extracellular protein antigen by a dendritic 
cell for presentation to a helper T cell. The drawing shows a simplified view of 
how peptide-class-ll-MHC complexes are formed in endosomes and delivered to 
the cell surface. Note that the release of the invariant-chain fragment from the 
binding groove of the class II MHC protein in the endosome is catalyzed by a class- 
ll-MHC-like protein called HLA-DM. Viral glycoproteins can also be processed by 
this pathway for presentation to helper T cells. These proteins of the viral envelope 
are normally made in the ER and then insert into the plasma membrane. Although 
most of these viral glycoproteins will be incorporated into the envelope of budding 
viral particles, some will enter endosomes after they are endocytosed and thereby 
enter the class II MHC pathway. 



peptide in the binding groove and transferring it to the cell surface for presenta- 
tion to helper T cells. 

A newly synthesized class II MHC protein must avoid clogging its binding 
groove prematurely in the ER lumen with peptides pumped in from the cytosol. 
A special polypeptide, called the invariant chain, ensures this by associating 
with newly synthesized class II MHC heterodimers in the ER. Part of its polypep- 
tide chain lies within the peptide-binding groove of the MHC protein, thereby 
blocking the groove from binding peptides in the lumen of the ER. The invariant 
chain also directs class II MHC proteins from the trans Golgi network to a late 
endosomal compartment. Here, proteases cleave the invariant chain, leaving 
only a short fragment bound in the peptide-binding groove of the MHC protein. 
This fragment is then released, freeing the MHC protein to bind peptides 
derived from endocytosed proteins (Figure 25-61). In this way, the functional 
differences between class I and class II MHC proteins are ensured — the former 
mainly presenting molecules that come from the cytosol, the latter mainly pre- 
senting molecules that come from the endocytic compartment. 

This distinction between antigen presentation to cytotoxic T cells and to 
helper T cells, however, is not absolute. Dendritic cells, for example, need to be 
able to activate cytotoxic T cells to kill virus-infected cells even when the virus 
does not infect dendritic cells themselves. To do so, dendritic cells use a process 
called cross-presentation, which begins when they phagocytose fragments of 
virus-infected cells. They then actively transport viral proteins out of the phago- 
some into the cytosol, where they are degraded in proteasomes; resulting frag- 
ments of the viral proteins are then transported into the ER lumen, where they 
load onto assembling class I MHC proteins. Cross-presentation in dendritic cells 
also operates to activate cytotoxic T cells against tumor antigens of cancer cells 
and against the MHC proteins of foreign organ grafts. 

Most of the class I and class II MHC proteins on the surface of a target cell 
have peptides derived from self proteins in their binding groove. For class I pro- 
teins, the fragments mainly derive from degraded cytosolic and nuclear proteins. 
For class II proteins, they mainly derive from degraded proteins that originate in 
the plasma membrane or extracellular fluid and are endocytosed. Only a small 
fraction of the 10 5 or so class II MHC proteins on the surface of an antigen-pre- 
senting cell will have foreign peptides bound to them. This is sufficient, however: 
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even a single copy of such a peptide-MHC complex on a dendritic cell can acti- 
vate a helper T cell that has a TCR that binds the complex with a high enough 
affinity. 
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Potentially Useful T Cells Are Positively Selected in the Thymus 

We have seen that T cells recognize antigen in association with self MHC pro- 
teins but not in association with foreign MHC proteins (see Figure 25-57): that 
is, T cells show MHC restriction. This restriction results from a process of posi- 
tive selection during T cell development in the thymus. In this process, those 
immature T cells (thymocytes) that will be capable of recognizing foreign pep- 
tides presented by self MHC proteins are selected to survive and mature, while 
most of the remainder, which would be of no use to the animal, undergo apop- 
tosis. Thus, MHC restriction is an acquired property of the immune system that 
emerges as T cells develop in the thymus. 

The most direct way to study the selection process is to follow the fate of a 
set of developing T cells of known specificity. This can be done by using trans- 
genic mice that express a specific pair of rearranged a and (3 TCR genes derived 
from a T cell clone of known antigen and MHC specificity. Such experiments 
show that the transgenic T cells mature in the thymus and populate the periph- 
eral lymphoid organs only if the transgenic mouse also expresses the same 
allelic form of MHC protein as is recognized by the transgenic TCR. If the mouse 
does not express the appropriate MHC protein, the transgenic T cells die in the 
thymus. Thus, the survival and maturation of a developing T cell depend on a 
match between its TCR and the MHC proteins expressed in the thymus (which 
have self peptides derived from the body's own proteins bound to them). Simi- 
lar experiments using transgenic mice in which MHC expression is confined to 
specific cell types in the thymus indicate that it is MHC proteins on epithelial 
cells in the cortex of the thymus that are responsible for this positive selection 
(Figure 25-62). 

After positively selected T cells leave the thymus, their continued survival as 
naive T cells depends on their continual stimulation by self-peptide-MHC com- 
plexes (and the cytokine IL 7); this stimulation is enough to promote cell survival 
but not enough to activate the T cells to proliferate or become effector or mem- 
ory cells. 

As part of the positive selection process in the thymus, developing T cells 
that express TCRs recognizing class I MHC proteins are selected to become 



Figure 25-62 The cellular organization 
of the human thymus. (A) A light 
micrograph of a stained section of one 
thymus lobule showing the outer cortex 
and inner medulla. (B) A schematic 
drawing of the lobule showing the 
cellular composition. The cortex contains 
immature thymocytes, and the medulla 
contains mature thymocytes. The 
thymocytes, macrophages, and dendritic 
cells develop from cells that migrate in 
from the bone marrow. The functions of 
these different regions and cell types will 
be discussed later, when we consider 
how developing thymocytes are selected 
for survival. Because of these selection 
processes, more than 95% of the 
thymocytes produced in the thymus die 
by apoptosis. The dead cells are rapidly 
phagocytosed and digested by the 
macrophages. (Adapted from K. Murphy 
et al., Janeway's Immunobiology, 7th ed. 
New York: Garland Science, 2008.) 
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cytotoxic cells, while T cells that express TCRs recognizing class II MHC pro- 
teins are selected to become either helper cells or regulatory cells. Thus, genet- 
ically engineered mice that lack cell- surface class I MHC proteins lack cytotoxic 
T cells specifically, whereas mice that lack class II MHC proteins lack both 
helper and regulatory T cells specifically The development of regulatory T cells 
depends on special groups of epithelial cells in the thymus medulla called Has- 
sall's corpuscles. 

The cells that are undergoing positive selection initially express both CD4 
and CD8 co-receptors, and these are required for the selection process: without 
CD4, helper and regulatory T cells fail to develop, and without CD8, cytotoxic T 
cells fail to develop. Once they develop, cytotoxic T cells lose CD4, and helper 
and regulatory T cells lose CD8. 

Positive selection still leaves a large problem to be solved. If developing cyto- 
toxic and helper T cells with receptors that recognize self peptides associated 
with self MHC proteins were to mature in the thymus and migrate to peripheral 
lymphoid tissues, they might wreak havoc. A second, negative selection process 
in the thymus is required to help avoid this potential disaster. 



Most Developing Cytotoxic and Helper T Cells That Could Be 
Activated by Self-Peptide-MHC Complexes Are Eliminated in the 
Thymus 

As discussed previously, a fundamental feature of the adaptive immune system 
is that it can learn to distinguish self from nonself and normally does not react 
against self molecules. An important mechanism in achieving this state of 
immunological self tolerance is the deletion in the thymus of developing self- 
reactive cytotoxic and helper T cells — that is, T cells whose TCRs bind strongly 
enough to the complex of a self peptide and a self MHC protein to become acti- 
vated. Because, as we discuss later, most B cells require helper T cells to respond 
to antigen, the elimination of self- reactive helper T cells also helps ensure that 
any self- reactive B cells that escape from the mechanisms responsible for B cell 
tolerance induction are harmless (see Figure 25-13). 

Before discussing the negative selection process that removes self- reactive T 
cells in the thymus, it is useful to speculate on the logic behind a two-step sys- 
tem that ends up selecting for the small fraction of developing T cells that 
express a TCR that binds weakly, but not strongly, to a self MHC protein carrying 
a self-peptide. As illustrated in Figure 25-63, it is thought that the production of 
a large repertoire of such T cells guarantees that at least a few of the T cells will 
be able to bind strongly to the complex of a foreign peptide with the same MHC 
protein, thus triggering an adaptive immune response. However, it is of course 
not enough for the thymus to select /or T cells that recognize self MHC proteins; 
it must also select against cytotoxic and helper T cells that could become acti- 
vated by self MHC proteins complexed with self peptides in peripheral lymphoid 
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Figure 25-63 Schematic drawing 
showing how a TCR selected in the 
thymus because it binds weakly to a self 
MHC protein complexed with a self 
peptide can bind strongly to the same 
MHC protein complexed with a foreign 
peptide. Because the self peptide 
complex gives weak binding and the 
foreign peptide is not present in the 
thymus, a T cell expressing this TCR in the 
thymus would be positively selected and 
also avoid negative selection. 
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Figure 25-64 The result of positive and 
negative selection in the thymus. Cells 
with TCRs that would enable them to 
respond to foreign peptides in 
association with self MHC proteins (see 
Figure 25-63) are positively selected: 
they survive, mature, and migrate to 
peripheral lymphoid organs. All of the 
other cells undergo apoptosis — either 
because they do not express TCRs that 
recognize self MHC proteins with self 
peptides bound or because they 
recognize such complexes too well and 
undergo negative selection. 

Although not shown, the cells 
undergoing positive selection initially 
express both CD4 and CD8 co-receptors. 
During the process of positive selection, 
helper T cells (Th), cytotoxic T cells (Tc), 
and regulatory T cells (T reg ) diverge by a 
poorly understood mechanism. In this 
process, helper cells and regulatory cells 
develop that express CD4 but not CD8 
and recognize peptides in association 
with class II MHC proteins, while 
cytotoxic cells develop that express CD8 
but not CD4 and recognize peptides in 
association with class I MHC proteins. 



organs. The overall goal can be achieved by (1) ensuring the death of cytotoxic 
and helper T cells that bind strongly to the self-peptide-MHC complexes in the 
thymus while (2) promoting the survival of those that bind weakly and (3) per- 
mitting the death of those that do not bind at all. Process 2 constitutes a major 
part of the positive selection we have just discussed. Process 1 is called negative 
selection, or clonal deletion in the thymus (see Figure 25-13). In both process 1 
and process 3, the cells die by apoptosis (Figure 25-64). 

The most convincing evidence for negative selection in the thymus derives 
once again from experiments with transgenic mice. The introduction of TCR 
transgenes encoding a receptor that recognizes a male-specific peptide antigen, 
for example, results in large numbers of mature T cells expressing the transgenic 
receptor in both the thymus and peripheral lymphoid organs of female mice, 
which do not express the peptide. Very few, however, are found in male mice, in 
which the cells die in the thymus before they have a chance to mature. Like pos- 
itive selection, negative selection requires the interaction of a TCR and a CD4 or 
CD8 co-receptor with an appropriate MHC protein. Unlike positive selection of 
developing helper and cytotoxic T cells, however, which occurs mainly on the 
surface of epithelial cells in the thymus cortex, negative selection of these cells 
occur in the thymus medulla, mainly on the surface of dendritic cells that are 
descendants of cells that have migrated into the thymus from the bone marrow. 



Some Organ-specific Proteins Are Ectopically Expressed in the 
Thymus Medulla 

After the discovery of negative selection of developing T cells in the thymus, 
immunologists wondered how T cells avoid responses against self proteins that 
are not present in the thymus. One explanation is that some potentially self- 
reactive T cells are deleted or functionally inactivated after they leave the thy- 
mus. This occurs when the cells recognize self peptides bound to MHC proteins 
on the surface of dendritic cells that have not been activated by pathogens and 
therefore do not provide appropriate activating signals. It can also occur via reg- 
ulatory T cells in the periphery that suppress the activity of some self- reactive 
effector T cells. These two mechanisms are examples of peripheral tolerance, 
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because, unlike T cell deletion in the thymus (central tolerance), they occur after 
T cells leave the thymus (see Figure 25-13). 

Recently, a third explanation has been discovered. A special class of epithe- 
lial cells in the thymus medulla ectopically expresses proteins previously 
thought to be expressed only outside the thymus in specific organs; insulin, for 
example, which is made by (3 cells in the pancreas, is also made by a small sub- 
set of medullary thymic epithelial cells. The ectopic expression of many of these 
proteins, including insulin, depends on a nuclear protein called autimmune reg- 
ulator (AIRE), which is specifically expressed in the same medullary thymic 
epithelial cells. Inactivation of the gene encoding AIRE in mice or humans 
results in a severe multi- organ autoimmune disease, indicating the importance of 
AIRE-dependent central tolerance to at least some "organ- specific" self proteins. It 
remains a mystery how AIRE promotes this ectopic expression of genes in the thy- 
mus medulla. 



The Function of MHC Proteins Helps Explain Their Polymorphism 

The role of MHC proteins in binding foreign peptides and presenting them to T 
cells helps explain the extensive polymorphism of these proteins. In the evolu- 
tionary war between pathogens and the adaptive immune system, pathogens 
tend to change their antigens to avoid associating with MHC proteins. When a 
pathogen succeeds, it is able to sweep through a population as an epidemic. In 
such circumstances, the few individuals that produce a new MHC protein that 
can associate with an antigen of the altered pathogen have a large selective 
advantage. In addition, individuals with two different alleles at any given MHC 
locus (heterozygotes) have a better chance of resisting infection than those with 
identical alleles at the locus, as they have a greater capacity to present peptides 
from a wide range of pathogens. Thus, this type of selection will tend to promote 
and maintain a large diversity of MHC proteins in the population. Support for 
the idea that infectious diseases have been a driving force for MHC polymor- 
phism has come from studies in West Africa. In this region, individuals with a 
specific MHC allele have a reduced susceptibility to a severe form of malaria. 
Although the allele is rare elsewhere, it is found in 25% of the West African pop- 
ulation where this form of malaria is common. 

If greater MHC diversity means greater resistance to infection, why do we 
each have so few MHC genes encoding these molecules? Why have we not 
evolved strategies for increasing the diversity of MHC proteins — by alternative 
RNA splicing, for example, or by the genetic recombination mechanisms used to 
diversify antibodies and TCRs? One reason for the restricted diversity of MHC 
proteins in an individual may be that each time a new MHC protein is added to 
the repertoire, the T cells with TCRs that bind strongly to self peptides bound to 
the new MHC protein must be eliminated to maintain self tolerance. The elimi- 
nation of these T cells would counteract the advantage of adding the new MHC 
protein. Thus, the number of MHC proteins we express is thought to represent a 
balance between the advantages of presenting a wide diversity of foreign pep- 
tides to T cells against the disadvantages of severely restricting the T cell reper- 
toire during negative selection in the thymus. Computer modeling studies sup- 
port this explanation. 



Summary 

There are three main functionally distinct classes of T cells. Cytotoxic T cells kill infected 
cells directly by inducing them to undergo apoptosis. Helper T cells help activate B cells 
to make antibody responses, cytotoxic T cells to kill their target cells, dendritic cells to 
stimulate T cell responses, and macrophages to destroy microorganisms that either 
invaded the macrophage or were ingested by it. Finally, regulatory T cells suppress the 
activity of effector T cells and dendritic cells and are crucial for self tolerance. 

All T cells express cell-surface, antibodylike receptors (TCRs), which are encoded 
by genes that are assembled from multiple gene segments during T cell development 
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in the thymus. TCRs recognize fragments of foreign proteins that are displayed on the 
surface of host cells in association with MHC proteins. T cells are activated in 
peripheral lymphoid organs by antigen-presenting cells, which express 
peptide-MHC complexes, co-stimulatory proteins, and various cell-cell adhesion 
molecules on their cell surface. The most potent of these antigen-presenting cells are 
dendritic cells, which are specialized for antigen presentation and are required for 
the activation of naive T cells. 

Class land class II MHC proteins have crucial roles in presenting foreign protein 
antigens to T cells: class I MHC proteins present antigens to cytotoxic T cells, while class 
II MHC proteins present antigens to helper and regulatory T cells. Whereas class I pro- 
teins are expressed on almost all vertebrate cells, class II proteins are normally 
restricted to those cell types that interact with helper T cells, such as dendritic cells, 
macrophages, and B lymphocytes. 

Both classes of MHC proteins have a single peptide-binding groove, which binds 
small peptide fragments derived from proteins. Each MHC protein can bind a large 
set of peptides, which are constantly being produced intracellularly by normal pro- 
tein degradation processes. However, class I MHC proteins mainly bind fragments 
produced in the cytosol, while class II MHC proteins mainly bind fragments pro- 
duced in endocytic compartments. After they have formed inside the target cell, the 
peptide-MHC complexes are transported to the cell surface. Complexes that contain 
a peptide derived from a foreign protein are recognized by TCRs, which interact with 
both the peptide and the walls of the peptide-binding groove of the MHC protein. T 
cells also express CD4 or CD8 co-receptors, which simultaneously recognize non- 
polymorphic regions of MHC proteins on the antigen-presenting cell or target cell: 
helper cells and regulatory cells express CD4, which recognizes class II MHC proteins, 
while cytotoxic T cells express CD8, which recognizes class I MHC proteins. 

A combination of positive and negative selection processes operates during T cell 
development in the thymus to shape the TCR repertoire. These processes help to ensure 
that only T cells with potentially useful receptors survive and mature, while all of the 
others die by apoptosis. First, T cells that can respond to peptides complexed with self 
MHC proteins are positively selected; subsequently, the T cells in this group that can 
react strongly with self peptides complexed with self MHC proteins are eliminated. 
Helper and cytotoxic T cells that leave the thymus with receptors that could react with 
self antigens are eliminated, functionally inactivated, or actively suppressed when they 
recognize self antigens on nonactivated dendritic cells. 



HELPER T CELLS AND LYMPHOCYTE ACTIVATION 

Helper T cells are arguably the most important cells in adaptive immunity, as 
they are required for almost all adaptive immune responses. They help activate 
B cells to secrete antibodies, and they help macrophages to destroy ingested 
pathogens. They also help activate cytotoxic T cells to kill infected target cells, at 
least partly by stimulating dendritic cells to activate naive cytotoxic cells more 
efficiently. As dramatically demonstrated in AIDS patients, without helper T cells 
we cannot even defend ourselves against many microbes that are normally 
harmless. 

Helper T cells themselves, however, can function only when activated to 
become effector cells. Naive helper cells are activated on the surface of dendritic 
cells, which become activated during the innate immune responses triggered by 
an infection. The innate responses, mainly via activated dendritic cells, also dic- 
tate what kind of effector cell a helper T cell will develop into, and they thereby 
determine the nature of the adaptive immune responses elicited. 

In this final section, we discuss the multiple signals that help activate aT cell 
and how a helper T cell, once activated to become an effector cell, helps activate 
other cells. We also consider how innate immune responses determine the 
nature of adaptive responses by stimulating helper T cells to differentiate into 
different types of effector cells. Finally, we discuss the probable evolutionary ori- 
gins of the Ig superfamily of proteins, which includes the MHC proteins, anti- 
bodies, and TCRs. 
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Figure 25-65 The TCR and its associated CD3 complex. All of the 

CD3 polypeptide chains (shown in green), except for the £ (zeta) chains, 
have extracellular Ig-like domains and are therefore members of the Ig 
superfamily. All of the four types of CD3 polypeptide chains form 
heterodimers or homodimers (as shown) and are rapidly phosphorylated 
on tyrosines in their intracellular domains following TCR activation (not 
shown). Some of these phosphorylated tyrosines then serve as docking 
sites for intracellular signaling proteins, as shown in Figure 25-66. 



Activated Dendritic Cells Use Multiple Mechanisms to Activate 
T Cells 

When a dendritic cell is activated during an infection, it changes its shape and 
migratory behavior, increases the amount of MHC proteins displayed on its sur- 
face, activates its antigen-processing pathways, and starts producing both cell- 
surface-bound co-stimulatory proteins and secreted cytokines (including 
chemokines). The dramatic changes also enable the dendritic cell to migrate to 
a peripheral lymhoid organ and activate naive T cells to become effector cells. 

The dendritic cell initially signals to the T cell through the T cell receptors 
(TCRs), which bind to a foreign peptide complexed with a class II MHC protein 
on an opposing dendritic cell surface. The TCR, however, does not act on its own 
to transmit the signal into the T cell. It is helped by a complex of invariant trans- 
membrane proteins called CD3, with which the TCR is associated (Figure 25-65) . 
Moreover, the CD4 co-receptor on a helper or regulatory T cell and the CD8 co- 
receptor on a cytotoxic T cell bind to the same MHC protein as the TCR and also 
play a crucial part in transmitting the signal, as illustrated in Figure 25-66. 

In addition to signaling through the TCR and its associated proteins and co- 
receptors, co-stimulatory proteins on the dendritic cell surface bind to other 
receptors on the T cell surface to provide further signals required to activate the 
T cell. Among these co- stimulatory proteins on the activated dendritic cell are 
the B7 proteins, which are recognized by the co-receptor protein CD28 on the 
surface of the T cell. Once activated, a helper T cell itself expresses a co-stimula- 
tory protein called CD40 ligand, which acts back on CD40 receptors on the den- 
dritic cell surface to increase and sustain the activation of the dendritic cell, cre- 
ating a positive feedback loop that amplifies the T cell response. 

Once bound to the surface of a dendritic cell, a T cell increases the strength 
of the binding by activating an integrin adhesion protein, which then binds 
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Figure 25-66 Signaling events initiated 
by the binding of peptide-MHC 
complexes to TCRs. When TCRs (and 
CD3) are clustered by binding to 
peptide-MHC complexes on an activated 
dendritic cell, CD4 molecules on helper 
cells or CD8 molecules on cytotoxic 
T cells are clustered with them, binding 
to invariant parts of the same class II or 
class I MHC proteins, respectively, on the 
dendritic cell. This brings the Src-like 
cytoplasmic tyrosine kinase Lck into the 
signaling complex and activates it. Lck 
activation also depends on a 
transmembrane protein tyrosine 
phosphatase on the T cell surface called 
CD45, which removes inhibitory 
phosphates from Lck (not shown). 

Once activated, Lck initiates a tyrosine 
phosphorylation cascade by 
phosphorylating tyrosines on all of the 
chains of the CD3 complex. The 
phosphotyrosines on the CD3 C, chain 
now serve as docking sites for yet 
another cytoplasmic tyrosine kinase 
called ZAP70. Lck phosphorylates, and 
thereby activates, ZAP70. Although not 
shown, ZAP70 then phosphorylates 
tyrosines on the tail of another 
transmembrane protein (called LAT), 
which then serve as docking sites for a 
variety of adaptor proteins and enzymes. 
These proteins then help relay the signal 
to the nucleus and other parts of the cell 
by activating the inositol phospholipid 
and MAP kinase signaling pathways 
(discussed in Chapter 15), as well as a Rho 
family GTPase that regulates the actin 
cytoskeleton (discussed in Chapter 16). 
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more strongly to its Ig-like ligand on the surface of the dendritic cell. This 
increased adhesion enables the T cell to remain bound to the antigen-present- 
ing cell long enough to become activated. 

This initial signaling through the TCR and associated proteins triggers the 
active assembly of an immunological synapse at the interface between the T 
cell and the dendritic cell. In these bull's- eye -like structures, TCRs and their 
associated CD3 subunits, co-receptors, and intracellular signaling proteins are 
clustered at the center, with cell-cell adhesion proteins forming a peripheral 
ring. Similar structures form when an effector helper or cytotoxic cell interacts 
with its target cell. Not all of the TCRs in the synapse are bound to a foreign pep- 
tide complexed with an MHC protein; some are bound to a self peptide bound 
to an MHC protein, and these TCRs also contribute to the activation of the T cell 
(recall that all T cells are initially positively selected in the thymus for their weak 
recognition of such self-peptide-MHC complexes). 

The combined actions of the various signals just discussed stimulate helper 
T cells to proliferate and to begin to differentiate into effector cells by a curiously 
indirect mechanism. The signals cause the T cells to help stimulate their own 
proliferation and differentiation by inducing the cells to secrete a cytokine called 
interleukin-2 (IL2) and simultaneously to synthesize high- affinity cell- surface 
receptors that bind it. The binding of IL2 to the IL2 receptors activates intracel- 
lular signaling pathways that turn on genes that help the T cells to proliferate 
and differentiate into effector cells (Figure 25-67). Although some T cells do not 
make IL2, as long as they have been activated by their antigen and therefore 
express IL2 receptors they can be helped to proliferate and differentiate by IL2 
made by neighboring activated T cells. IL2 also plays an important part in the 
development of regulatory T cells in the thymus, because without it these cells 
fail to develop. 

Dendritic cells are not only important for activating T cells, they are also 
important for inactivating or eliminating self-reactive T cells. When T cells rec- 
ognize self-peptide-MHC complexes on the surface of dendritic cells that have 
not been activated by a pathogen, they are either inactivated, so that they do not 
respond to the same peptide-MHC complexes even on activated dendritic cells, 
or they proliferate briefly and then die by apoptosis. Both of these mecha- 
nisms — clonal inactivation and clonal deletion — contribute to peripheral self 
tolerance. Dendritic cells also contribute to peripheral self tolerance by activa- 
ting regulatory T cells, which then suppress the activity of self- reactive effector T 
cells, although the details of how dendritic cells selectively activate regulatory T 
cells are poorly understood (see Figure 25-13). 
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During the multi-step activation of a T cell, the cell starts to express a cell- sur- 
face protein called CTLA4, which inhibits intracellular signaling. This protein 
resembles CD28, and, like CD28, it binds to B7 proteins on the surface of the 
activating dendritic cell (see Figure 25-67). CTLA4 binds B7 with much higher 
affinity than does CD28 and, in doing so, it blocks the activating activity of CD28, 
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Figure 25-67 The stimulation of 
T cells by IL2. This model can apply to 
both helper and cytotoxic T cells, at 
least in culture. The combination of 
peptide-MHC complexes and a 
co-stimulatory B7 protein (either B7-1 
or B7-2; also called CD80 and CD86, 
respectively) on the surface of an 
activated dendritic cell helps stimulate 
a resting T cell to make high affinity IL2 
receptors and to secrete IL2.The 
binding of IL2 to its receptors then 
helps stimulate the T cells to proliferate 
and differentiate into effector cells. The 
various proteins associated with the 
TCRs (see Figure 25-65) are not shown. 
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Table 25-3 Some Accessory Proteins on the Surface of T Cells 
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*CD stands for cluster of differentiation, as each of the CD proteins was originally defined as a blood cell "differentiation antigen" recognized by 
multiple monoclonal antibodies. Their identification depended on large-scale collaborative studies in which hundreds of such antibodies, 
generated in many laboratories, were compared and found to consist of relatively few groups (or "clusters"), each recognizing a single cell-surface 
protein. Since these initial studies, however, more than 240 CD proteins have been identified. 

thereby providing negative feedback that holds the activation process in check. 
Thus, mice with a disrupted Ctla4 gene die from a massive accumulation of acti- 
vated T cells. 

Table 25-3 summarizes some of the co-receptors and other accessory pro- 
teins found on the surface of T cells that we have discussed in this chapter. 

Most of the T (and B) effector cells produced during an immune response 
must be eliminated after they have done their job. Although most of the cells die 
by apoptosis, the extracellular mechanisms responsible for their elimination are 
poorly understood. One possibility is that, as antigen levels fall and the response 
subsides, effector T cells are deprived of the antigen and cytokine stimulation 
that they need to survive, so that only memory cells and some long-lived effec- 
tor cells survive. The death of effector T cells, however, does not occur solely 
through a lack of survival signals. In the case of effector cytotoxic T cells, for 
example, the cytokine interferon-y {IFNfi plays an important part in inducing 
the cell death; as effector cytotoxic T cells make IFNy (see Figure 25-60), this is 
another form of negative feedback. 

Before considering how effector helper T cells help activate macrophages 
and B cells, we need to discuss the two main functionally distinct subclasses of 
effector helper T cells, TrI and Tr2 cells, and how they are generated. 



The Subclass of Effector Helper T Cell Determines the Nature of 
the Adaptive Immune Response 

When an activated dendritic cell activates a naive helper T cell in a peripheral 
lymphoid organ, the T cell usually differentiates into a TrI or Tr2 effector helper 
cell. The outcome depends on the affinity of the TCR for the peptide-MHC com- 
plex, on the density of the complex on the dendritic cell surface, and on the 
nature of the dendritic cell. 

The two main subclasses of effector helper T cells can be distinguished by 
the cytokines they secrete. TrI cells mainly secrete IFNy end tumor necrosis fac- 
tor-a (TNFa), which activate macrophages to kill microbes located within the 
macrophages' phagosomes. They also help activate cytotoxic T cells to kill 
infected cells. Thus, TrI cells mainly defend an animal against intracellular 
microbes. They also, however, stimulate B cells to secrete specific subclasses of 
IgG antibodies that can coat extracellular microbes and activate complement, 
thereby helping to eliminate some extracellular microbes as well. 

Tr2 cells, by contrast, mainly defend an animal against extracellular 
pathogens, including microbes and multicellular parasites. They secrete a variety 
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of cytokines, including interleukins 4 and 10 (IL4 and IL10), and they help stim- 
ulate B cells to make most classes of antibodies, including IgM, IgA, IgE, and 
some subclasses of IgG. Some of these antibodies bind to mast cells, basophils, 
and eosinophils; when activated by antigen binding, these cells release local 
mediators that cause sneezing, coughing, or diarrhea and thereby help expel 
extracellular microbes and larger parasites from epithelial surfaces of the body 

Thus, the decision of naive helper T cells to differentiate into ThI or Th2 
effector cells influences the type of adaptive immune response that an animal 
mounts against the pathogen — whether it will be dominated by macrophage 
activation or by antibody production. The specific cytokines produced during 
the process of helper T cell activation have an important influence on the type 
of effector helper cell produced. Some intracellular bacteria, for example, stim- 
ulate dendritic cells to produce IL12, which induces TrI development, and 
thereby macrophage activation. As expected, mice that are deficient in either 
IL12 or its receptor are much more susceptible to these bacterial infections than 
are normal mice. Many parasitic protozoa and worms, by contrast, stimulate a 
dendritic cell to express the Jagged protein on its surface. Jagged is an activating 
ligand for Notch receptors (discussed in Chapters 15 and 22) on the T cell surface, 
and the resulting Notch signaling helps induce Tr2 development and the pro- 
duction of IL4. The Tr2 cells and IL4 help stimulate antibody production and 
eosinophil activation, leading to parasite expulsion (Figure 25-68). The IL4 also 
generates a positive feedback loop, as it is a potent inducer of Tr2 development. 

Once aTnl or Tr2 effector cell develops, it inhibits the differentiation of the 
other type of helper T cell. IFNy produced by ThI cells inhibits the development 
of Tr2 cells, while IL4 and IL10 produced by Tr2 cells inhibit the development 
of TrI cells. Thus, as the response proceeds, it reinforces the initial choice 
through its effect on the response of other T cells nearby. 

Individuals infected with Mycobacterium leprae, the bacterium that causes 
leprosy, illustrate the importance of the Th1/Th2 decision. This bacterium repli- 
cates mainly within macrophages and causes either of two forms of disease, 
depending mainly on the genetic make-up of the infected individual. In some 
patients, the tuberculoid form of the disease occurs. Here, ThI cells develop and 
stimulate the infected macrophages to kill the bacteria. This produces a local 
inflammatory response, which damages skin and nerves. The result is a chronic 



Figure 25-68 The activation of ThI and 
Th2 cells. The differentiation of naive 
helperT cells into either Th1 orTn2 
effector cells determines the nature of 
the subsequent adaptive immune 
responses. Whether a naive helperT cell 
becomes a Th1 orTn2 cell depends 
mainly on the signal proteins present 
when an activated dendritic cell in a 
peripheral lymphoid organ stimulates the 
helperT cell. The types of signal proteins 
produced depend on the local 
environment and the nature of the 
pathogen that activated the dendritic cell 
at the site of infection. IL1 2 secreted by 
activated dendritic cells promotes Th1 
cell development. By contrast, both the 
transmembrane Notch ligand Jagged on 
the surface of activated dendritic cells 
and the IL4 made by basophils, mast 
cells, and Th2 cells promote Th2 cell 
development. 

In this figure, the effector T H 1 cell 
produced in the peripheral lymphoid 
organ migrates to the site of infection 
and helps a macrophage kill the 
microbes it has phagocytosed.The 
effector Th2 cell remains in the lymphoid 
organ and helps activate a B cell to 
produce antibodies against the parasite. 
In addition to binding to the parasites, 
the antibodies bind to Fc receptors on 
mast cells, basophils, and eosinophils 
(see Figure 25-27), which then can help 
expel the parasite from the gut. Although 
not shown here,Tn1 cells also help 
activate B cells to make antibodies. 
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disease that progresses slowly but does not kill the host. In other patients, by con- 
trast, the lepromatous form of the disease occurs. Here, Th2 cells develop and 
stimulate the production of antibodies. As the antibodies cannot get through the 
plasma membrane to attack the intracellular bacteria, the bacteria proliferate 
unchecked and eventually kill the host. For unknown reasons, there is also a gen- 
eral depression of T- cell-mediated immunity to most antigens in the lepromatous 
form of the disease. 

Naive helper T cells can also develop into a recently described third type of 
effector cell called ThI 7 cells because they secrete the pro-inflammatory inter- 
leukin IL1 7. Th17 cells help defend against extracellular pathogens, but they also 
play an important part in many autoimmune diseases. They develop when some 
naive helper T cells are activated by antigen in the presence of TGF(3 and the 
interleukin IL6. 



ThI Cells Activate Infected Macrophages and Stimulate An 
Inflammatory Response 

Macrophages and dendritic cells ingest pathogens and their products at sites of 
infection. The dendritic cells become activated and carry microbial antigens to 
a peripheral lymphoid organ, where they preferentially induce the development 
of ThI cells. The TrI cells then migrate to the site of infection to help activate 
infected macrophages (see Figure 25-68). 

TrI effector cells use two signals to activate the specific macrophages they 
recognize. They secrete IFNy, which binds to IFNy receptors on the macrophage 
surface, and they display the co- stimulatory protein CD40 ligand, which binds to 
CD40 on the macrophage (Figure 25-69). (We see later that CD40 ligand is also 
used by helper T cells to activate B cells.) Once activated, the macrophage can kill 
the microbes in their phagosomes: lysosomes can now fuse more readily with the 
phagosomes, unleashing a hydrolytic attack, and the activated macrophage 
makes oxygen radicals and nitric oxide, both of which are highly toxic to the 
microbes (discussed in Chapter 24). Because dendritic cells also express CD40, 
the TrI cells at sites of infection can also help activate them. As a result, the den- 
dritic cells increase their production of class II MHC proteins, co- stimulatory 
proteins, and various cytokines, especially IL12. This makes them more effective 
at stimulating naive helper T cells to differentiate into ThI effector cells in periph- 
eral lymphoid organs, providing a positive feedback loop that increases the pro- 
duction of ThI cells and, thereby, the activation of macrophages. 

ThI effector cells also stimulate an inflammatory response (discussed in 
Chapter 24) by recruiting more phagocytic cells into the infected site. They do so 
in three ways: 

1. They secrete cytokines that act on the bone marrow to increase the pro- 
duction of monocytes (macrophage precursors that circulate in the blood) 
and neutrophils. 

2. They secrete other cytokines that activate endothelial cells lining local 
blood vessels to express cell adhesion molecules that cause monocytes and 
neutrophils in the blood to adhere there. 

3. They secrete chemokines that direct the migration of the adherent mono- 
cytes and neutrophils out of the bloodstream into the site of infection. 

A ThI cell can also help activate a cytotoxic T cell in a peripheral lymphoid 
organ by producing chemokines that attract the cytotoxic cells to the site of the 
ThI -cell-dendritic-cell interaction, while at the same time stimulating the den- 
dritic cell to produce more co- stimulatory proteins. In addition, ThI cells can 
help effector cytotoxic T cells kill virus -infected target cells by secreting IFNy, 
which increases the efficiency with which target cells process viral antigens for 
presentation to cytotoxic T cells (see Figure 25-60). An effector ThI cell can also 
directly kill some cells itself, including effector lymphocytes: by expressing Fas 
ligand on its surface, it can induce effector T or B cells that express cell-surface 
Fas to undergo apoptosis (see 25-47B). 

Both ThI and Th2 cells can also help stimulate B cells to proliferate and dif- 
ferentiate and to switch the class of antibody they make, from IgM and IgD to 
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Figure 25-69 The differentiation of T H 1 cells and their activation of macrophages. (A) An activated dendritic cell that has 
ingested bacteria at a site of infection and migrated to a peripheral lymphoid organ activates a naive helper T cell to 
differentiate into a Th1 effector cell. The dendritic cell uses both cell-surface co-stimulatory proteins such as the B7 proteins 
and secreted IL1 2 to induce T H 1 cell differentiation. (B) The activated T H 1 effector cell then migrates from the peripheral 
lymphoid organ to the infected site, where it helps activate macrophages to kill the bacteria harbored within the 
macrophages' phagosomes. As indicated, it does this by means of secreted IFNy and membrane-bound CD40 ligand, which 
binds to CD40 on the macrophage. 



one of the secondary classes of antibody. Before considering how helper T cells 
do this, we need to discuss the role of the B cell antigen receptor in the activa- 
tion of B cells. 



Antigen Binding to B Cell Receptors (BCRs) Is Only One Step 
in B Cell Activation 

Like T cells, most B cells require multiple extracellular signals to become acti- 
vated. One signal is provided by antigen binding to the B cell receptor (BCR) for 
antigen, which, as discussed previously, is a membrane-bound antibody 
molecule. A helper T cell usually provides the other required signals. If a B cell 
receives the first signal only, it may be eliminated or functionally inactivated, 
which is one way in which B cells become tolerant to self antigens. 

Signaling through the BCR works in much the same way as signaling 
through the TCR (see Figure 25-66). The receptor is associated with two invari- 
ant protein chains, Iga and Ig(3, which help convert antigen binding to BCRs into 
intracellular signals. When antigen cross-links BCRs on the surface of a B cell, it 
causes them and their associated invariant chains to cluster into small aggre- 
gates. This aggregation leads to the assembly of an intracellular signaling com- 
plex at the site of the clustered receptors and to the initiation of a tyrosine phos- 
phorylation cascade (Figure 25-70). 

A co-receptor complex that binds complement proteins greatly enhances 
the efficiency of signaling through the BCR and its associated invariant chains. 
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Figure 25-70 Early signaling events in B cells activated by the binding of antigen to BCRs. The antigen cross-links 
adjacent BCRs, which are transmembrane antibody molecules. The cross-linking causes the BCRs and their associated 
invariant chains (heterodimers of Iga and Igp) to cluster. A Src-like cytoplasmic tyrosine kinase, which can be Fyn, Blk, or 
Lyn, is associated with the cytosolic tail of Igp. It joins the cluster and phosphorylates Iga and Igp (for simplicity, only the 
phosphorylation on IgP is shown). As in the case of TCR activation, the protein tyrosine phosphatase CD45 is also required 
to activate these Src-like kinases (not shown). The resulting phosphotyrosines on Iga and IgP serve as docking sites for 
another Src-like tyrosine kinase called Syk, which is homologous to ZAP70 in T cells (see Figure 25-66). Like ZAP70, Syk 
becomes phosphorylated and thereby activated, and it then relays the signal downstream. 



If a microbe directly activates the complement system (discussed in Chapter 24), 
complement proteins are often deposited on the microbe surface, greatly 
increasing the B cell response to the microbe. Now, when the microbe clusters 
BCRs on a B cell, the complement-binding co-receptor complexes are brought 
into the cluster, increasing the strength of signaling by activating PI 3 -kinase 
(discussed in Chapter 15) (Figure 25-71A). As expected, antibody responses are 
greatly reduced in mice lacking either one of the required complement compo- 
nents or one of the complement-binding co-receptor subunits on B cells. 

Later in the immune response, by contrast, when IgG antibodies are present 
on the surface of the microbe, a different co-receptor dampens down the B cell 
response. These are Fc receptors, which bind the tails of the IgG antibodies. They 
recruit both lipid and protein phosphatase enzymes into the signaling complex 
that decrease the strength of signaling (Figure 25-71B). In this way, the Fc recep- 
tors on B cells act as inhibitory co-receptors, just as the CTLA4 proteins do on T 
cells. Thus, the co-receptors on a T cell or B cell allow the cell to gain additional 
information about the antigen bound to its receptors and thereby make a more 
informed decision as to how to respond. 
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Figure 25-71 The influence of B cell 
co-receptors on the effectiveness of 
signaling through BCRs. (A) The binding 
of microbe-complement complexes to 
BCRs cross-links the BCRs to complement- 
binding, co-receptor complexes. The 
cytosolic tail of one component of the 
co-receptor complex becomes 
phosphorylated on tyrosines, which then 
serve as docking sites for PI 3-kinase. As 
discussed in Chapter 15, PI 3-kinase is 
activated to phosphorylate specific 
inositol phospholipids in the plasma 
membrane, which then act as docking 
sites to recruit intracellular signaling 
proteins (not shown). These signaling 
proteins act together with the signals 
generated by the Syk kinase to amplify 
the response. (B) When IgG antibodies 
bind to foreign antigen, usually late in a 
response, the Fc regions of the antibodies 
bind to Fc receptors on the B cell surface 
and are thus recruited into the signaling 
complex. The Fc receptors become 
phosphorylated on tyrosines; these then 
serve as docking sites for two types of 
phosphatase enzymes: (1) an inositol 
phospholipid phosphatase, which 
dephosphorylates the inositol 
phospholipid docking sites in the plasma 
membrane generated by PI 3-kinase, 
thereby reversing the activating effects of 
PI 3-kinase; (2) protein tyrosine 
phosphatases, which inhibit signaling by 
the activated tyrosine kinases. 
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Antigen-Specific Helper T Cells Are Essential for Activating Most 
B Cells 



Whereas antigen-presenting cells such as dendritic cells and macrophages are 
omnivorous and ingest and present antigens on their MHC proteins nonspecifi- 
cally, a B cell generally presents only peptides derived from an antigen that it 
specifically recognizes using its BCRs. Thus, BCRs do more than just bind antigen 
to begin the process of B cell activation; they also play a crucial part in recruiting 
T cell help. Through endocytosis, they deliver their bound protein antigen to an 
endosomal compartment, where the antigen is degraded into peptides; many of 
these peptides are returned to the B cell surface bound to class II MHC proteins 
(see Figure 25-61). These peptide-class-II-MHC complexes are recognized by 
antigen-specific helper T cells, which then deliver further signals to the B cell that 
are required for its proliferation and antibody secretion (Figure 25-72). 

How do the antigen- specific T cells required for B cell activation originate? 
As discussed previously, during primary antibody responses, naive helper T cells 
are activated in a peripheral lymphoid organ by binding to a foreign peptide 
bound to a class II MHC protein on the surface of an activated dendritic cell. The 
effector helper T cells that results from this activation can then activate a B cell 
that displays the same complex of foreign peptide and class II MHC protein on 
its surface. Thus, the helper T cell activates only those B cells with BCRs that 
specifically recognize the antigen that initially activated the T cell, even though 
the TCRs and BCRs usually recognize distinct antigenic determinants on the 
antigen (see Figure 25-72). This requirement for linked recognition of antigen by 
a T cell and a B cell helps avoid autoimmune B cell responses, which would 
require the simultaneous presence of both helper T cells and B cells that recog- 
nize the same self antigen. 

In secondary antibody responses, memory B cells themselves can act as 
antigen-presenting cells and activate helper T cells, as well as being the subse- 
quent targets of the effector helper T cells. The mutually reinforcing actions of 
helper T cells and B cells lead to an antibody response that is both intense and 
highly specific. 

Once a helper T cell has been activated to become an effector cell and con- 
tacts a B cell, the contact initiates an internal rearrangement of the helper cell's 
cytoplasm. The T cell orients its centrosome and Golgi apparatus toward the B 
cell, as described previously for an effector cytotoxic T cell contacting its target 
cell (see Figure 25-46). In this case, however, the orientation is thought to enable 
the effector helper T cell to direct both membrane-bound and secreted 
cytokines onto the B cell surface (see Figure 25-72). One crucial membrane- 
bound signal molecule is CD40 ligand, which we encountered earlier. It is 
expressed on the surface of effector helper T cells, but not on nonactivated naive 
or memory helper T cells, and it is recognized by the CD40 protein on the B cell 
surface. This interaction between CD40 ligand and CD40 is required for helper T 
cells to activate B cells to proliferate and differentiate into memory or antibody- 



Figure 25-72 The activation of a B cell 
by a protein antigen and an effector 
helper T cell. Note that the B cell and 
T cells recognize different antigenic 
determinants on the antigen and that the 
effector helper T cell uses both secreted 
and membrane-bound co-stimulatory 
molecules to help activate the B cell. 
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Figure 25-73 Comparison of the signals 
required to activate a helper T cell and 
a B cell to the same protein antigen. 

Note that in both cases secreted and 
membrane-bound molecules cooperate 
in the activation process. The red arrow 
indicates the endocytosis of the protein 
antigen. Although not shown, CD40 
ligand is also used by effector helper T 
cells to increase and maintain the 
activation of dendritic cells, which 
express CD40, thereby creating a positive 
feedback loop. 

The antigenic determinant recognized 
by the helper T cell is presented on the 
surface of both the dendritic cell and the 
B cell as a peptide fragment of the 
protein antigen bound to a class II MHC 
protein. By contrast, the B cell recognizes 
a different antigenic determinant on the 
surface of the folded (native) protein. 



activated dendritic cell 



effector helper Tcell 



secreting effector cells. Individuals that lack CD40 ligand are severely immun- 
odeficient. They are susceptible to the same infections that affect AIDS patients 
whose helper T cells have been destroyed. 

Helper T cells also secrete cytokines to help B cells proliferate and differen- 
tiate and, in some cases, to switch the class of antibody they produce. The 
cytokines include the interleukins IL2 and IL4. IL4, for example, is produced by 
Tr2 cells and collaborates with CD40 ligand in stimulating B cell proliferation 
and differentiation; it also promotes switching to IgE antibody production. Mice 
deficient in IL4 production are severely impaired in their ability to make IgE. 

Figure 25-73 compares the signals required for T and B cell activation; Table 
25-4 lists some of the cytokines discussed in this chapter. 



A Special Class of B Cells Recognize T-cell-independent Antigens 

Some antigens can stimulate B cells to proliferate and differentiate into anti- 
body-secreting effector cells without help from T cells. Most of these T-cell-inde- 
pendent antigens are microbial polysaccharides that do not activate helper T 
cells. Some activate B cells directly by providing both the antigen signal and the 
accessory signals normally provided by helper T cells. Others are large polymers 
with repeating, identical antigenic determinants (see Figure 25-29B); their mul- 
tipoint binding to BCRs can generate a strong enough signal on its own to acti- 
vate the B cell directly, without additional signals. 



Table 25- 


-4 Properties of Some Cytokines 




CYTOKINE SOME SOURCES 


SOME ACTIONS 


IL2 


all helper T cells; some cytotoxic T cells 


stimulates proliferation and differentiation of activated T cells; 
required for regulatory T cell development in thymus 


IL4 


Th2 cells, basophils, and mast cells 


stimulates B cell proliferation, differentiation, and class switching to IgE 
and IgGI ; promotes Th2 and inhibits ThI cell development 


IL7 


many non-T cells 


promotes memory T cell survival 


IL10 


Th2 cells, macrophages, and dendritic cells 


inhibits macrophages and ThI cell development 


IL12 


B cells, macrophages, dendritic cells, and 
granulocytes 


induces T H 2 cell development and inhibits T H 1 cell development 


IL15 


many non-T cells 


promotes memory Tcell survival 


IL17 


some effector helper T cells 


stimulates inflammatory responses 


IFNy 


Th1 cells and cytotoxic T cells 


activates macrophages; increases MHC expression in many cell types 


TGFp 


regulatory T cells 


suppresses effector T cell activity, dendritic cells, and macrophages 


TNFoc 


ThI cells and macrophages 


activates endothelial cells and macrophages 
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Because T-cell-independent antigens do not activate helper T cells, they fail 
to induce B cell memory, affinity maturation, or class switching, all of which 
require help from T cells. They therefore mainly stimulate the production of low- 
affinity (but high-avidity) IgM antibodies. Most B cells that make antibodies 
without T cell help belong to a distinct B cell lineage. They are called Bl cells to 
distinguish them from B2 cells, which require T cell help. Bl cells seem to be 
especially important in defense against intestinal pathogens. 



Immune Recognition Molecules Belong to the Ancient Ig 
Superfamily 

Most of the proteins that mediate cell-cell recognition or antigen recognition in 
the immune system contain Ig or Ig-like domains, suggesting that they have a 
common evolutionary history. Included in this Ig superfamily are antibodies, 
TCRs, MHC proteins, the CD4, CD8, and CD28 co-receptors, the B7 co-stimula- 
tory proteins, and most of the invariant polypeptide chains associated with 
TCRs and BCRs, as well as the various Fc receptors on lymphocytes and other 
white blood cells. All of these proteins contain one or more Ig or Ig-like domains. 
In fact, about 15% of the 250 or so proteins that have been characterized on the 
surface of white blood cells belong to this superfamily. Many of these molecules 
are dimers or higher oligomers in which Ig or Ig-like domains of one chain inter- 
act with those in another (Figure 25-74). 

A separate exon usually encodes the amino acids in each Ig-like domain. It 
seems likely that the entire gene superfamily evolved from a gene coding for a 
single Ig-like domain — similar to that encoding p2-microglobulin (see Figures 
25-50A and 25-52) or the Thy- 1 protein (see Figure 25-74) — that may have medi- 
ated cell-cell interactions. There is evidence that such a primordial gene arose 
before vertebrates diverged from their invertebrate ancestors about 400 million 
years ago. New family members presumably arose by exon and gene duplica- 
tions. 
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Figure 25-74 Some of the cell-surface 
proteins discussed in this chapter that 
belong to the Ig superfamily. The Ig and 

Ig-like domains are shaded in gray, except 
for the antigen-binding domains (not all of 
which are Ig domains), which are shaded in 
blue.Jhe function of Thy1 is unknown, but 
it is held in the plasma membrane by a 
glycosylphosphatidylinositol (GPI) anchor 
and is widely used to identify T cells in 
mice. The Ig superfamily also includes 
many cell-surface proteins involved in 
cell-cell interactions outside the immune 
system, such as the neural cell adhesion 
molecule (NCAM) discussed in Chapter 19 
and the receptors for various protein 
growth factors discussed in Chapter 15 (not 
shown). There are more than 750 members 
of the Ig superfamily in humans. 
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The multiple gene segments that encode antibodies and TCRs may have 
arisen when a transposable element, or transposon (discussed in Chapter 5), 
inserted into an exon of a gene encoding an Ig family member in an ancestral 
lymphocyte-like cell. The transposon may have contained the ancestors of the 
Rag genes, which, as discussed earlier, encode the proteins that initiate V(D)J 
recombination; the finding that the RAG proteins can act as transposons in a test 
tube strongly supports this view Once the transposon had inserted into the 
exon, the gene could be expressed only if the transposon was excised by the RAG 
proteins and the two ends of the exon were rejoined, much as occurs when the 
Vand/gene segments of an Ig light chain gene are assembled (see Figure 25-38). 
A second insertion of the transposon into the same exon may then have divided 
the gene into three segments, equivalent to the present-day V, D, and /gene seg- 
ments. Subsequent duplication of either the individual gene segments or the 
entire split gene may then have generated the arrangements of gene segments 
that characterize the adaptive immune systems of present-day vertebrates. 



Summary 

The production of an effector helper T cell from of a naive helper T cell requires multi- 
ple signals from an activated dendritic cell MHC-peptide complexes on the dendritic 
cell surface provide one signal, by binding to both TCRs and a CD4 co-receptor on the 
T cell. Co-stimulatory proteins on the dendritic cell surface, including CD28, and 
secreted cytokines are the other signals. When naive helper T cells are initially activated 
on a dendritic cell, most differentiate into either ThI or Th2 effector cells, depending 
mainly on the signal proteins in their environment. ThI cells activate macrophages, 
cytotoxic T cells, and B cells, while Th2 cells activate mainly B cells. In both cases, the 
effector helper T cells recognize the same complex of foreign peptide and class IIMHC 
protein on the target cell surface as they initially recognized on the dendritic cell that 
activated them. They activate their target cells by a combination of membrane-bound 
and secreted co- stimulatory proteins. One membrane-bound signal protein used by 
both ThI and Th2 cells is CD40 ligand. 

Like T cells, B cells require multiple signals for activation. Antigen binding to the 
B cell antigen receptors (BCRs) provides one signal, while antigen-specific effector 
helper T cells provide the other signals. The requirement for multiple signals to acti- 
vate either a T cell or aB cell helps to prevent inappropriate and dangerous activation 
of lymphocytes, including self -reactive lymphocytes. 

Most of the proteins involved in cell-cell recognition and antigen recognition in 
the immune system, including antibodies, TCRs, and MHC proteins, as well as the var- 
ious co-receptors discussed in this chapter, belong to the ancient Ig superfamily. This 
superfamily is thought to have evolved from a primordial gene encoding a single Ig- 
like domain. The mechanisms for diversifying antibodies and T cell receptors by 
recombining gene segments may have arisen when a transposon inserted into an exon 
of a gene encoding an Ig family member. 
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ABC transporter protein 

A large family of membrane transport proteins that use the 
energy of ATP hydrolysis to transfer peptides or small mole- 
cules across membranes. (Figure 11-7) 

acetyl (-COCH3) 

Chemical group derived from acetic acid (CH3COOH). Acetyl 
groups are important in metabolism and are also added cova- 
lently to some proteins as a post-translational modification. 

acetyl CoA 

Small water-soluble activated carrier molecule. Consists of 
an acetyl group linked to coenzyme A (CoA) by an easily 
hydrolyzable thioester bond. (Figure 2-62) 

acetylcholine (ACh) 

Neurotransmitter that functions at cholinergic synapses. 
Found in the brain and peripheral nervous system. The neu- 
rotransmitter at vertebrate neuromuscular junctions. (Figure 
15-9) 

acetylcholine receptor (AChR) 

Membrane protein that responds to binding of acetylcholine 
(ACh). The nicotinic AChR is a transmitter- gated ion channel 
that opens in response to ACh. The muscarinic AChR is not an 
ion channel, but a G-protein-coupled cell-surface receptor. 

acid 

A proton donor. Substance that releases protons (H + ) when 
dissolved in water, forming hydronium ions (H3CH and low- 
ering the pH. (Panel 2-2, pp. 108-109) 

acrosomal vesicle 

Region at the head end of a sperm cell that contains a sac of 
hydrolytic enzymes used to digest the protective coating of 
the egg. When a sperm starts to enter an egg, the contents of 
the vesicle are released (the acrosome reaction), helping the 
sperm penetrate the zona pellucida. (Figures 21-33 and 
21-27) 

actin 

Abundant protein that forms actin filaments in all eucaryotic 
cells. The monomeric form is sometimes called globular or 
G- actin; the polymeric form is filamentous or F- actin. (Panel 
16-1, p. 968, and Figure 16-12) 

actin-binding protein 

Protein that associates with either actin monomers or actin 
filaments in cells and modifies their properties. Examples 
include myosin, a-actinin, and profilin. (Panel 16-3, 
pp. 994-995) 

actin filament (microfilament) 

Helical protein filament formed by polymerization of globu- 
lar actin molecules. A major constituent of the cytoskeleton 
of all eucaryotic cells and part of the contractile apparatus of 
skeletal muscle. (Panel 16-1, p. 968) 

action potential 

Rapid, transient, self-propagating electrical excitation in the 
plasma membrane of a cell such as a neuron or muscle cell. 



Action potentials, or nerve impulses, make possible long- 
distance signaling in the nervous system. (Figure 1 1-30) 

activated carrier 

Small diffusible molecule that stores easily exchangeable 
energy in the form of one or more energy-rich covalent 
bonds. Examples are ATI> acetyl CoA, FADH 2 , NADH, and 
NADPH. (Figure 2-55) 

activation energy 

Extra energy that must be acquired by atoms or molecules in 
addition to their ground-state energy in order to reach a 
transition state and undergo a particular chemical reaction. 
(Figure 2-44) 

activator (gene activator protein, transcriptional activator) 

Gene regulatory protein that when bound to its regulatory 
sequence in DNA activates transcription. 

active site 

Region of an enzyme surface to which a substrate molecule 
binds in order to undergo a catalyzed reaction. (Figure 1-7) 

active transport 

Movement of a molecule across a membrane or other barrier 
driven by energy other than that stored in the electrochemi- 
cal or concentration gradient of the transported molecule. 



acyl group (-CO-R) 

Functional group derived from a carboxylic acid (R-C * 



,0 , 
OH 



adaptation 

(1) adaptation (desensitization): Adjustment of sensitivity 
following repeated stimulation. The mechanism that allows a 
cell to react to small changes in stimuli even against a high 
background level of stimulation. (2) evolutionary adapta- 
tion: an evolved trait. 

adaptive immune response 

Response of the vertebrate adaptive immune system to a 
specific antigen that typically generates immunological 
memory. (Figures 25-1 and 25-2) 

adaptor protein 

General term for a protein that functions solely to link two or 
more different proteins together in an intracellular signaling 
pathway or protein complex. (Figure 15-22) 

adenomatous polyposis coli (APC) protein 

Tumor suppressor protein that forms part of a protein com- 
plex in the canonical Wnt signaling pathway that recruits free 
cytoplasmic p-catenin and degrades it. 

adenosine triphosphate — sec ATP 

adenylyl cyclase (adenylate cyclase) 

Membrane -bound enzyme that catalyzes the formation of 
cyclic AMP from ATP An important component of some 
intracellular signaling pathways. 

adherens junction 

Cell junction in which the cytoplasmic face of the plasma 
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membrane is attached to actin filaments. Examples include 
adhesion belts linking adjacent epithelial cells and focal con- 
tacts on the lower surface of cultured fibroblasts. 

ADP (adenosine 5'-diphosphate) 

Nucleotide produced by hydrolysis of the terminal phos- 
phate of ATP Regenerates ATP when phosphorylated by an 
energy-generating process such as oxidative phosphoryla- 
tion. (Figure 2-57) 

adrenaline (epinephrine) 

Hormone released by adrenal gland chromaffin cells, espe- 
cially in response to stress, that binds to specific GPCRs. It 
can initiate and coordinate a "fight or flight" response, which 
includes an increase in heart rate and blood sugar levels. It is 
also a catecholamine neurotransmitter. 

aerobic 

Occurring in, or requiring, the presence of molecular oxygen 
(0 2 ). 

affinity 

The strength of binding of a molecule to its ligand at a single 
binding site. 

affinity chromatography 

Type of chromatography in which the protein mixture to be 
purified is passed over a matrix to which specific ligands for 
the required protein are attached, so that the protein is 
retained on the matrix. (Figure 8-13) 

affinity constant (association constant) (Ay 

Measure of the strength of binding of the components in a 
complex. For components A and B and a binding equilib- 
rium A + B ^ AB, the association constant is given by 
[AB]/[A][B], and is larger the tighter the binding between A 
and B. (Figure 3-43) 

affinity maturation 

Progressive increase in the affinity of antibodies for the 
immunizing antigen with the passage of time after immu- 
nization. 

Akt (protein kinase B, PKB) 

Serine/threonine protein kinase that acts in the PI 3- 
kinase/Akt intracellular signaling pathway involved espe- 
cially in signaling cells to grow and survive. Also called pro- 
tein kinase B (PKB). (Figure 15-64) 






O 



aldehyde 

Organic compound that contains the [— C\„ ] group. Ex- 
ample: glyceraldehyde. Can be oxidized to an acid or 
reduced to an alcohol. (Panel 2-1, p. 107) 

alga (plural algae) 

Informal term used to describe a wide range of simple uni- 
cellular and multicellular eucaryotic photosynthetic organ- 
isms. Examples include Nitella,Volvox, and Fucus. 

alkaline — see basic 

alkyl group (C„H 2 n+i) 

General term for a group of covalently linked carbon and 
hydrogen atoms such as methyl (-CH 3 ) or ethyl (-CH2CH3) 
groups. Usually exist as part of larger organic molecules. On 
their own they form extremely reactive free radicals. 

allele 

One of several alternative forms of a gene. In a diploid cell 
each gene will typically have two alleles, occupying the cor- 
responding position (locus) on homologous chromosomes. 

allelic exclusion 

The expression of a protein from only one of the two alleles 
of the gene encoding the protein in the cell, as occurs, for 
example, in the expression of an immunoglobulin or T cell 
receptor chain or an olfactory receptor. 

allostery (adjective allosteric) 

Change in a protein's conformation brought about by the 
binding of a regulatory ligand (at a site other than the pro- 
tein's catalytic site), or by covalent modification. The change 



in conformation alters the activity of the protein and can 
form the basis of directed movement. (Figures 3-58 and 
16-61) 

alpha helix (a helix) 

Common folding pattern in proteins, in which a linear 
sequence of amino acids folds into a right-handed helix sta- 
bilized by internal hydrogen bonding between backbone 
atoms. (Figure 3-7) 

alternative RNA splicing 

Production of different RNAs from the same gene by splicing 
the transcript in different ways. (Figure 7-94) 

alveoli (singular alveolus) 

Small dilated outpocketings of an epithelium, especially the 
epithelium of the lung, where they form millions of air-filled 
sacs. Similar structures are found in the milk- secreting glan- 
dular epithelium of the breast. 

amide 

Molecule containing a carbonyl group linked to an amine. 

(Panel 2-1, p. 107) 

amine 

Chemical group containing nitrogen and hydrogen. 
Becomes positively charged in water. (Panel 2-1, p. 107) 

amino acid 

Organic molecule containing both an amino group and a 
carboxyl group. Those that serve as building blocks of pro- 
teins are alpha amino acids, having both the amino and 
carboxyl groups linked to the same carbon atom. 
(NH2CHRCOOH, Panel 3-1, pp. 128-129) 

aminoacyl-tRNA synthetase 

Enzyme that attaches the correct amino acid to a tRNA mol- 
ecule to form an aminoacyl-tRNA. (Figure 6-57) 

amino group (-NH2) 

Weakly basic functional group derived from ammonia (NH3) 
in which one or more hydrogen atoms are replaced by 
another atom. In aqueous solution it can accept a proton 
and carry a positive charge (-NH3 + ). 

amino terminus — see N terminus 

amoeba (plural amoebae) 

Carnivorous unicellular protozoan that crawls using 
pseudopodia. 

AMP (adenosine 5'-monophosphate) 

One of the four nucleotides in an RNA molecule. Two phos- 
phates are added to AMP to form ATP. (Panel 2-6, pp. 
116-117) 

amphipathic 

Having both hydrophobic and hydrophilic regions, as in a 
phospholipid or a detergent molecule. 

anabolism (biosynthesis) 

Formation of complex molecules from simple substances by 
living cells. (Figure 2-36) 

anaerobic 

Requiring, or occurring in, the absence of molecular oxygen 
(0 2 ). 

anaphase 

(1) Stage of mitosis during which sister chromatids separate 
and move away from each other. Composed of anaphase A 
(chromosomes move toward the two spindle poles) and 
anaphase B (spindle poles move apart). (2) Anaphase I and 
II: stages of meiosis during which chromosome homolog 
pairs separate (I), and then sister chromatids separate (II). 
(Panel 17-1, pp. 1072-1073) 

anaphase-promoting complex (APC/C) (cyclosome) 

Ubiquitin ligase that catalyzes the ubiquitylation and 
destruction of securin and M- and S-cyclins, initiating the 
separation of sister chromatids in the metaphase-to- 
anaphase transition during mitosis. 



GLOSSARY 



G:3 



anchorage dependence 

Dependence of cell growth, proliferation, and survival on 
attachment to a substratum. 

anchoring junction 

Cell junction that attaches cells to neighboring cells or to the 
extracellular matrix. (Figure 19-2, and Table 19-1, p. 1133) 

angiogenesis 

Growth of new blood vessels by sprouting from existing ones. 

Angstrom (A) 

Unit of length used to measure atoms and molecules. Equal 
to 10~ 10 meter or 0.1 nanometer (nm). 

animal pole 

In yolky eggs, the end opposite the yolk. Cells derived from 
the animal region will envelop those derived from the yolky 
(vegetal) region. (Figure 22-68) 

anion 

Negatively charged ion. 

antenna complex 

Part of a photosystem that captures light energy and chan- 
nels it into the photochemical reaction center. It consists of 
protein complexes that bind large numbers of chlorophyll 
molecules and other pigments. 

antibiotic 

Substance such as penicillin or streptomycin that is toxic to 
microorganisms. Often a product of a particular microorgan- 
ism or plant. 

antibody (immunoglobulin, Ig) 

Protein produced by B cells in response to a foreign molecule 
or invading microorganism. Binds tightly to the foreign mol- 
ecule or cell, inactivating it or marking it for destruction by 
phagocytosis or complement-induced lysis. 

anticodon 

Sequence of three nucleotides in a transfer RNA (tRNA) mol- 
ecule that is complementary to a three -nucleotide codon in 
a messenger RNA (mRNA) molecule. 

antigen 

A molecule that can induce an adaptive immune response or 
that can bind to an antibody or T cell receptor. 

antigenic determinant (epitope) 

Specific region of an antigen that binds to an antibody or a T 
cell receptor. 

antigenic variation 

Ability to change the antigens displayed on the cell surface; a 
property of some pathogenic microorganisms that enables 
them to evade attack by the adaptive immune system. 

antigen-presenting cell 

Cell that displays foreign antigen complexed with an MHC 
protein on the cell surface for presentation to T lymphocytes. 

antiparallel 

Describes the relative orientation of the two strands in a 
DNA double helix or two paired regions of a polypeptide 
chain; the polarity of one strand is oriented in the opposite 
direction to that of the other. 

antiporter 

Carrier protein that transports two different ions or small 
molecules across a membrane in opposite directions, either 
simultaneously or in sequence. (Figure 1 1-8) 

antisense RNA 

RNA complementary to an RNA transcript of a gene. Can 
hybridize to the specific RNA and block its function. 

APC — see adenomatous polyposis coli 

APC/C — see anaphase-promoting complex 

apical 

Referring to the tip of a cell, a structure, or an organ. The apical 



surface of an epithelial cell is the exposed free surface, oppo- 
site to the basal surface. The basal surface rests on the basal 
lamina that separates the epithelium from other tissue. 

apical meristem 

The growing tip of a plant shoot or root, composed of divid- 
ing undifferentiated cells. (Panel 22-1, p. 1401) 

apoptosis 

Form of programmed cell death, in which a "suicide" pro- 
gram is activated within an animal cell, leading to rapid cell 
death mediated by intracellular proteolytic enzymes called 

caspases. 

aqueous 

Pertaining to water, as in an aqueous solution. 

Arabidopsis thaliana (common Thale cress) 

Small flowering weed related to mustard. Model organism 
for flowering plants and the primary model for studies of 
plant molecular genetics. 

archaeon (plural arch [a] ea) (archaebacterium). 

Single-celled organism without a nucleus, superficially simi- 
lar to bacteria. At a molecular level, more closely related to 
bacteria in metabolic machinery, but more similar to eucary- 
otes in genetic machinery. Archaea and Bacteria together 
make up the Procaryotes. (Figure 1-21) 

ARF (ADP-ribosylation factor, ARF protein) 

Monomeric GTPase in the Ras superfamily responsible for 
regulating both COPI coat assembly and clathrin coat 
assembly at Golgi membranes. (Table 15-5, p. 926) 

aromatic 

Molecule that contains carbon atoms in a ring drawn as hav- 
ing alternating single and double bonds. Often a molecule 
related to benzene. 

ARP (actin-related protein) complex (ARP2/3 complex) 

Complex of proteins that nucleates actin filament growth 
from the minus end. 

ARS— see autonomously replicating sequence 

asexual reproduction 

Any type of reproduction (such as budding in Hydra, binary 
fission in bacteria, or mitotic division in eucaryotic microor- 
ganisms) that does not involve the mixing of two different 
genomes. Produces individuals that are genetically identical 
to the parent. 

association constant — see affinity constant 

aster 

Star- shaped system of microtubules emanating from a cen- 
trosome or from a pole of a mitotic spindle. 

astral microtubule 

In the mitotic spindle, any of the microtubules radiating 
from the aster which are not attached to a kinetochore of a 
chromosome. 

ATM (ataxia telangiectasia mutated protein) 

Protein kinase activated by double-strand DNA breaks. If 
breaks are not repaired, ATM initiates a signal cascade that 
culminates in cell cycle arrest. Related to ATR. 

ATP (adenosine 5'-triphosphate) 

Nucleoside triphosphate composed of adenine, ribose, and 
three phosphate groups. The principal carrier of chemical 
energy in cells. The terminal phosphate groups are highly 
reactive in the sense that their hydrolysis, or transfer to 
another molecule, takes place with the release of a large 
amount of free energy. (Figure 2-26) 

ATPase 

Enzyme that catalyzes the hydrolysis of ATP. Many proteins 
have ATPase activity. 

ATP synthase (F Fi ATPase) 

Transmembrane enzyme complex in the inner membrane of 
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mitochondria and the thylakoid membrane of chloroplasts. 
Catalyzes the formation of ATP from ADP and inorganic 
phosphate during oxidative phosphorylation and photosyn- 
thesis, respectively Also present in the plasma membrane of 
bacteria. 

ATR (ataxia telangiectasia and Rad3 related protein) 

Protein kinase activated by DNA damage. If damage remains 
unrepaired, ATR helps initiate a signal cascade that culmi- 
nates in cell cycle arrest. Related to ATM. 

atypical protein kinase (aPKC) 

An atypical form of protein kinase C (PKC) that does not 
require both Ca 2a and phosphatidylserine for activation. One 
such aPKC is involved in the specification of polarity in some 
individual animal cells. 

auditory hair cell (sensory hair cell) 

Sensory cells in the inner ear, responsible for detecting 
sound by converting a mechanical stimulus (the vibrations 
caused by sound waves) into a release of neurotransmitter. 

(Figures 23-13 to 23-15) 

autocrine signaling 

Where a cell secretes signal molecules that act back on itself. 

autoimmune disease, autoimmune response 

Pathological state in which the body mounts a disabling 
adaptive immune response against one or more of its own 
molecules. 

autonomously replicating sequence (ARS) 
Origin of replication in yeast DNA. 

autophagy 

Digestion of worn-out organelles by the cell's own lysosomes. 

autoradiography 

Technique in which a radioactive object produces an image 
of itself on a photographic film or emulsion. 

autosome 

Any chromosome other than a sex chromosome. 

auxin 

Plant hormone, commonly indole-3-acetic acid, with 
numerous roles in plant growth and development. 

avidity 

Total binding strength of a polyvalent antibody with a poly- 
valent antigen. 

axon 

Long nerve cell projection that can rapidly conduct nerve 
impulses over long distances so as to deliver signals to other 
cells. 

axonal transport 

Directed intracellular transport of organelles and molecules 
along a nerve cell axon. Can be anterograde (outward from 
the cell body) or retrograde (back toward the cell body). 

axoneme 

Bundle of microtubules and associated proteins that forms 
the core of a cilium or a flagellum in eucaryotic cells and is 
responsible for their movements. 

BAC — see bacterial artificial chromosome 

bacterium (plural bacteria) (eubacterium) 

Member of the domain Bacteria, one of the three main 
branches of the tree of life (Archaea, Bacteria, and 
Eucaryotes). Bacteria and Archaea both lack a distinct 
nuclear compartment, and together comprise the 
Procaryotes. (Figure 1-21) 

bacterial artificial chromosome (BAC) 

Cloning vector that can accommodate large pieces of DNA 
up to 1 million base pairs. 

bacteriophage (phage) 

Any virus that infects bacteria. Phages were the first organ- 



isms used to analyse the molecular basis of genetics, and are 
now widely used as cloning vectors. See also bacteriophage 
lambda. 

bacteriophage lambda (bacteriophage X, lambda) 

Virus that infects E. coll Widely used as a DNA cloning vector. 

bacteriorhodopsin 

Pigmented protein found in the plasma membrane of a salt- 
loving archaean, Halobacterium salinarium (Halobacterium 
halobium). Pumps protons out of the cell in response to 
light. (Figure 10-33) 

basal 

Situated near the base. Opposite the apical surface. 

basal body 

Short cylindrical array of microtubules and their associated 
proteins found at the base of a eucaryotic cell cilium or fla- 
gellum. Serves as a nucleation site for growth of the 
axoneme. Closely similar in structure to a centriole. 

basal lamina (plural basal laminae) 

Thin mat of extracellular matrix that separates epithelial 
sheets, and many other types of cells such as muscle or fat 
cells, from connective tissue. Sometimes called basement 
membrane. (Figure 19-40) 

base 

A substance that can reduce the number of protons in solu- 
tion, either by accepting H + ions directly, or by releasing OH" 
ions, which then combine with H + to form H2O. The purines 
and pyrimidines in DNA and RNA are organic nitrogenous 
bases and are often referred to simply as bases. (Panel 2-2, 
pp. 108-109) 

base excision repair 

DNA repair pathway in which single faulty bases are 
removed from the DNA helix and replaced. Compare 
nucleotide excision repair. (Figure 5-48) 

basement membrane — see basal lamina 

base pair 

Two nucleotides in an RNA or DNA molecule that are held 
together by hydrogen bonds— for example, G paired with C, 
and A paired with T or U. 

basic (alkaline) 

Having the properties of a base. 

B cell (B lymphocyte) 

Type of lymphocyte that makes antibodies. 

Bcl2 family 

Family of intracellular proteins that either promote or inhibit 
apoptosis by regulating the release of cytochrome c and 
other mitochondrial proteins from the intermembrane 
space into the cytosol. 

benign 

Of tumors: self-limiting in growth, and noninvasive. 

beta-catenin (P-catenin) 

Multifunctional cytoplasmic protein involved in cadherin- 
mediated cell-cell adhesion, linking cadherins to the actin 
cytoskeleton. Can also act independently as a gene regula- 
tory protein. Has an important role in animal development 
as part of a Wnt signaling pathway. 

beta sheet (P sheet) 

Common structural motif in proteins in which different sec- 
tions of the polypeptide chain run alongside each other, 
joined together by hydrogen bonding between atoms of the 
polypeptide backbone. Also known as a (3-pleated sheet. 
(Figure 3-7) 

binding site 

Region on the surface of one molecule (usually a protein or 
nucleic acid) that can interact with another molecule 
through noncovalent bonding. 
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bi-orientation 

The attachment of sister chromatids to opposite poles of the 
mitotic spindle, so that they move to opposite ends of the cell 
when they separate in anaphase. 

biosphere 

All of the living organisms on Earth. 

biosynthesis — see anabolism 

biotin 

Low-molecular-weight compound used as a coenzyme. Also 
useful technically as a covalent label for proteins, allowing 
them to be detected by the egg protein avidin, which binds 
extremely tightly to biotin. (Figure 2-63) 

bivalent 

A four-chromatid structure formed during meiosis, consist- 
ing of a duplicated chromosome tightly paired with its 
homologous duplicated chromosome. 

blastomere 

One of the many cells formed by the cleavage of a fertilized 
egg. (Figure 22-69) 

blastula 

Early stage of an animal embryo, usually consisting of a hol- 
low ball of epithelial cells surrounding a fluid-filled cavity, 
before gastrulation begins. 

blotting 

Biochemical technique in which macromolecules separated 
on a gel are transferred to a nylon membrane or sheet of 
paper, thereby immobilizing them for further analysis. [See 
Northern, Southern, and Western (immuno-) blotting.] 
(Figure 8-38) 

B lymphocyte — see B cell 

bond energy 

Strength of the chemical linkage between two atoms, meas- 
ured by the energy in kilocalories or kilojoules needed to 
break it. 

bright- field microscope 

Normal light microscope in which the image is obtained by 
simple transmission of light through the object being viewed. 

brush border 

Dense covering of microvilli on the apical surface of epithe- 
lial cells in the intestine and kidney. 

budding yeast 

Common name given to the baker's yeast Saccharomyces 
cerevisiae, a model experimental organism, which divides by 
budding off a smaller cell. 

buffer 

Solution of weak acid or weak base that resists pH change 
when small quantities of acid or base are added, or when 
solution is diluted. 

Ca 2+ /calmodulin- dependent protein kinase — see (CaM -kinase) 

cadherin 

Member of the large cadherin superfamily of transmem- 
brane adhesion proteins. Mediates homophilic Ca 2+ - 
dependent cell-cell adhesion in animal tissues. (Figure 19-4, 
and Table 19-2, p. 1135) 

Caenorhabditis elegans 

A small (-lmm) nematode worm used extensively in molec- 
ular and developmental biology as a model organism. 

caged molecule 

Organic molecule designed to change into an active form 
when irradiated with light of a specific wavelength. Example: 
caged ATP. 

CAK — see Cdk-activating kinase 

calcium pump — see Ca 2+ pump 



calmodulin 

Ubiquitous intracellular Ca 2+ -binding protein that under- 
goes a large conformation change when it binds Ca 2+ , allow- 
ing it to regulate the activity of many target proteins. In its 
activated (Ca 2+ -bound) form, it is called Ca 2+ /calmodulin. 

(Figure 15-43) 

calorie 

Unit of heat energy, equal to 4.2 joules. One calorie (small 
V) is the amount of heat needed to raise the temperature of 
1 gram of water by 1°C. A kilocalorie (1000 calories) is the 
unit used to describe the energy content of foods. 

Calvin cycle — see carbon-fixation cycle 

CAM (cell adhesion molecule) 

Protein on the surface of an animal cell that mediates 
cell-cell binding or cell-matrix binding. 

CaM -kinase 

Serine /threonine protein kinase that is activated by 
Ca 2+ /calmodulin. Indirectly mediates the effects of an 
increase in cytosolic Ca 2+ by phosphorylating specific target 

proteins. (Figure 15-43) 

CaM-kinase II 

Multifunctional Ca 2+ /calmodulin-dependent protein kinase 
that phosphorylates itself and various target proteins when 
activated. Found in most animal cells but is especially abun- 
dant at synapses in the brain, and is involved in some forms 
of synaptic plasticity in vertebrates. (Figure 15-44) 

cAMP — see cyclic AMP 

cAMP-dependent protein kinase — see protein kinase A 

cancer 

Disease featuring abnormal and improperly controlled cell 
division resulting in invasive growths, or tumors, that may 
spread throughout the body. (Figure 20-37) 

capsid 

Protein coat of a virus, formed by the self-assembly of one or 
more types of protein subunit into a geometrically regular 
structure. (Figure 3-30) 

Ca 2+ pump (calcium pump, Ca 2+ ATPase) 

Transport protein in the membrane of sarcoplasmic reticu- 
lum of muscle cells (and elsewhere). Pumps Ca 2+ out of the 
cytoplasm into the sarcoplasmic reticulum using the energy 

of ATP hydrolysis. 

carbohydrate 

General term for sugars and related compounds containing 
carbon, hydrogen, and oxygen, usually with the empirical 
formula (CH 2 0)„. 

carbon fixation reaction 

Process by which inorganic carbon (as atmospheric C0 2 ) is 
incorporated into organic molecules. The second stage of 
photosynthesis. (Figure 14-39) 

carbon-fixation cycle (Calvin cycle) 

Major metabolic pathway in photosynthetic organisms by 
which C0 2 and H 2 are converted into carbohydrates. 
Requires both ATP and NADPH. (Figure 14-40) 

carbonyl group (C=0) 

Carbon atom linked to an oxygen atom by a double bond. 
(Panel 2-1, p. 107) 

carboxyl group (-COOH) 

Carbon atom linked both to an oxygen atom by a double 
bond and to a hydroxyl group. Molecules containing a car- 
boxyl group are weak acids — carboxylic acids. (Panel 2-1, 
p. 107) 

carboxyl terminus— see C terminus 

carcinogen 

Any agent, such as a chemical or a form of radiation, that 
causes cancer. 
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carcinoma 

Cancer of epithelial cells. The most common form of human 
cancer. 

cardiac muscle 

Specialized form of striated muscle found in the heart, con- 
sisting of individual heart muscle cells linked together by cell 

junctions. 

carrier protein — see transporter 

cartilage 

Form of connective tissue composed of cells (chondrocytes) 
embedded in a matrix rich in type II collagen and chon- 
droitin sulfate proteoglycan. 

cascade — see signaling cascade 

caspase 

Intracellular protease that is involved in mediating the intra- 
cellular events of apoptosis. 

catabolism 

General term for the enzyme-catalyzed reactions in a cell by 
which complex molecules are degraded to simpler ones with 
release of energy. (Figure 2-36) 

catalyst 

Substance that can lower the activation energy of a reaction 
(thus increasing its rate), without itself being consumed by 
the reaction. 

catastrophe factor 

Protein that destabilizes microtubule arrays by increasing 
the frequency of rapid disassembly of tubulin subunits from 
one end (catastrophe). (Figure 16-16) 

P-catenin — see beta catenin 

cation 

Positively- charged ion. 

caveola (plural caveolae) 

Invaginations at the cell surface that bud off internally to 
form pinocytic vesicles. Thought to form from lipid rafts, 
regions of membrane rich in certain lipids. 



CD4 



Co-receptor protein found on helper T cells, regulatory T cells, 
and macrophages. It binds to class II MHC proteins (on anti- 
gen presenting cells) outside the peptide-binding groove. 



CD8 



Co-receptor protein found on cytotoxic T cells. It binds to 
class I MHC proteins (on antigen-presenting cell) outside the 
peptide-binding groove. 



CD28 



Co-receptor protein on T cells that binds a co- stimulatory B7 
protein on dendritic cells, providing an additional signal 
required for the activation of a naive T cell by antigen. 

Cdc6 

Protein essential in the preparation of DNA for replication. 
With Cdtl it binds to an origin recognition complex on chro- 
mosomal DNA and helps load the Mem proteins onto the 
complex to form the prereplicative complex. 

Cdc20 

Activating subunit of the anaphase-promoting complex 
(APC/C). 

Cdc25 

Protein phosphatase that dephosphorylates Cdks and 
increases their activity. 

Cdc gene (cell-division-cycle gene) 

Gene whose product (a Cdc protein) controls a specific step 
or set of steps in the eucaryotic cell cycle. Originally identi- 
fied in yeasts. 



Cdk-activating kinase (CAK) 

Protein kinase that phosphorylates Cdks in cyclin-Cdk com- 
plexes, activating the Cdk. 

Cdk inhibitor protein (CKI) 

Protein that binds to and inhibits cyclin-Cdk complexes, pri- 
marily involved in the control of Gl and S phases. 



cDNA 



Cdk — see cyclin-dependent kinase 



DNA molecule made as a copy of mRNA and therefore lack- 
ing the introns that are present in genomic DNA. 

Cdtl 

Protein essential in the preparation of DNA for replication. 
With Cdc6 it binds to origin recognition complexes on chro- 
mosomes and helps load the Mem proteins on to the com- 
plex, forming the prereplicative complex. 

cell adhesion molecule — see CAM 

cell cortex 

Specialized layer of cytoplasm on the inner face of the 
plasma membrane. In animal cells it is an actin-rich layer 
responsible for movements of the cell surface. 

cell cycle (cell-division cycle) 

Reproductive cycle of a cell: the orderly sequence of events 
by which a cell duplicates its chromosomes and, usually, the 
other cell contents, and divides into two. (Figure 17-4) 

cell-cycle control system 

Network of regulatory proteins that governs progression of a 
eucaryotic cell through the cell cycle. 

cell division 

Separation of a cell into two daughter cells. In eucaryotic 
cells it entails division of the nucleus (mitosis) closely fol- 
lowed by division of the cytoplasm (cytokinesis). 

cell-division-cycle gene — see Cdc gene 

cell fate 

In developmental biology, describes what a particular cell at 
a given stage of development will normally give rise to. 

cell-free system 

Fractionated cell homogenate that retains a particular bio- 
logical function of the intact cell, and in which biochemical 
reactions and cell processes can be more easily studied. 

cell line 

Population of cells of plant or animal origin capable of divid- 
ing indefinitely in culture. 

cell memory 

Retention by cells and their descendants of persistently 
altered patterns of gene expression, without any change in 
DNA sequence. See also epigenetic inheritance. 

cell plate 

Flattened membrane-bounded structure that forms by fus- 
ing vesicles in the cytoplasm of a dividing plant cell and is 
the precursor of the new cell wall. 

cell senescence — see replicative cell senescence 

cell signaling 

The processes in which cells are stimulated or inhibited by 
extracellular signals, usually chemical signals produced by 
other cells. 

cell transformation — see transformation 

cellularization 

The formation of cells around each nucleus in a multinucle- 
ate cytoplasm, transforming it into a multicellular structure. 

cellulose 

Structural polysaccharide consisting of long chains of cova- 
lently linked glucose units. Provides tensile strength in plant 
cell walls. (Figures 19-78 and 19-79) 
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cell wall 

Mechanically strong extracellular matrix deposited by a cell 
outside its plasma membrane. Prominent in most plants, 
bacteria, archaea, algae, and fungi. Not present in most ani- 
mal cells. 

central lymphoid organ (primary lymphoid organ) 

Organ in which lymphocytes are produced from precursor 
cells. In adult mammals, these are the thymus and bone 
marrow. 

centriole 

Short cylindrical array of microtubules, closely similar in 
structure to a basal body. A pair of centrioles is usually found 
at the center of a centrosome in animal cells. (Figure 16-31) 

centromere 

Constricted region of a mitotic chromosome that holds sister 
chromatids together. Also the site on the DNA where the 
kinetochore forms that captures microtubules from the 
mitotic spindle. (Figure 4-50) 

centrosome 

Centrally located organelle of animal cells that is the primary 
microtubule-organizing center (MTOC) and acts as the spin- 
dle pole during mitosis. In most animal cells it contains a 
pair of centrioles. (Figures 16-30 and 17-29) 

CG island 

Region of DNA with a greater than average density of CG 
sequences; these regions generally remain unmethylated. 

CGN — see cis Golgi network 

channel (membrane channel) 

Transmembrane protein complex that allows inorganic ions 
or other small molecules to diffuse passively across the lipid 
bilayer. (Figures 1 1-3 and 1 1-4) 

channel-forming junction 

Cell- cell junction that links the cytoplasm of adjacent cells 
and provides a passageway for small molecules and ions to 
pass from cell to cell. In animal tissues, composed of connexin 
or innexin proteins. In plants, a similar function is performed 
by plasmodesmata. (Figure 19-2, and Table 19-1, p. 1133) 

channel protein 

Membrane transport protein that forms an aqueous pore in 
the membrane through which a specific solute, usually an 
ion, can pass. Compare transporter. 

chaperone (molecular chaperone) 

Protein that helps guide the proper folding of other proteins, 
or helps them avoid misfolding. Includes heat shock proteins 

(Hsp). 

checkpoint 

Point in the eucaryotic cell-division cycle where progress 
through the cycle can be halted until conditions are suitable 
for the cell to proceed. 

chelate 

To combine reversibly, usually with high affinity, with a metal 
ion such as iron, calcium, or magnesium. 

chemical biology 

Name given to a strategy that uses large-scale screening of 
hundreds of thousands of small molecules in biological 
assays to identify chemicals that affect a particular biological 
process and that can then be used to study it. 

chemical bond 

Chemical affinity between two atoms that holds them 
together. Types found in living cells include covalent bonds 
and noncovalent bonds. {See also ionic bond, hydrogen bond.) 

chemiosmotic coupling (chemiosmosis) 

Mechanism in which a gradient of hydrogen ions (a pH gra- 
dient, or proton gradient) across a membrane is used to drive 
an energy-requiring process, such as ATP production or the 
rotation of bacterial flagella. 



chemokine 

Chemotactic cytokine. Small secreted protein that attracts 
cells, such as white blood cells, to move towards its source. 
Important in the functioning of the immune system. 

chemotaxis 

Directed movement of a cell or organism towards or away 
from a diffusible chemical. 

chiasma (plural chiasmata) 

X-shaped connection visible between paired homologous 
chromosomes during meiosis. Represents a site of chromo- 
somal crossing-over, a form of genetic recombination. 

chimera 

Whole organism formed from an aggregate of two or more 
genetically different populations of cells (two or more geno- 
types), originating from different zygotes. Compare mosaic. 

ChIP — see chromatin immunoprecipitation 

chitin 

Abundant organic polymeric polysaccharide of Af-acetylglu- 
cosamine. A major component of insect exoskeletons and 
the cell walls of fungi. 

chlorophyll 

Light-absorbing green pigment that plays a central part in 
photosynthesis in bacteria, plants, and algae. 

chloroplast 

Organelle in green algae and plants that contains chlorophyll 
and carries out photosynthesis. A specialized form of plastid. 

cholesterol 

An abundant lipid molecule with a characteristic four-ring 
steroid structure. An important component of the plasma 
membranes of animal cells. (Figure 10-4) 

chondrocyte (cartilage cell) 

Connective-tissue cell that secretes the matrix of cartilage. 

chromatid 

One of the two copies of a duplicated chromosome formed 
by DNA replication during S phase. The two chromatids, 
called sister chromatids, are joined at the centromere. 

chromatin 

Complex of DNA, histones, and nonhistone proteins found 
in the nucleus of a eucaryotic cell. The material of which 
chromosomes are made. 

chromatin immunoprecipitation (ChIP) 

Technique by which chromosomal DNA bound by a particular 
protein can be isolated and identified by precipitating it by 
means of an antibody against the bound protein. (Figure 7-32) 

chromatin remodeling complex 

Enzyme complex that alters histone-DNA configurations in 
eucaryotic chromosomes, changing the accessibility of the 
DNA to other proteins, notably those involved in transcription. 

chromatography 

Broad class of biochemical techniques in which a mixture of 
substances is separated by charge, size, hydrophobiciry, non- 
covalent binding affinities, or some other property by allow- 
ing the mixture to partition between a moving phase and a 
stationary phase. Used to separate mixtures of proteins or 
nucleic acids. See also affinity-, DNA affinity-, and high-per- 
formance liquid chromatography. (Figures 8-13 and 8-14) 

chromosome 

Structure composed of a very long DNA molecule and asso- 
ciated proteins that carries part (or all) of the hereditary 
information of an organism. Especially evident in plant and 
animal cells undergoing mitosis or meiosis, during which 
each chromosome becomes condensed into a compact rod- 
like structure visible in the light microscope. 

ciliate 

Single-celled eucaryotic organism (protozoan) characterized 
by numerous cilia on its surface. 
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cilium (plural cilia) 

Hairlike extension of a eucaryotic cell containing a core 
bundle of microtubules. Many cells contain a single non- 
motile cilium, while others contain large numbers that per- 
form repeated beating movements. Compare flagellum. 

circadian clock (circadian rhythm) 

Internal cyclical process that produces a particular change in 
a cell or organism with a period of around 24 hours, for 
example the sleep -wakefulness cycle in humans. 

cis 

On the same or near side. 

cisterna (plural cisternae) 

Flattened membrane-bounded compartment, as found in 
the endoplasmic reticulum or Golgi apparatus. (Figures 13-3 
and 13-25) 

citric acid cycle (tricarboxylic acid (TCA) cycle, Krebs cycle) 

Central metabolic pathway found in aerobic organisms. 
Oxidizes acetyl groups derived from food molecules, gener- 
ating the activated carriers NADH and FADH 2 , some GTI> 
and waste CO2. In eucaryotic cells it occurs in the mitochon- 
dria. (Panel 2-9, pp. 122-123) 

CKI — see Cdk inhibitor protein 

class I MHC protein 

One of two classes of MHC protein. Found on the surface of 
almost all vertebrate cell types, where it can present peptides 
derived from an infecting intracellular microbe (such as a 
virus) to cytotoxic T cells. (Figure 25-50) 

class II MHC protein 

One of two classes of MHC protein. Found on the surface of 
various antigen-presenting cells, where it presents foreign 
peptides to helper T cells. (Figure 25-50) 

class switch 

Change from making one class of immunoglobulin (for 
example, IgM) to making another class (for example, IgG) 
that many B cells undergo during the course of an adaptive 
immune response. Involves DNA rearrangement called 
class-switch recombination. (Figure 25-41) 

clathrin 

Protein that assembles into a polyhedral cage on the cytoso- 
lic side of a membrane so as to form a clathrin-coated pit, 
which buds off by endocytosis to form an intracellular 
clathrin-coated vesicle. (Figure 13-6) 

cleavage 

(1) Physical splitting of a cell into two. (2) Specialized type of 
cell division seen in many early embryos whereby a large cell 
becomes subdivided into many smaller cells without growth. 

clonal selection theory 

Theory that explains how the adaptive immune system can 
respond to millions of different antigens in a highly specific 
way. From a population of lymphocytes with a vast repertoire 
of randomly generated antigen-specific receptors, a given 
foreign antigen activates (selects) only those lymphocyte 
clones that display a receptor that fits the antigen. (Figure 
25-8) 

clone 

Population of identical individuals (cells or organisms) 
formed by repeated (asexual) division from a common 
ancestor. Also used as a verb: "to clone a gene" means to cre- 
ate multiple copies of a gene by growing a clone of carrier 
cells (such as E. coll) into which the gene has been intro- 
duced, and from which it can be recovered, by recombinant 
DNA techniques. 

cloning vector 

Small DNA molecule, usually derived from a bacteriophage 
or plasmid, which is used to carry the fragment of DNA to be 
cloned into the recipient cell, and which enables the DNA 
fragment to be replicated. (Figure 8-39) 



coactivator 

Protein that does not itself bind DNA but assembles on other 
DNA-bound gene regulatory proteins to activate transcrip- 
tion of a gene. (Figure 7-51) 

coated vesicle 

Small membrane-bounded organelle with a cage of proteins 
(the coat) on its cytosolic surface. Formed by the pinching off 
of a coated region of membrane (coated pit). Some coats are 
made of clathrin, others are made from other proteins. 

codon 

Sequence of three nucleotides in a DNA or mRNA molecule 
that represents the instruction for incorporation of a specific 
amino acid into a growing polypeptide chain. 

coenzyme 

Small molecule tightly associated with an enzyme that par- 
ticipates in the reaction that the enzyme catalyzes, often by 
forming a covalent bond to the substrate. Examples include 
biotin, NAD + , and coenzyme A. 

coenzyme A (CoA) 

Small molecule used in the enzymatic transfer of acyl 
groups. See also acetyl CoA, and Figure 2-62. 

cofactor 

Inorganic ion or coenzyme required for an enzyme's activity. 

cohesin, cohesin complex 

Complex of proteins that holds sister chomatids together 
along their length before their separation. (Figure 17-24) 

coiled-coil 

Especially stable rodlike protein structure formed by two or 
more alpha helices coiled around each other. (Figure 3-9) 

co-immunoprecipitation (co-IP) 

Method of isolating proteins that form a complex with each 
other by using an antibody specific for one of the partners. 

collagen 

Fibrous protein rich in glycine and proline that is a major 
component of the extracellular matrix in animals, conferring 
tensile strength. Exists in many forms: type I, the most com- 
mon, is found in skin, tendon, and bone; type II is found in 
cartilage; type IV is present in basal laminae. (Figures 3-23 
and 19-66) 

colony-stimulating factor (CSF) 

General name for numerous signal molecules that control 
differentiation of blood cells. 

complement system 

System of serum proteins activated by antibody-antigen 
complexes or by microorganisms. Helps eliminate patho- 
genic microorganisms by directly causing their lysis or by 
promoting their phagocytosis. 

complementary 

(1) Of nucleic acid sequences: capable of forming a perfect 
base-paired duplex with each other. (Figure 4-4) (2) Of other 
interacting molecules, such as an enzyme and its substrate: 
having biochemical or structural features that marry up, so 
that noncovalent bonding is facilitated. (Figure 2-16) 

complementary DNA — see cDNA 

complementation (genetic complementation) 

Phenomenon in which the mating of two individuals, each 
showing an abnormal phenotype, results in offspring in 
which the normal (wild-type) phenotype has been restored. 
Basis of a test of whether two mutations are in the same or 
different genes. (Panel 8-1, pp. 554-555) 

complex trait 

Heritable characteristic whose transmission to progeny does 
not obey simple Mendelian laws. Such traits are due to the 
interaction of multiple genes and/ or gene-environment 

interactions. 
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condensation reaction (dehydration reaction) 

Chemical reaction in which two molecules are covalently 
linked through -OH groups with the removal of a molecule of 
water. 

condensin (condensin complex) 

Complex of proteins involved in chromosome condensation 
prior to mitosis. Target for M-Cdk. (Figure 17-27) 

conditional mutation 

Mutation that changes a protein or RNA molecule so that its 
function is altered only under some conditions, such as at an 
unusually high or unusually low temperature. (Panel 8-1, pp. 

554-555) 

confocal microscope 

Type of light microscope that produces a clear image of a 
given plane within a solid object. It uses a laser beam as a 
pinpoint source of illumination and scans across the plane 
to produce a two-dimensional "optical section." 

connective tissue 

Any supporting tissue that lies between other tissues and 
consists of cells embedded in a relatively large amount of 
extracellular matrix. Includes bone, cartilage, and loose con- 
nective tissue. 

connexin 

Protein component of gap junctions, a four-pass transmem- 
brane protein. Six connexins assemble in the plasma mem- 
brane to form a connexon, or 'hemichannel.' (Figure 19-34) 

connexon 

Water-filled pore in the plasma membrane formed by a ring 
of six connexin protein subunits. Half of a gap junction: con- 
nexons from two adjoining cells join to form a continuous 
channel through which ions and small molecules can pass. 
(Figure 19-34) 

consensus sequence 

Average or most typical form of a sequence that is reproduced 
with minor variations in a group of related DNA, RNA, or pro- 
tein sequences. Indicates the nucleotide or amino acid most 
often found at each position. Preservation of a sequence 
implies that it is functionally important. (Figure 6-12) 

constitutive 

Occurring steadily regardless of circumstances; opposite of 
regulated. 

constitutive secretory pathway (default pathway) 

Pathway present in all cells by which molecules such as 
plasma membrane proteins are continually delivered to the 
plasma membrane from the Golgi apparatus in vesicles that 
fuse with the plasma membrane. The default route to the 
plasma membrane if no other sorting signals are present. 
(Figure 13-63) 

contractile ring 

Ring containing actin and myosin that forms under the sur- 
face of animal cells undergoing cell division. Contracts to 
pinch the two daughter cells apart. (Figure 17-50) 

convergent extension 

Rearrangement of cells within a tissue that causes it to extend 
in one dimension and shrink in another. (Figure 22-76) 

cooperativity 

Phenomenon in which the binding of one ligand molecule to 
a target molecule promotes the binding of successive ligand 
molecules. Seen in the assembly of large complexes, as well 
as in enzymes and receptors composed of multiple allosteric 
subunits, where it sharpens the response to a ligand. (Figure 
15-25) 

co-receptor 

In immunology: receptor on B cells or T cells that does not 
bind antigen but binds to other molecules and helps the 
antigen-binding receptors activate the lymphocyte. (Figure 
25-57) More generally: a receptor that collaborates with 



another conventional cell-surface receptor, helping the cell 
respond to secreted signal proteins. Examples are LRP (in the 
Wnt/p-catenin signaling pathway) and cell-surface proteo- 
glycans. (Figure 15-77) 

co-repressor 

Protein that does not itself bind DNA but assembles on other 
DNA-bound gene regulatory proteins to inhibit the expres- 
sion of a gene. (Figure 7-51) 

cortical granule 

Specialized secretory vesicle present under the plasma 
membrane of unfertilized eggs, including those of mammals. 
The contents of the cortical granules, released by exocytosis 
after fertilization, alter the egg coat so as to prevent the entry 
of further sperm. 

co-translational 

Occurring as translation proceeds. Examples include the 
import of a protein into the endoplasmic reticulum before 
the polypeptide chain is completely synthesized (co-transla- 
tional translocation, Figure 12-35), and the folding of a nas- 
cent protein into its secondary and tertiary structure as it 
emerges from a ribosome. (Figure 6-84) 

co-transport (coupled transport) 

Membrane transport process in which the transfer of one 
molecule depends on the simultaneous or sequential trans- 
fer of a second molecule. (Figure 11-8) 

coupled reaction 

Linked pair of chemical reactions in which the free energy 
released by one serves to drive the other. (Figure 2-51) 

covalent bond 

Stable chemical link between two atoms produced by shar- 
ing one or more pairs of electrons. (Figure 2-5 and Panel 2-1, 
pp. 106-107) 

Cre/lox 

Site-specific recombination system used to produce condi- 
tional mutants in which the target gene can be excised in a 
specific tissue or at a specific time. A site-specific recombi- 
nase (Cre) is introduced under the control of a promoter that 
will activate it as required. The gene to be disrupted is 
flanked by introduced lox sequences, at which the activated 
Cre operates to excise the gene. (Figure 5-79) 

crista (plural cristae) 

Invagination of the inner mitochondrial membrane. 

critical concentration 

Concentration of a protein monomer, such as actin or tubu- 
lin, that is in equilibrium with the assembled form of the pro- 
tein (i.e., assembled into actin filaments or microtubules, 
respectively). (Panel 16-2, pp. 978-979) 

crossover (chiasma) 

In meiotic recombination, a site on the paired chromosomes 
where a segment of a maternal chromatid is exchanged for a 
corresponding segment of a homologous paternal chro- 
matid (Figures 21-6 and 21-10) 

cross-strand exchange — see Holliday junction 

cryptochrome 

Flavoprotein responsive to blue light, found in both plants 
and animals. In animals, it is involved in circadian rhythms. 

C terminus (carboxyl terminus) 

The end of a polypeptide chain that carries a free carboxyl 
(-COOH) group. (Figure 3-1) 

cut-and-paste transposition 

Type of movement of a transposable element in which the 
element is cut out of the DNA and inserted into a new site by 
a special transposase enzyme. (Figure 5-69) 

cyclic AMP (cAMP) 

Nucleotide that is generated from ATP by adenylyl cyclase in 
response to various extracellular signals. It acts as small 
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intracellular signaling molecule, mainly by activating cAMP- 
dependent protein kinase (PKA). It is hydrolyzed to AMP by a 
phosphodiesterase. (Figure 15-34) 

cyclic AMP-dependent protein kinase (protein kinase A, PKA) 

Enzyme that phosphorylates target proteins in response to a 

rise in intracellular cyclic AMP (Figure 15-35) 

cyclic GMP (cGMP) 

Nucleotide that is generated from GTP by guanylyl cyclase in DAG— see diacylglycerol 

response to various extracellular signals. 



cytosol 

Contents of the main compartment of the cytoplasm, 
excluding membrane-bounded organelles such as endoplas- 
mic reticulum and mitochondria. 

cytotoxic T cell (killer T cell) 

Type of T cell responsible for killing host cells infected with a 
virus or another type of intracellular pathogen. 



cyclin 

Protein that periodically rises and falls in concentration in 
step with the eucaryotic cell cycle. Cyclins activate crucial 
protein kinases (called cyclin-dependent protein kinases, or 
Cdks) and thereby help control progression from one stage 
of the cell cycle to the next. 

cyclin-Cdk complex 

Protein complex formed periodically during the eucaryotic 
cell cycle as the level of a particular cyclin increases. A cyclin- 
dependent kinase (Cdk) then becomes partially activated. 
(Figures 17-15 and 17-16, and Table 17-1, p. 1063) 

cyclin-dependent kinase (Cdk) 

Protein kinase that has to be complexed with a cyclin protein 
in order to act. Different Cdk-cyclin complexes trigger differ- 
ent steps in the cell-division cycle by phosphorylating spe- 
cific target proteins. (Figure 17-15) 

cyclosome — see anaphase-promoting complex 

cytochrome 

Colored heme-containing protein that transfers electrons 
during respiration and photosynthesis. 

cytochrome b-c\ complex 

Second of the three electron-driven proton pumps in the 
respiratory chain. Accepts electrons from ubiquinone. 
(Figure 14-26) 

cytochrome c 

Soluble component of the mitochondrial electron-transport 
chain. Its release into the cytosol from the mitochondrial 
intermembrane space also initiates apoptosis. (Figure 14-26) 

cytochrome oxidase complex 

Third of the three electron-driven proton pumps in the res- 
piratory chain. It accepts electrons from cytochrome c and 
generates water using molecular oxygen as an electron 
acceptor. (Figure 14-26) 

cytokine 

Extracellular signal protein or peptide that acts as a local 
mediator in cell-cell communication. 

cytokine receptor 

Cell-surface receptor that binds a specific cytokine or hor- 
mone and acts through the lak-STAT signaling pathway. 
(Figure 15-68) 

cytokinesis 

Division of the cytoplasm of a plant or animal cell into two, 
as distinct from the associated division of its nucleus (which 
is mitosis). Part of M phase. (Panel 17-1, pp. 1072-1073) 

cytoplasm 

Contents of a cell that are contained within its plasma mem- 
brane but, in the case of eucaryotic cells, outside the nucleus. 

cytoplasmic tyrosine kinase 

Enzymes activated by certain cell-surface receptors (tyro- 
sine-kinase-associated receptors) that transmit the receptor 
signal onwards by phosphorylating target cytoplasmic pro- 
teins on tyrosine side chains. 

cytoskeleton 

System of protein filaments in the cytoplasm of a eucaryotic 
cell that gives the cell shape and the capacity for directed 
movement. Its most abundant components are actin fila- 
ments, microtubules, and intermediate filaments. 



dalton 

Unit of molecular mass. Approximately equal to the mass of 
a hydrogen atom (1.66 x 10 -24 g). 

death receptor 

Transmembrane receptor protein that can signal the cell to 
undergo apoptosis when it binds its extracellular ligand. 
(Figure 18-6) 

default secretory pathway— see constitutive secretory pathway 

degenerate 

Not a moral judgment but an adjective describing multiple 
states that amount to the same thing: different triplet combi- 
nations of nucleotide bases (codons) that code for the same 
amino acid, for example. 

dehydration reaction — see condensation reaction 

dehydrogenase 

Enzyme that removes a hydride ion (H - ), equivalent to a pro- 
ton plus two electrons from a substrate molecule. 

delta G — see free-energy change 

delta G° — see standard free- energy change 

denaturation 

Dramatic change in conformation of a protein or nucleic 
acid caused by heating or by exposure to chemicals. Usually 
results in the loss of biological function. 

dendrite 

Extension of a nerve cell, often elaborately branched, that 
receives stimuli from other nerve cells. 

dendritic cell 

The most potent type of antigen-presenting cell, which takes 
up antigen and processes it for presentation to T cells. 

deoxyribonucleic acid — see DNA 

deoxyribose 

The five-carbon monosaccharide component of DNA. 
Differs from ribose in having H at the 2-carbon position 
rather than OH. C5H10O4. Compare ribose. 

depolarization 

Shift in a cell's membrane potential to a less negative value 
inside. 

dermis 

Thick underlying layer of connective tissue in the skin, 
beneath the epidermis. Rich in collagen. 

desensitization — see adaptation 

desmosome 

Anchoring cell-cell junction, usually formed between two 
epithelial cells. Characterized by dense plaques of protein 
into which intermediate filaments in the two adjoining cells 
insert. (Figure 19-3) 

determination 

In developmental biology, an embryonic cell is said to be 
determined if it has become committed to a particular spe- 
cialized path of development. Determination reflects a change 
in the internal character of the cell, and it precedes the much 
more readily detected process of cell differentiation. 

diacylglycerol (DAG) 

Lipid produced by the cleavage of inositol phospholipids in 
response to extracellular signals. Composed of two fatty acid 
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chains linked to glycerol, it serves as a small signaling mole- 
cule to help activate protein kinase C (PKC). (Figure 15-38) 

dideoxy method 

The standard enzymatic method of DNA sequencing. (Figure 
8-50) 

differential-interference-contrast microscope 

Type of light microscope that exploits the interference effects 
that occur when light passes through parts of a cell of differ- 
ent refractive indexes. Used to view unstained living cells. 

differentiation 

Process by which a cell undergoes a change to an overtly spe- 
cialized cell type. 

diffusion 

Net drift of molecules in the direction of lower concentration 
due to random thermal movement. 

diploid 

Containing a double genome (two sets of homologous chro- 
mosomes and hence two copies of each gene or genetic 
locus). Compare haploid. 

dissociation constant (Ad) 

Measure of the tendency of a complex to dissociate. For com- 
ponents A and B and the binding equilibrium A + B ^ AB, the 
dissociation constant is given by [A] [B] / [AB] , and it is smaller 
the tighter the binding between A and B. The dissociation 
constant {K£ is the reciprocal of K a . See also affinity con- 
stant, equilibrium constant. (Figure 3-43) 

disulfide bond (-S-S-) 

Covalent linkage formed between two sulfhydryl groups on 
cysteines. For extracellular proteins, a common way of join- 
ing two proteins together or linking different parts of the 
same protein. Formed in the endoplasmic reticulum of 
eucaryotic cells. (Panel 2-1, p. 107, and Figure 3-28) 

DNA (deoxyribonucleic acid) 

Polynucleotide formed from covalently linked deoxyribonu- 
cleotide units. The store of hereditary information within a 
cell and the carrier of this information from generation to 
generation. (Figure 4-3 and Panel 2-6, pp. 116-117) 

DNA affinity chromatography 

Technique for purifying sequence-specific DNA-binding 
proteins by their binding to a matrix to which the appropri- 
ate DNA fragments are attached. (Figure 7-28) 

DNA footprinting 

Technique for determining the DNA sequence to which a 
DNA-binding protein binds. (Figure 7-29) 

DNA helicase 

Enzyme that is involved in opening the DNA helix into its 
single strands for DNA replication. 

DNA library 

Collection of cloned DNA molecules, representing either an 
entire genome (genomic library) or complementary DNA 
copies of the mRNA produced by a cell (cDNA library). 

DNA ligase 

Enzyme that joins the ends of two strands of DNA together 
with a covalent bond to make a continuous DNA strand. 

DNA methylation 

Addition of methyl groups to DNA. Extensive methylation of 
the cytosine base in CG sequences is used in vertebrates to 
keep genes in an inactive state. 

DNA microarray 

A large array of short DNA molecules (each of known 
sequence) bound to a glass microscope slide or other suit- 
able support. Used to monitor expression of thousands of 
genes simultaneously: mRNA isolated from test cells is con- 
verted to cDNA, which in turn is hybridized to the microar- 
ray. (Figure 8-73) 



DNA-only transposon 

Transposable element that exists as DNA throughout its life 
cycle. Many move by cut- and -paste transposition. See also 
transposon. 

DNA polymerase 

Enzyme that synthesizes DNA by joining nucleotides 
together using a DNA template as a guide. 

DNA primase 

Enzyme that synthesizes a short strand of RNA on a DNA tem- 
plate, producing a primer for DNA synthesis. (Figure 5-11) 

DNA replication 

Process by which a copy of a DNA molecule is made. 

DNA tumor virus 

General term for a variety of different DNA viruses that can 
cause tumors. (Figure 20-43) 

domain (protein domain) 

Portion of a protein that has a tertiary structure of its own. 
Larger proteins are generally composed of several domains, 
each connected to the next by short flexible regions of 
polypeptide chain. Homologous domains are recognized in 
many different proteins. 

dominant 

In genetics, the member of a pair of alleles that is expressed 
in the phenotype of an organism while the other allele is not, 
even though both alleles are present. Opposite of recessive. 
(Panel 8-1, pp. 554-555) 

dominant negative mutation 

Mutation that dominantly affects the phenotype, blocking 
gene activity and causing a loss-of-function phenotype even 
in the presence of a normal copy of the gene. (Panel 8-1, pp. 
554-555) 

double helix 

The three-dimensional structure of DNA, in which two anti- 
parallel DNA chains, held together by hydrogen bonding 
between the bases, are wound into a helix. (Figure 4-5) 

Drosophila melanogaster 

Species of small fly, commonly called a fruit fly A model 
organism in molecular genetics. 

duplex DNA 

Double-stranded DNA. 

dynamic instability 

Sudden conversion from growth to shrinkage, and vice versa, 
in a protein filament such as a microtubule or actin filament. 
(Panel 16-2, pp. 978-979) 

dynamin 

Cytosolic GTPase that binds to the neck of a clathrin-coated 
vesicle in the process of budding from the membrane, and 
which is involved in completing vesicle formation. 

dynein 

Large motor protein that undergoes ATP-dependent move- 
ment along microtubules. 

E. coli — see Escherichia coli 

ectoderm 

Embryonic epithelial tissue that is the precursor of the epi- 
dermis and nervous system. 

effector cell 

Cell that carries out the final response or function in a par- 
ticular process. The main effector cells of the immune sys- 
tem, for example, are activated lymphocytes and phagocytes 
that help eliminate pathogens. 

E2F protein 

Gene regulatory protein that switches on many genes that 
encode proteins required for entry into the S-phase of the 
cell cycle. 
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elF — see eucaryotic initiation factor 

elastic fiber 

Extensible fiber formed by the protein elastin in many ani- 
mal connective tissues, such as in skin, blood vessels, and 
lungs, which gives them their stretchability and resilience. 

elastin 

Extracellular protein that forms extensible fibers (elastic 
fibers) in connective tissues. 

electrochemical gradient 

Combined influence of a difference in the concentration of 
an ion on two sides of a membrane and the electrical charge 
difference across the membrane (membrane potential). Ions 
or charged molecules can move passively only down their 
electrochemical gradient. 

electron 

Negatively charged subatomic particle that orbits the 
nucleus in an atom. (Figure 2-1) 

electron acceptor 

Atom or molecule that takes up electrons readily, thereby 
gaining an electron and becoming reduced. 

electron carrier 

Molecule such as cytochrome c, NADH, NADPH, and FADH 2 
which carries electrons and transfers them from donor mol- 
ecules to acceptor molecules. See also electron transport 
chain, and Figure 2-60. 

electron donor 

Molecule that easily gives up an electron, becoming oxidized 
in the process. 

electron microscope 

Microscope that uses a beam of electrons to create the 
image. 

electron-microscope tomography (EM tomography) 

Technique for viewing three-dimensional specimens in the 
electron microscope in which multiple views are taken from 
different directions by tilting the specimen holder. The views 
are combined computationally to give a three-dimensional 
image. 

electron- transport chain 

Series of reactions in which electron carrier molecules pass 
electrons 'down the chain' from a higher to successively 
lower energy levels, to a final acceptor molecule. The energy 
released during electron movement can be used to power 
various processes. Electron-transport chains present in the 
inner mitochondrial membrane (called the respiratory 
chain) and in the thylakoid membrane of chloroplasts gener- 
ate a proton gradient across the membrane that is used to 
drive ATP synthesis. See especially Figures 14-3 and 14-10. 

electrophoresis 

Technique for separating molecules (typically proteins or 
nucleic acids) on the basis of their speed of migration 
through a porous medium when subjected to a strong elec- 
tric field. 

electroporation 

Method for introducing DNA into cells, especially bacteria, 
in which a brief electric shock makes the cell membrane 
temporarily permeable to the foreign DNA. 

elongation factor (EF) 

Nomenclature used in both transcription and translation. In 
transcription, elongation factors associate with RNA poly- 
merase and allow it to transcribe long stretches of DNA with- 
out dissociating. In translation, elongation factors bind to 
the ribosome and, by hybridizing GTP, drive the addition of 
amino acids to the growing polypeptide chain. 

EM tomography — see electron-microscope tomography 

embryonic stem cell (ES cell) 

Cell derived from the inner cell mass of the early mammalian 



embryo. Capable of giving rise to all the cells in the body. Can 
be grown in culture, genetically modified, and inserted into a 
blastocyst to develop a transgenic animal. 

endocrine 

Relating to hormones or the glands that secrete them. 

endocrine cell 

Specialized animal cell that secretes a hormone into the 
blood. Usually part of a gland, such as the thyroid or pituitary 
gland. 

endocrine signaling 

Signaling via hormones released by endocrine glands into 
the bloodstream and carried to distant target cells that have 
receptors that bind the specific hormone. (Figures 15-4 and 
15-5) 

endocytosis 

Uptake of material into a cell by an invagination of the 
plasma membrane and its internalization in a membrane- 
bounded vesicle. See also pinocytosis and phagocytosis. 

endoderm 

Embryonic tissue that is the precursor of the gut and associ- 
ated organs. 

endonuclease 

Enzyme that cleaves nucleic acids within the polynucleotide 
chain. Compare exonuclease. 

endoplasmic reticulum (ER) 

Labyrinthine membrane-bounded compartment in the 
cytoplasm of eucaryotic cells, where lipids are synthesized 
and membrane-bound proteins and secretory proteins are 
made. (Figure 12-36) 

endosome 

Membrane-bounded organelle in animal cells that carries 
materials newly ingested by endocytosis and passes many of 
them on to lyso somes for degradation. 

endothelial cell 

Flattened cell type that forms a sheet (the endothelium) lin- 
ing all blood and lymphatic vessels. 

enhancer 

Regulatory DNA sequence to which gene regulatory proteins 
bind, increasing the rate of transcription of a structural gene 
that can be many thousands of base pairs away. 

entropy (5) 

Thermodynamic quantity that measures the degree of disor- 
der or randomness in a system; the higher the entropy, the 
greater the disorder. (Panel 2-7, pp. 118-119) 

enveloped virus 

Virus with a capsid surrounded by a lipid bilayer membrane 
(the envelope), which is derived from the host cell plasma 
membrane when the virus buds from the cell. (Figure 24-15) 

enzyme 

Protein that catalyzes a specific chemical reaction. 

enzyme-coupled receptor 

A major type of cell-surface receptor that has a cytoplasmic 
domain that either has enzymatic activity or is associated 
with an intracellular enzyme. In either case, the enzymatic 
activity is stimulated by an extracellular ligand binding to the 
receptor. (Figure 15-16) 

Eph receptor 

The most numerous type of receptor tyrosine kinase (RTK) 
that recognizes Ephrins. (Figure 15-52) 

Ephrin 

One of a family of membrane-bound protein ligands for the 
Eph receptor tyrosine kinases (RTKs) that, among many 
other functions, stimulate repulsion or attraction responses 
that guide the migration of cells and nerve cell axons during 
animal development. 
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epidermis 

Epithelial layer covering the outer surface of the body. Has 
different structures in different animal groups. The outer 
layer of plant tissue is also called the epidermis. 

epigenetic inheritance 

Inheritance of phenotypic changes in a cell or organism that 
do not result from changes in the nucleotide sequence of 
DNA. Can be due to positive feedback loops of gene regula- 
tory proteins or to heritable modifications in chromatin such 
as DNA methylation or histone modifications causing hete- 
rochromatin formation. (Figures 4-35 and 7-86) 

epinephrine — see adrenaline 

epistatic 

Describes a mutation in one gene that masks the effect of a 
mutation in another gene when both mutations are present 
in the same organism or cell. 

epithelium (plural epithelia) 

Sheet of cells covering the outer surface of a structure or lin- 
ing a cavity. 

epitope — see antigenic determinant 

equilibrium constant (K) 

Ratio of forward and reverse rate constants for a reaction. 
Equal to the association or affinity constant {K a ) for a simple 
binding reaction (A + B ^ AB). See also affinity constant, dis- 
sociation constant. (Figure 3-43) 

ER — see endoplasmic reticulum 

ER lumen 

Space enclosed by the membrane of the endoplasmic 
reticulum (ER). 

ER resident protein 

Protein that remains in the endoplasmic reticulum (ER) or its 
membranes and carries out its function there, as opposed to 
proteins that are present in the ER only in transit. 

ER retention signal 

Short amino acid sequence on a protein that prevents it from 
moving out of the endoplasmic reticulum (ER). Found on 
proteins that are resident in the ER and function there. 

ER signal sequence 

N-terminal signal sequence that directs proteins to enter the 
endoplasmic reticulum (ER). Cleaved off by signal peptidase 
after entry. 

erythrocyte (red blood cell) 

Small hemoglobin- containing blood cell of vertebrates that 
transports oxygen to, and carbon dioxide from, tissues. 
(Figure 11^10) 

erythropoietin 

A hormone produced by the kidney that stimulates the pro- 
duction of red blood cells in bone marrow. 

ES cell — see embryonic stem cell 

Escherichia coli (E. coli) 

Rodlike bacterium normally found in the colon of humans 
and other mammals and widely used in biomedical research. 

ester 

Molecule formed by the condensation reaction of an alcohol 
group with an acidic group. Phosphate groups usually form 
esters when linked to a second molecule. (Panel 2-1, pp. 
106-107) 

eubacteria 

True bacteria, in contradistinction to archaea (archaebacte- 
ria). (Figure 1-21) 

eucaryote (eukaryote) 

Organism composed of one or more cells that have a distinct 
nucleus. Member of one of the three main divisions of the 
living world, the other two being Bacteria and Archaea. 
(Figure 1-21) 



eucaryotic initiation factor (elF) 

Protein that helps load initiator tRNA on to the ribosome, 
thus initiating translation. 

euchromatin 

Region of an interphase chromosome that stains diffusely; 
"normal" chromatin, as opposed to the more condensed het- 
erochromatin. 

excision repair — see base excision repair 

exocytosis 

Process by which most molecules are secreted from a 
eucaryotic cell. These molecules are packaged into mem- 
brane-bounded vesicles that fuse with the plasma mem- 
brane and release their contents to the outside. 

exon 

Segment of a eucaryotic gene that consists of a sequence of 
nucleotides that will be represented in mRNA or in the final 
transfer, ribosomal, or other mature RNA molecule. In pro- 
tein-coding genes, exons encode the amino acids in the pro- 
tein. An exon is usually adjacent to a noncoding DNA seg- 
ment called an intron. (Figure 4-15) 

exonuclease 

Enzyme that cleaves nucleotides one at a time from the ends 
of polynucleotides. Compare endonuclease. 

expression vector 

A virus or plasmid that carries a DNA sequence into a suit- 
able host cell and there directs the synthesis of the protein 
encoded by the sequence. (Figure 8-48) 

extracellular matrix 

Complex network of polysaccharides (such as glycosamino- 
glycans or cellulose) and proteins (such as collagen) secreted 
by cells, and in which the cells are embedded. 

extracellular signal molecule 

Any secreted or cell-surface chemical signal that binds to 
receptors and regulates the activity of the cell expressing the 
receptor. 

facilitated diffusion — see passive transport 

FAD/FADH2 (flavin adenine dinucleotide/reduced flavin 
adenine dinucleotide) 

Electron carrier system that functions in the citric acid cycle. 
One molecule of FAD gains two electrons plus two protons in 
becoming the activated carrier FADH2. (Figure 2-83) 

FAK — see focal adhesion kinase 

Fas (Fas protein, Fas death receptor) 

Transmembrane death receptor that initiates apoptosis when 
it binds its extracellular ligand (Fas ligand). (Figure 18-6) 

fat 

Energy-storage lipid in cells. Composed of triglycerides — 
fatty acids esterified with glycerol. 

fatty acid 

Carboxylic acid with a long hydrocarbon tail. A major source 
of energy during metabolism and a starting point for the syn- 
thesis of phospholipids. (Panel 2-5, pp. 114-115) 

Fc receptor 

One of a family of cell-surface receptors that bind the tail 
region (Fc region) of an antibody (immunoglobulin) mole- 
cule. Different Fc receptors are specific for different classes 
of antibodies such as IgG, IgA, or IgE. 

feedback inhibition 

The process in which a product of a reaction feeds back to 
inhibit a previous reaction in the same pathway. (Figures 
3-56 and 3-57) 

fermentation 

Anaerobic energy-yielding metabolic pathway. In anaerobic 
glycolysis, for instance, pyruvate is converted into lactate or 
ethanol, with the conversion of NADH to NAD + . (Figure 2-71) 
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fertilization 

Fusion of a male and a female gamete (both haploid) to form 
a diploid zygote, which develops into a new individual. 

F Fi ATPase— see ATP synthase 

fibroblast 

Common cell type found in connective tissue. Secretes an 
extracellular matrix rich in collagen and other extracellular 
matrix macromolecules. Migrates and proliferates readily in 
wounded tissue and in tissue culture. 

fibronectin 

Extracellular matrix protein involved in adhesion of cells to 
the matrix and guidance of migrating cells during embryoge- 
nesis. Integrins on the cell surface are receptors for 
fibronectin. 

filopodium (plural filopodia) (microspike) 

Thin, spike-like protrusion with an actin filament core, gen- 
erated on the leading edge of a crawling animal cell. (Figure 
16-47) 

FISH — see fluorescence in situ hybridization 

fission yeast 

Common name for the yeast model organism Schizosaccharo- 
myces pombe. It divides to give two equal-sized cells. 

fixative 

Chemical reagent such as formaldehyde, glutaraldehyde, or 
osmium tetroxide used to preserve cells for microscropy. 
Samples treated with these reagents are said to be 'fixed/ and 
the process is called fixation. 

flagellum (plural flagella) 

Long, whiplike protrusion whose undulations drive a cell 
through a fluid medium. Eucaryotic flagella are longer versions 
of cilia. Bacterial flagella are smaller and completely different 
in construction and mechanism of action. Compare cilium. 

flavin adenine dinucleotide — see FAD/FADH2 

fluorescence 

Light emission exhibited by some substances (fluo- 
rochromes) as their electrons, having been excited by 
absorption of light, return to their normal lower energy state. 
The emitted radiation is always at a lower energy (longer 
wavelength) than the absorbed radiation. 

fluorescence-activated cell sorter (FACS) 

Machine that sorts cells according to their fluorescence. 
(Figure 8-2) 

fluorescence in situ hybridization (FISH) 

Technique in which fluorescently labeled nucleic acid 
probes hybridize to specific DNA or RNA sequences in situ. 

fluorescence microscope 

Microscope designed to view material stained with fluores- 
cent dyes. Similar to a light microscope but the illuminating 
light is passed through one set of filters before the specimen, 
to select those wavelengths that excite the dye, and through 
another set of filters before it reaches the eye, to select only 
those wavelengths emitted when the dye fluoresces. 

fluorescence recovery after photobleaching (FRAP) 

Technique for monitoring the kinetic parameters of a protein 
by analyzing how fluorescent protein molecules move into 
an area of the cell bleached by a beam of laser light. 

fluorescence resonance energy transfer (FRET) 

Technique for monitoring the closeness of two fluorescently 
labeled molecules (and thus their interaction) in cells. 
(Figure 8-26) 

focal adhesion, (focal contact, adhesion plaque) 

Anchoring cell junction, forming a small region on the sur- 
face of a fibroblast or other cell that is anchored to the extra- 
cellular matrix. Attachment is mediated by transmembrane 
proteins such as integrins, which are linked, through other 
proteins, to actin filaments in the cytoplasm. 



focal adhesion kinase (FAK) 

Cytoplasmic tyrosine kinase present at cell-matrix junctions 
(focal adhesions) in association with the cytoplasmic tails of 
integrins. 

follicle cell 

One of the cell types that surround a developing oocyte or 
egg. (Figure 21-24) 

footprinting — see DNA footprinting 

FRAP — see fluorescence recovery after photobleaching 

free energy (G) (Gibbs free energy) 

The energy that can be extracted from a system to drive reac- 
tions. Takes into account changes in both energy and 
entropy. (Panel 2-7, pp. 118-119) 

free-energy change (AG) 

Change in the free energy during a reaction: the free energy of 
the product molecules minus the free energy of the starting 
molecules. A large negative value of AG indicates that the reac- 
tion has a strong tendency to occur. (Panel 2-7, pp. 118-119) 

free radical 

Atom or molecule which is extremely reactive by virtue of its 
having at least one unpaired electron. Responsible for much 
intracellular DNA damage. 

free ribosome 

Ribosome that is free in the cytosol, unattached to any mem- 
brane. 

freeze-fracture electron microscopy 

Technique for studying membrane structure, in which the 
membrane of a frozen cell is fractured along the interior of 
the bilayer, separating it into the two monolayers with the 
interior faces exposed. 

FRET — see fluorescence resonance energy transfer 

fungus (plural fungi) 

Kingdom of eucaryotic organisms that includes the yeasts, 
molds, and mushrooms. Many plant diseases and a relatively 
small number of animal diseases are caused by fungi. 

fusion protein 

Engineered protein that combines two or more normally 
separate polypeptides. Produced from a recombinant gene. 

G — see free energy 

AG — see free energy change 

AG — see standard free energy change 

G{ — see inhibitory G protein 

G q 

Class of G protein that couples GPCRs to phospholipase C-(3 
to activate the inositol phospholipid signaling pathway. 

G s — see stimulatory G protein 

GAG — see glycosaminoglycan 

gain-of-function mutation 

Mutation that increases the activity of a gene, or makes it 
active in inappropriate circumstances. Usually dominant. 
(Panel 8-1, pp. 554-555) 

gamete 

Specialized haploid cell (a sperm or egg) formed from pri- 
mordial germ cells by meiosis and specialized for sexual 
reproduction. See also germ cell. 

gamma-tubulin ring complex (YTuRC) 

Protein complex containing y-tubulin and other proteins 
that is an efficient nucleator of microtubules and caps their 
minus ends. 

ganglion (plural ganglia) 

Cluster consisting mainly of neuronal cell bodies and associ- 
ated glial cells, located outside the central nervous system. 
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ganglioside 

Any glycolipid having one or more sialic acid residues in its 
structure. Found in the plasma membrane of eucaryotic cells 
and especially abundant in nerve cells. (Figure 10-18) 

GAP — see GTPase- activating protein 

gap gene 

In Drosophila development, a gene that is expressed in spe- 
cific broad regions along the anteroposterior axis of the early 
embryo, and which helps designate the main divisions of the 
insect body. (Figure 22-37) 

gap junction 

Communicating channel-forming cell-cell junction present 
in most animal tissues that allows ions and small molecules 
to pass from the cytoplasm of one cell to the cytoplasm of the 
next. 

gastrulation 

Stage in animal embryogenesis during which the embryo is 
transformed from a ball of cells to a structure with a gut (a 

gastrula). (Figure 22-3) 

d-Cdk 

Cyclin-Cdk complex formed in vertebrate cells by a Gi-cyclin 
and the corresponding cyclin-dependent kinase (Cdk). 
(Table 17-1, p. 1063) 

Gi-cyclin 

Cyclin present in the Gi phase of the eucaryotic cell cycle. 
Forms complexes with Cdks that help govern the activity of 
the Gi/S-cyclins, which control progression to S-phase. 

GEF — see guanine nucleotide exchange factor 

gel-mobility shift assay 

Technique for detecting proteins bound to a specific DNA 
sequence by the fact that the bound protein slows down the 
migration of the DNA fragment through a gel during gel elec- 
trophoresis. (Figure 7-27) 

gel-transfer hybridization — see blotting 

geminin 

Protein that prevents the formation of new prereplicative 
complexes during S phase and mitosis, thus ensuring that 
the chromosomes are replicated only once in each cell cycle. 

gene 



Region of DNA that is transcribed as a single unit and carries 
information for a discrete hereditary characteristic, usually 
corresponding to (1) a single protein (or set of related pro- 
teins generated by variant post-transcriptional processing), 
or (2) a single RNA (or set of closely related RNAs). 

gene activator protein — see activator 

gene control region 

The set of linked DNA sequences regulating expression of a 
particular gene. Includes promoter and regulatory 
sequences required to initiate transcription of the gene and 
control the rate of initiation. [Figures 7-37 (procaryotes) and 

7-44 (eucaryotes)] 

gene conversion 

Process by which DNA sequence information can be trans- 
ferred from one DNA helix (which remains unchanged) to 
another DNA helix whose sequence is altered. It often 
accompanies general recombination events. (Figure 5-66) 

gene expression 

Production of an observable molecular product (RNA or pro- 
tein) by a gene. 

general recombination, general genetic recombination — see 
homologous recombination 

general transcription factor 

Any of the proteins whose assembly at a promoter is required 
for the binding and activation of RNA polymerase and the 
initiation of transcription. (Table 6-3, p. 341) 



gene regulatory protein 

General name for any protein that binds to a specific DNA 
sequence to influence the transcription of a gene. 

gene repressor protein — see repressor 

genetic code 

Set of rules specifying the correspondence between 
nucleotide triplets (codons) in DNA or RNA and amino acids 
in proteins. (Figure 6-50) 

genetic engineering (recombinant DNA technology) 

Collection of techniques by which DNA segments from dif- 
ferent sources are combined to make a new DNA, often 
called a recombinant DNA. Recombinant DNAs are widely 
used in the cloning of genes, in the genetic modification of 
organisms, and in molecular biology generally. 

genetic instability 

Abnormally increased spontaneous mutation rate, such as 
occurs in cancer cells. 

genetic mosaic — see mosaic 

genetic recombination — see recombination 

genetic redundancy 

The presence of two or more similar genes with overlapping 
functions. 

genetics 

The study of the genes of an organism on the basis of hered- 
ity and variation. 

genetic screen 

Procedure for discovery of genes affecting specific aspects of 
the phenotype by surveying large numbers of mutagenized 
individuals. 

genome 

The totality of genetic information belonging to a cell or an 
organism; in particular, the DNA that carries this information. 

genomic DNA 

DNA constituting the genome of a cell or an organism. Often 
used in contrast to cDNA (DNA prepared by reverse tran- 
scription from mRNA). Genomic DNA clones represent DNA 
cloned directly from chromosomal DNA, and a collection of 
such clones from a given genome is a genomic DNA library. 

genomic imprinting 

Phenomenon in which a gene is either expressed or not 
expressed in the offspring depending on which parent it is 
inherited from. (Figure 7-82) 

genomics 

Study of the DNA sequences and properties of entire 
genomes. 

genotype 

Genetic constitution of an individual cell or organism. The 
particular combination of alleles found in a specific individ- 
ual. (Panel 8-1, pp. 554-555) 

germ cell 

A cell in the germ line of an organism, which includes the 
haploid gametes and their specified diploid precursor cells. 
Germ cells contribute to the formation of a new generation 
of organisms and are distinct from somatic cells, which form 
the body and leave no descendants. 

germ layer 

One of the three primary tissue layers (endoderm, meso- 
derm, and ectoderm) of an animal embryo. (Figure 22-70) 

germ line 

The cell lineage that consists of the haploid gametes and 
their specified diploid precursor cells. 

GFP— see green fluorescent protein 

Gibbs free energy— see free energy 
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giga- 

Prefix denoting one billion (10 y ). 

glial cell 

Supporting nonneural cell of the nervous system. Includes 
oligodendrocytes and astrocytes in the vertebrate central 
nervous system and Schwann cells in the peripheral nervous 

system. 

glucagon 

Hormone produced in the pancreas that stimulates liver 
cells to convert stored glycogen to glucose, leading to a rise 
in blood sugar levels. Effect is opposite to that of insulin. 

glucose 

Six-carbon sugar that plays a major role in the metabolism of 
living cells. Stored in polymeric form as glycogen in animal 
cells and as starch in plant cells. (Panel 2-4, pp. 112-113) 

glycerol 

Small organic molecule that is the parent compound of 
many small molecules in the cell, including phospholipids. 
(Panel 2-5, pp. 114-115) 

glycogen 

Polysaccharide composed exclusively of glucose units. Used 
to store energy in animal cells. Large granules of glycogen are 
especially abundant in liver and muscle cells. (Figure 2-75 
and Panel 2-4, pp. 112-113) 

glycolipid 

Lipid molecule with a sugar residue or oligosaccharide 
attached. (Panel 2-5, pp. 114-115) 

glycolysis 

Ubiquitous metabolic pathway in the cytosol in which sugars 
are incompletely degraded with production of ATP. Literally 
"sugar splitting." (Figure 2-70 and Panel 2-8 , pp. 120-121) 

glycoprotein 

Any protein with one or more oligosaccharide chains cova- 
lently linked to amino -acid side chains. Most secreted pro- 
teins and most proteins exposed on the outer surface of the 
plasma membrane are glycoproteins. 

glycosaminoglycan (GAG) 

Long, linear, highly charged polysaccharide composed of a 
repeating pair of sugars, one of which is always an amino 
sugar. Mainly found covalently linked to a protein core in 
extracellular matrix proteoglycans. Examples include chon- 
droitin sulfate, hyaluronan, and heparin. (Figure 19-55) 

glycosidic bond 

Chemical bond formed by a condensation reaction between 
a hydroxyl group on the sugar carbon atom that carries the 
aldehyde or the ketone and another molecule, often another 
sugar. Glycosidic bonds are found in numerous structures 
including polysaccharides, nucleic acids, and glycoproteins. 
See O-linked and AT-linked oligosaccharides. (Panel 2-4, 
pp. 112-113) 

glycosylase — see DNA glycosylase 

glycosylation 

Addition of one or more sugars to a protein or lipid molecule. 

glycosylphosphatidylinositol anchor — see GPI anchor 

G2/M checkpoint 

Point in the eucaryotic cell cycle at which the cell checks for 
completion of DNA replication before triggering the early 
mitotic events that lead to chromosome alignment on the 
spindle. (Figure 17-14) 

Golgi apparatus (Golgi complex) 

Complex organelle in eucaryotic cells, centered on a stack of 
flattened membrane-bounded spaces, in which proteins and 
lipids transferred from the endoplasmic reticulum are mod- 
ified and sorted. It is the site of synthesis of many cell wall 
polysaccharides in plants and extracellular matrix gly- 
cosaminoglycans in animal cells. (Figure 13-25) 



Go phase 

State of withdrawal from the eucaroytic cell-division cycle by 
entry into a quiescent digression from the Gi phase. A com- 
mon, sometimes permanent, state for differentiated cells. 

Gi phase 

Gap 1 phase of the eucaryotic cell-division cycle, between the 
end of mitosis and the start of DNA synthesis. (Figure 17-4) 

G2 phase 

Gap 2 phase of the eucaryotic cell-division cycle, between 
the end of DNA synthesis and the beginning of mitosis. 
(Figure 17-4) 

GPI anchor (glycosylphosphatidylinositol anchor) 

Lipid linkage by which some membrane proteins are bound 
to the membrane. The protein is joined, via an oligosaccha- 
ride linker, to a phosphatidylinositol anchor during its travel 
through the endoplasmic reticulum. (Figure 10-19(6)) 

G protein (trimeric GTP-binding protein) 

A trimeric GTP-binding protein with intrinsic GTPase activ- 
ity that couples GPCRs to enzymes or ion channels in the 
plasma membrane. (Table 15-3, p. 919) 

G-protein-coupled receptor (GPCR) 

A seven-pass cell-surface receptor that, when activated by its 
extracellular ligand, activates a G protein, which in turn, acti- 
vates either an enzyme or ion channel in the plasma mem- 
brane. (Figures 15-16 and 15-30) 

grana (singular granum) 

Stacked membrane discs (thylakoids) in chloroplasts that 
contain chlorophyll and are the site of the light-trapping 
reactions of photosynthesis. (Figures 14-35 and 14-36) 

granulocyte 

Category of white blood cell distinguished by conspicuous 
cytoplasmic granules. Includes neutrophils, basophils, and 
eosinophils. Arises from a granulocyte /macrophage (GM) 
progenitor cell. (Figure 23-37) 

green fluorescent protein (GFP) 

Fluorescent protein isolated from a jellyfish. Widely used as a 
marker in cell biology. 

growth cone 

Migrating motile tip of a growing nerve cell axon or dendrite. 
(Figure 16-105) 

growth factor 

Extracellular signal protein that can stimulate a cell to grow. 
They often have other functions as well, including stimulat- 
ing cells to survive or proliferate. Examples include epider- 
mal growth factor (EGF) and platelet-derived growth factor 
(PDGF). 

d/S-Cdk 

Cyclin-Cdk complex formed in vertebrate cells by a Gi/S- 
cyclin and the corresponding cyclin-dependent kinase 
(Cdk). (Figure 17-16 and Table 17-1, p. 1063) 

Gi/S-cyclin 

Cyclin that activates Cdks in late Gi of the eucaryotic cell 
cycle and thereby helps trigger progression through Start, 
resulting in a commitment to cell- cycle entry. Its level falls at 
the start of S phase. (Figure 17-16) 

GTP (guanosine 5 -triphosphate) 

Nucleoside triphosphate produced by the phosphorylation 
of GDP (guanosine diphosphate). Like ATP, it releases a large 
amount of free energy on hydrolysis of its terminal phos- 
phate group. Has a special role in microtubule assembly, 
protein synthesis, and cell signaling. (Figure 2-83) 

GTPase 

An enzyme that converts GTP to GDR GTPases fall into two 
large families. Large trimeric G proteins are composed of 
three different subunits and mainly couple GPCRs to 
enzymes or ion channels in the plasma membrane. Small 
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monomeric GTP-binding proteins (also called monomeric 
GTPases) consist of a single subunit and help relay signals 
from many types of cell-surface receptors and have roles in 
intracellular signaling pathways, regulating intracellular 
vesicle trafficking, and signaling to the cytoskeleton. Both 
trimeric G proteins and monomeric GTPases cycle between 
an active GTP-bound form and an inactive GDP-bound form 
and frequently act as molecular switches in intracellular sig- 
naling pathways. (Figure 15-19) 

GTPase-activating protein (GAP) 

Protein that binds to a GTPase and inhibits it by stimulating 
its GTPase activity, causing the enzyme to hydrolyzes its 
bound GTP to GDP. (Figure 3-71) 



GTP-binding protein — see GTPase 

guanine nucleotide exchange factor (GEF) 

Protein that binds to a GTPase and activates it by stimulating 
it to release its tightly bound GDP, thereby allowing it to bind 
GTP in its place. (Figure 3-73) 

guanosine triphosphate — see GTP 

H + — see proton 

H~ — see hydride ion 

hair cell — see auditory hair cell 

haploid 

Having only a single copy of the genome (one set of chromo- 
somes), as in a sperm cell, unfertilized egg, or bacterium. 
Compare diploid. 

haplotype 

Haploid genotype. 

haplotype block 

Combination of alleles and DNA markers that has been 
inherited in a large, linked block on one chromosome of a 
homologous pair — undisturbed by genetic recombination — 
across many generations. 

haplotype map (hapmap) 

Human genome map based on haplotype blocks. Intended 
to help identify and catalog human genetic variation. 

H chain — see heavy chain 

heart muscle cell — see muscle cell 

heat shock protein (Hsp, stress-response protein) 

One of a large family of highly conserved molecular chaper- 
one proteins, so named because they are synthesized in 
increased amounts in response to an elevated temperature 
or other stressful treatment. Hsps have important roles in 
aiding correct protein folding or refolding. Prominent exam- 
ples are Hsp60 and Hsp70. 

heavy chain (H chain) 

The larger of the two types of polypeptide chain in an 
immunoglobulin molecule. 

Hedgehog protein 

Secreted extracellular signal molecule that has many differ- 
ent roles controlling cell differentiation and gene expression 
in animal embryos and adult tissues. Excessive Hedgehog 
signaling can lead to cancer. 

HeLa cells 

'Immortal' line of human epithelial cells that grows vigor- 
ously in culture. Derived in 1951 from a human cervical car- 
cinoma. 

helicase — see DNA helicase 

a helix — see alpha helix 

helix-loop-helix (HLH) 

DNA-binding structural motif present in many gene regula- 
tory proteins, consisting of a short alpha helix connected by 



a flexible loop to a second, longer alpha helix. Its structure 
enables two HLH -containing proteins to dimerize, forming a 
complex that binds to DNA. Distinct from the helix-turn- 
helix motif. (Figure 7-23) 

helix-turn-helix 

DNA-binding structural motif present in many gene regula- 
tory proteins, consisting of two alpha helices held at a fixed 
angle and connected by a short chain of amino adds, consti- 
tuting the ttm. Proteins containing this motif frequently 
form symmetric dimers and bind to DMA sequences that are 
the in selves similar and arranged symmetrically. Distinct 
from the helix-loop-helix motif. (Figures 7-10, 7-11, 7-12) 

helper T cell 

Type of T cell that helps stimulate B cells to make antibodies 
and macrophages to kill ingested microorganisms. Also 
helps activate dendritic cells and cytotoxic T cells. 



heme 

Cyclic organic molecule containing an iron atom that carries 
oxygen in hemoglobin and carries an electron in 
cytochromes. (Figure 14-22) 

hemidesmosome 

Specialized anchoring cell junction between an epithelial 
cell and the underlying basal lamina. 

hemopoiesis (hematopoiesis) 

Generation of blood cells, mainly in the bone marrow. 

(Figure 23-42) 

hemopoietic stem cell 

Self- renewing bone marrow cell that gives rise to all the var- 
ious types of blood cells, as well as some other cell types. 

hepatocyte 

The major cell type in the liver. 

heterocaryon 

Cell with two or more genetically different nuclei; produced 
by the fusion of two or more different cells. 

heterochromatin 

Region of a chromosome that remains in the form of unusu- 
ally condensed chromatin; generally transcriptionally inac- 
tive. Compare euchromatin. 

heterodimer 

Protein complex composed of two different polypeptide 
chains. 

heterophilic binding 

Binding between molecules of different kinds, especially 
those involved in cell-cell adhesion. (Figure 19-8) 

heterozygote 

Diploid cell or individual having two different alleles of one 
or more specified genes. 

high-energy bond 

Covalent bond whose hydrolysis releases an unusually large 
amount of free energy under the conditions existing in a cell. 
A group linked to a molecule by such a bond is readily trans- 
ferred from one molecule to another. Examples include the 
phosphodiester bonds in ATP and the thioester linkage in 
acetyl CoA. 

high-performance liquid chromatography (HPLC) 

Type of chromatography that uses columns packed with tiny 
beads of matrix; the solution to be separated is pushed 
through under high pressure. 

histidine-kinase-associated receptor 

Transmembrane receptor found in the plasma membrane of 
bacteria, yeast, and plant cells, and involved, for example, in 
sensing stimuli that cause bacterial chemotaxis. Associated 
with a histidine protein kinase on its cytoplasmic side. 

histone 

One of a group of small abundant proteins, rich in arginine 
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and lysine. Histones form the nucleosome cores around 
which DNA is wrapped in eucaryotic chromosomes. (Figure 
4-25) 

histone chaperone (chromatin assembly factor) 

Protein that binds free histones, releasing them once they 
have been incorporated into newly replicated chromatin. 
(Figure 4-30) 

histone code 

Combinations of chemical modifications to histones (e.g., 
actylation, methylation) that are thought to determine how 
and when the DNA packaged in nucleosomes can be 
accessed (e.g., for replication or transcription). (Figure 4-44) 

histone HI 

'Linker' (as opposed to 'core') histone protein that binds to 
DNA where it exits from a nucleosome and helps package 
nucleosomes into the 30 nm chromatin fiber. (Figure 4-34) 



HIV 

Human immunodeficiency virus, the retrovirus that is the 
cause of AIDS (acquired immune deficiency syndrome). 
(Figures 13-19 and 24-16) 

HLH — see helix-loop-helix 

hnRNP protein (heterogeneous nuclear ribonuclear protein) 

Any of a group of proteins that assemble on newly synthe- 
sized RNA, organizing it into a more compact form. (Figure 
6-33) 

Holliday junction (cross-strand exchange) 

X-shaped structure observed in DNA undergoing recombi- 
nation, in which the two DNA molecules are held together at 
the site of crossing-over, also called a cross-strand exchange. 

(Figure 5-61) 

homeobox 

Short (180 base pairs long) conserved DNA sequence that 
encodes a DNA-binding protein motif (homeodomain) 
famous for its presence in genes that are involved in orches- 
trating development in a wide range of organisms. 

homeodomain 

DNA-binding domain that defines a class of gene regulatory 
proteins important in animal development. (Figures 3-13 
and 7-13) 

homeotic mutation 

Mutation that causes cells in one region of the body to 
behave as though they were located in another, causing a 
bizarre disturbance of the body plan. (Figures 22-42 and 
22-127) 

homeotic selector gene 

In Drosophila development, a gene that defines and pre- 
serves the differences between body segments. 

homolog 

One of two or more genes that are similar in sequence as a 
result of derivation from the same ancestral gene. The term 
covers both orthologs and paralogs. (Figure 1-25) See homol- 
ogous chromosomes. 

homologous 

Genes, proteins, or body structures that are similar as a result 
of a shared evolutionary origin. 

homologous chromosomes (homologs) 

The maternal and paternal copies of a particular chromo- 
some in a diploid cell. 

homologous recombination (general recombination) 

Genetic exchange between a pair of identical or very similar 
DNA sequences, typically those located on two copies of the 
same chromosome. (Figures 5-51, 5-53, 5-59, and 5-64) 

homophilic binding 

Binding between molecules of the same kind, especially 
those involved in cell-cell adhesion. (Figure 19-8) 



homozygote 

Diploid cell or organism having two identical alleles of a 
specified gene or set of genes. 

hormone 

Signal molecule secreted by an endocrine cell into the blood- 
stream, which can then carry it to distant target cells. 

housekeeping gene 

Gene serving a function required in all the cell types of an 
organism, regardless of their specialized role. 

Hox gene complex 

Cluster of genes coding for gene regulatory factors, each 
gene containing a homeodomain, and specifying body- 
region differences. Hox mutations typically cause homeotic 
transformations. 

HPLC — see high-performance liquid chromatography 

hyaluronan (hyaluronic acid) 

Type of nonsulfated glycoso amino glyc an with a regular 
repeating sequence of up to 25,000 identical disaccharide 
units, not linked to a core protein. Found in the fluid lubricat- 
ing joints and in many other tissues. (Figures 19-56 and 19-57) 

hybridization 

In molecular biology, the process whereby two complemen- 
tary nucleic acid strands form a base-paired duplex DNA- 
DNA, DNA- RNA, or RNA- RNA molecule. Forms the basis of a 
powerful technique for detecting specific nucleotide 
sequences. (Figures 5-54 and 8-36) 

hybridoma 

Cell line used in the production of monoclonal antibodies. 
Obtained by fusing antibody-secreting B cells with cells of a 
B lymphocyte tumor. (Figure 8-8) 

hydride ion (H~) 

A proton (H + ) plus two electrons (2e~). Equivalent to a hydro- 
gen atom with one extra electron. 

hydrocarbon 

Compound that has only carbon and hydrogen atoms. (Panel 
2-1, pp. 106-107) 

hydrogen bond 

Noncovalent bond in which an electropositive hydrogen 
atom is partially shared by two electronegative atoms. (Panel 
2-3, pp. 110-111) 

hydrogen ion 

A proton (H + ) in an aqueous solution. The basis of acidity. 
Since the proton readily combines with a water molecule to 
form H 3 + , it is more accurate to call it a hydronium ion. 

hydrolase 

General term for enzyme that catalyses a hydrolytic cleavage 
reaction. Includes nucleases and proteases. 

hydrolysis (adjective hydrolytic) 

Cleavage of a covalent bond with accompanying addition of 
water. General formula: AB + H 2 -> AOH + BH. 

hydronium ion (H 3 + ) 

Water molecule associated with an additional proton. The 
form generally taken by protons in aqueous solution. 

hydrophilic 

Dissolving readily in water. Literally, "water loving." 

hydrophobic (lipophilic) 

Not dissolving readily in water. Literally, "water hating." 

hydrophobic force 

Force exerted by the hydrogen-bonded network of water 
molecules that brings two nonpolar surfaces together by 
excluding water between them. (Panel 2-3 , pp. 110-111) 

hydroxyl (-OH) 

Chemical group consisting of a hydrogen atom linked to an 
oxygen, as in an alcohol. (Panel 2-1, pp. 106-107) 
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hypertonic 

Having a sufficiently high concentration of solutes to cause 
water to move out of a cell by osmosis. 

hypervariable region 

Any of three small regions within the variable region of an 
immunoglobulin or T cell receptor chain that shows the 
highest variability from molecule to molecule. These regions 
determine the specificity of the antigen -binding site. (Figure 
25-31) 

hypotonic 

Having a sufficiently low concentration of solutes to cause 
water to move into a cell by osmosis. 

IAP (inhibitor of apoptosis) family 

Intracellular protein inhibitors of apoptosis. 

IF — see initiation factor 
Ig — see antibody 

Ig superfamily 

Large and diverse family of proteins that contain 
immunoglobulin domains or immunoglobulin-like 
domains. Most are involved in cell-cell interactions or anti- 
gen recognition. (Figure 25-74) 

imaginal disc 

Group of cells that are set aside, apparently undifferentiated, 
in the Drosophila embryo and which will develop into an 
adult structure, e.g., eye, leg, wing. Overt differentiation 
occurrs at metamorphosis. (Figure 22-51) 

immortalization 

Production of a cell line capable of an apparently unlimited 
number of cell divisions. Can be the result of mutations or 
viral transformation or of fusion of the original cells with 
cells of a tumor line. 

immune response 

Response made by the immune system when a foreign sub- 
stance or microorganism enters the body; usually refers to 
an adaptive immune response. See also innate, adaptive, pri- 
mary, and secondary immune responses. 

immune system 

System of lymphocytes and other cells in the body that pro- 
vides defense against infection. There are two types of 
immune systems in vertebrates — innate and adaptive. 

immunoblotting — see blotting 

immunoglobulin (Ig) 

An antibody molecule. Higher vertebrates have five classes of 
immunoglobulin— IgA, IgD, IgE, IgG, and IgM— each with a 
different role in adaptive immune responses. 

immunoglobulin domain (Ig domain) 

Characteristic protein domain of about 100 amino acids that 
is found in immunoglobulin light and heavy chains. Similar 
domains, known as immunoglobulin-like (Ig-like) domains, 
are present in many other proteins involved in cell-cell inter- 
actions and antigen recognition and define the Ig superfam- 
ily. (Figure 25-32) 

immunological memory 

Long-lived property of the adaptive immune system that fol- 
lows a primary immune response to many antigens, in which 
subsequent encounter with the same antigen will provoke a 
more rapid and stronger secondary immune response. 
(Figure 25-10) 

immunoprecipitation (IP) 

Use of a specific antibody to draw the corresponding protein 
antigen out of solution. The technique can identify com- 
plexes of interacting proteins in cell extracts by using an anti- 
body specific for one of the proteins to precipitate the com- 
plex. See also chromatin immunoprecipitation and co- 
immunoprecipitation. 

imprinting — see genomic imprinting 



inducible promoter 

A regulatory DNA sequence that allows expression of an 
associated gene to be switched on by a particular molecular 
or physical stimulus (e.g., heat shock). (Figure 22-49) 

induction (inductive interaction) 

In developmental biology, a change in the developmental 
fate of one tissue caused by an interaction with another tis- 
sue. Such an effect is called an inductive interaction. (Figures 
22-10 and 22-16) 

inflammatory response 

Local response of a tissue to injury or infection — character- 
ized by tissue redness, swelling, heat, and pain. Caused by 
invasion of white blood cells, which release various local 
mediators such as histamine. 

inhibitor of apoptosis family — see IAP family 

inhibitory G protein (Gi) 

Trimeric G protein that can regulate ion channels and inhibit 
the enzyme adenylyl cyclase in the plasma membrane. See 
also G protein. (Table 15-3, p. 919) 

inhibitory neurotransmitter 

Neurotransmitter that opens transmitter-gated CI" or K + 
channels in the post-synaptic membrane of a nerve or mus- 
cle cell and thus tends to inhibit the generation of an action 
potential. 

initiation factor (IF and elF) 

Protein that promotes the proper association of ribosomes 
with mRNA and is required for the initiation of protein syn- 
thesis. Abbreviated elF in eucaryotes, IF in procaryotes. elFs 
help load Met-tRNAi on to the ribosome, thus initiating 
translation. (Figure 6-72) 

initiator tRNA 

Special tRNA that intiates translation. It always carries the 
amino acid methionine, forming the complex Met-tRNAi. 
(Figure 6-72) 

innate immune response 

Immune response (of both animals and plants) to a pathogen 
that involves the pre-existing defenses of the body (the innate 
immune system), which include antimicrobial molecules and 
phagocytic cells. Such a response is not specific for the 
pathogen, in contrast to an adaptive immune response. 

inner cell mass 

Cluster of undifferentiated cells in the early mammalian 
embryo from which the whole of the adult body is derived. 
(Figure 22-88) 

inositol 

Ring-shaped sugar molecule forming part of inositol phos- 
pholipids. 

inositol phospholipid (phosphoinositide) 

A lipid containing a phosphorylated inositol derivative. 
Minor component of the plasma membrane, but important 
in demarking different membranes and for intracellular sig- 
nal transduction in eucaryotic cells. (Figure 15-37) 

inositol phospholipid signaling pathway 

Intracellular signaling pathway that starts with the activation 
of phospholipase C and the generation of IP3 and diacylglyc- 
erol (DAG) from inositol phospholipids in the plasma mem- 
brane. The DAG helps to activate protein kinase C. (Figures 
15-38 and 15-39) 

inositol 1,4,5-trisphosphate (IP3) 

Small intracellular signaling molecule produced during acti- 
vation of the inositol phospholipid signaling pathway. Acts to 
release Ca 2+ from the endoplasmic reticulum. (Figures 15-38 
and 15-39) 

in situ hybridization 

Technique in which a single-stranded RNA or DNA probe is 
used to locate a gene or mRNA molecule in a cell or tissue by 
hybridization. 
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insulator element 

DNA sequence that prevents a gene regulatory protein 
bound to DNA in the control region of one gene from influ- 
encing the transcription of adjacent genes. (Figure 7-62) 

insulin 

Polypeptide hormone that is secreted by P cells in the pan- 
creas to help regulate glucose metabolism in animals. 
(Figure 3-35) 

integral membrane protein 

Protein that is retained in a membrane by virtue of one or 
more domains that span or are embedded in the lipid 
bilayer. (Figure 10-19) 

integrin 

Transmembrane adhesion protein that is involved in the 
attachment of cells to the extracellular matrix and to each 
other. (Figure 19-4, and Table 19-2, p. 1135) 

intercalary regeneration 

Type of regeneration that fills in the missing tissues when the 
cells in two mismatched parts of a structure are grafted 
together. (Figure 22-56) 

interferon (IFN) 

Member of a class of cytokines secreted by virus -infected 
cells and certain types of activated T cells. Interferons induce 
antiviral responses, inhibiting viral replication and stimulat- 
ing macrophages and natural killer cells to kill virus-infected 
cells. (Figure 25-60) 

interleukin (IL) 

Secreted cytokine that mainly mediates local interactions 
between white blood cells (leucocytes) during inflammation 
and immune responses. (Table 25-4, p. 1598) 

intermediate filament 

Fibrous protein filament (-10 nm diameter) that forms rope- 
like networks in animal cells. One of the three most prominent 
types of cyto skeletal filaments. (Panel 16-1, pp. 968-969) 

internal ribosome entry site (IRES) 

Specific site in a eucaryotic mRNA, other than at the 5' end, 
at which translation can be initiated. (Figure 7-108) 

interphase 

Long period of the cell cycle between one mitosis and the 
next. Includes Gi phase, S phase, and G2 phase. (Figure 17-4) 

interpolar microtubule 

In the mitotic or meiotic spindle, a microtubule interdigitat- 
ing at the equator with the microtubules emanating from the 
other pole. (Figure 16-85) 

intracellular signaling protein 

Protein involved in a signaling pathway inside the cell. It 
usually activates the next protein in the pathway or gener- 
ates a small intracellular mediator. (Figure 15-1) 

intron 

Noncoding region of a eucaryotic gene that is transcribed 
into an RNA molecule but is then excised by RNA splicing 
during production of the mRNA or other functional RNA. 
(Figure 4-15) 

inversion 

Type of mutation in which a segment of chromosome is 
inverted. (Panel 8-1, pp. 554-555) 

in vitro 

Taking place in an isolated cell-free extract, as opposed to in 
a living cell; also sometimes used to distinguish studies in 
cell cultures from studies in intact organisms. (Latin for "in 
glass.") 

in vitro fertilization (IVF) 

An infertility treatment in which eggs are fertilized with 
sperm outside the mother's body; successfully fertilized eggs 
are cultured for a few days, and the early embryos are then 
transferred into the mother's uterus. 



in vivo 

In an intact cell or organism. Latin for "in life." 

ion 

An atom that has either gained or lost electrons to acquire a 
charge; for example Na + and CI". 

ion channel 

Transmembrane protein complex that forms a water-filled 
channel across the lipid bilayer through which specific inor- 
ganic ions can diffuse down their electrochemical gradients. 
(Figure 11-21) 

ion-channel-coupled receptor— see transmitter- gated ion 
channel 

ionic bond (ionic interaction) 

Cohesion due to electrostatic attraction between two atoms, 
one with a positive charge, the other with a negative charge. 
One type of noncovalent bond. (Figure 2-5, and Panel 2-3, 
pp. 110-111) 

ionophore 

Small hydrophobic molecule that dissolves in lipid bilayers 
and increases their permeability to specific inorganic ions. 

IP3 — see inositol 1 ,4,5-trisphosphate 

IP 3 receptor (IP 3 -gated Ca 2+ -release channel) 

Gated Ca 2+ channel in the ER membrane that opens on bind- 
ing cytosolic IP 3 , releasing stored Ca 2+ into the cytosol. 
(Figure 15-39) 

IRES — see internal ribosome entry site 

iron-sulfur center 

Electron-transporting group consisting of either two or four 
iron atoms bound to an equal number of sulfur atoms, found 
in a class of electron-transport proteins. (Figure 14-23) 

isoelectric point (pi) 

The pH at which a molecule in solution has no net electric 
charge and therefore does not move in an electric field. 
(Figure 8-22) 

isoform 

One of a set of variant forms of a protein, derived either by 
alternative splicing of a common transcript or as products of 
different members of a set of closely homologous genes. 

isomer 

Molecule formed from the same atoms and linkages as 
another but having a different three-dimensional conforma- 
tion. (Panel 2-4, pp. 112-113) 

isomerase 

Enzyme that catalyzes the rearrangement of bonds within a 
single molecule. See also topoisomerase. 

isoprenoid (polyisoprenoid) 

Lipid molecule with a carbon skeleton based on multiple 
five-carbon isoprene units. Examples include retinoic acid 
and dolichol. (Panel 2-5 , pp. 114 -115) 

isotope 

One of a number of forms of an atom that differ in atomic 
weight but have the same number of protons and electrons, 
and therefore the same chemistry. May be either stable or 
radioactive. 

JAK-STAT signaling pathway 

Signaling pathway activated by cytokines and some hor- 
mones, providing a rapid route from the plasma membrane 
to the nucleus to alter gene transcription. Involves cytoplas- 
mic Janus kinases (JAKs), and signal transducers and activa- 
tors of transcription (STATs). 



joule 



Standard unit of energy in the meter-kilogram-second sys- 
tem. One joule is the energy delivered in one second by a 1- 
watt power source. Approximately equal to 0.24 calories. 
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K—see equilibrium constant 

Ka — see affinity constant 

Kd — see dissociation constant 

The Michaelis-Menten constant. Equal to the concentration 
of substrate at which an enzyme works at half its maximum 
rate. Large values of K M usually indicate that the enzyme 
binds to its substrate with relatively low affinity. (Panel 3-3) 

karyotype 

Display of the full set of chromosomes of a cell, arranged 
with respect to size, shape, and number. 

keratin 

Protein that forms keratin intermediate filaments, mainly in 
epithelial cells. Specialized keratins are found in hair, nails, 
and feathers. 

ketone 

Organic molecule containing a carbonyl group linked to two 
alkyl groups. (Panel 2-1, p 107) 

killer cell 

Any eucaryotic cell capable of directly killing another 
eucaryotic cell. 

kinase 

Enzyme that catalyzes the addition of phosphate groups to 
molecules. See also protein kinase. 

kinesin 

Member of one of the two main classes of motor proteins 
that use the energy of ATP hydrolysis to move along micro- 
tubules. (Figure 16-58) 

kinetochore 

Complex structure formed from proteins on a mitotic chro- 
mosome to which microtubules attach. Plays an active part 
in the movement of chromosomes to the poles. Forms on the 
chromosome centromere. (Figure 17-36) 

kinetochore microtubule 

In the mitotic or meiotic spindle, a microtubule that con- 
nects the spindle pole to the kinetochore of a chromosome. 

K + leak channel 

K + -transporting ion channel in the plasma membrane of ani- 
mal cells that remains open even in a "resting" cell. (Panel 
11-3, p. 679) 

knockout 

An engineered deletion or inactivating mutation of a gene. 

Krebs cycle — see citric acid cycle 

lagging strand 

One of the two newly synthesized strands of DNA found at a 
replication fork. The lagging strand is made in discontinuous 
lengths that are later joined covalently. (Figure 5-7) 

lambda — see bacteriophage lambda 

lamellipodium (plural lamellipodia) 

Flattened, sheetlike protrusion supported by a meshwork of 
actin filaments, which is extended at the leading edge of a 
crawling animal cell. (Figures 16-86 and 16-87) 

lain in — see nuclear lamin 

laminin 

Extracellular matrix fibrous protein found in basal laminae, 
where it forms a sheetlike network. (Figures 19-42 and 19-43) 

lampbrush chromosome 

Huge paired chromosome in meiosis in immature amphib- 
ian eggs, in which the chromatin forms large stiff loops 
extending out from the linear axis of the chromosome. 
(Figure 4-54) 

L chain — see light chain 



leading strand 

One of the two newly synthesized strands of DNA found at a 
replication fork. The leading strand is made by continuous 
synthesis in the 5'-to-3' direction. (Figure 5-7) 

lecithin — see phosphatidylcholine 



lectin 



Protein that binds tightly to a specific sugar. Abundant 
lectins from plant seeds are used as affinity reagents to purify 
glycoproteins or to detect them on the surface of cells. 

leptin 

Peptide hormone secreted by fat cells. Helps regulate the 
desire to eat by suppressing appetite. 

lethal mutation 

Mutation that causes the death of the cell or the organism 
that contains it. (Panel 8-1, pp. 554-555) 

leucine-rich repeat protein (LRR protein) 

Common type of receptor serine/threonine kinase in plants 
that contains a tandem array of leucine-rich repeat 
sequences in its extracellular portion. 

leucine zipper 

Structural motif seen in many DNA-binding proteins in 
which two alpha helices from separate proteins are joined 
together in a coiled-coil (rather like a zipper), forming a pro- 
tein dimer. (Figure 7-19) 

leucocyte — see white blood cell 

leukemia 

Cancer of white blood cells. 

ligand 

Any molecule that binds to a specific site on a protein or 
other molecule. From Latin ligare, to bind. 

ligase 

Enzyme that joins together (ligates) two molecules in an 
energy- dependent process. DNA ligase, for example, joins 
two DNA molecules together end-to-end through phospho- 
diester bonds. 

light chain (L chain) 

One of the smaller polypeptides of a multisubunit protein 
such as myosin (Figure 16-72) or immunoglobulin. (Figure 

25-21) 

lignin 

Network of cross-linked phenolic compounds that forms a 
supporting network throughout the cell walls of xylem and 
woody tissue in plants. 

limit of resolution 

In microscopy, the smallest distance apart at which two 
point objects can be resolved as separate. Just under 0.2 mm 
for conventional light microscopy, a limit determined by the 
wavelength of light. 

lipase 

Enzyme that catalyzes the cleavage of fatty acids from the 
glycerol moiety of a triglyceride. 

lipid 

Organic molecule that is insoluble in water but tends to dis- 
solve in nonpolar organic solvents. A special class, the phos- 
pholipids, forms the structural basis of biological mem- 
branes. (Panel 2-5, pp. 114-115) 

lipid bilayer (phospholipid bilayer) 

Thin double sheet of phospholipid molecules that forms the 
core structure of all cell membranes. The two layers of lipid 
molecules are packed with their hydrophobic tails pointing 
inward and their hydrophilic heads outward, exposed to 
water. (Figure 2-22 and Panel 2-5, pp. 114-1 15) 

lipid raft 

Small region of the plasma membrane enriched in sphin- 
golipids and cholesterol. (Figure 10-14) 
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lipophilic — see hydrophobic 

liposome 

Artificial phospholipid bilayer vesicle formed from an aque- 
ous suspension of phospholipid molecules. (Figure 10-9) 

local mediator 

Secreted signal molecule that acts locally on nearby cells. 
(Figure 15-4) 

locus 

In genetics, the position on a chromosome. For example, in a 
diploid cell different alleles of the same gene occupy the 
same locus. 

long-term potentiation 

Long-lasting increase (days to weeks) in the sensitivity of 
certain synapses in the brain, induced by a short burst of 
repetitive firing in the presynaptic neurons. (Figure 11-42) 

loss-of-function mutation 

Mutation which reduces or abolishes the activity of a gene. 
Usually recessive. (Panel 8-1, pp. 554-555) 

low-density lipoprotein (LDL) 

Large complex composed of a single protein molecule and 
many esterified cholesterol molecules, together with other 
lipids. The form in which cholesterol is transported in the 
blood and taken up into cells. (Figure 13-50) 

LTP — see long-term potentiation 

lymph 

Colorless fluid containing lymphocytes found in lymphatic 
vessels. (Figure 25-3) 

lymphocyte 

White blood cell responsible for the specificity of adaptive 
immune responses. Two main types: B cells, which produce 
antibody, and T cells, which interact directly with other effec- 
tor cells of the immune system and with infected cells. T cells 
develop in the thymus and are responsible for cell-mediated 
immunity. B cells develop in the bone marrow in mammals 
and are responsible for the production of circulating anti- 
bodies. (Figure 25-7) 

lymphoid organ 

Organ involved in the production or function of lympho- 
cytes. Lymphocytes are produced in primary lymphoid 
organs and respond to antigen in peripheral lymphoid 
organs. (Figure 25-3) 

lymphoma 

Cancer of lymphocytes, in which the cancer cells are mainly 
found in lymphoid organs (rather than in the blood, as in 
leukemias). 

lysis 

Rupture of a cell's plasma membrane, leading to the release 
of cytoplasm and the death of the cell. 

lysosome 

Membrane -bounded organelle in eucaryotic cells containing 
digestive enzymes, which are typically most active at the acid 
pH found in the lumen of lysosomes. (Figures 12-2 and 
13-37) 

lysozyme 

Enzyme that catalyzes the cutting of polysaccharide chains 
in the cell walls of bacteria. 

M — see M phase 

macrophage 

Phagocytic cell derived from blood monocytes, resident in 
most tissues but able to roam. It has both scavenger and anti- 
gen-presenting functions in immune responses. (Figure 24-53) 

major histocompatibility complex — see MHC complex 

malignant 

Of tumors and tumor cells: invasive and/or able to undergo 
metastasis. A malignant tumor is a cancer. (Figure 20-3) 



mannose 6-phosphate (M6P) 

Unique marker attached to the oligosaccharides on some 
glycoproteins destined for lysosomes. (Figure 13-43) 

MAP — see microtubule-associated protein 

MAP kinase (mitogen-activated protein kinase) 

Protein kinase at the end of a three-component signaling 
module involved in relaying signals from the plasma mem- 
brane to the nucleus. 

MAP-kinase module (mitogen-activated protein kinase 
module) 

An intracellular signaling module composed of three protein 
kinases, acting in sequence, with MAP kinase as the third. 
Typically activated by a Ras protein in response to extracel- 
lular signals. (Figure 15-78) 

mass spectrometry (MS) 

Technique for identifying compounds on the basis of their 
precise mass -to -charge ratio. Powerful tool for identifying 
proteins and sequencing polypeptides. (Figure 8-21) 

maternal-effect gene 

Gene that acts in the mother to specify maternal mRNAs and 
proteins in the egg. Maternal-effect mutations affect the 
development of the embryo even if the embryo itself has not 
inherited the mutant gene. 

mating- type locus (Mat locus) 

In budding yeast, the locus that determines the mating type 
(a or a) of the haploid yeast cell. (Figure 7-65) 

matrix 

Space or supporting medium within which something is 
formed. (1) Large internal compartment of the mitochon- 
drion. (2) The corresponding compartment in a chloro- 
plast, more commonly known as the stroma. (3) 
Extracellular matrix. The extracellular composite of 
secreted proteins and polysaccharides in which cells are 
embedded. (Figure 14-37) 

matrix metal loprotease 

Ca 2+ - or Zn 2+ -dependent proteolytic enzyme present in the 
extracellular matrix that degrades matrix proteins. Includes 
the collagenases. 

M-Cdk (M-phase Cdk) 

Cyclin-Cdk complex formed in vertebrate cells by an M- 
cyclin and the corresponding cyclin-dependent kinase 
(Cdk). (Figure 17-16 and Table 17-1, p. 1063) 

Mem proteins 

Proteins in the eucaryotic cell needed for the initiation of 
DNA replication; thought to form the helicase present at 
moving DNA replication forks. 

M-cyclin 

A cyclin found in all eucaryotic cells that promotes the 

events of mitosis. (Figure 17-16) 

MDR protein — see multidrug resistance protein 

megakaryocyte 

Large myeloid cell with a multilobed nucleus that remains in 
the bone marrow when mature. Buds off platelets from long 
cytoplasmic processes. (Figures 23-40 and 23-42) 

meiosis 

Special type of cell division that occurs in sexual reproduc- 
tion. It involves two successive nuclear divisions with only 
one round of DNA replication, thereby producing haploid 
cells from a diploid cell. (Figure 21-5) 

melanocyte 

Cell that produces the dark pigment melanin. Responsible 
for the pigmentation of skin and hair. (Figure 23-1) 

membrane 

The lipid bilayer plus associated proteins that encloses all cells 
and, in eucaryotic cells, many organelles as well. (Figure 10-1) 
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membrane-bound ribosome 

Ribosome attached to the cytosolic face of the endoplasmic 
reticulum. The site of synthesis of proteins that enter the 
endoplasmic reticulum. (Figure 12-41) 

membrane channel — see channel 

membrane potential 

Voltage difference across a membrane due to a slight excess 
of positive ions on one side and of negative ions on the other. 
A typical membrane potential for an animal cell plasma 
membrane is -60 mV (inside negative relative to the sur- 
rounding fluid). (Figure 11-22) 

membrane transport 

Movement of molecules across a membrane, mediated by a 
membrane transport protein. (Figures 11-3 and 11-4) 

membrane transport protein 

Membrane protein that mediates the passage of ions or mol- 
ecules across a membrane. The two main classes are trans- 
porters (also called carriers or permeases) and channels. 

(Figures 11-3 and 11-4) 

meristem 

Organized group of dividing cells whose derivatives give rise 
to the tissues and organs of a flowering plant. Examples are 
the apical meristems at the tips of shoots and roots. (Panel 
22-2, pp. 1404-1405, and Figure 22-118) 

mesenchyme 

Immature, unspecialized form of connective tissue in ani- 
mals, consisting of cells embedded in a thin extracellular 

matrix. 

mesoderm 

Embryonic tissue that is the precursor to muscle, connective 
tissue, skeleton, and many of the internal organs. (Figures 
22-3 and 22-70) 

messenger RNA (mRNA) 

RNA molecule that specifies the amino acid sequence of a 
protein. Produced in eucaryotes by processing of an RNA 
molecule made by RNA polymerase as a complementary 
copy of DNA. It is translated into protein in a process cat- 
alyzed by ribosomes. (Figures 6-21 and 6-22) 

metabolism 

The sum total of the chemical processes that take place in 
living cells. All of catabolism plus anabolism. (Figure 2-36) 

metaphase 

Stage of mitosis at which chromosomes are firmly attached 
to the mitotic spindle at its equator but have not yet segre- 
gated toward opposite poles. (Panel 17-1, pp. 1072-1073) 

metaphase plate 

Imaginary plane at right angles to the mitotic spindle and 
midway between the spindle poles; the plane in which chro- 
mosomes are positioned at metaphase. (Panel 17-1, pp. 
1072-1073) 

metaphase-to-anaphase transition 

Checkpoint in the eucaryotic cell cycle preceding sister- 
chromatid separation at anaphase. If the cell is not ready to 
proceed to anaphase, the cell cycle is halted at this point. 
(Figure 17-14, and Panel 17-1, pp. 1072-1073) 

metastasis 

Spread of cancer cells from their site of origin to other sites 
in the body. (Figures 20-1 and 20-17) 

methyl group (-CH3) 

Hydrophobic chemical group derived from methane (CH4). 

MHC complex (major histocompatibility complex) 

Complex of genes in vertebrates coding for a large family of 
cell-surface glycoproteins (MHC proteins). (Figure 25-51) 

MHC protein 

One of a large family of vertebrate cell-surface glycoproteins, 



which are members of the Ig superfamily. MHC proteins 
bind peptide fragments of foreign antigens and present them 
to T cells to induce an adaptive immune response. See also 
class I MHC protein, class II MHC protein. (Figures 25-49 
and 25-50) 

Michaelis-Menten constant — see K M 

microarray — see DNA microarray 

microfilament — see actin filament 

micron {\im or micrometer) 

Unit of measurement equal to 10~ 6 meter or 10~ 3 millimeter. 

micro RNA (miRNA) 

Short (21-26 nucleotide) eucaryotic RNAs, produced by the 
processing of specialized RNA transcripts coded in the 
genome, that regulate gene expression through complemen- 
tary base-pairing with mRNA. Depending on the extent of 
base pairing, mRNAs can lead either to destruction of the 
mRNA or to a block in its translation. (Figures 7-1 12) 

microsome 

Small vesicle derived from endoplasmic reticulum that is 
produced by fragmentation when cells are homogenized. 
(Figure 12-37) 

microtubule 

Long hollow cylindrical structure composed of the protein 
tubulin. It is one of the three major classes of filaments of the 
cytoskeleton. (Panel 16-1, p. 968) 

microtubule-associated protein (MAP) 

Any protein that binds to microtubules and modifies their 
properties. Many different kinds have been found, including 
structural proteins, such as MAP2, and motor proteins, such 
as dynein. [Not to be confused with the 'MAP' (mitogen-acti- 
vated protein kinase) of 'MAP kinase.'] 

microtubule flux 

Movement of individual tubulin molecules in the microtubules 
of the spindle towards the poles by loss of tubulin at their 
minus ends. Helps to generate the poleward movement of sis- 
ter chromatids after they separate in anaphase. (Figure 17-41) 

microtubule-organizing center (MTOC) 

Region in a cell, such as a centrosome or a basal body, from 
which microtubules grow. 

microvillus (plural microvilli) 

Thin cylindrical membrane-covered projection on the sur- 
face of an animal cell containing a core bundle of actin fila- 
ments. Present in especially large numbers on the absorptive 
surface of intestinal epithelial cells. (Figure 16-50) 

midbody 

Structure formed at the end of cleavage that can persist for 
some time as a tether between the two daughter cells in ani- 
mals. (Figure 17-51) 

minus end 

End of a microtubule or actin filament at which the addition 
of monomers occurs least readily; the "slow- growing" end of 
the microtubule or actin filament. The minus end of an actin 
filament is also known as the pointed end. (Panel 16-2, pp. 
978-979) 

miRNA — see micro RNA 

mismatch repair 

DNA repair process that corrects mismatched nucleotides 
inserted during DNA replication. A short stretch of newly 
synthesized DNA including the mismatched nucleotide is 
removed and replaced with the correct sequence with refer- 
ence to the template strand. (Figure 5-20) 

mitochondrion (plural mitochondria) 

Membrane-bounded organelle, about the size of a bac- 
terium, that carries out oxidative phosphorylation and pro- 
duces most of the ATP in eucaryotic cells. (Figure 1-33) 
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mitogen 

Extracellular signal molecule that stimulates cells to prolif- 
erate. 

mitogen-activated protein kinase — see MAP kinase 

mitosis 

Division of the nucleus of a eucaryotic cell, involving con- 
densation of the DNA into visible chromosomes, and separa- 
tion of the duplicated chromosomes to form two identical 
sets. From Greek mitos, a thread, referring to the threadlike 
appearance of the condensed chromosomes. (Panel 17-1, 
pp. 1072-1073) 

mitotic chromosome 

Highly condensed duplicated chromosome with the two new 
chromosomes still held together at the centromere as sister 
chromatids. 

mitotic spindle 

Array of microtubules and associated molecules that forms 
between the opposite poles of a eucaryotic cell during mito- 
sis and serves to move the duplicated chromosomes apart. 
(Figure 17-28 and Panel 17-1, pp. 1072-1073) 

mobile genetic element 

Genetic element (DNA segment) that can move or copy itself 
to another position in the genome. 

model organism 

A species that has been studied intensively over a long period 
and thus serves as a "model" for deriving fundamental bio- 
logical principles. 

molarity (M) 

Number of moles of a solute per liter of solution. A 'one 
molar' (1M) solution contains 1 mole of solute dissolved in 1 
liter of solution. 

mole 

X grams of a substance, where X is its relative molecular mass 
(molecular weight). A mole consists of 6.02 x 10 23 molecules 
of the substance. 

molecular chaperone — see chaperone 

molecular weight (relative molecular mass) 

Mass of a molecule relative to the mass of a hydrogen atom 
(strictiy, relative to 1/12 of the mass of an atom of 12 C, i.e., 
one dalton). 

monoclonal antibody 

Antibody secreted by a hybridoma cell line. Because the 
hybridoma is generated by the fusion of a single B cell with a 
single tumor cell, each hybridoma produces antibodies that 
are all identical. (Figure 8-8) 

monocyte 

Type of white blood cell that leaves the bloodstream and 
matures into a macrophage in tissues. (Figure 23-37) 

monomer 

Small molecular building block that can serve as a subunit, 
being linked to others of the same type to form a larger mol- 
ecule (a polymer). 

monomeric GTPase — see GTPase 

monosaccharide 

Simple sugar with the general formula (CH2O) n, where n = 3 
to 8. (Panel 2-4, pp. 112-113) 

morphogen 

Signal molecule that can impose a pattern on a field of cells 
by causing cells in different places to adopt different fates. 
(Figure 22-13) 

mosaic (genetic mosaic) 

In developmental biology, an individual organism made of a 
mixture of cells with different genotypes, but developed from 
a single zygote. Mosaics can arise naturally, as a result of a 
mutation in cells that give rise to new tissues, or can be made 
deliberately to aid genetic analysis. Compare chimera. 



motif 

Element of structure or pattern that recurs in many contexts. 
Specifically, a small structural domain that can be recog- 
nized in a variety of proteins. 

motor protein 

Protein that uses energy derived from nucleoside triphos- 
phate hydrolysis to propel itself along a linear track (protein 
filament or other polymeric molecule). 

M6P — see mannose 6-phosphate 

M phase 

Period of the eucaryotic cell cycle during which chromo- 
somes are condensed and the nucleus and cytoplasm divide. 
(Figure 17-4, and Panel 17-1, pp. 1072-1073) 

M-phase Cdk — see M-Cdk 

mRNA— see messenger RNA 

MTOC — see microtubule-organizing center 

mTOR (mammalian target of rapamycin) 

Mammalian serine /theronine protein kinase that is a down- 
stream component of the PI 3-kinase/Akt signaling pathway. 
mTOR (the yeast homolog is called TOR) is part of two dis- 
tinct protein complexes, one of which is sensitive to the drug 
rapamycin and stimulates cell growth, while the other helps 
activate the protein kinase Akt. (Figures 15-64 and 15-65) 

multidrug resistance protein (MDR protein) 

Type of ABC transporter protein that can pump hydrophobic 
drugs (such as some anti-cancer drugs) out of the cytoplasm 
of eucaryotic cells. 

multimeric 

Of proteins: containing more than one subunit. 

multipass transmembrane protein 

Membrane protein in which the polypeptide chain crosses 
the lipid bilayer more than once. (Figure 10-19) 

muscle cell 

Cell type specialized for contraction. The three main classes 
are skeletal, heart, and smooth muscle cells. (Figure 23-47) 

mutation 

Heritable change in the nucleotide sequence of a chromo- 
some. (Panel 8-1, pp. 554-555) 

Myc 

Gene regulatory protein that is activated when a cell is stim- 
ulated to grow and divide by extracellular signals. It activates 
the transcription of many genes, including those that stimu- 
late cell growth. (Figure 17-62) 

myelin sheath 

Insulating layer of specialized cell membrane wrapped 
around vertebrate axons. Produced by oligodendrocytes in 
the central nervous system and by Schwann cells in the 
peripheral nervous system. (Figure 1 1-32) 

myeloid cell 

Any white blood cell other than a lymphocyte. (Figure 23-42) 

myoblast 

Mononucleated, undifferentiated muscle precursor cell. A 
skeletal muscle cell is formed by the fusion of multiple 
myoblasts. (Figure 23-48) 

myoepithelial cell 

Type of unstriated muscle cell found in epithelia, e.g. in the 
iris of the eye and in glandular tissue. (Figure 23-47) 

myofibril 

Long, highly organized bundle of actin, myosin, and other 
proteins in the cytoplasm of muscle cells that contracts by a 
sliding filament mechanism. 

myosin 

Any of a large class of motor proteins that move along actin 
filaments. (Figure 16-57) 
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NADVNADH (nicotinamide adenine dinucleotide/reduced 
nicotinamide adenine dinucleotide) 

Electron carrier system that participates in oxidation-reduc- 
tion reactions, such as the oxidation of food molecules. 
NAD + accepts the equivalent of a hydride ion (H~, a proton 
plus two electrons) to become the activated carrier NADH. 
The NADH formed donates its high-energy electrons to the 
ATP-generating process of oxidative phosphorylation. 
(Figure 2-86) 

NADH dehydrogenase complex 

First of the three electron-driven proton pumps in the mito- 
chondrial respiratory chain. It accepts electrons from NADH. 
(Figure 14-26) 

NADP+/NADPH (nicotinamide adenine dinucleotide 
phosphate/reduced nicotinamide adenine dinucleotide 
phosphate) 

Electron carrier system closely related to NAD+/NADH, but 
used almost exclusively in reductive biosynthetic, rather 
than catabolic, pathways. (Figure 2-60) 

Na + -K + pump (Na + -K + ATPase) 

Transmembrane carrier protein found in the plasma mem- 
brane of most animal cells that pumps Na + out of and K + 
into the cell, using energy derived from ATP hydrolysis. 

(Figure 11-14) 

nanometer (nm) 

Unit of length commonly used to measure molecules and 
cell organelles. 1 nm = 10~ 3 micrometer (|im) = 10~ 9 meter. 

natural killer cell (NK cell) 

Cytotoxic cell of the innate immune system that can kill 
virus -infected cells and some cancer cells. (Figure 24-57) 

negative feedback 

Control mechanism whereby the output of a reaction or 
pathway inhibits an earlier step in the same pathway 

Nernst equation 

Quantitative expression that relates the equilibrium ratio of 
concentrations of an ion on either side of a permeable mem- 
brane to the voltage difference across the membrane. (Panel 
11-2, p. 670) 

nerve cell — see neuron 

nerve impulse — see action potential 

neural cell adhesion molecule (NCAM) 

Cell adhesion molecule of the immunoglobulin superfamily, 
expressed by many cell types including most nerve cells. A 
mediator of Ca 2+ -independent cell-cell attachment in verte- 
brates. (Figure 19-20) 

neural crest 

Collection of cells located along the line where the neural 
tube pinches off from the surrounding epidermis in the ver- 
tebrate embryo. Neural crest cells migrate to give rise to a 
variety of tissues, including neurons and glia of the periph- 
eral nervous system, pigment cells of the skin, and the bones 
of the face and jaws. (Figures 19-11, 22-84, and 22-97) 

neural tube 

Tube of ectoderm that will form the brain and spinal cord in 
a vertebrate embryo. (Figure 22-78) 

neurite 

Long process growing from a nerve cell in culture. A generic 
term that does not specify whether the process is an axon or 
a dendrite. (Figure 16-105) 

neuroblast 

Embryonic nerve cell precursor. (Figure 22-66) 

neurofilament 

Type of intermediate filament found in nerve cells. (Figure 16-22) 

neuromuscular junction 

Specialized chemical synapse between an axon terminal of a 



motor neuron and a skeletal muscle cell. (Figures 11-36 and 
11-39) 

neuron (nerve cell) 

Impulse-conducting cell of the nervous system, with exten- 
sive processes specialized to receive, conduct, and transmit 
signals. (Figures 22-93 and 22-94) 

neuropeptide 

Peptide secreted by neurons as a signal molecule either at 
synapses or elsewhere. 

neurotransmitter 

Small signal molecule secreted by the presynaptic nerve cell 
at a chemical synapse to relay the signal to the postsynaptic 
cell. Examples include acetylcholine, glutamate, GABA, 
glycine, and many neuropeptides. 

neutron 

Uncharged heavy subatomic particle that forms part of an 
atomic nucleus. (Figure 2-1) 

neutrophil 

White blood cell that is specialized for the uptake of particu- 
late material by phagocytosis. Enters tissues that become 
infected or inflamed. (Figures 24-55 and 25-24) 

NFkB protein 

Latent gene regulatory protein that is activated by various 
intracellular signaling pathways when cells are stimulated 
during immune, inflammatory, or stress responses. Also has 
important roles in animal development. (Figure 15-79) 

nicotinamide adenine dinucleotide — see NAD+/NADH 

nicotinamide adenine dinucleotide phosphate — see 
NADP + /NADPH 

nitric oxide (NO) 

Gaseous signal molecule that is widely used in cell-cell com- 
munication in both animals and plants. (Figure 15-12) 

nitrogen fixation 

Biochemical process carried out by certain bacteria that 
reduces atmospheric nitrogen (N2) to ammonia, leading 
eventually to various nitrogen-containing metabolites. 

NK cell — see natural killer cell 

iV-linked oligosaccharide 

Chain of sugars attached to a protein through the NH2 group 
of the side chain of an asparagine residue. Compare O- linked 
oligosaccharide. (Figures 12-50 and 12-51) 

nm — see nanometer 

NMR (nuclear magnetic resonance, NMR spectroscopy) 

NMR is the resonant absorption of electromagnetic radia- 
tion at a specific frequency by atomic nuclei in a magnetic 
field, due to flipping of the orientation of their magnetic 
dipole moments. The NMR spectrum provides information 
about the chemical environment of the nuclei. NMR is used 
widely to determine the three-dimensional structure of 
small proteins and other small molecules. The principles of 
NMR are also used for medical diagnostic purposes in mag- 
netic resonance imaging (MRI). (Figure 8-29) 

NO — see nitric oxide 

noncovalent bond (noncovalent attraction, noncovalent 
interaction) 

Chemical bond in which, in contrast to a covalent bond, no 
electrons are shared. Noncovalent bonds are relatively weak, 
but they can sum together to produce strong, highly specific 
interactions between molecules. (Panel 2-3 , pp. 110 -111) 

noncyclic photophosphorylation 

Photosynthetic process that produces both ATP and NADPH 
in plants and cyanobacteria. (Figure 14-49) 

nondisjunction 

Event occurring occasionally during meiosis in which a pair 
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of homologous chromosomes fails to separate so that the 
resulting germ cell has either too many or too few chromo- 
somes. 

nonenveloped virus 

Virus consisting of a nucleic acid core and a protein capsid 
only. (Figure 24-24C and D) 

nonpolar (apolar) 

Lacking any asymmetric accumulation of positive and nega- 
tive charge. Nonpolar molecules are generally insoluble in 
water. (Panels 2-2 and 2-3, pp. 108 -111) 

nonretroviral retrotransposon — see retrotransposon 

nonsense-mediated mRNA decay 

Mechanism for degrading aberrant mRNAs containing in- 
frame internal stop codons before they can be translated 
into protein. (Figure 6-80) 

Northern blotting 

Technique in which RNA fragments separated by elec- 
trophoresis are immobilized on a paper sheet, and a specific 
RNA is then detected by hybridization with a labeled nucleic 
acid probe. 

NO synthase (NOS) 

Enzyme that synthesizes nitric oxide (NO) by the deamina- 
tion of arginine. (Figure 15-12B) 

Notch 

Transmembrane receptor protein (and latent gene regula- 
tory protein) involved in many cell-fate choices in animal 
development, for example in the specification of nerve cells 
from ectodermal epithelium. Its ligands are cell-surface pro- 
teins such as Delta and Serrate. (Figure 15-76) 

notochord 

Stiff rod of cells defining the central axis of all chordate 
embryos. In vertebrates becomes incorporated into the ver- 
tebral column. (Figure 22-97) 

NPC — see nuclear pore complex 

NSF (N-ethylmaleimide sensitive factor) 

Protein with ATPase activity that disassembles a complex of 
a v-SNARE and a t-SNARE. (Figures 13-18 and 13-22) 

N terminus (amino terminus) 

The end of a polypeptide chain that carries a free a-amino 
group. (Figure 3-1) 

nuclear envelope (nuclear membrane) 

Double membrane (two bilayers) surrounding the nucleus. 
Consists of an outer and inner membrane and is perforated 
by nuclear pores. The outer membrane is continuous with 
the endoplasmic reticulum. (Figures 4-9 and 12-8) 

nuclear export signal 

Sorting signal contained in the structure of molecules and 
complexes, such as RNAs and new ribosomal subunits, that 
are transported from the nucleus to the cytosol through 
nuclear pore complexes. (Figure 12-15) 

nuclear lamin 

Protein subunit of the intermediate filaments that form the 
nuclear lamina. 

nuclear lamina 

Fibrous meshwork of proteins on the inner surface of the 
inner nuclear membrane. It is made up of a network of inter- 
mediate filaments formed from nuclear lamins. 

nuclear localization signal (NLS) 

Signal sequence or signal patch found in proteins destined 
for the nucleus that enables their selective transport into the 
nucleus from the cytosol through the nuclear pore com- 
plexes. (Figures 12-11 and 12-15) 

nuclear magnetic resonance — see NMR 

nuclear pore complex (NPC) 

Large multiprotein structure forming an aqueous channel 



(the nuclear pore) through the nuclear envelope that allows 
selected molecules to move between nucleus and cytoplasm. 
(Figure 12-9) 

nuclear receptor superfamily 

Intracellular receptors for hydrophobic signal molecules 
such as steroid and thyroid hormones and retinoic acid. The 
receptor-ligand complex acts as a transcription factor in the 
nucleus. (Figure 15-14) 

nuclear transplantation 

Transfer of a nucleus from one cell to another by microinjec- 
tion. (Figure 8-6) 

nuclear transport receptor (karyopherin) 

Protein that escorts macromolecules either into or out of the 
nucleus: nuclear import receptor or nuclear export receptor. 

(Figure 12-15) 

nuclease 

Enzyme that splits nucleic acids by hydrolyzing bonds 
between nucleotides. See also endonuclease and exonucle- 
ase. 

nucleation 

Critical stage in the assembly of a polymeric structure, such 
as a microtubule, in which a small cluster of monomers 
aggregates in the correct arrangement to initiate rapid poly- 
merization. (Panel 16-2, pp. 978-979) More generally, the 
rate-limiting step in an assembly process. 

nucleic acid 

RNA or DNA, a macromolecule consisting of a chain of 
nucleotides joined together by phosphodiester bonds. 

nucleic acid hybridization — see hybridization 

nucleocapsid 

Viral nucleic acid plus its surrounding protein capsid. If a 
viral envelope is present, the envelope surrounds the nucle- 
ocapsid. (Figure 13-19) 

nucleolar organizer 

Region of a chromosome containing a cluster of rRNA genes 
that gives rise to part of a nucleolus. (Figure 6-47) 

nucleolus 

Structure in the nucleus where rRNA is transcribed and ribo- 
somal subunits are assembled. (Figure 4-9) 

nucleoporin 

Any of a number of different proteins that make up nuclear 
pore complexes. 

nucleoside 

Purine or pyrimidine base covalently linked to a ribose or 
deoxyribose sugar. (Panel 2-6, pp. 116-117) 

nucleosome 

Beadlike structure in eucaryotic chromatin, composed of a 
short length of DNA wrapped around an octameric core of 
histone proteins. The fundamental structural unit of chro- 
matin. (Figures 4-23 and 4-24) 

nucleotide 

Nucleoside with one or more phosphate groups joined in 
ester linkages to the sugar moiety. DNA and RNA are poly- 
mers of nucleotides. (Panel 2-6, pp. 116-117) 

nucleotide excision repair 

Type of DNA repair that corrects damage of the DNA double 
helix, such as those caused by chemical or UV damage, by 
cutting out the damaged region on one strand and resynthe- 
sizing it using the undamaged strand as template. Compare 
base excision repair. (Figure 5-48) 

nucleus 

Prominent membrane-bounded organelle in eucaryotic 
cells, containing DNA organized into chromosomes. 

null mutation 

Loss-of-function mutation that completely abolishes the 
activity of a gene. (Panel 8-1, pp. 554-555) 
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nurse cell 

Cell in the invertebrate ovary that is connected by cytoplas- 
mic bridges to a developing oocyte and thereby supplies the 
oocyte with ribosomes, mRNAs, and proteins needed for the 
development of the early embryo. (Figure 21-24) 

occluding junction 

Type of cell junction that seals cells together in an epithe- 
lium, forming a barrier through which even small molecules 
cannot pass — making the cell sheet an impermeable (or 
selectively permeable) barrier. (Figure 19-2, and Table 19-1, 
p. 1133) 

Okazaki fragments 

Short lengths of DNA produced on the lagging strand during 
DNA replication. Rapidly joined by DNA ligase to form a con- 
tinuous DNA strand. (Figure 5-7) 

olfactory sensory neuron 

The sensory cell in the nasal olfactory epithelium responsi- 
ble for detecting odors. 

oligodendrocyte 

Glial cell in the vertebrate central nervous system that forms 
a myelin sheath around axons. Compare Schwann cell. 

oligomer 

Short polymer. 

oligosaccharide 

Short linear or branched chain of covalently linked sugars. 
(Panel 2-4, pp. 112-113) 

O- linked oligosaccharide 

Chain of sugars attached to a protein through the OH group 
of serine or threonine residues. Compare Af-linked oligosac- 
charide. (Figure 13-32) 

oncogene 

An altered gene whose product can act in a dominant fash- 
ion to help make a cell cancerous. Typically, an oncogene is a 
mutant form of a normal gene (proto-oncogene) involved in 
the control of cell growth or division. (Figure 20-27) 

oocyte 

Developing egg, before it has completed meiosis. (Figures 
21-25 and 21-26) 

oogenesis 

Formation and maturation of oocytes in the ovary. (Figure 
21-23) 

open reading frame (ORF) 

A continuous nucleotide sequence free from stop codons in 
at least one of the three reading frames (and thus with the 
potential to code for protein). 

operator 

Short region of DNA in a bacterial chromosome that controls 
the transcription of an adjacent gene. (Figure 7-34) 

operon 

In a bacterial chromosome, a group of contiguous genes that 
are transcribed into a single mRNA molecule. (Figure 7-34) 

ORC — see origin recognition complex 

ORF — see open reading frame 

organelle 

Subcellular compartment or large macromolecular complex, 
often membrane-enclosed, that has a distinct structure, 
composition, and function. Examples are nucleus, nucleo- 
lus, mitochondrion, Golgi apparatus, and centrosomes. 
(Figure 1-30) 

Organizer (Spemann's Organizer) 

Specialized tissue at the dorsal lip of the blastopore in an 
amphibian embryo; a source of signals that help to orches- 
trate formation of the embryonic body axis. (Named after H. 
Spemann and H. Mangold, co -discoverers) (Figure 22-74) 



origin of replication 

Site in a chromosome where DNA replication starts. 

origin recognition complex (ORC) 

Large protein complex that is bound to the DNA at origins of 
replication in eucaryotic chromosomes throughout the cell 

cycle. (Figure 5-36) 

orthologs 

Genes or proteins from different species that are similar in 
sequence because they are descendants of the same gene in 
the last common ancestor of those species. Compare par- 
alogs. (Figure 1-25) 

osmosis 

Net movement of water molecules across a semipermeable 
membrane driven by a difference in concentration of solute 
on either side. The membrane must be permeable to water 
but not to the solute molecules. (Panel 11-1, p. 664) 

osteoblast 

Cell that secretes matrix of bone. (Figure 23-55) 

osteoclast 

Macrophage-like cell that erodes bone, enabling it to be 
remodeled during growth and in response to stresses 
throughout life. (Figure 23-59) 

osteocyte 

Nondividing cell in bone that develops from an osteoblast 
and is embedded in bone matrix. (Figure 23-55) 

ovulation 

Release of an egg from the ovary. (Figure 21-26) 

ovum 

Mature egg. 

oxidase 

Enzyme that catalyzes an oxidation reaction, especially one 
in which molecular oxygen is the electron acceptor. 

oxidation (verb oxidize) 

Loss of electrons from an atom, as occurs during the addi- 
tion of oxygen to a molecule or when a hydrogen is removed. 
Opposite of reduction. (Figure 2-43) 

oxidative phosphorylation 

Process in bacteria and mitochondria in which ATP forma- 
tion is driven by the transfer of electrons through the elec- 
tron transport chain to molecular oxygen. Involves the inter- 
mediate generation of a proton gradient (pH gradient) across 
a membrane and a chemiosmotic coupling of that gradient 
to the ATP synthase. (Figures 14-10 and 14-14) 

p53 

Tumor suppressor gene found mutated in about half of 
human cancers. Encodes a gene regulatory protein that is 
activated by damage to DNA and is involved in blocking fur- 
ther progression through the cell cycle. (Figures 20-37 and 

20-40) 

pairing (homolog pairing) 

In meiosis, the lining up of the two homologous chromo- 
somes along their length. (Figure 21-6) 

pair-rule gene 

In Drosophila development, a gene expressed in a series of 
regular transverse stripes along the body of the embryo and 
which helps to determine its different segments. (Figure 

22-37) 

palindromic sequence 

Nucleotide sequence that is identical to its complementary 
strand when each is read in the same chemical direction — 

e.g., GATC. (Figure 8-31) 

Par3, Par6 

Scaffold proteins involved in the specification of polarity in 
individual animal cells; Par3 and Par6 form a complex with 
atypical protein kinase C (aPKC). (Figure 19-31) 
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paracrine signaling 

Short-range cell-cell communication via secreted signal 
molecules that act on neighboring cells. (Figure 15-4) 

paralogs 

Genes or proteins that are similar in sequence because they 
are the result of a gene duplication event occurring in an 
ancestral organism. Compare orthologs. (Figure 1-25) 

parthenogenesis 

Production of a new individual from an egg cell in the 
absence of fertilization by a sperm. 

passive transport (facilitated diffusion) 

Transport of a solute across a membrane down its concen- 
tration gradient or its electrochemical gradient, using only 
the energy stored in the gradient. (Figure 11-4) 

patch-clamp recording 

Electrophysiological technique in which a tiny electrode tip 
is sealed onto a patch of cell membrane, thereby making it 
possible to record the flow of current through individual ion 
channels in the patch. (Figure 11-33) 

pathogen (adjective pathogenic) 

An organism, cell, virus, or prion that causes disease. 

pattern recognition receptor 

Receptor present on or in cells of the innate immune system 
that recognizes and responds to pathogen-associated molec- 
ular patterns (PAMPs)— such as surface carbohydrates on 
bacteria and viruses and unmethylated GC sequences in 
bacterial DNA. 

PCR (polymerase chain reaction) 

Technique for amplifying specific regions of DNA by the use 
of sequence-specific primers and multiple cycles of DNA 
synthesis, each cycle being followed by a brief heat treatment 
to separate complementary strands. (Figure 8-45) 

PDZ domain 

Protein-binding domain present in many scaffold proteins, 
and often used as a docking site for intracellular tails of 
transmembrane proteins. (Figure 19-21) 

pectin 

Mixture of polysaccharides rich in galacturonic acid which 
forms a highly hydrated matrix in which cellulose is embed- 
ded in plant cell walls. (Figure 19-79) 

pentose 

Five-carbon sugar. 

peptide 

Short polymer of amino acids. 

peptide bond 

Chemical bond between the carbonyl group of one amino 
acid and the amino group of a second amino acid — a special 
form of amide linkage. Peptide bonds link amino acids 
together in proteins. (Panel 3-1, pp. 128-129) 

peripheral lymphoid organ (secondary lymphoid organ) 

Lymphoid organ in which T cells and B cells interact with for- 
eign antigens. Examples are spleen, lymph nodes, and 
mucosal-associated lymphoid tissue. (Figure 25-3) 

peripheral membrane protein 

Protein that is attached to one face of a membrane only by 
noncovalent interactions with other membrane proteins, 
and which can be removed by relatively gentle treatments 
that leave the lipid bilayer intact. (Figure 10-19) 

permease — see transporter 

permissive (nonrestrictive) conditions 

Circumstances (such as temperature or nutrient availability) 
in which the phenotypic effect of a conditional mutation will 
be absent: that is, the phenotype will be normal. (Figure 8-55 
and Panel 8-1, pp. 554-555) 



peroxisome 

Small membrane-bounded organelle that uses molecular 
oxygen to oxidize organic molecules. Contains some 
enzymes that produce and others that degrade hydrogen 
peroxide (H 2 2 ). (Figure 12-30) 

pH 

Common measure of the acidity of a solution: "p" refers to 
power of 10, "H" to hydrogen. Defined as the negative loga- 
rithm of the hydrogen ion concentration in moles per liter 
(M). pH = -log [H + ]. Thus a solution of pH 3 will contain 10" 3 
M hydrogen ions. pH less than 7 is acidic and pH greater 
than 7 is alkaline. 

phage — see bacteriophage 

phagocyte 

General term for a professional phagocytic cell — that is, a 
cell such as a macrophage or neutrophil that is specialized to 
take up particles and microorganisms by phagocytosis. 
(Figures 13-46 and 13-47) 

phagocytosis 

Process by which unwanted cells, debris, and other bulky 
particulate material is endocytosed ("eaten") by a cell. 
Prominent in carnivorous cells, such as Amoeba proteus, and 
in vertebrate macrophages and neutrophils. From Greek 
phagein, to eat. (Figure 24-53) 

phagosome 

Large intracellular membrane-bounded vesicle that is 
formed as a result of phagocytosis. Contains ingested extra- 
cellular material. (Figure 24-30) 

phase-contrast microscope 

Type of light microscope that exploits the interference effects 
that occur when light passes through material of different 
refractive indexes. Used to view living cells. (Figures 9-7 and 
22-101) 

PH domain — see pleckstrin homology domain 

phenotype 

The observable character (including both physical appear- 
ance and behavior) of a cell or organism. (Panel 8-1, pp. 

554-555) 

phosphatase 

Enzyme that catalyzes the hydrolytic removal of phosphate 
groups from a molecule. 

phosphatidylcholine (lecithin) 

Common phospholipid present in abundance in most bio- 
logical mebranes. (Figure 10-3) 

phosphatidylinositol 

An inositol phospholipid. (Figure 15-37) 

phosphatidylinositol 4,5-bisphosphate (PI(4,5)P 2 , PIP2) 

Membrane inositol phospholipid (a phosphoinositide) that 
is cleaved by phospholipase C into IP 3 and diacylglycerol at 
the beginning of the inositol phospholipid signaling path- 
way. It can also be phosphorylated by PI 3-kinase to produce 
PIP3 docking sites for signaling poteins in the PI 3-kinase/Akt 
signaling pathway. (Figures 15-38 and 15-64) 

phosphoanhydride bond 

High-energy bond linking phosphate groups in, for instance, 
ATP and GTP (Panel 2-6, pp. 116-117) 

phosphodiester bond 

A covalent chemical bond formed when two hydroxyl groups 
form ester linkages to the same phosphate group, such as 
between adjacent nucleotides in RNA or DNA. (Figure 2-28) 

phosphoglyceride 

Phospholipid derived from glycerol, abundant in biomem- 
branes. (Figures 10-2 and 10-3) 

phosphoinositide — see inositol phospholipid 
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phosphoinositide 3-kinase (PI 3-kinase) 

Membrane -bound enzyme that is a component of the PI 3- 
kinase/Akt intracellular signaling pathway. It phosphory- 
lates phosphatidylinositol 4,5-bisphosphate at the 3 posi- 
tion on the inositol ring to produce PIP3 docking sites in the 
membrane for other intracellular signaling proteins. 
(Figure 15-64) 

phospholipase C (PLC) 

Membrane-bound enzyme that cleaves inositol phospho- 
lipids to produce IP 3 and diacylglycerol in the inositol phos- 
pholipid signaling pathway. PLC|3 is activated by GPCRs via 
specific G proteins, while PLCyis activated by RTKs. (Figures 
25-39 and 15-55) 

phospholipid 

The main category of lipids used to construct biomem- 
branes. Generally composed of two fatty acids linked 
through glycerol (or sphingosine) phosphate to one of a vari- 
ety of polar groups. (Figure 10-3, andPanel2-5, pp. 114-115) 

phosphorylation 

Reaction in which a phosphate group is covalently coupled 
to another molecule. 

phosphorylation cascade 

Series of sequential protein phosphorylations mediated by a 
series of protein kinases, each of which phosphorylates and 
activates the next kinase in the chain. Such cascades are 
common in intracellular signaling pathways. (Figure 15-60) 

photoactivation 

Technique for studying intracellular processes in which an 
inactive form of a molecule of interest is introduced into the 
cell, and is then activated by a focused beam of light at a pre- 
cise spot in the cell. (Figure 9-30) 

photochemical reaction center 

The part of a photosystem that converts light energy into 
chemical energy in photosynthesis. (Figure 14-43) 

photoreceptor 

Cell or molecule that is sensitive to light. 

photorespiration 

Wasteful metabolic process that occurs in plants in condi- 
tions of low CO2 in which O2 is used up and CO2 liberated 
without the production of carbohydrate for storage. 

photosynthesis 

Process by which plants, algae and some bacteria use the 
energy of sunlight to drive the synthesis of organic molecules 
from carbon dioxide and water. (Figures 2-40 and 14-38) 

photosynthetic electron transfer 

Light-driven reactions in photosynthesis in which electrons 
move along an electron-transport chain in a membrane, 
generating ATP and NADPH. (Figure 14-38) 

photosystem 

Multiprotein complex involved in photosynthesis that cap- 
tures the energy of sunlight and converts it to useful forms of 
energy. (Figure 14-43) 

phragmoplast 

Structure made of microtubules and actin filaments that 
forms in the prospective plane of division of a plant cell and 
guides formation of the cell plate. (Figure 17-57) 

phylogeny 

Evolutionary history of an organism or group of organisms, 
often presented in chart form as a phylogenetic tree. (Figures 
4-75 and 14-72) 

pi — see isoelectric point 

PI 3-kinase — see phosphoinositide 3-kinase 

pinocytosis 

Literally, "cell drinking." Type of endocytosis in which soluble 
materials are continually taken up from the environment in 
small vesicles and moved into endosomes along with the 



membrane-bound molecules. Compare phagocytosis. 
(Figure 13-48) 

PKA— see cyclic-AMP-dependent protein kinase 

PKB— seeAkt 

PKC — see protein kinase C 

planar cell polarity 

Type of cellular asymmetry seen in some epithelia, such that 
each cell has a polarity vector oriented in the plane of the 
epithelium. (Figure 19-32) 

plant growth regulator (plant hormone) 

Signal molecule that helps coordinate growth and develop- 
ment. Examples are ethylene, auxins, gibberellins, 
cytokinins, abscisic acid, and the brassinosteroids. 

plasma membrane 

The membrane that surrounds a living cell. (Figure 10-1) 

plasmid 

Small circular extrachromosomal DNA molecule that repli- 
cates independently of the genome. Modified plasmids are 
used extensively as vectors for DNA cloning. 

plasmodesma (plural plasmodesmata) 

Plant equivalent of a gap junction. Communicating cell-cell 
junction in plants in which a channel of cytoplasm lined by 
plasma membrane connects two adjacent cells through a 
small pore in their cell walls. 

plastid 

Cytoplasmic organelle in plants, bounded by a double mem- 
brane, that carries its own DNA and is often pigmented. 
Chloroplasts are plastids. 

platelet 

Cell fragment, lacking a nucleus, that breaks off from a 
megakaryocyte in the bone marrow and is found in large 
numbers in the bloodstream. Helps initiate blood clotting 
when blood vessels are injured. 

PLC — see phospholipase C 

pleckstrin homology domain (PH domain) 

Protein domain found in some intracellular signaling pro- 
teins. Some PH domains in intracellular signaling proteins 
bind to phosphatidylinositol 3,4,5-trisphosphate produced 
by PI 3-kinase, bringing the signaling protein to the plasma 
membrane when PI 3-kinase is activated. 

ploidy 

The number of complete homologous sets of chromosomes 
in a genome. Diploid organisms have two sets in their 
somatic cells; haploid organisms have a single set; polyploid 
organisms have more than two. 

plus end 

End of a microtubule or actin filament at which addition of 
monomers occurs most readily; the "fast-growing" end of a 
microtubule or actin filament. The plus end of an actin fila- 
ment is also known as the barbed end. (Panel 16-2, pp. 
978-979) 

point mutation 

Change of a single nucleotide pair, or a very small part of a 
single gene, in DNA. (Panel 8-1, pp. 554-555) 

polar 

In the electrical sense, describes a structure (for example, a 
chemical bond, chemical group, or molecule) with positive 
charge concentrated toward one end and negative charge 
toward the other as a result of an uneven distribution of elec- 
trons. Polar molecules are likely to be soluble in water. 
(Figure 2-10 and Panel 2-2, pp. 108-109) 

polar body 

Smaller of two daughter cells produced during meiosis by 
asymmetric division of a primary or secondary oocyte, the 
other (large) daughter being the oocyte or ovum itself. The 
polar bodies eventually degenerate. (Figure 21-23) 
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polyadenylation 

Addition of a long sequence of A nucleotides (the poly-A tail) 
to the 3' end of a nascent mRNA molecule. (Figures 6-21 and 
6-38) 

poly-A tail — see polyadenylation 

polycistronic mRNA 

Individual mRNA that encodes several different proteins— 
commonly found in bacteria but not in eucaryotes. (Figure 
6-73) 

polygenic trait 

Heritable characteristic that is influenced by multiple genes, 
each of which makes a small contribution to the phenotype. 

polymer 

Large molecule made by covalently linking multiple identical 
or similar units (monomers) together. 

polymerase 

Enzyme that catalyzes polymerization reactions such as the 
synthesis of DNA and RNA. See also DNA polymerase, RNA 
polymerase. 

polymerase chain reaction — see PCR 

polymorphic 

Describes a gene with two or more alleles that coexist at high 
frequency in a population. 

polypeptide 

Linear polymer of amino acids. Proteins are large polypep- 
tides, and the two terms can be used interchangeably. (Panel 
3-1, pp. 128-129) 

polyploid 

Cell or organism that contains more than two sets of homol- 
ogous chromosomes. 

polyribosome (polysome) 

Messenger RNA molecule to which are attached a number of 
ribosomes engaged in protein synthesis. (Figure 6-76) 

polysaccharide 

Linear or branched polymer of monosaccharides. Examples 
include glycogen, starch, hyaluronic acid, and cellulose. 
(Panel 2-4, p. 113) 

polytene chromosome 

Giant chromosome in which the DNA has undergone 
repeated replication and the many copies have stayed 
together. (Figures 4-58 and 4-59) 

porin 

Channel-forming proteins of the outer membranes of bacte- 
ria, mitochondria, and chloroplasts. 

position effect 

Difference in gene expression that depends on the position 
of the gene on the chromosome and probably reflects differ- 
ences in the state of the chromatin along the chromosome. 
When an active gene is placed next to heterochromatin, the 
inactivating influence of the heterochromatin can spread to 
affect the gene to a variable degree, giving rise to position 
effect variegation. (Figure 4-36) 

positional information 

Information supplied to or possessed by cells according to 
their position in a multicellular organism. A cell's internal 
record of its positional information is called its positional 
value. 

positive feedback 

Control mechanism whereby the end product of a reaction 
or pathway stimulates its own production or activation. 

post-transcriptional control 

Any control on gene expression that is exerted at a stage after 
transcription has begun. (Figure 7-92) 

post -( ran slational modification 

An enzyme-catalyzed change to a protein made after it is 



synthesized. Examples are acetylation, cleavage, glycosyla- 
tion, methylation, phosphorylation, and prenylation. 

pre-B cell 

Immediate precursor of a B cell. (Figure 25-22) 

preinitiation complex 

A multiprotein complex that is assembled on the origin of 
replication at the onset of the S phase of the eucaryotic cell 
cycle. Initiates DNA synthesis by unwinding the DNA helix 
and loading DNA polymerases and other replication 
enzymes onto the DNA strands. (Figure 17-23) 

pre- mRNA 

Precursor to messenger RNA. In eucaryotes, includes all 
intermediate stages of RNA processing. 

prenylation 

Covalent attachment of an isoprenoid lipid group to a pro- 
tein. (Figure 10-20) 

preprophase band 

Circumferential band of microtubules and actin filaments 
that forms around a plant cell under the plasma membrane 
prior to mitosis and cell division. (Figure 17-57) 

prereplicative complex (pre-RC) 

Multiprotein complex that is assembled at origins of replica- 
tion during late mitosis and early Gi phases of the cell cycle; 
a prerequisite to license the assembly of a preinitiation com- 
plex, and the subsequent initiation of DNA replication. 

(Figures 17-22 and 17-23) 

primary cell wall 

The first cell wall produced by a developing plant cell; it is 
thin and flexible, allowing room for cell growth. (Figure 
19-79) 

primary cilium 

Short, single, nonmotile, cilium lacking dynein that arises 
from a centriole and projects from the surface of many ani- 
mal cell types. Some signaling proteins are concentrated in 
the primary cilium. (Figure 15-48) 

primary immune response 

Adaptive immune response to an antigen that is made on 
first encounter with that antigen. (Figure 25-10) 

primary structure 

Linear sequence of monomer units in a polymer, such as the 
amino acid sequence of a protein. 

primary transcript (primary RNA transcript) 

Freshly synthesized transcript, before it has undergone splic- 
ing or other modifications. (Figure 6-21) 

primary tumor 

Tumor at the original site at which a cancer first arose. 
Secondary tumors develop elsewhere by metastasis. 

primase — see DNA primase 

primer 

Oligonucleotide that pairs with a template DNA or RNA 
strand and promotes the synthesis of a new complementary 
strand by a polymerase. 

primordial germ cell 

Cell in an embryo that is a precursor to cells that give rise to 
gametes. (Figures 21-17, 21-23, and 21-30) 

prion 

An infectious, abnormally folded form of a protein that is 
replicated in the host by inducing normal proteins of the same 
type to adopt the aberrant structure. (Figures 6-95 and 6-96) 

prion disease 

Transmissible spongiform encephalopathy — such as Kuru 
and Creutzfeld-Jakob disease (CJD) in humans, scrapie in 
sheep, and bovine spongiform encephalopathy (BSE, or 'mad 
cow disease') in cows — that is caused and transmitted by an 
infectious, abnormally folded protein (prion). (Figure 24-18) 
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probe 

Defined fragment of RNA or DNA, radioactively or chemi- 
cally labeled, used to locate specific nucleic acid sequences 
by hybridization. 

procaryote (prokaryote) 

Single-celled microorganism whose cells lack a well-defined, 
membrane-enclosed nucleus. Either a bacterium or an 
archaeon. (Figure 1-21) 

procaspase 

Inactive precursor of a caspase, a proteolytic enzyme usually 
involved in apoptosis. (Figure 18-5) 

processive 

Of an enzyme: able to proceed along a polymer chain cat- 
alyzing the same reaction repeatedly without detaching from 
the chain. 

programmed cell death 

A form of cell death in which a cell kills itself by activating an 
intracellular death program. 

prometaphase 

Phase of mitosis preceding metaphase in which the nuclear 
envelope breaks down and chromosomes first attach to the 
spindle. (Panel 17-1, pp. 1072-1073) 

promoter 

Nucleotide sequence in DNA to which RNA polymerase 
binds to begin transcription. See also inducible promoter. 
(Figure 7-44) 

proneural gene 

Gene whose expression defines cells with the potential to 
develop as neural tissue. 

proofreading 

Process by which potential errors in DNA replication, tran- 
scription, and translation are detected and corrected. 

prophase 

First stage of mitosis, during which the chromosomes are 
condensed but not yet attached to a mitotic spindle. (Panel 
17-1, pp. 1072-1073) 

protease (proteinase, proteolytic enzyme) 

Enzyme that degrades proteins by hydrolyzing some of the 
peptide bonds between amino acids. 

proteasome 

Large protein complex in the cytosol with proteolytic activity 
that is responsible for degrading proteins that have been 
marked for destruction by ubiquitylation or by some other 
means. (Figures 6-89 and 6-90) 

protein 

The major macromolecular constituent of cells. A linear 
polymer of amino acids linked together by peptide bonds in 
a specific sequence. (Figure 3-1) 

protein activity control 

The selective activation, inactivation, degradation, or com- 
partmentalization of specific proteins after they have been 
made. One of the means by which a cell controls which pro- 
teins are active at a given time or location in the cell. 

protein domain — see domain 

protein kinase 

Enzyme that transfers the terminal phosphate group of ATP 
to one or more specific amino acids (serine, threonine, or 
tyrosine) of a target protein. 

protein kinase A (PKA) — see cyclic-AMP-dependent protein 
kinase 

protein kinase B — see Akt 

protein kinase C (PKC) 

Ca 2+ -dependent protein kinase that, when activated by dia- 
cylglycerol and an increase in the concentration of cytosolic 
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Ca 2+ , phosphorylates target proteins on specific serine and 
threonine residues. (Figure 15-39) 

protein subunit 

An individual protein chain in a protein composed of 
than one chain. 

protein translocator 

Membrane-bound protein that mediates the transport of 
another protein across a membrane. (Figure 12-23) 

protein tyrosine phosphatase 

Enzyme that removes phosphate groups from phosphory- 
lated tyrosine residues on proteins. (Figure 25-71) 

proteoglycan 

Molecule consisting of one or more glycosaminoglycan 
chains attached to a core protein. (Figure 19-58) 

proteolysis 

Degradation of a protein by hydrolysis at one or more of its 
peptide bonds. 

proteolytic enzyme — see protease 

proteomics 

Study of all the proteins, including all the covalently modi- 
fied forms of each, produced by a cell, tissue, or organism. 
Proteomics often investigates changes in this larger set of 
proteins in 'the proteome' — caused by changes in the envi- 
ronment or by extracellular signals. 

protist 

Single-celled eucaryote. Includes protozoa, algae, yeasts. 
(Figure 1^1) 

proton 

Positively charged subatomic particle that forms part of an 
atomic nucleus. Hydrogen has a nucleus composed of a sin- 
gle proton (H + ). (Figure 2-1) 

proton-motive force 

The force exerted by the electrochemical proton gradient 
that moves protons across a membrane. (Figure 14-13) 

pro to- oncogene 

Normal gene, usually concerned with the regulation of cell 
proliferation, that can be converted into a cancer-promoting 
oncogene by mutation. (Figure 20-34) 

protozoa 

Free-living or parasitic, nonphotosynthetic, single-celled, 
motile eucaryotic organisms, such as Paramecium and 
Amoeba. Free-living protozoa feed on bacteria or other 
microorganisms. (Figure 1-41) 

pseudogene 

Nucleotide sequence of DNA that has accumulated multiple 
mutations that have rendered an ancestral gene inactive and 
nonfunctional. 

pseudopodium (plural pseudopodia) 

Large, thick cell-surface protrusion formed by amoeboid 
cells as they crawl. More generally, any similarly shaped 
dynamic actin-rich extension of the surface of an animal cell. 
Compare filopodium, lamellipodium. (Figure 16-94) 

pump 

Transmembrane protein that drives the active transport of 
ions or small molecules across the lipid bilayer. 

purifying selection 

Natural selection operating to retard divergence in gene 
sequences within a population in the course of evolution by 
eliminating individuals carrying deleterious mutations. 

purine 

Nitrogen-containing ring compound found in DNA and 
RNA: adenine or guanine. (Panel 2-6, pp. 116-117) 

pyrimidine 

Nitrogen-containing ring compound found in DNA and 
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RNA: cytosine, thymine, or uracil. (Panel 2-6, pp. 116-117) 

pyruvate (CH3COCOO) 

End-product of the glycotytic pathway Enters mitochondria 
and feeds into the citric acid cycle and other biosynthetic 

pathways. 

quaternary structure 

Three-dimensional relationship of the different polypeptide 
chains in a multisubunit protein or protein complex. 

quinone (Q) 

Small, lipid-soluble mobile electron carrier molecule found 
in the respiratory and photosynthetic electron-transport 
chains. (Figure 14-24) 

Rab (Rab protein) 

Monomeric GTPase in the Ras superfamily present in the 
plasma membrane and organelle membranes. Involved in 
conferring specificity on vesicle docking. (Table 15-5, p. 926) 

Ran (Ran protein) 

Monomeric GTPase in the Ras superfamily present in both 
cytosol and nucleus. Required for the active transport of 
macromolecules into and out of the nucleus through nuclear 
pore complexes. (Table 15-5, p. 926) 

Ras (Ras protein) 

Monomeric GTPase of the Ras superfamily that helps to relay 
signals from cell-surface RTK receptors to the nucleus, fre- 
quently in response to signals that stimulate cell division. 
Named for the ras gene, first identified in viruses that cause 
rat sarcomas. (Figure 3-72) 

Ras superfamily 

Large superfamily of monomeric GTPases (also called small 
GTP-binding proteins) of which Ras is the prototypical mem- 
ber. (Table 15-5, p. 926) 

Rb — see retinoblastoma protein 

reading frame 

Phase in which nucleotides are read in sets of three to 
encode a protein. A mRNA molecule can be read in any one 
of three reading frames, only one of which will give the 
required protein. (Figure 6-51) 

RecA (RecA protein) 

Prototype for a class of DNA-binding proteins that catalyze 
synapsis of DNA strands during genetic recombination. 

(Figure 5-56) 

receptor 

Any protein that binds a specific signal molecule (ligand) 
and initiates a response in the cell. Some are on the cell sur- 
face, while others are inside the cell. (Figure 15-3) 

receptor-mediated endocytosis 

Internalization of receptor-ligand complexes from the 
plasma membrane by endocytosis. (Figure 13-53) 

receptor serine/threonine kinase 

Cell-surface receptor with an extracellular ligand-binding 
domain and an intracellular kinase domain that phosphory- 
lates signaling proteins on serine or threonine residues in 
response to ligand binding. The TGF(3 receptor is an exam- 
ple. (Figure 15-69) 

receptor tyrosine kinase (RTK) 

Cell-surface receptor with an extracellular ligand-binding 
domain and an intracellular kinase domain that phosphory- 
lates signaling proteins on tyrosine residues in response to 
ligand binding. (Figure 15-52 and Table 15-4, p. 923) 

recessive 

In genetics, the member of a pair of alleles that fails to be 
expressed in the phenotype of the organism when the domi- 
nant allele is present. Also refers to the phenotype of an indi- 
vidual that has only the recessive allele. (Panel 8-1, pp. 
554-555) 



recombinant DNA 

Any DNA molecule formed by joining DNA segments from 
different sources. 

recombinant DNA technology — see genetic engineering 

recombination (genetic recombination) 

Process in which DNA molecules are broken and the frag- 
ments are rejoined in new combinations. Can occur natu- 
rally in the living cell — for example, through crossing- over 
during meiosis — or in vitro using purified DNA and enzymes 
that break and ligate DNA strands. Three broad classes are 
homologous (general), conservative site-specific, and trans- 
positional recombination. 

recombination complex 

In meiosis, a protein complex that assembles at a DNA dou- 
ble-strand break and helps mediate homologous recombina- 
tion. 

recycling endosome 

Large intracellular membrane-bounded vesicle formed from 
a fragment of an endosome; an intermediate stage on the 
passage of recycled receptors back to the cell membrane. 
(Figure 13-60) 

red blood cell — see erythrocyte 

redox pair 

Pair of molecules in which one acts as an electron donor and 
one as an electron acceptor in an oxidation-reduction reac- 
tion: for example, NADH (electron donor) and NAD + (elec- 
tron acceptor). (Panel 14-1, p. 830) 

redox potential 

The affinity of a redox pair for electrons, generally measured 
as the voltage difference between an equimolar mixture of 
the pair and a standard reference. NADH/NAD + has a low 
redox potential and O2/H2 has a high redox potential (high 
affinity for electrons). (Panel 14-1, p. 830) 

redox reaction 

Reaction in which one component becomes oxidized and 
the other reduced; an oxidation-reduction reaction. (Panel 
14-1, p. 830) 

reduction (verb reduce) 

Addition of electrons to an atom, as occurs during the addi- 
tion of hydrogen to a biological molecule or the removal of 
oxygen from it. Opposite of oxidation. (Figure 2-43) 

regulative 

Of embryos or parts of embryos: self-adjusting, so that a nor- 
mal structure emerges even if the starting conditions are per- 
turbed. 

regulator of G protein signaling (RGS) 

A GAP protein that binds to a trimeric G protein and 
enhances its GTPase activity thus helping to limit G-protein- 
mediated signaling. (Figure 15-19) 

regulatory sequence 

DNA sequence to which a gene regulatory protein binds to 
control the rate of assembly of the transcriptional complex at 
the promoter. (Figure 7-44) 

release factor 

Protein that enables release of a newly synthesized protein 
from the ribosome by binding to the ribosome in the place of 
tRNA (whose structure it mimics). (Figure 16-74) 

replication — see DNA replication 

replication fork 

Y-shaped region of a replicating DNA molecule at which the 
two strands of the DNA are being separated and the daugh- 
ter strands are being formed. (Figures 5-7 and 5-19) 

replication origin 

Location on a DNA molecule at which duplication of the 

DNA begins. (Figures 4-21 and 5-25) 
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replicative cell senescence — see also senescence 

Phenomenon observed in primary cell cultures in which cell 
proliferation slows down and finally irreversibly halts. 

reporter gene 

Genetic construct, usually artificial, in which a copy of the 
regulatory DNA of a gene of interest is linked to a sequence 
coding for an easily- detectable product. The presence or 
absence of this product (the 'reporter protein') in a cell con- 
taining the construct indicates whether the gene of interest 
is active or inactive. (Figure 8-70) 

repressor (gene repressor protein, transcriptional repressor) 

Protein that binds to a specific region of DNA to prevent 
transcription of an adjacent gene. 

respiration 

General term for an energy- generating process in cells that 
involves the oxidative breakdown of sugars or other organic 
molecules and requires the uptake of O2 while producing 
C0 2 and H 2 as waste products. (Figure 2-41) 

respiratory chain — see electron transport chain 

respiratory enzyme complex 

Any of the major protein complexes of the mitochondrial 
respiratory chain that act as electron-driven proton pumps 
to generate the proton gradient across the inner membrane. 
(Figures 14-14 and 14-26) 

resting membrane potential 

Membrane potential in equilibrium conditions in which 
there is no net flow of ions across the plasma membrane. See 
also membrane potential. 

restriction fragment 

Fragment of DNA generated by the action of restriction 

enzyme (s). 

restriction map 

Diagrammatic representation of a DNA molecule indicating 
the sites of cleavage by various restriction enzymes. 

restriction nuclease (restriction enzyme) 

One of a large number of nucleases that can cleave a DNA 
molecule at any site where a specific short sequence of 
nucleotides occurs. Extensively used in recombinant DNA 
technology. (Figures 8-31 and 8-32) 

restriction point — see Start 

restrictive (nonpermissive) conditions 

Circumstances (such as temperature or nutrient availability) 
in which the phenotypic effect of a conditional mutation will 
be evident. (Figure 8-55, and Panel 8-1, pp. 554-555) 

retinoblastoma protein (Rb) 

Tumor suppressor protein involved in the regulation of cell 
division. Mutated in the cancer retinoblastoma, as well as in 
many other tumors. Its normal activity is to regulate the 
eucaryotic cell cycle by binding to and inhibiting the E2F 
proteins, thus blocking progression to DNA replication and 
cell division. (Figure 17-62) 

retrotransposon 

Type of transposable element that moves by being first tran- 
scribed into an RNA copy that is then reconverted to DNA by 
reverse transcriptase and inserted ('retro-transposed') else- 
where in the genome. There are two types: retroviral -like 
retrotransposons and nonretroviral retrotransposons. (Table 
5-3, p. 318) 

retrovirus 

RNA- containing virus that replicates in a cell by first making 
an RNA-DNA intermediate and then a double-stranded DNA 
molecule that becomes integrated into the cell's DNA. 
(Figure 5-71) 

reverse genetics 

Approach to discovering gene function that starts from the 
DNA (gene) and its protein product and then creates 
mutants to analyze the gene's function. 



reverse transcriptase 

Enzyme first discovered in retroviruses that makes a double- 
stranded DNA copy from a single-stranded RNA template 
molecule. 

reverse transcription 

Transcription from RNA to DNA. This is in the opposite direc- 
tion to that prescribed by central dogma, which holds that DNA 
is transcribed into RNA and RNA is translated into protein. 

RGD sequence 

Tripeptide sequence of arginine-glycine-aspartic acid that 
forms a binding site for integrins; present in fibronectin and 
some other extracellular proteins. (Figure 19-72C) 

RGS — see regulator of G protein signaling 

Rho (Rho protein family) 

Family of monomeric GTPases within the Ras superfamily 
involved in signaling the rearrangement of the cytoskeleton. 
Includes Rho, Rac, and Cdc42. (Table 15-5, p. 926) 

rhodopsin 

Seven-span membrane protein of the GPCR family that acts 
as a light sensor in rod photoreceptor cells in the vertebrate 
retina. Contains the light-sensitive prosthetic group retinol. 
(Figure 15-49) 

ribonuclease 

Enzyme that cuts an RNA molecule by hydrolyzing one or 
more of its phosphodiester bonds. 

ribonucleic acid — see RNA 

ribose 

The five-carbon monosaccharide component of RNA. 
C5H10O5. Compare deoxyribose. 

ribosomal RNA (rRNA) 

Any one of a number of specific RNA molecules that form 
part of the structure of a ribosome and participate in the syn- 
thesis of proteins. Often distinguished by their sedimenta- 
tion coefficient (e.g., 28S rRNA, 5S rRNA). 

ribosome 

Particle composed of rRNAs and ribosomal proteins that cat- 
alyzes the synthesis of protein using information provided by 
mRNA. (Figure 1-10) 

ribozyme 

RNA with catalytic activity. 

RNA (ribonucleic acid) 

Polymer formed from covalently linked ribonucleotide 
monomers. See also messenger RNA, ribosomal RNA, trans- 
fer RNA. (Table 6-1, p. 336, and Panel 2-6, pp. 116-117) 

RNA editing 

Type of RNA processing that alters the nucleotide sequence 
of a pre-mRNA transcript after it is synthesized by inserting, 
deleting, or altering individual nucleotides. 

RNA interference (RNAi) 

As originally described, mechanism by which an experi- 
mentally introduced double-stranded RNA induces 
sequence-specific destruction of complementary mRNAs. 
The mechanism, which is highly conserved in eucaryotes, 
proceeds through short double-stranded small interfering 
RNAs (siRNAs) produced by endonucleolytic cleavage. The 
term RNAi is often used broadly to also include the inhibi- 
tion of gene expression by microRNAs (miRNAs), which are 
encoded in the cells own genome. RNA interference is 
widely used experimentally to study the effects of inactivat- 
ing specific genes. (Figure 7-115) 

RNA polymerase 

Enzyme that catalyzes the synthesis of an RNA molecule on 
a DNA template from ribonucleoside triphosphate precur- 
sors. (Figure 6-8) 

RNA primer 

Short stretch of RNA synthesized on a DNA template. It is 
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required by DNA polymerases to start their DNA synthesis. 

RNA processing 

Broad term for the various modifications an RNA transcript 
undergoes as it reaches its mature form. May include 5' cap- 
ping, 3' polyadenylation, 3' cleavage, splicing, and editing. 

RNA splicing 

Process in which intron sequences are excised from RNA 
transcripts in the nucleus during formation of messenger 
and other RNAs. 

rod photoreceptor (rod) 

Photoreceptor cell in the vertebrate retina that is responsible 
for noncolor vision in dim light. (Figure 23-17) 

rough endoplasmic reticulum (rough ER) 

Endoplasmic reticulum with ribosomes on its cytosolic sur- 
face. Involved in the synthesis of secreted and membrane- 
bound proteins. 

rRNA — see ribosomal RNA 

rRNA gene 

Gene that specifies a ribosomal RNA (rRNA). 

RTK — see receptor tyrosine kinase 

RT-PCR (reverse transcription-polymerase chain reaction) 

Technique in which a population of mRNAs is converted into 
cDNAs via reverse transcription, and the cDNAs are then 
amplified by PCR. 

S — see S phase 

saccharide 

Sugar. 

Saccharomyces 

Genus of yeasts that reproduce asexually by budding or sex- 
ually by conjugation. Economically important in brewing 
and baking. Saccharomyces cerevisiae is widely used as a sim- 
ple model organism in the study of eucaryotic cell biology. 
See also Schizosaccharomyces. 

sarcoma 

Cancer of connective tissue. 

sarcomere 

Repeating unit of a myofibril in a muscle cell, composed of 
an array of overlapping thick (myosin) and thin (actin) fila- 
ments between two adjacent Z discs. (Figure 16-74) 

sarcoplasmic reticulum 

Specialized type of endoplasmic reticulum in the cytoplasm 
of muscle cells that contains high concentrations of 
sequestered Ca 2+ that is released into the cytosol during 
muscle excitation. (Figure 16-77) 

satellite DNA 

Region of highly repetitive DNA from a eucaryotic chromo- 
some, identifiable by its unusual nucleotide composition. 
Typically present at centromeres (as well as other sites) in 
higher eucaryotes, and thought to play a part in centromere 
function. (Figure 4-49) 

scaffold protein 

Protein that binds groups of intracelluar signaling proteins 
into a signaling complex, often anchoring the complex at a 
specific location in the cell. (Figure 15-17) 

scanning electron microscope 

Type of electron microscope that produces an image of the 
surface of an object. 

S-Cdk 

Cyclin-Cdk complex formed in vertebrate cells by an S-cyclin 
and the corresponding cyclin-dependent kinase (Cdk). 
(Figure 17-16 and Table 17-1, p. 1063) 

S. cerevisiae — see Saccharomyces 



Schizosaccharomyces 

Genus of rod-shaped yeasts that reproduce by binary fission. 
S. pombe, along with the budding yeast Saccharomyces cere- 
visiae, is a model organism used in many different studies. 

SCF (SCF protein) 

Family of ubiquitin ligases formed as a complex of several 
different proteins. One is involved in regulating the eucary- 
otic cell cycle, directing the destruction of inhibitors of S- 
Cdks in late Gi and thus promoting the activation of S-Cdks 
and DNA replication. (Figures 3-79 and 17-20) 

Schwann cell 

Glial cell responsible for forming myelin sheaths in the 
peripheral nervous system. Compare oligodendrocyte. 
(Figure 11-32) 

S-cyclin 

Member of a class of cyclins that accumulate during late Gi 
phase and bind Cdks soon after progression through Start; 
they help stimulate DNA replication and chromosome dupli- 
cation. Levels remain high until late mitosis, after which 
these cyclins are destroyed. (Figure 17-16) 

SDS-PAGE (sodium dodecyl sulfate-polyacrylamide gel 
electrophoresis) 

Type of electrophoresis used to separate proteins by size. The 
protein mixture to be separated is first treated with a power- 
ful negatively charged detergent (SDS) and with a reducing 
agent (p mercaptoethanol), before being run through a poly- 
acrylamide gel. The detergent and reducing agent unfold the 
proteins, free them from association with other molecules, 
and separate the polypeptide subunits. See also elec- 
trophoresis. 

secondary cell wall 

Permanent rigid cell wall that is laid down underneath the 
thin primary cell wall in certain plant cells that have com- 
pleted their growth. (Figure 19-77C) 

secondary immune response 

Adaptive immune response to an antigen that is made on a 
second or subsequent encounter with a given antigen. More 
rapid in onset and stronger than the primary immune 
response. (Figure 25-10) 

secondary structure 

Regular local folding pattern of a polymeric molecule; in pro- 
teins, a-helices and p-sheets. 

second messenger (small intracellular mediator) 

Small intracellular signaling molecule that is formed or 
released for action in response to an extracellular signal and 
helps to relay the signal within the cell. Examples include 
cyclic AMP, cyclic GMI* IP3, Ca 2+ , and diacylglycerol. (Figure 
15-17) 

secretory vesicle 

Membrane-bounded organelle in which molecules destined 
for secretion are stored prior to release. Sometimes called 
secretory granule because darkly staining contents make the 
organelle visible as a small solid object. (Figures 13-63 and 
13-66) 

securin 

Protein that binds to the protease separase and thereby pre- 
vents its cleavage of the protein linkages that hold sister 
chromatids together in early mitosis. Securin is destroyed at 
the metaphase-to-anaphase transition. (Figure 17-44) 

seed 

In plants, the structure containing the dormant embryo, 
along with a food store, enclosed in a hard protective coat. 
(Panel 22-1, p. 1401) 

segment-polarity gene 

In Drosophila development, a gene involved in specifying 
the anteroposterior organization of each body segment. 

(Figures 22-37 and 22-41) 
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selectable marker gene 

Gene included in a DNA construct to signal presence of that 
construct in a cell, and making it possible to select cells 
according to whether they contain the construct. 

selectin 

Member of a family of cell-surface carbohydrate-binding 
proteins that mediate transient, Ca 2+ -dependent cell-cell 
adhesion in the bloodstream — for example between white 
blood cells and the endothelium of the blood vessel wall. 
(Figure 19-19) 

selectivity filter 

The part of an ion channel structure that determines which 
ions it can transport. (Figures 11-23 and 11-24) 

senescence 

(1) aging of an organism. (2) replicative cell senescence: phe- 
nomenon observed in primary cell cultures in which cell pro- 
liferation slows down and finally halts irreversibly. 

sensory hair cell — see auditory hair cell 

separase 

Protease that cleaves the cohesin protein linkages that hold 
sister chromatids together. Acts at anaphase, enabling chro- 
matid separation and segregation. (Figure 17-44) 

septate junction 

Main type of occluding cell junction in invertebrates; its 
structure is distinct from that of vertebrate tight junctions. 
(Figure 19-28) 

sequencing 

Determination of the order of nucleotides or amino acids in 
a nucleic acid or protein molecule. (Figure 8-50) 

serine protease 

Type of protease that has a reactive serine in the active site. 
(Figures 3-12 and 3-38) 

serine/threonine kinase 

Enzyme that phosphorylates specific proteins on serine or 
threonines. (Figure 15-70) 

sex chromosome 

Chromosome that may be present or absent, or present in a 
variable number of copies, determining the sex of the indi- 
vidual; in mammals, the X and Y chromosomes. 

SH2 domain 

Src homology region 2, a protein domain present in many 
signaling proteins. Binds a short amino acid sequence con- 
taining a phosphotyrosine. (Panel 3-2, pp. 132-133) 

p- sheet — sec beta sheet 

side chain 

The part of an amino acid that differs between amino acid 
types. The side chains give each type of amino acid its 
unique physical and chemical properties. (Panel 3-1, pp. 
128-129) 

signaling cascade 

Sequence of linked intracellular reactions, typically involving 
multiple amplification steps in a relay chain, triggered by an 
activated cell-surface receptor. 

signal molecule 

Extracellular chemical produced by a cell that signals to 
other cells in the organism to alter the cells' behavior. (Figure 
15-1) 

signal patch 

Protein-sorting signal that consists of a specific three- 
dimensional arrangement of atoms on the folded protein's 
surface. (Figure 13-45) 

signal peptidase 

Enzyme that removes a terminal signal sequence from a pro- 
tein once the sorting process is complete. (Figure 12-25) 



signal-recognition particle (SRP) 

Ribonucleoprotein particle that binds an ER signal sequence 
on a partially synthesized polypeptide chain and directs the 
polypeptide and its attached ribosome to the endoplasmic 
reticulum. (Figure 12-39) 

signal-relaying junction 

Complex type of cell-cell junction that alows signals to be 
relayed from one cell to another across their plasma mem- 
branes at sites of cell-to-cell contact. Typically includes 
anchorage proteins as well as proteins mediating signal 
transduction. (Figure 19-2, and Table 19-1, p. 1133) 

signal sequence 

Short continuous sequence of amino acids that determines 
the eventual location of a protein in the cell. An example is 
the N-terminal sequence of 20 or so amino acids that directs 
nascent secretory and transmembrane proteins to the endo- 
plasmic reticulum. (Table 12-3, p. 702) 

signal transduction 

Conversion of a signal from one physical or chemical form to 
another (e.g., conversion of light to a chemical signal or of 
extracellular signals to intracellular ones). 

single- nucleotide polymorphism (SNP) 

Variation between individuals in a population at a specific 
nucleotide in their DNA sequence. 

single-pass transmembrane protein 

Membrane protein in which the polypeptide chain crosses 
the lipid bilayer only once. (Figure 10-19) 

single-strand DNA-binding protein 

Protein that binds to the single strands of the opened-up 
DNA double helix, preventing helical structures from 
reforming while the DNA is being replicated. (Figure 5-16) 

siRNA — see small interfering RNA 

sister chromatids 

Tightly linked pair of chromosomes that arise from chromo- 
some duplication during S phase. They separate during M 
phase and segregate into different daughter cells. (Figure 
17-26) 

site-directed mutagenesis 

Technique by which a mutation can be made at a particular 
site in DNA. (Figure 8-63) 

site-specific recombination 

Type of recombination that occurs at specific DNA 
sequences and is carried out by specific proteins that recog- 
nize these sequences. Can occur between two different DNA 
molecules or within a single DNA molecule. 

skeletal muscle cell — see muscle cell 

sliding clamp 

Protein complex that holds the DNA polymerase on DNA 
during DNA replication. (Figure 5-18) 

Smad protein 

Latent gene regulatory protein that is phosphorylated and 
activated by receptor serine/threonine kinases and carries 
the signal from the cell surface to the nucleus. (Figure 
15-69) 

small interfering RNA (siRNA) 

Short (21-26 nucleotides) double-stranded RNAs that inhibit 
gene expression by directing destruction of complementary 
mRNAs. Production of siRNAs is triggered by exogenously 
introduced double-stranded RNA. (Figure 7-115) 

small intracellular mediator— see second messenger 

small nuclear ribonucleoprotein (snRNP) 

Complex of an snRNA with proteins that forms part of a 

spliceosome. (Figure 6-29) 

small nuclear RNA (snRNA) 

Small RNA molecules that are complexed with proteins to 
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form the ribonucleoprotein particles (snRNPs) involved in 
RNA splicing. (Figures 6-29 and 6-30) 

small nucleolar RNA (snoRNA) 

Small RNAs found in the nucleolus, with various functions, 
including guiding the modifications of precursor rRNA. 

(Table 6-1, p. 336, and Figure 6-43) 

smooth endoplasmic reticulum (smooth ER) 

Region of the endoplasmic reticulum not associated with 
ribosomes. Involved in lipid synthesis. (Figure 12-36) 

smooth muscle cell — see muscle cell 

SNARE 

Member of a large family of transmembrane proteins present 
in organelle membranes and the vesicles derived from them. 
SNAREs catalyze the many membrane fusion events in cells. 
They exist in pairs — a v-SNARE in the vesicle membrane that 
binds specifically to a complementary t-SNARE in the target 
membrane. 

SNP — see single- nucleotide polymorphism 

snRNA — see small nuclear RNA 

solute 

Any molecule that is dissolved in a liquid. The liquid is called 
a solvent. 

somatic cell 

Any cell of a plant or animal other than cells of the germ line. 
From Greek soma, body. 

somatic hypermutation 

Accumulation of point mutations in the assembled variable- 
region-coding sequences of immunoglobulin genes that 
occurs when B cells are activated to form memory cells. 
Results in the production of antibodies with altered antigen- 
binding sites. 

somite 

One of a series of paired blocks of mesoderm that form dur- 
ing early development and lie on either side of the notochord 
in a vertebrate embryo. They give rise to the segments of the 
body axis, including the vertebrae, muscles, and associated 
connective tissue. (Figure 22-81) 

sorting signal 

Amino acid sequence that directs the delivery of a protein to 
a specific location, such as a particular intracellular com- 
partment. 

Southern blotting 

Technique in which DNA fragments separated by elec- 
trophoresis are immobilized on a paper sheet. Specific frag- 
ments are then detected with a labeled nucleic acid probe. 
(Named after E.M. Southern, inventor of the technique.) 

spectrin 

Abundant protein associated with the cytosolic side of the 
plasma membrane in red blood cells, forming a network that 
supports the membrane. Also present in other cells. (Figure 

10-41) 

Spemann's Organizer— see Organizer 

sperm (spermatozoon, plural spermatozoa) 

Mature male gamete in animals. Motile and usually small 
compared with the egg. (Figure 21-27) 

spermatogenesis 

Development of sperm in the testes. (Figure 21-30) 

S phase 

Period of a eucaryotic cell cycle in which DNA is synthesized. 

(Figure 17-4) 

sphingolipid 

Phospholipid derived from sphingosine. (Figure 10-3) 



spindle assembly checkpoint (metaphase-to-anaphase 
transition checkpoint) 

Checkpoint that operates during mitosis to ensure that all 
chromosomes are properly attached to the spindle before 
sister- chromatid separation starts. (Figure 17-14, and Panel 
17-1, pp. 1072-1073) 

spliceosome 

Large assembly of RNA and protein molecules that performs 
pre-mRNA splicing in eucaryotic cells. (Figures 6-29 and 6-30) 

splicing 

Removal of introns from a pre-mRNA transcript by splicing 
together the exons that lie on either side of each intron. See 
also alternative RNA splicing, and trans- splicing. 

S. pombe — see Schizosaccharomyces 

Src (Src protein family) 

Family of cytoplasmic tyrosine kinases (pronounced "sark") 
that associate with the cytoplasmic domains of some 
enzyme-linked cell-surface receptors (for example, the T cell 
antigen receptor) that lack intrinsic tyrosine kinase activity. 
They transmit a signal onwards by phosphorylating the 
receptor itself and specific intracellular signaling proteins on 
tyrosines. (Figures 3-10 and 15-70) 

SRP — see signal-recognition particle 

standard free-energy change (AG°) 

Free -energy change of two reacting molecules at standard 
temperature and pressure when all components are present 
at a concentration of 1 mole per liter. (Table 2-4, p. 77, and 
Figure 14-18) 

starch 

Polysaccharide composed exclusively of glucose units, used 
as an energy storage material in plant cells. (Figure 2-75) 

Start (Start checkpoint, restriction point) 

Important checkpoint at the end of Gi in the eucaryotic cell 
cycle. Passage through Start commits the cell to enter S 
phase. The term was originally used for this checkpoint in 
the yeast cell cycle only; the equivalent point in the mam- 
malian cell cycle was called the restriction point. In this book 
we use Start for both. (Figure 17-14) 

start-transfer signal 

Short amino acid sequence that enables a polypeptide chain 
to start being translocated across the endoplasmic reticulum 
membrane through a protein translocator. Multipass mem- 
brane proteins have both N-terminal (signal sequence) and 
internal start- transfer signals. (Figures 12-45-12-48) 

STAT (signal transducer and activator of transcription) 

Latent gene regulatory protein that is activated by phospho- 
rylation by JAK kinases and enters the nucleus in response to 
signaling from receptors of the cytokine receptor family. 

(Figure 15-68) 

stem cell 

Undifferentiated cell that can continue dividing indefinitely, 
throwing off daughter cells that can either commit to differ- 
entiation or remain a stem cell (in the process of self- 
renewal). (Figure 23-5) 

stem-cell niche 

The specialized microenvironment in a tissue in which self- 
renewing stem cells can be maintained. (Figure 23-27) 

stereocilium 

A large, rigid microvillus found in "organ pipe" arrays on the 
apical surface of hair cells in the ear. A stereocilium contains 
a bundle of actin filaments, rather than microtubules, and is 
thus not a true cilium. (Figures 23-13 and 23-15) 

steroid 

Hydrophobic lipid molecule with a characteristic four- 
ringed structure; derived from cholesterol. Many important 
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hormones, including Cortisol, estrogen, and testosterone, are 
steroids that activate intracellular nuclear receptors. (Panel 
2-5, pp. 114-115) 

stimulatory G protein (G s ) 

G protein that, when activated, activates the enzyme adeny- 
lyl cyclase and thus stimulates the production of cyclic AMP. 
{See also G protein, and Table 15-3, p. 919.) 

stochastic 

Random. Involving chance, probability, or random variables. 

stop-transfer signal 

Hydrophobic amino acid sequence that halts translocation 
of a polypeptide chain through the endoplasmic reticulum 
membrane, thus anchoring the protein chain in the mem- 
brane. (Figure 12-48 and 12-49) 

striated muscle 

Muscle composed of transversely striped (striated) myofib- 
rils. Skeletal and heart muscle of vertebrates are examples. 
(Figure 16-74) 

stroma 

(1) 'Bedding': the connective tissue in which a glandular or 
other epithelium is embedded. Stromal cells provide the 
environment necessary for the development of other cells 
within the tissue. (Figure 20-19) (2) The large interior space 
of a chloroplast, containing enzymes that incorporate CO2 
into sugars. (Figure 12-21) 

structural gene 

Region of DNA that codes for a protein or for an RNA mole- 
cule that forms part of a structure or has an enzymatic func- 
tion. Distinguished from regions of DNA that regulate gene 
expression. 

substrate 

Molecule on which an enzyme acts. 

substratum 

Solid surface to which a cell adheres. 

sucrose 

Disaccharide composed of one glucose unit and one fruc- 
tose unit. The major form in which glucose is transported 
between plant cells. (Panel 2-4, pp.1 12-1 13) 

sugar 

Small carbohydrate with a monomer unit of general formula 
(CH2O) n. Examples are the monosaccharides glucose, fruc- 
tose and mannose, and the disaccharide sucrose (glucose 
and fructose linked). (Panel 2-4, pp.1 12-1 13) 

sulfhydryl (thiol, -SH) 

Chemical group containing sulfur and hydrogen; found in 
the amino acid cysteine and other molecules. Two 
sulfhydryls can join to produce a disulfide bond. (Panel 2-1, 
pp. 106-107, and Figure 3-28) 

suppressor mutation 

Mutation that suppresses the phenotypic effect of another 
mutation, so that the double mutant individual seems nor- 
mal. (Panel 8-1, pp. 554-555) 

surface plasmon resonance (SPR) 

Technique used to characterize molecular interactions, such 
as antibody-antigen binding, ligand-receptor coupling, and 
the binding of proteins to DNA. Binding interactions are 
detected by monitoring the reflection of a beam of light off 
the interface between an aqueous solution of potential bind- 
ing molecules and a biosensor surface carrying the immobi- 
lized bait protein. 

survival factor 

Extracellular signal that promotes cell survival by inhibiting 
apoptosis. (Figure 18-14) 

symbiosis 

Intimate association between two organisms of different 
species from which both derive a long-term selective advan- 
tage. (Figure 1-16) 



symporter 

Carrier protein that transports two types of solute across the 
membrane in the same direction. (Figure 1 1-8) 

synapse 

Communicating cell-cell junction that allows signals to pass 
from a nerve cell to another cell. In a chemical synapse, the 
signal is carried by a diffusible neurotransmitter. (Figure 
19-22) In an electrical synapse, a direct connection is made 
between the cytoplasms of the two cells via gap junctions. 
(Figure 19-34) 

synapsis 

(1) In genetic recombination, the initial formation of base 
pairs between complementary DNA strands in different DNA 
molecules that occurs at sites of crossing-over between chro- 
mosomes. (Figure 5-56) (2) In meiosis, the formation of a 
synaptonemal complex between two tightly aligned homol- 
ogous chromosomes. (Figure 21-9) 

synaptic vesicle 

Small neurotransmitter-filled secretory vesicle found at the 
axon terminals of nerve cells. Its contents are released into 
the synaptic cleft by exocytosis when an action potential 
reaches the axon terminal. 

synaptonemal complex 

Structure that holds paired homologous chromosomes 
tightly together in pachytene of prophase I in meiosis and 
promotes the final steps of crossing over. (Figures 21-8 and 

21-9) 

syncytium 

Mass of cytoplasm containing many nuclei enclosed by a 
single plasma membrane. Typically the result either of cell 
fusion or of a series of incomplete division cycles in which 
the nuclei divide but the cell does not. 

synteny 

The presence, in different species, of regions of chromo- 
somes with the same genes in the same order. 

synthetic lethality 

An interaction between two mutant genes in which the two 
mutant genes together result in cell death, whereas either 
single mutation alone does not. 

tap-tagging (tandem affinity purification tagging) 

Highly efficient method for protein purification, based on 
construction of a fusion protein in which the protein of 
interest is linked to two protein domains in tandem that act 
as tags for affinity chromatography purification. Two rounds 
of affinity purification, first using one tag and then the other, 
result in a very pure preparation. 

TATA box 

Sequence in the promoter region of many eucaryotic genes 
that binds a general transcription factor and hence specifies 
the position at which transcription is initiated. (Figures 6-16 
and 6-17) 

TCA (tricarboxylic acid) cycle — see citric acid cycle 

T cell (T lymphocyte) 

Type of lymphocyte responsible forT-cell-mediated adaptive 
immune responses; the class includes cytotoxic T cells, 
helper T cells, and regulatory T cells. 

T-cell-mediated immune response 

Any adaptive immune response mediated by antigen-spe- 
cific T cells. 

telomerase 

Enzyme that elongates telomere sequences in DNA, which 
occur at the ends of eucaryotic chromosomes. 

telomere 

End of a chromosome, associated with a characteristic DNA 
sequence that is replicated in a special way. Counteracts the 
tendency of the chromosome otherwise to shorten with each 
round of replication. From Greek telos, end. 
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telophase 

Final stage of mitosis in which the two sets of separated 
chromosomes decondense and become enclosed by nuclear 
envelopes, (Panel 17-1, pp. 1072-1073) 

temperature-sensitive (ts) mutant 

Organism or cell carrying a mutation that shows its pheno- 
typic effect in one temperature range (usually high tempera- 
ture) but not at other (usually low) temperatures. (Panel 8-1, 

pp. 554-555, and Figure 8-55) 

template 

Single strand of DNA or RNA whose nucleotide sequence 
acts as a guide for the synthesis of a complementary strand. 
(Figure 1-3) 

terminator 

Signal in bacterial DNA that halts transcription. 

tertiary structure 

Complex three-dimensional form of a folded polymer chain, 
especially a protein or RNA molecule. 

TGFp superfamily (transforming growth factor-P superfamily) 

Large family of structurally related secreted proteins that act 
as hormones and local mediators to control a wide range of 
functions in animals, including during development. It 
includes the TGFp/activin and bone morphogenetic protein 
(BMP) subfamilies. (Figure 15-69) 

TGN — see trans Golgi network 

thioester bond 

High-energy bond formed by a condensation reaction 
between an acid (acyl) group and a thiol group (-SH). Seen, 
for example, in acetyl CoA and in many enzyme- substrate 
complexes. (Figure 2-62) 

thiol — see sulfhydryl 

thylakoid 

Flattened sac of membrane in a chloroplast that contains 
chlorophyll and other pigments and carries out the light- 
trapping reactions of photosynthesis. Stacks of thylakoids 
form the grana of chloroplasts. (Figures 14-35 and 14-36) 

tight junction 

Cell-cell junction that seals adjacent epithelial cells together, 
preventing the passage of most dissolved molecules from 
one side of the epithelial sheet to the other. (Figures 19-3 and 
19-26) 

TIM complexes 

Protein translocators in the mitochondrial inner membrane. 
The TIM23 complex mediates the transport of proteins into 
the matrix and the insertion of some proteins into the inner 
membrane; the TIM22 complex mediates the insertion of a 
subgroup of proteins into the inner membrane. (Figure 
12-23) 

T lymphocyte — seeT cell 

Toll-like receptor family (TLR) 

Important family of mammalian pattern recognition recep- 
tors abundant on or in cells of the innate immune system. 
They recognize pathogen-associated immunostimulants 
such as lipopolysacharide and peptidoglycan. (Figure 24-51) 

TOM complex 

Multisubunit protein complex that transports proteins 
across the mitochondrial outer membrane. (Figure 12-23) 

topoisomerase (DNA topoisomerase) 

Enzyme that binds to DNA and reversibly breaks a phospho- 
diester bond in one or both strands. Topoisomerase I creates 
transient single-strand breaks, allowing the double helix to 
swivel and relieving superhelical tension. Topoisomerase II 
creates transient double-strand breaks, allowing one double 
helix to pass through another and thus resolving tangles. 
(Figures 5-22 and 5-23) 



totipotent 

Describes a cell that is able to give rise to all the different cell 
types in an organism. 

trans 

On the other (far) side. 

transcellular transport 

Transport of solutes, such as nutrients, across an epithelium, 
by means of membrane transport proteins in the apical and 
basal faces of the epithelial cells. (Figure 11-11) 

transcript 

RNA product of DNA transcription. (Figure 6-21) 

transcription (DNA transcription) 

Copying of one strand of DNA into a complementary RNA 
sequence by the enzyme RNA polymerase. (Figure 6-21) 

transcriptional activator — see activator 

transcriptional repressor — see repressor 

transcription attenuation 

Inhibition of gene expression by the premature termination 
of transcription. 

transcription factor 

Term loosely applied to any protein required to initiate or 
regulate transcription in eucaryotes. Includes gene regula- 
tory proteins, the general transcription factors, coactivators, 
co-repressors, histone-modifying enzymes, and chromatin 
remodeling complexes. (Figures 6-19 and 7-44) 

transcytosis 

Uptake of material at one face of a cell by endocytosis, its 
transfer across a cell in vesicles, and discharge from another 
face by exocytosis. (Figure 13-60) 

transfection 

Introduction of a foreign DNA molecule into a cell. Usually 
followed by expression of one or more genes in the newly 
introduced DNA. 

transfer RNA (tRNA) 

Set of small RNA molecules used in protein synthesis as an 
interface (adaptor) between mRNA and amino acids. Each 
type of tRNA molecule is covalently linked to a particular 
amino acid. (Figures 1-9 and 6-52) 

transformation 

(1) Insertion of new DNA (e.g., a plasmid) into a cell or organ- 
ism, such as into competent ExolL (2) Conversion of a nor- 
mal cell into one that behaves in many ways like a cancer cell 
(i.e., unregulated proliferation, anchorage-independent 
growth in culture). 

transforming growth factor-P superfamily— see TGFP 
superfamily 

transgenic organism 

Plant or animal that has stably incorporated one or more 
genes from another cell or organism (through insertion, 
deletion, and/or replacement) and can pass them on to suc- 
cessive generations. The gene that has been added is called a 
transgene. (Figures 8-64 and 8-65) 

transit amplifying cell 

Cell derived from a stem cell that divides a limited number of 
cycles before terminally differentiating. (Figure 23-7) 

transition state 

Structure that forms transiently in the course of a chemical 
reaction and has the highest free energy of any reaction 
intermediate. Its formation is a rate-limiting step in the reac- 
tion. (Figure 3-46) 

translation (RNA translation) 

Process by which the sequence of nucleotides in a mRNA 
molecule directs the incorporation of amino acids into pro- 
tein. Occurs on a ribosome. (Figures 6-66 and 6-67) 
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translocation 

(1) Type of mutation in which a portion of one chromosome is 
broken off and attached to another. (Panel 8-1 and Figure 20-5) 

(2) The process of transferring a protein across a membrane. 

transmembrane protein 

Membrane protein that extends through the lipid bilayer, with 
part of its mass on either side of the membrane. (Figure 10-19) 

transmitter-gated ion channel (ion-channel-coupled receptor, 
ionotropic receptor) 

Ion channel found at chemical synapses in the postsynaptic 
plasma membranes of nerve and muscle cells. Opens only in 
response to the binding of a specific extracellular neuro- 
transmitter. The resulting inflow of ions leads to the genera- 
tion of a local electrical signal in the postsynaptic cell. 
(Figures 15-16 and 11-35) 

transporter (carrier protein, permease) 

Membrane transport protein that binds to a solute and 
transports it across the membrane by undergoing a series of 
conformational changes. Transporters can transport ions or 
molecules passively down an electrochemical gradient or 
can link the conformational changes to a source of metabolic 
energy such as ATP hydrolysis to drive active transport. 
Compare channel protein. See also membrane transport pro- 
tein. (Figure 11-3) 

transposable element (transposon) 

Segment of DNA that can move from one genome position to 
another by transposition. (Table 5-3, p. 318) 

transposase 

Enzyme that cuts a transposon sequence in a chromosome 
and causes the DNA sequence to be inserted into a new tar- 
get site. The transposase is usually encoded by the transpo- 
son that it acts upon. (Figure 5-69) 

transposition (transpositional recombination) 

Movement of a DNA sequence from one genome site to 
another. (Table 5-3, p. 318) 

transposon — see transposable element 

trans-splicing 

Type of RNA splicing present in a few eucaryotic organisms 
in which exons from two separate RNA transcripts are joined 
together to form an mRNA. (Figure 6-34) 

treadmilling 

Process by which a polymeric protein filament is maintained 
at constant length by addition of protein subunits at one end 
and loss of subunits at the other. (Panel 16-2, pp. 978-979) 

triacylglycerol (triglyceride) 

Molecule composed of three fatty acids esterified to glycerol. 
The main constituent of fat droplets in animal tissues (where 
the fatty acids are saturated) and of vegetable oils (where the 
fatty acids are mainly unsaturated). (Panel 2-5, pp. 114-115) 

tricarboxylic acid (TCA) cycle — see citric acid cycle 

trimeric G protein (trimeric GTP-binding protein) — see 
G protein 

triglyceride — see triacylglycerol 

tRNA— see transfer RNA 

ts mutant — see temperature-sensitive mutant 

t-SNARE— see SNARE 

tubulin 

The protein subunit of microtubules. (Panel 16-1, p. 968, and 
Figure 16-11) 

y-tubulin ring complex (yiuRC) — see gamma tubulin ring 
complex 

tumor 

Abnormal mass of cells resulting from a defect in cell-prolif- 



eration control. A tumor can be noninvasive (benign) or 
invasive (cancerous, malignant) . (Figure 20-3) 

tumor necrosis factor (TNFa) 

Cytokine that is especially important in inducing inflamma- 
tory responses. (Figure 15-79) 

tumor progression 

Process by which an initial mildly disordered cell behavior 
gradually evolves into a full-blown cancer. (Figures 20-9 and 
20-11) 

tumor suppressor gene 

Gene that appears to help prevent formation of a cancer. 
Loss- of- function mutations in such genes favor the develop- 
ment of cancer. (Figure 20-27) 

tumor virus 

Virus that can make the cell it infects cancerous. 

turgor pressure 

Large hydrostatic pressure developed inside a plant cell as 
the result of the intake of water by osmosis; it is the force 
driving cell expansion in plant growth and it maintains the 
rigidity of plant stems and leaves. (Panel 11-1, p. 664) 

two-hybrid system — see yeast two-hybrid system 

type HI secretion system 

Bacterial system for delivering toxic proteins into the cells of 
their host. (Figure 24-8) 

tyrosine kinase 

Enzyme that phosphorylates specific proteins on tyrosines. 
See also cytoplasmic tyrosine kinase. (Figure 15-70) 

ubiquitin 

Small, highly conserved protein present in all eucaryotic 
cells that becomes covalently attached to lysines of other 
proteins. Attachment of a short chain of ubiquitins to such a 
lysine can tag a protein for intracellular proteolytic destruc- 
tion by a proteasome. (Figure 6-92) 

ubiquitin ligase 

Any one of a large number of enzymes that attach ubiquitin 
to a protein, often marking it for destruction in a protea- 
some. The process catalyzed by a ubiquitin ligase is called 
ubiquitylation. (Figure 3-79) 

unfolded protein response 

Cellular response triggered by an accumulation of misfolded 
proteins in the endoplasmic reticulum. Involves increased 
transcription of ER chaperones and degradative enzymes. 
(Figure 12-55) 

uniporter 

Carrier protein that transports a single solute from one side 
of the membrane to the other. (Figure 11-8) 

UTR (untranslated region) 

Noncoding region of an mRNA molecule. The 3' UTR extends 
from the stop codon that terminates protein synthesis to the 
start of the poly-A tail. (Figure 7-105) The 5' UTR extends from 
the 5' cap to the start codon that initiates protein synthesis. 

"MMAX 

Maximum rate of an enzymatic reaction. (Figure 3-45 and 
Panel 3-3, pp. 162-163) 

vacuole 

Very large fluid-filled vesicle found in most plant and fungal 
cells, typically occupying more than a third of the cell vol- 
ume. (Figure 13-39) 

valence 

Number of electrons that an atom must gain or lose (either 
by sharing or by transfer) to fill its outer shell. The valence of 
an atom is equal to the number of single bonds that the atom 
can form. 

van der Waals attraction 

Type of (individually weak) noncovalent bond that is formed 
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at close range between nonpolar atoms. (Table 2-1, 
Panel 2-3, pp. 110-111) 



p. 53 and 



variable region 

Region of an immunoglobulin light or heavy chain that dif- 
fers from molecule to molecule and forms the antigen-bind- 
ing site. (Figures 25-30 and 25-31) 

vascular endothelial growth factor (VEGF) 

Secreted protein that stimulates the growth of blood vessels. 
(Table 15-4, p. 923, and Figure 23-35) 

V(D)J recombination 

Somatic recombination process by which gene segments are 
brought together to form a functional gene for a polypeptide 
chain of an immunoglobulin or T cell receptor. (Figures 
25-36, 25-37, and 25-38) 

vector 

In cell biology, the DNA of an agent (virus or plasmid) used 
to transmit genetic material to a cell or organism. See also 
cloning vector, expression vector. (Figure 8-39) 

vegetal pole 

The end at which most of the yolk is located in an animal egg. 
The end opposite the animal pole. (Figure 22-68) 

VEGF— see vascular endothelial growth factor 

vesicle 

Small, membrane-bounded, organelle in the cytoplasm of a 
eucaryotic cell; often spherical. (Figure 11-35) 

vesicular transport 

Transport of proteins from one cell compartment to another 
by means of membrane-bounded intermediaries such as 
vesicles or organelle fragments. 

V gene segment 

Gene segment encoding most of the variable region of an 
immunoglobulin or T cell receptor polypeptide chain. 

viral envelope 

Phospholipid bilayer, derived from host-cell plasma mem- 
brane, covering an enveloped virus. Encloses the nucleocap- 
sid. (Figure 24-15) 

virion 

A single virus particle. (Figure 24-13) 

virulence gene 

Gene that contributes to an organism's ability to cause dis- 
ease. 

virus 

Particle consisting of nucleic acid (RNA or DNA) enclosed in 
a protein coat and capable of replicating within a host cell 
and spreading from cell to cell. (Figure 24-13) 

voltage-gated cation channel 

Type of ion channel found in the membranes of electrically 
excitable cells (such as nerve, endocrine, egg, and muscle 
cells). Opens in response to a shift in membrane potential 
past a threshold value. 

v-SNARE— see SNARE 

Weel 

Protein kinase that inhibits Cdk activity by phosphorylating 
amino acids in the Cdk active site. Important in regulating 
entry into M phase of the cell cycle. 

Western blotting (immunoblotting) 

Technique by which proteins are separated by electrophore- 
sis and immobilized on a paper sheet and then analyzed, 
usually by means of a labeled antibody. 

white blood cell (leucocyte) 

General name for all the nucleated blood cells lacking hemo- 
globin. Includes lymphocytes, granulocytes, and monocytes. 



wild- type 

Normal, nonmutant form of an organism; the form found in 
nature (in the wild) . 

Wnt (Wnt protein) 

Member of a family of secreted signal proteins that have 
many different roles in controlling cell differentiation, prolif- 
eration, and gene expression in animal embryos and adult 
tissues. 

Wnt signaling pathway 

Signaling pathway activated by binding of a Wnt protein to 
its cell-surface receptors. The pathway has several branches. 
In the major (canonical) branch, activation causes increased 
amounts of p-catenin to enter the nucleus, where it regulates 
the transcription of genes controlling cell differentiation and 
proiferation. Overactivation of the Wnt/(3-catenin pathway 
can lead to cancer. (Figure 15-77) 

X chromosome 

One of the two sex chromosomes of mammals. The cells of 
women contain two X chromosomes, while those of men 
contain only one. 

Xenopus laevis (South African clawed toad) 

Species of frog (not toad) frequently used in studies of early 
vertebrate development. 

XIC — see X-inactivation center 

X-inactivation 

Inactivation of one copy of the X chromosome in the somatic 
cells of female mammals. 

X-inactivation center (XIC) 

Site in an X chromosome at which inactivation is initiated 
and spreads outwards. 

X-ray crystallography (X-ray diffraction) 

Technique for determining the three-dimensional arrange- 
ment of atoms in a molecule based on the diffraction pattern 
of X-rays passing through a crystal of the molecule. (Figure 
8-28) 

Y chromosome 

One of the two sex chromosomes of mammals. The cells of 
men contain one Y and one X chromosome. 

yeast 

Common name for several families of unicellular fungi. 
Includes species used for brewing and bread-making, as well 
as pathogenic species. Among the simplest of eucaryotes. 

yeast two-hybrid system 

Molecular genetic technique for identifying protein-protein 
and protein-DNA interactions. (Figure 8-24) 

yolk 

Nutritional reserves rich in lipids, proteins and polysaccha- 
rides, present in the eggs of many animals. 

Z disc (Z line) 

Platelike region of a muscle sarcomere to which the plus 
ends of actin filaments are attached. Seen as a dark trans- 
verse line in micrographs. 

zinc finger 

DNA-binding structural motif present in many gene regula- 
tory proteins. All zinc finger motifs incorporate one or more 
zinc atoms that help hold the protein conformation together. 

zona pellucida 

Glycoprotein layer on the surface of the unfertilized egg. It is 
often a barrier to fertilization across species. (Figure 21-22) 

zonula adherens — see adhesion belt 

zygote 

Diploid cell produced by fusion of a male and female 

gamete. A fertilized egg. 
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Activity-dependent synapse elimination, 1394 
Adaptation 

bacterial chemotaxis, 944 

calcium ion activated potassium ion channel, 
690 

photoreceptors, 917-918 

receptor down-regulation, 903, 903F 

receptor inactivation, 903F, 920 

receptor sequestration, 903F, 920 

visual transduction, 919 
Adaptin, clathrin-coated vesicles, 756F 
Adaptive immune response see Immune 
response/system, adaptive 
Adaptor proteins, signaling pathways, 899, 927 
ADARs see Adenosine deaminase(s) (ADARs) 
Adenine, 11 6F, 197, 301 F 

base-pairing, 198F 

deamination, 301 F 

RNA structure, 332 
Adenocarcinoma, definition, 1206 
Adenoma, definition, 1206 
Adenomatous polyposis coli see APC (adenomatous 

polyposis coli) gene/protein 
Adenosine deaminase(s) (ADARs), 484, 484F 

reaction rates, 161 F 
Adenosine diphosphate see ADP 
Adenosine triphosphate see ATP 
Adenovirus(es), 1496F 

entry and uncoating, 1506F, 1507 

microtubule-based movement in axons, 1517 
Adherens junction(s), 1135T 

adhesion belt (zona adherens), 1 1 42, 1 1 43F 

cadherins see Cadherin(s) 
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catenin interactions, 1 142 

columnar epithelium, 1 1 34, 1 1 34F 

coordination of cell motility, 1142 

occurrence, 1142 

see also Cell adhesion 
Adhesins 

bacterial adhesion to epithelium, 1 502 

phagocytosis induction by bacteria, 1508, 1509F 
Adhesion belt (zona adherens), 1 142, 1 143F 
Adhesion receptors, lymphocyte recirculation, 

1550-1551 
Adipocytes see Fat cells 
Adjuvant(s), 1539 
ADP (adenosine diphosphate) 

ATP:ADP ratio, 823-824 

D-form Actin, 980 

production from ATP, 80, 80F 
Adrenaline (epinephrine), 907T 

glycogen breakdown, 900 

G-protein-linked receptors, 904 

smooth muscle control, 1030 
Adrenocorticotrophic hormone (ACTH), 803F, 907T 
Aequoria victoria, 592 
Aequorin, intracellular ion concentration, 596-597, 

597F 
Aerobic metabolism, 97-98 

bacterial, 839, 839F 

evolution, 12,27,873-874 
Aeropyrum pernix, genome size, 18T 
Affinity, antibody for antigenic determinant, 1558 
Affinity chromatography, 512,514F 

DNA-binding proteins, 428-429, 429F 

matrices, 534F 

principles, 428-429 

protein-protein interactions, 523 

protein separation/purification, 428-429, 429F, 
513-514, 515F 
see also Protein tags 

protein tagging, 51 5F 
Affinity constant (K a ), 158F 

antigen-antibody binding, 1558 
Affinity maturation, antibody production, 1 562, 

1566,1566 
AflatoxinBI, 1225, 1226F, 1229 
African sleeping sickness, 354 
Agamous Arabidopsis mutant, 1413, 1413F 
Agamous gene, 1 41 4F 
Agarose gel electrophoresis, 534 
Age/aging 

premature, DNA repair defects, 295T 

relation to telomere shortening, 293-294 

see also Cancer 
Aggrecan, 1 1 81 , 1 1 82, 1 1 82F, 1 1 84T 
Agrin,NMJ, 1168 

AIDS (acquired immunodeficiency syndrome), 1477, 
1581 

see also HIV 
AIDS virus see HIV 
Airways, histology, 1434-1436 
Akt (protein kinase B), 934 

in cancer, 1244 
Alanine, structure, 129F 
Albinism (hypopigmentation), 786 
Albumin, gene characterization, 539-540 
Alcohol dehydrogenase, structure, 144F 
Alcohol, production by fermentation, 90, 90F 
Aldehydes, oxidation in glycolysis, 92F 
Aldolase, 120F 
Aldoses, 11 2F 
Alexa dyes, 587, 587F 
Algae, 32 

evolution, 874, 875F 
Allele(s),314,554F,558 

frequency, haplotype blocks, 562F 

of MHC proteins, 1576 

natural selection effects, 561-562, 562F 
Allelic exclusion, 1565 
Allergy/allergic reactions, 1556, 1557, 1557F 
Allografts), 1575 
All-or-none response, 900 
Allosteric cooperative transitions see Allosteric 

regulation 
Allosteric proteins, 171,1 72-1 73 
Allosteric regulation, 171-176 

conformational changes, 171,1 72F 
phosphorylation-mediated, 175 
proton pumps, 835-836, 837F 
ribozymes,404 

cooperative transitions, 173-174, 173F, 174F, 
175F 



see also Cooperativity (protein interactions) 
integrins, 1171 
linkage, 171-172, 172F 

see also Feedback regulation 
Alport syndrome, 1167 
Alternative splicing, 348 
advantages, 348, 479 
Dscam gene, 479F 

gene definition modification, 480-481 
internal splice site, 479F 
intron sequence ambiguity, 480 
mutually exclusive exons, 479F 
neural cell adhesion molecule (NCAM), 

1146-1147 
optional exon, 479F 
optional intron, 479F 
patterns, 479-480, 479F 
post-transcriptional gene regulation, 479-480, 

479F 
regulation, 480, 480F 
splicing signals, 349, 349F 
a-tropomyosin gene, 349F, 480 
Alu elements, 322, 323F 
Alveolar cells, types l/ll, 1434 
Alveoli 

lung, 1434, 1435F 

mammary gland, 1427-1428, 1427F 
Alzheimer's disease, protein aggregates, 397, 1001, 

1021 
a-Amanitin, 385T 
Ames test, 1225, 1225F 
Amide bond, hydrolysis, 161 F 
Amine groups, 107F 
Amino acid(s), 5, 59-60, 125 
aminoacylation, 370, 371 F 

see also Transfer RNA 
analysis, mutation rate data, 264 
charge (pK values), 60F 
coding for, 1 99-200, 200F 

see also Genetic code 
energy sources, 99, 101 

see also Citric acid cycle 
essential, 101, 101F 
families, 128 

acidic, 128, 128F, 129F 
basic, 128, 128F 
nonpolar, 127F, 129F 
polar (uncharged), 127F, 129F 
nitrogen cycle, 100-101 
optical isomers, 128F 
protein folding role, 130, 130F 
protein functional role, 1 52 

side-chain effects, 154-155, 154F 
protein structure role, 125-137 
protein subunits, 59, 126F 
structure, 59-60, 59FJ28F 

side chains, 60F, 125-127 
see also Protein(s); individual amino acids 
Aminoacyl-tRNA-ribosome binding, 375, 375F 
Aminoacyl-tRNA synthetases, 370, 370-371 
accuracy, 371-372, 377-378 
catalytic reaction, 371, 371 F 
proofreading, 370, 372, 372F 
structure, 371-372, 373F 
synthetic ribozymes, 407 
see also Transfer RNA 
Amniotic sac, 1379 
Amoebae, 33F 
cell crawling, 1036 
osmosis regulation, 663 
spread of Legionnaire's disease, 1 508 
AMP (adenosine monophosphate) 
cyclic see Cyclic AMP 
production from ATP, 86, 86F 
AMPA receptor(s), hippocampus, 691 
Amphibians 
embryos 

gastrulation, 1364-1369 
polarity, 1364 

see also Xenopus laevis development 
genome size, 30-31 
lampbrush chromosomes, 234F, 235F 
see also Xenopus; individual species 
Amphipathic molecules 

evolutionary significance, 404-406 
phospholipids, 58-59 
Amphiphilic molecules, 618 
Ampicillin resistance, mobile genetic elements, 

318F 
Amyloid protein, aggregation, 396-397 



Amyloplasts, 841, 841 F 

Amyotrophic lateral sclerosis (ALS), neurofilament 

assembly, 987 
Anabaena, 15F 
Anabolism see Biosynthesis 
Anaerobic organisms, 839, 839F 

evolutionary significance, 12, 870-872 
formic acid oxidation, 871 , 871 F 
glycolysis, 839 

see also Glycolysis 
pathogenic, 1501 
Anaphase 
meiosis 

anaphase 1, 1091, 1091F 
anaphase II, 1091 F 
mitosis, 1054, 1073F, 1089F 
anaphase A, 1089-1090 
anaphase B, 1089-1090 
spindle assembly checkpoint and, 1 088 
Anaphase promoting complex (APC/C), 1064, 1066T 
chromatid separation and completion of mitosis, 

1066, 1087, 1087F,1088F 
Gi phase and, 1100-1101, 11 OOF 
regulated proteolysis, 395, 396F 
regulation, 1064, 1066T, 1087, 1100-1 101 
targets/mechanism of action, 1065F, 1071, 1100 
DNA replication initiation and, 1068 
geminin destruction, 1068-1069 
M-cyclins, 1064 
S-cyclins, 1064 
securin destruction, 1064 
Ancestor tracing, 247, 248F 
Anchorage dependence, 1 1 75-1 1 76, 1 1 75F 
Anchor amino acid, peptide binding to MHC 

protein, 1578-1579, 1579F 
Anchoring junction(s), 1 1 32, 1 1 32F, 1 1 33-1 1 50, 
1133T, 1135T 
see also Cadherin(s); Integrin(s); specific types 
Anchor proteins, cytoskeletal, 1 142 
Anemia, 452, 646 
Aneuploidy 

meiotic nondisjunction, 1278-1279 
polyspermy, 1300 
Angiogenesis, 1446, 1448-1449 
capillary sprouting, 1447, 1447F 
endothelial cells role, 1446, 1447F 
inhibitors, in cancer therapy, 1262 
process, 1446-1447, 1449F 
response to wounding and damage, 1448, 1448F 
tumor growth and, 1220-1221, 1448 
Anion, definition, 47 
Anisomycin, 385T 
Antenna complex, 848, 848F 
Anteroposterior axis development 

Drosophila f 1331-1334, 1331 F, 1332F, 1334F 
role of homeotic selector genes, 1 341 -1 347, 
1342F, 1343F 
vertebrates 

amphibian embryo, 1366 
role of homeotic selector genes, 1 344-1 347, 
1346F, 1347F 
Anthrax, 1493, 1493F 
Antibiotics, 1521-1522 
misuse, 1524 
resistance, 1521-1524 

active pumping, 1 522, 1 523F 
environmental reservoirs, 1523-1524 
enzyme alteration, 1522, 1523F 
horizontal gene transfer, 22 
mobile genetic elements, 318F 
R-gene product, 1 522, 1 523F 
selective pressure, 1 523 
transfer, 1522-1523 
targets/mechanisms of action, 1494F, 1522F 
protein synthesis inhibitors, 384, 384F, 385T 
Antibodies see Immunoglobulin(s) 
Anticodon(s), 6, 7F, 368-369, 368F, 369F, 377 

wobble, 369, 369F 
Antigen(s), 156, 1539, 1557 

monoclonal specificity, 508-509 
Antigen-antibody binding, 156-157, 1553F, 1558F 
affinity constant (K a ), 1558 
determinant number, 1553F 
Antigen-binding site, 1552, 1552F, 1554F, 1559F, 
1561F 
structure, 156, 157F, 1560-1561 
Antigenic determinants (epitopes), 1545, 1558F 
Antigenic variation, 1519-1520, 1519F 
Antigen presentation, 1590-1591 
B7 proteins, 1590 
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CD28, 1590 

cross presentation, 1584 
cytotoxic vs. helperT-cells, 1584-1585 
Antigen-presenting cells, 1 540, 1 542, 1 542F, 
1571-1572, 1571F 
costimulatory molecules, 1 572F 
see also Macrophage(s) 
Antimicrobial drugs, resistance see Antibiotics, 

resistance 
Anti-Mullerian hormone, 1285, 1286F 
Antiporters, 656 
mechanism, 656F 
sodium-calcium exchanger, 660 
sodium-driven chloride-hydrogen carbonate 

exchanger, 657-658 
sodium independent chloride-hydrogen 
carbonate exchanger, 657-658 
sodium-potassium pump (ATPase), 661-663, 

839-840 
sodium-proton exchanger, 657 
see also specific types 
Antirrhinum (snapdragon), mutation transforming 

flower to leafy shoot, 32 
Antisense RNA, dominant negative mutations, 564, 

564F 
Antiserum, 589 
Antral follicles, 1292 

AP2/EREBP gene regulatory proteins, MOOT 
Apaf1,apoptosis, 11 21,11 22F 
APC/C see Anaphase promoting complex (APC/C) 
APC (adenomatous polyposis coli) gene/protein, 
949,950,1252 
colorectal cancer, 1252T, 1253 
Ape(s), evolutionary relationships, 247, 247F, 248F 
AP endonuclease, base excision repair (BER), 298, 

299F 
Apetala2 Arabidopsis mutant, 1413, 1413F 
Apetala2/ethylene-responstve-element binding 

protein, MOOT 
Apetala2 gene, 1414F 
Apetala3 Arabidopsis mutant, 1413, 141 3F 
Apetala3 gene, 1414F 

Apical meristem, 1403, 1408F, 1409-1410, 141 OF 
maintenance by Wuschel and Clavata signaling, 

1409-1410, 1410F 
root, 1163, 1407F 

shoot, 1163, 1401F, 1408FF, 1409-1410, 141 1F 
Aplysia, cell movement, 1040 
APOBEC, 1535 

Apoptosis (programmed cell death), 1115-11 29, 
1116F 
activation/induction 

abnormal mitogenic stimulation, 1107-1108, 

1108F 
Apaf-1,1121,1122F 
DNA damage, 1106 
Fas ligand/Fas receptor, 1120, 1571, 1573F, 

1594 
procaspases, 1 1 1 8, 1 1 1 9F, 1 1 20F, 1 1 21 
associated diseases, 1 1 27 

cancer and, 1127, 1215-1216, 1216F, 1245-1246 
cell elimination, 1115-1117 
cell recognition, biochemical, 1117-1118 
cell signaling, 884, 885F 
cellular changes, 1115 

plasma membrane, 1117 
development, 1126 
C.elegans, 1327 
mouse paws, 1 1 1 6, 1 1 1 6F 
tadpole metamorphosis, 1 1 1 6, 1 1 1 7F 
DNA damage, 1117, 1118F 
extrinsic pathway, 1120-1121 
hemopoiesis, 1462 

intrinsic pathway, 1121, 1124F, 1125F 
IRES-mediated production of cell-death proteins, 

491 
macrophage scavenging, 787 
in mammary gland, 1428 
membrane potential, 1118 
mitochondrial proteins, 856 
neurons, 1389-1390 
phagocytosis, 787, 1115 
phosphatidylserine, 627 
proteolytic cascade, caspases see Caspase(s) 
regulation 

anti-apoptotic proteins, 1 1 22-1 1 23 
Bcl2 proteins, 1121-1124 
IAP proteins, 1124-1125 
pro-apoptotic proteins, 1 1 22-1 1 23 
Appendages, development, 1347-1363 



fine-grained patterning, 1356-1358, 1362 
insects, 1347-1355, 1356-1362 
vertebrates, 1355-1356 
see also specific appendages 
Apterous gene, 1352, 1353 
Apurinic sites, 299F 
Apyrimidinic sites, 299F 
Aquaporin channel(s), 673-675, 673F 
mode of action, 674, 675 
structure, 633F, 674, 674F 
Aquifex 

genome size, 18T 
in tree of life, 16F 
Arabidopsis thaliana (wali cress), 36f, 39F, 1 398FF 
embryogenesis, 1400, 1402, 1402F 
flowers, 141 2F 

homeotic mutations, 1 41 3F 
gene regulatory proteins in, MOOT 
genetic screen, 1399F, 1402 
genome, 142, 147, 569, 1399-1400 

size, 18T 
homeotic selector gene expression in flower, 

1414F 
model plant, 36, 37-39, 568-569, 1398-1399 
mutant seedlings, 1402F 
transgenic, 569 
Archaea (archaebacteria), 16, 16F, 17, 18T 
bacteriorhodopsin, 640-642 
membrane composition, 624 
Archaeogiobus fulgidus, genome size, 18T 
Architectural protein(s), 445-446, 447F 

see also DNA bending 
ARFprotein(s), 759, 1107 
Arginine, structure, 128F 
Argonaute protein, 494, 494F, 496 
ARP (actin-related protein), 994F, 997, 1022 
ARP complex (ARP 2/3 complex), 997, 997F 
in lamellipodia, 1038, 1038F 
pathogen movement, 1516 
Arrestin, 919, 920-921, 920F 
ARS (autonomously replicating sequence), 286, 

287F 
Artery, 1445F 

Arthritis, chronic infections, 1500 
Arthropods 

ECM polysaccharides, 1 180 
infections transmitted by, 1501-1502 
see also individual species 
Artificial chromosomes, DNA cloning vectors, 541, 

542F 
Ascaris, 1487, 1489F 

Ascaris lumbricoides, 1 489 
Asexual reproduction, 1269, 1269F 
Ash1 gene regulatory protein, 1023, 1023F 
A-site (ribosome binding), 375F, 376 
Asparagine-linked oligosaccharide(s), protein 

glycosylation, 737, 737F, 747F 
Asparagine, structure, 129F 
Aspartate (aspartic acid), structure, 129F 
Aspartate transcarbamoylase 

allosteric cooperative transitions, 1 73-1 74, 1 74F 
catalytic mechanism, 173-174, 175F 
structure, 144F, 173, 174F 
Aspartic acid, structure, 129F 
Association constant (K ), 158F 
Association rate, 158F, 526 
Associative learning, 1396-1397 
Asthma, 1500, 1534F 
Astral microtubules, 1035F, 1076, 1076F 
Astral relaxation model, 1096-1097, 1096F 
Astral stimulation model, 1096, 1096F 
Asymmetrical division, oocytes, 1 289, 1 289F 
Asymmetric cell division, 1099, 1099F 
oocytes, 1289-1290, 1289F 
stem cells, 1421, 1421F 
AT-AC spliceosome, 353-354, 354F 
Ataxia telangiectasia (AT), 295T, 304 

mutation, 295T, 304, 1106 
Atherosclerosis, 791, 1500 
Atherosclerotic plaques, 791, 1500 
ATM protein kinase 
defects, 295T, 304 
DNA damage and, 1105 
mutation, 1106 
Atohl gene, 1432 

Atomic force microscopy (AFM), 600, 601 F 
Atomic modeling, proteins, 527-531 
Atomic nuclei, NMR, 529 
Atomic radius, 51 
Atomic weight, definition, 45 



Atoms, 45-48, 46, 47F, 48F 
atomic interactions, 46-48 
atomic radius, 51 
atomic structure, 45-46, 46F 

electrons see Electron(s) 
space-filling models, 51 
valences, 47 
ATP (adenosine triphosphate), 8-9 
abundance, 81 
biosynthetic role, 81,81 F 
chemical structure, 61 F, 1 1 7F 
energy carrier, 61, 62F, 78, 80-81 
historical aspects, 90 
hydrolysis see ATP hydrolysis 
radiolabeling, 603F 
synthesis see ATP synthesis 
transport, 822 

utilization measurement, 168 
ATP:ADP ratio, 823-824 
ATPases (ion transport), 659-660, 665 

see also specific pumps/proteins 
ATP caps, actin filaments, 979, 979F 
ATP hydrolysis, 80F 
alternative route, 86F 
aminoacyl-tRNA synthetases, 371 
ATP synthase, 826, 826F 
carbon fixation, 845, 845F 
chromosome condensation, 243 
coupled to DNA replication, 273, 273F 
coupling to biosynthetic reactions, 81, 81 F, 
824-825 
nucleic acids, 86, 87F 
energetics, 81, 86, 86F, 824-825, 825F 
glutamine synthase, 81 F 
mitochondrial protein import, 716-717 
nucleosome sliding, 215-216 
sperm motility, 1293 
spliceosomes, 351-352 
ATP-mediated phosphorylation, 84F 
ATP synthase, 660, 813, 81 4F 
bacterial, 839 

chloroplast localization, 855F 
mechanism of action, 822-823 
mitochondrial localization, 821, 855F 
oxidative phosphorylation, 821-822, 821 F 
reversibility, 826-827, 826F 
bacteria, 839 
energetics, 826-827 
sodium-driven, 839-840 
structure, 822F 

see also ATP hydrolysis; ATP synthesis 
ATP synthesis, 80 

electron transport-driven, 1 0OF, 817-819, 81 9F 
proton gradients, 821-823, 821 F 
uncoupling, 836 

see also Electrochemical proton gradients; 
Electron transport chain(s) 
energetics, 80 
evolution, 870-876 
anaerobes, 871-872 
cyanobacteria, 872-875 
fermentation, 870-871 
glycolytic pathway, 91 , 92F, 93F, 1 21 F 

see also Glycolysis 
mitochondrial see Oxidative phosphorylation 
photosynthetic, 94-95, 95F, 850-853 
cyclic photophosphorylation, 853 
noncyclic photophosphorylation, 850-853, 
852FF 
see also ATP synthase; Glycolysis 
ATR protein kinase, DNA damage and, 1 1 05 
Attenuation, bacterial gene regulation, 477-478 
Atypical protein kinase C (aPKC), epithelial apico- 

basal polarity, 1156 
Auditory hair cells, 1429, 1430-1432, 1431 F, 

1431FF 
Aurora B kinase, 1084 
Aurora kinases, 1074 
Autimmune regulator (AIRE) protein, 

immunological tolerance, 1588 
Autocatalysis, origin of life, 7F, 401 
Autocrine signaling, 881 
Autoimmune disease, 1539, 1549 
Automated DNA sequencing, 550, 550F 
Autonomously replicating sequence (ARS), 286, 

287F 
Autophagosome(s), 782-783, 784F 
Autophagy, 782-783, 783F, 784F 

apoptosis, 1126 
Autophosphorylation, 915, 91 5F, 922 
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Autoradiography, 601, 602-603, 603F 

electron microscopy, 603F 

eucaryotic replication forks, 282-283, 284F 

2D-PAGE, 522F 

3 H-thymidine, 282-283, 284F 

3 H-uridine,603 
Autosome(s), 473, 554F, 1271 
Auxin, 959, 1403-1404, 1408-1409 

indole-3-acetic acid, 959F, 1406F 

transport in plants, 959-961 
Avogadro's number, 45-46 
Axil, 1410 

Axillary bud, 1410, 1411F 
Axin, 949 
Axon(s),680F, 1047 

action potential propagation, 677F 
see also Action potential(s) 

cytoskeleton, 1047-1050 
see also Axonal transport 

development, 1386-1387, 1387F 
differentiation from dendrites, 1386 
guidance mechanisms, 1140, 1147, 1177, 

1387-1389, 1388F 
retinotectal projection, 1391-1393 

microtubule orientation, 1047 

myelin sheath, 678, 680F 

neurosignaling, 675 

see a/so Dendrite(s) 
Axonal transport 

anterograde vs. retrograde, 1048 

herpesviruses, 1516-1517, 1516F 
Axonemal dyneins, 1015, 1033, 1033F 
Axoneme, sperm, 1293, 1293F 
Axon hillock(s), ion channels, 689 
Azide, cytochrome oxidase binding, 834 
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B1 elements, 323F 
B7 proteins, 1592T, 1594,1 595F 
antigen presentation, 1590 
CTLA4 binding, 1591-1592 
T-cell activation, 1 591 F 
Bacilli, 1490F 
Bacillus, \&, 18T 
Bacillus anthracis, 1493 
Bacillus subtilis, 18T,20F 
BACs {bacterial artificial chromosomes), DNA 

cloning vectors, 541, 552 
Bacteria, 16, 16F, 1488, 1489-1490 
antibiotic action and targets, 1494F 
archaea and eucaryotes vs., 16, 17F 
bacteriophage adhesion, 1 505 
chemical composition, 55T, 63T 
chemiosmotic coupling, 839-840, 839F 
chromosomes, 202, 1491 F 

origin of replication, 282, 283F 
diversity, 15 
DNA replication, 280, 282, 283F 

see also DNA replication 
DNA sequences as immunostimulants, 1 527 
E. coli as model, 24-25, 25F 
electron transport, 873F 
evolution, 871-872 
see also Electron transport chain(s) 
energy sources, 839 
entry into host cells by phagocytosis, 

1507-1508 
epigenetics, 472 
gene expression, 345F 

see also Transcription 
gene regulation see Bacterial gene expression 

regulation 
genomes 

sequenced, 18T 
size, 17, 18T, 1488 
Gram-negative, 1490F, 1527 
Gram-positive, 1490F, 1527 
intracellular movement, 1 514-1 51 7 
invasion of host cells, 1 507-1 508, 1 509F 
lithotrophic, at hydrothermal vents, 12 
mRNA, 346F,380,381F 
normal flora, 1486, 1501 
pathogenic, 1489-1494 
facultative pathogens, 1490 
genome organization, 1492F 
invasion, 1507-1511 
nonpathogens vs., 1492F 
obligate pathogens, 1490-1491 



opportunistic pathogens, 1490 
phase variation, DNA rearrangement, 454-455 
photosynthetic, 840, 863, 872-875 

see also Cyanobacteria; Photosynthesis; Purple 
bacteria 
porin insertion, 717, 71 7F 
protein synthesis, 345 F, 380 
inhibitors, 384, 385T 

see also Antibiotics 
ribosomes, 612F 

see also Protein synthesis; Translation 
riboswitches, 478-479 
structure(s), 1490F 
capsules, 1532 

cell-surface projections, 1490F 
cell wall, 14, 1490F, 1527 
flagella see Flagella 
outer membrane, 1490F 
pili (fimbriae), 1490F 
plasma membrane, 839-840, 1490F 
ribosomes, 612F 
shapes, 14F, 1490F 
sizes, 14F, 17 
toxins see Bacterial toxins 
transcription see under Transcription 
translation quality control, 387, 387F 
virulence genes and factors, 1491 
viruses of see Bacteriophage(s) 
see also specific organisms 
Bacterial artificial chromosomes (BACs), 541, 552 
Bacterial gene expression regulation, 336-338, 
339F, 345F 
gene regulatory proteins, 416 
DNA sequence recognition, 41 8T 
repressors, 416 

see also Genetic switches; Repressor protein(s) 
operons, 433F, 462-463 

see also Lac operon (Escherichia coli) 
transcription attenuation, 477-478 
see also Transcription 
Bacterial toxins, 1492 
Bacillus anthracis, 1493 
cholera, 1492, 1504-1505 
diarrhea due to, 1491 
edema toxin (anthrax), 1493 
lethal toxin (anthrax), 1493 
mechanism of action, 1493-1494 

cAMP overproduction due to, 1503 
type III secretion system for, 1493, 1494F, 1504 
type IV secretion system for, 1493-1494 
Bacteriophage^), 21, 22F, 1492, 1492F 
bacterial adhesion, 1505 
genome integration, 324, 325F 

see also Transposition 
virulence gene transfer, 1491-1492 
see also individual viruses 
Bacteriophage lambda 
bacterial adhesion, 1505 
conservative site-specific recombination, 324, 
325F 
excisionase, 326 
integrase, 294, 324, 325F 
gene regulation, 457-458, 458F 
compression, 457, 458F 
Cro repressor, 421 F, 457-458, 458F 
dimerization, 142, 142F 
sequence recognition, 421 F 
lambda repressor, 41 6, 41 8T, 457, 458F 
see also Cro repressor; Lambda repressor 
life cycle, 325F 
excision, 324 
heritable state, 458, 458F 
integration, 324, 325F 
lytic state, 458, 458F 
prophage, 457, 458F 
transcription circuits, 459 
Bacteriophage T4, 22F 
Bacteriorhodopsin, 640-642, 640F, 641 F 
Bacteroides, 1501 

BAD protein, PI 3-kinase signaling, 934, 934F 
Bak, apoptosis, 1122 
Balbiani rings, mRNA export, 359, 359F 
Ball-and-chain model, voltage-gated potassium ion 

channels inactivation, 678F 
Barr body, X-inactivation, 473 
Basal cell carcinoma, 951 
Basal cells 

epidermis, 1419, 1419F 
olfactory epithelia, 1429F, 1430 
Basal lamina, 1131, 11 32F, 1164-1169, 1419F 



distribution/occurrence, 1 1 64-1 1 65, 1 1 64F 
epidermal stem-cell maintenance, 1421, 1426 
functional roles, 1164 
barrier functions, 1168 
diversity, 1167-1169 

kidney glomerulus, 1 1 64-1 1 65, 1 1 64F, 1 1 67 
mechanical, 1 165 

regeneration and, 1 1 68-1 1 69, 1 1 68F 
mammary gland, 1428 
structure/composition, 1165, 1165F, 1166F, 
1167F 
laminins, 1165-1166 
type IV collagen, 1 165, 1 166-1 167 
synthesis, 1165 
Base(s) (chemical), 53, 109F 

measurement see PH 
Base(s) (nucleic acid) 

DNA, 197-199, 198F, 198FF 
damage, 296, 296F 
natural, 301 F 

tautomers, 268-269, 270F 
unnatural, 301 F 
nucleotide structure, 61, 116F 
pairing see Base-pairing 
purine see Purine base(s) 
pyrimidine, 61, 116F 
RNA, 300 
Base excision repair (BER), 297-298, 299F 
Base-flipping, 298, 300F, 306, 306F 
Basement membrane see Basal lamina 
Base-pairing (nucleotides) 

DNA structure and, 197-199, 198F, 198FF 

see also DNA structure 
DNA synthesis and, 266, 266F, 268-269 

see also DNA replication; DNA synthesis 
origin of life, 401 
RNAand,332,333F,334F 
splicing mechanism, 351 
see also Ribosomal RNA; RNA structure 
unusual, 269, 332 

RNA folding, 333F 
wobble base-pairing, 369F 
Base substitution mutation(s), 246 
Basic helix-loop-helix (bHLH) proteins, 1 356, 1 362, 

MOOT 
Basophils, 1452F, 1556, 1557F 
Bax, apoptosis, 1122 
Bcell(s), 1540 

activation, 1543-1544, 1543F, 1552, 1554F, 
1568, 1568F, 1596F 
antibody gene-pool selection, 1565-1566, 

1566F 
B-cell receptor, 1595, 1596F 
class switching, 482-483 
helper T-cells (T H ), 1574-1575, 1597-1598, 
1597F 
antibody synthesis, 1552 

post-transcriptional control, 482-483 
B1 cells, 1599 
B2 cells, 1599 
B-cell receptor, 1416F, 1595-1596, 1596F 

T-cell receptor vs., 1595-1596 
co-receptors, 1594-1595, 1596F 
development, 1543-1544, 1543F, 1552, 
1553-1554, 1554F 
antibody gene-pool selection, 1565-1566, 

1566F 
class switching, 482-483, 1 568F 
see also Immunoglobulin(s) 
hybridoma production, 508-509, 509F 
immunological synapse, 1597T 
memory, 1 546 
monospecificity, 1565-1566 
recirculation, lymph node lymphoid follicles, 

1550 
regulation, Fc receptor signaling, 1596 
resting, 1544F 

signaling events, 1416F, 1596F 
Bci2 proteins, 1121-1124, 1123F 

in cancer, 1127, 1245 
BcI-Xl protein, membrane integrity, 1122-1123 
Bcr-Abl protein, 1261 
Gleevecand, 1262F 
Bcr gene, chronic myelogenous leukemia, 1 261 , 

1261F 
"Beads on a string" chromatin, 21 1, 21 1 F, 21 2F 
BEAF insulator-binding protein, 453F 
Beggiatoa, 15F 

Behavioral mutants, 556, 557F 
Benzopyrene, role in cancer, 1225 
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Beta-barrel proteins, as transmembrane proteins, 
632, 635F 
see also specific proteins 
Beta-cells (insulin-secreting), 1444, 1444F 
Betaglycan, 1184, 1184T 
Beta-sheet motif, 131, 134F, 135, 135F 
DNA-binding,422,423F 
see also Protein structure 
BH3-only proteins, apoptosis, 1 123-1 124 
BH1 23 proteins, apoptosis, 1 1 22, 1 1 23F 
Bicoid protein, 1333, 1333F, 1334, 1334F 
Drosophila development, 1022-1023 
eve gene activator, 448-449, 449F 
mRNA localization, 486-487, 1022-1023 
Bid, apoptosis, 1124 
Bidirectional signaling, Eph receptors and ephrins, 

922 
Binocularly driven cells, in visual cortex, 1 395-1 396 
Biological order, 65-87 

structural, 64F, 65F 

thermodynamics, 66-68 
entropy, 67, 67F 
see also Thermodynamics 
Bioluminescence, 596-597 
Biosynthesis, 66 

activated carriers, 79F, 81, 81 F 
see also specific carriers 

biological order, 65-87 

catabolism vs., 67F 

cholesterol, 83F 

feedback inhibition, 1 70, 1 70F 

macromolecules, 63 

energy requirements, 84-87 

see also specific pathways 
Biosynthetic-secretory pathway(s), 749-787, 750F, 
800 

pH control, 770, 770F 

see also Exocytosis; Protein transport 
Biotin,84F, 167, 167T 
BiP, 736, 736, 767 

Birthdays, neuronal, 1385-1386, 1386F 
1,3-Bisphosphoglycerate, 121 F 
Bithorax complex, 1342, 1342F 
Bivalent formation, 1274, 1274F, 1276F 
Black membranes, 622, 622F 
Bladder 

cancer, 121 OF, 1228, 1229 

infections, uropathogenic E. coll, 1502, 1503F 
Blastocyst, 1379-1380, 1379FF, 1381F 
Blastoderm, 1330-1331, 1330F 
Blastomere, 1365, 1365F, 1379F 
Blastopore, 1366, 1367-1368, 1367F 
BLAST sequence alignment, protein analysis, 531, 

531F 
Blastula, 1363F, 1365 
Blie, 1443 

Blood cells, 1451-1453, 1451 F, 1452-1453, 1452F, 
1453T 

formation see Hemopoiesis 

functions, 1453T 

see also individual types 
Blood coagulation 

anti-clotting agents, disintegrins, 1 193 

ECM degradation, 1194 
Blood sinuses, in bone marrow, 1454 
Blood vessels, 1445-1450, 1445F, 1446F 

branching mechanism, 1448 

development, 1446 

endothelial cells see Endothelial cells 

formation and sprouting see Angiogenesis 

see also specific types 
Bloom syndrome, DNA repair disorders, 295T 
Blotting 

membranes, 538 

Northern blotting, 538-539, 539F 

Southern blotting, 539-540, 539F 

western blotting, 51 9F 

see also DNA hybridization 
B lymphocyte leukemia, origin, 1219 
B lymphocytes see B cell(s) 
BMP (bone morphogenetic protein) family, 1336, 

1441 
Bone(s), 1469-1474 

canaliculi, 1470, 1470F 

cartilage replacement by, 1470-1471, 1471 F 

cells, 1467, 1470F 

see also Osteoblast(s); Osteoclast(s) 

composition, 1469-1470 

development, 1470-1472 

cartilage model, 1470-1471, 1471F 



ECM production, 1179 

fibroblasts transformation to, 1468, 1469-1471 

formation, endochondral, 1470 

joints, 1471-1472 

lacunae, 1470 

long, development, 1470-1471, 1471 F 

matrix 

erosion by osteoclasts, 1471 
secretion by osteoblasts, 1469-1470 
as source of signal proteins, 1471 
uncalcified (osteoid), 1469-1470, 1470F 

remodeling, 1472-1473, 1473F 
ECM degradation, 1193 
stress response, 1474 

repair, 1474 

structure, 1470F 
Bone marrow 

apoptosis, 1117 

blood cell production, 1453-1456, 1453F, 1541F 
control, 1459 
see also Hemopoiesis 

blood sinuses, 1454 

histology, 1453-1462, 1455F 

stromal cells, role, 1458-1459 

transfusion, 1456, 1456F 

transplantation, 1456, 1456F 

X-ray effect on, 1 455-1 456, 1 456F 
Bone morphogenetic proteins (BMPs), 1336, 1441 
Bordetella pertussis, 1 503, 1 504 
Borrelia burgdorferi, chronic illness, 1500 
Boundary elements see Insulator elements 
Bovine spongiform encephalopathy (BSE), 397-398, 
1498 

see also Prion proteins (PrP) 
Brachyury ("short tail") protein, 1370 
Brain, sensory maps, 1 391 -1 395, 1 392FF 
Branchial arches, 1345 

Branching morphogenesis, 1381-1382, 1382F 
Branchless gene, 1382 
Branch migration, 307F, 308, 308F, 312F 
Branch point-binding protein (BBP), 350F 
Brassinosteroids, 957, 1403 
Brat protein, 1361 
Brca1/Brac2 proteins, 310 
BrdU (bromodeoxyuridine) 

cell cycle analysis, 285, 285F, 1059, 1059F 

epidermal cell division, 1422 

muscle cell division, 1425 
Breast cancer 

Brca1/Brac2 proteins, 310 

chromosome abnormalities, 1215F 

DNA repair defects, 295T 

incidence dependent on reproductive history, 
1229, 1229F 

typical growth, 1208F 
Breathless gene, 1382 
BRI1 receptor kinase, 957 
Bride-of-sevenless mutant, Drosophila, 927 
Brightfield microscopy, 583 
Bromodeoxyuridine see BrdU (bromodeoxyuridine) 
Bronchiole, 1435F 
Brown fat, 838 
Brown, Louise, 1301 

46BR patient, DNA repair disorders, 295T 
Brush-border, 806, 1437F, 1438F 
Bsl transposon, 318T 
Budding yeast(s), 34F 

alternative splicing, 479-480 

cell type specification, 455-457, 456F, 457F 
see also Mating-type switching 

E. coli vs., 34 

genome, 18T, 34 

haploid vs. diploid state, 34 

as model eucaryote, 33-34 

replication origins, 286, 287F 

reproductive cycles, 41 3F 

see also Saccharomyces cerevisiae 
Bundle-sheath cells, 846 
a-Bungarotoxin, effect on acetylcholine receptors, 

685 
Burkitt's lymphoma 

translocation activating Myc gene, 1239 

viral causation, 1228T, 1499 
BY2 immortal cell lines, cell culture micrograph, 

504F 
BZIP gene regulatory proteins, family members in 
different eucaryotes, MOOT 



C2C2 (Zn finger)/GATA gene regulatory proteins, 

MOOT 
C2H2 (Zn finger) gene regulatory proteins, MOOT 
C 3 plants, 846-847, 847F 
C 4 plants, 846-847, 847F 
C6 (Zn finger) gene regulatory proteins, MOOT 
Ca 2 + ATPase see Calcium pump (Ca 2+ -ATPase) 
Ca 2+ /calmodulin see under Calmodulin 
Ca 2+ pump see Calcium pump (Ca 2+ -ATPase) 
Cadherin(s), 1133-1150 
bacterial invasion and, 1508 
p-cateninand, 949, 1142 
Ca 2+ -mediated cell-cell adhesion, 1135-1136, 

1137-1138 
classical cadherins, 1 1 36, 1 1 38T, 1 1 77T 
cadherin domain repeats, 1 1 37 
catenin link to actin cytoskeleton, 1142, 

1142F 
E-cadherin, 11 36, 1 1 38T, 1141,1 249-1 250 
N-cadherin, 1 1 36, 1 1 38T, 1 388 
P-cadherin, 1136, 1138T 
signaling, 1145 
structure, 1139F 
VE-cadherin, 1138T, 1145 
developmental expression 

embryo compaction, 1 1 36, 1 1 36F 
neural development, 1140, 1140F, 1141 F, 1388 
selective adhesion, 1139-1140, 1139F, 1140F 
selective assortment, 1140-1141, 1141F, 
1142F 
distribution, 1135, 1136, 1136F 
diversity, 1136-1137, 1137F, 1138T 
family members, 1138T 
genes, in neuronal guidance, 1388 
homophilic adhesion, 1137-1139, 1138F 
binding affinity, 1138 
tissue segregation and, 1 140-1 141 
"Velcro principle," 1 1 38-1 1 39, 1 1 39F 
nervous tissue, 1 1 36, 1 1 36F, 1 388 
nonclassical cadherins, 1 1 36-1 1 37, 1 1 38T, 1 1 77T 
cadherin 23, 1138T 
cadherin domain repeats, 1 1 37 
desmocollins, 1136, 1138T 
desmogelins, 1136, 1138T 
Fat proteins, 1137, 1138T 
Flamingo protein, 1 1 37, 1 1 38T, 1 145 
protocadherins, 1136, 1138T 
signaling, 1145 

T-cadherin, 1136-1137, 11 38T 
signaling functions, 1136-1137, 1145 
structure, 1 1 36-1 1 37, 1 1 37F, 1 1 39F 
cadherin domain motif, 1 1 37 
conformational changes, 1138 
intracellular domains, 1142 
synapse formation and, 1 148 
see also specific types 
Caenorhabditis eiegans, 37f, 39F, 1321-1328, 1322F 
adult anatomy, 1321 
apoptosis, ICE, 1118 
cell numbers, 1321 
development see Caenorhabditis eiegans 

development 
gene regulatory proteins, MOOT 
genes 

dosage compensation, 475, 476F 
for multicellular development, 1 323 
see also under Caenorhabditis eiegans 
development 
genome, 18T, 20T, 552 
hermaphrodites, 1321 
microtubules, 982F 
as model organism, 36-37, 39 
molt cycle, 1322 
mutants, 557F, 564F 
protein interaction maps, 188 
RNA interference (RNAi), 571, 571 F 
sex determination, 1282F, 1286 
sexual reproduction, 1321 
size, 1 489 
Caenorhabditis eiegans development, 1 321 -1 328 
ABa cell, 1324, 1324F, 1324FF 
ABp cell, 1324, 1324F, 1324FF 
asymmetric cell division, 1323-1324, 1323F 
cell death, 1327-1328, 1327F 
cell fate, 1322 

cell lineage, 1321, 1322, 1322F, 1324 
cell patterning, 1324 
Ecell, 1324, 1324FF 
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egg cell, 1323 

EMS cell, 1324,1 324F,1324FF 

epithelial apico-basal polarity mechanisms, 

1155-1156 
founder cells, 1322 
gene regulatory proteins, 1400T 
genes, 1323 

heterochronic genes and timing, 1 326-1 327, 

1326F 
maternal effect, 1323-1324 
mom (more mesoderm), 1 325 
pop (plenty of pharynx), 1325 
see a/so individual genes 
germ cells, 1322-1 323, 1323 
gut, 1322F 

inductive cell-cell interactions, 1324, 1324FF 
MS cells, 1324, 1324FF 
P 2 cell, 1324,1 324FF 
P granules, 1323-1324, 1323F 
polarizing signal, 1323 
pop- 1,1325 
precision, 1322 

sex determination, 1282F, 1286 
Vasa protein, 1324 

Wnt signaling pathway, 1324, 1324F, 1325 
Caged molecules, 594-595, 595F, 596F 
Cajal bodies, 241 , 241 F, 363-364, 365F, 594F 
Calcium-activated potassium channel(s), 690 
Calcium/calcium ion <Ca 2+ ) 

cadherin-mediated cell-cell adhesion, 

1135-1136 
cytosolic concentrations, 660 
egg activation, 1299 
gap junction permeability, 1 1 61 -1 1 62 
intracellular measurement, 596-597, 597F 
as intracellular messenger, 912-916 
regulation of concentration in cytosol, 912-913 
regulation of cytosol concentration, 912-913 
signaling see Calcium signaling 
storage, sarcoplasmic reticulum, 725-726 
transmitter release at synapses, 804, 808, 
912-913 
Calcium channel(s), 91 2, 913 

IP 3 -gated Ca 2+ -release channels, 910, 91 1 
ryanodine receptors, 91 2 
Calcium pump (Ca 2+ -ATPase), 660-661 
autophosphorylation, 660 
mechanism, 661 F 
muscle contraction, 660 
structure, 660-661, 661 F 
Calcium-sensitive fluorescent indicators, 91 2, 91 2F 
Calcium signaling, 912-916 
Ca 2+ -induced Ca 2+ release, 914 
Ca 2+ oscillations, 912-914, 91 3F 
Ca 2+ spikes, 913, 91 3F 
Ca 2+ wave, fertilization, 91 2, 91 2F, 1 299 
calcium release, 910, 913 
calmodulin and, 914-916, 914F, 915F, 1030 
entry into cytosol, 910-912 

see also Calcium channel(s) 
IP 3 -mediated, 910, 91 OF 

see also Inositol phospholipid(s) 
mechanisms keeping cytosolic Ca 2+ levels low, 

912, 912F 
oscillation frequency, 912-914 
PKCand,911 
at synapses, 912 

ubiquitous intracellular messenger, 912 
Calcium wave, egg cells, 912, 91 2F, 1299 
Callus formation, plant cells, 504-505, 568 
Calmodulin, 914-916 

action on CaM-kinase II, 915, 91 5F 
Bordetella adenylyl cyclase binding, 1503 
Ca 2+ binding, 1030 
Ca 2+ /calmodulin, 914 
structure, 144F 
Calnexin, as chaperone protein, 738-739, 767 
Calreticulin, protein chaperone function, 738-739 
Calvin cycle see Carbon-fixation cycle 
CaM-kinase (Ca 2+ /calmodulin-dependent protein 

kinase), 915-916, 91 5F, 91 6F 
CAMP see Cyclic AMP 
Canaliculi, 1442 
Cancer, 1205-1267 

angiogenesis in cornea, 1448 
apoptosis, 1127 

benign vs. malignant, 1206, 1207F 
causes/pathogenesis, 1208 
carcinogens see Carcinogens 
DNA repair defects, 277-278, 295-296, 295T, 



1216-1217 

see also DNA repair 

environmental factors, 1224, 1224F 

epigenetic change, 1208, 1213 

genes see Cancer-critical genes 

preventable, 1224-1230 

somatic mutations, 265, 1208, 1209 

see also Oncogene(s); Tumor suppressor 
genes (TSGs) 
cells see Cancer cells 
clonality 

clonal evolution, 1212-1213, 1212F, 1227 

origin from single mutant cell, 1207-1208, 
1209F 

see also Cancer cells 
ECM degradation, 11 94, 1 1 94F 
epidemiology, 265, 1 229-1 230 
epigenetic vs. genetic change, 1 214F 
epithelial-mesenchymal transitions, 1141 
growth/progression, 1210 

cell proliferation, 1107-1108 

contributing properties, 1223 

defective cell death, 1215-1216 

defective cell differentiation, 1215-1216 

ECM degradation, 1193 

invasiveness, 1220 
see also Metastases 

metastasis see Metastases 

natural selection, 1212-1213 

oxygen as limiting factor, 1 220 
growth rate, 1212 
incidence 

environmental factors, 1224, 1224F 

as function of age, 1 209, 1 209F 

Mormons, 1225 

USA, 1207F, 1229F 

variation between countries, 1228T 
initiation, 1226, 1227 

as a microevolutionary process, 1205-1224 
mortality, USA, 1229F 
nomenclature, 1206 
prevention, 1224-1230 
primary, 1207 
progression, 1210 

colorectal cancer, 1250-1255, 1255F 

genetic instability effect, 1214-1215 

microenvironment, 1222, 1222F 

natural selection, 1212-1213 
propagation, stem cell role, 1218 
recurrence, 1219 
stem cells, 1217-1218 

origin, 1218-1219 

properties, 1219 
treatment, 1256-1265 

antibodies targeting oncogenic proteins, 1260 

evolution of resistance to therapy, 1 259 

new therapies, 1 257-1 360 

sensitivity, 1219 

taxol, 988 

traditional therapies, 1257 
see also Mutation(s); specific cancers 
Cancer cells, 1241-1255 

blood supply, dependence, 1262 

cell crawling, 1008 

cell division rate, 1209 

cell lines, 506T 

colonization without restraint, 1206-1207 

epigenetic changes, DNA methylation, 1213 

fluorescence-activated cell sorter, 1218, 121 8F 

heritable properties, 1206 

loss of contact inhibition, 1 233F 

molecular characterization, DNA microarrays, 

412,414F 
reproduction without restraint, 1206-1207 
stem cells, 1217-1219 
telomerase activity, 1217 
Cancer-critical genes, 1230-1241 

analysis in developing embryos, 1241-1242 
analysis in transgenic mice, 1241-1242 
cell proliferation regulation, 1242-1244 
identification, 1230-1241 

cell transformation assay, 1232-1233 

chromosome abnormalities, 1231-1 232 

dominant vs recessive (oncogene vs tumor 
suppressor gene), 1232F 

hereditary cancer syndromes, 1 234-1 235 

loss of heterozygosity, 1 236 

Ras gene, 1233-1234 

Rb gene, 1234-1235, 1336F 

retroviruses, 1233 



p53 gene/protein and, 1 106 

Rb gene/protein, 1 1 04-1 1 05, 1 234-1 235, 1 344F 

varied types of mutation, 1 234-1 236 

see also Mutation(s); Oncogene(s); Tumor 

suppressor genes (TSGs); individual 
genes 
Cancer syndromes, hereditary, 1234-1235 
Canine parvovirus, microtubule-based movement 

in axons, 1517 
Capacitation, 1294, 1297 
Cap-binding complex (CBC), 346-347 
Capillaries, 1446-1447, 1446F 

response to wounding, 1448, 1448F 
see also Angiogenesis 
Capping, of actin filaments see Actin/actin 

filaments 
5' Capping of eucaryotic mRNA, 344, 346-347, 
346F 
cap-binding complex, 346-347 
decapping, 492 
functions, 346-347 
guanyl transferase reaction, 346 
methyl transferase reaction, 346 
phosphatase reaction, 346 
pre-mRNA ordering, 352 
reaction, 346, 347F 
Capping protein, actin filament, 1003 
Capsid(s) (viral), 148, 148F, 149F, 150F, 1489F, 1497, 

1513 
Cap snatching, 1517 
Capsules, phagocytosis resistance mechanisms, 

1532 
CapZ protein, 1003 
Carbamoyl phosphate synthase, molecular 

tunneling, 167-168, 168F 
Carbohydrates, 55 
biosynthesis, 844 

see also Carbon fixation 
see also Polysaccharides; Sugars 
Carbon 

atomic structure, 46F 
cell components, 54-55 
double bonds between, 106F 
skeletons, 106F 
Carbon-carbon double bonds, 106F 
Carbon dioxide (CO2) 

fixation see Carbon fixation 
metabolic production, 98-99 
see also Citric acid cycle 
Carbon dioxide (CO2) pump, carbon fixation, 

846-847 
Carbon fixation, 13,69 
ATP hydrolysis, 845, 845F 
chloroplasts, 69, 843-848 
C0 2 (carbon dioxide), 13, 844-845 
C0 2 pump, 846-847 
cyanobacteria, 840, 873-875 
evolution, 872-875 
NADPH,845,845F 
reactions, 71 F, 844-847, 845F 
ribulose bisphosphate carboxylase, 844, 844F 
Carbon-fixation cycle, 71 F, 844-847, 845F 
Carbonic anhydrase, reaction rates, 1 61 F 
Carbon monoxide, as intercellular signal, 889 
Carboxylated biotin, 84F 
Carboxyl groups, fatty acids, 1 1 4F 
Carboxypeptidase, zinc ion dependence, 1 67 
Carcinogenesis, 1208 
Carcinogens, 1208 

cancer after exposure, 1210, 1210F, 1225 
identification for cancer avoidance, 1229-1230 
industrial, weighing risks, 1230 
metabolic activation, 1225, 1225F 
mutagenicity, 1225, 1225F 
types, 1225-1229 
Carcinoma, 1206, 1234 
Cardiac muscle, 1031 

adherens junctions, 1 142 
cells, 1463-1464, 1463F 

mitochondria, 81 6F 
mutational effects, 1 031 F 
sarcomeres, 1026 
Cardiolipin, 867-867, 868F 
Cardiovascular disease, 1500 
Cargo, 749, 750F, 767F 
Cargo receptor(s), 754 
Carrier protein(s), 652-667 
active transport, 654-667 
asymmetric cellular distribution, 658-659 
ATP-driven pumps, 656F, 659-660, 660F 
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conformational changes, 653F, 655, 655F 
coupled carriers, 656F 

coupling to proton gradients, 822-823, 823F 
evolution, 655 
kinetics, 655, 655F 
light-driven pumps, 656F 
localization, 1 151 
mechanism, 655F 
membrane transport, 652-643 
reversibility, 826-827, 826F 
specificity, 652 

see also ABC transporter(s); Active transport; 
specific proteins 
Cartilage, 1468-1469 
bone repair, 1471 
cells, 1467, 1468-1469, 1469F 

see also Chondrocytes 
erosion by osteoclasts, 1472-1473 
growth, 1468-1469, 1469F 

defective in achondroplasia, 1471-1472, 
1472F 
mineralization, 1471 

'models' in bone development, 1 470, 1 471 F 
replacement by bone, 1470-1471, 1471 F 
Casein kinase 1, 949 
Caspase(s), 1118-1120 

activation, 1118, 11 19R1120F 

see also Procaspase(s) 
caspase-3, 1119 
caspase-8, 1573F 

caspase recruitment domain, 1119 
human, 1119T 

inhibition, 1124-1125, 1127 
interleukin-1 -converting enzyme, 1118 
signaling pathways, 1119 
Caspase recruitment domain (CARD), caspases, 

1119 
Catabolism, 88-103 
activated carriers, 79F 
anabolism vs., 67F 
definition, 66, 88 

oxidation of organic molecules, 70, 1 0OF 
citric acid cycle see Citric acid cycle 
glycolysis see Glycolysis 
oxidative phosphorylation see Oxidative 

phosphorylation 
sugars, 55-58,88 
Catabolite activator protein (CAP), 41 8T, 420F, 436 
Catalase, 144F, 721 
Catalysis 

autocatalysis and origin of life, 7F, 401 
catalysts, 73, 158-159 

see also Enzyme(s); Ribozymes 
in controlled energy use by cells, 65-87 
energy barriers, 72-73 
by RNA see Catalytic RNA; Ribozymes 
Catalytic antibodies, 1 60, 1 61 F 
Catalytic RNA 

origin of life, 401 , 402, 402 F 
self-replication, 404, 407F 
ribosomes, 378-379 
spliceosome active site, 352 
see also Ribozymes; Self-splicing RNA 
Catastrophe factors, 1080 
Catastrophe, in dynamic instability, 1003, 1080 
Catenin(s) 
p-Catenin 

cadherin binding, 949, 1 1 42 
colorectal cancer, 1253 
p120-Catenin, 1142 

y-Catenin (plakoglobin), cadherin binding, 1142 
cell-cell adhesion and intracellular signaling, 

949,1142,1145 
classical cadherin link to actin filaments, 1 142, 
1142F 
Cation, definition, 47 
Cation-transporting ATPases see Calcium pump 

(Ca 2+ -ATPase) 
Cauiobacter cresentus, cytoskeleton shape, 991 
Caveolae, 790, 790F 
Caveolin, 790 

CBC (cap-binding complex), 347 
C-Cbl protein, 926 

CCR5, HIV receptor, 1505-1506, 1505F 
CD3 complex, T cells, 1590F, 1592T, 1599F 
CD4 protein, 1580-1581, 1580T, 1581F, 1592T, 
1599F 
antigen presentation, 1590 
HIV receptor, 1505, 1505F 
role in viral entry into cells, 765F 



CD4T-cell(s), 1580, 1592T, 1599F 
negative selection, 1587, 1587F 
positive selection, 1586, 1587F 
see also Helper T-cell(s) (Th) 
CD8 protein, 1580-1581, 1580T, 1581 F, 1592T, 
1599F 
antigen presentation, 1590 
CD8T-cell(s), 1580, 1592T, 1599F 
negative selection, 1587, 1587F 
positive selection, 1586, 1587F 
see also Cytotoxic T- eel Is (Tc) 
CD9, sperm-egg binding, 1298-1299 
CD28, 1590, 1591F, 1592T 
CD40ligand, 1590, 1592T, 1594, 1595F, 1597 
CD40 receptor protein, 1 590, 1 592T 
on B cell, 1597, 1598T 
on macrophage, 1593F, 1594 
CD80, 1592T 
CD86, 1592T 
Cdc6, 288, 289F, 1068 
Cdc20, APC/C regulation, 1064, 1066, 1066T, 1087, 

1100 
Cdc25 phosphatase, 1063, 1071, 1071F 
Cdc42, 1 042F, 1 043, 1 1 56, 1 5 1 6 
Cdc genes, 1056-1057 

temperature-sensitive mutants, 1057, 1057F 
Cdh1, APC/C regulation, 1064, 1066T, 1 101 
Cdk (cyclin-dependent kinase) see Cyclin- 

dependent kinases (Cdks) 
Cdk4, in cancer, 1243, 1244F 
cDNA, 542 

clones, 542, 543, 543F, 544 

PCR cloning, 546F 
libraries, 542, 543F, 544 
synthesis, 543F, 574 
Cdt1,ORC binding, 1068 
Ced3 gene, 1327, 1327F 
Cec/4 gene, 1327, 1327F 
Ced9 gene, 1327, 1327F 
C. elegans see Caenorhabditis elegans 
Cell(s) 
behavior 

cytoskeleton involvement, 1025-1050 
see also Cell motility/movement 
chemical components, 45-65 
carbon compounds, 54-55 
small molecules, 55 
free energy 

information transmission, 8 

inorganic chemical sources (lithotrophic cells), 

12, 13FF 
light sources (phototrophic cells), 12 
from living organisms (organotrophic cells), 
12 
genome see Genome(s) 
isolation, 501-517 

mixed suspensions, 502 
separation techniques, 502, 503F, 503FF 
origins of life, 400-408 
polarization see Polarity/polarization 
procaryotes, diversity, 14-15, 14F, 15F 
self-reproduction by autocatalysis, 7F 
terminal differentiation, 1103 
tree of life 

archaean cells, 16 

bacterial cells, 14F, 15-16, 15F, 25, 25F 
eucaryotic cells, 14, 26-32 
unit of living matter, 1 
universal properties 

ATP as energy currency, 8-9 

DNA as hereditary information store, 2, 3F 

gene families in common, 23, 24T 

plasma membrane, 9-1 0, 1 OF 

proteins as catalysts and executive molecules, 

5-6, 6F 
ribosomal machinery for protein synthesis, 

6-7, 8F 
RNA as intermediary in information transfer, 4, 

5F 
small molecules and fundamental chemistry, 
8-9,45-123 
vehicle for hereditary information, 2 
see also entries beginning cell/cellular; specific 
components; specific types 
Cell adhesion, 1131-1204, 1177T 
bacterial invasion of host cells, 1 508 
cell-cell see Cell-cell adhesion 
cell-matrix see Cell-matrix adhesions 
traction for cell movement, 1040-1041 
see also Cell adhesion molecules (CAMs); Cell 



junction(s); Extracellular matrix (ECM) 
Cell adhesion molecules (CAMs), 1 1 77T 
cadherins see Cadherin(s) 
cell-cell adhesion, 1 1 77T 

see also Cadherin(s); Cell-cell adhesion 
cell-matrix adhesion, 1 1 77T 

see also Cell-matrix adhesions; Integrin(s) 
ICAMs, 411, 1146, 1592T 
immunoglobulin superfamily, 1145, 1146-1147 
integrin superfamily see Integrin(s) 
selectins, 1145-1146 
synapse formation and, 1147-1148 
T-cell function, 1571 
see also specific types 
Cell-cell adhesion 
assays, 1135 
Ca 2+ role,1135-1136 
cell adhesion molecules, 1 1 77T 

cadherins, 1133-1150 

dendritic cells, 1571 

immunoglobulin superfamily, 1 145, 
1146-1147 

integrins, 1134, 1145, 1146 

selectins, 1145-1146 
cell proliferation and, 1 1 53-1 1 55 
heterophilic vs. homophilic binding, 1138F 
scaffold proteins, 1145, 1148-1149 
selective adhesion, 1139-1140, 1139F, 1140F 
selective assortment, 1 1 40-1 141,11 40F, 1 1 41 F, 

1142F 
signaling, 1145 

synapse formation and, 1147-1149, 1149F 
see also Cell junction(s); Cell-matrix adhesions; 
Extracellular matrix (ECM) 
Cell-cell contact 

actin polymerization via Rac, 1047 
bone marrow, 1458 
capillary sprouting, 1448 
see also Cell-cell adhesion; Cell junction(s) 
Cell coat (glycocalyx), 636, 637F 
Cell communication, 879-903 
adaptation, 902,920, 920F 
autocrine, 881 

budding yeast, mating, 880, 880F 
carbon monoxide, 889 

cell-surface receptors see Cell-surface receptor(s) 
contact dependent, 881 , 881 F 

see also Ephrin(s); Notch receptor protein 
different responses in target cell types, 885, 

885F 
endocrine, 882-883, 882F, 883F 

see also Hormones 
evolution, 955 
extracellular signals, combinatorial actions, 884, 

885F 
gap junctions, 884, 884F 
nitric oxide, 887-889, 888F 
paracrine, 881, 882F, 883 
plants, 955-962 

speed of response, 886-887, 887F 
see also Cell junction(s); Neurotransmitter(s); 
Receptor(s); Signaling 
molecule(s)/pathway(s); Signal 
transduction; individual signaling 
molecules and pathways 
Cell crawling, 1006, 1008, 1036, 1036F 
activities involved, 1036 
leading edge, 1040F 
see also Cell motility/movement 
Cell culture, 501-517 

anchorage dependence, 1 1 75-1 1 76, 1 1 75F 

bacterial, cloning vector production, 540, 541 F 

cells in culture, 504F 

definitions, 502-504 

historical aspects, 504 

mammalian 

cell cycle analysis, 1059, 1059F 

'immortalized,' 1059 

replicative senescence, 1059, 1 107 
plant, 504-505, 568 
primary cultures, 504 
secondary cultures, 504 
tissue segregation in, 1 1 40-1 1 41 , 1 1 42F 
see also Cell lines; Tissue culture 
Cell cycle, 554F, 1 053-1 1 1 3, 1 054F 
analysis 

animal embryos, 1057-1058, 1057F, 1058F 

BrdU labeling, 285, 285F, 1059, 1059F 

DNA microarrays, 1065 

flow cytometry, 1059-1060, 1060F 
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mammalian cell culture, 1059, 1059F 
progression analysis, 1059 
yeast mutants, 1056-1057, 1056F, 1057F 
arrest, 1061 

abnormal proliferation signals and, 

1107-1108, 11 08F 
DNA damage and, 303-304, 1 1 05-1 1 06, 

1106F 
G phase, 1103 
male gametes, 1281 
cancer cells, 1107-1108 
checkpoints see Cell cycle control 
chromosome changes, 208-209, 208F, 209-210, 

209F 
control systems see Cell cycle control 
nucleolar changes, 363, 363F 
overview, 1054-1060 

phases, 208F, 285F, 1054-1055, 1054F, 1055F 
G (G zero), 488-489, 489, 1 103 

elF-2 regulation, 488-489 
Gt phase, 285F, 1 055, 1 1 00-1 1 01 , 1 1 0OF 

mitogen actions, 1 103 
G 2 phase, 285F, 1055 
interphase, 208F, 209, 1055 
M phase, 208F, 285F, 1054, 1071 
cytokinesis see Cytokinesis 
mitosis see Mitosis 
see also Meiosis 
S phase (DNA synthesis), 285F, 1067-1071 
chromatin protein production, 1074 
chromosome duplication, 1054, 1067, 

1068F, 1069-1070 
DNA replication, 284, 285, 1067-1069 
histone synthesis, 289-290 
labelled cells, 1059F 
meiotic, 1090, 1272 
sister chromatid cohesion, 1070-1071 
timing, 284, 285 
see also DNA replication 
universal characteristics, 1053 
start (restriction) point, 1055, 1061, 1066, 1105 
timing, 1056 

see also Cell division; Cell growth; Cell 
proliferation 
Cell cycle control, 177-178, 1060-1067, 1066F 
analysis 

animal embryos, 1057-1058, 1057F, 1058F 
importance, 1053 

mammalian cell culture, 1059, 1059F 
yeast mutants, 1056-1057, 1056F, 1057F 
cancer and, 1216-1217, 1243, 1244F 
checkpoints, 505, 1061 

defects making cancer cells vulnerable, 

1216-1217 
DNA damage, 303, 1 105-1 107 
G 2 /M checkpoint, 1 061 , 1 062, 1 066, 1 1 05 
meiotic, 1281 
metaphase-to-anaphase transition, 1 061 , 

1066,1071 
spindle assembly checkpoint, 1088, 1088F 
start (restriction point), 1055, 1061, 1066, 
1105 
cyclical proteolysis, 1064, 1065F, 1066T 
APC/C see Anaphase promoting complex 

(APC/C) 
SCF enzyme complex, 1064, 1065F 
eucaryotic similarities, 1056 
functions, 1060-1061 
intracellular triggering of cell-cycle events, DNA 

replication, 1067-1069, 1068F 
molecular/biochemical switches, 1061, 

1065-1066,1075 
regulatory proteins, 1066T 

Cdks see Cyclin-dependent kinases (CDKs) 
cyclinsseeCyclin(s) 
E2F proteins, 1103-1105 
inhibitory phosphorylation, 1063-1064 
p53 and, 1105 

Rb proteins, 1 104-1 105, 1 104F 
ubiquitin ligases see Ubiquitin ligase(s) 
see also specific proteins 
resetting, 1069 
as timer/clock, 1060, 1061F 
transcriptional regulation, 1065, 1104, 1104F 
Cell death 

apoptotic see Apoptosis (programmed cell 

death) 
cell number and, 1102 
defective in cancer, 1215-1216 
neurons, 1389-1390 



see also specific types 
Cell determination, 1311-1312, 1312F 

combinatorial control, 465-466 
Cell differentiation, 41 1, 454-477 
cancer and, 1215-1216 
common processes, 412 
differing response to extracellular signals, 41 5, 

464 
DNA rearrangement, bacterial phase variation, 

454-455, 455f 
genome constancy, 41 1 -41 2, 41 3F 
patterns of gene expression, 41 2, 464-465, 464F, 
465 F 
see also Combinatorial control 
protein differences, 41 2 
specialization, 412 
terminal, 1103 

see also Developmental genetics; Gene 
expression regulation 
Cell diversification, role of Notch, 1 362 
Cell division, 1053, 1055F 
asymmetric, 1099, 1099F 

C elegans embryo, 1323-1324, 1323F 
oocytes, 1289-1290, 1289F 
plant development, 1400 
cell death balance, 1102 
cell number and, 1102 
control, 1101-1112 

density-dependent (contact inhibition), 1110, 

1110F 
DNA damage response, 1105-1107 
mitogens see Mitogen(s) 
see also specific factors 
coordinated growth and division, 1 1 08-1 110, 

1109F 
cytoskeletal role, 966-967, 967F 
delay, 1103 

density-dependent inhibition (contact 
inhibition), 1110, 1233-1234 
limits, 1059, 1107 

density-dependent, 1110, 1233-1234 
see also Replicative cell senescence 
plane of, 1095-1097 
plant cell(s), 1195 
stem cells, 1425 

total cell mass control, 1111-1112 
see also Cell cycle; Cell growth; Cell proliferation; 
Cytokinesis; Meiosis; Mitosis 
Cell doctrine, 579 

Cell extracts (homogenates ), 510, 51 2 
Cell fate determinants, asymmetric cell division, 

1099 
Cell fractionation, 510-512 

cell-free systems see Cell-free systems 
cell lysis, 510 

chromatography, 512-514 
electrophoresis, 517, 518F, 521-522 
macromolecule/organelle separation, 510-511, 
51 IF 
mitochondria, 81 7F 
ultracentrifugation, 510-511, 510F, 51 IF, 512F 
see also Protein analysis 
Cell-free systems 

biological process reconstruction, 511-512, 516 
cell cycle analysis, 1058, 1058F 
cell fractionation, 516 
vesicular transport study, 752, 752F 
Cell growth, 1053 

anchorage dependence, 1 1 75-1 1 76, 1 1 75F 

control, 1101-1112 

coordinated growth and division, 1108-1110, 

1109F 
organ growth, 1108 
organism growth, 1 108 

see also Cell cycle; Cell division; Cell proliferation 
Cell homogenate(s), 510, 512 
Cell junction(s), 1131-1204 

anchoring junctions, 1 132, 11 32F, 1133-1150, 
1133TJ135T 
cell-cell, 1 1 32, 1 133-1 1 50, 1 1 33T, 1 1 35T 
adherens junction see Adherens junctton(s) 
desmosome see Desmosome(s) 
immunoglobulin superfamily, 11 45, 

1146-1147 
selectins, 1145-1146 
selective adhesion, 1 1 39-1 1 40, 1 1 39F, 

1140F 
selective assortment, 1140-1141, 1140F, 

1141F, 1142F 
signaling, 1145 



see also Cad he rin(s) 
cell-matrix, 1 1 33T, 1 1 35T, 1 1 69-1 1 78 
actin-linked, 1 1 33T, 1 1 34, 1 1 35T 
focal adhesions, 1170 
hemidesmosomes see Hemidesmosome(s) 
see also Integrin(s) 
intermediate filament attachment, 1 1 33T 
see also Cell-cell adhesion; Cell-matrix 
adhesions 
channel-forming junctions, 1132, 1132F, 1133T, 
1158-1164 
gap junction see Gap junction(s) 
plasmodesmata, 1158, 1162-1163, 1163F 
epithelial, 1133-1135 

see also Epithelia 
functional classification, 1 1 32, 1 1 33T 
homophilic vs. heterophils, 1 1 37, 1 1 38F 
occluding junctions, 1 1 32, 1 1 32F, 1 1 33T, 
1150-1158 
septate, 1154-1155, 11 54F 
tight junction see Tight junction(s) 
signal-relaying junctions, 1 1 32, 1 1 32F, 1 1 33T 
chemical synapse see Chemical synapse(s) 
immunological synapse, 1132 
transmembrane signaling, 1 1 32, 1 1 33T 
transmembrane adhesion proteins, 1 134-1 135, 
1134F, 1135T 
cadherin superfamily, 1133-1150 
immunoglobulin superfamily, 1 145, 

1146-1147 
integrin superfamily, 1 134, 1145, 1146, 

1169-1178 
selectins, 1145-1146 
see also Cell adhesion molecules (CAMs); 
specific types 
see also Cell adhesion; Extracellular matrix (ECM); 
specific types 
Cell lines 

eucaryotic, 505, 506T 
hybrid cells, 509F 

see also Hybridomas 
immortal, 504F, 505, 506T, 1059 
primary vs. secondary, 504 
transformed, 505, 506T 
see also Cell culture 
Cell-matrix adhesions, 1 1 33T, 1 1 34, 1 1 34F, 1 1 35T, 
1169-1178 
actin-linked, 1 1 33T, 1 1 34, 1 1 35T 
CAMs, 1177T 
fibronectin and, 1191 

hemidesmosomes see Hemidesmosome(s) 
see also Cell-cell adhesion; Cell junction(s); 

Extracellular matrix (ECM); Integrin(s) 
Cell-mediated immune responses, 1540, 

1540-1551, 1540F, 1569-1589 
intracellular pathogens, 1572 
transplantation reactions, 1575 
see also MHC (major histocompatibility 

complex); T cell(s);T cell receptor(s) 
Cell memory, 454, 458, 458F, 466 

see also Cell differentiation 
Cell migration 

developmental, 1140, 11 40F, 1373-1375 
ECM degradation, 1 1 93, 1 1 94, 1 1 94F 
external signals/guidance molecules, 1045, 1140 
gut epithelial cells, from crypts to villi, 1 440F 
integrins and, 1170-1171 
neuronal, 1385F 

see also Cell motility/movement 
Cell motility/movement 

in animal development, 1 363-1 378 

see also Cell migration; Development 
contribution of myosin II, 1039F 
crawling see Cell crawling 
microscopy, 583 
protrusion, 1037-1038, 1039F 
traction, 1040-1041, 1041 F 
via actin polymerization, 1037-1039 
Cell number, 1102 
Cell plate, 1097 
Cell proliferation, 1053 

abnormal signals, cell cycle arrest, 1107-1108 
anchorage dependence, 1 1 75-1 1 76, 1 1 75F 
cancer cells, 1107-1108, 1217 

DNA tumor virus proteins, 1 248, 1 249F 
see also Cancer 
coordinated growth and division, 1 108-1 110 
density-dependent inhibition, 1 1 10, 1 1 10F 
integrins and, 1 1 75-1 1 76 
limitation, telomere length, 293 



INDEX 



1:9 



requirements, 1244, 1245F 

scaffold proteins and junctional complexes, 

1153-1155 
total cell mass control, 1111-1112 
see also Cell cycle; Cell division; Cell growth; 
Replicative cell senescence 
Cell renewal and turnover 
epidermis, 1417-1428 
in liver, 1443 

mammary gland, 1426-1428 
small intestine, 1436-1438, 1436F, 1439 
see also Regeneration; Stem cell(s) 
Cell senescence 

macrophage scavenging, 787 
replicative see Replicative cell senescence 
telomere shortening, 293 
Cell signaling, 879-974 
all-or-none response, 901 
general principles, 879-903 
see also Cell communication; Signaling 

molecule(s)/pathway(s); Signal 
transduction 
Cell size, control, 1109-1110 
Cell-surface receptor(s), 891-895, 893F, 894F, 936F 
enzyme-linked, 956, 958F 
G-protein linked see G protein-coupled receptors 

(GPCRs) 
intracellular receptors vs., 881, 881 F 
ion-channel-linked see Ion channel(s) 
in NFkB pathway (Toll family), 1530 
plants, 956-960 
see also individual types 
Cell survival 

integrins and, 1175-1176 
regulation of hemopoiesis, 1462 
see also Cell death 
Cell suspension(s), mixed, 502 
Cell transformation assay, for oncogenes, 1232 
Cellular interactions, mathematical modelling, 

35-36, 35F 
Cellularization, 1099 
Cellulase, protein structure, 529F 
Cellulose 

control of oriented plant cell expansion, 1406 
microfibrils, 1197-1198, 1198F 

cross-linking glycans, 1196,1198, 1198F, 

1199T 
direction of growth and, 1199-1200, 1199F 
structure, 1197, 11 97F 
plant cell wall, 1180, 1196, 1197-1198, 11 98F, 
1199T 
microtubules and deposition, 1200-1202, 
1200F, 1201F 
Cellulose synthase, 1199-1200, 1201 
Cellwall(s) 

plant see Plant cell wall 
procaryotes, 14, 1490F, 1527 
CENP-A, 230-231, 232F 
Central dogma, 331 

Central lymphoid organ, 1 541 F, 1543, 1543F 
Central nervous system (CNS) 
apoptosis, 1115-1116 
early embryonic origins, 1 367F 
progenitor cells, 1386F 
protein aggregation vulnerability, 397 
repair, 1479 

see also Nervous system 
Central spindle stimulation model, 1096F, 1097F 
Centrifugation techniques, 510-51 1, 51 OF, 51 1 F, 
512F 
molecular weight determination, 522-523 
sedimentation coefficient, 511, 522 
Centriole(s), 993, 993F, 1076, 1076F 
replication, 1078, 1078F 
zygote, 1301, 1301 F 
Centromere{s), 228-230 

chromatin structure, 231-233, 231 F 
heterochromatin, 228-229, 232F 
histones, H3 CENP-A variant, 230-231 , 232F 
memory circuits, 231, 233F 
see also Chromatin 
chromosome replication, 209-210, 210, 210F, 

1076, 1076F 
DNA sequence, 210, 229-230 
plasticity, 229-230, 230F 
structure, 229-230, 229F 
Centrosome, 1076, 1076F 

center-seeking behavior, 996F 

composition, 992 

duplication and spindle assembly, 1078, 1078F, 



1079 

maturation, 1079 

microtubules emanating from, 992-996, 993F 

reorientation, in cell locomotion, 1046 

'search and capture' of chromosomes, 1082, 
1084F 
Ceramide, biosynthesis, 744-745 
Cerebral cortex, 1386F 

homunculus, 1391 

neuronal migration, 1385F 

somatosensory region, 1392F 
Cervical cancer, 1211-1212, 1211F 
Cesa (cellulose synthase) genes, 1 199-1200 
Cesium chloride gradients, 51 1 
CG (CpG) islands, 434, 470-471, 1527 

DNA damage, 300-301, 470 

evolution, 434, 470 

role in innate immunity, 1 530 
CGN see Cis Golgi network (CGN) 
Ch4 elements, 31 8T 
Chagas' disease, 1509 
Chain-terminating nucleotides, DNA sequencing, 

550 
Channel-forming junctions, 1 132, 1 132F, 1133T, 
1158-1164 

see also specific types 
Channel protein(s), 652-653 

conformational changes, 653F 

passive transport, 653, 654F 

see also Ion channel(s); specific types 
Chaperones 

bacterial, GroEL, 390F 

eucaryotic, 388-390, 390F, 715, 71 6F, 717 

mitochondrial protein import, 71 5, 716-71 7, 
71 6F 

protein folding role, 130-131,388-390 

see also specific molecules 
Charcot-Marie-Tooth disease, 1048 
CheA, 943, 944, 944F 
Checkpoints in cell-cycle see under Cell cycle 

control 
Chemical biology, 527 
Chemical bonds, 46-50, 48F, 106F 

electron interactions, 46-47 

energy carriers, 61 , 69F 

see also ATP (adenosine triphosphate) 
Chemical groups, 107F 
Chemical reactions, free energy, 75F 
Chemical synapse(s), 682, 684 

acetylcholine receptors, 684 

cell signaling, 882 

excitatory, 684 

inhibitory, 684 

mechanism of action, 683F 

see also Neuromuscular junction (NMJ); 
Neurotransmitter(s) 
Chemiosmotic coupling, 813-814, 814F 

ATP production, 817-819, 81 9F 
see also ATP synthesis 

bacterial, 839-840, 839F 

see also Electron transport chain(s) 
Chemokine(s), 1533-1534, 1550-1551, 1550F 

in inflammatory response, 1453-1454 

proteoglycans and, 1 183 

receptor, HIV binding, 765F 
Chemorepulsion, 1140 
Chemotaxis, 1045, 1140 

bacterial, 941-945, 943F, 945F 

growth cone guidance, 1 388, 1 388F 

neutrophils, 1045, 1045F 
Chiasma formation, 1274, 1274F, 1276, 1276F 
Chick embryo 

limb development, 1312-1313, 1313F, 1355 
see also Limb buds (vertebrate) 

neural development, 1 384F 

somites, 1372F 
Chickenpox virus, 1516-1 51 7 
Chimeras, mouse, 1380, 1380F, 1381F 
Chimeric proteins, transcription activator proteins, 

442F 
Chimpanzee(s), evolutionary relationships, 247, 

247F, 248F 
Chk1 protein kinase, 1 105, 1 106F 
Chk2 protein kinase, 1 1 05, 1 1 06F 
Chlamydia pneumoniae, 1500, 1500F 
Chlamydia trachomatis, 1511F, 1512F, 1513 
Chiamydomonas, flagella, 1032, 1033 
Chloramphenicol, 384, 385T 
Chloride channels, 666, 673, 674F 
Chlorophyll(s),848F 



photochemistry, 847-848, 848F, 849-850, 849F, 
850F 
see also Photosynthesis; Photosystem(s) 
see also Chloroplast(s) 
Chloroplast(s), 30F, 840-855 
bacterial resemblance, 857, 863 
£} barrel proteins, 635 
biogenesis, 856, 856F, 867 
biosynthetic reactions, 855 
cell-free systems, 51 1 
development, 698, 699F 
distribution during cytokinesis, 1098 
electron transport see Photosynthetic electron 

transport chain(s) 
energy interconversions, 842 
evolutionary origin, 29, 31 F, 840-841, 859-860, 
874, 875F 
endosymbiont hypothesis, 859-860, 863-864 
maintenance, 868-870 
organelle-nuclear gene transfer, 859-860 
function, 696 

genetic system, 868-870, 869F 
genome, 856-870 
copy number, 858 
diversity, 857T, 859 
evolution, 859-860, 868-870 
genes, 863, 864F 
gene transfer, 864 
higher plants, 863-864, 864F 
introns, 863 
liverwort, 864F 

maternal inheritance, 866, 866F 
mutants, 867 
replication, 858 
variegation and, 866, 866F 
see also Non-Mendelian inheritance 
glycolysis, 854-855 
growth and division, 857-858 
lipid synthesis, 867 
mitochondria vs., 842-843, 843F 
nuclear-encoded tissue-specific proteins, 867 
photosynthesis see Photosynthesis 
protein import, 719-720, 720F 
protein synthesis, 856-857, 856F, 869F 
starch granules, 94F, 95, 841, 842F 
structure, 71 3F, 842-843, 842F, 843F 
transport, 854-855 
see a/so Chlorophyll(s) 
Chloroquine, Plasmodium falciparum resistance, 

666 
Cholecystokinin, in enteroendocrine cells, 1437 
Cholera toxin, 629, 906, 1492, 1493, 1504 
Cholera, transmission, 1491 
Cholesterol 

biosynthesis, 83F, 743, 744-745, 791 
membranes, 620, 623 
structure, 620, 620F,620FF 
transport see Low-density lipoproteins (LDLs) 
Choline, 11 4F 

Chondroblasts, ECM production, 1 1 79 
Chondrocytes, 1468-1469, 1469F 
Chondrodysplasias, 1187 
Chondroitin sulfate, 1 1 79, 1 388 
Chondroma, definition, 1206 
Chondrosarcoma, definition, 1206 
Chordates, 1370 
Chordin, 940, 1336 
Chromatids see Sister chromatid(s) 
Chromatin, 202, 365F 

chromatin assembly factors (CAFs), 290 
condensation, 243, 286, 288, 1070 
see also Chromosome condensation; 
Heterochromatin 
euchromatin, 220 

heterochromatin see Heterochromatin 
immunoprecipitation, 431-432, 432F 
nuclear sites, 239-240, 239F, 240F, 241 F 
packing, 243, 244F 

remodeling see Chromatin remodeling 
structure see Chromatin structure 
see also Chromosome structure; Genome(s); 
Nucleosome(s) 
Chromatin assembly factors (CAFs), 290 
Chromatin remodeling, 215-216, 442-443, 443F 
"barrier sequences," 227-228 
"code-writer enzyme," 226-227, 227F, 228F 
complexes, 215, 21 5F, 343, 344 
histones, 216, 216F, 432, 433F 
nucleosomes, 216, 432, 433F 
position effect variegation, 226-227 
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RNA interference (RNAi), 443 
RNA polymerase(s), 433 

transcriptional, repressor protein-mediated, 445, 
446F 
Chromatin structure, 21 1 -218, 21 1 -21 9 

30nm fiber, 21 1, 21 1F, 21 2F, 216-218, 21 7F, 21 8F, 
222 
see also Histone(s); Nucleosome(s) 
"beads on a string," 211, 21 IF, 21 2F 
centromeric see Centromere(s) 
direct inheritance, 230-234, 232F 
duplication during S phase, 1069-1070 
effect on replication timing, 285-286 
epigenetics, 472 
historical aspects, 220 
inheritance, 473-476 

see also Heterochromatin 
interphase, loops, 234-236, 235F 
mitotic chromosomes, 243, 243F, 245 
regulation, 219-233 

see also Chromatin remodeling; Chromosome 
structure 
Chromatography, 512-514, 513F, 534F 

see also Protein purification; specific types 
Chromocenter, Drosophila polytene chromosomes, 

237F 
Chromokinesins, 1077 
Chromomeres, 234, 235F 
Chromoplast(s), development, 699F 
Chromosomal crossing-over see Homologous 

recombination (crossing-over) 
Chromosomal instability, 1217 

see also Genetic instability 
Chromosome(s), 1 95-1 96, 202-205, 554F 
aberrant see Chromosome abnormalities 
analysis, 195F, 196, 196F, 202-203, 203F, 203FF, 
237FF,285F, 534, 535F, 590F 
see also specific techniques 
autosomes, 1271, 1284F 
bacterial, 202, 282, 283F, 1491 F 
biological functions, 204 
evolution, 207, 208F 
gene content, 204-205, 204F 

see also Genome(s) 
historical research, 195-196, 196 
homologous see Homologous chromosomes 

(homologs) 
human, 202, 203F, 203FF 

chromosome 3 evolution, 208F 
chromosome 12 translocation, 204F 
chromosome 22, 205F, 206T 
evolution, 207, 208F 
gene organization, 205F 
mouse vs., 249-250, 249F 
replication origins, 287-288 
replication rate, 283 
see also Human genome 
lampbrush chromosomes, 234, 234F, 1288 
mitotic see Mitotic chromosome(s) 
packaging, 1069 

DNA, 202-218, 202-219 
post-mitotic chromosomes, 1090 
see also Chromatin; Chromosome 

condensation; Chromosome structure 
polytene chromosomes see Polytene 

chromosome(s) 
puffs, 220-222, 239, 239F 

see also Polytene chromosome(s) 
rearrangements, 1231 

see also Chromosome abnormalities; specific 
rearrangements 
replication see Chromosome replication 
sex chromosomes see Sex chromosome(s) 
species differences, 204-205, 205F 
structure see Chromosome structure 
see also Cytogenetics; Karyotype; individual 
chromosomes 
Chromosome abnormalities 

analysis see Cytogenetics; Karyotype 
cancer, 1215F, 1231, 1254F 
meiotic errors, 1278-1279 
see also specific types 
Chromosome bands, 202-203, 203FF, 237FF 
Chromosome condensation, 243, 244F, 1 075F 
ATP hydrolysis, 243 

cell cycle variation, 208-209, 208F, 209 
chromatin condensation, 1070 
chromatin packing, 243, 244F 
condensins see Condensin(s) 
M-Cdk role, 1071 



X-chromosome inactivation see X-inactivation 
see also Mitotic chromosome(s) 
Chromosome deletion 

cancer role, 235F, 1234-1236, 1236F 
genome evolution, 246-247 
Chromosome duplication 

centrosome duplication vs,, 1078 
S phase of cell cycle, 1 054 

chromatin duplication, 1069-1070 
regulation of, 1067, 1068F 
Chromosome puffs, 220-222, 239, 239F 

see also Polytene chromosome(s) 
Chromosome replication, 208-209, 209-210, 209F, 
210F 
centromere, 209-210, 210, 210F 
chromatin condensation and timing, 285-286 

see also Chromosome condensation 
control see Cell cycle control 
duplication during S phase of cell cycle, 1 054 
replication origin, 209, 21 OF 
segregation during see Chromosome 

segregation 
sister chromatids see Sister chromatid(s) 
structural changes needed, 1067 
telomere, 210, 210F 

see also Telomere(s) 
yeast, 210 

see also Cell cycle; DNA replication; DNA 
synthesis; Mitosis 
Chromosome segregation 

meiotic {homologous chromosomes), 
1276-1278 
failure (nondisjunction), 1236F, 1278-1279 
mitotic, 865, 865F, 1089-1090, 1089F 
see also Meiosis; Mitosis; Sister chromatid(s) 
Chromosome structure, 554F 

cell cycle changes, 208-209, 208F, 209, 243F 
see also Cell cycle; Interphase chromosome(s); 
Mitotic chromosome(s) 
centromere see Centromere(s) 
chromatin see Chromatin 
DNA packaging, 202-218, 202-219 
global (higher order), 233-245 
chromatin see Chromatin 
condensation see Chromosome condensation 
loops, 234-236, 234F, 235F 
polyteny see Polytene chromosome(s) 
linear, 209-210, 209F, 210F 
replication and, 209, 21 OF, 1067 

see also Chromosome replication; Replication 
origin(s) 
telomere seeTelomere(s) 
X-inactivation see X-inactivation 
Chromosome translocation, 528F 
cancer role, 1261 

Philadelphia chromosome in CML, 1208, 

1208F, 1261J261F, 1262F 
translocation activating Myc gene, 1 239 
chromosome 1 2 and, 204F 
DNA repair, homologous recombination, 309 
genome evolution, 246-247 
Chronic myelogenous leukemia (CML), 1208, 1208F, 

1210, 1218, 1261, 1261F, 1262F 
Chymotrypsin, 138F, 144F 
Cilia, 1031-1034 

basal bodies, 1033, 1033F 
of epithelial cell apical domain, 806 
flagella comparison, 1031 
in left-right asymmetry, 1376F, 1377 
microtubule arrangement, 1032F 
motility, 1031, 1031F 

polarity, of beating in respiratory tract, 1435F 
primary, 1034 
Ciliary (axonemal) dynein, 1031-1032, 1032F, 1033F 

hereditary defects, 1033 
Ciliated cells, respiratory tract, 1434-1436 
Ciliates, 28F 

Circadian clocks, 460-462, 461 F, 462F 
Cis Golgi network (CGN) (intermediate 

compartment), 771 F, 772, 778F 
Cisternal maturation model, of Golgi transport, 778 
Citrate synthase, 122F 
Citric acid, 98, 98F, 122F 
Citric acid cycle, 97-99, 1 02F, 81 7 
electron generation, 817 
pathway, 98F, 122-123FF, 122F, 123F 
Clamp loader, 274, 275F, 276F 
Classical genetics see Genetics,classical 
Class switching, 1567-1568 
B cell activation, 482-483 



V(D)J recombination vs., 1568 
Clathrin coat(s), 751, 754-755, 755F, 756F 

see also Clathrin-coated pit(s); Clathrin-coated 
vesicle(s) 
Clathrin-coated pit(s), 743F, 790F 

LDL endocytosis, 791 -792, 791 F, 793F 

pinocytosis, 789-790, 789F 
Clathrin-coated vesicle(s), 751, 754, 754F, 755F, 
758F 

adaptin, 756F 

cargo receptors, 754-755 

formation, 755-757, 756F 

pinching-off, 754-755, 756F, 758F 

regulation, 795 

structure, 755F 

vesicular traffic, 754F 
Clathrin, structure, 754, 755F 
Claudins, 1153 
CLAVATA1,957F 
Clavatal protein, 1410, 141 OF 
CLAVATA3,956,957F 
Clavata3 protein, 1410, 141 OF 
Cleavage and polyadenylation specificity factor 

(CPSF), 357-358, 357F 
Cleavage furrow, 1093, 1093F 
Cleavage stimulation factor F (CstF), 

polyadenylation, 357-358, 357F, 
482-483 
Clonal anergy, 1548, 1548F 

see also Immunological tolerance 
Clonal deletion, 1548, 1548F 

see also Immunological tolerance 
Clonal expansion, 1546 
Clonal inactivation, 1548, 1548F 
Clonal selection theory, adaptive immunity, 1544, 

1545F 
Cloning, 507-508 

DNA cloning see DNA cloning 

reproductive see Reproductive cloning 

therapeutic, 507F, 508 

vectors (DNA) see under DNA cloning 
Cloverleaf structure, tRNAs, 368, 368F 
Cluster analysis of gene expression, 575, 575F 
CLV1 (CLAVATA 1), shoot meristem, 957F 
CLV3 (CLAVATA 3), 956, 957F 
Coactivator(s),445,447F 
Coat coloration, maternal effects, 474, 474F 

see also X-inactivation 
Coated vesicle(s), 751,754 

formation 

ARF-proteins, 759 

coat-recruitment GTPases, 759-760 
Sari protein, 759 

vesicular transport, 751, 754, 754F 

see also specific types 
Coat-recruitment GTPase(s), 758-760 

see also GTP-binding proteins (GTPases) 
Coaxial stack, RNA structure, 403F 
Cocci, 1490F 

Cockayne's syndrome, 300 
Code reader complex, 225-226, 225F 
"Coding problem," 367 
Codons,6,367,367F, 368F 

Candida CUG codon, 383 

initiation codon, 367F, 380, 489-491 

mitochondrial genome, 861 -862 

redundancy, 246-247 

stop codons, 367F, 381 

synonymous, 247 

wobble, 369, 369F 

see also Anticodon(s) 
Coenzyme A (CoA), 83-84, 83F, 84T, 1 1 7F 

see also Acetyl CoA 
Coenzyme Q, 831, 832F, 835 
Coenzymes, 167, 167T 

see also specific types 
Cofilin (actin depolymerizing factor), 994F, 1001, 
1002F 

lamellipodia, 1038F 

nucleotide hydrolysis, 1002 
Cohesin(s), 1070-1071, 1087, 1272 

N-terminal degradation, 395 

structure, 1070F 
Coiled-coil motif, 135, 135F, 145,349F 
Coilin, nuclear localization, 365F 
Co-immunoprecipitation, 458, 523 
Colchicine (colcemid), 987, 988T, 1021 
Collagen(s), 1131, 1184-1186, 1186T, 1467 

in bone, 1469 

degradation, 1194 
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fibril-associated, 1185, 1187-1189, 1189F 
fibrillar vs., 1188-1189 

fibrillar, 1184-1189, 11 86T 

fibril-associated vs., 1 188-1 189 

genes, 1184-1185 
evolution, 1185-1186 
mutations, 1187 

structure, 144F, 146F, 1 184, 1185F 

fibrils, 1185, 1185F, 1187, 1187F, 1189, 11 89F 

fibrous protein, 146 

post-translational modification, 1186, 1187F 

synthesis, 1186 

collagen fibrils, 1 187, 1188F 
pro-a chains, 1186 
procollagen, 1186, 1187 
propeptides, 1186 

tensile strength, 1 1 87-1 1 88, 1 1 89 

type 1, 1185, 1186T, 1187 

type II, 1186X1187, 1469 

type 111,1186X1 187 

type IV, 1 1 65, 1 1 66-1 1 67, 11 67F, 1 1 68, 1 1 85, 
1186T 

typeV, 1186T 

type VII, 1185, 1186T 

type IX, 11 85, 1189, 1189F 

typeXI, 1186T 

type XII, 1185, 1189 

type XVII, 1185, 1186T 

type XVIII, 1165, 1185, 1186T 
Collagenases, 1194 

Colloidal gold, electron microscopy, 606 
Colon cancer see Colorectal cancer 
Colorectal cancer, 1250-1255, 1352F 

detection and screening, 1251-1252 

genetic abnormalities, 1251-1253, 1252T, 1438 

heterogeneity, 1256 

mismatch repair defects, 277, 295T, 1254 

sequence of mutations, 1255, 1255F 

steps in progression, 1250-1252, 1254-1256, 
1352F 
Columnar epithelia, 1 1 33, 1 1 50-1 1 51 
Combinatorial control, 425 

cell determination, 465-466 

cell differentiation, 464-465, 465 F 

complex formation, 447, 447FF 

Drosophila Eve gene, 448-450, 449F 

heterodimerization, 424-425, 425F, 425FF 

promoter integration, 449-450, 450F 
Combinatorial diversification, antibody chains, 

1563 
Combinatorial regulation, microRNA (miRNA), 

494-495 
Combustion, 820F 
Commissural axon(s), 1388, 1388F 
Commissureless protein, 1389 
Common cold, 1518 
Common evolutionary ancestor, 862 
Compact bone, 1470F 
Compaction, 1379, 1379F 
Compaction ratio, DNA packaging, 210-21 1 
Comparative genomic hybridization (CGH), tumour 

cell identification, 1239, 1239F 
Comparative genomics, 22, 207, 245-260 

human vs chimpanzee, 247, 248F 

human vs mouse, 249F 

phylogenetic footprinting, 431, 431 F 

phylogenetic tree creation, 247, 247F 
Compartment(s), 1352-1355, 1352F 

compartmentalization of cell, 695-704 

of imaginal discs, 1352, 1353-1354 

intracellular see Intracellular compartments 
Competition 

between axons for synaptic territory, 1 393-1 394, 
1394F 

sexual reproduction and, 1271-1272, 1271 F 
Complementary DNA see cDNA 
Complementation, 528F 
Complementation tests, 528F, 558 
Complement-binding co-receptor complex, 1596 
Complement system, 1542 

activation, 1528-1529, 1528F 

adaptive immunity, 1551-1552, 1596, 1596F 

C3 activation, 1528 

C3b and C3a, 1528-1529 

cascade, 1528-1529, 1528F 

early components, 1 528 

Ig activation, 1557T 

late components, 1 529F 

lesions in red blood cells, 1529F 

membrane attack complex, 1 529, 1 529F 



phagocytosis, role, 788 
Complex oligosaccharide(s), 773-775, 774F 
Complex traits, 563 
Concentration gradient, 653 
Condensation, chromosomes see Chromosome 

condensation 
Condensation reactions (macromolecule 
polymerization), 84-87, 85F 

sugars, 56, 57F 
Condensin(s), 243, 244F, 245F, 1075 
Conditional mutation, 528F 

temperature-sensitive, 557, 557F, 1057, 1057F 
Cones (cone photoreceptors), 1432, 1432F 
Confocal microscope, 591 F 
Confocal microscopy, 590-592, 591 F, 592F 
Conformational changes 

allosteric regulation see Allosteric regulation 

cadherins, 1138 

EF-Tu elongation factor, 180-181, 181F, 377 

integrins, 1170-1172, 11 72F 

ionchannel(s),668F 

motor protein(s), 1016-1019 

negative feedback (feedback inhibition), 171, 
172F 

prion protein (PrP), 397F, 398 

ribozymes, 404, 406F 

tryptophan repressor, 433-434 
Congenital abnormality, DNA repair defects, 295T 
Connective tissue, 1131,11 32F, 1 1 78F 

basal lamina and see Basal lamina 

cells, 1467-1474, 1467F 
differentiation, 1468-1470 
types, 1467, 1467F 
see also individual cell types 

collagen see Collagen(s) 

extracellular matrix, 1 1 78-1 195 

cell differentiation influence, 1468-1470 
see also Extracellular matrix (ECM) 

see also Epithelia 
Connexin-26, 1161 
Connexins, 1159-1161, 1160F, 1290 
Connexons, 1 159-1 161, 1 160F 
Consensus sequences 

polyadenylation signals, 357, 357F 

promoter elements, 338-339, 338F 

RNA polymerase start sites, 342F 

splicing signals, 349, 349F 
Conservation, evolutionary see Evolutionary 

conservation 
Conservative site-specific recombination, 316, 324 

gene control mechanism, 325-326 

reversible DNA rearrangement, 324, 324F, 325F 

transgenic applications, 325-326, 326F 
Conserved synteny, 207, 208F 
Constant domain, Ig heavy chain, 1559F, 1560F 
Constitutive secretory pathway, 800, 801 F 

see also Exocytosis 
Contact-dependent signaling, 881, 881 F 

see also Ephrin(s); Notch receptor protein 
Contact guidance, 1 1 40, 1 387-1 388 
Contact inhibition, 1 1 10, 1 1 1 0F, 1 233, 1 234F 
Contractile bundles, 1006-1007, 1007F 

see also Actin/actin filaments 
Contractile ring see Cytokinesis 
Convergent extension, 1368, 1369F 
Coomassie blue, protein staining, 517, 518F 
Cooperativity (protein interactions) 

allosteric transitions, 173-174, 173F, 174F, 175F 
see also Allosteric regulation 

DNA replication fork, 273, 274F, 276 

hemoglobin, 256, 256F 
COPI-coated vesicle(s), 751, 754, 754FF 
COPII-coated vesicle(s), 751, 754F, 759F, 767, 767F 
Copia element, 318T 
Copy number variation, 258 

genome evolution, 259-260, 259F 
Co-receptor, T cell, 1580-1581, 1580T, 1581F 
Corepressor(s), 445, 447F, 457-458, 458-459, 458F 

see also Cooperativity (protein interactions) 
Corn see Maize 

Corneal tumor, angiogenesis, 1448, 1448F 
Coronaviruses, 1496F 

Corral(s), membrane partitioning, 647, 648F 
Cortical reaction, 1299, 1300, 1300F 
Corticotropin (ACTH), 803F 
Cortisol, 463, 889, 889F 

gene expression regulation, 463 

receptor, 891 F 
Costal2 protein, 951 
Costimulatory signals 



antigen-presenting cells, 1542F, 1571, 1572F, 
1593F 
see also B7 proteins; CD40 ligand 

dendritic cells, 1571 

lymphocyte activation, 1548 
Coupled reactions see Reaction coupling 
Covalent bonds, 47, 48-50, 50F, 51 F, 1 06F 

bond length, 48, 53T 

bond strength, 48-49, 53T 

dipoles, permanent, 50 

geometry, 49, 50F 

ionic bonds vs., 48F 

phosphate 

bond energies, 93F 
phosphoanhydride bonds, 61, 93F 
phosphodiester, 61-62, 93F 

see also individual types 
Covalent modification of proteins, 186-187, 186T 

see also Posttranslational modification; specific 
types 
CpG motifs see CG (CpG) islands 
CPI, p-Globin gene expression, 451 F 
CPSF (cleavage and polyadenylation specificity 

factor), 357-358, 357F 
Cranial sensory ganglia, 1384F 
CRE (Cyclic AMP response element), 908 
Creatine phosphokinase, myoblast specialization, 

464 
CREB (CRE-binding protein), 908, 908F 
CREB-binding protein (CBP), 908, 908F 
C region exons, antibody genes, 1 562, 1 563FF 
Cre/Lox system, 567-568, 1348, 1444 
Creutzfeldt-Jacob disease (CJD), 397-398 
Crick, Francis H.C, 196 
Critical period, 1396 
Cro repressor, 142, 142F, 421 F, 457-458, 458F 

see also Lambda repressor 
Cross-beta filaments, 397, 397F 
Crossing-over see Homologous recombination 

(crossing-over) 
Cross presentation, antigen presentation, 1584 
Cross-strand exchange see Holliday junctions 
Cross-talk, prevention between MAP kinases, 930 
Crumbs complex, epithelial apico-basal polarity, 

1156, 1157F 
Cryoelectron microscopy, 610 
Cryptdins, 1437 

Cryptic splicing signals, 352, 352F, 355, 355F 
Cryptochrome, 961 
Crypts of Langerhans, 1526F 
Crypts, of small intestine, 1436, 1436F, 1441 F 
Crystallization, proteins, 528 
Crystallography 

electron, 611-612 

X-ray see X-ray crystallography 
CstF (cleavage stimulation factor F), 357-358, 357F, 

482-483 
CTCF protein, 470F 
C-terminal domain (CTD), RNA polymerase II, 

341-342 
CTLA4, B7 binding, 1591-1592 
CTLA-4 protein, T-cell regulation, 1592T 
CTXcf) bacteriophage, 1492F 
Cubitus interruptus (Ci) protein, 951, 952F 
Culture, cells see Cell culture 
Cut-and-paste transposition, 31 9F, 320F 

see also DNA-only transposons 
CXCR4, 1375, 1505-1506, 1505F 
Cy3 fluorescence, 587, 587F 
Cy5 fluorescence, 587, 587F 
Cyanide, cytochrome oxidase binding, 834 
Cyanobacteria 

evolutionary significance, 873-875 

photosynthesis, 827F, 840 

in tree of life, 16F 
Cyclic AMP (cAMP), 1 1 7F, 905-909, 906F, 907F, 
907T 

bacterial toxin action, 1503 

cyclic-AMP-gated cation channels, 91 7 

gene activation by, 909F 

regulation, G protein, 905-906 

regulation of PKA, 908, 908F 

synthesis and degradation, 905-906, 907F 
Cyclic-AMP phosphodiesterases, 906, 907F 
Cyclic-AMP response element (CRE), 908 
Cyclic GMP(cGMP) 

cyclic-GMP-gated Na + channel, 917 

photoreceptors, 917-918, 91 7F 

response to NO, 887, 888F 
Cyclic-GMP phosphodiesterase, 91 8 
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Cyclic-nucleotide-gated ton channels, 916, 917 
Cyclic photophosphorylation, 853 
Cyclin(s), 1062 

APC/C and, 1087 

Cdk complexes see Cyclin-Cdk complexes 

cyclical changes, 1062, 1063 

Grcyclins, 1062 

G^S-cyclins, 1062 

gene expression, 1101 

M-cyclins, 395, 1062, 1064, 1074, 1087, 1101 

S-cyclins, 1062, 1064, 1087 

signal integrating device, 1 77-1 78, 1 77F 

vertebrates, 1063F 

yeast, 1063F 
Cyclin-Cdk complexes, 177-178, 177F, 1062, 1062F, 
1063F 

CKI binding, 1063-1064 

G r Cdk, 1062, 1066, 1103-1105 

Gi/S-Cdk, 1062, 1066, 1103-1105 

M-Cdk, 1062, 1066 

activation by dephosphorylation, 1074-1075, 

1074F 
entry into mitosis, 1071, 1074 
microtubule dynamics and, 1 080 
spindle assembly role, 1078-1079, 1079F, 
1080 

mechanism of action, 1062-1063, 1062F 

S-Cdk, 1062, 1066, 1067-1069, 1068 

see also Cyclin(s);Cyclin-dependent kinases 
(Cdks) 
Cyclin-dependent kinase-activating kinase (CAK), 

1063, 1063F, 1066T 
Cyclin-dependent kinase inhibitory proteins (CKIs), 

1063-1064, 1064F, 1066T, 1101 
Cyclin-dependent kinases (Cdks), 1062-1063 

activation by dephosphorylation, 75F, 177, 1063, 
1063F, 1066T, 1074-1075, 1074F 

cyclical changes in activity, 1062 

cyclin complexes see Cyclin-Cdk complexes 

evolution, 177F 

G 2 /M checkpoint, 1062 

inactivation/inhibition 
Gi phase, 1100-1101 
phosphorylation, 1063-1064, 1064F, 1066T 

regulation, 177-178, 177F 

vertebrates, 1062, 1063F 

yeast, 1062, 1063F 

see also Serine/threonine kinases; specific kinases 
Cycloheximide, 384, 385T 
Cyclostome see Anaphase promoting complex 

(APC/C) 
Cys-Cys-His-His zinc finger proteins, 422F 
Cysteine, 101,1 29F 
Cystic fibrosis, 560, 666, 768 
Cystic fibrosis transmembrane conductance 

regulator protein (CFTR), 666 
Cystinuria, 652 
Cytochalasin, 988T 
Cytochrome(s), 829, 831, 832-833, 833F, 852F 

see also individual cytochromes 
Cytochrome be-f complex, photosystem 11,851, 

852F 
Cytochrome b 5 62, structure, 1 37F 
Cytochrome b-Ci complex, 832, 833F 

proton pumping, 835 

Q-cycle, 835-836 

structure, 836F 
Cytochrome c, 833F 

apoptosis, 1118, 1121, 1121F, 1122F 

mutation rate, 265 

structure, 144F 
Cytochrome oxidase complex, 832, 833F 

essential nature, 834 

iron-copper center, 834 

mammalian, 834 

oxygen reduction, 832-834, 833F 

poisoning, 834 

structure, 833F, 834, 834F 
Cytochrome P-450 oxidase, carcinogen activation, 

1225 
Cytogenetics, 202-203, 203F, 203FF, 204F 
Cytokine(s) 

antagonists, viral-encoded, 1534 

dendritic cells, 141 3F, 1 592-1 593 

inflammatory response, 1533-1534 

T cells, 1582 

see also Interleukin(s); specific molecules 
Cytokinesis, 1054, 1055, 1055F, 1073F, 1092-1101, 
1093F 

asymmetric, 1099, 1099F 



cleavage furrow, 1093, 1093F 
contractile ring, 1093F 

actin and myosin II as force generators, 
1093-1094 
see also Actin/actin filaments; Myosin 
assembly, 1093-1094, 1093F 
dynamic behavior, 966 
RhoA role, 1094-1095, 1095F 
site of assembly, 1097 
midbody, 1094, 1094F 

mitotic spindle and plane of division in animal 
cells, 1095-1097 
astral relaxation model, 1096-1097, 1096F 
astral stimulation model, 1096, 1096F 
central spindle stimulation model, 1096, 
1096F, 1097F 
organelle distribution, 1098 
phragmoplast role in higher plants, 1097-1098, 

1097F, 1098F 
plasma membrane enlargement, 805 
timing, 1095 
see also Mitosis 
Cytolytic effects, of viruses, 1 496 
Cytomegalovirus (CMV), MHC class I protein 
translocation inhibition, 1536 
Cytoplasm, 74-75, 75F, 696 
division of see Cytokinesis 
egg (ovum), 1287 
strands in plant cells, 782F 
see also Cytosol; specific components 
Cytoplasmic bridges, 1290, 1290F, 1294, 1296F 
Cytoplasmic dyneins, 1014 
Cytoplasmic inheritance see Non-Mendelian 

inheritance 
Cytoplasmic tyrosine kinases, 935 
Cytosine, 116F, 197, 301 F 
base-pairing, 198F 
C-to-U editing, 484 
deamination, 296, 297F, 298F, 301 F 
RNA structure, 332 
Cytoskeletal filaments, 647, 648F, 965-1053 
assembly/formation 

drug-induced changes, 987-989, 988T 
kinetics, 973 

nucleation, 973, 978F, 992 
rate of subunit addition, 977 
self-assembly, 965-991 
subunits, 970-971 
see also Actin polymerization 
behavior, 966-969 
capping, 1003, 1003F 
cross-linking, 1001, 1005-1008 
dynamic behavior, 965-991, 1002-1003 
elongated, 972F, 976 
nucleation regulation, 992-1010 
organization, 1005-1008 
stabilization, 1001-1002 
subunit stoichiometries, 1002-1003 
see also Actin/actin filaments; Intermediate 
filament(s); Microtubule(s) 
Cytoskeleton, 26, 965-1053, 965F 

cell junction attachment, 1 1 31, 1 142, 1 142F 

cellular polarity, 969, 1044 

function, 965 

helical filaments, 143, 145 

influencing cell behavior, 1025-1050 

integrin attachment, 1 170, 1 1 70F 

intracellular anchor proteins, 1 1 42 

motor proteins see Motor protein(s) 

pathogen utilization, 1514-1517 

properties, 965 

Rho proteins, 1041-1043 

RNA localization, 1022-1023 

spectrin (membrane cytoskeleton), 646, 647F 

stability, 969 

see also Actin/actin filaments; Cytoskeletal 

filaments; Intermediate filament(s); 
Microtubule(s); specific components 
Cytosol, 696 

pH regulation, 657-658, 661 -662 
volume, 697T 
Cytotoxic T-cells (T c ), 1535, 1560-1561, 
1572-1573,1581-1582 
activation, helper T-cells (T H ), 1 574-1 575 
antigen presentation, 1581-1582, 1583F 

helper T-cells vs., 1584 
CD8 co-receptor role, 1580-1581, 1581F, 1586 
cell killing mechanism, 1572-1573, 1573F 
interferon-y effects, 1592 
selection in thymus, 1585-1586, 1587F 



transfer experiments, 1 581 F 
virus-infected cells, 1581 F 
see a/so CD8T-cell(s) 



Dae (dachsund) gene, 466F 

Dachsous, planar cell polarity, 1 1 58 

Dally/Dally-like genes, 1 1 84, 1 1 84T 

Dalton, definition, 45 

DAN enzyme, 492F 

DAPI fluorescence, 587F 

Darkfield microscopy, 583 

Databases, 530-531 

Daz gene/protein, spermatogenesis regulation, 

1295-1296 
DCC receptor protein, 1 388, 1 388F 
Deadenylation-independent mRNA decay, 492, 

492F 
Deadenylation-mediated mRNA decay, 492, 492F 
Deafness, connexin-26 mutations, 1 1 61 
Deamination, 296, 297F, 301 F 

DNA methylation role, 300-301, 301 F 

hypoxanthine production, 300, 301 F 

inosine production, 369, 369F, 484, 484F 

5-methylcytosine, 470 

mutagenesis mechanism, 298F, 300-301 
Death (of cells) see Cell death 
Death inducing signaling complex (DISC), 

apoptosis, 1120, 1120F 
Death receptor(s), apoptosis, 1120-1121, 1 120F 
Decapentaplegic (Dpp) gene, 1 335 
Decapentaplegic protein (Dpp), 1335, 1335F, 1353, 

1353F 
Decapping, 492 

Decorin, 1181-1182, 11 82F, 11 84T 
Default pathway, 800, 801 F 

see a/so Exocytosis 
Defensins, 1525-1526, 1533 
Degradation signals, protein(s), 396F 
Dehydrogenation reactions, 71F, 72 
Delayed K + channel(s), 689 
Deletion cassettes, yeast mutagenesis, 570 
Deletions, chromosomal see Chromosome deletion 
8 chain, T-cell receptor, 1571 
Delta G (AG) see Free energy change Delta G (AG) 
Delta-Notch signaling see Notch signaling pathway 
Delta protein 

C elegans early embryo, 1 324 

Drosophila sensory bristle development, 1359 

lateral inhibition, 946-947, 947F, 1358FF 

see also Notch receptor protein; Notch signaling 
pathway 
Denaturation, protein, 130 
Dendrite(s), 1047, 1050F, 1386-1387, 1387F 

microtubule orientation, 1049 

neuronal computation, 688F 

neurosignaling, 675 
Dendritic cells, 1 533, 1 542, 1 571-1 572, 1 571 F 

activation, 1571,1590-1591 

cell-cell adhesion molecules, 1571 

co-stimulatory proteins, 1571 

distribution, 1571 

functions, 1542, 1542F, 1593F, 1594 

maturation, 1571 

MHC expression, 1571,1590 

NOD proteins, 1536 

nonactivated, 1571 

pattern recognition receptors, 1536, 1571 

phagocytosis, 787, 1452 

self-reactive T-cell elimination, 1591 

Toll-like receptors (TLR), 1531, 1536 
Denovo DNA methyltransferases, DNA methylation, 

467 
Dense-core vesicles see Secretory vesicle(s) 
Density gradient centrifugation, 511, 51 2F, 522 
Deoxyribonucleic acid see DNA 
Deoxyribonucleoside triphosphates (dNTPs), 266, 

268F 
Deoxyribose, 116F,332F 
Dephosphorylation, 176, 176F 

see also Protein phosphatase(s) 
Depth of field, electron microscopy, 606, 608 
Depurination, 296, 297-298, 297F, 298F, 299F 
Dermal papillae, 1422F 
Dermatan sulfate, 1179 
Dermis, 1418, 1418F, 1477 
Desensitization see Adaptation 
Desert Hedgehog protein, 950 
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Desmin,985T, 987 
Desmocollins, 1136, 1138T 
Desmogelins, 1136, 1138T 
Desmosome(s), 1 1 34, 1 1 34F, 1 1 35T, 1 1 43-1 1 44, 
1144F, 1177T 

keratin filaments, 986 
Desmotubule, 1162 
Detergent(s), 517, 51 7R 638F, 639F 

ionic vs. nonionic, 636 

membrane protein solubility, 636-640 

see also specific detergents 
Determination see Cell determination 
Detoxification reaction(s), smooth endoplasmic 

reticulum, 725 
Development, 1305-1320 

asymmetric cell division, 1 099, 1 289-1 290, 
1289F, 1313-1315, 1314F 

C eiegons see Caenorhabditis elegons 
development 

cell determination, 1311-1312, 1312F 

cell fate, 1311-1312 

cell lineage tracing, 1310-1311, 1311 F 

cell memory, 1305, 1312,1315 

cell migration, 1140, 1140F 

cleavage of egg, 1307 

descriptive embryology, 1310 

Drosophila melanogaster see Drosophila 
development 

epithelial folding, 1142, 1143F 

epithelial-mesenchymal transitions, 1 141 

evolutionary conservation among animals, 
1 306F, 1 307 

experimental embryology, 1310, 131 OF 

four essential processes, 1305, 1306F 

gap junctions, 1161 

genetics see Developmental genetics and gene 
regulation 

germ layers and gastrulation, 1 307, 1 307F 

inductive interactions and signals, 1310, 1 31 3, 
1313F, 1316T 

lateral inhibition, 1314, 1315F 

mitosis in absence of cytokinesis, 1099-1100 

model organisms, 131 1 

morphogens and gradients, 1316-1317, 1318F 

mouse see Mouse development 

nervous system, 1383-1397 
see also Neuron(s),development 

plants see Plant development and growth 

positional controls, 1111 

positional values, 1312-1313, 1312F, 1313F 

regulatory DNA defines program, 1 309, 
1309-1310, 1309F 

sequential induction, 1319, 1320F 

timekeeping, 1319-1320, 1320F 

total cell mass, 1111 

vertebrates see Vertebrate development 

Xenopus laevis see Xenopus laevis development 

see also Embryo(s)/embryogenesis; Signaling 
molecule(s)/pathway(s); individual 
organisms and processes 
Developmental genetics and gene regulation, 
450-451, 451 F, 1308, 1309 

cell differentiation, 464-465, 465F 

combinatorial gene control, 464-4695, 465F 

Drosophila see under Drosophila development 

genes specially required, 1308 

genetic screens, 1308 

organ formation, 465-466, 466F 

regulatory DNA defines program, 1309, 
1309-1310, 1309F 

transcriptional synergy, 464 

see also Cell differentiation; specific genes 
Diabetes mellitus type 1 , 1 549 
Diacylglycerol, 910 
Diakinesis, 1275, 1276 
Diapedesis see Lymphocyte(s),recirculation 
Diarrhea 

bloody, in dysentery, 1491 

enteropathogenic E. coii, 1504 

Salmonella enterica spread, 1518 

spread of infection, 1487-1488 
Dicer protein, RNA interference (RNAi),496 
Dickkopf protein, 131 6T 
Dictyostelium, 16F 

chemotaxis, 1045 

myosin I and II localization in crawling amoeba, 
1041, 1041 F 
Dideoxy DNA sequencing, 549-550 
Didinium, 28F 
Dietary requirements, nitrogen, 101 



Differential-interference microscopy, 583 

see also Phase-contrast microscopy 
Differentiation of cells see Cell differentiation 
Diffraction patterns, 528, 528F 
Diffusion-limited enzyme catalysis, 163, 163T, 169 
Diffusion, random nature, 74, 75F 
Digestion, 88, 1436 

see also Lysosome(s) 
Dihydrofolate reductase, cancer treatment, 1 260 
Dihydrouridine, tRNA modification, 368F, 369F 
Dihydroxyacetone, 1 12F 
Dihydroxyacetone phosphate, 120F 
Dimer formation 

DNA damage, 296, 298F 
proteins, 142F 

see also DNA-binding proteins 
Dimethylbenz[a]anthracene (DMBA), 1226, 1238 
2, 4-Dinitrophenol, 836 
Dinitrophenyl, 1545, 1545F 
Diploid cells 

classical genetics, 554F 

sexual reproduction, 1269, 1270F 

see also Meiosis 
yeast life cycle, 34, 34F 
Diplotene, 1275-1276, 1280, 1288 
Dipoles, covalent bonds, 50 
Disaccharides, 56, 57F, 113F 
Discs large (Dig) protein, 1 1 48-1 1 49, 1 1 48F, 1 1 54 
Dishevelled gene/protein, planar cell polarity, 1 1 58, 

1359 
Disintegrins, 1 193 
Dislocation, proteins see Retrotranslocation, 

misfolded proteins 
Dissociation constant (Kd), 158F 
Dissociation rate, 158F, 526 
Distal-less gene, 1351, 1351F, 1355 
Disulfide bonds 

amino acids, 129F, 147-148 
electrophoresis, 51 8F 
protein stability, 147-148, 147F 
Diurnal rhythms, 460-462, 461 F 

see also Orcadian clocks 
Divergence (evolutionary) 
mutation rate analysis, 264 
phylogenetics, 247, 247F, 248F 
see also Gene duplication(s) 
"Diverse animalcules," Leeuwenhoek, Anton van, 

501 F 
Divisional asymmetry, in stem cell production, 

1421, 1421F 
DMBA (dimethylbenz[a]anthracene), as mutagenic 

chemical carcinogen, 1226, 1238 
DNA (deoxyribonucleic acid), 3, 3F 

amplification via PCR see Polymerase chain 

reaction (PCR) 
analysis, 532-553 
bending see DNA bending 
catenation, 1071 
cell macromolecule, 62F 
chemical synthesis, 548 
chromatography, DNA-binding proteins, 

428-429, 429F 
cloning see DNA cloning 
compaction, 210-211 
complementary see cDNA 
damage see DNA damage 
fingerprinting, 546, 547F 
helix-passing reaction, DNA topoisomerase II, 

281 F 
historical research, 195-196, 195-197, 329 
identification as genetic material, 1 95-1 96, 

196F, 197, 198F 
Streptococcus pneumoniae experiments, 196F, 

198F 
structure elucidation, 1 95-1 96, 1 96, 1 97, 329 
hybridization see DNA hybridization 
information coding, 199-200, 329, 330F, 331 
DNA makes RNA makes protein, 331, 331 F 
relation to proteins, 199-200, 199F 
as universal information store, 2, 297, 408 
see also DNA sequence; Genetic code; 
Genome(s) 
labeling, 534 
linker, 211 
location, 200-201, 201 F 

see also Nucleus 
maintenance, 263-265 
failure, 263, 265, 1212 

see also Cancer; DNA damage; Mutation(s) 
genome evolution, 246-247 



see also DNA repair 
manipulation see Recombinant DNA technology 
noncoding see Noncoding DNA 
packaging, 202-218, 202-219 
chromatin packing, 243, 244F 

see also Chromatin; Nucleosome(s) 
compaction, 210-211 
see also Chromosome(s); Chromosome 
structure 
polarity, 3F 
protein interactions see Protein-DNA 

interactions 
recombination see Recombination 
regulatory see Regulatory DNA 
repair see DNA repair 
repeats see Repetitive DNA 
replication see DNA replication 
structure see DNA structure 
synthesis see DNA synthesis 
telomeric, 210 

see also Telomere(s) 
templated polymerization, 3-4, 3F 
thermal stability, 296 
unwinding 

replication, 273, 273F 
transcription, 333 
see also entries beginning DNA 
DNA array(s) see DNA microarray(s) 
DNA bending 

nucleosome-DNA interactions, 214F 
proteins, enhancesome formation, 446, 447F 
see also DNA-binding proteins 
DNA-binding dyes, cell cycle analysis, 1 059 
DNA-binding motifs (proteins), 416-454 
P sheet motif, 422, 423F 
DNA-binding homeodomains, paired domains, 

141 
gene regulatory proteins, 41 8-41 9, 41 8T 
helix-loop-helix motif, 425-426, 426F 
helix-turn-helix motif, 419-420 
homeodomain, 420-421, 421 F 

see also Homeodomain proteins 
recognition helix, 41 9, 420F 
structure, 419, 420F 
leucine zipper motif, 423, 424F 
protein-DNA interaction, 41 7, 41 8-41 9 
base-pair recognition, 41 7, 41 7F 
see also DNA structure 
zinc finger motifs, 421-422, 422F, 423F 
see also Protein-DNA interactions 
DNA-binding proteins, 427F 
binding site prediction, 426 
dimerization, 420, 420F, 422 
Cro repressor, 142, 142F 
functional role, 423 
heterodimerization, 424-425, 425FF 
homodimerization, 424, 425F 
DNA-binding motifs see DNA-binding motifs 
DNA structure relationship, 416-41 7, 41 7F, 41 9 

see also DNA structure 
helix-turn-helix proteins, 420-421, 420F 
homeodomain proteins see Homeodomain 
proteins 
histones see Histone(s) 
leucine zipper proteins, 423, 424F, 425F 
major groove-binding, 423, 424F 
minor groove-binding, 423, 424F 
sequence-specific, 430F 

affinity chromatography, 428-429, 429F 
chromatin immunoprecipitation, 432, 432F 
DNA sequence determination, 429-431, 430F 
gel-mobility shift assay, 427-428, 428F 
zinc finger interactions, 427F 
SSBs see Single-strand DNA-binding proteins 

(SSBs) 
zinc finger proteins, 421-422, 422F, 423F 
see also DNA replication; Gene regulatory 

protein(s); Protein-DNA interactions 
DnaB protein, 283F 
DNA catenation, 1071 
DNA cloning, 532, 540-541 

cloning vectors, 540-541, 541 FF, 542F 
genomic library production, 540-541, 542F 
ligation reaction, 540, 541 F 
restriction fragments, 540 
reverse genetics, 575 
see also cDNA; DNA libraries 
DnaC protein, 283F 
DNA crosslinks, 295T, 296, 297F 
DNA damage, 296, 296F, 297F 
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agents causing see DNA-damaging agents 

apoptosis, 1106, 1117 

ATM/ATR signaling pathway, 1 105 

bulky lesions, 298 

cancer and, 1216-1217, 1225, 1246 
see also Carcinogens; Mutation(s) 

cell cycle and, 303-304 

checkpoints, 303, 1105-1107, 1106F 

cross-linking, 296, 297F 

failure to repair, 1106 

see also Cancer; Mutation (s) 

repair see DNA repair 

replication, 296, 298F 
see a/so Mutation(s) 

spontaneous alterations, 296, 296F 
see also specific types 

see also DNA repair 
DNA-damaging agents 

carcinogens see Carcinogens 

ionizing radiation, 300-301 

see also Ultraviolet (UV) radiation 

sensitivity, DNA repair defects, 295T 

see also Mutagenesis 
DNA-DNA interactions, 306F 

hybridization see DNA hybridization 
DNAduplication(s) 

chromosomes see Chromosome duplication 

exon recombination, 257 

genes see Gene duplication(s) 

genome evolution, 246-247 

whole genome, 38-39, 39F, 255 
DNA excision, site-specific recombination, 323, 324, 

324F 
DNA fingerprinting, 546, 547F 
DNA footprinting, 429-431, 430F 
DNA glycosylases, 297-298 

mechanism, 299F 

recognition of DNA damage, 300-301, 300F 
DnaG protein, 283F 
DNA helicase(s), 273, 274F 

assay, 273F 

defects, 295T 

DnaB protein, 283F 

inhibition, DnaC protein, 283F 

mechanism of action, 273, 282, 283F 

TFIIH transcription factors, 340-341 
DNA-helix-passing reaction, DNA topoisomerase II, 

281 F 
DNA hybridization, 306 

chromosome paints, 202-203 

DNA-RNA hybridization, 535-537, 538F 
Northern blotting, 538-539, 539F 

helix nucleation, 306F 

hybridization conditions, 535-536, 537F 

model of recombinational base-pairing, 306F 

nonpairing interactions, 306F 

recombinant DNA technology, 532 

Southern blotting, 539-540, 539F 

see also DNA probe(s) 
DNA labels, 534 
DNA libraries, 540-542, 541-542 

cDNA, 542, 543F, 544 

genomic, 542, 542F 
cDNAvs., 543F, 544 

in yeast two-hybrid system, 524 

see also DNA cloning 
DNA ligase{s) 

DNA cloning, 540-541 , 541 F 

DNA repair, 295T, 302-303 

Lagging strand DNA synthesis, 272 

reaction mechanism, 273F 
DNA looping, 437F, 438F 

transcription regulation, 438, 438F, 480-481 
DNA melting, 537F 

DNA methylase(s) see DNA methyltransferase(s) 
DNAmethylation,472F 

cancer cells, 1213 

CG (CpG) islands, 434, 470-471 

damage recognition, 300-301 

enzymes, 284F, 467 

eucaryotes, 278 

gene expression and, 467-468, 468F 

genomic imprinting, 468-470 

inheritance, 467-468, 467F 

methylated DNA binding proteins, 468 

5-methylcytosine, 467, 467F 

procaryotes, 277-278, 282, 284F 

restriction-modification systems, 532 

recognition functions, 277-278, 532 

role in mutation, 300-301 , 301 F 



strand-directed mismatch repair, 277-278, 277F, 
284F 

uncontrolled, 296F 
DNA methyltransferase(s), 284F, 467 
DNA microarray(s), 574 

cell cycle analysis, 1065 

gene expression analysis, 537, 574-575 
cancer cell typing, 4 14F, 1240, 1249, 1264 
cluster analysis, 575, 575F 
human genes, 412 

metastases, 1249 

methodology, 574, 574F 

replication fork analysis, 285, 286F 

sizes, 574 
DNA mismatch repair see Mismatch repair 
DNA-onlytransposons, 31 8F, 318T, 323 

cut-and-paste transposition, 318T, 31 9F, 320F 
DNA polymerase^), 266 

5'-3' chain elongation, 267, 268F 

catalytic mechanism, 267F, 268F 

cooperativity, 275-276 

DNA repair, 302 

dNTP substrates, 266, 267F, 268F 

eucaryotic, 280, 281 F, 293F, 294F, 295T 

fidelity, 269 

movement along DNA, 273-274 

proofreading, 269-270, 270F 
see also Mismatch repair 

RNA polymerases vs., 334, 335 

sliding clamp, 274-275, 275-276, 275F, 276F 

structure, 268F 

T7 polymerase, 269 

thermophilic, 544 

see also Polymerase chain reaction (PCR) 

viral, 1517 

see also specific enzymes 
DNA primases, 272, 272F, 273, 276F 

bacterial dnaG, 283F 

eucaryotic, 280 

mechanism of action, 282, 283F 
DNA probe(s), 534, 536F 

nucleic acid detection, 539-540, 539F 
see also DNA hybridization 
DNA-protein interactions see Protein-DNA 

interactions 
DNA rearrangement(s) 

bacterial phase variation, 454-455, 455F 

conservative site-specific recombination 
heritable, 325-326 
reversible, 324, 324F 

see also Recombination; specific rearrangements 
DNA recombination see Recombination 
DNA renaturation see DNA hybridization 
DNA repair, 263, 295-304 

base excision repair (BER), 297-298, 299F 

cell-cycle delay, 303-304 

cross-link repair, 295T 

defects in see DNA repair disorders 

direct chemical reversal, 298 

DNA polymerases, 302 

DNA structure importance, 297 

double-strand breaks, 295T, 302-303, 303F 

enzymes, 296, 299F 

error prone, 310-311 

failure see Mutagenesis 

homologous recombination, 295T, 305 

identification, 295 

importance, 295 

mismatch repair see Mismatch repair 

multiple pathways, 295, 297-298 

nucleotide excision repair (NER), 295T, 298, 299F 

RNA polymerase coupling, 299-300 

transcription coupling, 299-300 

translesion synthesis, 295T 

see also DNA damage; individual repair pathways 
DNA repair disorders, 277, 295-296, 295T, 304, 1 106 

see also Cancer; specific disorders 
DNA repeats see Repetitive DNA 
DNA replication, 3, 200, 201 F, 263, 266-295 

analysis techniques, 282-283, 284F, 285, 285F 

bacterial, 280, 282, 283F 
end-replication, 292 

DNA catenation, 1071 

DNA synthesis see DNA synthesis 

DNA templating, 200, 201 F, 266 

end-replication problem, 292 

errors, 246-247, 269, 271-272, 276-277 
see also DNA damage; Mutation(s) 

eucaryotic, 280 

chromatin effects, 285-286 



chromosome ends seeTelomere{s) 

initiation see DNA replication initiation 

multiple replication forks, 283, 284F 

replication forks see under Replication fork 

replication origins see under Replication 
origin(s) 

replication rate, 283 

S phase of cell cycle, 284, 285, 285F, 
1067-1069, 1068F 

timing, 285 

see also Cell cycle 
fidelity, 269-270, 271 FT 

see also DNA repair 
historical research, 266-267 
incorrect model of DNA replication, 269F 
initiation see DNA replication initiation 
machinery, 266-281, 276F 

cooperativity, 275-276 

DNA helicases, 273, 273F, 274F 

DNAIigase,272,273F 

DNA polymerase see DNA polymerase(s) 

DNA primase, 272, 272F, 273, 276F 

DNA topoisomerases, 278-280, 279F, 280F 

primosome, 276 

single-strand DNA-binding proteins, 273, 
274F,275F 
mitochondrial, 856-857, 857F 
nucleosome assembly, 289-290 
origins see Replication origin(s) 
phylogenetic conservation, 280 
proofreading, 269-270, 270FF 
replication fork see Replication fork 
replication origins see Replication origin(s) 
semiconservative nature, 266, 266F, 268F 
strand recognition, 277 

methylation in procaryotes see DNA 
methylation 

nicks in eucaryotes, 278, 278F 
see also Single-strand DNA breaks 

see also Mismatch repair 
transcription vs., 333-334 
"winding problem," 278F 
see also Chromosome replication; individual 
components 
DNA replication initiation, 281-282 

bacterial chromosomes, 282, 283FF, 284F 
eucaryotic, 289F, 1067, 1068F 

preinitiation complex, 1067 

prereplication complex, 1067-1068, 1068F 
ORC (origin recognition complex), 287 
origins see Replication origin(s) 
origins of replication see Replication origin(s) 
proteins, 281, 282, 283F, 1067 
regulation, 282 
DNA-RNA hybridization, 536-537, 538F 

see also RNA-DNA hybrids 
DNA segment shuffling, 19, 19F 
DNA sequence, 199-200 

alterations in genome evolution, 246-247 
analysis techniques see DNA sequencing 
bacterial, as immunostimulant, 1527 
evolutionary conservation, 207, 208F, 250 
human p-globin gene, 199F, 200F 
recognition, 41 8-41 9, 41 8T 

see also DNA-binding motifs 
structural effects, 416-41 7 
DNA sequencing, 548-550 
alignment, 530-531 
automation, 550, 550F 
chain-terminating nucleotides, 550 
gel electrophoresis, 534, 535F 
genomic see Genome sequencing 
historical aspects, 549-550 
human genome, 142, 205-206 

comparative, 207, 247, 247F, 248F 
intron-exon recognition, 551 
open reading frame prediction, 551 
protein sequence prediction, 139, 550-551, 550F 
recombinant DNA technology, 532, 534, 535F 
DNA structure, 62F, 195-201, 197-201 

backbone (sugar-phosphate), 197, 198-199, 

198F 
base-pairing, 197-199, 198F,417F 
complementary chains (strands), 197-199, 266F 

antiparallel nature, 198F, 199, 267 

role in repair, 297 

role in replication, 200, 200F, 201 F, 266 
deformation, coupled to packaging, 213, 21 4F 
double helix, 197-199, 198F, 199F, 338F 

complementary chains (strands) see DNA 
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replication 
hydrogen bonds, 197-198, 198F, 199R417 
major groove, 1 99F, 41 6F 
minor groove, 199F, 416F 
repair, 296-297 

see also DNA repair 
rotation, 278, 278F 
strand separation, 273 
turns, 198-199,199 
elucidation, 195-196, 196, 197 
hairpin helices, 273 
mechanism for heredity, 199-200 

see also Gene(s) 
nucleotides, 62, 1 97-1 99, 1 98F 

see also Nucleotide(s) 
phosphodiester bonds, 199F 
polarity, 198F 
3' to 5' polarity, 197 

protein-DNA interactions, 41 6-41 7, 41 7F 
see also DNA-binding motifs (proteins) 
RNA vs., 408 
sequence effects, 41 6-41 7 

see also DNA bending 
X-ray diffraction analysis, 195-196, 196 
see also DNA topology 
DNA supercoiling see Supercoiling 
DNA synthesis, 266-268 
analysis 

autoradiography, 282-283, 284F, 602-603 
BrdU staining, 285, 285F, 1059, 1059F, 1422, 
1425 
ATP requirement, 86, 87F 
chemistry, 268F 

5'-3' chain elongation, 266-267, 267F, 268F, 

271-272, 271F 
deoxyribonucleoside triphosphates, 266, 
267F, 268F 
DNA replication, 266-268 

5'-3' chain elongation {leading strand), 267, 

267F, 268, 268F, 271-273, 271 R 276F 
lagging strand see Lagging strand synthesis 

(DNA replication) 
primer strand, 267F, 268F 
pyrophosphate release, 267F, 268F 
template strand, 267F, 268F 
see also DNA polymerase; DNA synthesis 
initiation, 281 
mechanism, 268F 

see also DNA polymerase(s) 
relation to histone synthesis, 289-290 
see also DNA replication 
DNA topoisomerases, 278 
catalytic reaction, 279F 
DNA replication role, 278-280 

see also DNA replication 
lambda integrase vs., 324 
mechanisms of action, 278-279, 279F, 280F 
topoisomerase I, 278, 279F 
topoisomerase II, 278-279, 280F, 281 F 
DNA topology 

supercoiling see Supercoiling 
transcription elongation, 343-345, 344F 
see also Nucleosome(s) 
DNA transfer see Horizontal gene transfer 
DNA tumor virus(es), 1247-1249, 1248F 
DNA virus(es), 1496F, 1497F 
in cancer, 1227-1228, 1228T 
see also DNA tumor virus(es) 
dNTPs see Deoxyribonucleoside triphosphates 

(dNTPs) 
Docking proteins, insulin receptor substrate (IRS-1), 

924 
Dolichol phosphate, 115F 
Dolichol, protein glycoslation, 737, 738F, 747F 
Dolly the sheep, 1287 
Domain fusion, activator proteins, 442F 
Domains, living world, 16, 16F 
Dominant negative mutation(s), 528F, 564F 
antisense RNA, 564F 
genetic engineering, 564 
RNA interference, 564F 
Dopamine, gap junction permeability regulation, 

1162, 1162F 
Dormancy, Epstein-Bon virus (EBV), 1499 
Dorsal lip, of blastopore, 1 367-1 368, 1 367F 
Dorsal protein, of Drosophila, 1334-1335, 1335F 
Dosage compensation, 473, 475-476, 476F 
Double bonds, carbon-carbon, 106F 
Double-checking, DNA polymerase proofreading, 
269 



Doublesex (Dsx) gene, Drosophila sex 

determination, 481 F, 482F 
Double-strand breaks (DSBs) 

homologous pairing/meiotic recombination, 

305F,312, 1275,1280 
production, 300-301 

repair, 302-303, 303F, 304-305, 308-309 
defects, 295T 

see also Homologous recombination 
(crossing-over) 
topoisomerase II production, 280F 
Double-stranded RNA (dsRNA), viruses, 1534 
Down syndrome, meiotic nondisjunction, 

1278-1279 
Dpp (decapentaplegic) gene/protein, 1 335, 1 353, 

1353F, 1355 
Drkgene, 928F 

Drosophila development, 1328-1341 
anteroposterior patterning, 1341-1347 
see also Drosophila melanogaster homeotic 
genes 
cell cycle analysis, 1058 
cellularization, 1099, 11 OOF 
early embryo and genesis of body plan, 
1328-1341 
blastoderm, 1330-1331, 1330FF 
body axis specification, 1332F, 1333-1334, 

1334F 
dorsoventral patterning, 1334-1337, 1335F, 

1335FF 
egg, 1330, 1331F 
egg polarity genes see Egg polarity genes 

(Drosophila) 
fate map, 1331F, 1332F 
follicle providing egg-polarizing signals, 

1333F 
gastrulation and mesoderm formation, 1335, 

1336F, 1340 
morphogen gradients, 1333-1334, 1334FF, 

1335F, 1335FF, 1336 
mRNA localization, 487, 488F, 1 333-1 334, 

1334F 
oocyte, 1333F 

planar cell polarity, 1 1 57, 1 1 57F 
regulatory hierarchy, genes, 1338F 
segmentation genes see Drosophila 

melanogaster segmentation genes 
sex determination, 1 286 
syncytial to cellular transition, 1330-1331 
syncytium formation, 1099, 1 100F 
synopsis, 1329F 

vertebrate body plan vs., 1 336F 
genetic control/gene regulation, 408-409, 
447-450, 448F, 449F, 449FF 
Even-skipped see Eve (Even-skipped) gene 
£ygene, 466, 466F 
homeotic genes see Drosophila melanogaster 

homeotic genes 
Ras role in eye development, 927, 927F 
segmentation genes see Drosophila 

melanogaster segmentation genes 
see also specific genes 
genetic model, 1329 

genetic screening for early patterning, 1 332 
genetic techniques, 1329 
imaginal discs, 1331 

see also under Imaginal discs 
key to developmental mechanisms in other 

animals, 38 
oogonia formation, 1 290, 1 290F 
organogenesis and patterning of appendages, 
1347-1362,1347-1363 
compartment boundaries as signaling 

centers, 1353-1355, 1353F 
compartments, 1352-1355, 1352FF, 1353F 
eye development, 466, 466F 
imaginal discs, 1349-1351, 1350F, 1356-1357 
individuality specified by homeotic selector 

genes, 1351 
intercalary regeneration, 1354, 1354F 
size regulation, 1353-1354 
parasegments vs. segments, 1330, 1330F 
sex determination, 481 -482, 481 F, 482F 
Drosophila melanogaster 

adult anatomy, 1328-1330, 1328F 
apoptosis, 1125, 1126 
chromosomes, 237F 
chromosome 2, 330F 
gene-chromosome relationship (proof), 
37-38, 38F 



polytene see Polytene chromosome(s) 
sex chromosomes, 481 
circadian clock, 461-462, 462F 
cytoskeleton, 967, 967F, 983 
development see Drosophila development 
gene expression 

alternative splicing, 479 

chromosome puffs, 220-222 

dosage compensation, 475 

gene regulatory proteins, 41 8T, 447, 448F, 

HOOT 
position effects, 221-222, 221F 
gene regulatory proteins, MOOT 
genome 

information coding, 329, 330F 
mobile genetic elements, 31 8T, 480, 556 
sequencing, 551, 552 
size, 18T 

see also chromosomes (above) 
homeodomain proteins, 429 

see also Homeodomain proteins 
as model organism, 37-38 
mRNA localization, 1022-1023 
multicellular development and, 447-450, 448F, 

449F, 449FF, 465-466, 466F 
oocyte, 1022-1023 
photoreceptor assembly, 927F 
protein interaction maps, 188 
RNA interference (RNAi), 571 
vertebrate homologies, 1306, 1306F, 1308 
wing hair mutants, 1 1 57-1 1 58, 1 1 57F 
Drosophila melanogaster homeotic genes, 
1341-1347 
bithorax and Antennapedia complexes, 1342, 

1342F 
Hox complex, 1342-1347 

see also Hox complex; Hox genes 
memory mechanism, 1344 
modulated repetition strategy, 1341-1342 
mutations, 1342 
Polycomb and Trithorax group genes, 1 344, 

1345F 
positional values, 1344 
sequential Hox gene expression, 1343-1344, 

1343F 
vertebrate homologies, 1344-1347, 1346F, 

1346FF 
see also specific genes 
Drosophila melanogaster segmentation genes, 
1336-1338, 1337F, 1338F 
gap genes, 1 333, 1 337-1 338, 1 337F, 1 338F 
pair-rule genes, 1333, 1337-1338, 1337F 
regulatory DNA controlling pattern (eve gene), 
447-45O, 448F, 449F, 449FF, 
1339-1340, 1339F, 1340F 
segment polarity genes, 1333, 1338F, 1339 
see also individual genes 
Drug-induced changes, cytoskeletal filament 
polymerization, 987-989, 988T 
Drug resistance, 1521-1524 
Dscam gene, alternative splicing, 479F 
Dynamic instability, cytoskeletal filaments, 980, 
981F,982F 
catastrophe, 980, 1003 
catastrophe vs. rescue, 1080 
of microtubules see Microtubule(s) 
nucleotide hydrolysis, 979F 
rescue, 980, 981 F 
Dynamin protein, 756F, 757-758, 758F 
Dynein(s), 1014-1015, 1015F, 1018, 1019F 
ATP hydrolysis, 1019 
attachment to membrane-enclosed organelles, 

1022, 1022F 
axonemal, 1015 

ciliary beating and left-right asymmetry, 1 377 
cytoplasmic, 1014 
force generation, 1018 
linker region, 1019 
mechanochemical cycle, 1019 
mitotic spindles, 1077, 1077F, 1079 
power stroke, 1019, 1019F 
Dysentery, epidemic, 1491 
Dyskeratosis congenita, 294 
Dystrophin, 1466 



E2F proteins, 1103-1105, 1244F 

F6 viral oncogene/protein, 1248, 1249F 
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E 7 viral oncogene/protein, 1248, 1249F 

Ear, embryonic origin, 1384F, 1429 

Early cell plate, 1098 

Early genes, chromatin condensation effect, 

285-286 
Eastern equine encephalitis virus, 1496F 
Ebola virus, cell entry, multivesicular bodies, 797 
E-cadherin, 1136, 1138T 

in cancer invasiveness, 1249-1250 
epithelial-mesenchymal transitions, 1 141 
E. coli see Escherichia coli 
Ectoderm, 1307, 1365 
Edema, 1493 
Edema toxin, 1493 
Edge effects, 582F 

EDTA, cell separation techniques, 502 
EF-1 elongation factor, 377F 
EF-2 elongation factor, 377F 
Effector B cells, 1 543, 1 546, 1 546F, 1 552F 
Effector T cells, 1543, 1544F, 1546, 1546F, 1552F 
Efflux transporter proteins, auxin transport, 959 
EF-G elongation factor, 377, 377F 
EF-Tu elongation factor, 1 81 , 1 81 F 

conformational change, 180-181, 181 R 377 
GTP-binding protein, 1 80-1 81 , 1 81 F, 377 
protein synthesis, 180, 377-378, 377F 
EGF see Epidermal growth factor (EGF) 
Egg (ovum), 1287-1292 
activation, 1287, 1298 
Ca 2+ increases, 1299 
cortical granules, 1288 
cortical reaction, 1299, 1300, 1300F 
sperm binding to, 1270F, 1298-1299, 1298F, 

1300F 
see also Fertilization 
Celegans, 1323-1324 

development stages, 1 288-1 290, 1 289F, 1 291 F 
oocytes see Oocytes 
PGCs see Primordial germ cells (PGCs) 
timing, 1288 
enucleation, 411, 413F 
parthenogenesis, 1287 
size, 1287, 1287F, 1290-1291, 1290F 
specialized cell structure, 288F, 1287-1288, 

1291F, 1298F 
totipotency, 1282 
Egg polarity genes (Drosophila), 1332-1335, 1332F, 

1334F 
Ehlers-Danlos syndrome, 11 87 
elFs (eucaryotic initiation factors), 380F 
elF-2, 488-489, 490F, 1 535 
elF4E, 380,491 
elF4G, 380 
regulation by phosphorylation, 488-489, 

490-491, 490F 
see also Cap-binding complex (CBC) 
Elastase, 138F 

Elastic fibers, 146F, 1189-1191, 1 190F 
Elastin, 1179, 1189-1191, 1191F 

structure, 146-147, 146F, 1190-1191, 1191F 
synthesis, 1190 
Electrical activity, in synapse maintenance and 

elimination, 1393-1397 
Electrical potential, neuron(s), 675 
Electrical synapses, gap junctions, 1 161 
Electrochemical gradients, 654F 
Electrochemical proton gradients, 653, 813, 821 F, 
827-840, 853-854 
across bacterial membranes, 839, 839F 
ATP production, 662, 817-819, 81 9F, 822-824 

see also ATP synthase 
bacterial flagellum, 821-822, 823F 
chemiosmosis, 813-814, 814F 
coupling to active transport, 822, 823F 
generation, 1 00, 1 0OF, 820-82 1 , 82 1 F 

see also Proton pumps 
membranes and, 640, 655, 820-821, 821 F 
mitochondrial protein import, 716-717, 71 6F 
noncyclic photophosphorylation, 850-851, 852F 
pH gradient, 820-821, 821 F 
proton-motive force, 82 1 , 839F 
Electrogenic pumps, 662-663, 671 
Electron(s) 

atomic interactions, 46-48 
see also Chemical bonds 
electron shell, 46, 48F, 49 
orbitals, 46 
Electron crystallography, 611-612 
Electron density maps, X-ray diffraction analysis, 
528 



Electron microscopy, 604-613 

3-D reconstruction, 606, 607F, 61 1-612 

EM-tomography, 61 1 F, 612, 61 2F 
autoradiography, 603F 
chromosome puffs, 239 
contrast generation, 606, 610 
colloidal gold, 606, 607F 
heavy metal stains, 606 
metal shadowing, 608-609, 61 OF 
negative staining, 610, 61 IF 
cryoelectron microscopy, 610 
depth of field, 606, 608 
electron lens, 604 
electron scattering, 605 
freeze-etch, 61 OF 
grids, 605, 605F 
history, 604T 
imaging, 610-612 
immunogold, 606-607, 607F 
microscope, 604 
molecular localization, 606-607 
resolution, 604-605, 604F, 608 
sample preparation, 605-606, 605F 
scanning electron microscopy (SEM), 607-608, 

608F, 609F 
SEM see Scanning electron microscopy (SEM) 
single-particle reconstruction, 61 1, 612F, 613F 
surface analysis, 607-609, 608F, 609F 
tomography, 61 1 F, 61 2, 61 2F, 61 3F 
transmission see Transmission electron 
microscopy (TEM) 
Electron-motive force, 814, 81 5F 
Electron scattering, electron microscopy, 605 
Electron stains, 606, 607F 
Electron transfer, 71-72, 100F, 813, 814F 
energetics, 820-82 1 , 829, 830F 

see also ATP (adenosine triphosphate) 
mitochondria vs. chloroplasts, 81 4, 81 5F 
NADH/NADPH carriers, 82-83, 82F, 818-819, 
818F 
see also Activated carriers in metabolism 
oxidative phosphorylation, 100, 819, 81 9F 
photochemical reaction centers, 848, 850F, 852F 
protons, 827-828, 828F 

redox potentials, 829, 831, 835, 835F, 852F, 853 
see also Electron transport chain(s); Oxidative 
phosphorylation 
Electron transport chain(s), 827-840 
ATP synthesis, 1 00, 1 0OF, 81 7-819, 81 9F 
proton gradients, 821-823, 821 F 
proton movement, 828, 828F, 829F 
uncoupling, 836 

see also ATP synthase; Electrochemical proton 
gradients; Proton pumps 
bacterial, 839-840, 839F, 8734F 

evolutionary implications, 871-872 
chloroplasts see Photosynthetic electron 

transport chain(s) 
cyanobacteria, 872, 873-875 
energetics, 820-821 
evolution, 870-876 

anaerobic bacteria, 871 -872 
cyanobacteria, 872, 873-875 
fermentation, 870-871 
photosynthetic bacteria, 872 
iron-sulfur centers, 101 
mitochondria see Mitochondrial electron 

transport chain(s)(s) 
photosynthesis see Photosynthetic electron 

transport chain(s) 
see also ATP (adenosine triphosphate); Electron 
transfer 
Electrophoresis, 517 
agarose gel, 534 

cell fractionation, 517, 518F, 521-522 
gel staining, 51 7, 51 8F, 534, 535F 
polyacrylamide gel see Polyacrylamide gel 

electrophoresis (PAGE) 
protein analysis, 518, 521-522, 521 F, 522FF 
protein denaturation, 517 
protein purification, 517, 518FF, 521-522, 522FF 
pulsed-field, 534 

recombinant DNA technology, 534 
blotting see Blotting 
DNA sequencing, 532, 534, 535F, 551 
pulsed-field gels, 534, 535F 
Electroporation, 565-566, 598, 598F 
Electrostatic interactions, 54 
Elements (chemical), 45, 46, 47F, 48F 
Elongation factor(s), 1 79-1 80 



EF-G see EF-G elongation factor 

EF-Tu see EF-Tu elongation factor 

transcriptional, 343-345 

translational, 376F, 377-378, 377F 
Embryo(s)/embryogenesis 

anteroposterior axis see Anteroposterior axis 
development 

cadhehns, 1136, 1136F 

cancer-critical gene function analysis, 
1241-1242 

cell crawling, 1036 

cell cycle analysis, 1057-1058, 1057F, 1058F 

descriptive embryology, 1310 

dissociation experiments, 1139-1140, 1139F, 
1140F 

experimental embryology, 1310, 1 31 OF 

Gi phase of cell cycle, 1 1 00 

gastrulation, 1364-1369 

plant, 1400, 1401F, 1402, 1402F 

polarity, 1364 

stem cells see Embryonic stem (ES) cells 

see also Development; individual species; specific 
stages 
Embryonic germ cells, 1283 
Embryonic stem (ES) cells, 505-507, 1 283 

control of differentiation by different factors, 
1481 F 

derivation, 505-507, 506F 

differentiated cell production in vitro, 1481 

drug discovery, 1482 

immune rejection problem, 1481-1482 

mouse, 567F, 1480 

"personalized," 1303, 1303F, 1481-1482 

"therapeutic cloning," 507 

in tissue repair, 1481-1482 
Encephalitis, viral, 1502 
Endochondral bone formation, 1470 
Endocrine cells, 882 

gut (enteroendocrine cells), 1437, 1437F 

in respiratory epithelium, 1435 
Endocrine signaling, 882-883, 882F, 883F 
Endocytic-exocytic cycle, 789 
Endocytic pathway, 749, 750F, 751 F, 795F 
Endocytosis, 749, 787-799 

early endosomes, 782, 792 

fluid-phase, 789 

late endosomes, 782 

LDLs, 791-792, 791 F, 793F 

receptor-mediated see Receptor-mediated 
endocytosis 

signal, 793 

see also Endocytic pathway 
Endoderm, 1307,1365 
Endoglycosidase H (Endo H), oligosaccharide 

processing, 775F 
Endo H-resistant oligosaccharides, 775F 
Endoplasmic reticulum (ER), 723-745 

antibody synthesis, 768F 

antigen processing/presentation, 1583F 

calcium store, 725-726 

ceramide synthesis, 744-745 

cholesterol synthesis, 743, 744-745 

COPII-coated vesicle formation, 767, 767F 

development, 702, 704 

distribution during cytokinesis, 1098 

evolution, 700F 

giycolipid synthesis, 744-745 

glycoprotein synthesis, 736-738, 737F, 747F 

glycosphingolipid synthesis, 744-745 

lipid bilayer assembly, 743-745, 744F 

lipid metabolism, 626 

membrane, amounts, 697T 

membrane proteins, 486, 630, 635 

microsomes, 511, 726 

microtubule motors, membrane organization, 
1021 

oligosaccharide processing, 775F 

phospholipid exchange proteins, 745 

phospholipid synthesis, 743, 743F 

phospholipid transport, 745 

position in cell, 697 

protein assembly, 736 

GPI anchor attachment, 742-743, 742F 
virus assembly and, 1513, 1514F 

protein degradation see Proteasome(s) 

protein folding, 736 

misfolded protein fate, 739-742, 741 F 
unfolded protein response, 740-742 
see also Protein folding 

protein glycosylation, 736-738, 737F, 747F 
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quality control, 767-768 
resident proteins, 732F, 736, 737F, 747F, 767, 
770-771, 770F 
selective retention, 771 
retention signal, 736 

retrieval pathway, 769-770, 770-771, 770F 
rough ER, 725F, 726, 726F 
smooth ER, 724-725, 725F, 726F 

see also Sarcoplasmic reticulum 
sphingomyelin synthesis, 744-745 
structure, 696, 723, 724F 
T-cell receptor assembly, 768 
unfolded protein response, 740-742 
virus assembly and, 1513, 1514F 
volume, 697T 
Endoplasmic reticulum (ER), protein transport, 
723-745 
co-translational translocation, mechanism, 732F 
to Golgi apparatus, 766-787, 768-769 
MHC class I proteins, inhibition by viruses, 1 536 
misfolded protein fate, 739-741, 741 F 
multipass transmembrane protein integration, 

734-736, 735F 
post-translational translocation, 732F, 736 
quality control, 767-768 
retrotranslocation, 391, 739-740, 740F 
Sec61 protein translocator complex, 730-731, 

730F,731F 
signal recognition particle (SRP), 727-730, 729F 

mechanism of action, 728, 730 
signal sequences, 726-727, 733-734 
single pass transmembrane protein integration, 

724-725, 733F 
SRP receptor, 728 
start-transfer signal, 732 
stop-transfer signal, 733, 733F 
(3-Endorphin, production, 803F 
Endosome(s) 

antigen processing/presentation, 1583F 
early, 782, 784F, 791, 794F 
apical and basolateral, 799F 
epithelial cells, 798, 799F 
material retrieval, 792-794 
function, 696 

late, 782, 784F, 794F, 798, 799F 
membrane amounts, 697T 
protein sorting, 807F 

recycling, 793F, 794, 794F, 797-798, 797F, 798F 
transport to lysosomes, 791-792 
volume, 697T 
see also Endocytosis 
Endosymbiont hypothesis, organelle evolutionary 

origin, 859-860 
Endothelial cells, 1445-1450, 1447F 
embryonic origin, 1446 
inflammatory response, 1534 
structure, 1445 
Endothelin-3, 1375 
End-replication problem, 292 
Energy 

ATP as carrier, 61, 62F 

see also Activated carriers in metabolism; ATP 
{adenosine triphosphate) 
cell use 

catabolism, 70,88-103 
catalysis, 65-87 
energy conversion, 813-815 
oxidation reactions, 70 
see also Chloroplast(s); Mitochondria; specific 
uses 
chemical bonds, 48-49, 49F, 69F 
electrical energy, 69F 
forms, 69F 

free energy see Free energy 
heat energy, 67-68, 69F 

see also Thermodynamics 
interconversions, 69F, 813 
kinetic energy, 69F 
light energy, 69F 

see also Photosynthesis 
potential energy, 69F 
Engrailed gene/protein, 1306F, 1340-1341, 1340F, 
1352, 1352F 
protein structure vs. yeast alpha2 protein, 138F 
Enhancers, 438F, 1309 
Enhancesome, 447, 447F 
Enkephalins, proteolytic processing, 803 
Enolase, 121F 

Enteroendocrine cells, 1437, 1437F 
Entropy (S), 66-67, 67F, 11 9F 



Enveloped viruses, 1497, 1505 

see also specific viruses 
fnvgene, HIV (human immunodeficiency virus), 

1521 
Environmental reservoirs, antibiotic resistance, 

1523-1524 
Enzyme(s), 6, 62-63, 72-75, 1 58-1 59 
antibiotic resistance and, 1 522, 1 523F 
catalysis see Enzyme catalysis 
classification, 159, 159T 
coenzymes/cofactors, 1 67 
energetics, 72-73 
mechanism of action, 74F 
multienzyme complexes, 168-169, 169F 
receptor coupling see Enzyme-coupled 

receptor(s) 
regulation, 169-171, 170F 
cooperativity, 172-173, 173F 
phosphorylation, 175-176 
see also Allosteric regulation; Feedback 
regulation 
structure, 72 

active site see Active site 
regulatory sites, 170 
tetrahedral intermediate, 160 
transition state, 160 

see also Metabolic pathway(s); individual 
enzymes 
Enzyme catalysis, 159-169 
catalytic antibodies, 160, 161 F 
diffusion-limited, 163, 163T, 169 
energetics, 72-73 

activation energy, 72, 73F, 160, 160F 
free energy, 72 
heat energy, 74 
enzyme-product complex, 164F, 166, 166F 
enzyme-substrate interaction see 

Enzyme-substrate interactions 
equilibrium points, 76, 78F 
floating ball analogies, 74F 
kinetics see Enzyme kinetics 
lysozyme catalysis, 163-165 
mechanisms, 74F, 159-160, 160, 164F, 166F 
molecular tunneling, 167-168, 168F 
Enzyme-coupled receptor(s), 892, 892, 893F, 
921-945 
classes of, 921 
see also individual receptors 
Enzyme kinetics, 159-160, 162-163FF 
double reciprocal plot, 163 
K cat , 162F, 163 
/C m , 160, 160F, 162F, 163 
Michaelis-Menten kinetics, 162F, 163 
reaction rates, 159-160, 160F, 161F 
inhibitory ligands, 173, 173F 
multienzyme complexes, 169 
steady-state, 162F-163F 
turnover number, 159-160, 162F 
\/ max ,159,160F 
Enzyme-linked immunosorbent assay (ELISA), 

588-589 
Enzyme-substrate interactions, 74, 164F 
diffusion, 74, 75F, 163 
fit, 160 
kinetics effects, 159-160, 162F 

see also Enzyme kinetics 
lysozyme, 164-165, 164F 
substrate binding, 159-160, 164-165, 164F 
Eosinophils 

attack on schistosomes, 1532, 1532F 
cytokinesis, 1556 
lineage and formation, 1452F 
structure, 1452F 

upregulation in parasitic infection, 1451 
Eosin, staining, 585F 
Eph B, gut epithelial cell migration, 1440-1441, 

1441 F 
Ephexin, 931 

Eph receptors, 922, 923T, 1 392-1 393 
Ephrin(s),922,923T,931 
receptors see Eph receptors 
signaling, gut epithelial cell migration, 

1440-1441,1 441 F 
see also specific types 
Ephrin A2, retinotectal axon guidance, 1 393 
Ephrin A5, retinotectal axon guidance, 1 392 
Ephrin A proteins, 1 392-1 393 
Ephrin B2, capillary sprouting, 1447-1448 
Ephrin B family, gut epithelium, 1440-1441 
Epidemics, 1487 



Epidermal cells, 1419-1420 

Epidermal growth factor (EGF), 1 1 03, 1 238 

Epidermal growth factor receptor (EGFR), uptake by 

endocytosis, 794 
Epidermis, 1417-1428, 1418F 

associated structures/appendages, 1418 
cells, 1419-1420 
granular layer, 1419, 141 9F 
interfollicular, 1418-1419 
plants, 1402, 1404F 
renewal by stem cells, 1417-1428 
associated signaling pathways, 1426 
rate, 1 420 
signaling, 1426 
stem cell distribution, 1422F 
waterproof barrier, 1418, 1420-1421 
Epidermolysis bullosa simplex, 986, 1 005 
Epididymis, 1294 

Epigenetic phenomena, 219, 471-473, 472F 
bacteria, 472 
in cancer, 1 208 
chromatin structure, 472 
DNA methylation see DNA methylation 
eucaryotes, 472 
genetic inheritance vs., 21 9F 
histone modification see Histone modification 
imprinting see Genomic imprinting 
inheritance see Inheritance 
mechanisms, 472-473, 472F 
mono-allelic expression, 473 
positive feedback loop, 471 , 472F 
protein aggregation state, 472F 
silencing of tumor suppressor genes, 1 236, 

1236F 
twin studies, 473, 473F 
see also Genomic imprinting 
Epigenome, 473 

Epilepsy, voltage-gated cation channels, 682 
Epinephrine see Adrenaline (epinephrine) 
Epistasis analysis, 558-559 
Epithelia, 1131,1132F 
bacterial adhesion, 1503 

cell-cell adhesion and cell junctions, 1 1 33-1 1 35, 
1134F 
anchoring junctions, 1 1 33-1 1 50 
apico-basal polarity mechanisms, 1155-1157, 

1155F, 1156F 
occluding junctions, 806, 1 1 50-1 1 58 
planar cell polarity mechanisms, 1 1 57-1 1 58, 

1157F 
see also Adherens junction(s); Cadherin(s); Cell 
adhesion; Cell junction(s); 
Desmosome(s) 
columnar, 1133 

epithelial-mesenchymal transitions, 1 141 
folding, 1142, 11 43F 
host defenses, 1 525-1 526, 1 526F 
microbial evasion of, 1502-1504 
mucus and, 1502, 1525 
as permeability barriers, 1 150 
sensory, 1429-1433 
structure, 11 34F, 1150 
see also Connective tissue 
Epithelial cell(s) 

apical domain, 798, 799F, 806, 806F 
endocytosis, 798, 799F 
glycolipids, 628 
IgA transport, 1556F 
membrane protein distribution, 645 
microvilli formation, 1007 
polarity, 798, 7 99 F, 806 F 
surface area, 1007 
tight junctions, 806 
Epithelial-mesenchymal transitions, 1 141 
Epitopes (antigenic determinants), 1 545, 1 558F 
Epitope tagging, 514, 515F, 516F 

see also Protein tags 
Epstein-Barr virus (EBV), 1228T, 1499 
Epuiopiscium fishelsoni, 1 4F 
Equilibrium 

chemical, 76-77, 77F 
free energy changes, 76, 77F 
Equilibrium constant (K) f 157-159 

relation to free energy changes, 76, 77T, 1 57, 
158F, 159F 
Equilibrium sedimentation, 511,51 2F 
ERGIC53 protein, ER to Golgi apparatus transport, 767 
Erk(MAP-kinase),929F 

ERM (ezrin, radixin, mesin) family of proteins, 1 009, 
1010F 
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ERp57, protein folding, 737F, 747F 
Error correction, DNA polymerase, 269-270, 270FF 
Erythroblasts, 1459, 1459F 
Erythrocyte(s), 1453T, 1459 
complement lesion, 1529F 
cytoskeleton, 646, 1008-1009 
development, 1454, 1457F, 1459, 1459F 

see also Erythropoiesis 
lifespan and turnover, 1459 
membrane, 61 8F, 646 
phospholipids, 626 
osmolarity regulation, 663, 663F 
Erythromycin, 385T 
Erythropoiesis, 1292-1293 

colony-stimulating factors role, 1459-1460, 1462 
erythropoietin role, 1103, 1459-1460 
Erythropoietin, 1 103, 1459-1460 
cells responsive to, 1 459 
synthesis, 1459, 1460T 
target cells and receptors, 1460T 
ES cells see Embryonic stem (E5) cells 
Escherichia coli, 14F 

double membrane, 665F 
enteropathogenic, 1504, 1504F 
flagella, tumbling, 942, 943F 
gene expression regulation, 416, 433F, 439, 439T 
lactose (Lac) operon see Lac operon 
(Escherichia coli) 
genome, 25F, 282 
chromosome, 1491 F 
replication, 282, 283F, 284F 
sequencing, 552 
size, 18T 
as model organism, 24-25, 25F 
mutations, 264, 276-277 
phylogenetics, tree of life, 1 6F 
Ppili, 1502 

promoter sequences, 338F 
RecA protein see RecA protein 
replication, 282, 283F 

clamp loader structure, 275F 
DNA polymerase structure, 268F 
refractory period, 282, 284F 
strand-directed mismatch repair, 277 
transposons, 318T 
uropathogenic, 1502, 1503F 
E-selectin, 1146 
Estradiol, 889F 
Estrogen, receptor, 891 F 
Ethical issues, reproductive cloning, 1 302-1 303 
Ethylene, 957-959, 958F 
Ethylenediaminetetraacetic acid (EDTA), 502 
Ethylene receptor, 958 
Etioplasts, 841 
Eubacteria see Bacteria 
Eucaryote(s), 14, 16F 

cells see Eucaryotic cell(s) 
epigenetics, 472 
evolutionary origin, 26-27 
general features, 26-32, 27F 
genomes 

DNA methylation, 278 
hybrid, 30 

Regulatory DNA, 31-32 
size, 18T, 30-31, 31 F 
non-coding DNA, 31, 204 

see also Noncoding DNA 
predators, 26-27 
transmembrane proteins, 635 
Eucaryotic cell(s) 

cell cycle see Cell cycle 
cell lines, 505, 506T 
cell types, 411 

see also Cell differentiation 
chemical composition, mammalian, 63T 
compartmentalization, 169 
DNA localization, 200-201 

see also Nucleus 
intracellular membranes, 169 
see also Organelle^); specific compartments 
division see Cell division 
DNA packaging, 202-203 

see also Chromatin; Chromosome(s) 
DNA replication see DNA replication 
gene expression, 345F, 41 2 
coordination, 462, 463F 
regulatory proteins see under Gene regulatory 

protein(s) 
transcriptional regulation see Transcriptional 
control of gene expression 



see also Gene expression; Transcription 

gene structure see Gene structure,eucaryotic 

metabolic rate, 169 

mRNA see Messenger RNA,eucaryotic 

protein phosphorylation, 176-177 

protein synthesis see Translation 

RNA polymerase(s) see RNA polymerase(s) 

rRNAssee Ribosomal RNA,eucaryotic 

transcription see Transcription 
Euchromatin, 220, 1070 

see also Heterochromatin 
Euglena, tree of life, 16F 
Eve (Even-skipped) gene 

combinatorial control, 448-450, 449F 

modular control of expression pattern, 448-449, 
448R449F, 1337,1339, 1339F 
Evolution 

Algae, 874, 875F 

animal-plant divergence, 955 

ATP synthesis, 870-876, 871-872, 872-875 

cancer as a microevolutionary process, 
1205-1224 

carrier protein(s), 655 

conservation in see Evolutionary conservation 

electron transport chain(s), 870-876 

gene(s), 16-17 

collagen genes, 1 185-1 186 
duplication/divergence see Gene 

duplication(s) 
see also Genome evolution 

genomes see Genome evolution 

Hemoglobin (Hb), 256-257, 256F 

homologous recombination and, 305 

innate immune system, 1524 

major events, 874F 

metabolic pathway(s), 870-872 

aerobic metabolism, 12, 27, 873-874 
carbon fixation, 872-875 

molecular see Molecular evolution 

multicellularity and cell communication, 955 

mutation rate analysis, 264 
see also Mutation rates 

myosin and kinesins, 1015-1016 

organelle(s), 697-699, 700F 
see also specific organelles 

pathogens, 1520-1521 

plant(s), 36, 840-841 

proteins see Protein evolution 

sexual reproduction and, 1271-1272, 1 271 F 

tree of life, 15-17, 16F, 23 

see also Origin of life 
Evolutionary conservation, 17, 17F 

cytoskeletal elements, 982-983 

genome sequence, 39-40, 207, 208F, 246, 250, 
292 

histones, 213 

homeodomain proteins, 138F, 420-421 

meiosis, 1280, 1286 

multicellular development, 1306F, 1307 
Evolutionary time, units, 265 
Evolutionary tracing, protein family binding-sites, 

155-156, 155F 
Excisionase, phage lambda, 326 
Excitatory postsynaptic potential (EPSP), 688 
Exocytosis, 750F, 799-809 

constitutive, 800, 801 F 

default pathway, 800, 801 F 

extracellular matrix, 800 

neurotransmitters see Synaptic vesicle(s) 

proteolytic processing of cargo, 803 

regulated see Regulated secretory pathway 

secretory proteins, 800 

secretory vesicles see Secretory vesicle(s) 
Exon(s), 346, 347 

gene structure (eucaryotic), 206 

length variation, 352, 353F 

recombination in evolution, 255 

skipping, 352, 355, 355F 
"Exon definition hypothesis," 352, 353F 
Exon-junction complexes (EJC), nonsense- 
mediated mRNA decay, 386 
Exonucleolytic proofreading, 269-270, 270F 
Exosome, 358, 485 
Experimental embryology, 1310-1311, 131 OF, 

131 IF 
Exportin(s), HIV mRNA transport, 485, 486F 
Export-ready mRNA, 327, 357, 358-359, 359F 
Expressed (DNA) sequences see Exon(s) 
Extracellular matrix (ECM), 1 1 78-11 95 

basal lamina see Basal lamina 



cell adhesion/interaction see Cell-matrix 

adhesions 
cell secreting/synthesis, 1 1 79, 1 1 79F 
components, 1165 

fibrous proteins, 145-146, 147F, 1179 

see also specific proteins 
shapes/sizes, 1166F 
see also Glycoprotein(s); specific 
macromolecules 
compressive forces and, 1 1 80-1 1 81 
degradation, 1193 
localisation, 1194 
matrix metalloproteinases, 1 194 
serine proteinases, 1194 
diversity, 1178 
exocytosis, 800 
matrix receptors, 1 169 

see also Integrin(s) 
mechanical interactions, 1189, 1189F, 1190F 
fibronectin fibril assembly and, 1 1 91 -1 1 93, 
1192F 
plant(s), 1180,1195 

see also Plant cell wall 
tensile forces and, 1 042F, 1 1 87-1 1 89 
tissue morphogenesis and repair, 1180-1181 
see also Bone; Cell adhesion; Cell junction(s) 
Extracellular signal molecule(s)/pathway(s), 
880-881,880-886 
cell response, 885 
cell size/number regulation, 1 102 
classified by range of action, 881-883 
CO, 889 

combinatorial action, 884 
competition for, 1110 
development, gonad specification, 1283 
growth factors see Growth factors 
hydrophobic, 889 
inhibitory, 1 1 02, 1 1 03, 1 1 1 0, 1 1 1 1 F 
mitogens see Mitogen(s) 
NO, 887-889, 888F 
survival factors see Survival factor(s) 
see also specific types/pathways 
Extracellular space, 1164 
Eya (Eyes absent) gene, 466F 
Eye color, as polygenic trait, 563 
Eye development, genetic control, 466, 466F 

see also specific genes 
Ey (Eyeless) gene/protein, 466, 466FF, 1 306F, 1 352 
Ezrin, 1009 



FqFt ATPase see ATP synthase 

Facilitated diffusion, 653 

FACS see Fluorescence-activated cell sorter 

Factor V, secretion from ER, 767 

FactorVIII,348F,767 

Facultative pathogens, 1490 

FAD/FADH2 

citric acid cycle, 98, 817 

electron carrier, 818-819, 81 9F 

fatty acid oxidation, 97F 

structure, 99F 
FAK (focal adhesion kinase), 937, 1176-1177, 

1176F 
Familial adenomatous polyposis coli (FAP), 1253, 

1253F 
Familial hypertrophic cardiomyopathy, 1031 
Fanconi anaemia groups A-G, DNA repair disorders, 

295T 
FAP (familial adenomatous polyposis coli), 1253, 

1253F 
Fascicles (fiber tracts), neuronal, 1387 
Fasciclin3, synapse formation, 1147 
Fas ligand, 1 120, 1571, 1573F, 1594 
Fas receptor protein, 1571, 1594 
FASTA sequence alignment, protein analysis, 531 
Fat(s) 

composition, 58-59 
see also Fatty acids 

digestion, 96 

energy source, 91, 94, 96, 824T 

storage, 8F, 91, 94 

structure, 97F 

see also Lipid(s); specific types 
Fat cells, 625, 1474-1476, 1475F, 1475FF 

gene expression regulation, glucocorticoids, 415 
Fat droplets, 1474 
Fat proteins (cadherin superfamily), 1 1 37, 1 1 38T 
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Fatty acids, 58-59, 58F, 95, 114-11 5FF 

cell membrane components, 58-59, 59F 
see also Membrane(s); Phospholipid(s) 

mobilization, 96F 

oxidation, 96-97, 97F 

synthesis, 855 

see also specific types 
Fatty acyl CoA, 96 

F-box proteins, SCF regulation, 1064 
Fc receptor(s), 1555, 1555F, 1596, 1596F, 1599F 

phagocytosis role, 788 

transcytosis, 797F 
Fc region, antibody molecule, 1552F, 1554F, 1555, 

1555F, 1559 
Feedback regulation, 1 70-1 71 

metabolic pathways, 170-171,837-838 

positive see Positive feedback 

see also Allosteric regulation; Negative feedback 
(feedback inhibition); Regulatory 
cascades 
Feed-forward loops, transcription circuits, 459, 459F 
F elements, 31 8T 
FepA protein, structure, 635 
Fermentation(s), 90, 839 

earliest cells, 870-871 

lactic acid production, 90 

pathways, 90F 

see also Glycolysis 
Ferredoxin-NADPH reductase, 852F 
Ferridoxin, 852, 852F 
Fertility, reduced, 1301 
Fertilization, 1297-1303 

capacitation and, 1294, 1297 

egg activation and sperm binding, 1270F, 1287, 
1298, 1298F 
acrosome reaction, 1297, 1298, 1299F 
Ca 2+ increases and egg activation, 912, 91 2F, 

1299 
cortical reaction, 1299, 1300, 1300F 
sperm-egg fusion mechanism, 1298-1299 

marine invertebrates, 1297 

pronuclei fusion, 1 301 , 1 301 F, 1 302F 

sperm-donated centrioles, 1301, 1301 F 

in vitro (IVF) and related procedures, 1301-1303 
capacitation, 1297 
pronuclei fusion, 1302F 

Xenopus embryo, 1057 

zygote formation, 1269 

see also Zygote 
Fever, 1488, 1534 

FGF see Fibroblast growth factor (FGF) 
FGF8 protein, 1371 

Fiber tracts (fascicles), neuronal, 1387 
Fibril-associated collagen(s), 1185, 1187-1189, 

1189F 
Fibrillar collagen(s), 1 1 84-1 189, 1 1 86T 

fibril-associated vs., 1188-1189 

see a/so Collagen(s) 
Fibrillarin, nuclear localization, 365F 
Fibrillin, 1190-1191 
Fibrin, formation, 264 

Fibrinogen, activation, fibrinopeptides, 264 
Fibrinopeptides, mutation rate, 264 
Fibroblast(s) 

cell crawling, 1036 

cell culture, phase-contrast microscopy, 504F 

chondrocyte transformation into, 1468-1469 

differentiation and changes, 464, 1467-1468 

ECM organization, 1189 

ECM synthesis, 1179, 1179F 

functions, 1467-1468 

mature and immature, 1468 

organization of microtubules, 1048F 

replicative senescence, telomeric repeats, 293 

transformation 

to cartilage and bone, 1468-1471 
to fat cells, 1475F 

in wound repair, 1467 
Fibroblast growth factor (FGF) 

in angiogenesis, 1449 

FGF8 protein, 1371 

FGF1 0, in lung development, 1 381 -1 382, 1 382F 

heparan sulfate binding, 1 1 83 
Fibronectin, 1191 

basal lamina, 1165 

connective tissue, 1188 

fibril assembly 

integrins and, 1192 

tension effects, 1191-1193, 11 92F, 11 93F 

gene, 1191 



integrin binding via RGD motif, 1 193 

protein modules, 141 F 

soluble form, 1191 

structure, 1191, 1192F 

type III fibronectin repeat, 1191, 1193 
Fibrous proteins 

basal lamina, 1165 

collagen see Collagen(s) 

ECM see Extracellular matrix (ECM) 

see also specific proteins 
Field emission guns, 604, 609 
Filaggrin protein, 1005 
Filaments, cytoskeletal see Actin/actin filaments; 

Cytoskeletal filaments 
Filamin 

actin filament packing, 994F 

actin filament web formation, 1 008-1 009 

loss, abnormal cell motility, 1008, 1009F 
Filopodia, 1006, 1037 

actin filament nucleation, 996 

angiogenesis, 1449 

steering neuronal growth cone, 1049, 1387, 
1387-1389 
Fimbriae, 1490F 
Fimbrin 

actin cross-linking, 1007, 1007F 

actin filament packing, 994F 
Firing rule, for synapse modification, 1395 
Fission yeast 

cell cycle, 1056-1057, 1056F 

shape mutants, 24F 

see also Schizosaccharomyces pombe 
Fitness (evolutionary), sexual reproduction and, 

1271,1271F 
Fixation 

carbon dioxide see Carbon fixation 

microscopy sample preparation, 585, 605, 605F 

nitrogen, 13, 13, 15F, 100 
Flagella, 1031-1034 

bacterial, 14F, 942, 943F, 1033, 1490F 

proton gradients, 821-822, 823F, 839-840, 
839F 

cilia comparison, 1031 

motility, 1031F 

sperm, 1293, 1293F 
Flagellin, 1033 

gene rearrangements, 454-455, 455F 
Flamingo protein, 1 1 37, 1 1 38T, 1 1 45, 1 1 58 
Fleas, plague transmission, 1 502, 1 502F 
FLIP, fluorescence loss in photobleaching (FLIP) 
Flip-flop switch, 457, 458F, 466 
Floor plate, of spinal cord, in growth cone 

guidance, 1388, 1388F 
Flora, normal microbial, of human body, 1486, 1501 
Flow cytometry, 576, 1059-1060, 1060F 
Flower(s), 1401F 

development, 1413, 1413-1414 

specification by homeotic selector genes, 1414F 
Flowering plants see Plant(s) 
FLP recombinase, 1348, 1349F 
FLP recombinase target (FRT), 1 348 
Fluid-phase endocytosis, 789 
Fluorescein, 586-587 
Fluorescence, 586 
Fluorescence-activated cell sorter (FACS), 502, 503F, 

1218, 1218F 
Fluorescence in situ hybridization (FISH), 

chromosome puffs, 239 
Fluorescence loss in photobleaching (FLIP), 643, 

644F,753F 
Fluorescence microscopy, 586-589 

applications, 587 

confocal, 590-592, 591 F, 592F 

fluorescent dyes, 586-587, 587F 

immunofluorescence, 587F, 588-589, 588F, 589F 

microscopes, 586, 586F, 591 F 

nanoparticles, 587, 588F 

quantum dots, 587, 588F 

single-cell expression analysis, 576 

total internal, 599, 600F 
Fluorescence recovery after photobleaching (FRAP), 

595, 596F, 643, 644F, 753F 
Fluorescence resonance energy transfer (FRET), 

526, 526F, 593-594, 595F 
Fluorescent analog cytochemistry, 597 
Fluorescent dyes, 586-587, 587F 
Foam cells, 1500, 1500F 
Focal adhesion kinase (FAK), 937, 1 1 76-1 1 77, 

1176F 
Focal adhesions, 1 1 70, 1 1 76-1 1 77, 1 1 76F 



Follicle cells, 1290-1291, 1290F 
Follicle stimulating hormone (FSH), 1292 
Follistatin, 940 
Food molecules 

nonfermentable, evolution of ability to use, 872 

obtaining energy, 88-103, 100F 
see also Catabolism 

storage, 91,94-95 
Forensic genetics, 547F 
Formaldehyde, tissue fixation, 585 
Formic acid oxidation, 871, 871F 
Formin, 994F, 998, 999F, 1000F, 1094 
Formylmethionine, 380, 1 526, 1 531 
Fossil record, phylogenetic tree comparisons, 248 
Founder cell, in C elegans, 1 322-1 323 
Foxp3, regulatory Tcell(s), 1574 
Fractionation see Cell fractionation 
Frameshifting, translational see Translational 

frameshifting 
FRAP see Fluorescence recovery after 
photobleaching (FRAP) 
Free energy, 75-78, 1 18-1 19FF 

catalysis, 72, 164 

see also Enzyme catalysis 

changes in see Free energy change Delta G (AG) 

favorable vs. unfavorable reactions, 75F, 
824-825, 825 F 

importance for life, 1, 8 

sources, for organotrophs vs phototrophs vs 
lithotrophs, 12 

translation accuracy, 385 
Free energy change Delta G (AG), 75-78, 1 19F 

calculation, redox potentials, 830 

electron transfer, 829 

equilibrium constants, 76, 77T 

reaction concentration effects, 76 

reaction coupling, 75F, 78F, 825, 825F 

sequential reactions, 77-78 
Free radicals, 832-833, 874, 1 532 
Freeze-etch electron microscopy, 61 OF 
FRET see Fluorescence resonance energy transfer 

(FRET) 
Fringe family of glycosyltransferases, 947 
Frizzled gene/protein, 1316T 

C. elegans development, 1 324, 1 325 

planar cell polarity, 1 1 58 

polarity-signaling pathway, 1359 
Frog(s),411,413F 

see also Xenopus laevis 
Frozen sections, 585 
Fructose 1, 6-bisphosphate, 120F 
Fructose 6-phosphate, 1 20F 
Fructose, structure, 112F 
Fruit fly see Drosophila meianogaster 
F-site (ribosome binding), 375F, 376 
FtsZ protein, 989-991, 989F, 989FF 
F-type pumps (ATPases), 659-660, 660F 
Fugu rubripes see Puffer fish (Fugu rubripes) 
Fumarase, 123F 
Fumarate, 122F, 123F 

Functional genomics, tagged knockouts, 569-571 
Fungi 

antibiotic production, 383-384 

dimorphism, 1494 

evolutionary origin, 29-30 

pathogenic, 1494 

prion proteins, 398 
Fushitarazu mutant, Drosophila, 1337 
Fusion proteins 

virus, membrane fusion, 764-765 

see also Protein tags; individual fusions 



Delta G (AG) see Free energy change Delta G (AG) 

G phase of cell cycle see Cell cycle 

G r Cdk, 1062, 1066, 1103-1105 

Gi-cyclins, 1062 

G] phase of cell cycle see Cell cycle; Cell cycle 
control 

Gi/S-Cdk, 1062, 1066, 1103-1105 

Gi/S-cyclins, 1062 

G 2 /M checkpoint, 1061, 1062, 1066, 1 105 

G2 phase of cell cycle see Cell cycle 

GABA (y-aminobutyric acid) 
neurotransmitter role, 684, 808 
receptors, psychoactive drug effects, 686 

Gag {viral capsid) proteins, translational 

frameshifting, 383, 384F, 485F, 1498F, 
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1521F 
GAGs see Glycosaminoglycans (GAGs) 
Gain-of-function mutation(s), 528F, 557-558, 564F 
Gal4 gene/protein, 1350F 

DNA sequence recognition, 41 8T 
GAL4/UAS technique, 1 350F 
Galactocerebroside, 628F 
Galactose, in N-linked oligosaccharides, 773F 
Gamete(s), 1269 

haploid state, 1269,1272, 1281 
production by cell division see Meiosis 
see also Egg (ovum); Spermatozoa 
Gamma aminobutyric acid (GABA) see GABA 

(y-aminobutyricacid) 
Ganglioside(s) 
G M1 ,628F,629 
membranes, 628 
GAP see GTPase-activating proteins (GAPs) 
Gap genes, 1 333, 1 337-1 338, 1 337F, 1 338F 
Gapjunction(s), 1158-1164 
cell communication, 884, 884F 
channel proteins, 667 
columnar epithelium, 1 1 34, 1 1 34F 
dynamics/turnover, 1160-1161, 1160F 
electrical coupling, 1 1 58-1 1 59 
functions, 1161 

metabolic coupling, 1 1 58-1 1 59 
oocyte-follicle cell, 1290 
permeability regulation, 1161-1162, 1162F 
plasmodesmata vs., 1 163 
size determination, 1 1 59, 1 1 59F 
structure, 11 59, 11 59-1 161, 11 59F, 11 60F 

see also Connexins 
structure-function relationship, 1159-1160 
GARP gene regulatory proteins, family members in 

different eucaryotes, HOOT 
Gastric acid secretion, 1436 
Gastric cancer, Helicobacter pylori association, 1 503 
Gastrula, 1363F 
Gastrulation, 1307 

gut origin by, 1307, 1307F 
sea urchin, 1307F 

Xenopus see Xenopus laevis development 
GATA-1, 418T, 451 F, 1456 
Gated transport, between nucleus and cytoplasm, 

700, 701 F 
Gating, ion channels see Ion channel(s) 
Gcn4 protein, 41 8T, 424F, 490-491 
GEF see Guanine nucleotide exchange factor 
Geiger counter, 601 
Gel(s) 

GAGs and, 1179-1180 
gel-forming proteins, 1006, 1009F 
Gel-filtration chromatography, 512, 514F 
matrices, 534F 
protein purification, 535F 
Gel-mobility shift assay, 427-428, 428, 428F 
Gelsolin,994F, 1000 
Gel transfer see Blotting 
Geminin, destruction by APC/C, 1068-1069 
GEMS (gemini of coiled bodies), 363-364, 364, 365F 
Gene(s) 

amplification 

in cancer, oncogenes, 1237-1239 
PCR see Polymerase chain reaction (PCR) 
cancer-critical see Cancer-critical genes 
chromosomal location, 204-205, 204F, 554F 

see also Genetic mapping 
definition, 7-8, 195, 204, 440, 554F 
history of concept, 480-481 
modification, 480-481 
DNA structure and, 199-200, 202 

see also DNA structure; Genetic code 
evolutionary innovation, 19 
expression see Gene expression 
function(s) 

antibiotic resistance, 31 8F 
biochemical analysis, 23-24 
deduction from mutant phenotype, 23-24, 
553 
tagged knockouts, 570 
targeted mutations, 575 
housekeeping, 434, 470 
prediction from sequence homology, 22 
generation from pre-existing genes, 18-19 
homologs, 20-21, 20F, 21 F, 246 
identification, 207, 329, 429, 560F 

see also specific methods 
imprinted see Genomic imprinting 
localization see Genetic mapping 



loss, cancer cells, 1232 

see also Cancer-critical genes 
manipulation see Recombinant DNA technology; 

Transgenic organism(s) 
numbers, 204, 204F 
orthologs, 20-21, 20F, 21 F 
paralogs, 20-21, 20F, 21 F 
pseudogenes see Pseudogenes 
sequencing see DNA sequencing 
structure see Gene structure 
transcription units, 336 
see also entries beginning gene/genetic 
Gene activator proteins see Transcriptional 

activator(s) 
Gene batteries, coordinated by critical regulatory 

protein, 463-464 
Gene clock, 461 F 
Gene cloning, 542 

PCR cloning, 544-546, 545F 
"shotgun" approach, 542 
see also DNA cloning 
Gene control region(s), 438F, 440, 441 F 
enhancers, 438F 
insulators, 452F, 453 
LCRs, 450-452, 451 F 
promoters see Promoter elements 
regulatory modules 

in Drosophila {eve gene), 448, 449F 
in mammals, 450-453, 450F, 45 1 F, 452F 
see also Gene regulatory protein(s) 
Gene conversion, 3 1 4-315 

Holliday junctions, 311,314-315 
mismatch conversion, 31 5F 
in tumor suppressor gene loss, 1 236F 
Gene duplication(s), 18, 19F, 40F, 246 
collagen genes, 1185-1186 
fate of duplicates, 255 
functional divergence, 40F, 246, 254-255 

see also Gene families 
genome evolution, 253-254 
protein kinase evolution, 1 76 
in protein module shuffling, 140-141, 141F 
rates, 255 

vertebrate evolution, 38-39 
Xenopus genome, 38-39, 39F 
zebrafish genomes, 255 
see also Gene families; Genetic redundancy; 
Pseudogenes 
Gene expression, 6, 199F, 200F 
analysis, 414FF, 553-576 
2D-PAGE,412,414F 
approaches, 553 

see also Genetics 
DNA microarrays, 34-35, 41 2, 41 4F, 537, 
574-575, 574F, 575F 
cluster analysis, 575, 575F 
reporter gene assay, 572-573, 572F 
cancer and, 1237-1239 
chromatin condensation, 285-286 
heterochromatin role, 220 
see also Chromatin; Chromosome 

condensation; Gene silencing; 
Heterochromatin 
control/regulation see Gene expression 

regulation 
DNA methylation and, 467-468, 468F 
DNA signals for transcription, 336-340 

see also Gene control region(s) 
efficiency, 331, 332F 
eucaryotic, 345F, 41 2, 462, 463f 
heritability, 454, 458, 458F, 466 

see also DNA methylation; Epigenetic 
phenomena; Gene silencing 
human, 412 

position effects, 220, 221 F 
position effect variegation (PEV), 221, 221 F 
procaryotic, 345F 

proportion of genome expressed, 412 
transcription units, 334F, 336 
variation between cell types, 412 
see also Transcription; Translation 
Gene expression regulation, 41 1 -499, 45 1 F 
combinatorial control, 424-425, 425, 425F, 465, 

465F 
controllable steps, 379F, 41 5 
coordinated control, 462-463, 462F, 464, 464F 
DNA structure role, 41 6-41 7, 41 7F 
evolution, 439-440 

gene control regions see Gene control region(s) 
genetic switches see Genetic switches 



hormonal, glucocorticoid regulation, 41 2, 41 5, 

463, 463 F 
manipulation site-specific recombination, 

325-326 
memory devices, 458, 458F 
mRNA degradation control, 379F, 415, 492-493 
"noise," 476-477 
oscillatory devices, 461 , 461 F 
post-transcriptional see Post-transcriptional 

regulation 
protein activity control, 379F, 41 5 
regulatory proteins see Gene regulatory 

protein(s) 
transcriptional see Transcriptional control of 

gene expression 
see also Chromosome puffs; Drosophila 
development 
Gene families 

common to archaea, bacteria, and eucaryotes, 

23, 24T 
evolution, 19-21 

duplication and divergence, 255 
globin genes, 256-257, 256F 
see a/so Globin genes 
see also Protein families 
Gene knockouts 

deletion cassettes, 570 
functional genomics, 569-571 
mouse, 566-568, 567F, 568F 
production, 567F 
Gene oscillation, synthetic biology, 461 F 
General recombination see Homologous 

recombination 
Gene regulatory circuits, mathematical modeling, 

35, 35F 
Gene regulatory protein(s), 222, 343, 343F, 
379-471, 41 8T 
analysis techniques 

affinity chromatography, 428-429, 429F 
chromatin immunoprecipitation, 431-432, 

432F 
DNA sequence determination, 429-430, 430F 
gel mobility shift assay, 427-428, 428F 
discovery, 41 6 

DNA sequence recognition, 41 8-41 9, 41 8T 
prediction, 426 

sequence determination, 429-430, 430F 
see also DNA-binding motifs 
eucaryotic, 399-409, 434, 440-441 

activators see Transcriptional activator(s) 
complex assembly, 445-447, 447, 447FF, 
464-465 
see also Combinatorial control 
coordinated gene control, 462-463, 463F, 464, 

464F 
developmental expression, 448, 448F, 449FF 
diversity, 31 

downstream gene batteries, 463-464 
enhancesome, 446-447, 447F 
glucocorticoid receptor protein, 463, 463F 
major families, MOOT 
mechanisms, 438F, 442F, 445, 445F, 446F 
modular nature, 442, 442F, 447-448, 449FF 
multicellular development and, 465, 465F 

see also Drosophila development 
regulation, 450-451, 451 F 
repressors see Repressor protein(s) 
transcriptional synergy, 444-445, 444F, 

464-465 
see also Transcription factors,general (TFII) 
heterodimerization, 424-425, 425F 

see also Combinatorial control 
latent see Latent gene regulatory proteins 
negative control, 434, 436F 

see also Repressor protein(s) 
positive control, 434, 436F 

see also Transcriptional activator(s) 
protein-DNA interactions, 41 6, 41 7F, 41 8-41 9, 
41 9F 
see also DNA-binding motifs (proteins); 
Protein-DNA interactions 
protein-protein interactions see Combinatorial 

control 
recognition sites see Genetic switches 
transcription factors see Transcription 

factors,general (TFII) 
transcription regulation see Transcriptional 

control of gene expression 
see also specific proteins 
Gene replacement, genetic engineering, 566F 
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Gene repressor proteins see Repressor protein(s) 
Gene silencing 

CG {CpG) islands, DNA damage, 300-301 
heterochromatin role, 220, 221 F 
mating-type switching in yeast, 456, 457F 
tumor suppressor genes, 1236, 1236F 
see also DNA methylation; Genomic imprinting; 
Repetitive DNA; X-inactivation 
Gene structure 

eucaryotic, 347, 348F 
exons see Exon(s) 
gene control region see Gene control 

region(s) 
human, 205-207, 206, 206T, 348F 
intron-exon organization, 348 
intronsseelntron(s) 
pre-mRNA processing, 347 
see also Splicing 
procaryotic 

bacterial chromosomes, 202, 1491 F 
operons, 433F, 462-463 
Genetic code, 199-200 
codonsseeCodons 
DNA structure and heredity, 199 
evolution, 300, 407-408 
reading frames, 368, 368F 
redundancy, 247, 354, 367, 368 
universal, 367 
variations, 382-383 

Candida CUG codon, 383 
mitochondrial variations, 367, 383, 862, 862T 
translational recoding, 383, 383F, 384F 
see also Translation 
Genetic disease 

integrin mutations, 1172-1174 
somatic nuclear transplantation, 508 
Genetic diversity, meiosis role, 1271, 1279-1280, 

1279F 
Genetic engineering, 532, 564-568 

dominant negative mutant generation, 564, 

564FF 
gene replacement, 566F 
germ cell gene insertion, 565-566 
inducible promoters, 564 
protein sequence alteration, 564-565, 565F 
sequence tags, 570 

transgenic organisms see Transgenic organism(s) 
see also Genetics; Recombinant DNA technology; 
specific techniques 
Genetic exchange, homologous recombination, 

305 
Genetic information 

defining feature of life, 2 
not necessarily encoded in nucleic acid (prions), 
1498 
Genetic instability 

cancer, 1214-1215, 1254, 1254F 
DNA repair defects, 295T 
from p53 loss after telomere shortening, 1217 
prevention, 31 5F 
Genetic mapping 

based on meiotic recombination, 554F 
linkage analysis, 559-560, 560F 
see also DNA sequencing 
Genetic mosaic, Drosophila development, 1348 
Genetic recombination see Recombination 
Genetic redundancy, 39 
Genetics 

classical, 553, 554-527 

chromosomes see Chromosome(s) 
complementation tests, 528F, 558 
genes see Gene(s) 
genotype, 553 
haploid-diploid cycle, 554F 

see also Meiosis 
heterozygote, 554F 
historical aspects, 195-196, 195-197 
homozygote, 554F 
linkage analysis, 560F 
mutation see Mutation(s) 
phenotype, 553, 554F 
recombination see Recombination 
two genes or one?, 528F 

see also Complementation tests 
reverse, 553, 563, 563-564, 575 

dominant negative mutant generation, 564FF 
gain-of-function mutant generation, 564F 
gene replacement, 567F 

see also Recombination; Transgenic 
organism(s) 



germ-line mutations, 575 

protein sequence alteration, 565F 

RNA interference (RNAi), 572 

site-directed mutagenesis, 565F 

techniques, 563-564 

in vitro mutagenesis, 575 

see also DNA cloning; Genetic engineering 
Genetic screens, 553, 553, 555 
mutant identification, 556-557 

behavioral, 556, 557F 
Genetic switches 

complex developmental, 447-448, 448F 

see also Eve (even-skipped) gene 
DNA sequences, 418 

see also DNA-binding motifs 
evolution, 439-440 
mechanisms, 432-454, 436F 

operators, 433-434, 433F, 434F 

repressors see Repressor protein(s) 

transcriptional activators see Transcriptional 
activator(s) 
Genital ridge, 1283 
Genome(s), 8, 200 

Arabidopsis thaliana, 1 42 
chromosomes see Chromosome(s) 
cloning, 541-542 

see also DNA cloning; DNA libraries 
completely sequenced organisms, 18T 
complexity, 142, 204-205 

CG (CpG) islands, 470-471 

gene numbers in different organisms, 18T, 
863-864, 864F 

noncoding regions see Noncoding DNA 

variation in intron/exon length, 352, 353F 
composition, 329 
conserved synteny, 207, 208F, 250 
diversity, 11-26 
d u pi i cation {s),Xenopus, 38-39 
evolution see Genome evolution 
HIV, 485F 

human see Human genome; Human genome 
information coding, 329, 330F, 331, 331F 

central dogma see Messenger RNA; 
Transcription; Translation 

potential, 348 

see also Genome evolution 

see also DNA (deoxyribonucleic acid); Gene 
expression; Genetic code 
mitochondrial see Mitochondrial genome 
mouse see Mouse 

multicellular development control, 31-32, 32F 
organelles 

chloroplast see Chloroplast(s), genome 

mitochondrial see Mitochondrial genome 
replication, 200 

see also Chromosome replication; DNA 
replication 
sequencing see Genome sequencing 
size variations, 18T, 30-31, 200, 246, 251 

minimal for life, 10, 1 1 F 

minimum gene complement (Mycoplasma), 
10,570-571 

see also Genome evolution 
transposon movements, 323 
see also individual species 
Genome evolution, 245-260 
accelerated change, 253, 254F 
additions, 250, 258 
CG (CpG) islands, 434, 470 
chromosome(s), 207, 208F 
gene duplications, 253-254 
gene number vs complexity, 246 
human variation, 258-260, 259-260, 259F 
human vs chimpanzee, 247, 247F, 248F 
human vs mouse, 249-250, 249F 
human vs puffer fish, 251 F 
intron-exon gene structure, 348 
losses, 250, 258 
mechanisms, 246 

exon shuffling, 257 

gene duplication see Gene duplication(s) 

genome duplication, 38-39, 140-141, 255 

mutation see Mutation(s) 

see also Mutation rates 
neutral mutations, 257-258 
organellas 859-860, 860F, 861 F 
protein module integration, 140 
sequence analysis, 207, 246 

conservation, 39-40, 246 

conserved sequence identification, 252-253, 



252F 
conserved synteny, 207, 208F, 250 
mammals, conservation in, 39-40 
single nucleotide polymorphisms (SNPs), 259 
see also Genome sequencing 

sequence divergence, 247 
ancestor tracing, 247, 248F 
human-mouse, 249F 
molecular clock hypothesis, 248-249 
phylogenetic trees, 247, 247F 
see also Comparative genomics 

size changes, 251 

synteny bfocks, 250 

transposons, 249F, 250F, 323, 323F 

zebrafish, 255 
Genome instability 

DNA repair defects, 295T 

prevention, 31 5F 
Genome sequencing, 18T, 142, 551-552 

comparative see Comparative genomics 

conserved vs. nonconserved regions, 207 

homology searches, domain fusions, 560F 

see also Genome evolution 
Genomic DNA clone, 542 

PCR cloning, 544-546, 545F, 546F 
Genomic DNA libraries, 542 

human, 542F 

vs cDNA, 543F, 544 
Genomic imprinting, 468-470, 469F, 1 287 

as epigenetic phenomena, 472-473 

insulin-like growth factor-2, 469-470 

see also Gene silencing; X-inactivation 
Genomic plasticity, retroviruses, 1520-1521 
Genomics 

comparative see Comparative genomics 

genome sequence analysis, 552 

tagged knockouts, 569-571 
Genotype, 553, 554F 
Geochemical energy, fueling life, 12 
Germ cell(s), 1271 

Caenorhabditis elegans f 1 322-1 323 

determinants, 1282, 1282F 

gene insertion, genetic engineering, 565-566 

see also Egg (ovum); Gamete(s); Spermatozoa 
Germinal centers, antibody affinity maturation, 

1566 
Germination, energy reserves, 95 
Germ layers, 1365-1366, 1365F 
Germ-line 

DNA, antibody genes, 1 563F 

functions, 265F 

gene replacement see Transgenic organism(s) 

mutations, 264, 265, 575 
reverse genetics, 575 

P granules role, 1323-1324 

Vasa protein in determination of, 1324 
GFP see Green fluorescent protein (GFP) 
Giant chromosomes see Polytene chromosome(s) 
Giardia, in tree of life, 16F 
Gibberellicacid(GA3), 1406F 
Gibberellins, 957, 1403 
Gibbs free energy see Free energy 
Glandular fever, Epstein-Barr virus (EBV), 1499 
Glanzmann's disease, 1174 
GlcNAc phosphotransferase, 785, 786F 
Gleevec (STI-751), chronic myelogenous leukemia 

therapy, 1261 
Glial cells, 680F, 1383-1384 

myelin production, 678 

radial, 1385F 
Glioblastoma, epidermal growth factor (EGF), 1 238 
Gli protein, 951 
Globin(s), 256 

fetal, 256, 256F 

genes see Globin genes 

homologies and evolutionary tree, 21 F, 256F 

see also specific types 
p-Globin gene, 256 

abnormal pre-mRNA processing, 355F 

chromatin structure, 288 

gene expression regulation, 450-452, 451 F 

human DNA sequence, 199F, 200F 

human vs mouse, 250F, 323F 

mutations, 288, 288F, 355F, 452 

see also Thalassemia, p-globin abnormalities 

structure, 348F 
Globin genes 

expression regulation, 450-452, 45 IF, 452F 
see also Locus control region(s) 

gene clusters, 450-451 , 451 F, 452F 
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evolution by duplication, 256-257, 256F 

a-globin, 256-257 

p-globin see p-Globin gene 

5-globin, 256-257, 451,452F 

y-globin,451,452F 

pseudogenes, 257 
Glomeruli, of olfactory bulb, 1430 
Glucagon, 907T 
Glucocorticoid receptor protein, gene expression 

regulation, 463, 463F 
Glucocorticoids, gene expression regulation, 41 2, 

415,463,463F 
Gluconeogenesis, 102 
Glucose 

disaccharide biosynthesis, 57F 

oxidation, 57, 1 0OF, 120F 
total energy yield, 823, 824T 
see also Glycolysis 

structure, 56F, 11 2F 

transport, 657F, 798 
Glucose 6-phosphate, 120F 
Glucose transporter, transcytosis, 798 
Glucosyl transferase, protein folding function, 739 
Glutamate (glutamic acid) 

neurotransmitter role, 684, 691, 808 

structure, 129F 
Glutamine, structure, 129F 
Glutamine synthase, ATP hydrolysis, 81 F 
Glutaraldehyde, 585, 605, 605F 
Glutathione dismutase, 868 
Glutathione S-transferase (GST), protein tags, 

514-515, 515F,516F 
Glyceraldehyde, 112F 

Glyceraldehyde3-phosphate,91,92F, 120F, 121F, 
846 

carbohydrate source, 854 

carbon-fixation cycle, 845, 845F 

glycolysis, 854-855 
Glyceraldehyde 3-phosphate dehydrogenase, 91, 

121F 
Glycine, 129F 

neurotransmitter role, 684, 808 
Glycocalyx, 636, 637F 
Glycogen, 91, 94, 94F 

Glycogen synthase kinase 3p (GSK30), 949, 949F 
Glycolipid(s) 

aggregates, 115F 

membranes, 628-629, 628F 

side chain diversity, 636 

structure, 115F 

synthesis in ER, 744-745 
Glycolysis, 88-91 

anaerobic, 88, 839 

see also Fermentation(s) 

ATP production (net gain), 89, 121 F 

chloroplasts, glyceraldehyde 3-phosphate, 
854-855 

energetics, 91 

energy storage by reaction coupling, 91, 92F 

enzymes, 88-89, 88-89, 91, 92F, 120-1 21 FF 
see also specific enzymes 

evolution, 88 

metabolic interrelationships, 102F 

NAD+/NADH role, 89 

pathway, 89F, 91, 92F, 93F, 120-1 21 FF 
Glycoprotein(s), 635-636 

cell-cell adhesion, 636 

synthesis, 736-738, 737F, 747F 
Glycosaminoglycans (GAGs), 1179-1180 

basal lamina, 1165 

compressive forces, 1180-1181 

protein attachment see Proteoglycan(s) 

proteoglycan production, 776 

structure, 1179, 11 79F, 11 80F 

see also specific types 
Glycosphingolipid(s) 

lipid rafts, 807 

protein sorting, 806-807 

synthesis, endoplasmic reticulum (ER), 744-745 
Glycosylation, 776-777 

proteins see Protein glycosylation 
Glycosylphosphatidylinositol (GP!) anchor, 630, 636, 
742-743, 742F, 807 

cell-surface proteoglycans, 1 183 
Glyoxylate cycle, 722 
Glyoxysome(s), 722, 722F 
Glypican, 1184 
Goblet cells, 1434-1436 

Golgi apparatus, 772 

intestinal, 1526F 



mucus secretion, 772, 774F, 1437 
respiratory epithelium, 1435, 1435F 
structure, 774F 
Golgi apparatus (Golgi complex), 766 
antigen processing/presentation, 1583F 
cellular distribution, 772 
cis Golgi network (CGN) see Cis Golgi network 

<CGN) 
cisternae, 771-772, 771F,778F 
cisternal maturation model, of transport 

through, 778 
clathrin-coated vesicle formation, 754-758, 

756F 
distribution during cytokinesis, 1098 
enzymes, importance, 731 
exit face, 772, 778F 

functional compartmentalization, 778F 
membrane amounts, 697T 
membrane proteins, glycosylation, 635 
microtubule motors, membrane organization, 

1021 
molecular compartmentalization, 773 
oligosaccharide chain processing, 773-775, 

775F, 777 
position in cell, 697 
proteoglycan assembly, 775-776 
selective protein retention, 731 
structure, 696, 771-773, 771 F, 778F 
trans face (exit face), 772, 778F 
trans Golgi network (TGN) see Trans Golgi 

network (TGN) 
transport from ER, 766-787 

see also Endoplasmic reticulum (ER) 
transport through, 777-778 
transport to cell surface from, 799-809 

see also Exocytosis 
transport to lysosomes, 779-787, 785F 
GlcNAc phosphotransferase, 785, 786F 
M6P addition, 785 
M6P receptor, 783-784 
vesicular transport model, of transport through, 

777-778 
volume, 697T 
Gonad(s), development 

PGC migration, 1283, 1283F, 1285 
5ry gene and testes development, 1 283-1 285, 
1286F 
Gooseberry mutant, Drosophila, 1 337 
Gp 1 20 protein, 765F 
GPI-anchored proteins see 

Glycosylphosphatidylinositol (GPI) 
anchor 
G protein(s) {trimeric GTP-binding 

proteins/ATPases), 1 79, 892, 905 
asubunit, 905, 906F 
py complex, 905, 906F 
activation, 905-906, 906F 
downstream effects 
CAMP, 905-906 

inositol-phospholipid signaling, 909-91 1 
on ion channels, 906-907 
Gi2,906 

Gj (inhibitory G protein), 906-907, 91 9T 
G ,919T 
G if, 91 7, 91 9T 
G q ,919T 

G 5 (stimulatory G protein), 906-907, 91 9T 
Gt(transducin), 918, 919T 
inactivation, 906F 

monomeric GTPases vs G proteins, 896 
receptor coupling see G protein-coupled 
receptors (GPCRs) 
G protein-coupled receptors (GPCRs), 641, 892, 

893F, 903F, 904, 904-921, 904F, 905F 
cyclic AMP and, 905-908, 906F, 907F, 907T 
desensitization, 920, 920F 
Flamingo proteins, 1 145 
inositol phospholipids and, 91 OF, 91 0T, 91 1 F, 

91 1FF, 919-921 
olfactory receptors, 917 
photoreceptors, 917-918 
regulating ion channels, 916 
signal amplification, 919-920, 920F 
visual transduction, 91 7-918, 91 8F 
see also specific receptors 
G protein-linked receptor kinases (GRKs), 920, 920F 
Graft rejection, 1575 
Gram-negative bacteria, 1490F, 1527 
Gram-positive bacteria, 1490F, 1527 
Granular layer, of epidermis, 1419, 1419F 



Granules, phagocytic cells, 1532 
Granulocyte(s), 1460, 1460T 

see also Basophils; Eosinophils; Neutrophil(s) 
Granulocyte CSF (G-CSF), 1460T 
Granulocyte/macrophage CSF (GM-CSF), 1460T, 

1461F 
Granulocyte/macrophage (GM) progenitor cells, 

1460 
Granulosa cells, 1291, 1291F, 1298 
GranzymeB protease, 1571, 1573F 
Grb-2 protein, 928 

Green fluorescent protein (GFP), 592-593, 594F 
gene expression regulation, 476-477 
intracellular visualization see Fluorescence 

resonance energy transfer (FRET) 
structure, 593F 

vesicular and intracellular transport studies, 753F 
Green sulfur bacteria, electron transport, 873F 
Grids, electron microscopy, 605, 605F 
GroEL chaperone, 390F 
Ground tissue, of plants, 1402, 1404F 
Group I introns, 334, 356, 356F 
Group II introns, self-splicing mechanism, 356, 356F 
Growth cones, neuronal, 1048, 1049F, 1386-1389, 
1386F, 1447F 
change of sensibilities, 1389 
collapse, 932F 

steering developing neurite, 1387-1389 
Growth factors, 1102 
competition for, 1110 
mechanisms of action, 1 1 08, 1 1 09F 
see also Mitogen(s); specific factors 
Growth regulators in plants (plant hormones), 

957-961 
Growth, retardation, DNA repair defects, 295T 
GSK3p,950,952F 
GTP (guanosine triphosphate), 98 
protein phosphorylation, 178-179 
structure, 99F 
GTPase see GTP-binding proteins (GTPases) 
GTPase-activating proteins (GAPs), 179, 180F,896, 

896F 
GTP-binding proteins (GTPases), 178-181 
cellular regulators, 178-179, 179F, 180F 

phosphorylation vs., 180F 
monomeric, 179-180, 179F, 180F, 758-760 
Rab family (vesicle docking), 760-761 
Ran GTPase, 708-709, 709F, 71 OF 
Ras family see Ras GTPase(s) 
Rho family see Rho GTPase(s) 
Sari (vesicular coat assembly), 759, 759F 
trimeric (G proteins) vs., 896 
see also specific GTPases 
movement generators, 179-181, 181 
regulation, 179, 180F 

see also GTPase-activating proteins (GAPs); 
Guanine nucleotide exchange factor 
(GEF) 
trimeric see G protein(s); G protein(s) (trimeric 
GTP-binding proteins/ATPases) 
GTP caps, microtubules, 977, 979, 979F 
GTP exchange factors, 377F 
Guanine, 197 

base-pairing, 198F 
deamination, 301 F 
RNA structure, 332 
structure, 116F, 301 F 
Guanine nucleotide disocciation inhibitors (GDIs), 

931 
Guanine nucleotide exchange factor (GEF), 896, 
896F 
elF-2 recycling, 488-489, 490F 
functional role, 179, 180F 
microtubule association, 1 043-1 044 
mitotic spindle formation, 1081 
Ras GEF, 179 
RhoA activation, 1095 
Guanosine triphosphate see GTP (guanosine 

triphosphate) 
Guanyl transferase, 5' mRNA capping, 346, 347F 
Guanylyl cyclase 

regulation, 888F, 889 
see also Receptor guanylyl cyclases 
Guide RNAs, 361, 362F 

RNA editing, 483, 484F 
Gut 

cell migration, 1440-1441, 1440F 
cell turnover, 1436-1437 
development, 1307, 1307F, 1322F 
endocrine cells (enteroendocrine cells), 1 437, 
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1437F 
histology, 1436, 1436-1444, 1436F 
Vibrio choleroe colonization of, 1 503 



H 

H + (proton) see Hydrogen ion (H+, proton) 
Haemophilus influenzae, genome sequence, 

551-552 
Hair cells, 609F, 1429, 1430-1432, 1431 FF 
Hair follicle, 1418F, 1426 
Hairpin structures, RNA, 338, 403F 
Hairy gene, 1337, 1371 
Half-life, radioisotopes, 601, 601 T 
Halobactehum salinarum, 640-642, 640F 
Haloferax, in tree of life, 1 6F 
Haploid state 

diploid state vs., 34 

in sexual reproduction, 1269-1271, 1270F, 1272, 
1281 
see a/so Meiosis 

yeast, 34 
Haplotype blocks, 561-562, 562F 
Haplotype maps (hapmaps), 561, 562 
HATS see Histone acetyltransferases (HATS) 
HDACs see Histone deacetylases (HDACs) 
HDVfold,406F 

Head polymerization, 86, 87F, 373 
Heart muscle see Cardiac muscle 
Heat energy, 67-68, 69F, 11 8F 

brown adipose tissue, 838 

enzyme catalysis, 74 
Heat-shock 

induced somatic recombination, 1348 

proteins (HSPs), 388 

see also Chaperones; specific proteins 

see also DHA repair 
Heavy (H) chain (antibodies), 1 57F, 1 553-1 557, 
1554F 

constant domains, 1558-1559, 1559F, 1560F 

gene segment pool, 1 563F 

IgM, 1555F 

repeating domains, 1559 

variable domains, 1558-1559, 1559F, 1560F 
Heavy isotopes, equilibrium sedimentation, 51 1 
Hebb rule, 1396 
Hedgehog protein, 950 
Hedgehog signaling pathway, 950 

absent in Arabidopsis, 1400 

animal development, 1316F 

Drosophila, 1340, 1353, 1353F 

in gut, 1441 

in skin, 1426 
Helicobacter pylori 

chronic illness, 1500 

colonization of stomach, 1503 

genome size, 18T 

stomach cancer causation, 1 228, 1 229 
Helix 

common molecular structure, 143, 145 

double <DNA) see DNA structure 

handedness, 143 

properties, 146F 
Helix destabilizing proteins see Single-strand DNA- 

binding proteins (SSBs) 
Helix-loop-helix motif, 425-426, 426F 
Helix-loop-helix proteins, 425-426, 426F 
Helix-turn-helix motif, 419-420, 420F 

sequence recognition, 420-421, 421 F 

structure, 420F 
Helix-turn-helix proteins, 420, 420F 

DNA-binding motif see Helix-turn-helix motif 

homeodomain proteins see Homeodomain 
proteins 
Helper T-cell(s) Oh), 1570, 1574-1575, 1583-1585 

antigen recognition, 1583-1585 

B cell activation, 1 574- 1 575, 1 597-1 598 

CD4 co-receptor role, 1580-1581, 1581F 

class switching, 1568 

cytokine production by, 1598T 

immunological synapse, 1597T 

selection in thymus, 1585-1586, 1587F 

T H 1 vs. T H 2 choice, 1574F, 1592-1594, 1593F 

typel (T H 1), 1574F, 1575, 1592 

activation of macrophages, 1594-1595, 

1595F 
dendritic cell activation, 1 593F 
T H 2 vs., 1574F, 1592-1594, 1593F 

type 2 (T H 2), 1 574F, 1 575, 1 592-1 593 



activation of B cells, 1598 
cytokines, 1593 
dendritic cell activation, 1593F 
T H 1 vs, 1574F, 1592-1594, 1593F 
see also CD4T-cell(s);Tcell(s) 
Hemangioblastoma, 1449 
Hematoxylin, 585, 585F 
Heme groups, 830F, 831 

iron coordination, 166, 166F 
porphyrin rings, 830F, 831 
Hemidesmosome(s), 1 1 34, 1 1 34F, 1 1 35T, 1 1 70, 
1171F 
intermediate filament attachment, 1144F, 1171F 
keratin filaments, 986 
Hemoglobin (Hb), 21 F 
cooperativity, 256, 256F 
evolution, 256-257, 256F 
gene homologies, 20 

see a/so Globin genes 
iron coordination, 166, 166F 
structure, 144F, 166F, 256-257 
globin chains, 256-257, 256F 
heme groups see Heme groups 
see also Globin(s) 
subunit interactions, 142-143, 143F 
Hemolysin, Listeria monocytogenes, 1511-1512 
Hemophilia, mutations, 348F 
Hemopoiesis, 1450-1463 
bone marrow, 1453-1456 
cell death in, 1462 
commitment, in, 1456-1458 

see also Transit amplifying cells 
control, 1454 

by apoptosis, 1462 

by colony-stimulating factors, 1459-1461 
multiple control points, 1461 F 
lineage diagram (scheme), 1457F 
from multipotent stem cells, 1456 
see also Erythropoiesis; Hemopoietic stem cells 
Hemopoietic cells, 1461-1462 
Hemopoietic stem cells, 1450-1463 
in bone marrow, 1454-1462 
cell-surface markers, 1456 
commitment, 1456-1458 
dependence on stromal cell contact, 1458-1459, 

1458F 
division rate, 1458 
gene therapy potential, 1456 
pluripotent, lymphocyte origin, 1543 
rescue of irradiated individuals, 1456 
self-renewal, 1458 
see also Hemopoiesis 
Hemopoietic tissues, origin of lymphocytes, 1 543 
Hemorrhagic fever, viral, 1 502 
Heparan sulfate, 881, 1183 
Hepatitis, 1505 
Hepatitis B infection, liver cancer causation, 

1227-1228, 1227T 
Hepatitis virus(es) 
chronic illness, 1500 
liver cancer and, 1227-1228 
receptors, 1505 
Hepatocyte(s), 1442, 1443-1444 
electron micrograph, 698F 
glucocorticoid-mediated gene expression, 412, 

415,463 
organelles, membrane amounts, 697T 
smooth endoplasmic reticulum, 725 
Hepatocyte growth factor (HGF), 1443 
Her2 protein, in cancer therapy, 1 260 
Hereditary nonpolyposis colorectal cancer 

(HNPCC), 277, 1254 
Heredity, 195 

defining feature of life, 1 
see also Inheritance 
Hermaphrodite, C. eiegans, 1321 
Herpes simplex virus (HSV), 1487-1488, 1496 
MHC class I protein translocation inhibition, 

1535-1536 
receptors, 1505 
Herpes virus(es), 1496F 
assembly, 1514F 
envelope acquisition, 1514F 
microtubule-based movement in axons, 
1516-1517, 1516F 
Hes 7 gene, 1371 

Heterocaryon(s), 509F, 643, 643F 
Heterochromatin, 220-222, 238-239, 1070 
biological functions, 220-221 
centromeric (centric), 220 



see also Centromere(s) 

dynamics, 220 

gene expression regulation, 366, 445, 452-453 
gene silencing, 220, 221 F 
position effects and position effect 

variegation, 221, 221 F 
X-inactivation, 285 
see also Gene silencing; Insulator elements 

heritability,221,22lF 

histone modification, 238-239 

multiple forms, 238-239, 238F 

organization, 220-222 

replication, 285 

short tandem repeats, 547F 

X-inactivation, 474-475 
Heterochromatin protein 1 (HP1), 238 
Heterochronic genes, 1326-1327, 1326F 
Heteroduplex joints, 305F, 306, 324, 325F 
Heteroduplex, meiotic recombination, 314, 31 4F 
Heterogeneous nuclear ribonucleoproteins, 353F, 

358 
Heterophilic binding, 1 146 

homophilic vs., 1138F 
Heterotypic membrane fusion, 768 
Heterozygote, definition, 554F 
Hexokinase, 120F, 159 
Hexoses, 112F 
High-density protein arrays, protein-protein 

interactions, 523 
High-mannose oligosaccharide(s), 774, 774F 
High-performance liquid chromatography (HPLC), 

513 
Hinge region, antibody molecule, 1552, 1553F 
Hippocampus, glutamate receptors, 691-692 
HIS gene, Saccharomyces cerevisiae, 285F 
Histamine, 804, 1556 

mast cells, 1557F 

release, 804, 804F 

secretion, 804, 804F, 1557F 
Histidine, 128F,548 
Histidine-kinase-associated receptors, 922, 

941-942 
Histochemical staining, 554-555, 579, 585F 
Histocompatibility molecule, 1575 

see also MHC (major histocompatibility complex) 
Histology 

airways, 1434-1436 

blood vessels, 1445-1450 

endothelial cells, 1445-1450 

epidermis, 1417-1428 

gut, 1436-1444 

liver, 1442-1444 

sensory epithelia, 1429-1433 
Histone(s), 211-216 

addition to DNA (postreplication), 290 

analysis, 21 1F, 21 2F 

evolutionary conservation, 213 

fold, 212, 21 3F 

genes, 289 

histone octamer, 21 1, 212 
assembly, 212, 214F 
identification, 21 2F 
S phase, 1070 

inheritance following replication, 290-291, 291 F 

in vitro mutagenesis, 213 

modification see Histone modification 

mutations, 213 

protein-DNA interactions, 212-213 

structure, 212, 213, 213 F, 21 4F 

synthesis in S phase, 289-290 

variants, 224, 224F 

see also individual histones 
Histone acetyltransferases (HATs), 222, 343, 343F 
Histone chaperones see Chromatin assembly 

factors (CAFs) 
Histone code hypothesis, 224-226, 226F 
Histone deacetylases (HDACs), 222, 446F 
Histone H1, 218, 218F 
Histone H2A, 211 

H2Bdimer,290 

interactions, 212, 21 4F 

modifications, 21 6F 

variants, 224F, 474-475 
Histone H2B, 211 

H2Adimer,290 

interactions, 212, 21 4F 

modifications, 216F 
Histone H3, 211 

centromere-specific variant CENP-A, 231-232, 
232F 
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H4 tetramer, 290-291 

histone code hypothesis, 225, 225F 

interactions, 212, 214F 

modifications, 187, 187F, 226, 226F, 290-291, 
291 F 

variants, 224F, 475 
Histone H4, 211 

evolutionary conservation, 213, 265 

H3 tetramer, 290-291 

interactions, 212, 21 4F, 217-218, 21 8F 

modifications, 290-291 , 291 F 

variants, X-inactivation, 475 
Histone modification, 222-224, 223F 

acetylation/deacetylation, 290 

amino acid side chains, 222, 222F 

chromatin immunoprecipitation analysis, 432, 
432F 

chromatin remodeling, 432, 433F 

functional roles 

epigenetic, 472, 472F 

histone code hypothesis, 224-226, 226F 

transcriptional, 343 

histone acetyl transferases {HATs), 222, 343, 343F 

histone deactylase complexes (HDACs), 222, 
446F 

histone tail modification, 222, 223F 

polytene chromosomes, 238F 
Histoplasma capsulatum, 1494, 1494F 
HIV (human immunodeficiency virus), 1496F, 1581 

cancer and, 1228T 

cell entry and uncoating, 764-765, 765F, 
1505-1506, 1506F, 1507 
multivesicular bodies, 797 

chemokine receptor binding, 765F 

co-receptors, 1505-1506, 1505F 

drug resistance, 1521 

fusion peptide, 1507 

genome, 485F, 1498F, 1521 

life cycle, 321 F, 486, 486F 

microtubule-based movement in axons, 1517 

mutation rate, 1520-1521 

Nef protein, 485F 

receptors, 1505-1506, 1505F 

regulated nuclear transport, 443FF, 485-486, 
485F, 486F 

reverse transcriptase structure, 32 1 F 

Rev protein, 443F, 485-486, 485F, 486F 

Rev responsive element (RRE), 485, 485F, 486F 

Tat protein, 478, 485F 

transcription attenuation, 478 

translational frameshifting, 383, 384F 

vaccine development, 1521 

see also specific types 
HIV-1,1521,1521F 

HIV-2, diversification and origin, 1 521 F 
HLA (human-leucocyte-associated) antigen see 
MHC (major histocompatibility 
complex) 
HLA-DM protein, 1406 
HNPCC see Hereditary nonpolyposis colorectal 

cancer (HNPCC) 
hnRNPs, 353F, 358 
Holliday junctions, 311, 312F 

double branch migration, 312, 31 2F 

gene conversion, 31 1, 314-315 

isomerization, 31 IF 
Homeobox genes see Homeotic (homeobox) genes 
Homeodomain DNA-binding motifs, 420-421, 
421 F 

see also Homeodomain proteins 
Homeodomain proteins, 420-421 

DNA-binding, 420-421, 421 F, 426-432, 430F 

evolutionary conservation, 138F, 420-421 

heterodimerization, 425F 

homologies, 138, 138F, 421 

isolation, 429 

plants, 1399 

Wuschel, 1410F 
Homeotic (homeobox) genes, 420, 1 338F, 
1341-1347 

chromosomal ordering in Hox complexes, 
1343-1344 

Drosophila see Drosophila melanogaster 
homeotic genes 

family members in different eucaryotes, 1400T 

flower development, 1413-1414, 1414F 

Hox complex see Hox complex 

see also Homeodomain proteins; Hox genes 
Homeotic mutation 

Arabidopsis, 1413F 



Drosophila, 1342, 1342F 
Homing receptors, lymphocytes, 1550 
Homologous chromosomes (homologs), 202, 1090, 
1271 
axial cores, 1275 

meiotic pairing, 1091-1092, 1271, 1273-1274, 
1274-1275, 1274F, 1276F 
bivalent formation, 1274, 1274F, 1276F 
double strand breaks, 1275, 1280 
genetic diversity and, 1271, 1279-1280, 

1279F 
morphological changes, 1275 
nondisjunction, 1236F, 1278-1279 
pairing sites, 1274 
sex chromosomes, 1275 
synapsis, 1275, 1276F 
synaptonemal complex, 1274F, 1275-1276, 

1275F 
telomeres and, 1274, 1274F 
transverse filaments, 1276, 1277F 
see also Homologous recombination 
(crossing-over) 
meiotic segregation, 1276-1278, 1277F 
nondisjunction, 1236F, 1278-1279 
Homologous end-joining, DNA double-strand 

break repair, 303 
Homologous genes, 20-2 1 , 20F, 2 1 F 
Homologous recombination (crossing-over), 
304-316 
biological functions, 304-305, 305F 
common features, 305 
control via mismatch correction, 31 5F 
double-strand break repair, 302-303, 303F, 
304-305, 305F, 308-309, 309F 
defects, 295T 
heterozygosity loss, 310 
regulation, 310-311 
"repair factories," 3 1 0, 31 OF 
evolutionary benefit, 305 
genetic exchange/assortment, 305, 1348 
see also meiotic recombination (below) 
mechanism 

base-flipping, 307F 
base-pairing, 305-306 
branch migration, 307F, 308, 308F 
heteroduplex joints, 306, 308 
Holliday junctions see Holliday junctions 
hybridization (synapsis), models, 306F 
strand invasion, 306 
meiotic recombination, 305F, 312-314, 1092, 

1092F, 1271, 1274-1275, 1274F, 1280F 
chiasma formation, 1274, 1274F, 1276, 1276F 
crossover control, 3 1 4 
crossover interference, 1280 
double Holliday junction, 312, 313F 
double-strand breaks, 305F, 312 
gene conversion, 314 
genetic diversity and, 1271, 1279-1280, 

1279F 
heteroduplex, 314, 314F 
hot spots and cold spots, 1280 
non-crossovers, 313-314 
recombination complex, 1275, 1280 
regulation, 314, 1280 
Spoil, 1280 
proteins, 31 2F 

RecA, 306-307, 307-308, 307F 
strand exchange see Holliday junctions 
strand invasion, 305 
see also Linkage analysis; Meiosis 
Homology, 20F, 246 
Homophilic adhesion, 1 137-1 1 39 

heterophilic vs., 1138F 
Homo sapiens see Humans (Homo sapiens) 
Homotypic membrane fusion, 768, 769F 
Homozygote, definition, 554F 
Homunculus, 1391 
Horizontal gene transfer, 19F, 21-22 
antigenic variation, 1520 
evolutionary innovation, 19 
Hormones, 882 

gene expression regulation, 41 2, 41 5, 463, 463F 
see also individual hormones 
Host behavior, modification to facilitate pathogen 

spread, 1518 
Host cells 

invasion by pathogens, 1501, 1504-1505 
bacteria, 1507-151 1, 1508F, 1509F 
evasion of defenses, 1502-1504 
intracellular parasites, 1508-1511 



viruses, 1505, 1506F 
membrane traffic and pathogen strategies, 

1511-1514,1511F 
response to pathogens, 1 488 
transcription shut-down, 1517 
virus-induced metabolic changes, 1517-1518 
Host defenses, 1501-1502 
epithelial surfaces, 1525-1526 
evasion by pathogens, 1502-1504 
antigenic variation, 1519-1520 
Housekeeping genes, CG (CpG) islands, 434, 470 
HoxB complex, expression in mouse, 1 346F 
HOX code, anteroposterior axis determination, 

1342-1346 
Hox complex 

anteroposterior axis determination, 1342-1346 
expression in mouse, 1346F 
expression pattern in Drosophila, 1343F 
insect and mammal compared, 1 346F 
sequential gene expression, 1 343-1 344 
Hox genes, 1345, 1355 
HPLC (high-performance liquid chromatography), 

513 
HS4 protein, 227-228 
hsp40 proteins, 390F 
Hsp60 protein, 388, 389 

Hsp60 protein, mitochondrial protein import, 717 
Hsp70 protein, 388, 389, 390F 
Hsp70 protein, mitochondrial protein transport, 

715,716F,717 
HTH motif see Helix-turn-helix motif 
Human accelerated regions (HARs), 253 
Human genetics, 556 
Human genome, 205-206, 206T 
chromosomes, 202, 203FF 

chromosome 12 translocation, 204F 
chromosome 22, 205F, 206T 
gene organization, 205F 
replication origins, 287-288 
replication rate, 283 
t-loop, 293F 
complexity, 142, 205-206 
composition, 207F 
Alu elements, 323F 
mobile elements, 207F, 31 8T, 323 

see also Mobile genetic elements 
see also Noncoding DNA 
genes, 142, 206T 
regulatory, 206 
size, 206 

structure, 205-207, 205F, 206, 206T 
see also Gene structure 
genomic DNA library, 541, 542F 
individual variational 
mitochondrial, 861-862, 861 F 
nonretroviral-like retrotransposons, 321-322 
phylogeny reconstruction, 248F, 25 1F 
scale, 206F, 206T, 209F 
sequencing, 142, 205-206 

comparative, 207, 247, 247F, 248F 
Human Genome Project, 205-206, 206, 532 
size, 18T 
Human Genome Project, 205-206, 206, 532 

see also Human genome 
Human immunodeficiency virus see HIV 
Human papillomavirus, cervical cancer, 1499 
Humans (Homo sapiens) 

genome see Human genome 
as model organism, 40 
in tree of life, 16F 
Humoral immune response, 1552-1568 

see also B cell(s); Immunoglobulin(s) 
Hunchback protein, eve gene activation, 448-450, 

449F 
Hunter's disease, 785 
Huntingtin, sequence comparisons, 251 F 
Huntington's disease, 397, 560 
Hurler's disease, 785 
Hyaluronan (hyaluronic acid; hyaluronate), 1 1 79, 

1180-1181, 11 80F 
Hyaluronidase, 1181 
Hybrid cells 

hybridomas see Hybridomas 
production, 509F 
Hybridization, DNA see DNA hybridization 
Hybridomas, 508-509, 509F 
Hydra, asexual reproduction, 1269, 1269F 
Hydride ions, electron carriers, 82 
Hydrocarbon(s), 1 06F, 623, 623F 
Hydrogenation reactions, 71 F, 72 
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Hydrogen, atomic structure, 46F 
Hydrogen bonds, 54, 108F, 1 10F 

aquaporin(s), 675 

ball and stick model, 54F 

bond length/strength, 53T 

DNA structure, 197-198, 198F, 199F, 417 

protein-DNA interactions, 41 7, 41 7F 

protein folding, 130F 

protein-ligand binding, 153, 154, 154F 

protein structure, 1 26, 1 27, 1 30F 

in water, 108F, 11 OF 
Hydrogen ion (H + , proton) 

acids, 52 

exchange, 109F 

gradients to drive ATP synthesis, 100F 

intracellular ion concentration, 596 

movement, 827-828, 835, 835F 

see also Electrochemical proton gradients; 
Proton pumps 

pH measurement, 596 

water, 828, 828F 
Hydrogen ion pump (H + pump) see Proton pumps 
Hydrogen peroxide, peroxisomes, 721 
Hydrolysis, 85F 

ATP see ATP hydrolysis 

macromolecules, energetics, 84-87 

nucleotide see Nucleotide(s),hydrolysis 

sugars, 56, 57F, 164-165 
Hydrolytic editing, aminoacyl-tRNA synthetases, 

372, 372F 
Hydronium ion, 52, 53, 109F, 828F 
Hydropathy plots, transmembrane protein(s), 632, 

633F 
Hydrophilic molecules, 52, 108F, 618, 620, 621 F 

see also Water 
Hydrophobic chromatography, 512 
Hydrophobic interactions, 1 1 1 F, 381 
Hydrophobic molecules, 52, 1 08F, 61 8, 620, 621 F 

proteins, 390-391 

see also Water 
Hydrothermal vents, 12, 13FF 
Hydroxyl ion, 109F 

Hydroxylysines, collagens, 1186, 1187F 
Hydroxyprolines, 1186, 1187F, 1190 
5-Hydroxytryptamine (5-HT) see Serotonin 
(5-hydroxytryptamine, 5-HT) 
Hypervariable region, antibody, 1560-1561 
Hypodermis (subcutaneous layer), 1418, 1418F 
Hypopigmentation (albinism), 786 
Hypothalamus, circadian clock, 461 
Hypoxanthine, 300, 301 F 
Hypoxia, angiogenesis activation, 1 221 
Hypoxia-inducible factor 1 (HIF1), 1221, 1448-1449 



ICAMs see Intercellular cell adhesion molecules 

(ICAMs) 
IFNssee Interferon(s) 
IgA see Immunoglobulin A (IgA) 
IgD see Immunoglobulin D (IgD) 
IgE see Immunoglobulin E (IgE) 
IgG see Immunoglobulin G (IgG) 
IgM see Immunoglobulin M (IgM) 
IHog protein, 950 
IkB kinase (IKK), 953 
IkB protein, 953 
ILssee Interleukin(s) 
Image processing 

electron microscopy, 610-612 
light microscopy, 583-584, 584F 
Imaginal discs, 1349-1351 
growth control, 1353-1354 
wing imaginal disc, 1356F 
Imago, 1329 

Immature naive B cell, 1553 
Immediate early gene(s), 929 
Immortal cell lines/immortalization, 504F, 506T, 
1059 
see also Cell cycle control 
Immortal strand hypothesis, 1424-1426, 1425F 
Immune complexes, 1553F 
Immune response/system 

adaptive, 1541-1542, 1542 F, 1545 

cell-mediated vs antibody, 1540, 1540F 
complement system see Complement system 
innate vs., 1539, 1540FF 
MHC proteins see MHC (major 

histocompatibility complex) 



primary vs secondary, 1546 
see also Bcell(s); lmmunoglobulin(s);Tcell(s) 
innate, 1485, 1524-1537, 1539, 1540FF 
adaptive vs., 1539, 1540FF 
elicitors, 1526-1527, 1527 
evolution, 1524, 1531 
function, 1524 

recognition of pathogens, 1 526-1 528 
response to viruses, 1534-1536 
Toll-like receptors role, 1530 
see also Phagocytosis 
Immunoblotting, 518, 519F 
Immunoglobulin a (Iga), 1599F 
Immunoglobulin p (IgfJ), 1599F 
Immunoglobulin(s), 1540, 1552-1561, 1552-1568, 
1552F, 1554F, 1557T 
affinity, 1558 
applications 

cell separation techniques, 502 

FACS, 502, 503F 

fluorescent labeling, 502, 588 

immunoelectron microscopy, 606-607, 607F 

immunofluorescence microscopy, 587F, 

588-589 
immunoprecipitation, 431-432, 432F 
microinjection, 597, 598F 
monoclonal antibodies, 509 
avidity, 1558 
classes, 1553-1557, 1557T 

see also specific types 
class switching see Class switching 
diversification, 1562-1569, 1568F 
evolution, exon recombination, 257F 
genes 

gene-pool selection, B cell development, 

1566F 
post-transcriptional regulation, 370 
segments, 1562, 1563, 1563FF, 1564-1565 
immunoglobulin a (Iga), 1599F 
immunoglobulin [5 (Igp), 1599F 
membrane-bound form, 1 557 
monoclonal see Monoclonal antibodies 
responses, 1540, 1540F 

primary and secondary, 588, 1 546F 
see also specific types 
structure, 156, 157F, 1558-1559, 1558F, 1559F, 
1560F 
binding sites see Antigen-binding site 
gene-segment pool, 1563F 
heavy (H) chain see Heavy (H) chain 

(antibodies) 
hinge region, 1552, 1553F, 1554F 
hypervariable region, 157F, 1559, 1559F, 

1560-1561 
Ig domains, 1 559, 1 559F, 1 560F 
immunoglobulin fold, 140F 
light (L) chains see Light (L) chain (antibodies) 
paired domains, 141 

tail (Fc) region, 1552F, 1554F, 1555, 1555F, 
1559 
synthesis 

in endoplasmic reticulum, 768F 
in vitro via hybridomas, 508-509, 509F 
see a/so Bcell(s) 
Immunoglobulin A (IgA), 1556, 1556FF, 1557T 
class switching, 1567 
heavy chain, 1553 
Immunoglobulin D (IgD), 1 557T 
B-cell development, 1553 
class switching, 1567 
heavy chain, 1553 
Immunoglobulin E (IgE), 1556, 1557T 
allergic reactions, 1557F 
class switching, 1567 
heavy chain, 1553 
histamine secretion, 1557F 
Immunoglobulin fold, 140F 
Immunoglobulin G (IgG), 1553, 1554-1556, 1557T, 
1559F, 1560F 
class switching, 1567 
structure, 1 561 F 
Immunoglobulin-like domain, 1560F, 1599 
Immunoglobulin M (IgM), 1553-1554, 1555F, 
1557T 
class switching, 1 567, 1 568F 
complement activation, 1554 
transmembrane, 1599F 
Immunoglobulin superfamily, 1560, 1599-1600, 
1599F 
alternative splicing, 1 146-1 147 



|32 integrin ligands, 1174 

cell adhesion, 1145, 1146-1147, 11 77T 
synapse formation and, 1 147-1 148 

evolution, 257F 

immunoglobulin fold, 140 

structures, 1147F 

see also Immunoglobulin(s); specific members 
Immunogold electron microscopy, 606-607, 607FF 
Immunological memory, 1546, 1546F 
Immunological synapse, 1573, 1591 

B cells/helper T cells, 1597T 

signal-relaying junctions, 1132 
Immunological tolerance, 1547-1549 

acquired, 1546F, 1547, 1547, 1547F 

autimmune regulator (AIRE) protein, 1588 

clonal deletion, 1548, 1548F 

nonactivated dendritic cells, 1571 

peripheral, 1587-1588 

receptor editing, 1548, 1548F 

self, 1548F 
Immunoprecipitation, 432, 432F, 458 
Immunostimulants, pathogen-associated, 

1526-1527,1527 
Immunosuppression, tuberculosis, 1488 
Imprinting see Genomic imprinting 
Inclusion-cell disease (l-cell disease), 785-786 
Indian Hedgehog protein, 950 
Indirect immunofluorescence, 587F, 588, 588F 
Induced proximityy, 185-186 
Induced somatic recombination, 1348, 1349F 
Inducible promoters, 564 
Inductive signals, in development, 1313 

gastrulation role, 1368F 

sequential, 1319-1320 

see also specific signals 
Infection(s) 

as carcinogens, 1227 

cell biology, 1501-1524 

chronic diseases, 1499-1500 

intracellular pathogens see Intracellular 
pathogens 

response to see Host defenses; Immune 
response/system 

route, 1501-1502 

signs/symptoms, 1487-1488 

see also specific infections/organisms 
Infectious mononucleosis, 1499 
Infertility, 1301 

Inflammation/inflammatory response, 1526, 1527, 
1533-1534, 1534F 

proteoglycans and, 1 183 
Inflammatory mediators, 1533-1534 
Influenza virus(es), 1496F, 1521 

cell entry, 765 

entry and uncoating, 1 506F, 1 507 

microtubule-based movement in axons, 1517 

pandemics, 1521, 1522F 
Influx transporter proteins, auxin transport, 959 
Information transmission 

defining property of life, 1 

energetics, 8 

molecules 

DNA see DNA (deoxyribonucleic acid) 
origin of life, 401, 402-403, 402F 
RNA see RNA (ribonucleic acid) 

quantification, 1 

see also Inheritance 
Inheritance, 1, 195 

epigenetic, 291,702,704 

chromatin structure, 230-234, 232F, 473-476 
DNA methylation, 467-468, 467F 
histone modifications, 290-291 , 291 F 

information transmission, 1-5 

non-Mendelian, 864-865, 865F 
chloroplast(s), 866, 866F 
maternal, 866 

'Protein-only inheritance/yeast, 398, 398F 
Inherited spinal muscular atrophy, SMN protein 

mutation, 364-365 
Inhibitors of apoptosis (lAPs), 1 1 24-1 1 25 
Inhibitory G protein (GO, 906-907 
Inhibitory postsynaptic potential (IPSP), 688 
Initiation codon, 367F, 380, 489-491 
Initiation complex, transcriptional, 341 
Initiation factors (eucaryotic) see elFs (eucaryotic 

initiation factors) 
Initiator tRNA, 380 

Innate immune response see Immune 
response/system, innate 
Innexins, 1159 
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lnosine,369,369F,484,484F 

Inositol 1,4, 5-triphosphate (IP 3 ), 910, 910F, 91 1 FF 

egg activation role, 1 299 

membranes, 910 
Inositol phospholipid(s), 624, 627F, 757, 757F, 910F, 
933 

cellular location, 758F 

phosphatidylinositol (PI), 743, 91 OF 

phosphatidylinositol 4, 5-bisphosphate [Pl(4, 
5)P 2 ],909,910F,911F 

phosphoinositide 3-kinase, in cancer, 1244 

retromers, 756 

signaling pathway, 909-91 1, 91 OF, 91 1 FF 
see also Phospholipase C 

see also specific types 
Input-output devices ("microchips"), protein 

kinases, 177-178, 177F, 179F 
Insect(s) 

as vectors for microbial pathogens, 1 501 -1 502 

see also Drosophila melanogaster 
Insertional mutagenesis, 528F, 556 

see a/so Transposon(s) 
In situ hybridization (ISH), 572-573, 573F 

fluorescence (FISH), chromosome puffs, 239 
Instars, 1329 

Insulator-binding proteins, 452-453, 453F 
Insulator elements, 452-453, 452F, 453, 453F 
Insulin, 144F 

proteolytic cleavage, 151, 152F 

receptor, 798, 798F, 923F, 923T, 924 

secretion (p cells), 1444, 1444F 
Insulin-like growth factor-1 (IGF-1), 923T, 924, 934 

receptor, 923F, 924 
Insulin-like growth factor-2 (IGF-2), imprinting, 

469-470, 470F 
Integra (skin substitute), 1477 
Integrase, 318T 

lambda integrase, 324, 325F 
Integrin(s), 1 1 69-1 1 78, 1 1 73T, 1 1 77T 

actin linkage, 1170, 1170F 

adhesion junctions, 1 1 34, 1 1 45 

allosteric regulation, 1 1 71 -1 1 72, 1 1 72F 

Pi integrins, 1173, 1422, 1422F 

p2integrins, 1174 

p3 integrins, 1174 

bacterial invasion and, 1508 

bi-directional signaling, 1 169 

cell migration and, 1 1 70-1 171 

cell proliferation and survival, anchorage 

dependence, 1 1 75-1 1 76, 1 1 75F 

clustering and strong adhesions, 1 1 74-1 1 75 

counterreceptors, 1 1 74 

diversity of function, 11 72-1 1 74, 1 1 73T, 1 1 74F 

fibronectin binding, 1193 

fibronectin fibril assembly, 1 1 92 

heterophiilic binding, 1 146 

inside-out activation, 1 1 72 

intracellular signaling, 1 1 72, 11 76-1 1 77, 1 1 76F 

in mammary gland, 1428 

mutation and genetic disease, 1172-1174 

outside-in activation, 1172 

structure, 1 1 70-1 1 72, 1 1 70F, 1 1 72F 

tension effects, 1 169, 1 171, 1 171 F 
Interaction domain(s), cell signaling, 897 
Intercalary regeneration, 1354, 1354F 
Intercellular cell adhesion molecules (ICAMs), 41 1 , 

1146,15921 
Intercellular gene transfer see Horizontal gene 

transfer 
Interchromatin granule clusters (speckles), 241, 

241 F, 363-364, 365F 
Interdigitated solenoid model, chromatin structure, 

217,217F 
Interference effects, 581, 582F 
Interferon(s) 

interferon-a (IFNa), 1 534-1 535 

interferon-p (IFNP), 1534-1535, 1536 

interferon-y(IFNy) 

cytotoxic T-cells, effects on, 1592 
T-cell development, 1582 
ThI cell secretion, 1592, 1595F, 1598T 
viral infections, effects on, 1406F 
Interfollicular epidermis, 1418-1419 
Interleukin(s), 1591F, 1598T 

receptors, 1 591 F 

see also specific interleukins 
lnterleukin-1 -converting enzyme (ICE), 1118 
lnterleukin-2 (IL2), 1591, 1591 F 
lnterleukin-2 receptor, 1591 F 
lnterleukin-3 (IL3), 1460, 1460T, 1461 F 



lnterteukin-4 (IL4), 1 593, 1 593F, 1 598, 1 598T 

lnterleukin-5 (IL5), 1593, 1598T 

lnterleukin-1 (IL1 0), 1 575, 1593, 1598T 

lnterleukin-1 2 (IL12), 1593F, 1594, 1595F, 1598T 

lnterleukin-1 3 (IL1 3), 1593 

lnterleukin-17 (IL17), 1594 

Intermediate compartment see Cis Golgi network 

(CGN) 
Intermediate filament(s) 

anchoring junction attachment, 1133T, 1134, 
1143, 1144, 1144F, 1171F 
see also Desmosome(s) 
assembly, 145, 984, 984F 
cellular location, 966 
dynamic behavior, 966 
elongated, 972F 
global distribution, 983 
isoforms, 985 
keratins see Keratin(s) 
lateral bundling and twisting of coiled coils, 

983-985, 984FF 
mechanical stability, 985-987 
organization, 1005-1006 
polypeptide structure, 984 
properties, 965 
role in cell division, 966-967 
structure, 968, 968F, 983F 
subunits, 971 
Internal membranes, 26 
Internal ribosome entry sites (IRESs), 491, 491 F, 

1517 
Interneurons, 1386F 
Internode (plant stem), 1398F, 1407 
Interphase, 1055 

chromosomes see Interphase chromosome(s) 
microtubule dynamics, 1080 
stress fibers, 1093 
Interphase chromosome(s), 208F, 209 
chromatin organization, 236F, 243 

see also Heterochromatin 
histone modifications, 238, 238F 
lampbrush chromosomes, 234, 234F, 235F, 1288 
looping, 234-236, 235F 
polyteny, 236-238 

see also Cell cycle; Chromosome structure; 
Polytene chromosome(s) 
Interpolar microtubules, 1075, 1076F, 1079 
Intervening (DNA) sequences see Intron(s) 
Intestinal defenses, against pathogens, 1526F 
Intracellular compartments, 695-748, 696F 
endocytic pathway, 751 F 
maintenance, 750-766 
see a/so Organelle(s) 
Intracellular ion concentration(s), 596, 597F, 652T 
Intracellular movement, bacteria and viruses, 

1514-1517 
Intracellular pathogens, 1 504-1 505 

entry/exit from host cells, 1504-1505, 1512F 
movement using actin polymerization, 

1515-1516 
parasites, invasion of host cells, 1508-1511 
strategies against host membrane traffic, 
1511-1514, 151 IF 
escape from phagolysosome, 1511-1512, 

151 IF 
lysosomal fusion prevention, 151 IF, 1512 
modification by bacteria, 1512, 1512F 
modification by viruses, 1513-1514, 151 4F 
survival in phagolysosome, 151 IF, 1513 
Intracellular processes 
manipulation, 598-599 
visualization 

electroporation, 598, 598F 
GFP, 592-593 

see also Green fluorescent protein (GFP) 
ion concentration, 596-597, 597F 
microinjection, 597-598, 598F 
radioisotopic tracing, 602-603 

see also Radioisotopes 
see also Fluorescent dyes 
Intracellular receptors see Nuclear hormone 

receptor(s); Receptor(s),intracellular 
Intracellular signal molecule(s)/pathway(s), 880F, 
893-899, 894F 
input-output devices, 1 77-1 78 

see also Protein kinase(s) 
integrins, 1 1 72, 1 1 76-1 1 77, 1 1 76F 
mitogens, 1103 
phosphorylation 

cfGTP-binding, 180F 



tyrosine residues see Tyrosine kinase(s) 
see also Protein kinase(s) 

positive feedback, 901 F 

second messengers, 893 

signal enhancement, 897 

signaling complexes, 897, 898F 

signaling proteins, 893, 894F, 897-899 
see also Signaling molecule(s)/pathway(s); 
specific molecules 

signal integration, 897F 

threshold-like response, 900-901, 901 F 

tyrosine phosphorylation see Tyrosine kinase(s) 

see also Signal transduction; specific 
molecules/pathways 
Intracellular transport 

GFP-fusion protein studies, 753F 

membrane-enclosed organelles, 1021-1022 

see also Cytoskeleton; Motor protein(s) 
Intracytoplasmic sperm injection (ICSI), 1 302, 1 302F 
Intragenic mutation(s), 18, 19F 
Intron(s) 

evolution, 348 

fate signals, 358 

gene structure, 206 

group I, 334, 356, 356F 

group II, 356, 356F 

human genes, 205F, 206, 206T 

length variation, 349, 352, 353F 

optional, 863 

organellas 863 

protein evolution role, 257 

removal by RNA splicing, 345-346, 347-348 
order, 352-353, 353F 
splicing signals see Splicing signals 
see also Splicing 

sequence ambiguity, 480 
Intron-exon mapping, 538F 
Intron-exon recognition, 551 
Invariant chain, 1584, 1596F 
Invariant pocket, MHC protein, 1577-1578, 1579F 
Invasin, 1508 

tnversion(s), genome evolution, 246-247 
Inverted repeats, DNA-only transposons, 318T, 319F 
In vitro, definition, 503-504 
In vitro fertilization (IVF) and related procedures, 

1297,1301-1303, 1302F 
In vitro mutagenesis, 21 3, 575 
In vitro RNA selection experiments, 404F, 408 
In vivo, definition, 503-504 
Involucrin, 1419 
lon(s), 47 

concentration, 596, 597F, 652T 

see also Electrochemical proton gradients 

membrane channels see Ion channel(s) 

see also specific ions 
Ion channel(s), 667-692 

abundancy, 668 

conformational changes, 668F 

cyclic-nucleotide-gated, 916 

families, 692T 

function, 667, 668 

gating, 668, 668F, 1431, 143 IF 

ligand-gated see Ligand-gated ion channel(s) 

mutations, 682 

patch-clamp recordings, 651 F, 680-682, 681 FF 

selectivity, 667-669 

transport efficiency, 667 

voltage-gated, 676-678 

see also individual types 
Ion channels, Cyclic-nucleotide-gated, 91 7 
Ion-exchange chromatography, 512, 51 4F 

matrices, 534F 

protein purification, 535F 
Ionic bonds, 47, 11 IF 

aqueous solutions, 11 IF 

bond length/strength, 53T 

covalent bonds vs., 48F 

protein folding, 130F 
Ionizing radiation, DNA-damaging agents, 300-301 

see also Ultraviolet (UV) radiation 
lonotropic receptors see Ion channel(s) 
Ion-sensitive indicators, 596, 597F 
IP 3 see Inositol 1,4,5-triphosphate (IP 3 ) 
IRESs see Internal ribosome entry sites (IRESs) 
Iron-copper center, cytochrome oxidase complex, 

832-834 
Iron, endocytic uptake, 794 
Iron-sulfur center, 830F, 831, 852 
Iron-sulfur proteins, 830F, 831 
IS1 transposons, 31 8T 
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Islet gene, 1 385 

Islets of Langerhans, 1444, 1444F 

Isocitrate, 122F, 123F 

Isocitrate dehydrogenase, 123F 

Isoelectric focusing, 2D-PAGE, 521, 521 F, 522F 

Isoleucine, structure, 129F 

Isomers, 56 

amino acids, 128F 

sugars, 112F 
Isotopes, 601, 601T 

radioactive see Radioisotopes 
Izumo protein, 1298 



Jak (Janus kinase), 937-939 

Jak-STAT signaling pathway, 937-939, 938F, 939T 

Janus kinase (Jak), 937-939 

J (joining) chain 

IgA, 1556F 

IgM, 1554, 1555F 
Joint molecules (recombination) 

conservative site-specific recombination, 324, 
325F 

synapsis, 305F 
Junctional complexes, 11 53, 11 54F 

cell proliferation and, 1153-1155 

organization, 1 1 48-1 1 49, 1 1 48F, 1 1 49F 

see also Scaffold protein(s) 
Junctional diversification, antibody gene segment 

joining, 1564-1565 
Junctional epidermolysis bullosa, 1 165 
Junk DNA, 204, 860 

see also Noncoding DNA 



Kaposi's sarcoma, 1228, 1228T 
Kartagener's syndrome, 1033 
Karyotype, 203 

analysis, 203FF 

human, 203F 

muntjac deer, 204F, 205F 

see also Mitotic chromosome(s) 
Katanin, 770, 995F, 1000, 1001F 
KDEL sequence, ER retrieval pathway, 770 
Keratan sulfate, 1179 
Keratin(s), 985-986, 985T, 986F 

a-keratin, 145 
Keratin filaments, epidermal prickle cells, 1419, 

1419F 
Keratinocytes, 1038F, 1418F, 1419 

control of differentiation in culture, 1422, 1426 

migratory, 1037, 1037F 
a-Ketoglutarate,99, 122F, 123F 
a-Ketoglutarate dehydrogenase complex, 123F 
Ketoses, monosaccharides, 11 2F 
Kidney glomerulus, basal lamina, 1 1 64-1 1 65, 

1164F, 1167 
Kinases see Protein kinase(s) 
Kinesin(s), 527, 1014, 101 4F 

ATP hydrolysis, 101 8F 

axonal transport, 1021 

central core and force generation, 1016 

evolutionary origin, 1015-1016 

force generation, 1016-1019 

linker region, 1016 

mechanochemical cycles, 1016-1017, 1018F 

microtubule-binding site, 1014, 1014F 

microtubule disassembly, 995F 

mitotic spindles, 1034, 1077, 1079, 1082F 

movement speed, 1020 

processivity, 1020 

structure, 1014F 

see also individual kinesins 
Kinesin-related proteins (KRPs), 1014, 1014F 
Kinesin superfamily see Kinesin(s); Kinesin-related 

proteins (KRPs); Motor protein(s) 
Kinetics 

enzyme catalysis see Enzyme kinetics 

motor proteins, 1020-1021 
Kinetochore(s), 210, 1076F, 1082-1083 

bipolarity, 1075-1076, 1076F 

meiosis 1, 1277-1278 

microtubules 

attachment, 1082, 1083F, 1084F 
forces acting on, 1085-1087, 1085F, 1086F, 
1089 



mitotic, 1035F, 1076, 1076F 
tension, 1084, 1086 

spindle assembly checkpoint, 1088 

structure, 1083F 
Kinetochore fiber, 1084 

Kin recognition, selective protein retention, 731 
Kissing hairpins, RNA structure, 403F 
Kit protein, 1375, 1458, 1458F 
KKXX sequence, ER retrieval pathway, 770 
Knockout animals see Gene knockouts 
K-Ras gene, 1 25 1 -1 252, 1 252T, 1 255 
Krebs cycle see Citric acid cycle 
Kringle domain, 140F, 141 
Kruppel mutant, Drosophila, 1337 
Kruppel protein, 41 8T, 448-450, 449F 
Kupffer cells, 1443F 
Kuru, 1499F 



l_1 elements see LINE (LI) elements 

Labeling index, 1059 

Lac operon (Escherichia coii), 9F, 41 6, 41 8T 

DNA looping, 438 

lac gene regulation, 9F, 436F 
CAP activation, 41 8T, 420F, 436 
lac repressor see Lac repressor 

mutations, 264 

two-signal switching, 436F 

see also Repressor protein(s) 
Lac repressor, 9F, 41 6, 41 8T 

discovery, 416 

DNA sequence recognition, 41 8T 

synthetic biology, 461 F 

see also Repressor protein(s) 
Lactate production, 90 

see also Glycolysis 
Lactation, 1427-1428, 1427F 
Lactose permease, 657, 658F 
Lagging strand synthesis (DNA replication), 268, 
269F, 272F, 276F 

end-replication problem, 292 
see a/so Telomerase 

eucaryotic, 280, 281 F 

ligation, 272 

Okazaki fragments, 267-268, 269F, 272, 281 F 

RNA primers, 272, 272F 
Lakritz mutant, 556 

Lambda bacteriophage see Bacteriophage lambda 
Lambda repressor, 457-458, 458F 

DNA sequence recognition, 41 8T 

gene regulatory protein, 416 

helix-turn-helix motif, 420F 

synthetic biology, 461 F 
Lamellipodia, 1037 

actin filament nucleation, 996 

actingels, 1008 

cell adhesion, 1040 

cell crawling, 1008, 1036F 

cofilin in, 1038F 

growth cones, 1040F, 1049 

movement mechanism, 1037-1038 

organization cf. mitotic spindle, 1039 

ruffling at leading edge, 1040F, 1041 

web formation by ARP complex, 1038, 1038F 
Laminin(s) 

basal lamina, 1165-1166 

collagen binding, 1 166-1 167, 1 167F 

epithelial apico-basal polarity, 1 1 55F 

isoforms, 1166 

laminin-1 (classical laminin), 1165-1166 

mutations, 1167 

neuronal guidance role, 1 388 

NMJ, 1168 

structure, 1166F 
Lampbrush chromosomes, 234, 234F, 235F, 1288 
Langerhans cell see Dendritic cells 
Lariat structure, pre-mRNA splicing, 347, 348F, 350F 
Larva (Drosophila), 1329, 1343F 

extended germ band, 1330F 

imaginal discs, 1349, 1350F 
Laser capture microdissection, 502, 503F 
Laser tweezers, 599, 599F 
Late genes, chromatin condensation effect, 

285-286 
Latent gene regulatory proteins 

p-catenin, wnt signaling, 948-950, 949F 

NFkB, 952-954, 953F, 1530 

Notch, 946-948 



proteolysis, 946-954 
Lateral diffusion (membrane proteins), 643, 646F 
Lateral inhibition 

Delta-Notch signaling, 946-947, 947F 

Drosophila sensory bristle development, 
1357-1358 

Wnt-Notch signaling, 1439-1440, 1439F 
Latrunculin, 987, 988T 
Lck, 1581,1 590F 

see also Src family of protein tyrosine kinases 
LCRs see Locus control region(s) 
LDL-receptor-related protein (LRP), 949, 949F 
LDLs see Low-density lipoproteins (LDLs) 
Leading strand synthesis (DNA replication), 267, 

267F, 268, 268F, 269F, 271-273, 271 F, 
276F 
Leaky scanning, 490 
Lectin(s), 1145 

complement activation, 1 529 

glycocalyx interaction, 636 

mannan-binding, 1529 

trans-Golgi network, 808 
Lectin complement activation pathway, 1529 
Leeuwenhoek, Anton van, 501, 501 F 
LEF1/TCF gene regulatory proteins, 950 
Left-right asymmetry development, 1 376-1 377 
Leftyl gene, 1377 
Legionella pneumophilia 

invasion of host cells, 1 508, 1 508F 

strategy against host membrane traffic, 1 51 1 F, 
1512-1513, 1512F 

transmission, 1508 
Legionnaire's disease, 1508 
Leishmaniasis, 1488 
Lens(es) 

electron microscope, 604 

light microscope, 581-582, 582F 

see also Light waves 
Lepromatous leprosy, 1593-1594 
Leprosy, 1593-1594 
Leptin, 1475 

deficiency, 1476F 

function/effects, 1475-1476, 1476F 

gene sequence comparisons, 248F, 249F 

secretion from fat cells, 1475-1476 
Leptogene, 1275 
/_ef7gene, 1327 
Lethal mutation(s), 528F 
Lethal toxin, 1493 

Leucine-rich repeat (LRR) proteins, 956, 956F 
Leucine, structure, 129F 
Leucine zipper proteins, 423, 424F, 425F 
Leucocyte(s), 1451J453T 

adhesion to endothelial cells, 1453, 1454F 

inflammatory response, 1453-1454, 1454F 

integrins, 1174 

migration from capillaries, 1453-1454, 1454F 

structure, 1452F 

types, 1451-1453 
Leucocyte adhesion deficiency, 1 1 74 
Leucoplasts, 841 
Leukemia 

B lymphocyte leukemia, origin, 1219 

chronic myelogenous see Chronic myelogenous 
leukemia (CML) 

DNA repair defects, 295T 
Lex A protein, domain fusion, 442F 
Leydig cells, development, 1 285 
LFA1 (lymphocyte-function-associated protein 1), 

1592T 
Life, 1 

autocatalytic process, 7F 

universal features, 1, 1-11 
Ligand(s) 

ion channel opening see Ion channel(s) 

protein binding sites, 1 53 

see also Substrate(s),enzyme binding 
Ligand-gated ion channel(s) 

ion-channel-linked receptors, 892, 893F 

mechanically-gated, 1431, 1431F 

neurotransmitter-gated see Transmitter-gated 
ion channel(s) 
Ligase fold, 406F 
Ligation 

DNA cloning, 540, 541 F 

lagging strand synthesis (DNA replication), 272 

see also DNA ligase(s) 
Light (L) chain (antibodies), 1552-1553, 1554F, 
1556, 1557T, 1558F 

constant domains, 1 558-1 559, 1 560F 
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generation of diversity, 1562-1566 

repeating domains, 1559, 1560F 

variable domains, 1558-1559, 1560F 
Light energy, 69F, 598-599 

see also Photosynthesis 
Light-harvesting complexes, 848, 848F 
Light microscopy, 581-583 

3-D imaging, 589-590 

confocal microscopy, 591, 592F 
image deconvolution, 590, 590F 

brightfield, 583 

contrast generation, 583, 583F, 584, 584F 

darkfield, 583 

differential-interference microscopy, 583 

fluorescence see Fluorescence microscopy 

history, 579 

image processing, 583-584, 584F 

microscope, 581, 581 F, 604F 

phase-contrast see Phase-contrast microscopy 

resolution, 580-583, 581 F, 582F, 590 
electron microscopy vs., 609F 

sample preparation, 554-555, 579, 585, 585F 
Light waves, 582FF 

contrast generation, 583F, 584F 

diffraction effects, 548, 581 F, 582F, 583 
phase, 580, 582F, 583 

interference effects, 581-582, 582F 

momentum, 598-599 
Lignin, plant cell wall, 1 1 95, 1 1 96, 1 1 99T 
Limb buds (vertebrate), 1355F, 1374F 
Limb regeneration, newt, 1480, 1480F 
Urn gene, 1385 
Lin4ger\e, 1326 
L/n74gene, 1326F 
Lineage tracing, Caenorhabditis elegans, 1 322, 

1322F 
LINE (L1) elements, 31 8T, 321-322, 322F 

human genome, 207F 

human vs. mouse genome, 323F 

see also Repetitive DNA 
Linkage analysis, 559-560 

gene localization, 559-560, 560F 

gene mapping, 559-560, 560F 

see also Homologous recombination (crossing- 
over) 
Linkage, protein-ligand interactions, 171-172, 

172FF 
Linker DNA, 211 
Linker region, (kinesin), 1016 
Lipid(s),58,115F 

aggregates, 115F 

bilayer formation see Lipid bilayer(s) 

cell membrane(s), 617-629, 624 
see also Lipid bilayer; Membrane(s) 

droplets, 94F, 97F, 1 1 5F, 625-626, 626F 

synthesis, 867 

see also Fat(s); specific types 
Lipid anchor(s), membrane proteins, 630 
Lipid bilayer(s), 9, 617-629, 61 8F, 625F 

assembly, 620-621, 622, 743-745, 744F 
see also Phospholipid(s) 

asymmetry, 626-627, 626F 

barrier function, 651 

black membranes, 622 

composition, 622-624, 624F 

diffusion, 622, 652 

domain formation, 624-625 

fluidity, 621-624, 621 F 

fusion, SNARE proteins, 763 F, 764 

membrane protein association see Membrane 
protein (s) 

mobility, 622, 623F 

origin, 405-406, 407F 

permeability, 624, 631 , 652 

relative permeability, 652F, 653F 

signaling, 626 

spontaneous formation, 620-621 

structure, 620F, 621-622 
Lipid kinase(s), 627 

see also specific kinases 
Lipid raft(s), 625, 625F, 807, 807F 

assembly, 742-743, 742F 

caveolae formation, 790 
Lipopolysaccharide (LPS), 1490F 

immunostimulation, 1527 

macrophage activation by, 1 530F 

structure, 1527F 
Liposome(s), 621-622, 622F, 625F 

membrane protein solubility, 638 
p-Lipotropin, 803F 



y-Lipotropin, 803F 

Liquid chromatography, 522 

Listeria monocytogenes 

actin-based movement, 1514, 1515F, 1516, 
1516F 

escape from host phagolysosome, 1511-1512, 
157 IF 

hemolysin secretion and action, 1511-1512 

invasion of host cells, 1 508, 1 5 1 2F 

phagosomal membrane destruction, 1512-1513, 
1512F 

receptors, 1508 
Lithotroph(s), 12, 15, 15F 
Liver, 1442-1444 

cancer, 1227-1228, 1229 

cells see Hepatocyte(s) 

structure, 1443F 
Lkbl gene/protein, epithelial apico-basal polarity, 

1155-1156, 11 56F 
Locus control region(s), 450-452, 452, 452F 
Long interspersed nuclear element see LINE (LI) 

elements 
Long-term depression (LTD), synaptic control, 691 
Long-term potentiation (LTP), effect, 691, 692F 
Loss-of-function mutation(s), 528F, 557-558 
Loss of heterozygosity (LOH), 310, 1236 
Lou Gehrig's disease, 987 
Low-density lipoprotein (LDL) receptors, 791-794, 

791 F 
Low-density lipoproteins (LDLs), 791-792, 791 F, 

792, 793F 
Lox recombination element see Cre/Lox system 
LPS see Lipopolysaccharide (LPS) 
LRR (leucin-rich repeat proteins), 956, 956F 
L-selectin, 1145, 1550, 1550F 
Luminescent molecules, 596-597 
Lung(s), 1434-1435, 1434-1436 
Luteinizing hormone (LH), 907T, 1292 
Lyme disease, 1500 
Lymphatic vessel, function, 1446 
Lymph node(s), 1550, 1551 F 
Lymphocyte(s), 1452-1453, 1453T 

activation, costimulatory signal, 1 548 

cytotoxic, apoptosis, 1 1 20 

effector cells, 1546 

general principles, 1540-1551 
adaptive immunity, 1 540-1 541 
B ceil development, 1543-1544 
clonal selection, 1544 
dependence on innate immune system, 

1541-1542 
immunological memory, 1546 
T cell development, 1 543-1 544 
tolerance, 1548 

hemopoietic stem cells, commitment to 
lymphoid pathway, 1456-1457 

homing receptors, 1550 

memory cells, 1546 

naive cells, 1546 

origin(s), 1541 F, 1543 

recirculation, 1549-1551, 1549F 

see also B cell(s); T cell(s) 
Lymphocyte-function-associated protein 1 (LFA1), 

1592T 
Lymphocytic choriomeningitis (LCM) virus, 1496F 
Lymphoid follicles, 1550, 1551 F 
Lymphoid organ(s), 1540, 1 541 F 

central, 1541 F, 1543, 1543F 

central vs. peripheral, 1543 

peripheral, 1543, 1549, 1549F, 1553F 

see also Bone marrow; Lymph node(s); Thymus 
Lymphoma(s) 

Burkitt's see Burkitt's lymphoma 

DNA repair defects, 295T 

non-Hodgkins, metastasis, 1206F 
Lysine, 128F, 1186, 11 87F 
Lysis/lysogeny, phage lambda, 458, 458F 
Lysosomal hydrolase, 785F, 786F 
Lysosomal secretion, 786 
Lysosomal storage diseases, 785-786 
Lysosome(s), 779-787, 780F 

acid hydrolases, 779-780 

degradation pathways, 784F 

distribution, 780-781 

exocytosis, 786 

function, 696, 779-780 

fusion with phagosomes, 1531-1532 

intracellular digestion, 779-780, 784F 

material import, 782, 783, 783F, 784FF 

maturation, 781 F 



melanocytes, 786 

membranes, 697T 

plant vacuoles, 781-782 

proton pump, 780, 780F 

structure, 780F 

transport 

from endosomes, 791-792, 795F, 796F 
from Golgi apparatus, 779-787, 783-784, 
785F 

Trypanosoma cruzi invasion, 151 1 

volume, 697T 

yeast vacuoles, 781-782 

see also Lysozyme 
Lysozyme 

catalysis, 161, 164-165, 164F, 165F 

disulfide-bonds, 148 

functions, 163-164 

structure, 144F, 164F 

see also Lysosome(s) 



M 

M6P see Mannose-6-phosphate (M6P) 
Macromolecular interactions 
DNA-DNA interactions, 306F 
see also DNA hybridization 
enzyme-substrate interactions see 

Enzyme-substrate interactions 
hydrophobic interactions, 1 1 1 F 
noncovalent interactions, 55F, 156, 158F 
protein-DNA interactions see Protein-DNA 

interactions 
protein-ligand interactions see Protein-ligand 

interactions 
see also specific methods 
Macromolecule(s) 
assembly, 148-152 
biosynthesis, 63, 85F 

energy requirements, 84-87 
orientation, 86, 87F 
cell chemistry, 62-63 
abundance, 62F 
composition, 335-336 
mammalian vs bacterial cells, 63T 
protein subunits, 148 
cell fractionation, 510-51 1 
interactions see Macromolecular interactions 
polymerisation, 56, 84-87 
shape specification, 63-64, 64F 
subunits, 62F, 63F, 148-152 
see also specific types 
Macrophage(s) 

activation, 1594-1595 

helper T-cells (T H ), 1575, 1593F, 1595F 
lipopolysaccharide, 1530F 
recruitment of cells, 1534 
apoptosis, 1117 
cell crawling, 1036 
effect of NO, 888 
foam cells, 1500 
function, 1452 
HIV infection, 1581 

interleukin secretion and sensitivity, 1598T 
leprosy, 1593-1594 
lifespan and turnover, 1460 
lineage and formation, 1457F 
lung, 1434-1436 
NADPH oxidase complex, 1532 
phagocytosis, 787, 788F, 1531-1532 
antibody-activated, 1555, 1555F, 1557T 
Mycobacterium tuberculosis, 1507-1508 
Salmonella enterica, 1533 
receptors, 1531-1532 
respiratory tract, 1434-1436 
Toll-like receptors (TLR), 1531 
tuberculosis, 1507-1508, 1593-1594 
see also Antigen-presenting cells 
Macrophage-colony-stimulating factor (M-CSF), 
1460T 
macrophage production role, 1460-1461 
Macropinosomes, 1508 
Mad2 protein, spindle assembly checkpoint, 1 088, 

1088F 
Mad cow disease see Bovine spongiform 

encephalopathy (BSE) 
MADS box gene regulatory proteins, 1 400T, 1 41 4 
Magnesium, long-term potentiation, 691 
Magnetic tweezers, 527 
Maintenance methyltransferase, DNA methylation, 
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467 
Maize, 16F, 318T, 1410-141 1, 1411F 
Major groove recognition, 41 9, 41 9F 

DNA-binding proteins, 41 9F, 423, 424F 

helix-turn-helix motif, 419, 420F 

see also DNA structure 
Major histocompatibility complex see MHC (major 

histocompatibility complex) 
Malaria, 1494-1495, 1495, 1495F 

spread, 1501-1502 

see also Plasmodium falciparum 
Malate, 122F, 123F 
Malate dehydrogenase, 1 23F 
MALDI-TOF mass spectrometry, 519-520 
Malignancy see Cancer 
Mammalian cell(s) 

cell culture, cell cycle analysis, 1059, 1059F 

chemical composition, 63T 

DNA replication, 281 F, 283-284 

mitochondrial DNA, 859 

mRNA editing, 484 

replicative senescence, 1059 
Mammalian development, 1378-1383 

branching morphogenesis (of lung), 1381-1382, 
1382F 

early stages and extraembryonic membranes, 
1379-1380, 1379F 

embryonic stem cells (ES cells), 1 380-1 381 , 
1381F 

left-right asymmetry, 1376-1377, 1376F 

regulative properties of early embryo, 1380 

see also Mouse development; individual species 
Mammalian genetics 

gene control regions, 450-453 

gene regulatory proteins, 41 8T 

genetic conservation, 39-40 

imprinting, 468-470 
Mammary gland 

alveoli, 1427-1428, 1427F 

basal lamina, 1428 

development and regression cycles, 1426-1428, 
1427F 

myoepithelial cells, 1427F 
Mannan-binding lectins, 1529 
Mannose-6-phosphate (M6P) 

lysosomal import, 783-784 

lysosomal sorting, 785, 786F 

receptors, 755, 784 

structure, 784F 
Mannose-6-phosphate (M6P) receptor(s), 755, 784 
MAP2 protein, 995F, 1001, 1001F 
MAP-kinase(s), 928, 928-931, 929FF, 936F 

EGF effects, 929F 

Erk,929,929F 

evolution, 177F 

neuronal, 930 

NGF effects, 930 

plants, 958 

see also Ras GTPase(s) 
MAP-kinase-kinase (MAPKK), 929, 929FF 
MAP-kinase-kinase-kinase (MAPKKK), 929, 929F 
MAP-kinase signaling pathway, 928-931, 929FF, 
1103 

see also specific components 
MAPs see Microtubule-associated proteins (MAPs) 
Maps (brain), 1391-1393, 1392F 

retinotectal, 1 391 F, 1392-1394 
Marfans syndrome, 1 190-1 191 
Marine invertebrates, fertilization in, 1297 
Marker genes see Reporter gene(s) 
Mass spectrometry 

accuracy, 519 

liquid chromatography combination, 522 

MALDI-TOF, 519-520 

membrane composition, 624 

principies, 520F 

protein analysis, 519-521, 520F 

tandem, 519-521 
Mast cells, 1451,1556 

histamine secretion, 804, 804F, 1557F 

interleukin secretion, 1598T 
Master regulatory proteins, 1362 
Matocl protein, 456 
Mata2 protein, 41 8T, 425 F, 456 
Maternal inheritance 

chloroplast genome, 866, 866F 

mitochondrial genome, 865F, 866 
Mathematical modelling, 35-36, 35F 
Mating factors (yeast), 455-456 
Mating type (yeast), 455-456 



Mating-type cassette, 455-456, 457F 
Mating-type switching, 455-456, 456F, 457F, 458F 

see also Gene conversion; Gene silencing 
Matrix-assisted laser desorption ionization-time-of- 
flight (MALDI-TOF) spectrometry, 
519-520 
Matrix metalloproteinases (MMPs), 1 194 
Matrix receptors, 1169 

integrinsseelntegrin(s) 
Mature naive B cell, 1 553 
Mbl protein, 989, 990F 
M-Cdk f 1062, 1066 

activation by dephosphorylation, 1074-1075, 

1074F 
entry into mitosis, 1071, 1074 
microtubule dynamics and, 1080 
spindle assembly role, 1078-1079, 1079F, 1080 
Mem proteins, 289F, 456F, 1068 
M-cyclins, 1062, 1064, 1074, 1087, 1101 
Mdm2 protein, 1 1 05, 1 1 06F, 1 1 07 
Mechanoreceptor(s), mechanosensory bristles, 

1356F 
Mef2 genes/proteins, 464F 
Megakaryocytes, 1099, 1453T, 1454, 1455, 1457F 

see also Blood cells; Platelet(s) 
Meiosis, 34, 554F, 1090-1092, 1269, 1272-1274, 
1272-1281 
checkpoints, 1281 
errors, 1278-1279 

evolutionary conservation, 1280, 1286 
genetic diversity and, 1271, 1279F 
homolog pairing see under Homologous 

chromosomes (homologs) 
homolog recombination/crossing-over see under 
Homologous recombination 
(crossing-over) 
homolog segregation, 1276-1278, 1277F, 

1278-1279 
lampbrush chromosomes, 234F 
meiosis 1, 1090-1092, 1 091 F, 1273-1274, 1273F 
kinetochores, 1277-1278 
prophase I stages, 1275-1276, 1276F 
specific protein complexes, 1278, 1280 
meiosis II, 1090, 1091F, 1092, 1273F, 1274, 1278 
meiotic S phase, 1090 
micromanipulation experiments, 1278 
mitosis vs., 1091F, 1272, 1273F, 1276-1277, 

1277F 
oocytes, 1280, 1288, 1290, 1292 
regulation, 1280-1281 
sex differences, 1 280-1 281 
timing differences, 1 280, 1 281 F 
see also Mitosis 
Meiotic recombination see under Homologous 

recombination (crossing-over) 
Mek kinase, 929F 
Melanocytes, 1418F 
fish, 1023-1024 
lysosomal exocytosis, 786 
melanosomes see Melanosomes 
Melanocyte stimulating hormone (MSH), 

production, 803F 
Melanoma, definition, 1206 
Melanosomes, 786, 1022, 1022F, 1024F 
Melatonin, circadian clock, 461 
Melting temperature, 537F 
Membrane(s) 
composition 

fatty acids, 58-59 

lipid(s), 61 7-629, 645, 645F 

see also Lipid bilayer(s); Phospholipid(s); 
individual lipids 
lipid to protein ratio, 629 
phosphoinositides, 757 
proteins see Membrane protein(s) 
disruption, 510 
electrical properties, 667-692 

see also Membrane potential 
fluidity, 621-622 
fusion, 763F, 764, 768, 769F 
intracellular, 26, 169, 697-698, 697T 
intracellular pathogen strategies, 1511-1514, 

151 IF 
liver cell (hepatocyte), 697T 
pancreatic exocrine cell, 697T 
strength, 646-648 
structure, 61 7-648, 61 7-649, 61 8F 

see also Lipid bilayer(s) 
transport across, 651 -692 

impermeant molecules, 597-598 



see also Membrane transport; Vesicular 
transport 
viscosity, 644 

see also Lipid bilayer(s); Lipid raft(s); Plasma 
membrane; specific types/locations 
Membrane attack complex, 1 529, 1 529F 
Membrane potential, 653, 669, 671 F, 821 F 

mitochondrial inner membrane, 820-821, 821 F 
resting, 669, 669-671, 671 
Membrane protein(s), 629-649, 630F 
amounts, 629 

association with lipid bilayer, 629-630, 631 F 
cell surface exposure, 630 
diffusion, 642-645, 643F, 646-648 
endoplasmic reticulum, 468 
glycosylation, 635-636 
glycosylphosphatidylinositol anchor, 742-743, 

742F 
integral, 630 

interchain disulfide bonds, 636 
lipid-linked, 630 

localization, 468-487, 645, 645F 
multicomponent complexes, 642-643 
peripheral, 630 
single-particle tracking, 644 
solubility, 636-640 
structure, 135 

transmembrane see Transmembrane protein(s) 
see also specific types/proteins 
Membrane shuttle, mannose-6-phosphate (M6P) 

receptor(s), 784 
Membrane trafficking, intracellular pathogen 

strategies, 1511-1514, 151 1 F 
Membrane transport 
importance, 651 
intracellular, 750-766 
principles, 651-654 
of proteins, 695-736 

see also Endoplasmic reticulum (ER); Protein 
sorting; Protein translocation; Protein 
transport 
of small molecules, 651-692 

carrier proteins and active transport, 9-10 
Thermotoga maritima, 10F 
see also Active transport; Carrier protein(s); 
Channel protein(s); Ion channel(s) 
vesicular traffic see Vesicular transport 
see also Active transport; Channel protein(s) 
Memory B cells, 1546 

Memory cells, adaptive immunity, 1546, 1546F 
Memory, synaptic plasticity, 1396 
p-Mercaptoethanol, 517 
Meristem(s), 956, 1195, 1403 
apical see Apical meristem 
flower, 1413 

maintenance, 1409-1410, 141 OF 
plasmodesmata, 1163 
regulation by CLV1 , 956, 957F 
sequential development, 1403 
size, 1409-1410 
MERRF, 866 

Mesenchymal cells ("immature" fibroblasts), 1468 
Mesenchymal stem cells, 1468 
Mesenchyme, 1 141 

Mesoderm, 1307, 1307F, 1335, 1365, 1445 
Mesophyll cells, 846 
Mesophyll (parenchyma), cells, 846 
Messenger RNA (mRNA), 4, 331, 334 
cap snatching, 1517 
degradation, 485 

gene expression regulation, 379F, 41 5, 

492-493 
nonsense-mediated decay see Nonsense- 
mediated mRNA decay 
pathways, 492, 492F 
RNA interference, 495 
eucaryotic, 346F, 487-488 

3' end generation, 357-358, 357F, 482-483 

see also Polyadenylation (mRNA 3' end) 
5'-caps see 5' Capping of eucaryotic mRNA 
cDNA clones, 543F, 544 
editing, 483-485 

expression levels, human cell types, 41 2 
internal ribosome entry site (IRES), 491, 491 F, 

1517 
localization, 486-487, 487F 
nuclear export see under Nuclear-cytoplasmic 

transport 
processing see RNA processing (eucaryotic) 
splicing see Splicing 



Page numbers in boldface refer to a major text discussion of the entry; page numbers with an F refer to a figure, with an FF to figures that follow consecutively; 
page numbers with a T refer to a table; vs means compare/comparison. 



1:30 



INDEX 



translation start sites, 379-380, 380 
untranslated regions, 487, 488 

function, 336T 

localization by the cytoskeleton, 1022-1023 

polyadenylation see Polyadenylation (mRNA 3' 
end) 

procaryotic, 346F, 380, 381 F, 488 

ribosome interactions see Messenger 
RNA-ribosome interactions 

splicing see Splicing 

stability, 492, 492-493 

transfer RNA interactions, 180-181 

translation signals, 379-380, 383 

see also Transcription; Translation 
Messenger RNA-ribosome interactions 

path through, 376F 

ribosome binding-site, 375F, 376 
bacterial mRNA, 381 F 
bacterial Shine-Dalgarno sequence, 380, 

381 F, 488 
internal ribosome entry site (IRES), 491, 491 F, 
1517 

see also Protein synthesis; Ribosome(s) 
Messenger RNA-transfer RNA interactions, 180-181 

see also Transfer RNA (tRNA); Translation 
Metabolic balance, 102 
Metabolic map(s), 67F, 102F 
Metabolic pathway(s) 

enzyme catalysis, 66F 

regulation, 169-171, 170F 

evolution, 870-872 

interrelationship, 67F, 100 

organization, 66, 66F, 101, 102F, 102FF 

see also Biosynthesis; Catabolism; specific 
pathways 
Metabolic rate enhancement, 168-169, 169 
Metabolism, 66 

activated carrier molecules, 79-86, 79F 

anabolism/biosynthetic see Biosynthesis 

balance, 102 

catabolism see Catabolism 

cooperation, 102 

rate increases by multienzyme complexes, 
168-169 

regulation, 101-102 

tissue differences, 102 

see 0/50 Metabolic pathway(s) 
Metal affinity chromatography, 514 
Metal-shadowing, electron microscopy, 609, 61 OF 
Metamorphosis, Drosophila, 1329, 1352 
Metaphase, 1054, 1072F 

chromosome movement, 1086 

chromosome structure, 243, 243F 

microtubule dynamics, 1080 

see also Mitotic chromosome(s) 
Metaphase plate, 1085 
Metaphase-to-anaphase transition, 1061, 1066, 

1071,1087 
Metastases (secondary tumors), 1206, 1206F, 1220, 
1220 

barriers, 1220, 1250F 

colonization of distant tissues, 1220, 1250 

easy and difficult steps, 1 250F 

ECM degradation, 1 1 93, 1 1 94, 1 1 94F 

entry into circulation, 1220, 1 221 F, 1250 

local invasion/invasiveness, 1220, 1249 

monitoring, 1250 

mutations leading to, 1249-1250 

process (steps), 1220, 1221F 
Methanobacterium, 16F 
Methanococcus, 1 6F, 1 8T 
Methionine, 101, 129F 
Methionine aminopeptidases, 395 
Methylated DNA binding proteins, DNA 

methylation, 468 
Methylation 

DNA see DNA methylation 

histone tails, 222, 223F 

rRNA,361,362F 

see also Methyltransferase(s) 
5-Methylcytosine, 428F 

deamination, 301 F, 470 

DNA methylation, 467, 467F 

structure, 301 F 
Methyltransferase(s) 

DNA see DNA methyltransferase(s) 

5' mRNA capping, 346, 347F 
Met repressor, p sheet motif, 423F 
MHC (major histocompatibility complex), 1 397F, 

1 569-1 589, 1571,1 575, 1 575, 1 576F 



class restriction, 1579-1580 
dendritic cells, 1571 

H-2 (histocompatibitity-2) antigen, 1575 
human genes, 1577F, 1588 
invariant pocket, 1578, 1579F 
MHC class 1,1575,1 577-1 578, 1 580T, 
1581-1582, 1599F 
coreceptor recognition, 1580-1581, 1 581 F 
expression on viral-infected cells, 1535-1536 
nonclassical, 1577 

peptide loading, 1581-1582, 1583F 
recognition by coreceptors, 1581 F 
recognition by natural killer cells, 1 535-1 536 
structure, 1576F, 1577-1578, 1578F, 1579F 
subtypes, 1576F 
MHC class II, 1575, 1577, 1577-1578, 1580T, 
1581F, 1583, 1599F 
coreceptor recognition, 1581, 1581 F 
peptide loading, 1583-1585 
recognition by coreceptors, 1581 F 
structure, 1576F, 1577 
subtypes, 1576F 
peptide-binding, 1574F, 1577-1579, 1579F, 
1580F 
see also Protein-protein interactions 
polymorphism, 1576, 1588 
restriction, 1406, 1581, 1584-1585 
specificity pocket, 1577-1578, 1579F 
Tcell interactions, 1569-1589 
Tcell selection, 1585-1586 
transplantation reactions, 1575-1576 
Micelle(s), 11 5F, 620, 621 F, 637 
Michaelis-Menten equation, 162F 
Microbial flora 

normal, 1486, 1501 
pathogenic see Pathogen(s) 
see also Bacteria 
Microdissection techniques, 502, 503F 
Microelectrode(s), intracellular ion measurement, 

596 
Microfibrils, 1 1 90-1 191,1 197-1 1 98, 1 1 97F 
Microfilaments see Actin/actin filaments 
(^-Microglobulin, MHC protein/antigen 

interactions, 1580F 
Microinjection, impermeant molecules, 597-598, 

598F 
MicroRNA (miRNA), 336, 493-495, 494F 
Microsatellite DNA, hypervariable (VNTRs), 547F 
Microscopy, 579 

cell cycle analysis, 1059 
cell structure analysis, 579-591 
electron microscopy see Electron microscopy 
historical aspects, 501, 501 F 
image processing, 583-584, 584F 
light microscopy see Light microscopy 
resolving power, 581 F 
sample preparation, 554-555, 585 
fixation, 585, 605, 605F 
sectioning, 585, 585F, 605 
staining, 554-555, 579, 585, 585F 
scale, 580, 580F 
in situ hybridization, 585, 586F 
Microsome(s), 51 1,726, 726F 
Microsporidia, 1510-1511, 151 IF 
Microsurgery, C elegans, 1325 
Microtome, 585, 585F 
Microtubule(s), 965F 

accessory proteins, 992, 995F 
assembly, 973, 974, 975 

see also Microtubule(s),nucleation; Tubulin 
bundles, organization by MAPs, 1001, 1002F 
capping, 1003 
cell division and, 969 

see also mitosis (below) 
cell polarization, communication with actin 

cytoskeleton, 1046 
cellular location, 969 
configuration, astral, 993 
depolymerization, 1085, 1085F, 1089, 1201 
dynamic instability, 969, 980, 981 F, 982F, 1080, 
1081F 
centrosome role, 1046 
endoplasmic reticulum organization, 1021 
fibroblasts, 1048F 
Golgi apparatus organization, 1021 
growth rate of plus/minus ends, 975-976, 976F 
interpolar, 1035, 1035F 
kinesin binding site, 1014F 
mitochondria association, 81 5, 81 6F 
mitosis, 1075-1076, 1076F 



chromosome effects on, 1081, 1082F 
cytokinesis and, 1095-1097, 1096F, 1097F 
instability/dynamics, 1080, 1081 F 
kinetochore attachment, 1082, 1083F, 1084F 
spindle formation, 1035, 1035F, 1075-1077, 

1079 
see also Mitosis; Mitotic spindle 
motor protein types, 1014-1015 

see also Kinesin(s) 
neurons, 1021, 1047-1048, 1048F 
nucleation, 992, 992F, 1034 
pathogen movement based on, 1516-1517 
plant cells, 1097-1098, 1200-1202, 1200F, 

1201F 
plus and minus ends, 974-975, 978F 
properties, 965 
structure, 968, 968F, 974-975, 974F 

see also Tubulin 
subunits, 971 

treadmilling, 976-982, 977FF 
Microtubule-associated proteins (MAPs), 995F, 1080 
Microtubule-dependent motor protein(s), 
1014-1015 
see also Kinesin(s) 
Microtubule flux, 1085, 1086F, 1089 
Microtubule-organising center (MTOC), 992 

see also Centrosome 
Microvilli 

actin filament nucleation, 996 
bundle of actin filaments, 969, 970F 
egg (ovum), 1298, 1298F 
membrane transport, 659 
structure, 1008F 
Mid-blastula transition, 1365 
Midbody (cytokinesis), 1094, 1094F 
Migration see Cell migration 
Milk, production and secretion, 1427, 1427F 
Minimum gene compliment, 10,570-571 
Minor groove-binding, DNA-binding proteins, 423, 

424F 
Miranda protein, 1361 

Mismatch conversion, gene conversion, 31 5F 
Mismatch proofreading, 277-279, 277F, 315-316, 

315F 
Mismatch repair, 272 

deaminated methylated cytosines, 302-303 
exonucleolytic proofreading, 270, 270F 
strand-directed mismatch repair, 272, 276-278, 
277F 
dysfunction and cancer, 277-278, 295-296, 
295T 
see also DNA repair disorders 
Mitochondria, 29F, 696, 81 5-827 

apoptosis role, 1121, 1121F, 1122F, 1124F, 1125F 
membrane potential, 1118 
protein release, 1121, 1123 
[3 barrel proteins, 635 
biogenesis, 856, 856F, 866-867, 867F 
cardiolipin synthesis, 867-868, 868F 
cell aging, 868 
cell-free systems, 511 
cell metabolism, 838-839, 839F 
distribution during cytokinesis, 1098 
DNA see Mitochondrial genome 
energy conversion, 81 5-827 

citric acid cycle see Citric acid cycle 
electron transport see Mitochondrial electron 

transport chain(s)(s) 
energetics, 819, 81 9F 
high-energy electrons, generation, 817 
oxidation of food molecules, 97F, 81 9F 
oxidative phosphorylation see Oxidative 

phosphorylation 
respiratory enzyme complex, 820 
see also ATP synthesis 
evolution, 27-28, 29F, 699, 700F, 859-860, 860F, 
874, 875F 
bacterial resemblance, 857, 859 
fractionation, 511, 817F 

see also Cell fractionation 
genome see Mitochondrial genome 
glutathione peroxidase, 868 
growth and division, 857-858, 858F 
heat production, 838 

high ATPrADP ratio maintenance, 823-824 
historical aspects, 815-816 
import into, 745 

see also Mitochondrial protein import 
lipid import, 867-868 
localization, 815, 81 6F 
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microtubule association, 81 5, 81 6F 
nuclear-encoded tissue-specific proteins, 867 

import see Mitochondrial protein import 
poisons, 834, 836 

protein import see Mitochondrial protein import 
protein synthesis, 856-857, 856F, 867, 869F 

see also Mitochondrial genome 
RNA editing, 483, 484F 
structure see Mitochondrial structure 
superoxide dismutase, 868 
urea cycle, 867 
variability with cell type, 857 
volume, 697T 
Mitochondrial diseases, 866 
Mitochondrial DNAsee Mitochondrial genome 
Mitochondrial electron transport chain(s){s), 814, 
819-820, 827F, 831-839 
chloroplasts vs., 814, 81 5F, 853-854, 855F 
electron carriers, 829, 831 

coenzyme Q (ubiquinone), 831, 832F 

cytochromes, 829, 830F, 831 
see also Cytochrome(s) 

iron-sulfur proteins, 830F, 831 

random collisions, 834 

spectroscopy, 831, 832F 

see also Respiratory enzyme complexes 
electron tunneling, 823-824 
energetics, 819, 820F 
energy storage, 821 
enzyme complexes see Respiratory enzyme 

complexes 
natural (brown fat), 838 
redox potentials, 835, 835F 
respiratory control, 837-838 
reverse electron flow, 838 
uncoupling, 836 

see also Oxidative phosphorylation 
Mitochondrial genome, 856-857, 861 F, 868-870, 
869F 
copy number, 858 
diversity, 859, 861 F 

copy number, 85 7T, 858 

sizes, 859F 
DNA replication, 856-857, 857F, 858 
evolution, 859-860, 861 F 

maintenance, 868-870 

molecular clock rate, 862 
gene expression, 483, 484F, 861-862 

transcription, 863 
genetic code variation, 367, 862, 862T 
human, 861 -862, 861 F 

mutations, 866 
Mitochondrial diseases, 866 
non-Mendelian inheritance, 864-865, 865F, 866 
nucleoids, 859 
Mitochondrial protein import, 713-719, 716FF 
chaperone proteins, 715, 71 6F, 717, 717 
contact sites, 715-716, 715F, 718F 
energy use, 716-717, 71 6F 
into inner membrane/intermembrane space, 

717-719, 718F 
mechanisms, 716FF 
OXA complex, 714F, 715, 718, 718F 
precursor proteins, 71 3, 71 5 
protein translocators, 714-715 
SAM complex, 715 
signal sequences, 714, 714F 
stop-transfer sequence, 718 
TIM complex see TIM complex 
TOM complex see TOM complex 
Mitochondrial reticulum, in yeast, 857, 858F 
Mitochondrial structure, 71 3, 71 3F, 815-81 7, 81 7F, 
81 8F 
chloroplasts vs., 841-842, 843F 
cristae,817,817F 
inner membrane, 713, 817, 81 7F, 818F, 821 F 

electrochemical gradient, 820-821, 821 F 

electron tunneling, 834-835 

respiratory enzyme complexes, 820, 831-832 

transport across, 822 
intermembrane space, 71 3, 81 6, 81 8F 
matrix, 81 6, 81 8F, 853 
matrix space, 713 
membrane composition, 697T 
outer membrane, 713, 713, 816, 81 7F, 81 8F 
porin insertion, 717, 717F 
role in respiration, 821 

see also Electron transport chain; Oxidative 
phosphorylation 
see also Mitochondrial genome 



Mitogen(s), 1102-1103 
competition for, 1110 
G r Cdk stimulation, 1103-1105 
d/S-Cdk stimulation, 1103-1105 
intracellular signaling pathways, 1 1 03, 1 1 07, 

1107-1108, 11 08F 
specificities, 1102-1103 
see also Growth factors; specific mitogens 
Mitogen-activated protein kinase(s) see MAP- 

kinase(s) 
Mitogen-activated protein kinase signaling see 

MAP-kinase signaling pathway 
Mitosis, 208F, 1054, 1055F, 1071-1092 
in absence of cytokinesis, 1099-1100 
APC/C and completion, 1087, 1088F 
asymmetric cell division, 1099 
Cdk inactivation, 1100-1 101 
centriole function, 1076 
centrosome duplication, 1078 
checkpoints, 1061, 1066, 1071, 1088 

see also Cell cycle control 
chromosome packaging, 1070-1071, 1090 
chromosome segregation, 1089-1090 
chromosome structure see Mitotic 

chromosome(s) 
condensinand, 1075 
entry into 

Aurora kinases, 1074, 1084 
M-Cdkand, 1071, 1074-1075, 1078-1079 
microtubule dynamics, 1080, 1081F 
Polo-like kinases, 1074 
kinetochoreand, 1082-1083 
meiosis vs., 1 091 F, 1 272, 1 273F, 1 276-1 277, 

1277F 
microtubule instability and, 1080, 1081 F 
mitotic spindle see Mitotic spindle 
nuclear lamina, dissociation, 710-712, 712F 
phases, 1071,1 072FF 

anaphase, 1054, 1073F, 1089-1090, 1089F 
metaphase, 243, 243F, 1054, 1072F 
prometaphase, 1072F 
prophase, 1054, 1072F 
telophase, 1055, 1073F, 1090 
see also specific phases 
see also Cell cycle; Cytokinesis; Meiosis; 
Microtubule(s) 
Mitotic chromosome(s), 208-209, 208F, 209, 243, 
243 F, 244F,245 
bipolar spindle assembly, 1081-1082 
centrosome duplication, 1078 
condensation, 243, 244F, 1075F 
chromatin condensation, 1070 
condensin role, 1075 
M-Cdk and, 1071 
metaphase, 243, 243F 
separation/segregation, 865, 865F, 1087, 1087F, 

1089-1090 
sister chromatids, 243, 243FF 

bi-orientation mechanism, 1083-1085, 1084F 
separation, 1087, 1087F, 1088, 1088F 
spindle attachment, 1082-1083 
spindle attachment, 1082-1083, 1084F 

bi-orientation mechanism, 1083-1085, 1084F 
forces acting on, 1085-1087, 1085F, 1086F, 

1089 
unattached chromosomes and, 1088 
see also Kinetochore 
Mitotic index, 1059 
Mitotic spindle, 208F, 1 054 

assembly, 71 2F, 1077-1082, 1077F, 1078F, 1079F, 

1082F, 1088 
asymmetric cell division, 1099, 1099F 
chromosome attachment, 1082-1083, 1084F 
bi-orientation mechanism, 1083-1085, 1084F 
forces acting on, 1085-1087, 1085F, 1089 
unattached chromosomes and, 1088 
see also Kinetochore 
cytokinesis plane of division, 1095-1097 
disassembly, 1054-1055, 1090 
intermediate filaments, role, 966-967 
lamellipodia vs., 1039 
length, 1079 

as microtubule-based machine, 1075-1077, 
1076F, 1079 
assembly, 1034-1035, 1035F 
bipolar array, 1034-1035, 1075-1076, 1077, 

1081 
classes, 1034, 1035F 
instability, 1035, 1080, 1081 F 
see also Microtubule^); Motor protein(s) 



photoactivation visualization, 594-595, 595F 
see also Mitotic chromosome(s) 
Mobile genetic elements, 316 
bacterial, 318F 

Drosophila melanogaster, 31 8T 
human genome, 207F, 31 8T 
Alu family, 250F, 323FF 
DNA transposons, 207F 
LINEs, 207F 

retroviral-like elements, 207F 
SINEs, 207F 
insertion mechanisms see Site-specific 

recombination 
see also Repetitive DNA;Transposon(s);Virus(es); 
specific elements 
Model organism(s), 25 

Arabidopsis, 36, 36F, 1398-1399 
Caenorhabditis elegans, 36-37, 367 F 
Drosophila melanogaster, 37-38, 37F, 

1328-1329 
Eco//, 24-25, 25 F 
humans, 40, 43F 
mouse, 39-40, 43F 
yeast, 33-34, 34F 
see also individual models 
Modular organization, Eve gene, 1 339 
Moesin, 1009 
Molarity, definition, 46 
Molecular chaperones see Chaperones 
Molecular clock(s), 248-249, 862 
Molecular evolution, 245-260 
clock hypothesis, 248-249 
mitochondria, 862 
phylogenetics, 247, 247F, 248F 
Molecular mimicry, release factors (translational), 

381,382F 
Molecular motions, 74-75, 75F 
Molecular motors see Motor protein(s) 
Molecular switches, 895, 895-897, 895F 
cell cycle control, 1061, 1065-1066, 1075 
GTPases, 1 79, 1 79F 
Ran GTPase, 708-709 
Rho family GTPases, 1042-1043 
in vesicular coat assembly, 759-760 
see also individual proteins 
Molecular weight, definition, 45 
Molecular weight determination, centrifugation 

techniques, 522-523 
Molecule(s), 48 
hydrophilic, 52 
hydrophobic, 52 
large see Macromolecule(s) 
representation, 51 

small cellular organic see Organic molecules 
{small, cellular) 
Mollusk, eye development, Pax-6, 1306F 
Molten globule, 387-388, 388F 
Mom (more mesoderm) mutants, 1325 
Mono-allelic expression, epigenetics, 473 
Monoclonal antibodies, 508-509, 509F 
in cancer therapy, 1 260 
immunofluorescence microscopy, 587F, 588F, 
589 
Monoclonal response (lymphocyte), 1545 
Monocytes, 1452, 1452F, 1453T, 1457F 
Monomeric GTPases see under GTP-binding 

proteins (GTPases) 
Monosaccharides, 55, 57F, 1 1 2F 
Monoubiquitinylation, 926 
Morphogenic gradient(s) 

cell fate dependent on position, 886, 886F 
definition, 1318 

Drosophila egg, 1333-1335, 1334F, 1335F 
Drosophila wing imaginal disc, 1352 
see also specific morphogens 
Morula, 1379, 1379F 
Mosquitoes, Plasmodium transmission, 1494, 

1501-1502 

Motor neuron(s), 688, 688F, 1385, 1386F 

Motor protein(s), 181-182, 1010-1025 

actin-based see Myosin(s) 

ATP hydrolysis, 1016-1019, 1017F 

see also ATP hydrolysis 
biological functions, 181, 1010, 1021-1022 
axonal transport, 1021, 1048F 
microtubule dynamics and, 1080 
mitotic spindle assembly/function, 1077, 

1077F, 1079, 1079F 
mitotic spindle movements, 1034-1035 
muscle contraction, 1026-1030 
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organelle transport, 1021-1022 
RNA localization, 487F, 1022-1023 
spindle pole separation, 1089 
see also Mitosis; Mitotic spindle 
conformational changes, 1016-1019 
evolutionary origins, 1015-1016 
kinetics, 1020-1021 
mechanisms, 1016-1020 
microtubule-based, 1014-1015 
see also Dynein(s); Kinesin(s) 
regulation, 1023-1025 
see also specific types 
Mouse 

development see Mouse development 
genome 

chromosomes, 293-294 
comparative genomics, vs human, 249F 
conserved synteny, 207, 208F, 250 
imprinting, 469F, 470F 

see also Genomic imprinting 
transposons, 323, 323F 
knockouts 

production, 567F 

transgenic organism(s), 566-568, 568F 
see also Gene knockouts 
as model organism, 39-40, 43F, 1 378 
chimeras, 1380, 1380F, 1381 F 
genetics, vs. humans, 1378 
transgenic mouse production, 567F 
Mouse development, 1378-1383 

branching morphogenesis (of lung), 1381-1382, 

1382F 
early stages and extraembryonic membranes, 

1379 
egg, 1379 

embryo, 1379, 1379F, 1380 
embryonic stem cells (ES cells), 1380-1381, 
1381F 
M phase of cell cycle see under Cell cycle 
MreB protein, 989 
Mrf4 gene regulatory protein, 464 
MRNA see Messenger RNA 
MSH, production, 803F 
MTOR, protein kinase, 934, 934F 
mu chain, IgM, 1553 
Mucociliary escalator, 1435, 1503 
Mucus 

host defense mechanism, 1 502, 1 525 
secretion, 772, 774F, 1435, 1437 
Multicellularity 

cell-cell interactions, 1131-1132 

see also Cell adhesion; Cell junction(s); 
Extracellular matrix (ECM) 
cell specialization see Cell differentiation 
evolution, 955 

see also Development; Tissue(s) 
Multidrug resistance, 665-666 
Multienzyme complexes, 168-169, 169, 169F 

see also Protein complexes (assemblies) 
Multinucleated cells, mitosis without cytokinesis, 

1099 
Multipass proteins see Transmembrane protein(s) 
Multiple sclerosis, demyelination, 678 
Multipotent stem cells see Stem cell(s) 
Multispecies conserved sequences, 252-253, 252F 
Multiubiquitylation, 795 
Multivalent antigen(s), 1557 
Multivesicular bodies, 795-797, 795F, 796F, 797F 
Mumps virus, 1496F 
Muntjac deer, chromosome differences, 204-205, 

205F 
Muscarinic acetylcholine receptors (mAChRs), 916 
Muscle, 1463-1467 
basal lamina, 1164F 
cardiac see Cardiac muscle 
cells see Muscle cell(s) 
classes, 1463, 1463F 
contraction, 1026-1030 

control by troponin, 1029, 1030F 
role of Ca 2+ , 1028-1030 
sliding-filament model, 1026-1028, 1028F 
development, 464, 464F, 1464 

see also Myoblast(s); Myogenic protein(s) 
dilator, 1464 
fibers see Muscle fiber(s) 
heart see Cardiac muscle 
insect flight muscle, 1027F 
motor neuron innervation, 1385 
skeletal see Skeletal muscle 
stem cell differentiation, 1425 



structure, 1026, 1027F, 1028F, 1029, 1029F 
Muscle cell(s) 

cardiac, 1463-1464 

differentiation, 464, 464F, 1464, 1480 
see also Myoblast(s); Myogenic protein(s) 

protein isoforms, 1465 

skeletal see Skeletal muscle 

smooth, 888, 1450, 1463-1464, 1463F 

syncytium, 1464 

uncoordinated growth and division, 1 109 

visualizing, 1425 
Muscle fiber(s), 1026F, 1064, 1464-1465, 1465F 
Muscular atrophy (inherited spinal), SMN protein 

mutation, 364-365 
Muscular dystrophy, 1005, 1 169, 1173, 1466 
Mutagenesis 

carcinogenesis, 1208 

gene expression analysis, 556 

mechanisms, 296, 298f, 303F, 528F, 556 
see also Mobile genetic elements 

random, 553, 556 

site-directed, 565, 565F 

in vitro mutagenesis, 575 

yeast, 570 

see a/so Mutation(s) 
Mutation(s), 16-17,263, 528F 

in cancer, 265, 1106, 1208, 1209, 1209-1210, 
1217,1261 

conditional, 528F 

temperature-sensitive, 557, 557F, 1057, 1057F 

consequences, 264, 570 

cancer development, 1106, 1209-1210, 1217 

see also Cancer; Cancer-critical genes 
evolutionary innovation, 18, 19F, 246 
see also Genome evolution 

deletion, 528F 

destabilizing/deleterious, 265, 1214 

dominant, 558 

dominant negative see Dominant negative 
mutation(s) 

elimination by natural selection, 264, 265 

gain-of-function, 528F, 557-558, 564F, 1231 

germ-line, 264, 265 

histones, 213 

identification, 556-557, 559-560 
see also Genetic screens 

intragenic, 18, 19F 

inversion, 528F 

lethal, 528F 

loss-of-function, 528F, 557-558, 1231 

mechanisms, 296, 298F 
see also Mutagenesis 

neutral mutations, 17 

null, 528F 

point mutation, base substitution, 246 

protein, effect on, 558F 

rate see Mutation rates 

recessive, 558 

screening, IVF and, 1 302 

silent, 264 

somatic, 265, 1208, 1209 

suppressor, 528F 

synonymous, 247 

translocations, 528F, 1261 

see also specific types 
Mutation rates, 246, 263-264 

Caenorhabditis elegans, 264 

DNA repair defects, 296 

Escherichia coli, 263-264 

evolutionary significance, 248-249, 264 

fibrinopeptides, 264 

germ-line, 264 

increases, 265 

see also Cancer; Genetic instability 

limitation on number of essential genes, 265 

mammals, 264 

measurement, 264 

mitochondrial DNA, 862 

natural selection effects, 264, 265 

underestimation, 263-264 
Mutator gene(s), 276-277 
MutH protein, 277F 
MutL protein, 277F 
MutS protein, 277F 
Myasthenia gravis, 1549 
Myb gene regulatory proteins, MOOT 
Myc gene/protein, 950 

amplified in carcinoma, 1238F 

Burkitt's lymphoma, 1239 

chromosome translocation activating, 1239 



mitogen activation and, 1 103 

as oncogene, 1107, 1238-1239, 1242, 1242F 
Mycobacterium tuberculosis, 1 487 

genome size, 18T 

host cell membrane traffic and, 1512, 151 2F 

persistence, 1507-1508 

phagocytosis and survival inside phagocytic 
cells, 1507-1508 

phagosome maturation prevention, 1512 

transmission, 1507 
Mycoplasma, 14F, 1500 
Mycoplasma genitaiium, 10, 11F, 18T, 570-571 
Myelin sheath 

action potential propagation, 678-680, 679F 
see also Action potential(s) 

structure, 680F 
Myoblast(s), 464 

cell culture micrograph, 504F 

differentiation, 1464 

fusion, 1464, 1464F 

maintenance, signals, 1466 

persistence as satellite (stem) cells, 1466 

recruitment, muscle fiber growth, 1466 

skeletal muscle fiber formation, 1465-1466 

somite origin, 1 373-1 374 
Myoclonic epilepsy and ragged red fiber disease 

(MERRF), 866 
MyoD gene, 464F 
MyoD gene regulatory proteins, 41 8T, 464, 464F, 

1464 
Myoepithelial cells, 1427F, 1463-1464, 1463F 
Myofibril, 1026, 1027F 
Myofibroblasts, 1468 
Myogenic protein(s), 464, 464F 

see also MyoD gene regulatory proteins 
Myogenin gene regulatory protein, 464 
Myoglobin, 20, 21 F, 144F 
Myosin(s), 1011-1014 

ATP hydrolysis, 1012, 101 7F 

cycle of structural changes, 1016 

enzymatic digestion, 1012 

evolutionary origin, 1015-1016 

force generation, 1016 

motor activity of head, 1 01 2F 

movement speed, 1020 

muscle contraction, 1026-1028 

non-muscle, 1012-1013 

power stroke, 1016 

rigor state, 1019 

structure, 1012, 1013, 101 5F, 1016 

subtypes, 1013, 1013F 

see also individual types 
Myosin 1, 1013, 1013F 
Myosin II, 1011 

cell motility, 1039F, 1041 

contractile ring (cytokinesis), 1093-1094, 1095, 
1095F 

filament arrangement, 1026 

lever arm, 1020, 1020F 

mechanochemical cycles, 1017F 

muscle contraction, 1026-1028 

processivity, 1020 

regulation by phosphorylation, 1024-1025, 
1025F 

structure, 101 IF 

thick filament, 1012F 
Myosin V, 1013, 1020, 1022 
Myosin light-chain kinase (MLCK), 1024, 1025F, 

1030 
Myostatin, 1110, 1111F 

gene mutation, effects, 1455-1466 

muscle growth control, 1465, 1466F 

mutation in mice, 1466F 
Myotendinous junctions, 1170 
Myotonia, voltage-gated cation channels, 682 



N 

W-acetylneuraminic acid see Sialic acid (NANA) 
NAC gene regulatory proteins, MOOT 
NADH dehydrogenase complex, 832, 833F, 836 
NADH/NAD+, 78, 82-83, 92F 

citric acid cycle, 98, 817 

electron carrier, 82-83, 818-819, 81 8F, 819-820 
see also NADH/NAD+ 

functional roles, 83 

generation, 819-820 

glycolysis, 89, 92F, 93F 

production from pyruvate, 96 
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NADPH/NADP+, 78, 82-83, 82F 
electron carriers, 82-83, 82F 
functional role, 83 
photosynthesis, 94 

carbon-fixation, 845, 845F 
evolutionary significance, 872 
noncyclic photophosphorylation, 850-853 
NADPH oxidase complex, 1532 
Naive cells, immune response, 1 546, 1 546F 
Nanos protein, 1283, 1333F, 1334, 1334F 
Nasal placode, 1384F 
Natural immunological tolerance, 1 547 

see also Immunological tolerance 
Natural killer (NK) cell(s), 1453, 1453T, 1535-1536 
activation, interferon-p (IFNP), 1536 
MHC class I recognition, 1 535-1 536 
virus-infected cell destruction, 1 535-1 536 
virus-infected celt recognition, 1536 
Natural selection, 16-17 
alleles, 561-562, 562F 
mutations, 264, 265 

see also Mutation(s) 
protein conformation, 137 
purifying selection, 247 
self-replicating molecules, 404-406 
sexual reproduction and, 1271 
in tumor progression, 1212-1213 
see also Evolution 
N-cadherin (neural cadherin), 1 1 36, 1 1 38T, 1 388 
NCAM (neural cell adhesion molecule), 1388 
Nebulin, 1028, 1028F 
Necrosis, 1115 

Negative feedback (feedback inhibition), 902, 902F 
allostery, 171J72F 

see also Allosteric regulation 
conformational coupling, 171, 172F 
multiple, 170F, 171 
rate of inhibition, 1 72, 1 73F 
regulation, 170F, 171 
transcription circuits, 459, 459F 
Negative selection, thymus, 1586-1587, 1587F 
Negative staining, electron microscopy, 610, 61 1 F 
Negative supercoiling, 344F, 345 
Negative translational control, 488, 490F 
Neisseria, 1 520 

Neisseria gonorrhoeae, 22, 456 
Nematodes see Caenorhabditis elegans 
N-end rule see N-terminal degradation 
Neoplasm (tumor) see Cancer 
Nernst equation, 669, 670 
Nerve cells see Neuron(s) 
Nerve growth factor (NGF), 923F, 923T, 926, 930, 

1109 
Nervous system 

apoptosis, 1115-1116 
cadherin expression, 1 1 36, 1 1 36F 
development see Neural development 
neurons see Neuron(s) 

see also entries beginning neuro-Zneural; specific 
regions/cell types 
Netrin protein, 1049, 1388, 1388F 
Neural cell adhesion molecule (NCAM), 1 146-1 147 
Neural crest cell migration, 1036, 1140, 1140F, 1374, 

1374F 
Neural development, 1383-1397 

axon-dendrite distinction, 1, 1383,1386-1387, 

1387F 
cadherins and, 1 140-1141, 1140F, 1141F 
early embryonic origins, 1 383 

brain and spinal cord (CN5), 1367F 
chick embryo, 1384F 
neural crest, 1374 

peripheral nervous system (PNS), 1384 
see also Neural crest cell migration 
effects of experience, 1395-1396, 1395F, 

1396-1397 
gap junctions, 1161 

growth cones, 1049, 1049F, 1386-1388, 1386F, 
1447F 
guidance, 1 1 40, 1 387-1 388, 1 388F, 1 393F 
integrinsand, 1177 
neural birthdays and diversification, 1385-1386, 

1386F 
neural crest cells see Neural crest cell migration 
neural map formation, 1391-1392, 1391 FF, 
1392F, 1393F 
activity-dependent remodelling, 1393-1395 
neural migration, 1385F 

see also Neural crest cell migration 
neural plate, 1367F, 1370-1371, 1370F 



neural specificity, 1391-1394 
neural tube, 1370, 1370F, 1384, 1385F 
neurite(s), 1390F 
neuroblasts, 1360 

neurogenesis, 1360, 1360F, 1383-1385 
neuron death, 1389-1390 
neurotrophic factors, 1389-1390, 1390F 
neurulation, 1370-1371, 1370F, 1371 
phases, 1383, 1384F 
synapse elimination and remodeling, 
1393-1395, 1394F, 1395F 
synapse formation, 1149F, 1393-1394 
CAMs, 1147-1148 
scaffold proteins, 1148-1149, 11 48F 
Neural maps, 1391-1393, 1391F 
Neural plate, 1161, 1370-1371, 1371F 
Neural retina see Retina 
Neural stem cells, 1478-1479, 1480 
Neural tube, 1140, 1141F, 1370, 1371F, 1385F 
Neuraminidase, subunit association, 143, 143F 
Neurites, definition, 1047 
Neuroblasts, 1360-1361 
Neurodegeneration, 1048 
protein aggregates, 396-397 

see also Prion diseases 
see also specific diseases 
Neurofibromatosis, 1010 
Neurofilaments, 985T, 987, 987F, 1048 
organization, 1005 
see also Intermediate filament(s) 
Neuroligin, synapse formation, 1 147-1 148 
Neuromuscular junction (NMJ), 684F 

acetylcholine receptors, basal lamina and, 1168 
ion channels, 684-686 
regeneration, basal lamina and, 1 1 68-1 1 69, 
1168F 
Neuron(s) 

apoptosis, 1126, 1126F 

character assignment, 1385-1386 

cytoskeleton 

dynamic instability, 1049 

neurofilaments, 985T, 987, 987F, 1005, 1048 

see also Intermediate filament(s) 
organization of microtubules, 1048F 
specialization dependent on cytoskeleton, 
1047-1050 
development, 1383-1397 
migration, 1385F 
specification (neurogenesis), 1360, 1360F, 

1383-1385 
see also Neural development 
electrical properties, 688-690 
see also Action potential(s) 
gap junctions, 1161 
interconnections, 1385-1386 
plasma membrane, 645 
schematic, 675F 
size control, 1109, 1 1 1 OF 
structure 

axons see Axon(s) 

cytoskeleton see cytoskeleton (above) 
dendrites see Dendrite(s) 
function relationship, 675-676, 688F, 1049F 
polarized cells, 806, 806F 
see also Neurotransmitter(s); Synapse(s); specific 
types 
Neuronal doctrine, 504 
Neurospheres, 1478 
Neurospora, gene definition, 481 
Neurotransmitter(s), 682, 683, 684, 882, 888F 
ion channel gating see Transmitter-gated ion 

channel(s) 
see also individual neurotransmitters 
Neutral mutations, 17 
Neutrophil(s), 1451 
apoptosis, 1117 
cell crawling, 1036 

cytoskeleton rearrangement, 969, 969F 
lineage and formation, 1457F, 1460-1461 
NADPH oxidase complex, 1 532 
phagocytosis, 787, 788F, 1531-1532, 1557T 
polarization, 1045F, 1046F 
rapid turnover by apoptosis, 1462 
receptors, 1531-1532 
structure, 1452F 
Toll-like receptors (TLR), 1531 
Newt, limb regeneration, 1480, 1480F 
NFkB proteins, 952-954, 953F, 1530 
NGF see Nerve growth factor (NGF) 
/V-glycanase, misfolded protein degradation, 740F 



Nicotinamide adenine dinucleotide see 

NADH/NAD+ 
Nicotinamide adenine dinucleotide phosphate see 

NADPH/NADP+ 
Nicotinic acetylcholine receptors (nAChRs), 916 
Nidogen, basal lamina, 1 1 65, 11 66, 1 1 67F 
Nitric oxide (NO), 887-889, 888, 1450 
Nitric oxide synthase, 888, 888F 
Nitrocellulose membranes, 519F, 538-539, 539F 
Nitrogen cycle, 100-101 
Nitrogen fixation, 13,13, 15F, 100 
Nitroglycerine, 888 
NK cells see Natural killer (NK) cell(s) 
/v-linked oligosaccharide(s) 

formation, 774-775, 774F 

protein folding function, 739F 

protein glycosylation, 738, 776F 

three-dimensional structure, 777F 
NMDA receptors, 691-692 

NMR spectroscopy, 1 39, 529-530, 529F, 530, 530T 
Nocodazole, 988T, 1021 
Nodal protein, 1376-1377 
Node, in plants, 1398F, 1407 
Nodes of Ranvier, 680, 680F 
NOD proteins, 1 531 -1 532, 1 536 
Noggin protein, 940, 1316T 
Noise, electron microscopy, 61 0-61 1 
Nomanski differential-interference contrast 

microscopy, 583 
Noncoding DNA, 31,204 

genome rich in, 331 

intronsseelntron(s) 

from mitochondria, 860 

repetitive elements see Repetitive DNA 
Noncoding RNA, 360-361 

see also specific RNA types 
Noncovalent interactions, 53-54, 1 1 0-1 1 1 FF 

hydrogen bonds see Hydrogen bonds 

hydrophobic forces, 1 1 1 F 

ionic bonds see Ionic bonds 

macromolecules, 55F, 64, 64F, 126-127, 156, 
158F 

van der Waals attractions see Van der Waals 
forces 

weak interactions, 1 10F 
Noncyclic photophosphorylation, 850-853, 852FF 

ATP synthesis, 94-95, 95 F, 851 

NADPH synthesis, 94, 851 
Nondisjunction, 1236F, 1278-1279 
Non-Hodgkins lymphoma, metastasis, 1206F 
Nonhomologous end-joining, DNA double-strand 

break repair, 302-303, 303F 
Non-Mendelian inheritance, 864-865, 865F, 866, 

866F 
Nonretroviral transposons, 31 8T 
Nonsense-mediated mRNA decay, 385-387, 386F 
Normal microbial flora, 1486, 1501 
Northern blotting, nucleic acid hybridization, 

538-539, 539F 
Nose, embryonic origin, 1 384F, 1 429 
Notch receptor protein, 777, 946, 947, 948F, 

1356-1358 
Notch signaling pathway, 946-948 

angiogenesis, 1449 

animal development, 1316F 

Arabidopsis, absence, 1400 

at Drosophila wing margin, 1 352, 1 352F 

endothelial cells, 1448 

gut cell differentiation, 1439-1440, 1439F 

in neurogenesis, 1360-1361, 1360F 

in non-neural tissue, 1 362 

pancreas, 1444 

in sensory bristle development, 1 356-1 358, 
1356F, 1358FF 

in skin, 1426 

in vertebral limb, 1355 
Notch signal protein, C elegans early embryo, 1324 
Noxa, apoptosis, 1 1 23 
NPCs see Nuclear pore complex(es) 
NSF, 764, 764F, 769F 
N-terminal degradation, 395-396 
Ntrc regulatory protein, DNA looping, 438F 
Nuclear-cytoplasmic transport 

directionality, 709, 709F 

export mechanisms, 708-709, 709F 

import, 705-710 

mechanisms, 708-709, 708F, 709F, 71 OF 
receptors, 707-708, 708F, 71 OF 
regulation, 71 1F 

mRNA export, 329F, 358-360, 359F, 383 
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export-ready mRNA, 329F, 357, 358-359, 359F 
gene expression regulation, 485-486 
HIV, 485-486, 486F 

localization mechanisms, 486-488, 487F 
regulation, 710 
nuclear localization signals (NLSs), 705-707, 

706F 
nuclear pore complexes see Nuclear pore 

complex(es) 
nuclear pores see Nuclear pores 
nucleoporins, 705 
Ran GTPase, 708-709, 709F, 71 OF 
receptors, 708 
regulation, 709-710, 71 1F 
shuttling proteins, 709-710 
Nuclear envelope, 26, 200-201, 201 F, 704 

mitotic cells, 71 2F, 1071, 1079-1080, 1080, 1090 
structure, 201 F, 705F, 706F 
see also Nuclear lamina 
Nuclear export receptor(s), 708 
Nuclear export signal(s), 708 
Nuclear hormone receptor(s) 

gene duplication and divergence, 254, 255F 
gene regulatory proteins, HOOT 
zinc finger proteins, 421-422, 423F 
Nuclear import receptor(s), 707-708, 708F 
Nuclear lamina, 200, 201 F, 704, 705F, 706F 
electron micrograph, 711 F 
mitosis, 71 1-71 2, 71 2F 
structure, 710 
Nuclear lamins, 984, 985T 
Nuclear localization signals (NLSs), 705-707, 706F 
Nuclear magnetic resonance (NMR) spectroscopy 

see NMR spectroscopy 
Nuclear matrix (scaffold), 242 
Nuclear membrane, 704, 705F, 706F 
Nuclear pore complex(es), 358, 705 
mechanism of action, 706-707 
mRNA transport, 329F, 358, 359F, 487 
Ran GTPase, 708-709 
size exclusion, 705, 706F 
structure, 358, 359F, 705, 706F 
visualization, 706-707, 707F 
Nuclear pores, 200, 201 F, 609F 

see also Nuclear pore complex(es) 
Nuclear receptor superfamiiy, 889-890, 891 F, 892F 
Nuclear scaffold see Nuclear matrix 
Nuclear shuttling protein(s), 709-710 
Nuclear transplantation, 41 1, 41 3F 
Nuclear transport receptor(s), 708 
Nucleation, of cytoskeletal polymer, 973, 978F, 992 

see also Actin polymerization; Microtubule(s) 
Nucleic acid(s), 117F 

biosynthesis, 85, 85F, 86, 87F 

see also DNA synthesis; RNA,synthesis 
chemistry, 116-117F 
DNA see DNA (deoxyribonucleic acid) 
hybridization 

hybridization conditions, 535-536, 537F 
Northern blotting, 538-539, 539F 
recombinant DNA technology, 532, 537F 
Southern blotting, 539-540, 539F 
see also DNA hybridization 
nucleotide subunits, 61-62 
origin of life, 401 -402 

see also Origin of life; RNA world hypothesis 
properties, 401 

base-pairing see Base-pairing 
RNA see RNA (ribonucleic acid) 
structure, 1 1 7F 
Nucleocapsid, viral, 1497F 
Nucleofilaments, RecA protein-DNA interactions, 

307F 
Nucleoids, 859 
Nucleolus, 362-363, 362F, 363F, 364F 

ribosome synthesis see Ribosome(s) 
Nucleoporin(s), 705 

Nucleoside triphosphates, RNA polymerization, 334 
Nucleosome(s), 211-213,211-218 
30-nmfiber,211F,212F 

see also Chromatin 
chromatin remodeling, 215-216, 432, 433F 
core particle, 212-214, 21 2F 

see also Histone(s) 
DNA interactions, DNA bending, 21 4F 
DNA replication, 280, 289-290 
dynamic structure, 21 5-21 6, 21 5F 
histoneHI binding, 218, 218F 
historical aspects, 220 
linker DNA, 211 



nonhistone proteins, 21 1 
protein binding, 215 
structural organization, 21 1, 21 IF, 21 2F 
structure determination, X-ray diffraction, 21 2F 
transcriptional effects, 344 
see also Chromatin 
Nucleosome sliding, 215-216, 215F 
Nucleotide(s), 3, 3, 3F, 61 -62, 116-11 7FF 
biosynthesis, 101 
damage, 296, 296F, 297F 
energy carriers, 61 

see also ATP (adenosine triphosphate) 
hydrolysis 

actin filament assembly, 974, 1002 
cofilin, 1002 
FtsZ protein, 989 

microtubule assembly, 974, 979, 979F 
motor protein force generation, 1016-1019 
treadmilling, 976, 977F 
see also ATP hydrolysis 
nomenclature, 11 7F 
nucleic acid subunits, 61 , 62, 1 97, 332 
see also DNA (deoxyribonucleic acid); RNA 
(ribonucleic acid) 
structure, 61, 61 F, 116F, 197 
intRNA,370F 

see also individual nucleotides 
Nucleotide excision repair (NER), 295T, 298, 299F 
Nucleus 

compartments, 241-243, 241 F, 242F 
DNA localization, 200-201 
see also Chromosome(s); DNA 
(deoxyribonucleic acid) 
membrane amounts, 697T 
proteins, 358-359 
structure, 200-201, 201 F 
subnuclear bodies, 363-366, 365F 
Cajal bodies, 363-364, 364, 365F 
functions, 364-366 
GEMS, 363-364, 364, 365F 
nucleolus see Nucleolus 
speckles, 363-364, 365F 
visualization, 365F 
transport into/out of see Nuclear-cytoplasmic 

transport 
volume, 697T 

see also entries beginning nucleo-Znuclear 
Nucleus, egg (ovum), 1 287 
Null mutation(s), 528F 
Numb protein, 1 358, 1 359F 
Numerical aperture, microscope lens, 581, 582F 
Nurse cells, 1290, 1290F 
Nutrient absorption, liver, 1442 
N-WASp, pathogen movement, 1516, 1516F 
Nylon membranes, Northern blotting, 538-539, 
539F 



Obesity, 1215, 1475 

Obligate pathogens, 1490-1491 

Occludin, 1153 

Occluding junction(s), 1 1 32, 1 1 32F, 1 1 33T 

see also specific types 
Octl Pou domain protein, 41 8T, 425F 
Octylglucoside, membrane protein solubility, 637 
Ocular dominance columns, 1395F 
Odorant receptor genes, 1430 
Odorant receptor proteins, in axon guidance, 1430 
Okazaki fragments, 267-268, 269F, 272, 281 F 
Oleic acid, 11 4F 
Olfactory bulb, neural stem cells and neuronal 

turnover, 1430 
Olfactory sensory (receptor) neurons, 917, 

1429-1430, 1429F 
Oligoclonal response, 1545 

see also Immune response/system,adaptive 
Oligodendrocyte(s), 504F, 678 
Oligonucleotides, affinity chromatography, 428, 

429F 
Oligosaccharide(s), 55-56, 11 3F 

complex, 113F 

ER processing, 775F 

Golgi apparatus processing, 773-775, 773F, 
775F 

sialic acid addition, 774F 

side chain diversity, 636 
Oligosaccharyl transferase, 737, 737F, 747F 
O-linked oligosaccharide(s), protein glycosylation, 



738, 776, 776F 
OMP decarboxylase, reaction rates, 161 F 
Oncogene(s), 1107, 1231 

activation, 1107, 1231-1234, 1238-1239, 
1241-1242 

identification, 1232 

new methods of identification, 1 239 

overexpression, 1237-1239 

viral, 1232 

see also Cancer-critical genes; individual 
genes/proteins 
Oncogenesis, retroviruses, 1232 
Oocytes, 1 288, 1291F 

asymmetrical division, 1289, 1289F 

development (oogenesis), 1 288-1 290, 1 289F, 
1291F 

Drosophila, oogonium, 1288, 1290, 1290F 

follicle/accessory cells and, 1290-1292, 1290F, 
1291F 

gene expression, 1290 

lampbrush chromosomes, 1288 

maturation, 1288-1289, 1291-1292, 1291F 

meiosis, 1280, 1281, 1288, 1290, 1292 

primary, 1288, 1289F, 1291F, 1292 

secondary, 1289, 1289F, 1292 

size acquisition mechanisms, 1290-1291, 1290F 
Oogenesis, 1288-1290, 1289F, 1291F 
Oogonia, 1288, 1290, 1290F 
Opa membrane protein family, Neisseria, 1520 
Open reading frames (ORFs), 490, 551 
Operator(s), 433F, 434, 434F 

lambda, 457-458 
Operons,433F, 462-463 

see also Lac operon (Escherichia coii) 
Opportunistic pathogens, 1490 
Opsin, 1433 

Opsonization, 1555, 1555F, 1557T 
Optical diffraction, 581 
Optical sectioning, 590-592, 590F, 592F 
Optical tweezers, 527 
Optic nerve, regeneration, in frog, 1391 
Optic tectum, 1391, 1391 F 
Optional introns, 863 

ORC see Origin recognition complex (ORC) 
Organelle(s), 695-697 

analysis methods, 509-512 

chloroplasts see Chloroplast(s) 

cytokinesis and, 1098 

development, 702, 704 

energy converting, 81 3 

see also Chloroplast(s); Mitochondria 

evolution, 697-699, 700F 

inheritance, 864-865 

membranes, 697T 

mitochondria see Mitochondria 

phosphoinositides, 757 

position, 697 

topological relationships, 700F 

volume, 696, 697T 

see also specific organelles 
Organic chemistry, 45 
Organic molecules (small, cellular), 55 

amino acids see Amino acid(s) 

as building blocks, 56F 

diffusion rate in cell, 74-75 

fatty acids see Fatty acids 

metabolism, 55 

nucleotides see Nucleotide(s) 

photosynthetic synthesis, 68-70 

sugars see Sugars 
Organism growth 

cell growth and, 1102,1108 

inhibitory factors, 1110, 111 1F 

total cell mass control, 1 1 02, 1 1 11 -1 1 1 2 

see also Development; Multicellular^ 
Organizer (Spemann's Organizer), 1 368, 1 370 
Organ of Corti, 1430, 143 IF 
Organogenesis, 1347-1363 

see also Development; specific organs 
Organotrophs, 12 
Origin of life, 400-408 

DNA as hereditary material, 408 

electron transport chain evolution, 871-872, 
872-875 

lipid bilayers, 405-406 

pre-RNA world, 402-403, 402F 

protein synthesis, 407-408 

RNA world see RNA world hypothesis 

self-replicating molecules 

autocatalysis requirement, 401 
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natural selection, 404-406 
polynucleotides, 401-402 
polypeptides, 401 
see also Catalytic RNA 

time line, major events, 874F 

see also Evolution 
Origin of replication see Replication origin(s) 
Origin recognition complex (ORC), 287, 288, 288F, 

289F, 1068 
Orphan receptors, nuclear receptor superfamily, 

889 
Orthologous genes, 20-21, 20F, 21 F 
Oscillation 

circadian see Circadian clocks 

of gene expression in somitogenesis, 1371-1 373, 
1372F, 1373F 
Osmium, electron microscopy stains, 606 
Osmium tetroxide, EM sample preparation, 605, 

605F 
Osmolarity regulation, 663, 664, 664F, 664FF 
Ossification, process, 1470-1472, 1471 F 
Osteoblast(s), 1458, 1469-1470, 1470F 

ECM production, 1179 

lineage and formation, 1467F 

precursor cells, 1470F 

remodeling of bone, 1 473F 
Osteoclast(s), 1470, 1472, 1472F 

bone erosion, 1471 

cartilage erosion, 1472-1473, 1472F 

cell crawling, 1036 

remodeling of bone, 1473F 
Osteocytes, 1470, 1470F, 1472F 
Osteogenesis imperfecta, 1 187 
Osteogenic cells, 1470F 
Osteoid, 1469-1470, 1470F 
Osteopetrosis, 1474 
Osteoporosis, 1474 

Ouabain, sodium-potassium pump effects, 663 
Ovaries 

accessory cells, 1290-1292, 1290F, 1291F 

cancer, DNA repair defects, 295T 

development, 1285, 1288 

gap junctions, 1161 
Ovulation, meiosis completion after, 1280, 1290, 

1292 
Ovum see Egg (ovum) 
OXA complex, 715,718, 71 8F 
Oxaloacetate, 98, 98F, 99, 122F, 123F 
Oxidation reactions, 71, 71 F 

biological vs combustion, 820F 

p oxidation, peroxisomes, 722 

cellular energy production, 70, 88-103 
see also Respiration 

electron transfers, 72, 828 

peroxisomes, 722 
Oxidation-reduction (redox) reactions, 828-829, 
831 

see also Redox potential(s) 
Oxidative damage, DNA, 296, 296F 
Oxidative metabolism 

energetics, 70 

glycolysis, 91, 92F 
see also Glycolysis 
Oxidative phosphorylation, 81 9, 81 9F, 820F 

ATP synthesis, 98, 100F, 821-822, 822F, 823 

electron transport see Mitochondrial electron 
transport chain 

evolution, 870-872, 871 F 
Oxygen 

atmospheric, 873-874, 874F 

electron affinity, 832-833 

evolutionary significance, 873-875, 874F 

free radical generation, 833, 874 

reduction by cytochrome oxidase, 832-834, 
833F 

requirement for catabolism, 96-97, 81 7-81 8 
see also Aerobic metabolism 

toxic compounds, 1532 

transport molecules, 256-257 
Oxygen-carrying molecules, 256-257 
Oxytocin, 1428 



P 2 , cells in C elegans embryo, 1 324-1 325, 1 324FF 
p16 (INK4) CKI protein, in cancer, 1243 
p21 gene/protein, 1105, 1106F, 1246 
p53 gene/protein 

apoptosis, 1123, 1127 



cell-cycle arrest in response to telomere 
shortening, 1217 

DNA binding, 41 8T, 423, 424F 

DNA damage checkpoint, 1 1 05, 1 1 06F 

Mdm2 protein interactions, 1 105 

modes of action, 1246, 1246F 

modifications, 187F 

mutant forms overexpressed, 1246 

tumor suppressor, mutated in cancer, 1 106, 1 127 
colorectal cancer, 1251-1252, 1252T 
deficiency targeted in cancer therapy, 1257 
inactivation by DNA tumor virus proteins, 
1517-1518 
Pachytene, 1275 

PAGE see Polyacrylamide gel electrophoresis (PAGE) 
Pair-rule genes, 1333, 1337-1338, 1337F 
Pancreas, 1444 
Pancreatic exocrine cell(s), organelles, membrane 

amounts, 697T 
Pandemics, influenza virus, 1521, 1522F 
Paneth cells, 1437-1438, 1437F, 1440, 1526F 
Papanicolaou technique, Cervical cancer, 1212 
Papillomas, 1226-1227 
Papillomaviruses, 1228, 1228F 

cancer role, 1228T, 1248 

oncogenes (E6 and E7), 1248, 1249F 

subversion of cell-cycle control, 1248-1249 
PAPS see 3'-Phosphoadenosine-5'-phosphosulfate 

(PAPS) 
Pap smear, 1212 
Part gene/protein, epithelial apico-basal polarity, 

1156-1157, 11 57F 
Par3/Par6/aPKC complex, neuroblast cell division, 

1361 
Par4 gene/protein, epithelial apico-basal polarity, 

1155-1156 
Par6 gene/protein, epithelial apico-basal polarity, 

1156-1157, 11 57F 
Paracellular transport, tight junction(s), 1 1 52 
Paracortex, lymph node, 1550, 155 IF 
Paracrine signaling, 881, 882F, 883 
Paralogous genes, 20-21, 20F, 21 F 
Paramecium, in tree of life, 16F 
Parasegments, Drosophila, 1330 
Parasites 

intracellular, host cell invasion, 1508-1511 

parasitic worms, 1489 

protozoan, 1494-1495 
Parathyroid hormone, 907T 
Par genes/proteins, epithelial apico-basal polarity, 
1155-1157, 1156F, 1157F 

see also specific genes/proteins 
ParM protein, 990, 990F 
Parthenogenesis, 1287 
Parvoviruses, 1496, 1496F 
Passive transport, 653, 654F 
Patch-clamp recording(s), gated channels studies, 

680-682, 681 F 
Patched protein, 950, 952 F 
Patch electrodes, 527 
Paternity testing, 547F 
Pathogen(s), 1485-1537, 1539, 1539 

colonization of host, 1 501 -1 502 

drug resistant, 1521-1524 

evasion of host defenses, 1 502-1 504 

evolution (rapid), 1520-1521 

facultative, 1490 

host behavior modification, 1518 

host cell membranes and, 1511-1514, 1511F 

host cytoskeleton and, 1514-1517 

host interactions, 1486-1487 

host membrane traffic changes, 1511-1514 

host response, signs/symptoms, 1487-1488 

human evolution, effects on, 1487 

immune response to see Immune 
response/system 

infections see Infection(s) 

infectious process, 1501-1502 

intracellular see Intracellular pathogens 

nonpathogens vs., 1492F 

obligate, bacterial, 1490-1491 

opportunistic, 1490 

phagocytosis, 1531-1532 

phylogenetic diversity, 1488-1489 

primary, 1486-1487 

recognition by host cells, 1526-1528 

requirements to multiply in host, 1487 

spread, host behavior modification, 1518 

tissue damage, 1488 

see also Bacteria; Fungi; Parasites; Protozoa; 



Virus(es) 
Pathogen-associated immunostimulants, 1526, 

1527,1539 
Pathogenesis, definition, 1489-1490 
Pathogenicity islands, 1491, 1491 F 
Pathway guidance, of migrating cell, 1045, 1 140 

neuronal, 1 1 40, 1 1 47, 1 1 77, 1 1 88F, 1 385-1 389 
Pattern recognition receptors, 1527, 1541-1542 

dendritic cells, 1536,1571 

pathogen recognition by phagocytes, 1531 

Toll-like proteins, 1530-1531 
Pax3, 1464 

Pax6 (and homologs), eye development, 466, 466F 
Pax7, 1464 

P-cadherin, 1136, 1138T 
PCNA protein, 275F 

PCR see Polymerase chain reaction (PCR) 
PDGF see Platelet-derived growth factor (PDGF) 
PDI see Protein disulfide isomerase (PDI) 
PDZ domains, scaffold proteins, 1 148 
Pectins, plant cell wall, 1 1 98, 1 1 98F, 1 1 99T 
P elements, 318T, 480, 556 
Pemphigus, 1144 
Penicillin, 1521-1522 
Pentoses, 11 2F 
Peptide-binding groove, MHC protein, 1 577, 1 578F, 

1579F 
Peptide bonds, 59, 60F 

bond angles, steric limitations, 127F 

formation, 128F, 407 

hydrolysis, 161 
Peptide fragments, immune response, 1581-1585 
Peptide-MHC complex, 1577-1579, 1578F, 1579F, 
1580F 

see also MHC {major histocompatibility complex) 
Peptide nucleic acid (PNA), 402, 402F 
Peptide transporters, class I MHC proteins loading, 

1582, 1582F 
Peptidoglycan(s), bacterial cell wall, 1490F, 1527 
Peptidyl transferase, 376, 379, 379F, 407 
Peptidyl-tRNA-ribosome binding, 375, 375F 
Per clock gene, 462F 
Perforin, 1573, 1573F 
Pericentriolar matrix, 1076 
Pericytes, 1446F, 1450 
Perinuclear space, 705F 
Periodic table, 49f 

Peripheral lymphoid organ(s), 1 541 F, 1 543, 1 543F 
Peripheral tolerance, 1587-1588 
Periplasmic space, 1490F 
Peritonitis, 1486 

Perlecan, 1 165, 1 166, 1 167F, 1 184T 
Permeases see Carrier protein(s) 
Peroxin(s), peroxisome protein import, 722 
Peroxisome(s) 

electron micrograph, 721 F, 723F 

enzymes, 721 

functions, 696, 721-722 

hydrogen peroxide, 721 

membrane amounts, 697T 

P oxidation, 722 

oxidative reactions, 721-722 

phospholipid import from ER, 745 

plasmalogens, 721 

production, 723F 

protein import, 721-723 

structure, 721 

volume, 697T 

see also Glyoxysome(s) 
Per protein, 462F 
Pertussis, 1503 

Pertussis toxin, 906-907, 1 503 
Petite mutants, 866-867 
Peutz-Jeghers syndrome, 1 156 
PEV (position effect variegation), 221, 221 F 
Pgranules, 1323, 1323F 
pH,52, 109F 

gradient, 820-821, 821 F 

intracellular, 596 

see also Acid(s); Base(s) (chemical) 
Phagocytic cells, 1531-1534 

see a/50 Macrophage(s); Neutrophil(s) 
Phagocytosis, 26, 27F, 28F, 784F, 787-789, 1115, 
1531F 

antibody-activated, 1555, 1555F 

bacterial entry into host cells, 1507-1508 

complement role, 788, 1528-1529 

electron micrographs, 788F 

induction by bacteria, 1508, 1509F 

macrophages, apoptosis, 1117 
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pathogens, 1531-1532 

process, 1531-1532 

resistance mechanisms, 1531-1532 

respiratory burst, 1532 
Phagolysosome, 1511-1512, 1511F, 1513, 

1531-1532 
Phagosome(s), 782-783, 787-788, 1452, 1531-1532 
Phalloidin,987,988T 
Phase, 581,583 

Phase-contrast microscopy, 504F, 583 
Phase transition, membranes, 623 
Phase variation, DNA rearrangement, 454-455, 
455F 

see also Site-specific recombination 
Phenobarbital, effect on liver, 1444 
Phenol extractions, nucleic acids, 538 
Phenotype, 553, 554F 

analysis, 556-557 

see also Genetic screens 

heritability see Genomic imprinting 
Phenylalanine, structure, 129F 
Pheromones, 917 
Philadelphia chromosome, in chronic myelogenous 

leukemia, 1208, 1208F, 1261, 1 261 F 
Phorbol esters, as tumor promoters, 1226 
Phosphatase(s) see Protein phosphatase(s) 
Phosphate bonds, 107F 

bond energies, 93F 

phosphoanhydride bonds, 61, 93F 

phosphodiester, 61-62, 93F 
Phosphate groups, 107F, 175 
Phosphate transfer reactions, 80, 84F 

dephosphorylation, 176, 176F 

glycolysis, 91, 92F 

phosphorylation see Phosphorylation 

see also ATP (adenosine triphosphate); Protein 
kinase(s); Protein phosphatase(s) 
Phosphatidylcholine, 618, 61 9F, 743, 743F 
Phosphatidyiethanolamine, 618, 61 9F, 743 
Phosphatidylinositol (PI), 627, 743, 796-797, 909F, 

933 
Phosphatidylinositol 3, 4-bisphosphate [Pl(3, 4)P 2 ], 

1010 
Phosphatidylinositol 3'-kinase (PI 3-kinase), 
932-935 

actin organization, 1043F, 1046 

extracellular signaling, 627, 627F 

growth factor signaling, 1 1 08, 1 1 09F 
Phosphatidylinositol 4, 5-bisphosphate [Pl(4, 5)P 2 ], 

909, 909, 91 OF, 91 IFF 
Phosphatidylinositol 4-phosphate [PI(4)P], 91 OF 
Phosphatidylinositol-specific phospholipase C, 630 
Phosphatidylserine, 61 8, 61 9F, 626, 788, 1117 
3'-Phosphoadenosine-5'-phosphosulfate (PAPS), 

776, 776F 
Phosphoanhydride bonds, 61, 93F 
Phosphodiesterase, base excision repair (BER), 299F 
Phosphodiester bonds, 61 -62, 93F, 1 99F 
Phosphoenolpyruvate, 1 21 F 
Phosphofructokinase, 120F 
Phosphoglucose isomerase, 120F 
2-Phosphoglycerate, 1 21 F 
3-Phosphoglycerate, 91, 92F, 121 F 
Phosphoglycerate kinase, 91, 1 21 F 
Phosphoglycerate mutase, 1 21 F 
Phosphoglyceride(s), 618, 618-620, 61 9F 

see also individual molecules 
Phosphoinositide-dependent protein kinase I, 934 
Phosphoinositide kinases, 757, 757F 
Phosphoinositides see Inositol phospholipid(s) 
Phosphoinositol (PI), 757, 757F 
Phosphoinositol kinases, 757, 757F 
Phospholipase(s), 627 

see also specific phospholipases 
Phospholipase C (PLC), 909, 91 OF, 1 299 
Phospholipid(s), 9,618 

aggregates, 115F 

amphipathic molecules, 58-59 

bilayers, 59F, 405-406, 407F 
see also Lipid bilayer(s) 

biosynthesis, 58, 867-868 

cell macromolecules, 62F 

lipid droplets, 625-626 

membrane composition, 61 8-620 

properties, 620 

structure, 58-59, 59F, 1 14F, 618 

synthesis, ER, 743-745 

see also specific molecules 
Phospholipid exchange proteins, endoplasmic 
reticulum (ER), 745 



Phospholipid flippase(s), lipid bilayer assembly, 

744, 744F 
Phospholipid scramblase, 627, 744-745, 744F 
Phospholipid transfer proteins, 745 
Phospholipid translocator(s) 
apoptosis, 627 
flippases, 744, 744F 
lipid bilayer assembly, 622, 743-745 
scramblases, 627, 744-745, 744F 
Phosphorus-32, DNA label, 534 
Phosphorylation 
ATP-mediated, 84F 
cascades, 895 
GTP-binding vs., 180F 
histone tails, 222, 223F 
noncyclic photophosphorylation see Noncyclic 

photophosphorylation 
oxidative see Oxidative phosphorylation 
protein see Protein phosphorylation 
Phosphotyrosine-binding domain (PTB domain, 

PTB site), 898, 899F 
Photoactivation, 594-595, 595F, 596F 
Photochemical reaction centers, 848, 849F, 854F 
bacterial, 850, 850F, 853-854 
electron transfer, 849 
photosystem I, 854F 
Photophosphorylation, 850-853, 852FF 

see also Photosystem(s) 
Photoprotein, 960-961 

Photoreceptors (photoreceptor cells), 91 8F, 1429, 
1432-1433, 1433F 
see also Cones; Rod photoreceptors (rods) 
Photorespiration, 846-847 
see also Carbon fixation 
Photosynthesis, 69-70, 840-855, 844F, 849-850, 
850F 
bacterial, 840, 863, 872-875 

see also Cyanobacteria; Purple bacteria 
carbon fixation (dark reactions) see Carbon 

fixation 
chlorophyll photochemistry, 847-848, 847F, 848F 

see also Chlorophyll(s); Photosystem(s) 
chloroplast(s), 843-848, 844F 
cyclic photophosphorylation, 853 
electron transport see Photosynthetic 

electron transport chain(s) 
noncyclic photophosphorylation, 850-853, 

852FF 
proton-motive force, 853-854 
cyclic photophosphorylation, 853 
electron-transfer (light reactions) see 

Photosynthetic electron transport 
chain(s) 
energetics, 68-70 
equation, 70 
evolution, 840-841, 853 

bacterial, 872-875 
noncyclic photophosphorylation see Noncyclic 

photophosphorylation 
photosystems see Photosystem(s) 
relation to respiration, 70, 70F 
stages, 69, 70F 
see also specific components 
Photosynthetic electron transport chain(s), 827F, 
843-844, 844F, 849-850, 850F, 852F, 
873F 
across thylakoid membrane, 852F 

see a/so Thylakoid(s) 
electron carriers, 850-853, 852F 
evolution, 872-875 
mitochondrial electron transport vs., 81 5F, 

853-854, 855F 
reaction centers, 848, 849-850, 850F 
redox potentials, 852F, 853 
water-splitting enzyme, 851, 852F, 873 
Z scheme, 852 

see also Carbon fixation; Chloroplast(s) 
Photosynthetic organisms, 840 
bacteria, 840, 863 
evolution, 872-875 
see also Cyanobacteria; Purple bacteria 
evolution, 840-841, 872-875 
plants see Plant(s) 
Photosynthetic reaction centers, 848, 849F, 854F 
bacterial, 642, 642F 
electron transfer, 849-850, 850F 
photosystem II complex, 642 
redox potentials, 852F 
Photosystem(s), 848 

antenna complex, 848, 848F 



bacterial, 850F, 853, 854F 

electron transfer see Photosynthetic electron 

transport chain(s) 
evolution, 853 

light-harvesting complexes, 848, 848F 
photosystem I, 851-852, 852F, 853, 854F 
photosystem II, 851, 851 F, 852F, 853, 854F 
reaction centers see Photosynthetic reaction 

centers 
redox potentials, 852F, 853 
structural relationships, 853 
see also Chlorophyll(s); Cytochrome(s) 
Phototrophs, 12 
Phototropin, 961 
Phototropism, 961 
Phragmoplast, cytokinesis in higher plants, 

1097-1098 
Phylogeneticfootprinting, 431, 431 F 
Phylogenetictree(s), 16F, 247, 247F, 248 
Physical gene mapping, 560, 560F 
Phytochrome, 960-961, 961 F 
PI see Phosphatidylinositol (PI) 
PI 3-kinase see Phosphatidylinositol 3'-kinase 

(PI 3-kinase) 
Pigment cells, fish, 1023-1024, 1024F 
Pili, 1490F 

Pilin protein, Neisseria, 1 520 
Pineal gland, circadian clock, 461 
Pinocytosis, 787, 789-790, 789F 
Pit fields, 1162 

PKC see Protein kinase C (PKC) 
Placenta, 1379 
Placodes, 1384F 
Plague, bubonic, 1502, 1502F 
Plakoglobin (y-catenin), cadherin binding, 1 142 
Planar cell polarity, 1 1 57-1 1 58, 1 1 57F, 1 358, 1 359F, 

1360F 
Plant(s) 

comparative anatomy, C 3 vs C 4 plants, 846-847, 

847F 
development see Plant development; Plant 

development and growth 
evolution, 36, 840-841 
extracellular matrix, 1 180, 1 195 
flowering, recent evolutionary origin, 36 
as model organisms see Arabidopsis thaliana 

(wall cress) 
plasmodesmata, 1 1 58, 1 1 62-1 1 63, 1 1 63F 
transgenic, 568-569, 569F, 570F 
Plant cell(s), 846 

bundle-sheath cells, 846 

carbon fixation see Carbon fixation; 

Photosynthesis 
cell differentiation, genome preservation, 41 1, 

413F 
eel walls see Plant cell wall 
centrosome lack, 1081 
culture, 504-505, 568 
cytokinesis in, 1097-1098, 1097F, 1098F 
development see Plant development and growth 
division, 1195 
evolutionary origin, 29 
mesophyll cells, 846 
microtubules, 1097-1098, 1200-1202, 1200F, 

1201F 
photosynthetic organelles see Chloroplast(s) 
plasticity, 5568 
plastids, 841 -842, 841 F 
totipotency, 568 
transformation, 570F 
turgor pressure, 1197 
vacuoles see Plant vacuoles 
Plant cell wall, 1195-1202 
composition/structure, 1199F 
cell type and, 1195-1197 
cellulose, 1180, 1196, 1197-1198, 11 97F, 

1198F 
cross-linking glycans, 1196, 1198, 1198F, 

1199T 
lignin, 1195, 1196, 11 99T 
pectins, 1198, 1198F, 1199T 
proteins, 1198, 1199T 
development and growth, 1 1 95-1 1 97, 1 1 96F 
actin filaments and, 1 202 
microtubules and, 1 200-1 202, 1 200F 
orientation of deposition, 11 99-1 200 
primary cefl walls, 1195, 1197-1198, 1198F 
secondary cell walls, 1 195, 1 196 
functional roles and specialization, 1 1 96-1 1 97, 
1197F 
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compression resistance, 1 196 
tensile strength, 1196, 1197, 1199 

plasmodesmata, 1 1 62-1 1 63, 1 1 63F 
Plant development and growth, 1398-1415, 
1401 F 

Arabidopsis see Arabidopsis thaliana (wall cress) 

cell signaling, 1409-1410 

cell wall formation, 1 196F see under Plant cell 
wall 

egg (zygote), asymmetric division, 1400 

embryogenesis, 1400, 1401F, 1402F 

environmental influences, 1403 

flower development, 1413-1414, 1413F, 1414F 

hormonal signals, long-range, 957, 1403, 1406, 
1406F 

meristems see Meristem(s) 

microtubules and, 1200-1202, 1201 F 

plant module growth from primordia, 
1407-1408 

regulatory mutations in maize, 1410-1412, 
1411FF 

repetitive patterning, 1408F 

sequential, by meristems, 1403 

sexual reproduction, haploid cells, 1270 

shaping and by oriented cell division, 1406 

signals for seedlings, 1399F, 1401, 1403 

turgor pressure and, 1197, 1199 
Plant hormones (growth regulators), 957, 1403, 

1406, 1406F 
Plant modules, 1407-1408 
Plant vacuoles, 781-782, 781 F, 782F 
Plant viruses, plasmodesmata and, 1 163 
Plasma cells (effector B cells), 1543, 1544F, 1547, 
1552 

see also Bcell(s) 
Plasmalogen(s),721,722F 
Plasma membrane(s), 9, 61 7 

actin filament contact, 1009-1010, 101 OF 

actin filament nucleation, 996-998 

domains, tight junctions, 1 151 

enlargement, regulated secretory pathway 
(exocytosis), 805, 806F 

lipids see Lipid bilayer(s) 

origin, 405-406, 407F,408F 

proteoglycans, 1 183-1 184 

protrusion driven by actin polymerization, 
1037-1039 

universality, 9-10 

see also Membrane(s) 
Plasmid(s), 21, 540-541, 541FF, 1491-1492, 1491F, 

1492F 
Plasmin, 1194 
Plasminogen, 1 194 
Plasminogen activators, 1 1 94 
Plasmodesmata, 1 1 58, 1 1 62-1 1 63, 1 1 63F 
Plasmodium falciparum, 1 494-1 495 

antigenic variation, 1519 

life cycle, 1495F, 1502 

mitochondrial DNA, 859 

time-dependent transcription, 1495F 

transmission and vector, 1494-1495, 1501-1502 

see also Malaria 
Plastid(s), 696, 841-842 

amyloplasts, 841, 841 F 

chloroplasts see Chloroplast(s) 

development, 698, 699F, 841 

diversity, 841 F 

etioplasts, 841 

evolution, 699 

genomes, 855-856 

growth and division, 857-858 

leucoplasts, 841 
Plastocyanin, 852F 
Plastoquinone, 852F 
Platelet(s), 1102, 11 02F, 1174 
Platelet-derived growth factor (PDGF), 923T, 925F, 

1102,1441, 1450 
Pleckstrin homology (PH) domain, 933 
Plectin,995F, 1005, 1006F 
Ploidy variation, 38-39, 39F, 255, 1 1 1 1, 11 1 1 F, 

1112F 
Pluripotent hemopoietic stem cells, lymphocyte 

origin, 1543 
Plus end tracking proteins (+TIPs), microtubule 

formation, 995F, 1004, 1004F 
PNA (peptide nucleic acid), 402, 402F 
Pneumocystis carinii, antigenic variation, 1519 
Point mutations, 246 
Point spread function, 590 
Polar body, 1289, 1289F 



Polar covalent bonds, 50, 51 F 
Polar ejection force, 1 085, 1 086F 
Polarity/polarization, 1042 

cytoskeleton and, 969, 974-975, 1044, 1046 

developmental, 1364 

see also under Drosophila development 

DNA strands, 3F 

epithelia 

apico-basal polarity mechanisms, 1 1 55-1 1 57, 

1155F, 1156F 
epiethial cells, 798, 799F, 806F 
planar cell polarity mechanisms, 1 1 57-1 1 58, 
1157F 

exocytosis control, 805-806 

kinetochore, 1075-1076, 1076F 

mitotic spindle, 1077-1078, 1081-1082 

neutrophil(s), 1045F, 1046F 

Saccharomyces cerevisiae, 1 044 

Tcell(s), 1047, 1573 

tissues, 1155 

Yeast budding, 880, 1044, 1045F 
Polar molecules, 52-53 
Poleward flux, 1035, 1035F 
Poliomyelitis, eradication by vaccination, 1 498F 
Poliovirus, 1496F, 1513-1514, 1515F 

capsid, 149F 

entry and uncoating, 1506F, 1507 

host cell transcription shut-off, 1517 

structure, 1489F 

TFIID cleavage, 1517 
Pollen, 592F, 1269 
Polo-like kinases, 1074 

Pol proteins, translational frameshifting, 383, 384F 
Poly-A-binding protein, 357F 
Polyacrylamide, 517 
Polyacrylamidegel electrophoresis (PAGE), 517 

2D-PAGE,414F, 521-522 

DNA sequencing gels, 534, 535F 

gel-mobility shift assay, 426-427, 428F 

SDS-PAGE,517,518F,637 

staining, 517, 518F 
Polyadenylation (mRNA 3' end), 346, 357-358, 357F 

CPSF protein, 357-358, 357F 

CstF protein, 357-358, 357F 

cytoplasmic, 493 

deadenylation-mediated mRNA decay, 492, 492F 

messenger RNA 3' end, 357-358, 357F 

nonretroviral transposons, 318T 

nonsense-mediated mRNA decay, 386 

poly-A-binding protein, 357F 

poly-A polymerase, 357F, 358 

post-transcriptional gene regulation, 370-371, 
372F 
Poly-A polymerase, 357F, 358 
Poly-A tails see Polyadenylation (mRNA 3' end) 
Polycistronic mRNA, 380 
Polyclonal response, 1545 

see also Immune response/system; 
Immunoglobulin(s) 
Polycomb group, 1344, 1345F 
Polygenic traits, 563 
Polyisoprenoid(s), 1 1 5F 
Polymerase chain reaction (PCR), 544-546, 545F, 

546F, 547F 
Polymerases 

DNA see DNA polymerase(s); DNA replication 

RNA see RNA polymerase(s) 

thermophilic, 544 

see also DNA replication; specific enzymes 
Polymerization reactions 

actin see Actin polymerization 

DNA see DNA replication 

energetics, 84-87 

head polymerization, 86, 87F, 373 

tail polymerization, 86, 87F 

see also Macromolecule(s) 
Polymorphism, MHC proteins, 1575, 1588 
Polymorphonuclear leucocytes see Neutrophil(s) 
Polynucleotides 

DNA see DNA (deoxyribonucleic acid) 

origin of life, 401 

see also RNA world hypothesis 

properties, 401 

RNA see RNA (ribonucleic acid) 

see also Nucleic acid(s) 
Polypeptide backbone protein models, 132F 
Polypeptides see Protein (s) 
Polyprotein(s), 803 
Polyribosome(s), 381-382, 382F, 728, 730 

see also Ribosome(s) 



Polysaccharides, 56, 11 3F 

biosynthesis, 85, 85F 

cell macromolecules, 62F 

digestion, 88F, 164-165 
Polyspermy, 1300 
Polytene chromosome(s), 236-238 

chromosome puffs, 220-222 

Drosophila melanogaster, 38, 38F, 236-237, 
236-238, 237FF 
deconvolution microscopy, 590F 
gene expression, 220-222 
histone modifications, 238F 
insulator-binding, 413, 453F 
microscopy, 590F 
see a/so Chromosome structure 

RNA synthesis, 239F 

visualization, 236 

see also Chromosome puffs 
Polyubiquitylation, 795 
Polyvalent antigen(s), 1557 
Pop1 gene regulatory protein, 1325 
Pop (plenty of pharynx) mutants, 1 325 
Porin(s), 144F, 635, 667, 707, 707F, 717, 71 7F 
Porphyrin rings, heme groups, 830F, 831 
Porphyromonas gingivalis, chronic illness, 1500 
Positional information, imaginal discs, 1351 
Positional value, 1312-1313, 1392-1393 
Position effects, 221, 221 F 
Position effect variegation (PEV), 221, 221F 
Positive feedback control, 901 , 901 F 

cell asymmetry, 1314-1315 

cell memory, 458, 458F, 466 

conformational coupling, 171, 172F 

epigenetics, 471,472F 

transcription circuits, 459, 459F 
Positive selection, in thymus, 1 585-1 586, 1 586F, 

1587F 
Positive supercoiling, 344F, 345 
Post-capillary venule, 1549, 1549F, 1551F 
Postsynaptic potential (PSP), 689F 
Post-transcriptional regulation, 477-499, 477F 

alternative splicing, 479-480 

immunoglobulin genes, 370, 482-483 

m RNA polyadenylation, 370-371, 372F 

RNA localization control, 379F, 41 5 

RNA processing control, 379F, 41 5, 477-485 
see also Splicing 

RNA transport control, 379F, 41 5, 485-486 

translational see Translational control of gene 
expression 
Post-translational modification, 388F 

collagens, 1186, 1187F 

covalent, 388F 

see also Protein glycosylation; Protein 
phosphorylation 

elastin, 1190 

function regulation, 186-187, 186T 

noncovalent, 388F 

see also specific modifications 
Potassium channel(s) (K + channel) 

bacterial, structure, 671-673, 672F 

chloride channel vs., 674F 

conductance, 671 

gating, 672 

leak channels, 669 

selectivity filter, 673F 

voltage-gated see Voltage-gated potassium 
channel(s) 
Potassium leak channels (K + leak channels), 669 
POU-domain, structure, 425F 
Power stroke, of myosin, 1016 
Poxviruses, 1496, 1496F 
Pre-B cell, 1553 

Precursor oligosaccharide(s), 737, 737F, 747F 
Preimmune antibody repertoire, 1562 
Preinitiation complex, 1067 
Premature ageing, DNA repair defects, 295T 
Pre-mRNA modifications 

capping see 5' Capping of eucaryotic mRNA 

hnRNP packaging, 353F, 358 

polyadenylation see Polyadenylation (mRNA 
3' end) 

splicing see Splicing 
Pre-pro-protein(s), 803 

Prereplication complex (pre-RC), 1067-1068, 1068F 
Pre-replicative ORC complex, 288 
Pre-RNA world, origin of life, 402-403 
Presenilin, 947 
Presomitic mesoderm, 1371 
Prickle cells, 1419, 1419F 
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Primary antibodies, immunofluorescence 

microscopy, 588 
Primary antibody response, 1 546F 

see also Immune response/system,adaptive 
Primary cell walls, 1 195, 1 197-1 198, 1 1 98F 
Primary cilium, 899 
Primary cultures, 504 
Primary immune response, 1 546, 1 546 

see also Immune response/system, adaptive 
Primary lymphoid organ see Central lymphoid 

organ 
Primary oocytes, 1288, 1289F 
Primary pathogens, 1486-1487 
Primer strand, DNA synthesis, 267F, 268F 
Primordial follicle cells, 1291, 1291F 
Primordial germ cells (PGCs), 1282-1286 

migration into developing gonad, 1283, 1283F, 
1285,1288 

pluripotency, 1283 

specification by germ cell determinants, 1 282, 
1282F 

specification by neighboring cells in mammals, 
1282-1283 
Primosome, 276, 282, 283F 
Prion diseases, 397-398, 397F 
Prion proteins (PrP), 1489, 1498-1499 

conformational change, 397F, 398 

fungi, 398 

infectivity, 397-398 

misfolding/aggregation, 397-398 

neural degeneration due to, 1498, 1499F 

normal cellular PrP, 397-398 

positive cellular roles, 398 

replication, 1498 

structure and aggregation, 1 498-1 499 
Procaryotes, 14 

cells, 14-15, 14F, 15F 

diversity, 15 

DNA methylation see DNA methylation 

gene expression, 345F 

mRNA,346F,380,381F,488 
repressors see Repressor protein(s) 
transcription see under Transcription 
transcriptional control, eucaryotes vs., 
434-436 

protein synthesis, 345F, 380 

strand-directed mismatch repair, 277 

Translation initiation, 1527 

see also Archaea (archaebacteria); Bacteria 
Procaspase(s), 1 1 18, 1 1 19F, 1 120F, 1 121, 1573F, 

1574 
Processing bodies (P-bodies), 494, 495F 
Procollagen, 1186, 1187 
Profilin, 999-1000, 999F, 1000F 
Progenitor cells, 1386F, 1456-1458 

see also Stem cell(s) 
Programmed cell death see Apoptosis 
(programmed cell death) 
Proinsulin, 151.152F 
Projection, retinotectal, 1391 
Proline, 129F, 1187, 11 87F 
Prometaphase, 1072F, 1080, 1086 
Promiscuous recombination, 315-316 
Promoter elements, 336, 338-339 

asymmetry, 339, 339F 

bacterial, 37F, 336-337, 338-339, 338F 

eucaryotic, 440 

CG (CpG) islands, 434, 470-471 
consensus sequences, 339 
transcriptional integration, 449-450, 450F 
Promoters, inducible, 564 
Pronuclei fusion, 1 301 , 1 301 F, 1 302F 
Proofreading by DNA polymerase, 269-270, 270FF 

see also Mismatch repair 
Proopiomelanocortin, proteolytic processing, 803F 
Pro-peptide(s), 803 
Prophase 

meiosis, 1 091 -1 092, 1 274, 1 275-1 276, 1 276F 

mitosis, 1054, 1072F, 1078-1079, 1080 
see also Mitotic spindle 
Proplastids,698,841 
Prospero protein, 1361 
Protease(s), 1183, 1194, 11 94F 

see also Proteasome(s) 
Protease inhibitors, regulation, 1 183 
Proteasome(s), 391-394, 392F 

antigen processing/presentation, 1582, 1583F 

misfolded protein degradation, 740F 

processivity of proteolysis, 392, 392F 

protein quality control, 391-393, 767 



structure, 392-393, 392F 
see also Ubiquitin pathway 
Protein(s), 125-194 
abundance in cells, 62F 
aggregation see Protein aggregation 
allosteric, 171, 172-173 

see also Allosteric regulation 
analysis methods see Protein analysis 
assemblies see Protein complexes (assemblies) 
basic principles, 5 
binding site(s), 142, 153-157 

see also Active site; Protein complexes 
(assemblies) 
as catalysts, 5-6, 6F 

see also Enzyme(s) 
chemical synthesis, 548 
classification, 138-139 
conformational changes see Conformational 

changes 
crystals, 528, 528F 
degradation see Proteolysis 
denaturation, 130, 517 

DNA interaction see Protein-DNA interactions 
evolution see Protein evolution 
families see Protein families 
fibrous, 145-146 

see also specific types 
folding see Protein folding 
functions see Protein function 
gene regulation by see Gene regulatory 

protein(s) 
GFP tagging for visualization in living cells, 

592-593, 594F 
globular, 143, 145 

glycosylation see Protein glycosylation 
identification see Protein identification 
as machines see Protein machines 
macromolecularsubunits, 148 
modules, 136, 136F, 140-141, 140F 

phosphorylation, 175, 187 

see also Protein structure,domains 
motor see Motor protein(s) 
movements (shape changes), 179-181, 1 81 F 
nonprotein components, 166-167 
phosphorylation see Protein phosphorylation 
protein interaction see Protein-protein 

interactions 
quality control, 390-391, 391 F 
sequencing see under Protein analysis 
shape/size, 131 

see also Protein structure 
sorting see Protein sorting 
stability, 137, 147F 
structure see Protein structure 
subunits, 141 F, 142-143, 142F, 143F, 148 

see also Protein complexes; Protein-protein 
interactions 
synthesis see Protein synthesis 
translocation see Protein transport 
Protein aggregation 
CNS vulnerability, 397 
human disease, 397-398, 397F 
proteolytic resistance, 397 
state, epigenetics, 472F 
see also Prion diseases 
Protein analysis 

chromatography, 512-514, 513F, 514F, 515F 
dynamics, visualization by GFP tagging, 593, 

594F 
electrophoresis, 51 8FF 

2D-PAGE, 521-522, 521-522, 521 F, 522FF 

isoelectric focusing, 522F 

Western blotting, 518, 519F 
hydrodynamic measurements, 522-523 
mass spectrometry, 519-521, 520F 
protein-protein interactions, 523-524, 524F, 

525-526, 525F 
proteomics, 519 
sequencing 

homology searches, 139, 139F 

ligand-binding site identification, 155 

mass spectrometry, 519-521 

sequence alignment, 531, 531 F 

see also DNA sequencing 
structure determination, 139, 517-532 

NMR studies, 139, 529-530 

nuclear magnetic resonance, 139 

SDS-PAGE,517 

X-ray diffraction, 1 39, 527-529, 528F 
see also Microscopy; Protein identification; 



Protein purification 
Protein-coding RNA, transcription, RNA pol II, 340T 
Protein complexes (assemblies), 142-143, 145F 
assembly, 148-152 
assembly factors, 151 
dimers, 142, 142F 
phosphorylation effects, 175 
ribosomes, 149 
self-assembly, 149-151 
viruses, 148, 148F, 149 
cooperativity, 172-173, 173F 

see also Allosteric regulation 
eucaryotic gene-regulatory proteins complexes, 

447, 447FF 
protein-to-protein binding, 141F, 142-143, 143F 
see also Protein machines; Protein-protein 
interactions 
Protein databases, 139, 139F 
Protein disulfide isomerase (PDI), 736 
Protein-DNA interactions, 144F 
binding site prediction, 426 
common interactions, 426, 427 
DNA structure effects, 41 6-41 7 

see also DNA structure 
enhancesome, 447, 447F 
gene regulatory proteins, 41 6-41 7, 41 7F, 

41 8-41 9, 41 9F 
histones, 212-213 
Rad51 protein, 307F 
RecA protein, 307F 
replication initiation, 282 
RNA polymerases, structural changes and, 

336-337,338 
see also DNA-binding motifs; DNA-binding 
proteins 
Protein evolution, 136-142 

amino acid changes, 1 37, 264, 265 
domain fusion, 560F 

domain shuffling, 140-141, 140F, 257, 257F 
facilitated by introns and splicing, 257, 348 
modules, 140-141,257 

see also Immunoglobulin superfamily 
mutation rates, 264 
protein families, 137-139, 155-156 

see also Gene families 
sequence homology, 138, 138F, 139, 155-156 
unit evolutionary time, 265 
Protein expression vectors, 546, 548, 548FF 
Protein families, 137-139 

evolutionary tracing, 155-156, 155F 
sequence homology, 138, 138F, 139, 139F 
structural resemblance vs sequence 

resemblance, 138-139, 138F 
see also Gene families; specific types 
Protein folding, 126,387-390, 1498-1499 
abnormal, 391, 391 F, 397-398 

see also Prion proteins (PrP); Proteolysis 
amino acid side-chain role, 130, 130F 
binding site creation, 1 54F 
calreticulin and calnexin, 738-739 
chaperones, 388-390, 390F, 738-739, 739F, 767 

see also Chaperones 
constraints, 126 

co-translational, 387-388, 388F 
energetics, 130-131 
ER, 736 

ERp57 role, 739F 

glucosyl transferase role, 739, 739F 
misfolded proteins, 739-741, 740F 
molten globule, 387-388, 388F 
monitoring, 738-739 
AMinked glycosylation role, 737F, 747F 
noncovalent interactions, 64, 64F, 126-127, 130F 
pathways, 388F 
prion proteins, 1498-1499 

see also Prion proteins (PrP) 
quality control, 390-391, 391 F, 767-768 
refolding, 131, 131F, 391, 391 F 

see also Chaperones 
relation to synthesis, 387 
Protein function, 152-196 
enzymes see Enzyme(s) 
phosphorylation effects see Protein 

phosphorylation 
protein-ligand binding see Protein-ligand 

interactions 
structure-function relationship, 154-155 
Protein glycosylation, 388F, 736-738, 737F, 747F 

see also Endoplasmic reticulum (ER) 
Protein identification, 519-532 
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mass spectrometry, 519-521 

see also Protein analysis; Protein purification 
Protein interaction maps, 188-190, 189F, 190F 
Protein kinase(s), 176-178,895 

Cdk inactivation, 1063-1064, 1064F, 1066T 

evolution, 176, 177F 

regulation, "microchips," 1 77-1 78, 1 77F, 1 79F 

sequence homology, 176, 177F 

structure, 141, 176F 

tyrosine kinases see Tyrosine kinase(s) 

see also individual enzymes 
Protein kinase B (Akt; PKB), 934, 1244 
Protein kinase C (PKC), 185, 626, 91 1 
Protein-ligand interactions, 153 

amino acid side-chains, 154-155, 154F, 155F 

antigen-antibody see Antigen-antibody binding 

binding sites, 153, 153F, 154F, 155F 

binding strength, 157-158 

see also Equilibrium constant (K) 

cooperativity, 172-173, 173F 
see also Allosteric regulation 

enzyme-substrate see Enzyme-substrate 
interactions 

linkage, 171-172, 172FF 

noncovalent interactions, 153, 153F, 154, 154F, 
155F 

regulation, 169-170 

allostery see Allosteric regulation 

specificity, 153 

water exclusion, 154 

see also Protein-DNA interactions; 

Protein-protein interactions 
Protein machines, 184-186 

activation, 185-186 

gene minimization, 184-185 

interchangeable parts, 184-185 

position in cell, 185-186, 186F 

scaffold proteins, 184-185, 185-186 

structure, 184 

see also Protein complexes (assemblies) 
Protein microarrays, 188 
Protein modules, 140-141, 175-176 
Protein motifs, DNA-binding motifs see DNA- 

binding motifs (proteins) 
'Protein-only inheritance,' yeast, 398, 398F 
Protein phosphatase(s), 176-177, 176F, 895 

Cdk activation, 1063, 1063F, 1066T, 1074-1075 

5'mRNA capping, 347F, 3446 

protein tyrosine phosphatases (PTPs), 938-939 
Protein phosphorylation, 175-178 

ATP as phosphate donor, 84F 

energetics, 177 

enzymes involved, 176, 176F 
see also Protein kinase(s); Protein 
phosphatase(s) 

functional effects, 175-176, 186-187, 187F 

gene regulatory proteins, 450, 451 F 

GTP-GDP exchange as alternative to 
phosphorylation, 178-179 

GTP-mediated seeGTP-binding proteins 
(GTPases) 

phosphorylation cycles, 177 

phosphorylation site as recognition signal for 
protein binding, 175-176 

Rb protein regulation, 1 1 04 
Protein-protein interactions, 187-190 

analysis methods, 523-527 

dimerization, 142 

interface types, 1 56, 1 56F 

map construction, 188-190, 524 

optical methods, 524-526 

quaternary structure, 136 

SH2 domains see SH2 domain (Src homology 2 
domain) 

single molecules, 526-527 

subunits, 140F, 141 F, 142-143, 148F 
Protein purification 

chromatography, 512-514, 515F, 535F 

electrophoresis, 517, 518FF, 521-522, 522FF 

Western blotting, 51 9F 

see also Protein analysis; Recombinant proteins 
Protein sorting 

apical domain, 806-807 

basolateral domain, 807 

endosomes, 807F 

glycosphingolipids, 807 

lipid rafts, 807 

signals, 699, 701-702, 701 F, 702T, 703F 

trans Golgi network, 807F 
Protein stains, Coomassie blue, 51 7, 51 8F 



Protein structure, 59-60, 125-152 
amino acid sequence, 125-137 

see also Amino acid(s) 
analysis, 139 
components, 126F 
computational analysis, 139, 139F 
conformation, 130-131, 154-155, 154F, 155F 

see also Allosteric regulation; Conformational 
changes 
core structure, 131,135, 135F 
C-terminus, 59, 1 26F, 482-483 
depicting, 132-133FF, 144F 
determination, 139, 517-532 

SDS-PAGE,517 

X-ray diffraction, 139, 527-529, 528F 
domains, 131,136,348 

3-D models, 131, 132-133FF, 137F 

evolutionary tracing, 155-156, 155F, 257F 

fusion, 560F 

large kinase domain, 1 36F 

modules, 140-141, 140F 

paired, 141 

SH2 domain, 131, 132-133FF, 136F, 155, 156 

SH3 domain, 136F 
helical filaments, 143, 144F, 145, 145F 

actin, 145F 

see also Actin/actin filaments 

assembly, 145F 

electron microscopy, 61 OF 

handedness, 143 
limited fold numbers, 1 36-1 37 
membrane spanning regions, 1 35 
noncovalent forces, 126-127 
N-terminus, 59, 126F 
polypeptide backbone, 125 
primary structure, 136 

coding, 199-200, 199F,200F 
see also Genetic code 

prediction, 550-551 
quaternary structure, 1 36 
secondary structure, 1 36 

othelix, 131J34F, 135, 135F 

P sheets, 131, 134F, 135, 135F,422,423F 
stability, 137, 147-148, 147F 
structure-function relationship, 154-155 
subunits, 140F, 141 F, 142-143, 143F 
tertiary structure, 136 
unstructured polypeptide chains, 146-147 
see also Peptide bonds; Protein folding 
Protein structure-function relationship, 154-155 
Protein synthesis, 85, 85F, 366-400 
elongation cycle, 373, 373F, 376F, 377F 

elongation factors, 179-180, 377-378, 377F 

peptidyl transferase, 376, 379, 379F 

steps, 375-376, 376F 
ER and see under Endoplasmic reticulum (ER) 
eucaryotic, 345F, 380, 380F, 399F 
evolution, 407-408 

experimental analysis, cell-free systems, 512 
inhibitors, 384, 385T 

see also Antibiotics 
initiation, 379-380, 380F 
levels of regulation, 399 
location see Ribosome(s) 
peptide bond formation, 59, 60F, 125, 373F 
post-translational alterations see Post- 
radiational modification 
procaryotic, 345F, 380 
reading frames, 368, 368F 
relation to DNA see Gene expression; Genetic 

code; Messenger RNA 
termination, 381, 381 F 
see also Protein folding; Translation 
Protein tags 

affinity chromatography, 51 5F 
cleavable, 515 
GST, 514-515, 516F 
histidinetag, 548 

see also Affinity chromatography; Epitope 
tagging 
Protein translocation, 703F 

across chloroplast membranes, 719-720 

see also Chloroplast(s),protein import 
across mitochondrial membranes, 713-718 

see also Mitochondrial protein import 
in endoplasmic reticulum see Endoplasmic 
reticulum (ER) 
Protein translocators, 714-715 
Protein transport 

across membranes, 695-748 



chloroplasts see Chloroplast(s), protein import 

endoplasmic reticulum see Endoplasmic 

reticulum (ER), protein transport 

gated, 700, 701 F 

general flow, 701 F 

mitochondria see Mitochondrial protein import 

nucleus see Nuclear-cytoplasmic transport 

between organelles, 699-701 

peroxisomes see Peroxisome(s), protein import 

signal hypothesis, 726-727, 727F, 728F 

studies, 703F 

through secretory pathway, 749-787 

see also Protein sorting; Transmembrane 
protein(s); Vesicular transport 
Protein tyrosine kinases see Tyrosine kinase(s) 
Protein tyrosine phosphatases (PTPs), 938-939 
Proteoglycan(s), 1184T 

aggregates, 1182, 11 82F, 1183F 

assembly/synthesis, 775-776, 1 1 81 , 1 1 81 F 

basal lamina, 1165 

cell surface co-receptors, 1183-1184 

connective tissues, 1 1 79, 11 81 -1 1 82 

membrane proteins, 636 

as molecular sieves, 1 1 82-1 1 83 

protein regulation by, 1182-1183 

sizes, 1181-1182, 1181 F 

see also specific types 
Proteolysis, 391 

apoptosis (programmed cell death), caspases see 
Caspase(s) 

cell cycle control, 1064 

degradation signals, 396F 

insulin, 151,152F 

protein aggregate resistance, 397 

regulated destruction, 395-396, 396F 

regulatory function, 395-396 

see also Proteasome(s); Ubiquitin pathway 
Proteomics, 188 
Protists, 32-34, 33F 
Protocadherins, 1136, 1138T 
Protofilament, 971-972, 972F 
Proton see Hydrogen ion (H + , proton) 
Proton gradients see Electrochemical proton 

gradients 
Proton-motive force, 821, 821 F, 853-854 
Proton pumps, 827-828, 829F 

allostery, 836 

atomic detail, 835 

bacteriorhodopsin, 640-642 

evolution, 871 

general mechanism, 837F 

lysosomes, 780, 780F 

photosynthesis, 851, 854 

redox potentials, 835, 835F 

see also Respiratory enzyme complexes 
Proto-oncogenes, 1231, 1237, 1237F 
Protozoa, 28F, 32-33 

as parasites, 1494-1495, 1508-1511 
Protrusion, in cell movement, of actin meshwork at 

leading edge, 1039F 
Proxl protein, 1448 
P-selectin, 1145-1146 
Pseudogenes, 248, 255, 257 
Pseudoknot, RNA structure, 403F 
Pseudomonas aeruginosa, 1 491 
Pseudopodia, 1037 

Pseudouridine, transfer RNA modification, 368F 
P-site (ribosome binding), 375F, 376 
Psychoactive drugs, 686, 690 
PTB domain (phosphotyrosine-binding domain), 

898, 899F, 925 
PTEN phosphatase (tumor suppressor), 933 
P-type pumps (ATPases), 659, 660F 

see also Calcium pump; Sodium-potassium 
pump (ATPase) 
Puberty, 1288-1289, 1292, 1293 
Puffer fish (Fugu rubripes), 30, 31 F, 251 F 
Pulse-chase experiments, 602, 602F 
Pulsed-field gel electrophoresis, 534 
Puma, apoptosis, 1 1 23 
Pumps see Carrier protein(s) 
Pupa, Drosophila, 1 329 
Purifying selection, 247 
Purine base(s), 61, 11 6F 

depurination, 296, 297F 

see also Adenine; Guanine 
Purkinje cell, intracellular calcium, 597F 
Puromycin, 384, 385T 
Purple bacteria 

mitochondrial evolution, role, 860 
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photosynthetic reaction centers, 827F, 850, 850F, 
854F 
Pus, 1531 

Pyranosyl-RNA(p-RNA),402F 
Pyrimidine base(s), 61 , 1 1 6F 

see also Cytosine; Thymine; Uracil 
Pyrophosphate release, DNA synthesis, 268F 
Pyruvate 

anaerobic breakdown, 89-90, 90F 

citric acid cycle, 122F 

glycolysis, 88, 89F, 121 F 

oxidation to acetyl CoA, 96, 96F 

transport, 822 
Pyruvate carboxylase, biotin use, 84F 
Pyruvate decarboxylase, 96F 
Pyruvate dehydrogenase complex, 96, 96F 
Pyruvate kinase, 121 F 



Q-cycle, 835-836 

Quantitative reverse transcriptase polymerase 

chain reaction (RT-PCR), 573-574, 

573F 
Quantum dots, 587, 588F 
Quaternary structure, 136 
Quinone(s), 832F, 835, 851, 852F 



Rab5, 762, 762F 

Rabeffector(s),76l,761F 

Rab GTPase(s), 760-761, 761, 761 T 

Rabies virus, 1496F, 1517, 1518 

receptor, 1 505 
RacGTPase(s),931 

activation, microtubules role, 1043 

effect on actin organization, 1042F, 1046-1047 

epithelial apico-basal polarity, 1 1 55F, 1 1 56 

growth cone steering, 1387 
Rad51 protein, 307F 

see a/so RecA protein 
Rad52 protein, 308, 308F 
Radiation, DNA effects, 300-301 

see also Ultraviolet (UV) radiation 
Radiofrequency, NMR, 529 
Radioisotopes, 600-601, 601T 

applications, 534, 602, 602-603, 602F, 603F 
see also Autoradiography 

measurement, 601 
Radiolabeling, 534, 602-603, 603F 
Radixin, 1009 
Raf kinase (MAP-kinase-kinase-kinase), 929, 929F, 

1239 
RAG protein, 1563, 1564, 1564F, 1600 
Ramachandran plot, peptide bond angles, 127F 
Random mutagenesis, gene expression analysis, 

553,556 
Ran-GAP, 708 
Ran-GEF, 708 
Ran-GTP,711 
Ran GTPase(s) 

compartmentalization, 709F 

mitotic spindle formation, 1081 

nuclear transport, 708-709, 709F, 71 OF 

Ran-GTPase-activating protein (Ran-GAP), 708 

Ran-guanine exchange factor (Ran-GEF), 708 
Ras gene, 1107, 1233-1234, 1241, 1242, 1242F 
Ras GTPase(s), 179, 180F, 926-927, 936F 

activation of MAP-kinase pathway, 929F 

activation of PI 3-kinase, 934 

active and inactive forms, 1 81 F 

mitogen activation, 1 1 03 

mutated in cancers, 11 07, 1233-1234, 1241 

normal Ras proteins, 1234 

as oncogene, 1107 

regulation, GEFs, 927, 928F 

structure, 180F 
Ras guanine nucleotide exchange factor (Ras GEF), 

927, 928F 
Ras-MAP-kinase signaling pathway, 928-930, 929F, 

1103,1176 
Rb (retinoblastoma) gene/protein 

cancer, 1104-1105, 1234, 1235 

cell cycle control, 1 103-1 105, 1 104F, 1 244F 

knock-out in mice, 1242 

regulation by phosphorylation, 1 104 

ways of losing, 1236F 



Reaction coupling, 75F, 78F, 824-825 
activated carrier formation, 79-80 
ATP role, 81 
glycolysis, 91 
mechanical model, 80F 
see also Free energy 
Reading frames 

open see Open reading frames (ORFs) 
protein synthesis, 368, 368F 
RecA protein 

homologous recombination, 306-307, 307F 

synapsis, 307-308, 307F 
homologs, 307-308, 307F, 308F 

Brca1/Brac2 proteins, 310 
protein-DNA interactions, 307F 
Receptor(s) 

cell-surface see Cell-surface receptor(s) 

down-regulation, 795 

editing, 1 548, 1 548F, 1 565, 1 565 

see also Immunological tolerance 
intracellular, 881, 881 F 

see also Nuclear hormone receptor(s) 
ligand-gated ion channels see Ligand-gated ion 

channel(s) 
see also Receptor-mediated endocytosis; specific 
types 
Receptor guanylyl cyclases, 922 
Receptor-like tyrosine phosphatase, 922 

see also Protein tyrosine phosphatases (PTPs) 
Receptor-mediated endocytosis, 784, 791-799 
EGF uptake, 794 
extracellular macromolecule import, 791-792, 

793F 
iron uptake, 794 
material degradation, 793F 
membrane protein sorting, 794F 
receptor recycling, 792-794, 793F 
signal peptides, 792 
transcytosis, 793F, 797-798, 797F 
Receptor proteins see Receptor(s) 
Receptor serine kinase family, evolution, 1 77F 
Receptor tyrosine kinases (RTK), 922-924, 923F, 
923T, 924FF, 925, 925F 
animal development, 1 31 6F 
docking with SH2-domain proteins, 924-925, 

924F, 925F 
Ras pathway activation, 926-930, 928F, 929FF 
Recessive mutation(s), 558 
Redinomonas, 862 

Recombinant DNA technology, 532-553 
basic principles, 532 
DNA cloning see DNA cloning 
DNA labeling, 534, 536F 
see also DNA probe(s) 
DNA libraries, 540-542, 541-542, 542F, 543F 
DNA microarrays, 537 
DNA sequencing see DNA sequencing 
electrophoresis see Electrophoresis 
historical aspects, 532, 533T 
human replication origin identification, 288 

see also Human genome 
nucleic acid hybridization, 532, 537F, 538F, 539F 

in situ hybridization, 532 
PCR, 544-546, 545F, 546F 
protein production see Recombinant proteins 
reporter genes see Reporter gene(s) 
restriction enzymes, 532, 532-533, 534F 
RNA production, 546 
site-specific recombination applications, 

325-326 
transgenic organisms see Transgenic organism(s) 
see also Genetic engineering; Recombinant 
proteins; specific techniques 
Recombinant proteins 
applications, 546, 548 
expression vectors, 546, 548F 
protein tags, 514-516, 548 
see also Protein purification 
Recombination, 263, 305F, 554F 
antibody chains, 1564-1565 
DNA repair role, homologous end-joining (DSB 

repair), 303 
general (homologous) see Homologous 

recombination (crossing-over) 
genetic engineering see Recombinant DNA 

technology 
meiotic see Homologous recombination 

(crossing-over); Meiosis 
promiscuous, 315-316 
site-specific see Site-specific recombination 



Recombination complex, 1280 
Recombination signal sequences, V(D)J joining, 

1564F 
Red blood cell(s) see Erythrocyte(s) 
Redox pairs, 828 

Redox potential(s), 828-829, 830F 
concentration changes, 830 
AG calculation, 830 

electron transfer, 829, 831 , 835, 835F, 852F, 853 
measurement, 830, 830F 
standard, 830T 
Reduction-oxidation (redox) reactions, 828-829, 

831 
Reduction reactions, 71-72, 71 F, 828 
Redundancy, genetic, 39 
Refractory period, E. coli replication, 282, 284F 
Regeneration 

basal lamina and, 1 1 68-1 1 69, 1 1 68F 
intercalary, 1354, 1354F 
optic nerve, in frog, 1 391 
skeletal muscle, 1463-1467 
Regulated secretory pathway (exocytosis), 800, 
801 F, 804 
localization, 804, 804F 
plasma membrane enlargement, 805, 806F 
see also Exocytosis 
Regulation by multisite modifications, 186-187, 

187F 
Regulatory cascades, 1 70, 482-483, 482F 
Regulatory DNA, 7-8, 206 

eve gene modular organization, 1 340F 
patterning of Drosophila embryo, 1 332 
programming of development and, 1 309, 1 309F 
as speciality of eucaryotes, 31-32 
Regulatory myosin light chain (RMLC), Rho- 

mediated phosphorylation, 1095 
RegulatoryTcell(s), 1543-1544, 1569-1570, 

1574-1575 
Release factors (translational), 381 , 381 F, 382F 
Remodeling, of synaptic connections, 1 393 
Reoviruses, host defences, 1 535 
"Repair factories," homologous recombination, 310, 

310F 
Repeat-induced recombination, prevention, 31 5F 
Repetitive DNA 

centromeric sequences, 210-21 1 

genetic markers, VNTRs, 547F 

human genome, 207F, 323F 

multiplication in genome, 323F 

repeat-induced recombination, 31 5F 

RNApolllCTD,346 

short tandem repeats, hypervariable 

microsatellites, 547F 
telomeric sequences, 210, 292 
see also Transposon(s); specific types 
Replication errors, 269, 271-272, 276-277 

see also Mismatch repair 
Replication fork, 266, 267-268, 269F 
arrangement, 276F 
asymmetry, 266, 268 
bacterial, 266, 276-277, 282 
cooperativity, 275-276 
DNA microarray analysis, 285, 286F 
eucaryotic, 280, 1067 

chromatin assembly factors (CAFs), 290 
DNA damage response, 1 1 05 
experimental analysis, 284F 
histone association, 290 
mammalian, 281 F, 283-284 
movement rate, 284 
nucleosome assembly, 289-290 
nucleosome interaction, 280 
initiation, 282, 282F 

see also Replication origin(s) 
lagging strand synthesis, 267-268, 269F, 272, 

272F,276F,281F 
leading strand synthesis, 268, 269F, 276F, 281 F 
strand separation, 273 

see also DNA helicase(s); DNA topoisomerases 
structure, 268, 269F 

template strand, 200, 200F, 201 F, 266, 266F 
see also DNA replication 
Replication origin(s), 282F, 1067 
bacterial, 282, 283FF 

hemimethylation, 282, 284F 
chromosomal, 209, 210F 
DNA sequences, 210, 281, 286, 287F, 288 
DNA synthesis, 281-282 
eucaryotic, 283, 285F 
human, 288 
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origin recognition complex (ORC), 287, 288F 
replication units, 283-284 
5. cerevisiae, 286, 287F, 288F 
licensing, 1067 

prereplication complex (pre-RC) binding, 1067 
Replication units, eucaryotic replication origins, 

283-284 
Repticative cell senescence, 293, 505, 1059, 1 107, 

1217 
Reporter gene(s) 

gene expression analysis, 572-573, 572F 
GFP see Green fluorescent protein (GFP) 
LacZ, 442F 
Repressor protein(s) 
discovery, 416 

DNA-binding, 41 8T, 420F, 421 F, 423F 
eucaryotic, 445, 445, 445-447, 447F, 447FF 
mechanisms, 433-434, 434F, 436F, 445, 446F 
procaryotic 

Cro repressor see Cro repressor 
lac repressor see Lac repressor 
lambda repressor see Lambda repressor 
Met repressor, 423F 
operator binding, 434-434, 434F 
tryptophan repressor see Tryptophan 
repressor 
translational repressors, 488, 490F 
Reproductive cloning, 507, 507F, 1302-1303, 1303F 

Dolly the sheep, 1287 
Reproductive hormones, effect on cancer 
incidence, 1229 
see also specific hormones 
Rescue, in dynamic instability, 980, 981 F, 1 080 
Respiration 

evolutionary aspects, 874-875 
oxidation of organic molecules, 70 
relation to photosynthesis, 70, 70F 
Respiratory burst, phagocytosis, 1532 
Respiratory chain see Electron transport chain(s) 
Respiratory control, 837-838 
Respiratory enzyme complexes, 819-820, 831-839 
cytochrome b<i complex, 832, 833F, 835-836, 

836F 
cytochrome oxidase complex, 832-834, 833F, 

834, 834F 
energetics, 819, 81 9F, 835, 835F 

see also Proton pumps 
identification, 831-832 
localization, 831-832, 833F 

see also Mitochondrial structure 
NADH dehydrogenase complex, 832, 833F, 836 
Respiratory epithelium, 1435, 1435F 
colonization, 1502, 1503 
function, 1502 

mucus, as host defense mechanism, 1 502, 1 525 
Restriction enzymes 

biological function, 532-533 

recombinant DNA technology, 532, 532-533, 

534F, 540-541 
sequence specificity, 533 
Restriction-modification systems, 532 
Restriction (start) point of cell cycle, 1055, 1061, 

1066,1105 
Retina 

gap junctions, 1 1 62, 1 1 62F 

histology and cell renewal, 1432-1433 

neuronal specificity and retinotectal map, 

1392-1393, 1393F 
structure, 1432-1433, 1432F 
Retinal, 166, 166F, 640, 1432 
Retinal ganglion cells, 1391, 1432 
Retinoblastoma, 1104, 1234-1235, 1235F 
Retinoblastoma gene/protein see Rb 

(retinoblastoma) gene/protein 
Retinoic acid, 889F, 890, 1285 
Retinoic acid receptor, 891 F 
Retinotectal map, refinement, 1393-1395, 1394F 
Retinotectal projection, 1391, 1393F 
Retromers, 755-756, 756, 756F 
Retrotranslocation, misfolded proteins, 391, 

739-740, 740F 
Retrotransposons, 318T, 323 

see also Retroviral-like retrotransposons 
Retroviral-like retrotransposons, 317, 318T, 320, 
322F 
human genome, 207F 
LTRs,318T 

reverse transcriptase, 318T 
seeaiso Retroviruses 
Retroviruses 



Gag proteins, translational frameshifting, 383, 
384F 

genome, 321 F 

genome integration, 321 F 
see also Transposition 

HIV see HIV (human immunodeficiency virus) 

integrase see Integrase 

life cycle, 321 F 

mutation rate, 1450 

oncogenesis, 1232 

Pol proteins, translational frameshifting, 383, 
384F 

reverse transcriptase see Reverse transcriptases 

translational frameshifting, 383, 384F 

seeaiso Retroviral-like retrotransposons 
Reverse electron flow, 838, 872, 873F 
Reverse genetics see Genetics,reverse 
Reverse transcriptases, 264F, 292 

applications, cDNA cloning, 542, 543F 

retrotransposons, 318T 

structure, HIV, 321 F 

see a/so Telomerase 
Reverse two-hybrid system, 524 
Rev protein, HIV mRNA transport, 485-486, 485F, 

486F 
Rev responsive element (RRE), 485, 485F, 486F 
RGD motif, integrin-fibronectin binding, 1193 
R-gene product, 1522, 1523F 
RhoA GTPase(s), 1094-1095, 1095F, 1 177 
RhoCgene, in metastasis, 1249 
Rhodamine, 586-587 
Rhodopsin,918,918F, 1432 

binding todynein, 1022 

ER membrane integration, 735F 

retinal binding, 166, 166F 
RhoGEF, 1095 
RhoGTPase(s) 

actin organization, 1041-1043, 1042F, 
1043-1045 

growth cone steering, 1 387 

monomeric, 931, 1042-1043 

RhoA see RhoA GTPase(s) 
Rhombomeres, 1345 
Ribbon protein models, 132F, 137F 
Ribonuclease, size, 1 1 82F 
Ribonucleic acid see RNA {ribonucleic acid) 
Ribose, 112F, 116F,332, 332F 
Ribosomal proteins, 373 

bacterial, 378F, 379F 

ribosomal location, 378-379, 379F 

translational control, 488 
Ribosomal RNA (rRNA), 6-7, 6-7, 373 

bacterial, 378F, 379F 

eucaryotic, 360-361, 361 F 

functions, 336T, 378 

genes, 360-361 , 361 F, 363, 863, 1 290 

modifications, 361, 361 F, 362F 

processing, 360-361, 361 F, 362F 
seeaiso SnoRNAs 

synthesis, 360-361 

see also RNA polymerase I; RNA polymerase III 

transcription, 334F, 335-336, 340T 
Ribosome(s), 6-7, 8F, 374F 

assembly, 488 
bacterial, 149-151 
eucaryotic, 362-363, 364F, 374-375 
seeaiso Nucleolus 

bacterial, 1 49-1 51 , 378F, 379F 
antibiotic binding, 384, 384F, 385T 
eucaryotic vs„ 374F 

composition see Ribosomal proteins; Ribosomal 
RNA 

efficiency, 375 

electron microscopy single-particle 
reconstruction, 61 2F 

eucaryotic, 360-361, 362-363, 374F 

free, 728, 729F 

hydrophobic interactions, 381 

internal ribosome entry site (IRES), 491 

leaky scanning, 490 

membrane-bound, 728, 729F 

polyribosomes (polysomes) see Polyribosome(s) 

protein synthesis, 373-376, 488 

seeaiso Protein synthesis; Translation 

ribozyme activity, 378-379 

rough ER, 724-725 

structure, 373, 374-375, 374F, 378-379, 378F, 
379F 
binding-sites, 375-376, 375F 
catalytic site, 378-379 



large subunit, 375 
protein location, 378-379, 379F 
RNA location, 378, 378F 
small subunit, 375 
Ribosome recycling factor, 381 F 
Riboswitches, 478-479, 478F, 489F 
Ribozymes, 378-379, 404FF, 406F 
allostery, 406F 

conformational changes, 404, 406F 
protein catalysts vs , 404 
ribosomes, 378-379 
ribozyme-catalysed reactions, 405T 
RNA folding, 404, 404F 
synthetic, 404F, 408 
see also Catalytic RNA 
Ribulose, 112F 

Ribulose bisphosphate carboxylase, 844, 844F, 846 
Rickettsia prowazekii, genome size, 18T 
Rickettsia rickettsia, actin-based movement, 

1515-1516 
Rifamycin, 385T 
Rigor state, of myosin, 1 01 9 
RISC (RNA-induced silencing complex), 494, 494F 
RITS (RNA-induced transcriptional silencing 

complex), 496 
River blindness, 1518 
RNA (ribonucleic acid), 4-8, 62F 
biosynthesis, 85, 85F, 86, 87F 
cellular types, 336T 

mRNA see Messenger RNA 
noncoding, 360-361, 362F 
see also specific types 
composition, 335-336 
degradation, 379F, 41 5, 485, 492-493, 495 
double-stranded (dsRNA), 1535 
editing see RNA editing 
evolutionary significance, 300, 408, 408F 

see also RNA world hypothesis 
genetic material, 300 
information transfer, 331, 331 F 

see also Transcription 
localization, gene expression regulation, 379F, 

415,486-488 
molecular conformation, 4-5, 5F, 7F 
processing see RNA processing (eucaryotic) 
recombinant DNA technology, 546, 548 
in situ hybridization, 585, 586F 
splicing see Splicing 
structure see RNA structure 
synthesis, 335-336,408 
autoradiography, 602 
error frequency, 270 
polytene chromosome(s), 239F 
see also Transcription 
RNA cleavage regulation, 439-441 
RNA-DNA hybrids, 338, 536-537, 538F 
RNA editing, 483 

evolutionary significance, 483-484 
gene expression regulation, 483-485 
mammalian mRNA, 484 
rRNA processing, 361, 361 F, 362F 
transfer RNA, 368F, 369, 369F 
trypanosome mitochondrial mRNA, 483, 484F 
see also Guide RNAs 
RNA endonuclease, 492, 492F 
"RNA factory" concept, RNA polymerase II, 346, 

347F 
RNA-induced silencing complex (RISC), 494, 494F 
RNA-induced transcriptional silencing complex 

(RITS), 496 
RNA interference (RNAi), 495-497, 571-572 
Caenorhabditis elegans, 571, 571 F 
chromatin remodeling, 443 
Dicer protein, 496 
Drosophila, 571 

experimental applications, 497, 564F 
in human disease, 497 
mechanism of action, 571 
reverse genetics, 572 
technique, 571-572 
transposable element reduction, 496 
tumor cell identification, 1240 
viral dsRNA-directed ssRNA destruction, 1535 
RNA-like polymers, origin of life, 402, 402 F 
RNA polymerase(s), 334, 334F, 335F 
bacterial, 336-337, 337F, 438F 
RNA pol II vs., 339-340, 340F 
sigma factors, 336, 439, 439T 
structure, 337F 
transcription cycle, 337F 
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transcription regulation, 438-439, 439T, 440 
chromatin remodeling, 433 
DNA polymerases vs„ 334, 335 
DNA repair coupling, 299-300 
elongation factor association, 343-344 
energetics, 337 
eucaryotic, 339-340, 340T 

chromatin effects, 343, 343F, 344 

functions, 339, 340T 

RNA polymerase I, 339, 340T 

RNA polymerase II, 339, 340T 

RNA polymerase III, 339, 340T 

transcription regulation, 440 

see also Transcription factors,general (TFII) 
evolution of, 335F 
fidelity, 270, 334-335 
nucleoside triphosphate substrates, 334 
orientation, 339, 339F 
promoter binding, 337 
proofreading, 334-335 
protein-DNA interaction, structural changes 
coupled to DNA-binding, 336-337, 
338 
reaction mechanism, 334 
structure, 334F 

superhelical tension generation, 344-345, 345F 
see also individual enzymes 
RNA polymerase I, 339, 340T, 360, 363 
RNA polymerase II, 339, 340T 

general transcription factors, 340-342 

see also Transcription factors,general (TFII) 
processivity, 357-358 
"RNA factory" concept, 346, 347F 
structure, 340F 

bacterial polymerase vs, 339-340, 340F 

C-terminal domain (CTD), 341, 346 
transcription initiation, 341-342, 341 F, 343F 

chromatin-modifying proteins, 343 

transcriptional activators, 342-343, 343F, 440 

transcriptional mediator, 343 
transcription termination, 358 
RNA polymerase III, 339, 340T, 361 
RNA primers, DNA replication, 272, 272F 
RNA probe(s), 537 

RNA processing (eucaryotic), 345-346 
editing, 483-485 
gene expression regulation, 379F, 41 5, 477-485 

3' changes, 482-483 

splicing, 477-481 

see also Alternative splicing 
mRNA capping, 346 
nuclear subdomains, 364-366 
polyadenylation, 346 
splicing see Splicing 
RNA-RNA rearrangements 
snRNPs,351 
splicing mechanism, 350-351, 350F, 351 F 

AT-AC spliceosome, 353-354 

ATP hydrolysis, 351-352 
RNA splicing see Splicing 
RNA structure, 272, 332-333, 332F, 333F 
base-pairing, 332, 333F, 403 
DNA comparison, 408 

double-stranded, RNA interference (RNAi), 495 
folding, 332, 333F, 403-404, 403F, 404 

noncovalent interactions, 64, 64F, 403 

tRNAs, 368, 368F 
nucleotides, 62, 332 

see also Nucleotide(s) 
phosphodiester backbone, 332F 
secondary, 403, 403F 

hairpins, 338, 403F 
Tar, transcription attenuation role, 478 
tertiary, 403-404, 403F 
RNA transcripts see Messenger RNA 
RNA transport, 486-488 

gene expression regulation, 379F, 41 5, 485-486, 

486-488 
mechanisms, 486-487, 487F 

3' UTR, 487-488, 488F 
nuclear see Nuclear-cytoplasmic transport 
regulation, 710 
RNA virus(es) 

assembly, 149-151 
defense, RNAi, 495 

double-stranded RNA (dsRNA), 1534, 1535-1536 
genomes, 1497F 
mutation rate, 1520-1521 
structures, 1496F 
RNA world hypothesis, 300, 400-404, 402F, 408F 



limited genetic code?, 407 

natural selection, 404-406 

self-replicating system, 401, 402F 

transition from pre-RNA world, 402-403 

see also Catalytic RNA; Ribozymes 
Rock kinase, 1095 
Rod photoreceptors (rods), 1432, 1433F 

amplification of signal in visual transduction 
cascade, 91 9-920, 920F 

outer segment, renewal, 1433, 1433F 

renewal, 1433F 

response to light, 91 8F 
Root(s) 

apical meristem, 1163, 1401F 

in plant development, 1400, HOOF 

structure, 1404-1405F 

tip, 1407F 
Root cap, 1407F 
Root hairs, 1405F, 1407F 
Rosettes, cellulose synthase, 1199, 1 201 
Rotational diffusion, membrane proteins, 642 
Rotaviruses, 1496F 
Rough endoplasmic reticulum see Endoplasmic 

reticulum (ER), rough 
Roundabout receptor protein, 1388F, 1389 
Rous sarcoma virus, 1 232, 1 233F, 1 499 
rRNA see Ribosomal RNA 
RTK (receptor tyrosine kinase) see Receptor tyrosine 

kinases (RTK) 
Rubisco see Ribulose bisphosphate carboxylase 
Ruffling, at leading edge of cells, 1 040F 
RuvA, Holliday junctions, 312F 
RuvB, Holliday junctions, 312F 



SI fragment, myosin, 1012 
S6 kinase (S6K), 1108 
Saccharomyces cerevisiae 

actin, interactions with other proteins, 983, 983F 

cell cycle, 1056-1057, 1056F 

cell polarization, 1044 

centromere structure, 229, 229F 

DNA replication, 286-287, 287F 

genome, 286-287, 287F, 552 

His gene, 285 F 

as model genetic organism, 286-287, 864 

mutants, 287 

protein interaction maps, 188, 190F 

role of cytoskeleton, cellular polarity, 969-970 

"shmoo,"1044, 1044F 
Saccharomyces pombe, genome, sequencing, 552 
Salivary gland, Drosophila polyteny, 236-238, 237F, 

237FF 
Salmonella enterica, 1491 

invasion of host cells, 1 508, 1 509F 

pathogenicity islands, 1491 F 

phagocytosis, 1533 

spread, 1518 

strategy against host membrane traffic, 1 51 1 F, 
1512J512F 
Salmonella, phase variation, 454-455, 455F 
Saltatory conduction, action potential propagation, 

680 
SAM complex, 715 

Sari protein, coated-vesicle formation, 759, 759F 
Sarcoma(s), 1206, 1232 
Sarcomere, 1026, 1028, 1028F 
Sarcoplasmic reticulum, 660, 725-726, 1029 
Satellite cells, skeletal muscle, 1466 
Satellite tobacco necrosis virus, capsid, 149F 
Scaffold protein(s), 894, 897, 930, 93 IF, 1 145, 1148F 

PDZ domains, 1148 

synapse formation and, 1 1 48-1 1 49, 1 1 49F 

tight junction(s), 1153-1155 

unstructured polypeptide chains, 147 
Scale, 580, 580F 
Scanning electron microscopy (SEM), 607-608, 

608F, 609F 
S-Cdk, 1066, 1067-1069, 1068 
S. cerevisiae see Saccharomyces cerevisiae 
SCF (stem-cell factor), 1375 
SCF enzyme complex, 184-185, 185F, 1064, 1065F, 
1066T 

protein interaction maps, 188, 190F 
Schistosoma haematobium, bladder cancer 

causation, 1228 
Schistosomes, eosinophil attack, 1532, 1532F 
Schizosaccharomyces pombe, 24F, 1056-1057, 



1056F 
Schwann cell(s), myelin production, 678 
Scintillation counter, 601 
Scrapie, 397-398, 1498 
Scribble complex, epithelial apico-basal polarity, 

1156, 1157F 
Scurvy, 1186 
Scute gene, 1 356 
S-cyclins, 1062,1064, 1087 
SDS (sodium dodecyl sulfate), 517, 517F 

membrane protein solubility, 637 

SDS-PAGE,517,518F,637 

see also Polyacrylamide gel electrophoresis 
(PAGE) 
Sealing strands, tight junction(s), 1152, 1152F, 

1153F 
Sea urchin, gastrulation, 1307F 
Sebaceous gland, 1418F 
Secondary antibody response, 1 546F 
Secondary cell walls, 1195, 1196 
Secondary cultures, 504 
Secondary immune response, 1546 

see also Immune response/system,adaptive 
Secondary lymphoid organ see Peripheral 

lymphoid organ 
Secondary oocytes, 1289, 1289F 
Second messenger(s), 893 

see also Cyclic AMP (cAMP) 
y-Secretase, 947 
Secretory component, 1556F 
Secretory C-terminal, 482-483 
Secretory granule(s) see Secretory vesicle(s) 
Secretory protein(s), exocytosis, 800 
Secretory vesicle(s), 800 

egg (ovum), 1288 

exocytosis, 802F 

formation, 801-802, 801 F 

membrane fusion, 803 

membrane recycling, 805 

proteolytic processing of cargo, 803, 803F 

sperm, 1293 

synapses see Synaptic vesicle(s) 
Sectioning, microscopy sample preparation, 585, 

585 F, 605 
Securin, destruction by APC/C, 1 064, 1 066, 1 087 
Sedimentation coefficient, 511, 522 
Seed(s), 1402, 1404F 
Seedlings, 1403 

Arabidopsis mutants, 1403F 
Segmentation 

genes controlling, 1336-1338, 1337F, 1338F 
Drosophila see Drosophila melanogaster 

segmentation genes 
see also specific genes 

insect body, 1329, 1330F 

see also Drosophila development 

vertebrates, 1345, 1346F, 1371-1372, 1372F 
Segmentation clock, 1372-1373 
Segmentation, Drosophila, 1329 
Segment-polarity genes, 1 333, 1 338F, 1 339 
Segment shuffling, DNA, 19, 19F 
Selectin(s), 1145-1146, 11 77T 

functions, 1145, 1146F 

L-selectin, 1145, 1550, 1550F 

structure, 1145-1146, 1146F 
Selective pressure, Antibiotic resistance, 1523 
Selectivity filter(s), ion channels, 668, 672 
Selenocysteine, translational recoding, 383, 383F 
Self-assembly, lipid bilayers, 9, 10F 
"Selfish DNA," definition, 317 
Self-replicating molecules, evolution 

autocatalysis requirement, 401 

catalytic RNA, 404, 407F 

natural selection, 404-406 

polynucleotides, 401 

polypeptides, 401 
Self-splicing RNA, 356-357, 356F 

see also Catalytic RNA 
SEM see Scanning electron microscopy 
Semaphorin protein, 1388F, 1389 
Semiconservative DNA replication, 266, 266F 
Seminiferous tubules, 1293 
Semliki forest virus, 1 507 
Senescence see Cell senescence 
Sensory bristles, 1356, 1356F 
Sensory cells, as transducers, 1429 

see also specific types 
Sensory epithelia, 1429, 1429-1433 
Sensory mother cell, 1 356-1 357, 1 356FF 
Sensory neuron(s), 1386F, 1390 
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Sepallata, Arabidopsis mutant, 1414 

Separase, 1087, 1278 

Sepsis, 1534 

Septate junctions, 1154-1155, 1154F 

Septic shock, 1534 

Septins, 1097 

Sequence tags, genetic engineering, 570 

Sequential induction, 1319-1320 

see also Drosophila development 
Serine protease(s) 

active site, 154, 155F 

extracellular matrix degradation, 1 1 94 

protein family evolution, 138-139, 140-141, 
140F 

structural homologies, 137-138, 138F 
Serine, structure, 129F 
Serine/threonine kinases, 941 

cell signaling, 939, 940F 
plants, 956-959 

receptor-linked, 177F 

regulation, 177 

see also individual kinases 
Serine/threonine phosphatases, 938 
Serotonin {5-hydroxytryptamine, 5-HT), 904 

endocrine cells of respiratory epithelium, 1 435 

neurotransmitter role, 684 
Sertoli cells, development, 1284-1285, 1286F 
Seveniess mutant, Drosophila, 927 
Sev protein, Drosophila, 927 
Sex chromosome(s), 202, 473, 554F 

meiotic pairing, 1275 

sex determination and, 1271, 1284F 

see also X-chromosome; Y-chromosome 
Sex determination, 1 282-1 286 

Drosophila melanogaster, 481-482, 481 F, 482F 

gonad development, 1283, 1283-1285, 1283F, 
1286F 

variation between species, 1286 

see also Primordial germ cells (PGCs) 
Sex differences, meiosis regulation, 1280-1281 
Sex hormone, 890 
Sex-lethal (Sxl) gene, Drosophila sex determination, 

481F,482F 
Sex-reversed mice, 1284 
Sexual reproduction, 1269-1304 

evolutionary perspective 

competitive advantage, 1271-1272, 1 271 F 
genetic diversity, 1 271 , 1 279-1 280, 1 279F 

fertilization see Fertilization 

gamete production, 1090, 1269, 1272-1274 
see also Egg (ovum); Meiosis; Spermatozoa 

germ cell development see Primordial germ cells 
(PGCs) 

haploid vs. diploid cells, 1269-1271, 1270F 

horizontal gene transfer, 22 

overview, 1269-1272 

sex determination see Sex determination 

species differences, 1285-1286 

strategy differences, 1282 

SH2 domain (Src homology 2 domain) 

cell signaling proteins, 898, 899F, 924-926, 925F, 
936 
Src family, 178, 178F 
STATs, 937-938, 938F 

evolutionary tracing, 155, 155F 

protein phosphorylation, 175 

regulatory role, 178, 178F 

structure, 131, 132-133FF, 136F, 141, 144F 

surface-string interactions, 156, 156F 
SH3 domain (Src homology 3 domain), 898, 899F, 
925, 936 

regulatory role, 178, 178F 

structure, 136F, 141 
Shigella flexneri, 1491 

actin-based movement, 1516, 1516F 

antibiotic-resistant, 1524 

virulence plasmid, 1491 F 
Shine-Dalgarno sequence, 380, 488, 489F 
"Shmoo," of yeast, 1044, 1044F 
Shoot apical meristem, 1163, 1401 F, 1408FF, 

1409-1410, 141 IF 
"Shotgun" gene cloning, 542, 551 
Shugoshins, 1278 

Sialic acid (NANA), 628F, 774F, 775F 
Stcl protein, 1101 
Sickle-cell anemia, 1495 
Sidekicks proteins, synapse formation, 1 1 47 
Sigma factors, 336, 337F, 439, 439T 
Signal hypothesis, protein transport, 726-727, 727F, 
728F 



Signaling cascades, 895 
Signaling cell(s), 880 
Signaling molecule(s)/pathway(s) 
BH3-only proteins, 1123 
cancer role, 1242-1245, 1243F 
caspases, 1119 

cell adhesion and, 1 1 36-1 1 37, 11 45, 1 1 72 
developmental, 1316, 1316T 
Ceiegans, 1325-1326 
inhibitors, 1317-1318 
mechanisms of signal spread, 1318-1319, 

1318F 
morphogens, 1316-1317, 1317F, 1318F 
see also Development; Morphogenic 
gradient(s) 
DISC, 1120 

extracellular see Extracellular signal 
molecule(s)/pathway(s) 
intracellular see Intracellular signal 
molecule(s)/pathway(s) 
kinases as input-output devices, 1 77F 
proteoglycans, 1183 
signaling proteins, 627F, 630-631 
signal transduction see Signal transduction 
turnover rate of signaling molecules, 886-887, 

887F 
see also specific pathways/components; specific 
types/pathways 
Signaling networks, "microchip" analogy, 177-178, 

177F 
Signal patch(es), 701-702, 701 F 
M6P addition recognition, 785 
nuclear localization signals, 705-707 
Signal-recognition particle(s) (SRP) 
protein transport, 727-730, 728, 730 
structure, 728, 728F 
Signal-relaying junction(s), 1 1 32, 1 1 32F, 1 1 33T 

see also specific types 
Signal sequence(s), 701-702, 701 F, 703F 
chloroplast proteins, 719-720 
mitochondrial proteins, 713, 714F 
nuclear localization signals, 705 
peroxisomes protein import, 722 
rough endoplasmic reticulum protein import, 

726-727 
types, 702T 
Signal transducers, 641 , 891 
Signal transduction 

adaptation (desensitization), 902, 920, 920F 

cyclic AMP, 905-909 

G proteins see G protein(s) (trimeric GTP-binding 

proteins/ATPases) 
NFkB, Toll-like receptors, 1530 
nuclear receptor superfamily, 889-891, 889F, 

891 F 
photoreceptors, plant photoproteins, 960-961, 
961 F 
Silent mutation(s), 264 
Simian immunodeficiency virus (SIV), diversification 

and origin, 1 521 F 
Simian virus 40 (SV40), capsid, 149F 
SINEs (short interspersed nuclear elements), 207F, 

322 
Single-cell expression analysis, 575-576, 576F 
Single-nucleotide polymorphisms (SNPs), 560 
genome evolution, 259 
inheritance tracking, 562F 

see also Haplotype blocks 
physical gene mapping, 560, 560F 
Single-pass proteins see Transmembrane protein(s) 
Single-strand DNA-binding proteins (SSBs) 
cooperativity, 273, 274F 
DNA hybridization role, 306 
DNA replication machinery, 273, 274F, 276F 
eucaryotic, 280 
structure, 275F 
Single-strand DNA breaks (nicks), 277, 277F, 278, 
279F 
see also Strand-directed mismatch repair 
Single-strand DNA, effect of SSBs, 273, 274F 
Sister chromatid(s), 243FF, 1054 
meiotic, 1273 
mitotic, 243, 243FF 

cohesion, 1054, 1070-1071, 1070F 

resolution, 1075 

separation, 1075, 1087, 1087F, 1088, 1088F, 

1089-1090, 1089F 
spindle attachment, 1083-1085, 1084F 
Site-directed mutagenesis, 565, 565F 
Site-specific recombination 



applications, 325-326 

consequences 

gene control, 325-326 

genome rearrangement, 324, 324F, 455, 455F 

mutations, 565, 565F 

mechanisms 

conservative, 324, 324F, 325-326 
transpositional see Transposition 
see also lntegrase;Transposase 
Skeletal muscle, 1463-1467 

actin filaments, 1 003, 1012 

see also Actin/actin filaments; Myosin(s) 

cells, 1463 

cell size, 1026, 1026F, 1463 
differentiation, 464, 464F 
satellite cells, 1466 
see also Muscle fiber(s) 

genesis, 1463-1467 

modulation, 1463-1467 

regeneration, 1463-1467 

structure, 1012F, 1026, 1027F, 1029, 1029F 
Skin 

architecture, 1418, 141 8F 

cancer, 295T, 1226 

function, 1417 

Mammalian, 1418F 

replicative senescence, telomeric repeats, 293 

supportive functional requirements, 1417-1418 
Sliding clamp, 274, 275F, 281 F 
Slit protein, 1388F, 1389 
SLRNP, 353-354, 354F 
Slug gene/protein, epithelial-mesenchymal 

transitions, 1141 
Smad4 gene, colorectal cancer, 1255 
Smad-dependent signaling pathway, 940F 
Smad family, 939-940, 941 
Small-cell lung cancer, associated mutations, 1256 
Small interfering RNA (siRNA), 336 
Small intestine, histology, 1436-1442, 1436F 
Small-molecule inhibitors, 527, 527F 
Small molecules 

organic see Organic molecules (small, cellular) 

transmembrane transport, 651 

universality, 8-9 
Small nuclear ribonucloproteins see SnRNPs 
Small nuclear RNAs see SnRNAs 
Small nucleolar RNAs see SnoRNAs 
Smallpox virus, 1496 
SMC proteins, 243, 244F, 1070 
SMN (survival of motor neurons) protein, 364-365 
Smooth endoplasmic reticulum see Endoplasmic 

reticulum {ER),smooth 
Smoothened protein, 951 
Smooth muscle 

calcium influx, 1030 

cells, 1463-1464, 1463F 
blood vessels, 888, 1450 

mechanism of action, 1030 
Snail gene/protein, epithelial-mesenchymal 

transitions, 1 141 
SNARE proteins 

disassembly, 764, 764F 

membrane fusion, 762-764, 763F, 768, 769F 

Rab effector interaction, 761 

target membrane SNAREs, 762-763, 763FF 

tetanus toxin target, 760 

vesicle SNAREs, 762-763, 763F 

vesicular transport guidance, 760 
SnoRNAs, 361 

function, 336T, 362F 

nuclear localization, 365F 

transcription, 340T 
snRNAs, 349-350 

catalytic functions, 351 

function, 336T 
see also Splicing 

nuclear localization, 364, 365F 

protein complexes (snRNPs) see SnRNPs 

SL snRNA, 354F 

transcription, 336, 340T 

Ul snRNA,349,351,352 

U2snRNA,349,352 

U4snRNA,349 

U5snRNA,349, 352 

U6 snRNA, 349, 351, 352, 363 

see also Spliceosome 
snRNPs, 349-350 

assembly, 365F 

AT-AC spliceosomal, 354-355, 354F 

recycling, 364 



Page numbers in boldface refer to a major text discussion of the entry; page numbers with an F refer to a figure, with an FF to figures that follow consecutively; 
Daae numbers with a T refer to a table: vs means comDare/comDarison. 



1:44 



INDEX 



RNA-RNA rearrangement, 351-352 

SL RNR 353-354, 354F 

sm proteins, 365F 

splicing mechanism, 350F, 352 

U1snRNP,350F,351,351F 

U2 snRNP, 350F, 351 F 

U4/5/6 triple snRNP, 350F 

U4 snRNP, 350F 

U5snRNP,350F,351F 

U6 snRNP, 350F, 351 F 
assembly, 363 

see also SnRNAs; Splicing 
Sodium-calcium exchanger, 660 
Sodium channel{s), voltage-gated see Voltage- 
gated sodium channel(s) 
Sodium dodecyl sulfate see SDS (sodium dodecyl 

sulfate) 
Sodium dodecyl sulfate polyacrylamide gel 

electrophoresis (SDS-PAGE), 517, 
518F,637 
Sodium independent chloride-hydrogen 

carbonate exchanger, 658 
Sodium-linked symporter, 659 
Sodium-potassium pump (ATPase), 661-663, 662F 

osmotic regulation, 669 

pumping cycle, 662F 

selectivity, 671 
So (sine oculis) gene, 466F 
Sog gene, 1335 
Solvent(s), water, 109F 
Somatic cell(s), 1271 

functions, 265F 

heat-shock induced recombination, 1348 

induced recombination, 1348, 1349F 

mutations, 265, 1208, 1209 
Somatic hypermutation, antibodies, 1566-1567, 

1567F 
Somatic mutation(s), 265, 1208, 1209 

see also Cancer; Mutation(s) 
Somatic nuclear transplantation, 507-508, 507F 
Somatosensory region, cerebral cortex, 1 392F 
Somatostatin, 908 
Somite(s), 1371,1372F 
Sonic Hedgehog protein, 950 
Son-of-sevenless mutant, Drosophila, 928 
SOS response, 310-311 

see also DNA repair 
Southern blotting, 539-540, 539F 
Sox9 gene/protein, testes development, 1285 
Spl protein, DNA sequence recognition, 41 8T 
Space-filling models, 50 

lysozyme, 164F 

protein molecules, 144F 

SH2 protein domain, 1 33F 
Species differences 

chromosomes, 204-205, 205F 

sex determination, 1 286 

sexual reproduction, 1282, 1285-1286 

Xenopus laevis, 38-39 
Species divergence, mutation rates, 264 
Specificity pocket, MHC protein, 1577-1578, 1579F 
Spectrin 

actin cross-linking, 1007F 

actin filament packing, 994F 

actin filament web formation, 1008 

membrane cytoskeleton, 646, 647F 

structure, 1008 
Spectroscopy, electron carriers, 831, 832F 
Sperm see Spermatozoa 
Spermatids, 1293,1294 
Spermatocytes, 1293 

meiosis in, 1281 

primary, 1293 

secondary, 1293,1294 
Spermatogenesis 

genes regulating, 1295-1296 

spermatids, 1293, 1294 

spermatocytes see Spermatocytes 

spermatogonia, 1293 

stages, 1293-1294, 1294F, 1295F 

syncytia, 1294-1296, 1296F 
Spermatogonia, 1293 
Spermatozoa, 1292-1296 

adaptation for DNA delivery, 1292-1293 

binding to egg, 1270F, 1298-1299, 1298F 
acrosome reaction, 1297, 1298, 1299F 
cortical reaction and, 1 299, 1 300F 
see also Fertilization 

capacitation, 1294,1297 

mitochondria, 81 6F 



motility, 1293, 1293F 

plasma membrane domains, 645, 646F 

precursors see Spermatocytes 

production see Spermatogenesis 

structure, 1292-1293, 1293F 
Spermatozoon see Spermatozoa 
S phase of cell cycle see under Cell cycle 
Sphingolipid(s), membrane lipid(s), 618-620, 625 

see also specific types 
Sphingomyelin, 618,619F, 744-745 
Sphingosine, 61 8, 61 9F, 628, 744-745 
Spinal cord 

development, 1385-1386, 1386F 

tissue culture, 504 
Spinal muscular atrophy, inherited, SMN protein 

mutation, 364-365 
Spindle assembly checkpoint, 1 088 
Spindle pole body, 993 
Spirochaetes, structure, 1490F 
Spiroplasma, 14F 

Spisula development, cell cycle analysis, 1058 
Spliceosome(s), 349-351, 349-351, 354F 

active site, 352 

assembly, 350 

AT-AC spliceosome, 353-354, 354F 

ATP hydrolysis, 351-352 

catalytic RNAs, 352 

components, 350F, 352 
see also SnRNAs 

evolution, 355-357 

fidelity mechanisms, 352 

mode of action, 350-35 1 , 350F 

RNA-RNA rearrangements, 351-352, 351 F, 353 

SR proteins, 352, 353 

frans-spliceosome, 354-355, 354F 
Splice sites, 350F 
Splicing, 346, 347-348 

alternative see Alternative splicing 

AT-AC splicing, 353-354, 354F 

cellular location, 366 

errors, 347-348, 352, 352F, 355, 355F 

evolution, 355-357 

evolutionary advantage, 348 

gene expression regulation, 477-481 

Drosophila sex determination, 481 -482, 481 F, 
482F 

intron removal, 347-348 

mechanisms, 347-348, 350F, 354F 
base-pairing, 351 
lariat structure, 347, 348F, 350F 
machinery see Spliceosome{s) 
release, 352 
RNA-RNA rearrangements, 350-351, 351-352, 

351F 
self-splicing see Self-splicing RNA 
transesterification, 347, 348F 

muscle cell isoforms, 1465 

plasticity, 355, 480 

regulation, 480, 480F 

signals, 349, 349F 

splice sites, 350F, 352-353, 353F 

trans-splicing, 354-355, 354F 

tRNA, 369, 369F 
Splicing signals, 349, 349F, 352, 352F, 355, 355F 
Spol 1 meiosis-specific protein, 1280 
S.pombe, 552 
Spore(s), 1269 

SPR see Surface plasmon resonance (SPR) 
Squames, 1419, 1419F 
Squid giant axon, classical experiments, 678, 679, 

679F 
Squint, 1396 
Src family of protein tyrosine kinases, 936 

deletions leading to cancer, 1232 

evolution, 177F 

membranes, 631 

regulation, 178, 178F 

Src see Src protein 

structure, 178F 
Src homology 2 domain (SH2) see SH2 domain 
Src homology 3 domain (SH3) see SH3 domain 
Src-like kinase, B-cell signaling, 1 596F 
Src protein 

functional role, 178, 179F 

regulation, 178, 178F 

structure, 136, 136F, 178F 
SRP see Signal-recognition particle(s) (SRP) 
SR proteins, 352, 353 

Sry gene/protein, 1283-1285, 1285, 1286F 
S-S bonds see Disulfide bonds 



Staining, histochemical, 554-555, 579, 585F 
Staphylococcal nuclease, reaction rates, 161 F 
Staphylococcus, infection route, 1501 
Starch 

biosynthesis, 846 

granules 

amyloplasts, 841, 841 F 
chloroplasts, 94F, 95, 841, 842F 

structure, 94F 

synthesis, 95 
Start (restriction) point of cell cycle, 1055, 1061, 

1066,1105 
Start-transfer signal(s), ER protein transport, 732 

mechanism, 733F 
Starvation, Cortisol release, 463 
Stathmin, 995F, 1000 
Stearic acid, 114F 

Steel factor (SCF; stem-cell factor), 1375 
Stem cell(s), 1417-1484 

asymmetry of daughter-cell fates, 1421, 1421 F 

division rate, 1420, 1425 

embryonic (ES) see Embryonic stem (ES) cells 

engineering, 1476-1483 

epidermal, 1417-1428 

founder cell population, 1424F 

gut epithelium, 1436-1437 

hemopoietic see Hemopoietic stem cells 

immortal strand hypothesis, 1424-1426, 1425F 

multipotent, hemopoietic, 1450-1463 

production, 1421-1422, 1421 F, 1424 

role in cancer, 1217-1218, 1219 
Stem cell factor (SCF), 1375 
Stem cell niche, signaling pathways, 1441, 1442F 
Stem-loops, RNA structure, 403F 
Stereocilia, 1430, 1430F, 143 IF 
Stereo vision, 1396 

Steric limitations, peptide bonds, 127F 
Steriocillia, 609F 
Steroids, structure, 115F 
Sterol(s), membrane lipid(s), 618-620 
STI-751 (Gleevec), chronic myelogenous leukemia 

therapy, 1261 
Stimulatory G protein (G s ), 906-907, 91 9T 
Stomach 

bacterial colonization, 1503 

cancer, Helicobacter pylori association, 1228, 
1229,1503 

epithelium, 1436 

ulcers, 1228, 1503 
Stomata, 846 
Stopcodons, 367F, 381 
Strand-directed mismatch repair, 272, 276-278, 

277F 
Strand invasion, homologous recombination, 305, 

306 
Stratified squamous epithelium, 1434 
Streptococcus, 14F, 1501 
Streptomycin, 385T 
Stress, Cortisol release, 463 
Stress fibers, 1093 
Stroma, 1165 

cells, hemopoiesis control, 1458-1459, 1458F 

chloroplasts, 71 3F, 719-720, 842, 843F, 853 

mammary gland, 1428 

protein transport, 71 9-720, 720F 

role in tumor progression, 1222 
Strychnine, mode of action, 684 
Subcutaneous layer (hypodermis) of skin, 1418, 

1418F 
Substrate(s), enzyme binding, 72-73, 159-160 

see also Ligand(s) 
Subunits (macromolecules), 148-152 
Succinate, 122F, 123F 
SuccinylCoA, 122F, 123F 
Succinyl-CoA synthetase, 123F 
Sucrose, 511,846 
Sugars, 55-58, 11 2-11 3FF 

biosynthesis via photosynthesis, 69, 70F, 
845-846, 845F 
see also Photosynthesis 

as building blocks, 56 

derivatives, 113F 

energy source, 55-58, 88F, 96-97, 824T 

polymerization, 56 

storage, 94F 
see also Starch 

structure, 55-56, 57F, 1 1 2F, 1 1 3F 

see also Carbohydrates; Monosaccharides; 
Polysaccharides; individual sugars 
Sulfolobus, tree of life, 1 6F 
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Superceding, 344-345, 344F 

Superhelical tension, 344-345, 344F 

Superoxide, aging, 868 

Superoxide dismutase (SOD), 860, 868 

Superoxide radical, 833 

Supporting cells, olfactory epithelium, 1429, 1429F 

Suppression, self-reactive lymphocytes, 1548 

see also Immunological tolerance 
Suppressor mutation(s), 528F 
Suprachiasmatic nucleus (SCN), circadian clock, 461 
Surface plasmon resonance (SPR), 525-526, 525F 
Surfactant, 1434, 1435F 
Surrogate light chains, pre-B cell, 1 553, 1 565-1 566, 

1566F 
Survival factor(s), 1 1 02, 1 1 26, 1 1 27F 
Suspensor, 1400, 1400F, 140 IF 
Swinholide, 988T 

Switch helix, GTP-binding proteins, 181 
Switch sequences, class switching, 1568 
Syk kinase, B-cell signaling, 1596F 
Symbiosis 

chloroplast origins, 29 

mitochondrial origins, 27-28 
Sympathetic ganglion, 1390F 
Sympathetic neurons, NGF dependence, 1390 
Symporter(s), 656, 656F, 659 
Synapse(s), 682 

chemical see Chemical synapse(s) 

electrical, 1161 

elimination, 1393-1395, 1394F 

formation, 11 49F, 1393-1394 

cell adhesion molecules and, 1 147-1 148 
scaffold proteins and, 1 1 48-1 1 49, 1 1 48F 

immunological, signal-relaying junctions, 1 1 32 

modification, dependence on electrical activity, 
1395F 

organization/structure, 1149F 

regeneration, 1168-1169, 1168F 

remodeling, 1393, 1396-1397 

signaling, 882, 882, 882F 

synaptic cleft, 682 

vesicles see Synaptic vesicle(s) 
Synapsis, 305F, 306F, 1275, 1276F 
Synaptic cleft(s), definition, 682 
Synaptic connections, remodeling, 1393, 1396-1397 
Synaptic signaling, 882, 882F 
Synaptic vesicle(s), 807-808 

membrane fusion, 803 

structure, 809F 

see also Neurotransmitter(s) 
Synaptobrevin, v-SNARE, 763F 
Synaptonemal complex, 1274F, 1275-1276, 1275F 
Synchronized cells, 285 
Synchrony, of electrical activity in synapse 

modification, 1393-1395 
Synchrotron X-ray sources, 529 
Syncytia, 1099 

Drosophila embryo, 448F, 1330 

muscle cells, 1464 
Syndecan-1, 1184T 
Syndecans, 1183-1184 
Synechocystis, genome size, 1 8T 
Synonymous mutation, 247 
Syntaxin,t-SNARE, 763F 
Synteny 

blocks, genome evolution, 250 

conserved, 207, 250 
human-mouse, 208F 
Synthetic biology, 460, 461 F 
Synthetic lethality, 559 
Synthetic phenotype, 559 



T4 bacteriophage, 22F 

T7 polymerase, proofreading, 269 

Tagged knockouts, functional genomics, 569-571 

Tail polymerization, 86, 87F 

Tail region see Fc region,antibody molecule 

Talin, 1170, 1170F 

Tam3 transposon, 31 8T 

Tandem affinity purification tagging (tap-tagging), 

515-516 
Tandem mass spectrometry, 519-521, 522 
Tap-tagging, 515-516 
Target proteins, apoptosis, 1118-1119 
Tar RNA structure, transcription attenuation role, 

478 
TATA-binding protein (TBP), 340, 341 FF, 342F 



TATA box 

consensus sequence, 342F 

location, 340 

transcription factor-binding, 340-341, 341 F, 342F 
Tat protein, transcription attenuation, 478 
Tau protein, 995F, 1001, 1002F 
Tautomeric shift, DNA bases, 269, 270 
Taxol, 988, 988F, 988T 
TBP see TATA-binding protein (TBP) 
T c see Cytotoxic T-cells (T c ) 
TCA cycle see Citric acid cycle 
T-cadherin, 1136-1137, 11 38T 
Tcell(s), 1392-1409, 1540, 1569-1589 

accessory receptors, 1580-1581 

activation, 71 IF, 1543F, 1571-1572, 1572T, 
1590F, 1591-1592, 1591F 

antigen recognition, 1575 

CD4 see Helper T-cell(s) (T H ) 

cytoskeletal polarization in, 1047, 1573 

development, 1543F 

direction to targets, 1579-1580 

general properties, 1540-1551 

helper see Helper T-cell(s) (T H ) 

integrins, 1174, 1174F 

lymph node, 1551 F 

main classes, 1543 

MHC class restriction, 1579-1580 

MHC protein interaction, 1569-1589, 1575 

recirculation, 1549-1551, 1550 

resting, 1544F 

selection in thymus, 1585-1586, 1585-1586, 
1587F 

tolerance, nonactivated dendritic cells, 1571 

see also CD4 T-cell(s); CD8 T-cell(s); Cytotoxic 

T-cells (T c ); Regulatory T cell(s);T cell 
receptor(s) 
T cell co-receptors 

CD4, 1580-1581, 1580T, 1581F 
see also Helper T-cell(s) (T H ) 

CD8, 1580-1581, 1580T, 1581F 
see also Cytotoxic T-cells (T c ) 

structure, 1599F 

T cell development, 1 586, 1 587, 1 587F 
T cell-independent antigen, 1 598-1 599 
T cell-mediated immune response, 1540 
Tcell receptor(s), 1570-1571, 1580F, 1599F 

antigen presentation to, 1590 

assembly in ER, 768 

B-cell receptors, 1595-1596 

CD3 complex association, 1 590F 

7 chain, 1571 

MHC protein/antigen interactions, 1574F, 
1577-1579, 1579F, 1580F 

signaling function, 1 590F 

structure, 1570-1571, 1570F, 1571 
TCF4 gene regulatory protein, colorectal cancer, 

1439 
TCP-I chaperone, 390F 
Tectorial membrane, 1430, 1430F, 143 IF 
Tectum, optic, 1391, 1391 F 

neuronal specificity and retinotectal map, 
1392-1393 
Teichoic acids, 1527 
Telomerase, 292-294, 505 

cell senescence and, 1 1 07 

nucleolar assembly, 363 

structure, 292F 

see also Reverse transcriptases 
Telomere(s), 292-294 

cancer, 1217 

heterochromatin role see Heterochromatin 

length regulation, 293-294 
senescence role, 293, 1 107 
yeast, 293, 294F 

meiotic homolog pairing and, 1 274, 1 274F 

repetitive DNA sequences, 210 
evolutionary conservation, 292 
replicative loss, 293 

replication, 210, 21 OF, 292, 294F 
T-loops, 292, 293F 
see also Telomerase 

replicative cell senescence and, 293, 1 107 
Telophase, 1055, 1073F, 1090 
TEM see Transmission electron microscopy 
Temperature-sensitive mutation(s), 557, 557F, 1057, 

1057F 
Templated polymerization, 3-4, 3F 

DNA, 200, 200F, 201 F, 266, 266F, 267F, 268F 

information transmission, 3-4, 3-5, 3F, 4F, 5F, 
333, 334F 



see also DNA replication; Transcription 
Teosinte, 141 1,141 1F 
Teosinte branched-! (Tbl) gene, 141 1 
Terminal differentiation, 1 103, 1420 
Terminator elements, transcription, 337, 338, 339 
Testes 

development, 1283-1285, 1286F 

spermatogenesis, 1293-1294, 1294F, 1295F 

structure, 1293, 1294F 
Testosterone, 889F 
Tetracyclin, 318F, 385T 
Tetradecanoyfphorbol acetate, as tumor promoter, 

1226 
Tetraspanin, 1298-1299 
Tet repressor, synthetic biology, 461 F 
TFIIH transcription factors, 340-341 
TFII transcription factors see Transcription 

factors,general 
TGFp superfamily see Transforming growth factor-B 

(TGF{5) family 
TGN see Trans Golgi network (TGN) 
T H see Helper T-cell(s) (T H ) 
T H 1 7 cells, 1594 
Thalassemia, p-globin abnormalities, 288, 288F, 

355F,452 
Therapeutic cloning, 507, 507F, 508 
Thermal motions, 49, 49F 
Thermodynamics, 66-67, 1 18-1 19FF 

entropy, 66-67, 67F 

first law, 67-68, 11 8F 

second law, 66-67, 1 18F 

see also Free energy 
Thermofilum, tree of life, 1 6F 
Thermomicrobium, tree of life, 16F 
Thermotoga maritima, 1 0F, 1 8T 
Thick filaments, 1012F 

see also Myosin 
Thoracic duct, 1549, 1549F 
Threading, computational protein analysis, 139 
Threonine, structure, 129F 
Thrombin, 159 
Thyl protein, 1599, 1599F 
Thylakoid(s), 719-720, 842-843, 843 

membrane, 71 3F, 843, 843F, 852F 

protein transport, 71 9-720, 720F 

vesicles, generation, 698 
Thylakoid space, 843, 843F 
Thymidine, 3 H-thymidine autoradiography, 

282-283, 284F, 602-603 
Thymine, 197 

base-pairing, 198F 

dimer formation, 296, 297F 

structure, 11 6F, 301 F, 332F 
Thymosin, actin filament modification, 999 
Thymus, 1541 F, 1585-1586 

organization, 1585F 

organ-specific protein expression, 1587-1588 

Tcell selection, 1585-1586 
Thyroid hormone, 890, 907T 

receptor, 891 F 
Thyroxine, 889F 
Tight junction(s), 1 1 50-1 1 58 

epithelial cells, 806, 1133, 1134F, 1150-1151 

functions, 1150-1153 

junctional complexes, 1 1 53-1 1 55, 1 1 54F 

paracellular transport, 1 152 

as permeability barriers, 1151-1152, 1152F 

scaffold proteins, 1 1 53-1 1 55 

structure, 1 1 52-1 1 53, 1 1 52F, 1 1 53F 

transcellular transport, 1151, 1151 F 

transmembrane adhesion proteins, 1153 
TIM23, 718 
Tim clock gene, 462F 
TIM complex, 714-715 

membrane association, 714F 

mitochondrial protein import, mechanism, 
716-717, 716F,718F, 719 
Timeless protein, 462F 
Tip cell, capillary sprouting, 1447 
Tissue(s) 

basal lamina see Basal lamina 

cell-cell interactions, 1 131-1132, 1 132F 

selective adhesion, 1 1 39-1 1 40, 1 1 39F, 1 1 40F 
selective assortment, 1 1 40-1 1 41 , 1 1 41 F, 

1142F 
see also Cell adhesion; Cell junction(s); 
Extracellular matrix (ECM) 

cell growth and, 1102, 1108 

connective see Connective tissue 

developmental segregation, 1140-1141, 1141F, 
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1142F 
engineering, 1479 
epithelial see Epithelia 
explants, 504 

extracellular factors, 1 1 02, 1 1 1 
organization and maintenance, 1 140 
polarity, 1155 
renewal, 1417-1484 
repair, 1193,1477 
sectioning, microscopy sample preparation, 585, 

585F,605 
specialized, 1417-1484 
variety, 1131 
see also specific tissues 
Tissue culture, 503-504 
explants, 504 
historical aspects, 504 
requirements, 503-504 
spinal cord, 504 
see also Cell culture 
Tissue explants, 504 
Tissue-inhibitors of metalloproteinases (TIMPs), 

1194 
Tissue slices, microdissection techniques, 502, 503F 
Tissue staining, 554-555, 579, 585F 
Tissue-type plasminogen activator (tPA), 1 194 
Titin, 1028, 1028F 
Tjp protein family, 1 1 54 
T-loops, 292, 293F 

T lymphocytes (T cells) see Lymphocyte(s); T cell(s) 
Tn3 transposons, 31 8T 
TNFct see Tumor necrosis factor-a (TNFct) 
Tobacco cells, cell culture micrograph, 504F 
Tobacco mosaic virus (TMV), self assembly, 1 49 
Toll-like proteins, 1530-1531 
Toll-like receptors (TLRs), 1530-1531, 1530F, 1542 
activation, 1533-1534 
dendritic cells, 1531,1536 
inflammatory response, 1 533-1 534 
Tomato bushy stunt virus, capsid, 149F 
TOM complex, 714-715, 714F, 716-717, 716F, 718F 
Tomography, electron microscope (EM), 61 1 F, 61 2, 

612F 
Tonoplast(s), plant vacuoles, 782F 
Topoisomerases see DNA topoisomerases 
TOR kinase, 934, 1108 
Torso protein, 1333F, 1334, 1334F 
Total cell mass, control, 1 1 02, 1 1 1 1 -1 1 1 2 
Total internal reflection fluorescence microscopy 

(TRIP), 599, 600F 
Totipotency, plant cells, 568 
Toxin(s), 1493 

bacterial see Bacterial toxins 
cholera, 1492 

mechanism of action, 1493-1494 
Toxoplasma gondii 
conoid, 1 509 

host cell membrane traffic, 1512 
invasion of host cells, 1 509, 1 51 OF 
structure, 1489F 

vacuole formation and replication, 1509, 151 OF 
Toy (twin of eyeless) gene, 466 F 
TP53 see p53 gene/protein 
TPA (tetradecanoylphorbol acetate), as tumor 

promoter, 1226 
Trabecular bone, 1470, 1470F 
Traction, cell movement, 1040-1041, 1041F 
Transcellular transport, 658-659, 659F 

tight junction(s), 1151, 1151F 
Transcription, 4-5, 4F, 331-343, 334F, 335F 
basic principles, 4F, 331 , 332-333, 333, 334, 
334F, 335-336 
see also Messenger RNA 
directionality, 339, 339F 
DNA signals, 336-340 

consensus sequences, 338-339, 338F 
heterogeneity, 338-339 
promoters, 337 
elongation, 343-346, 344F 
eucaryotic, 339, 339-346, 341 F, 343F, 399F 
elongation, 343-345 
initiation, 341 F, 343F 
mitochondrial DNA, 863 
RNA polymerases, 339, 339-340, 340T 

see also RNA polymerase^) 
RNA processing, 345-346 
seeaiso Transcription factors, general (TFII) 
fidelity, 270, 334 

gene expression regulation see Transcriptional 
control of gene expression 



initiation, 336-337 
abortive, 337F 
complex, 341-342 

eucaryotic, 339, 340-342, 341 F, 343F 
histone code reading, 444F 
promoters, 338-339 
transcriptional activators see Transcriptional 

activator(s) 
see also Promoter elements 
procaryotic (bacterial), 336-338, 337F, 338-339, 
338F, 339F, 345F 
directionality, 339, 339F 
promoters, 337, 337F, 338F 
RNA polymerase, 336-337, 337F 
sigma factors, 336, 337F 
terminators, 337, 339 
transcription attenuation, 477-478 
rates, 334, 337, 370 
replication vs., 333 

RNA polymerase orientation, 339, 339F 
rRNA genes, 361 F 
termination, 337, 339 
universal principles, 4-5 
Transcriptional activator(s), 341, 342-343, 343F, 435 
eucaryotic, 436F 

coactivators, 445, 447F 
complex formation, 447, 447FF 
holoenzyme interaction, 438F 
repressor interactions, 445, 446F 
structure, 441, 442, 442F 

seeaiso DNA-binding motifs 
transcriptional synergy, 444-445, 444F 
seeaiso RNA polymerase II 
mechanisms, 436F 

procaryotic, 41 8T, 420F, 435-436, 436, 436F 
Transcriptional control of gene expression, 379F, 
415,432-477 
activators see Transcriptional activator(s) 
eucaryotes 

"action at a distance," 438F 

cell cycle control, 1065, 1 104, 1 104F 

complexity, 434-436 

DNA looping, 438F 

gene regulatory proteins see under Gene 

regulatory protein(s) 
procaryotes vs., 434-436 
promoters, 339, 438F, 440 

see also Promoter elements 
regulatory elements, 438F, 440 
insulators, 453 
locus control regions (LCRs), 450-452, 

452F 
TATA box see TATA box 
transcriptional attenuation, 478 
transcriptional synergy, 444-445, 444F, 

464-465 
seeaiso DNA-binding motifs; Transcription 
factors,general (TFII) 
evolution, 439-440 
heritability, 458, 458F 

procaryotes (bacterial), 436FF, 438-439, 439F, 
439T, 477-478 
eucaryotes vs., 434-436 
Lac operon see Lac operon (Escherichia coli) 
regulatory proteins see Gene regulatory 

protein(s) 
repressors see Repressor protein(s) 
see also RNA polymerase(s) 
Transcriptional mediator, 343 
Transcriptional repressors see Repressor protein(s) 
Transcriptional silencing see Gene silencing 
Transcriptional synergy, 444-445, 444F, 464-465 
Transcription attenuation, 477-478 
Transcription circuits, 459-460, 459F, 460F 
Transcription-coupled repair, 299-300 
Transcription factors, general (TFII), 340-342 
activator protein interaction, 438F 
function, 340 
nomenclature, 340 

repressor protein interaction, 445, 446F 
TATA box-binding, 340-341, 341-342, 341F, 342F, 

438F 
transcription initiation, 341 F 
Transcription initiation complex, 341-342 
Transcription unit(s), 336, 477 

see a/so Gene(s) 
Transcytosis, 793, 793F, 797-798, 797F 
glucose transporters, 798 
IgA, 1556F 
insulin receptor, 798, 798F 



recycling endosomes, 798 
Transducers, 1429 
Transesterification reactions, pre-mRNA splicing, 

347-348, 348F 
Transfection, DNA cloning vectors, 540-541 
Transferrin receptor, 794 
Transfer RNA (tRNA), 6, 7F, 368-369 

aminoacylation reaction, 370-371, 371 F, 372F 
accuracy, 371-373 
origin, 407 

synthetase recognition, 372-373, 372F 
see also Aminoacyl-tRNA synthetases 
aminoacyl-tRNA-ribosome binding, 375-376, 

375F 
anticodons see Anticodon(s) 
evolutionary significance, 407-408 
function, 336T 
initiator tRNAs, 380 
modification, 368F, 369, 369F, 370F 
peptidyl-tRNA-ribosome binding, 375-376, 375F 
structure, 368, 368F, 370F, 373F 
transcription, 335-336, 863 
see a/so Genetic code 
'Transformed' phenotype, cancer cells, 506T, 1 232, 

1233 
Transformer (Tra) gene, Drosophila sex 

determination, 481 F, 482F 
Transforming growth factor-p (TGFp) family 
activation of Smad-dependent signaling 

pathway, 939-941, 940F 
animal development, 1316F 
Arabidopsis absence of, 1400 
Drosophila development, 1 336 
inhibitor signals, 1103,1110, 1111F 
loss of function mutations, 1243 
myostatin, 1466 
proteoglycans and, 1 1 83 
regulatory Tcell(s), 1575 
skia 1426 
Transgenic organism(s), 566-568 
plants, 568-569, 569F, 570F 
production, 566-568, 570F 

site-specific recombination, 325-326 
Cre/Lox system, 567-568 
see also Gene knockouts 
Trans Golgi network (TGN), 772 
electron micrograph, 771 F 
functional compartmentalization, 778F 
lipid raft assembly, 807, 807F 
M6P receptor proteins, 784 
protein sorting pathways, 801 F, 807F 
structure, 771 F 
transport, 779-787, 785F 
seeaiso Exocytosis 
Transit amplifying cells, 1422-1424, 1422F 
benefits, 1425-1426 
growth control strategy, 1423 
gut epithelium, 1437 
Transitional endoplasmic reticulum see 

Endoplasmic reticulum (ER),smooth 
Transition state, 160 

activation energy, 160, 160F 
analogs, 160, 161 F 

stabilization, enzyme catalysis, 1 59-1 60, 1 60, 
164 
Translation, 4-8, 4F, 331, 366-383, 376F 
accuracy, 270, 382 
EF-Tu role, 377-378 
free energy expenditure, 385 
basic principles, 4-8, 4F 
decoding, 367, 367F 

adaptor molecules, 371, 372F 
reading frames, 368, 368F 
see also Genetic code; Transfer RNA 
eucaryotes, 345F, 380, 380F, 399F 

ribosomes, 360-361, 362-363, 374F 
gene expression regulation seeTranslational 

control of gene expression 
historical aspects, 366-367 
inhibitors, 384, 385T 
initiation, 379-380, 380F 
bacteria, 380, 488 
eucaryotes, 380, 380F, 487-488 
formylmethionine, 380 
initiation codon, 380, 489-490 
initiation factors (elFs), 380, 488-489, 490F 
initiator tRNA, 380 

internal ribosome entry site (IRES), 491 
leaky scanning, 490 
mRNA cap, 1517 
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mRNA signals, 379 
procaryotic, 1 527 
viruses, 1517 
peptide chain elongation, 373F, 375F, 376F 
elongation factors, 377-378 
peptidyl transferase, 375-376, 379, 379F 
see also Protein synthesis 
quality control, 382-383, 385-387, 387F 

see also Nonsense-mediated mRNA decay 
regulation, 489F 
termination, 381, 381 F 
viral RNA, 1517-1518 
see also Messenger RNA (mRNA); Protein 
synthesis; Ribosome(s) 
Translational control of gene expression, 379F, 415, 
488 
cytoplasmic polyadenylation, 493 
initiation regulation, 489-491, 490F 
internal ribosome entry site (IRES), 491 
mRNA degradation, 492-493, 492F 
negative, 488, 490F 
see also Messenger RNA (mRNA) 
Translational frameshifting, 383, 384F 
Translational recoding, 383, 383F, 384F 
Translational repressors, 488, 490F 
Translation release factor (eRFI), molecular mimicry, 

382F 
Translocase of the inner mitochondrial membrane 

see TIM complex 
Translocase of the outer mitochondrial membrane 

see TOM complex 
Translocations, chromosomal see Chromosome 

translocation 
Transmembrane adhesion proteins, tight 

junction(s), 1153 
Transmembrane protein(s), 629-630, 630F 
assembly, 634 

(3 barrel conformation, 632, 634-635, 635F 
cell-cell adhesion, 1 1 34-1 1 35, 1 1 34F, 1 1 35T, 
1153 
see a/so Cadherin(s) 
cell-matrix adhesion, 1 169, 1 177T 

see also Integrin(s) 
domains, 632 

a-helix conformation, 631-632, 632F 
a-helix interactions, 632-634 
hydrogen bonding, 632 
hydropathy plots, 632 
multipass, 632, 634F, 634FF, 653, 734-735, 

734-736, 735F 
single-pass, 632, 636F, 733-734, 733F 
transport, small molecules, 651 
see also Porin(s); specific proteins/types 
Transmission electron microscopy (TEM), 604-607, 
604F 
3-D reconstruction, 606, 607F 
immunoelectron microscopy, 606-607, 607F 
microscope, 604-605, 604F 
sample preparation, 605-606, 605F 
surface analysis, 608-609 

metal shadowing, 608-609, 610F 
Transmitter-gated ion channel(s), 682-684, 892 
acetylcholine receptors as, 684-686 
localization, 683 

neuromuscular junctions, 684-686, 687-688 
permeability changes, 683 
psychoactive drug effects, 686-687 
see also specific receptors 
Transplantation reaction, 1575-1576 
Transporters/transport proteins see Carrier 

protein(s) 
Transport vesicle(s), 749, 760 
Transposable elements see Transposon(s) 
Transposase, 317, 318F 
alternative splicing, 480 
DNA-only transposons, 317, 31 8T 
genes, 31 8F 
Transposition, 316 

DNA-only transposons, 31 8T 

cut-and-paste mechanism, 31 9F, 320F 
enzymes, 31 8T 

see also specific enzymes 
mechanism see Site-specific recombination 
retrotransposons, 318T 
viral integration, 321 F 
Transpositional site-specific recombination see 

Transposition 
Transposon(s), 31 7, 1 599 
bacterial genome, 318F 
classification, 31 8T 



DNA-only, 317, 318F, 318T 
retrotransposons, 317, 318T 

see also Retrotransposons; Retroviral-like 
retrotransposons 

evolutionary movements, 323 
(3-globin genes, 250F 
human genome, 207F 

insertional mutagenesis, 528F, 556 

phylogenetic variation, human vs. mouse, 323F 

transposition mechanism see Transposition 

see also Transposase; individual elements 
Trans-splicing, 354-355, 354F 

SLRNR353, 354F 

trypanosomes, 354, 354F 
Transverse filaments, meiotic homolog pairing, 

1276, 1277F 
Treadmilling, 976-980, 977FF 
Tree of life, 12-13, 12-13,15-16 
Treponema pallidum, 1 4F 

genome size, 18T 
Triacylglycerols, 115F 

energy storage, 95, 96F 

fatty acid storage, 58 

structure, 97F, 11 4F 
Tricarboxylic acid cycle see Citric acid cycle 
Tricellulin, 1153 
Trichomonas, tree of life, 16F 
Trichothiodystrophy, mouse knockouts, 568F 
Triglyceride(s) see Triacylglycerols 
Trimeric GTP-binding proteins (G proteins) see 
G protein(s); G protein(s) (trimeric 
GTP-binding proteins/ATPases) 
Triosephosphate isomerase, 120F, 161 F 
Trioses, 112F 

Triple response, growing seedling, 957, 958F 
Triple snRNP, splicing mechanism, 350F 
Trithorax group, 1344 
Triton, membrane protein solubility, 637 
Trk A, 923T 

TrkA receptor protein (NGF receptor), 1390 
tRNA see Transfer RNA 
tRNA-splicing endonuclease, 369, 370F 
Tropoelastin, 1190 
Tropomodulin, 1003 
Tropomyosin, 349F, 480, 994F, 1001 
Trypanosoma, 16F, 354, 354F, 483, 484F 
Trypanosoma brucei, 1502, 1519F, 1520 
Trypanosoma cruzi, 1 509, 1 5 1 1 , 1 51 1 F 
Trypanosome(s) 

antigenic variation, 1519F, 1520 

tsetse flies, 1502 
Trypsin, cell isolation techniques, 502 
Tryptophan, 129F, 433F 
Tryptophan operon, 433F 
Tryptophan repressor, 433-434, 433F, 434F 

helix-tum-helix motif, 420F 
T segments, Drosophila, 1 329 
Tsetse fly, 1 502 
T-SNAREs see SNARE proteins, target membrane 

SNAREs 
Tubercle, 1507 

Tuberculoid leprosy, 1593-1594 
Tuberculosis, 1488, 1507-1508, 1508 

see also Mycobacterium tuberculosis 
Tube worms, 13F 
Tubulin(s) 

bacterial homolog, 989 

evolutionary conservation, 982-983 

filament assembly, 973-975 

isoforms, 982 

photoactivation visualization, 594-595, 595F 

polymerization, time course, 914F 

structure, 973 

T-form and D-form, 980 

y-tubulin, 992,992F, 998 

see also Microtubule(s) 
y-Tubulin ring complex (y-TuRC), 992, 992F, 994F, 

1077,1079 
Tumor(s) see Cancer; specific types/locations 
Tumor initiators, 1226, 1227 
Tumor necrosis factor-a (TNFa), 953, 953F, 1592, 

1598T 
Tumor promoter(s), 1226, 1227 
Tumor suppressor genes (TSGs), 1231, 1232F 

genetic vs. epigenetic change, 1237F 

hereditary cancer syndromes, 1234-1235 

identification, 1234-1235 

loss by mutation, 1235-1236 

mechanisms of losing, 1 236F 

see also individual genes/proteins 



TUNEL technique, apoptotic cells, 1117, 111 8F 

Turgor pressure, 781 , 1 1 97, 1 1 99 

Turnover number, enzyme catalysis, 1 59-1 60 

Twin studies, epigenetics, 473, 473F 

Twist gene/protein, 1 141, 1335 

Two-component signaling pathway/system, 942, 

944F 
Two-dimensional polyacrylamide gel 

electrophoresis (2D-PAGE), 521-522 

gene expression analysis, 412, 41 4F 

isoelectric focusing step, 521, 521 F, 522F 

SDS-PAGE step, 521-522 

visualization, autoradiography, 522F 
Two-hybrid methods, 523-524, 524F 

see also Protein-protein interactions 
Two-hybrid system, 523-524, 524F 
"Two-photon" effect, confocal microscopy, 592, 

592F 
Ty1 retrotransposon, 318T 
Type III secretion system, bacterial toxins, 1493, 

1494F, 1504 
Tyrosine aminotransferase, gene expression 

regulation, 415 
Tyrosine kinase(s), 177F 

B cell receptor signaling, 1 590F 

receptor-linked see Receptor tyrosine kinases 
(RTK) 

T-cell receptor signaling, 1590F 

see also Src family of protein tyrosine kinases 
Tyrosine-kinase-associated receptor, 922, 935 
Tyrosine, structure, 129F 



U 

U1 snRNP, splicing mechanism, 350F 

U2AF spliceosomal protein, 350F 

U2 snRNP, splicing mechanism, 350F 

U4 snRNP, splicing mechanism, 350F 

U6 snRNP, splicing mechanism, 350F 

UAS (upstream activating sequence) element, 

1350F 
Ubiquinone see Coenzyme Q 
Ubiquitin, 392-393, 740F 
Ubiquitin-activating enzyme, 393, 394F 
Ubiquitin ligase(s), 393, 394F, 396F 

cell cycle control, 1064, 1065F, 1066T 

SCF, 184-185, 185F, 188 

see also specific enzymes 
Ubiquitin pathway, 393-394, 394F, 396F 

activation, 393-394, 394F 

enzymes, 393, 394F 

N-terminal degradation, 395-396 

proteasome targeting, 393-394 

protein labeling, 395F 

ubiquitin, 393, 740F 

see also Proteasome(s) 
Ubiquitylation, histone tails, 223F 
Ultracentrifugation 

cell fractionation, 510-511, 51 1 F 

microsome purification, 726 

preparative ultracentrifuge, 510-51 1, 51 OF 
Ultraviolet (UV) radiation 

DNA damage, 296, 297F, 298 

see also DNA damage; DNA repair 

sensitivity, DNA repair defects, 295T 
UncSH, netrin receptor, 1389 
Unc5, netrin receptor, 1389 
Unc6 gene, 1388 
Unc genes, 1388 
Uncoupling agents, 836, 838 
Uniporter(s), 656,656F 

Unit evolutionary time, protein evolution, 265 
Universe, time line see Origin of life 
Untranslated regions 

3' untranslated region, 487, 488, 488F 

5' untranslated region, 488 
Upstream ORFs, 490 
Uracil, 116F, 332, 332F, 333F 
Uranium, electron microscopy stains, 606 
Urea, 101 

Urea cycle, mitochondria, 867 
Urease, 1503 

3 H-Uridine autoradiography, 603 
Uridine isomerization, rRNA modifications, 361, 

362F 
Urokinase plasminogen activator (uPA), 1194 
UTR see Untranslated regions 
UV light see Ultraviolet (UV) radiation 



Page numbers in boldface refer to a major text discussion of the entry; page numbers with an F refer to a figure, with an FF to figures that follow consecutively; 
page numbers with a T refer to a table; vs means compare/comparison. 
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Vaccines/vaccination, 1498, 1498F 

HIV prevention, development, 1521 
Vaccinia virus, 1514R 1516F, 1517 
Valine, structure, 129F 
Valium, mechanism of action, 686 
Vancomycin, resistance, 1523 
VanderWaalsforces,51,51F,54, 1 1 0F 
bond length/strength, 53T 
protein folding, 130F 
Van derWaals radius, 51, 110F 
Variable domain 
genes, 1 563 

heavy Ig chains, 1559F, 1560F 
light Ig chains, 1559F, 1560F 
Variable number tandem repeats (VNTRs), 547F 
Varicella zoster virus, microtubule-based 

movement, 1516-1517 
Variegation, 866, 866F 
Vasa family proteins, 1 282 
Vascular cell adhesion molecules (VCAMs), 1 146 
Vascular endothelial growth factor (VEGF) 
cell-cell junctions and, 1 1 45 
ECM binding, 1183 
endothelial cells, 1448, 1449 
role in tumour progression, 1221 
Vascular tissue 

blood vessels see Blood vessels 
of plants, 1402, 1404F 
Vasopressin, 907T 
VE-cadherin, 1138X1145 
Vectors 

DNA cloning, 540-541, 541 FF 

see a/so DNA cloning 
of pathogens, 1501-1502 
protein expression, 546, 548, 548F, 548FF 
virus transmission by, 1502 
"Velcro principle" 

cadherin adhesion, 1 1 38-1 1 39, 1 1 39F 
integrin adhesion, 1174 
Velocity sedimentation, 51 0-51 1 , 51 2F 
Vertebrate(s) 

development see Vertebrate development 
times of evolutionary divergence, 43F 
Vertebrate development, 1363-1378 

cadherins and, 1 1 39-1 140, 1 140-1 141, 1 140F, 
1141F 
see a/50 Cadherin(s) 
cell movements and shaping of body plan, 
1363-1378 
blastula, 1363F, 1365, 1366F 
cell adhesion and sorting-out, 1 369-1 370, 

1370F 
cleavage, 1365, 1365F 
convergent extension, 1368-1369, 1369F 
egg asymmetries in amphibia, 1364, 1364F 
gastrulation, 1365-1369, 1366F, 1367F 
left-right asymmetry, 1376-1377, 1376F 
mid-blastula transition, 1365 
migratory cells, 1373-1375 
myoblast migration, 1373-1374, 1374F 
neural crest, 1374, 1374F 
neurulation, 1370-1371, 1370F, 1372F 
notochord, 1370-1371 
pigment cells, 1375, 1376F 
somite formation, 1371-1372, 1372F 
see also under Xenopus laevis development 
mouse see Mouse development 
neural crest, 1140, 1140F 
neural tube formation, 1140, 1 141 F 
Xenopus laevis see Xenopus laevis development 
see also individual processes and species 
Vesicular transport, 701, 701 F, 750-766, 750F 
coated vesicles see Coated vesicle(s) 
docking specificity, 760-761, 761-762 
ER to Golgi apparatus, 766-787 
exocytosis, 799-809 
see also Exocytosis 
Golgi apparatus transport 
to cell surface, 799-809 

see also Exocytosis 
to lysosomes, 779-787 
model, 777-778 
investigations, 752, 752F, 753, 753F 
organelle relationship, 701 F 
secretory vesicles see Secretory vesicle(s) 
SNARE proteins and see SNARE proteins 
Vesicular tubular clusters, 768-769 
electron micrograph, 769F, 772F 



formation, 769, 769F 

retrieval pathway, 769, 769F, 770F 

transport from ER to Golgi apparatus, homotypic 
membrane fusion, 768, 769F 
Vibrio cholerae, 1 4F 

bacteriophage, 1491 

chromosome, 1492F 

colonization of gut, 1503 

CTX(J> bacteriophage, 1492F 

gene map, 1492F 

pandemics, 1491 

structure, 1489F 

toxin see Cholera toxin 

virulence genes, 1491 
Villin, actin cross-linking, 1007 
Villi, of small intestine, 1436, 1436F, 1441 
Vimentin,985,985T 
Vinblastine (vincristine), 988T 
Viral infection(s) 

interferon-y, effects of, 1406F 

vaccination, 1498, 1498F 

see also Virus(es); specific infections/organisms 
Virion, definition, 1496 
Virulence factors, 1491 

anthrax, 1493 

type III secretion system for, 1493, 1494F, 1504 

see also Bacterial toxins 
Virulence genes, bacterial, 1491 
Virulence plasmids, 1491, 1491 F, 1492F 
Virus(es), 1488, 1496-1498, 1534-1536 

adhesion to host cells, 1 505-1 506 

bacterial see Bacteriophage(s) 

bloodborne, 1502 

as carcinogens, 1227-1228 
see also DNA tumor virus(es) 

cytolytic effects, 1496 

diversity, 1488 

DNA tumor virus(es), 1247-1249, 1248F 

entry and uncoating, 1505, 1506F 

enveloped, 1497, 1497F, 1505, 1513, 151 4F 

enzymes encoded, 1517-1518 

evolution, 1520-1521 

genome, 1496, 1497, 1497F 

genome integration see Mobile genetic 

elements; Site-specific recombination 

host cell membrane traffic changes, 1513-1514, 
1514F 

host metabolism changes, 1517-1518 

intracellular movement, 1514-1517 

life cycles, 1496-1497, 1496F 

MHC class I expression inhibition, 1535-1536 

morphology, 1496, 1496F 

nonenveloped, 1497, 1507 

plant, plasmodesmata and, 1 163 

proteins, 1497 

protein synthesis, 1517, 1535-1536 

receptors, 1505-1506 

replication, 1497 

retroviruses see Retroviruses 

RNA viruses see RNA virus(es) 

size, 1496 

structure, 148-149, 1496 

capsids, 148, 148F, 149F, 150F, 1489F, 1497, 

1513 
icosahedral, 61 1F, 612 
nucleocapsid, 1497F 
self assembly, 149 
spherical, 150F 

transmission by vectors, 1502 

see also specific infections/organisms 
Visual connections, in mammalian cortex, 
1395-1396 

molded by experience, 1 395-1 396, 1 395F 

segregation of right- and left-eye inputs, 1395, 
1395F 
Visual cortex, ocular dominance columns, 1 395F 
Visual experience, influence on ocular dominance 

columns, 1395F 
Visual transduction, 917-918 

adaptation, 920, 920F 

amplification of signal in rod photoreceptors, 
919-920, 920F 

photoreceptor cells, 918 

see also Arrestin; Rhodopsin; Rod photoreceptors 
(rods) 
Vitamin(s) 

as coenzymes, 167, 167T 

see also individual vitamins 
Vitamin A, 640, 890 
Vitamin C, deficiency, 1 1 86 



Vitamin D, 890 

receptor, 891 F 
Vitamin D3, 889F 
Vitelline layer, 1287 
Vitreous ice, 605, 610 
V(D)J joining, 1564-1566 

class switching vs., 1568 

control, 1565-1566 

recombination signal sequences, 1564F 
V-J joining process, k light chain production, 1563F 
V(D)J recombinase, 1564 
Voltage-gated calcium channel(s), 682, 687-688 
Voltage-gated ion channel(s), 676-678 

see also specific types 
Voltage-gated potassium channel(s), 677 

delayed, 678, 689 

epilepsy, 682 

evolutionary relationship, 682 

inactivation, ball-and-chain model, 678F 

mutants, 678 
Voltage-gated sodium channel(s) 

inactivating mechanism, 676 

neuromuscular junctions, 687 

states, 676-677 
Volvox, 501 F 

Von Hippel-Lindau (VHL) syndrome, 1449 
V-SNAREs, 762-763, 763F 
V-type pumps (ATPases), 660 



W 

Wall cress see Arabidopsis thaliana 
Warburg effect, 1245 
WASp family, 1043, 1516, 1516F 
Water, 108-1 09FF 

cell content, 51-52 

channel(s) see Aquaporin channel(s) 

chemical properties, 51-52 

electron source, 851,873 

functional roles, proteins, 154 

proton behavior, 828, 828F 

solvent, 109F 

structure, 52F, 108F 
Water-splitting enzyme, 851, 852F, 873 
Watson, James D., 196 
Wee1 kinase, 1063, 1074, 1074F 
Werner syndrome, DNA repair disorders, 295T 
Western blotting, 518, 51 9F 
White blood cells see Leucocyte(s); individual cell 

types 
Whooping cough (pertussis), 1503 

see also Pertussis toxin 
"Winding problem," DNA replication, 278F 
Wing bud, 1355F 
Wing imaginal disc, 1356F 
Wingless (Wg) gene/protein, 1353, 1353F 

in Drosophila segmentation, 1 340 

in vertebrate limb, 1355 

see also Wnt signaling pathway 
Wingless signaling pathway, 1353 

see also Wnt signaling 
Wing margin, 1201, 1352 
Wire protein models, 133F 
Wnt signaling pathway 

absenting rabidops is, 1 400 

animal development, 1316F 

p-catenin, 948-950, 949F 

in bone marrow, 1458 

cell-cell adhesion and, 1 145 

colorectal cancer, 1253 

gut stem-cell population maintenance, 
1438-1441 

planar cell polarity, 1 1 58 

in skin, 1426 
Wobble base-pairing, 369, 369F 
Wolbachia 

host behavior modification, 1518 

infection treatment, 1518 

microtubule-based movement, 1517, 151 7F 

vertical spread, 1517, 1518 
Worm see Caenorhabditis elegans 
Wound(s) 

angiogenesis, 1448, 1448F 

healing 

angiogenesis, 1448 

control of cell turnover, 1425-1426 

roleofTGFp, 1426 

infection route, 1501-1502 
WRKY (Zn finger) gene regulatory proteins, family 



r 
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members in different eucaryotes, 
MOOT 
Wuschel protein, 1410, 141 OF 



X-chromosome{s),202,473, 1271, 1284 
inactivation see X-inactivation 
meiotic pairing, 1275 
Xenografts, transplantation reactions, 1575 
Xenopus laevis, 39, 39F 

development see Xenopus loevis development 
genome duplication, 38-39 
species differences, 38-39 
Xenopus laevis development, 1 363F 

cell movement during development, 1363 
effect of blocking Notch signaling in, 1 360F 
egg, 1364F 

cell-free systems, 1058, 1058F 
fertilization, 1057 
growth, 1058F 

microtubule dynamics during mitosis, 1080, 
1081F 
embryo 

cell cycle analysis, 1057-1058, 1057F 
cleavage, 1058F 
gastrulation, 1365-1369, 1366F, 1367F, 1368F, 
1369F 
cell movements, 1365-1367, 1365F, 1366F, 

1367F 
cell packing changes (convergent extension), 

1368-1369, 1369F 
signals controlling, 1367 
Xenopus ruwenzoriensis, 39 
Xenopus tropicalis, 39, 39F 
Xeroderma pigmentosum (XP), 295T, 1208 
XIC (X-inactivation center), 474 
X-inactivation, 473-475, 473F, 474F 
Barr body, 473 

dosage compensation, 473, 475-476 
histone H2A variants, 474-475 
histone H3 variants, 475 
histone H4 variants, 475 
mechanism, 285, 473-475, 473F 
mosaics, 1209F 

random chromosome choice, 473-474 
see also Gene silencing; Genomic imprinting 
X-inactivation center (XIC), 474 
XISTRNA,473F,474 
XMAP215, microtubule formation, 995F, 1004, 

1004F 
Xpd gene/protein, mouse knockouts, 568F 
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X-ray{s), 527-528 

sensitivity, DNA repair defects, 295T 

synchrotron sources, 529 
X-ray crystallography 

DNA, 195-196,197 

electron density maps, 528 

historical aspects, 530T 

IgG structure, 1 561 F 

protein structure determination, 1 39, 527-529, 
528F 



YACs see Yeast artificial chromosomes (YACs) 
Y-chromosome, 202, 473, 1 271 , 1 284 

meiotic pairing, 1275 

Sry gene, 1 284 
Yeast(s), 33, 606F, 1494 

budding see Budding yeast(s) 

cell cycle control, 1056-1057, 1056F, 1057F, 
1062, 1063F 

chromosome replication, 210 

E. coii vs., 34 

fermentations, 90 
see also Glycolysis 

fission yeast see Schizosaccharomyces pombe 

gene expression analysis, 34-35 

genetics see Yeast genetics 

genome, 34, 31 8T 

metabolic map, 102F 

mitochondria, 857, 858F, 867F 

DNA, 863, 864-865, 865F, 866-867 

as model eucaryote, 33-34 

'protein-only inheritance,' 398, 398F 

protein transport studies, 703F 

reproductive cycles, 34F 

sexual reproduction, haploid vs. diploid cells, 
1269-1270, 1270F 

"shmoo,"1044, 1044F 

in tree of life, 16F 

two-hybrid methods, protein-protein 
interactions, 523-524, 524F 

utility as model organisms, 1 056 

vacuoles, lysosomal properties, 781-782 
Yeast artificial chromosomes (YACs), 541, 542F 
Yeast budding 

cellular polarity, 880, 1044, 1045F 

mating types, 1044 

signaling pathway, 880, 880F, 1045F 
Yeast genetics 

gene regulatory proteins, DNA sequence 
recognition, 41 8T 



mitochondrial DNA, 863 
inheritance, 864-865, 865F 
petite mutants, 866-867 
mutagenesis, deletion cassettes, 570 
Yersinia pestis, 1502, 1502F, 1520 
Yersinia pseudotuberculosis, 1508, 1520 
Yolk, 1287, 1290 
Yolk granules, 1287 



Zap70 tyrosine kinase, 1590F 

Zdisc, 1026, 1027F 

Zebrafish, 255, 556, 592F, 1 391 F 

Zellweger syndrome, 723 

Zigzag model, 30-nm fiber formation, 21 7, 21 7F 

Zinc finger motifs, 421, 421-422, 422F 

Zinc finger proteins, 421-422, 422F, 423F 

DNA-binding,422F 

sequence specificity, 427F 

structure, 421, 422F 

types, 421-422, 422F,423F 

zinc finger repeats, 421 
Zinc protease, lethal toxin, 1493 
Zona adherens (adhesion belt), 1 142, 1 143F 
Zona occludens see Tight junction(s) 
Zona pellucida, 1 287, 1 288F, 1 291 F, 1 298, 1 298F, 

1300 
ZO proteins, 1154 
ZP1 glycoprotein, 1298 
ZP2 glycoprotein, 1298, 1300 
ZP3 glycoprotein, 1298, 1300 
Z-ring, formation during cell division, 989, 989F 
Z scheme (photosynthesis), 852 
Zygote, 1 269 

centrioles, 1301, 1301 F 

reconstruction (cloning), genome preservation, 
411,413F 

zygotic-effect genes, 1 337 

see also Fertilization 
Zygotene, 1 275 
Zygotic-effect genes, 1 337 
Zymogens, 793 
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AMINO ACIDS AND 
THEIR SYMBOLS 




A 


Ala 


Alanine 


GCA GCC GCG GCU 


C 


Cys 


Cysteine 


UGC UGU 


D 


Asp 


Aspartic acid 


GAC GAU 


E 


Glu 


Glutamic acid 


GAA GAG 


F 


Phe 


Phenylalanine 


UUCUUU 


G 


Gly 


Glycine 


GGAGGCGGGGGU 


H 


His 


Histidine 


CAC CAU 


1 


He 


Isoleucine 


AUA AUC AUU 


K 


Lys 


Lysine 


AAAAAG 


L 


Leu 


Leucine 


UUA UUG CUA CUC CUG CUU 


M 


Met 


Methionine 


AUG 


N 


Asn 


Asparagine 


AACAAU 


P 


Pro 


Proline 


CCA CCC CCG CCU 


Q 


Gin 


Glutamine 


CAACAG 


R 


Arg 


Arginine 


AGA AGG CGA CGC CGG CGU 


S 


Ser 


Serine 


AGC AGU UCA UCC UCG UCU 


T 


Thr 


Threonine 


ACA ACC ACG ACU 


V 


Val 


Valine 


GUAGUCGUGGUU 


W 


Trp 


Tryptophan 


UGG 


Y 


Tyr 


Tyrosine 


UAC UAU 



